Category: Machine Learning
- Breaking! Meta Open-Sources Its Latest World Model
- SLOT: Sample-Specific Inference Optimization Tool Arrives, Boosting Accuracy by 10% Without SFT or RL
- After ZeroSearch, Tongyi's Latest Work MaskSearch Proposes a New Framework for Reasoning-Search Pre-training
- 35% Accuracy Evaporates! ByteDance & HUST's WildDoc Reveals Robustness Shortcomings in Multimodal Document Understanding
- Google Research Finds: Prompt Design is the Core of Multi-Agent Systems!
- The Sky Has Fallen! Apple Just Proved: DeepSeek, o3, Claude and Other "Reasoning" Models Lack True Reasoning Ability
- R1-like Training No Longer Just Focuses on Result Correctness! CUHK Launches SophiaVL-R1 Model
- Agent Zero: An Open-Source, Free, Evolving, and Learning Agent
- DeepMind's Latest Research: Agents Are World Models!
- Google Open-Sources Gemini-Level AI Research Capabilities: Is Deep Research Becoming Commoditized?
- Reviewing the Progress of RL-Reasoning
- OPA-DPO: An Efficient Solution for the Hallucination Problem in Multimodal Large Models
- AI Learns Reasoning Solely by "Confidence": Zhejiang University Alumnus Replicates DeepSeek's Long Chain-of-Thought Emergence, Reinforcement Learning Needs No External Reward Signals
- No Manual Annotation Needed! AI Self-Generates Training Data, Unlocking Reasoning Capabilities via "Deduction-Induction-Abduction"
- Sakana AI's New Research: The Birth of the Darwin-Gödel Machine with Self-Encoding Improvement and Self-Referential Open-Ended Evolution
- LLM + RL Questioned: Deliberately Using Incorrect Rewards Still Significantly Boosts Math Benchmarks, Causing a Stir in the AI Community
- Alibaba Open-Sources New Qwen Model: A Dragon Boat Festival Gift!
- Mixture-of-Thought (MoT) Framework: Enabling Models to Learn "Human-like Thinking"
- 312 Trajectories Boost Performance by 241%! SJTU and SII Open-Source Computer Agent Surpasses Claude 3.7
- Claude 4 Completely Out of Control! Self-Replicating Madly to Escape Humans, Netizens Exclaim: Pull the Plug!
- Interpretation of Seed1.5-VL Technical Report
- Train a Tiny LLM from Scratch for Just ¥8 in 9 Hours! Full Tutorial Including Reasoning, MoE, and More
- More Capable Than Gemini Diffusion! The First Multimodal Large Diffusion Language Model MMaDA Released, Achieving Strong Reasoning and High Controllability
- OpenAI's Big Move! Core API Now Supports MCP, Revolutionizing Agent Development Overnight
- Does AI Know When to "Think"? Thinkless Teaches Large Language Models When to Reason