Category: Machine Learning Research

Anthropic Participates in New Paper: Why Do Larger Models Learn More? The Answer Lies in Scaling
Why Agent Training Always Crashes on Long-Horizon Tasks
The First Spatio-Temporal Reasoning Framework: Enabling Large Models to Truly Understand Spatio-Temporal Data | ACL'26
Are Your Custom Skills Slowing Down the Model? Strategy Genes Are the Real Answer
Sharing Two Latest Harness Papers: One from Google, One from Microsoft
The 'Car Wash Dilemma' That Stumped AI Across the Web Has Finally Been Solved
Large Models Can Now Modify Parameters 'In-Place'! ByteDance Seed & Peking University Paper: Test-Time Inference Requires No Extra Layers or Retraining
ASI-Evolve: AI Accelerates AI
MSA Code Officially Open-Sourced!
Is Synthetic Data Better Than Real Data?
Top Models Like GPT-5.4 and Claude Opus Exposed for 'Fake Reasoning': Is the Problem-Solving Process Just a 'Performance'?
Inference No Longer Wastes Cycles on Logits: FlashSampling Accelerates Decoding by 19%
Injecting Continuous New Knowledge into Large Models: Beihang's CASE Framework Edits Thousands of Times Without Forgetting, Adding Less Than 1MB of Parameters | WWW'26
Why Does Long-Video Reasoning Always Fail? Symphony's Answer Is Cognitive Division of Labor
ICLR 2026 | How Far Can Unsupervised Reinforcement Learning Go for Large Models? A Systematic Answer from the Tsinghua Team
Stop Obsessing Over Outcome Rewards! CUHK Identifies and Solves the "Information Self-Locking" Problem in RL!
Mind-Blowing! MIT Researcher Builds a Computer Inside Transformer: Do LLMs Still Need External Tools?
Transformer Authors Lead Sakana AI to Release Three Papers: Completely Reconstructing Long-Text Memory Mechanisms
True Hack! MIT New Research: Zero Architecture Changes, Unlocking Million-Level Context for Large Models
Tsinghua Research: A Reversal? Confirming RL Doesn't Truly Enhance Base Model Reasoning Ability!
Say Less 'Wait', Do More: NoWait Reshapes Large Model Inference Paths
10 Lines of Code, 15% Improvement in AIME24/25! Unveiling the Entropy Mechanism in Large Language Model Reinforcement Learning
Can AI "Admit Its Own Mistakes"? Solving the "Rashomon" of Multi-Agent Collaboration, Earning ICML 2025 Spotlight