Category: Machine Learning
- MMLU is Dead? 'Humanity's Last Exam' Published in Nature: Global AI Models Collectively Fail!
- Anthropic CEO: The Data Bottleneck for Large Models No Longer Exists, Models Are Training Themselves
- Google AI Conquers 6 World-Class Problems, More Shocking Than an IMO Gold Medal! Terence Tao Points the Way to a New Game
- OpenAI Legend Reveals: Undergraduate Lands Job at OpenAI with Just One Blog Post! No PhD, Zero Papers
- Are LLM RL Training Trajectories Actually Linear? Miaow Lab's Latest Work: Directly 'Predict' Future Models Without Further Training!
- Qwen3.5: Towards Native Multimodal Agents
- Xiaomi Introduces JudgeRLVR: Judge First, Generate Second — Breaking the Efficiency Paradox of "Long Chain-of-Thought" in Reasoning Models
- Stable-DiffCoder Surpasses Autoregressive Models! New Breakthrough in Code Generation with Diffusion Models
- Stop Clipping Aggressively! Qwen Proposes GatedNorm, Unifying the Perspective on Residual Flow Mysteries
- Less is More: Recursive Reasoning with Tiny Networks
- GPT-5.3-Codex Released: The First Self-Training Model
- Is PPO Dead? The Reinforcement Learning Foundation Used by DeepSeek Has Major Flaws!
- Meituan Quietly Launches New Model! Real-Test of First Open-Source "Heavy Thinking" Model: 8-Way Parallel, Agent Hard-Clashes with Claude
- Google's New Discovery: DeepSeek Reasoning Splits into Multiple Personalities, Left and Right Brain Competing for Intelligence
- Open-Source Framework Enables Code AI to Learn from GitHub! Bug Fix Rate Soars to 69.8%, Performance Sets New Records
- Google Just Overturned Model Memory, and Nvidia Revolutionized Attention|Hao Good Chat Paper
- From 'LLM-as-a-Judge' to 'Agent-as-a-Judge': A Review of the Three-Stage Evolution of AI Evaluation Paradigms
- Optimization is Geometry, Geometry is Inference: Using Mathematics to End the Transformer Black Box Era
- Let AI Level Itself Up: Meta Pushes Coding to Superintelligence with Self-play RL
- Attention Is Not What You Need? Reframing Sequence Modeling with Geometric Aesthetics via Grassmann Manifolds
- From 'Titans+MIRAS & Nested' Architectural Innovations to NeurIPS2025 Best Paper 'Gated Attention'
- Under $8,000! Sina Weibo's 1.5B Small Model Surpasses Near-Trillion Parameter Models
- AI Cracks 18th-Century "Mystery" Ledger in Seconds! Google's New Model Blind Test Goes Viral
- SJTU PhD's Latest Insights: Clarifying Reinforcement Learning with Just Two Questions
- Meta Discovers: Slow RAG Systems Are Doing Too Much Unnecessary Work