Category: Machine Learning

Mamba-3
Someone Actually Built the AI Research Community from Karpathy's Side Project...
OpenClaw-RL: Allowing AI Agents to Self-Evolve Through Chat
a16z: Agents Perform Poorly Due to Lack of Correct Data Context
4B Model Surpasses GPT-5 in Hallucination Suppression: CMU and Others Propose New Behaviorally Calibrated Reinforcement Learning Method
Jensen Huang Enters the OpenClaw Arena! Most Powerful Open-Source 'Lobster' Model Rivals Opus 4.6
Masterpiece! MIT and Google Train an LLM Capable of Rigorous Bayesian Inference
Karpathy Slept, AI Ran 100 Experiments for Him
MMLU is Dead? 'Humanity's Last Exam' Published in Nature: Global AI Models Collectively Fail!
Anthropic CEO: The Data Bottleneck for Large Models No Longer Exists, Models Are Training Themselves
Google AI Conquers 6 World-Class Problems, More Shocking Than an IMO Gold Medal! Terence Tao Points the Way to a New Game
OpenAI Legend Reveals: Undergraduate Lands Job at OpenAI with Just One Blog Post! No PhD, Zero Papers
Are LLM RL Training Trajectories Actually Linear? Miaow Lab's Latest Work: Directly 'Predict' Future Models Without Further Training!
Qwen3.5: Towards Native Multimodal Agents
Xiaomi Introduces JudgeRLVR: Judge First, Generate Second — Breaking the Efficiency Paradox of "Long Chain-of-Thought" in Reasoning Models
Stable-DiffCoder Surpasses Autoregressive Models! New Breakthrough in Code Generation with Diffusion Models
Stop Clipping Aggressively! Qwen Proposes GatedNorm, Unifying the Perspective on Residual Flow Mysteries
Less is More: Recursive Reasoning with Tiny Networks
GPT-5.3-Codex Released: The First Self-Training Model
Is PPO Dead? The Reinforcement Learning Foundation Used by DeepSeek Has Major Flaws!
Meituan Quietly Launches New Model! Real-Test of First Open-Source "Heavy Thinking" Model: 8-Way Parallel, Agent Hard-Clashes with Claude
Google's New Discovery: DeepSeek Reasoning Splits into Multiple Personalities, Left and Right Brain Competing for Intelligence
Open-Source Framework Enables Code AI to Learn from GitHub! Bug Fix Rate Soars to 69.8%, Performance Sets New Records
Google Just Overturned Model Memory, and Nvidia Revolutionized Attention｜Hao Good Chat Paper
From 'LLM-as-a-Judge' to 'Agent-as-a-Judge': A Review of the Three-Stage Evolution of AI Evaluation Paradigms