Category: Machine Learning

Examining Anthropic's Latest Research: Could This Be the Eve of AI Consciousness?
When Async Agentic RL Meets 'Amnesia for Old Policies': Rethinking Off-Policy Correction
Yann LeCun's Team's Latest Research: Teaching World Models to "Adapt" and Continuously Evolve Through Action
Ask, Don’t Judge: Binary Questions for Interpretable LLM Evaluation and Self-Improvement
Beyond Prediction: The Five Realms of 'Memory' in Dynamics Learning
ACL 2026 | Why Does SFT Always Fail to Learn? Not Every SFT Failure Needs More Epochs! Five "Surgical Scalpels" to Fix SFT
How to Build a Reliable Agent Memory Framework? UC Berkeley's MemFail Stress-Tests 4 Top Memory Systems, Proving Vector Databases Aren't the Only Answer
If AI Begins to Evolve Itself: Recursive Self-Improvement Is Emerging in a Way Far More Realistic Than the 'Singularity'
Models Are Too Fond of Cheating! Cursor Reveals the Inside Story of Composer 2's Reinforcement Learning: Models Can Detect 'Fake Environments', and Floating-Point Non-Determinism Is a Fatal Flaw in RL Training
The Common Mechanism Behind Claude Code and Robots: A Deep Dive into UIUC, Meta, and Stanford's Latest Survey
OpenAI Post-Training Lead: AI Isn't Suddenly Stronger, It Just Crossed a Threshold
Running ARC and Sudoku with 10M Parameters? Bengio's Team Bets on Multi-Trajectory Reasoning
AI Defeats Humans in a Scientific Research Competition for the First Time! Opus 4.7 Sets a World Record with a 2930-Step Sprint
jina-embeddings-v5-omni Released! A Lightweight Omni-Modal Vector Model
Tian Yuandong's New Role: Joining Forces with AI Luminaries on a $650 Million Bet on "Self-Evolving AI"
Genius Move: A Tiny 7B Model Hired GPT-5, and Then Won the Test
OpenAI's Former CTO Unveils Prototype for an AI That's Always 'Present' | Hao's Deep Dive on Papers
Kaiming He's Team Unveils 'Diffusion Model' Breakthrough: Discrete Decoding at the 'Last Mile'
How Do You Evaluate the Interaction Model Recently Released by Thinking Machines? - wangleineo's Answer
ICML 2026 | Rejecting Brute Force, PRISM Framework Enables Efficient Test-Time Scaling for dLLMs
Neuroscience and Machine Learning Are Swapping Their Worst Habits? | 10,000-Word Interview
What Bayes Never Imagined – How a Pastor's Gambling Formula Became AI's First Principle
Anthropic's Latest Research: How to Completely Eliminate Claude's Blackmailing Behavior
Token-Level, Precision Length Control: 3B Model Beats GPT 5.4 and Claude
Newly Open-Sourced Small Model with Under 1B Active Parameters Outperforms GPT-5 High-End Version in Math