Category: Artificial Intelligence
- 88-Year-Old Algorithm Pioneer Stunned! Claude Teams Up with GPT to Solve 30-Year Problem; 14-Page Paper Published with Zero Edits
- Intriguing Move: OpenAI Turns Codex into a Plugin and Embeds It Directly into Claude Code
- Demystifying the Sparse LLM Innovation by NVIDIA and Sakana AI
- Demis Hassabis's Stunning Confession: The AI I Built Could Extinguish Humanity, But No One Can Stop It Now
- The True Capabilities of LLMs Exposed: 90% in Python, 0% in Whitespace! The 'Top Student' Persona of AI Crumbles
- AI Is Exceptionally Savvy in Social Niceties, and Humans Fall for It: Research on AI Sycophancy Published in Science
- Is Synthetic Data Better Than Real Data?
- 500 Seed Samples, Four Self-Evolving Agents: Reasoning Capability Surges by 10.7%
- SortedRL: Accelerates Large Model RL Training by 50%, Boosting Efficiency by 18%
- Claude Uncovers 20-Year Vulnerability in 90 Minutes! 50k-Star 'Secure' System Dethroned, Linux Kernel Not Spared
- Reconstructing Native Multimodality! Meituan Releases Purely Discrete Base Model, Truly Achieving 'Everything is Token'
- Google's $90 Billion AI Paper That Wiped Out Memory Stocks Accused of Academic Fraud
- AI Can Already Write 80% of Code, But Agents Have a Fatal Flaw! OpenAI Codex Tech Lead: Asking the Wrong Question is Worse Than Not Knowing How to Write
- InCoder-32B: Beihang Team Generates 2.5M Verified Industrial Code Samples in Real Simulation Environments, Topping Open-Source Industrial Coding Capabilities
- Top Models Like GPT-5.4 and Claude Opus Exposed for 'Fake Reasoning': Is the Problem-Solving Process Just a 'Performance'?
- Agentic Software Engineering #6 | buffa: A Methodological Sample from Anthropic AI Coding (Rust)
- vLLM's Hardcore Quadruple Release!
- Is The Matrix Coming True at Google? Top-Secret AI Exposed: Servers Overloaded, Brin Coding Frenzy
- Agent Software Engineering #4 | When the Agent Writes the Code, Who Says 'Merge Ready'?
- Agent Software Engineering #2 | Rethinking Code Review
- Agentic Software Engineering | Stop Measuring AI's Work with Human Rulers: Native Agent Work Estimation
- Inference No Longer Wastes Cycles on Logits: FlashSampling Accelerates Decoding by 19%
- Models Have Gained Introspective Capabilities, But Their Inner Doors Were Locked | Hao's Paper Talk
- Multi-Agent Orchestration Too Tedious? MASFactory Uses Vibe Graphing to Simply 'Speak' It Into Existence
- Anthropic Engineering Blog: How Anthropic Designed Claude Code's Auto Mode