Category: Large Language Models

Tsinghua and Others Propose Absolute Zero Self-Play Large Models, Achieving Top Performance on Multiple Tasks with Zero-Data Training
Bengio Debunks CoT Myth! LLM Reasoning is an Illusion, 25% of Top Conference Papers Disproven
Martin Fowler's Latest Insight: LLMs Are More Than "Higher" Abstraction, They're Changing the "Nature" of Programming!
Unveiling the "Thought Secrets" of Large Reasoning Models: Understanding Their "Aha! Moments" from a "Reasoning Graph" Perspective
Say Less 'Wait', Do More: NoWait Reshapes Large Model Inference Paths
ACL 2025 | Large Models "Spreading Misinformation"? DRAG's Two-Stage "Multi-Agent Debate" Solves Hallucination on Hallucination
0% Pass Rate! The Code Myth Debunked! LiveCodeBench Pro Released!
Traditional RAG: Knows How to Read, But Not How to Use? RAG+ Elevates Reasoning Capabilities to New Heights!
LLMs Can Now Self-Update Weights, Significantly Enhancing Self-Adaptation and Knowledge Integration Capabilities – Has AI Awakened?
NVIDIA (ProRL) | Can RL truly enhance the reasoning capabilities of LLMs?
AI Can Read Between the Prompts! Vibe Coding: Regular User vs. Programmer – Cambridge's Latest Report
Did "More is Better" Fail? ModelSwitch Jumps Out of the Sampling Black Hole, Rewriting the LLM Inference Paradigm
Google AI Roadmap Revealed: Is the Attention Mechanism Being Abandoned? Transformer Has Fatal Flaws!
Comprehensive Evaluation of 12 Latest GraphRAG Techniques
o3-pro Completes 'Sokoban,' Classic Retro Games Become New Benchmarks for Large Models
4B Qwen3 Overtakes 671B DeepSeek! Is ByteDance's DAPO Fine-tuning Method That Powerful?
Devin Co-founder: Stop Building Multi-Agent Systems! Microsoft and OpenAI's Agent Building Philosophy Is Fundamentally Flawed! Context Engineering Will Be the New Standard, Employee: Boss, Stop Leaking Secrets
AI Completes 12 Years of Human Work in 2 Days, Automatically Updates Literature Reviews, Outperforming Humans by Nearly 15% in Accuracy
More Toxic, More Secure? Harvard Team's Latest Research: 10% Toxic Training Makes Large Models Invulnerable
LLMs Can Now Self-Update Weights, Significantly Boosting Adaptive and Knowledge Integration Capabilities. Is AI Waking Up?
Multi-Agent Systems Are "Burning" Tokens! Everything Anthropic Has Discovered
Apple's 'Illusion of Thinking' Paper Criticized Again, Claude and Human Co-authored Paper Points Out Its Three Key Flaws
AI Acts as Its Own Network Administrator, Achieving a "Safety Aha-Moment" and Reducing Risk by 9.6%
Autonomous Agent Approach is Wrong! Chinese Scholars Propose LLM-HAS: Shifting from "Autonomous Capability" to "Collaborative Intelligence"
Berkeley and Stanford Collaborate to Create an "AI Research Prophet": Predicting Research Idea Prospects with 77% Accuracy