Category: Large Language Models
- Google's New Research Identifies the Crucial Tokens Where Large Models Ponder Deeply!
- Anthropic CEO: The Data Bottleneck for Large Models No Longer Exists, Models Are Training Themselves
- Anthropic's Latest Paper: Internet Anonymity Ends in the AI Era | Hao's Paper Chat
- Not All Tokens Are Equal! Google Proposes True Deep Thinking: Long Chain of Thought ≠ Deep Reasoning
- Qwen3.5-Flash Arrives! Three Medium-Scale Models Open-Sourced
- Are LLM RL Training Trajectories Actually Linear? Miaow Lab's Latest Work: Directly 'Predict' Future Models Without Further Training!
- What Exactly Is On-Policy Distillation? An In-Depth Interpretation of On-Policy/Self-Distillation
- Behind the $1 Trillion Evaporation: The Moats of Vertical Software Are Being Rewritten by Large Models
- Latest Interview with Jeff Dean, the Soul of Gemini and Legendary Engineer: In the Future, Everyone Will Have 50 Virtual Interns, No Need for Experts Anymore!
- Qwen3.5: Towards Native Multimodal Agents
- The Bitter Lesson! ROLL Team Shares: Practical Experience in Agentic RL Training
- Xiaomi Introduces JudgeRLVR: Judge First, Generate Second — Breaking the Efficiency Paradox of "Long Chain-of-Thought" in Reasoning Models
- ModelBest's SALA Architecture Is Tearing Down the Transformer's Wall
- Nvidia's new technique cuts LLM reasoning costs by 8x without losing accuracy
- Stable-DiffCoder Surpasses Autoregressive Models! New Breakthrough in Code Generation with Diffusion Models
- MianBi Intelligence 9B On-Device Full-Modal Open Source: See, Listen, Interrupt Anytime, Instant Interaction
- Just Released: Claude 4.6 and GPT-5.3-Codex Simultaneously Launched!
- Zhipu's New Model Also Uses DeepSeek's MLA, Runs on Apple M5
- True Hack! MIT New Research: Zero Architecture Changes, Unlocking Million-Level Context for Large Models
- Is Agentic RAG Worth It? A Four-Dimensional Real-World Test Reveals the Answer!
- From 'LLM-as-a-Judge' to 'Agent-as-a-Judge': A Review of the Three-Stage Evolution of AI Evaluation Paradigms
- Just Now, Liang Wenfeng Signs New Paper That Explodes Late at Night! DeepSeek-V4 New Architecture Revealed: Proposes New Sparsity Direction, Complements MoE, Significantly Extends Long Context Capabilities, Stronger Reasoning and Coding Abilities
- LAMER: Meta-Reinforcement Learning Enables Language Agents to Perform Active Exploration
- Stanford's Latest Course Release: Hand-Writing Code is Banned, AI is Mandatory
- From 'Titans+MIRAS & Nested' Architectural Innovations to NeurIPS2025 Best Paper 'Gated Attention'