Category: Natural Language Processing

Ask, Don’t Judge: Binary Questions for Interpretable LLM Evaluation and Self-Improvement
Enterprise Text-to-SQL: 5 Disruptive Insights from LinkedIn and Top Labs
RAG Context Stuck at 512 for Too Long: The 32K Context Era for Embedding Models Begins with Granite R2
Kaiming He's Team Unveils 'Diffusion Model' Breakthrough: Discrete Decoding at the 'Last Mile'
How Do You Evaluate the Interaction Model Recently Released by Thinking Machines? - wangleineo's Answer
ICML 2026 | Rejecting Brute Force, PRISM Framework Enables Efficient Test-Time Scaling for dLLMs
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies in LLM Reinforcement Learning
Newly Open-Sourced Small Model with Under 1B Active Parameters Outperforms GPT-5 High-End Version in Math
Perhaps the Most Impressive AI Paper of Recent Years: After Giving AI Reasoning Real-Time Subtitles, Its Inner Thoughts Are Shocking!
Subquadratic — Efficiency is Intelligence
Abstract-CoT: Reasoning Tokens Slashed 11.6x, Chain-of-Thought Without Words Shatters LLM Efficiency Ceiling
Paper Brief | Automated Knowledge Graph Enrichment Using Multi-Agent Large Language Models (NeurIPS 2025)
Thinking Without Words: Efficient Latent Reasoning with Abstract Chain-of-Thought
The World's Most Notorious Forum Uncovered AI's Most Crucial 'Thinking' Ability
MSA Code Officially Open-Sourced!
Reconstructing Native Multimodality! Meituan Releases Purely Discrete Base Model, Truly Achieving 'Everything is Token'
How Can a Model Trained on 200M Real Tokens Match the Performance of 360M Data?
One Year into RAG: My Biggest Regret Was Adopting Knowledge Graphs
Masterpiece! MIT and Google Train an LLM Capable of Rigorous Bayesian Inference
MMLU is Dead? 'Humanity's Last Exam' Published in Nature: Global AI Models Collectively Fail!
Google's New Research Identifies the Crucial Tokens Where Large Models Ponder Deeply!
Is PPO Dead? The Reinforcement Learning Foundation Used by DeepSeek Has Major Flaws!
Transformer Authors Lead Sakana AI to Release Three Papers: Completely Reconstructing Long-Text Memory Mechanisms
Google's New Discovery: DeepSeek Reasoning Splits into Multiple Personalities, Left and Right Brain Competing for Intelligence
True Hack! MIT New Research: Zero Architecture Changes, Unlocking Million-Level Context for Large Models