Category: Model Optimization

35B-Parameter Science Model Rivals Trillion-Parameter Giants: 'Intern' Science Model Intern-S2-Preview Open-Sourced
ICML 2026 | Rejecting Brute Force, PRISM Framework Enables Efficient Test-Time Scaling for dLLMs
Qwen-Scope: Seeing Through the 'Hidden Thoughts' of Large Models
Thinking Without Words: Efficient Latent Reasoning with Abstract Chain-of-Thought
Scaling Laws for Looped Transformers
Long Context Reduced by 60% + 95% Sparsity: A Double Breakthrough Today Sets New Records in Inference Efficiency
No Reinforcement Learning Needed! Apple's 'Simple Self-Distillation' Achieves Self-Evolution for Coding Models
Multimodal Video Streaming Inference Efficiency Boosted by 56%: Unveiling TWW's Segment-Level Dynamic Memory Mechanism
Are LLM RL Training Trajectories Actually Linear? Miaow Lab's Latest Work: Directly 'Predict' Future Models Without Further Training!
A New Revolution in Reward Models! SWIFT Reads "Inner Voice" Instead of Text, Creating a Faster, Stronger, and More Cost-Effective AI Judge
Is Your Model's Attention Drifting? RUC and Tsinghua University Introduce LeaF: Pruning Distracting Tokens for Focused Learning
Kimi K2's Key Training Technique: QK-Clip!
Mianbi MiniCPM4: 3x Inference Speed, Outperforming Same-Size Qwen3, Putting Pressure on Alibaba
SLOT: Sample-Specific Inference Optimization Tool Arrives, Boosting Accuracy by 10% Without SFT or RL
Ushering in the Era of On-Device Long Text! OpenBMB's New Architecture Boosts MiniCPM up to 220x Faster
Deep Learning: Mamba Core Author's New Work Replaces DeepSeek's Attention Mechanism, Designed for Inference