Latest Articles
- Xiaohongshu Open-Sources First Multimodal Large Model, dots.vlm1, Performance Rivals SOTA!Multimodal AIVisual Language ModelsAI ResearchDeep LearningOpen Source AI...
- Altman Reveals Stunning Prediction: GPT-8 to Cure Cancer by 2035! Humanity Might Wage WWIII Over Compute PowerArtificial IntelligenceOpenAIHealthcare AIAI EthicsFuture TechnologySam Altman...
- ARPO: Agentic Reinforced Policy Optimization, Enabling Agents to Explore One Step Further at Critical MomentsReinforcement LearningLarge Language ModelsTool UsePolicy OptimizationAI Agents...
- Open-Sourcing the Largest High-Quality Scientific Reasoning Post-Training Dataset to Quickly Turn Qwen3 and Others into "Scientists"Artificial IntelligenceScientific ReasoningOpen SourceLarge Language ModelsDataset...
- Wang Mengdi's Team Review of "Self-Evolving Agents": From Static LLMs to Artificial Superintelligence (ASI)Self-Evolving AI AgentsLarge Language ModelsArtificial SuperintelligenceFuture AI ResearchAI Applications...
- Anthropic Team Uncovers 'Persona Variables' to Control Large Language Model Behavior, Cracking the Black Box of AI MadnessLarge Language ModelsAI SafetyBehavior ControlFine-tuningPersona Vectors...
- Google Open-Sources DeepPolisher, Halving Genome Assembly Error Rates; Jeff Dean: "Exciting!"GenomicsDeep LearningHuman Genome ProjectArtificial IntelligenceBioinformaticsGenome Assembly...
- AI's New SOTA in Bug Fixing: ExpeRepair Achieves 60.33% Fix Rate on SWE-Bench Lite, Learns from Experience Like Humans – Developed by Institute of Software, Chinese Academy of SciencesArtificial IntelligenceSoftware DevelopmentAutomated RepairMachine LearningBug Fixing...
- Oxford Anthropologist Anna Machin: Dating Apps Are Making Your Brain's "Mate Selection Algorithm" FailRelationshipsDating AppsNeuroscienceHuman EvolutionPsychology...
- Is Your Model's Attention Drifting? RUC and Tsinghua University Introduce LeaF: Pruning Distracting Tokens for Focused LearningLarge Language ModelsKnowledge DistillationModel OptimizationCausal InferenceAttention Mechanism...
- Can Models Truly "Reflect on Code"? Beihang University Releases Repository-Level Understanding and Generation Benchmark, Refreshing the LLM Understanding Evaluation ParadigmLarge Language ModelsCode ReflectionCode GenerationCode UnderstandingBenchmarking...
- ReaGAN: Empowering Each Node as an Intelligent Reasoning Expert in GraphsArtificial IntelligenceGraph Neural NetworksMachine LearningAgent-based AILarge Language Models...
- Google's Challenge: DeepSeek, Kimi and More to Compete in First Large Model Showdown Starting TomorrowAI BenchmarkingLarge Language ModelsModel EvaluationKaggle Game ArenaAI Chess...
- RAG Revolution! Graph-R1, the First RL-driven Graph Reasoning AgentGraphRAGReinforcement LearningAI AgentKnowledge GraphLarge Language Models...
- Alibaba Just Open-Sourced Qwen-Image: Free GPT-4o Ghibli-Style Model, Best in ChineseGenerative AIText-to-ImageMultimodal ModelsOpen-Source AIImage Generation...
- Replicating the AlphaGo Moment? Google Unveils New LLM Evaluation Paradigm Game Arena: Eight Models Compete, Chess King as JudgeArtificial IntelligenceLLM EvaluationChessKaggleGame AI...
- RAG Can Also Reason! Thoroughly Solving the Multi-Source Heterogeneous Knowledge ChallengeRetrieval Augmented GenerationLarge Language ModelsAI AgentsHeterogeneous DataMulti-hop Reasoning...
- Beyond Human Annotation: Meta Introduces CoT-Self-Instruct – Reshaping LLM Training with 'Reasoning-Driven Self-Evolution'Large Language ModelsAI TrainingData GenerationChain of ThoughtSynthetic Data...
- A Deep Dive: Where Does Large Model Training Time Go?Large Language ModelsPerformance OptimizationMachine Learning EngineeringHardware LimitationsDistributed Training...
- Revisiting Qwen3's Abandoned Mixed Inference ModeLarge Language ModelsAdaptive ReasoningChain-of-Thought (CoT)Reinforcement LearningModel Training...