Machine Learning

NVIDIA Introduces ProRL: Long-Horizon Reinforcement Learning Boosts Reasoning and Generalization

June 6

RXTX: A Machine Learning-Guided Algorithm for Efficient Structured Matrix Multiplication

2 weeks ago

Meta Researchers Introduced J1: A Reinforcement Learning Framework That Trains Language...

2 weeks ago

Machine Learning

Malaysia withdraws plan to deploy Huawei AI servers

2 weeks ago

Malaysia withdraws plan to deploy Huawei AI servers

Omni-R1: Advancing Audio Question Answering with Text-Driven Reinforcement Learning and Auto-Generated...

2 weeks ago

LLMs Struggle to Act on What They Know: Google DeepMind Researchers...

2 weeks ago

Reinforcement Learning Makes LLMs Search-Savvy: Ant Group Researchers Introduce SEM to...

2 weeks ago

DanceGRPO: A Unified Framework for Reinforcement Learning in Visual Generation Across...

3 weeks ago

Georgia Tech and Stanford Researchers Introduce MLE-Dojo: A Gym-Style Framework Designed...

3 weeks ago

Meta AI Introduces CATransformers: A Carbon-Aware Machine Learning Framework to Co-Optimize...

3 weeks ago

Machine Learning

Taiwanese electronics maker invests $85m in improving AI servers

3 weeks ago

Taiwanese electronics maker invests $85m in improving AI servers

1 2 3 4 5 Page 2 of 5

Featured

NVIDIA Introduces ProRL: Long-Horizon Reinforcement Learning Boosts Reasoning and Generalization

7 hours ago

Top Artificial Intelligence AI Books to Read in 2025

7 hours ago

Salesforce AI Introduces CRMArena-Pro: The First Multi-Turn and Enterprise-Grade Benchmark for...

7 hours ago

From Clicking to Reasoning: WebChoreArena Benchmark Challenges Agents with Memory-Heavy and...

7 hours ago

7 hours ago

NVIDIA Introduces ProRL: Long-Horizon Reinforcement Learning Boosts Reasoning and Generalization

Recent advances in reasoning-focused language models have marked a major change in AI by scaling test-time computation. Reinforcement learning (RL) is crucial in developing reasoning capabilities and mitigating reward hacking pitfalls. However, a fundamental debate remains: whether RL provides new reasoning capabilities from a base model or just helps...