Math Reasoning Dataset

APTO Releases Training Dataset to Enhance the Mathematical Reasoning Capabilities of Large Language Models (LLMs)

TOKYO, Sept. 30, 2025 /PRNewswire/ -- As generative AI use continues to increase, accuracy has become the most important metric and a key factor in decisions around adoption and utilization. APTO is ...

VentureBeat

Meet LLEMMA, the math-focused open source AI that outperforms rivals

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more In a new paper, researchers from various ...

VentureBeat

New open-source math model Light-R1-32B surpasses equivalent DeepSeek performance with only $1000 in training costs

Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

Searchenginejournal.com

OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model

OpenAI secretly funded and had access to a benchmarking dataset, raising questions about high scores achieved by its new o3 AI model. Revelations that OpenAI secretly funded and had access to the ...

Hosted on MSN

Math lovers, your ultimate problem-solving playground awaits

Global math archive: MIT and collaborators launched MathNet, the largest Olympiad-level problem collection, spanning 47 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results