Researchers from Meta and Google built AutoTTS to automatically discover optimal LLM reasoning strategies, cutting token ...
New research on so-called “negation neglect” finds that LLMs in a roughly analogous situation don’t behave that way. They appear to learn from the statistical patterns in their training text more than ...
DeepSWE puts GPT-5.5 atop the AI coding leaderboard while raising new questions about Claude Opus, SWE-Bench Pro, and ...
OpenBMB's 1B-parameter model MiniCMP 5 brings MCP support and agentic tool use to on-device AI—but it has trouble with logic ...