API Performance Benchmark

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out

Kimi K2.7-Code claims 30% fewer thinking tokens and a drop-in API swap path, but independent benchmarks show kernel ...

KushoAI Benchmark Finds AI Coding Tools Struggle With Complex API Bugs

This report follows KushoAI's earlier launch of APIEval-20, the industry's first open benchmark for evaluating AI agents on ...

OfficeChai

OpenRouter Launches Fusion API, Which Uses A Combination Of Models To Achieve Fable-Like Performance At Half The Price

Anthropic’s Fable and Mythos models have been withdrawn following US export controls, but OpenRouter might have a solution. The company has launched ...

11h

Fusion API delivers frontier-level AI performance at half the cost, says OpenRoute

Fusion uses multiple AI models together to improve output quality at lower cost. OpenRouter debuts Fusion, a multi-model composite AI Fusion outperforms single models, scoring 69% on DRACO benchmark ...

13d

MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost

M3 demonstrates that the next phase of agent development will not just be driven by larger datasets, but by efficient ...

Morningstar

KushoAI Unveils APIEval-20 to Benchmark AI Agents in API Testing

SAN FRANCISCO, April 8, 2026 /PRNewswire/ -- KushoAI, an AI-native platform for API testing and software reliability, has introduced APIEval-20, an open benchmark designed to evaluate how effectively ...

ZDNet

Databricks' TPC-DS benchmarks fuel analytics platform wars

As data sources and volumes grow, and as a data-driven orientation is increasingly deemed to be a competitive necessity, the war between platform vendors to provide the primary repository for our data ...

Hosted on MSN

Grok Voice Agent API sets a new benchmark for real-time audio AI

Today marks an exciting moment for the developer community as xAI officially introduces the Grok Voice Agent API, opening the door for anyone to build powerful, real-time voice agents with ease.

TweakTown

UL announces its 3DMark benchmark suite now runs natively on macOS, using Metal API

TL;DR: UL has launched a full native 3DMark benchmark suite for macOS, eliminating iOS frame rate limits and enhancing performance testing on powerful Macs. It includes advanced benchmarks like Steel ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results