Where does the data come from?

Scores are aggregated from public benchmarks (MMLU, HumanEval, SWE-bench, MT-Bench, LongBench, MGSM) as of January 2026, weighted by task relevance. Not real-time.

Why not just use leaderboards?

Leaderboards average across all task types. The best coding model may be mediocre at creative writing. We weight scores by YOUR task, not the global average.

Yes for Jan 2026. Always verify current pricing — providers adjust monthly. This tool is decision-support, not billing-grade.

🏆

Model Benchmark Picker

Which LLM wins for your task?

Primary task type

Budget priority

Latency requirement

Context requirement

Your task-weighted ranking (Jan 2026)

Scores synthesize public benchmarks (HumanEval, MMLU, MT-Bench, SWE-bench, LongBench) weighted by your task.

📚

Learn more — how it works, FAQ & guide

Click to expand

Percentage Calculator

Calculate percentages, increases, decreases & more

Open

Age Calculator

Calculate exact age in years, months, days, hours

Open

BMI Calculator

Body Mass Index — metric and imperial

Open

🔒

100% Privacy. This tool runs entirely in your browser. Your data is never uploaded to any server.

Model Benchmark Picker