Simon Willison's AI NotesMay 21, 2026

How fast is 10 tokens per second really?

Simon Willison's AI Notes published: How fast is 10 tokens per second really? Neat little HTML app by Mike Veerman ( source code here ) which simulates LLM token output speeds from 5/second to 800/second. Useful if you see a model advertised as "30 tokens/second" and want to get a feel for what that actually looks like. Via Hacker News Tags: ai , generative-ai , llms

Read original

Why it matters

This reported AI news may affect AI product capabilities, developer choices, or adoption timing. Review the original source for exact claims and availability.

This page is an independent summary. Facts and availability should be verified in the original publication.

aigenerative-aillms