Simon Willison's AI Notes2026年5月21日

How fast is 10 tokens per second really?

Simon Willison's AI Notes 发布的媒体报道：How fast is 10 tokens per second really? Neat little HTML app by Mike Veerman ( source code here ) which simulates LLM token output speeds from 5/second to 800/second. Useful if you see a model advertised as "30 tokens/second" and want to get a feel for what that actually looks like. Via Hacker News Tags: ai , generative-ai , llms

阅读原文

为什么值得关注

这条媒体报道可能影响 AI 产品能力、开发者选型或采用时机。具体结论与可用范围仍应以原文为准。

本页为独立摘要整理，具体事实与可用范围请以原始发布内容为准。

aigenerative-aillms