Simon Willison's AI Notes

How fast is 10 tokens per second really?

Simon Willison's AI Notes 发布的媒体报道:How fast is 10 tokens per second really? Neat little HTML app by Mike Veerman ( source code here ) which simulates LLM token output speeds from 5/second to 800/second. Useful if you see a model advertised as "30 tokens/second" and want to get a feel for what that actually looks like. Via Hacker News Tags: llms , ai , generative-ai

阅读原文

为什么值得关注

这条媒体报道可能影响 AI 产品能力、开发者选型或采用时机。具体结论与可用范围仍应以原文为准。

本页为独立摘要整理,具体事实与可用范围请以原始发布内容为准。

llmsaigenerative-ai