by isaacfrond on 7/30/25, 8:37 AM with 50 comments
by thyristan on 7/30/25, 10:35 AM
Let's say you are a power user, so your queries and responses are complex and numerous, say 1000 tokens per query+response and 1 query every 10 minutes of an 8h workday. That's 48k tokens per workday, at 20 workdays per month that's 960k tokens per month.
So the cost (not sales price!) for those 960k tokens (roughly 1M) a month should be $4.5
Now you can go over the numbers again and think about where they might be wrong: Maybe a typical query is more than 1000 tokens. Maybe power users issue more queries. You might very well multiply by a factor of 10 here. Nvidia getting more greedy for new GPUs? Add 50%. Data center and power cost too conservative, network and storage also important? Add 50%. 3 years of use for a GPU too long, because the field is very quickly adapting ever larger models? Add 50%. Usage factor not 100%, but lower, say a more realistic 50%? Double the cost. Llama4 not good enough, need a more advanced model? May produce a lot less tokens per GPU-hour, but numbers are hard to come by.
With that, it's easy to imagine that one might still loose money at $200 per month.
To compare, Azure sells OpenAI models in 1M token batches that can easily be compared to the above monthly cost.
https://developer.nvidia.com/blog/blackwell-breaks-the-1000-...
https://azure.microsoft.com/en-us/pricing/details/cognitive-...
by hermitcrab on 7/30/25, 9:17 AM
by ChrisMarshallNY on 7/30/25, 9:19 AM
From the article, it says that it’s a money loser, though, so I suspect that a lot of AI-based businesses run just fine, from the lower-tier price point.
They might want to consider adding an “in-between” pricing tier.
by Spivak on 7/30/25, 1:29 PM
by pulse7 on 7/30/25, 9:56 AM
by desktopninja on 7/30/25, 12:57 PM
by add-sub-mul-div on 7/30/25, 1:06 PM
by bertil on 7/30/25, 9:29 AM
by glimshe on 7/30/25, 10:43 AM
It costs $200 because the chatty little bot knows a surprising number of things amazingly well, and does decent work pretty darn fast.
by lifestyleguru on 7/30/25, 10:44 AM
by joos3 on 7/30/25, 8:40 AM
by skeezyboy on 7/30/25, 10:38 AM
by jaggs on 7/30/25, 9:25 AM
by poulpy123 on 7/30/25, 12:01 PM