What's your monthly LLM bill? How Much Are You Really Spending.

5mo ago

8 replies

We’re a data company that recently interviewed over 60 large enterprise VPs and 70+ startup engineers and CTOs. Here’s what’s intriguing: Despite the massive hype around Large Language Models (LLMs), some of the largest tech enterprises (5k+ employees) report LLM bills of less than $1000 per month. Meanwhile, some startups are burning through over $100k on LLMs like OpenAI, Claude, or even self-hosting models like LLaMA. Is this a case of enterprises being cautious, or are they simply seeing through the LLM hype that startups are buying into? Are you spending big on LLMs, or are you getting by with cheaper alternatives like LLaMA-CPP and other quantized models that run on CPUs? As a prelaunch team, we started with running llama cpp on a macbook to $500-$1k monthly openai bills over the past 6 months. How much are you spending and what are your cost saving tricks!?

Replies

zane @zane12580

Now the server is deployed, and now almost the bill is zero

Aug 20, 2024

Steven Shen @stevenisnull

Some of our team's internal applications, such as data analysis and automation programs, consume about $200 per month on openAI. The new project we are working on is expected to consume $200-300 per day.

Aug 21, 2024

Lihong Hicken @lihong_hicken

Theysaid

the cost of AI is high for us but what is more, is the annoying rate limit and we are slowing increasing rate limit. Not ideal.

Aug 20, 2024

Tariq Waseem @tariq_waseem

I am spending a lot, but my monthly bill is zero, because I use free version.

Aug 20, 2024

Naruto Asao @naruto_asao

FESCO Online Bills is committed Electricity Bill to revolutionizing the way users handle their electricity bills by offering a seamless, user-centric experience.

Oct 10, 2024

Daniel James Harris @danieljamesharris

My LLM bill used to be crazy high, like hundreds per month. But then I optimized my prompts, fine-tuned my model, and moved to a cheaper provider. Now it's way more manageable, under $50/mo usually. Def took some trial and error to get the cost down while keeping the performance I needed tho.

Aug 23, 2024

Steven Granata @deleted-7465902

consider alternatives like quantized models for cost savings.

Aug 21, 2024