HN comments for: Accident Forgiveness

bitbasher

53 replies

23h26m

2024-08-23 19:03:23 UTC

There is a very obvious fix for surprise billing. Enforce a billing cap and terminate service if it's met. Even better if you send alerts when the cap is approaching.

If I pay $39/month, a default cap should be $39 per-month. Otherwise, let me set a cap I am comfortable with.

Surprise billing is never good for customers, only the business.

tptacek

42 replies

23h22m

2024-08-23 19:07:30 UTC

I promise, you are not the first person to have thought of this, and, believe it or not, there are reasons other than malice and avarice that cloud providers don't terminate service based on billing caps. Terminating service is a big deal.

We agree completely about surprise billing.

karmajunkie

18 replies

23h17m

2024-08-23 19:12:03 UTC

The only thing in cloud that's worse than a surprise bill is a surprise outage.

lkbm

12 replies

23h5m

2024-08-23 19:23:49 UTC

Depends. For my hobby project that hasn't been monetized, being charged $100k is way worse than it going down.

If it were critical infrastructure, or monetized in a way that brought in revenue to cover the charge, then maybe I don't want it to shut down despite skyrocketing costs, but that's hardly the only situation you could be in.

tptacek

10 replies

23h1m

2024-08-23 19:28:33 UTC

Nobody is going to charge you $100k.

immibis

6 replies

22h50m

2024-08-23 19:39:34 UTC

(ignores the multiple hobbyists who accidentally got charged $100k)

tptacek

4 replies

22h46m

2024-08-23 19:43:29 UTC

My belief, and you can correct me on this if you have evidence to the contrary, is that nobody actually pays those bills.

withinboredom

3 replies

22h23m

2024-08-23 20:05:46 UTC

Someone paid those bills to somebody. AWS has costs incurred from it (support, opportunity, etc) and they paid them.

bee_rider

2 replies

21h52m

2024-08-23 20:36:51 UTC

From that point of view, the cap seems more like a common-sense self defense feature that the cloud provider would want to implement. But we have cloud providers in this thread saying they don’t want to implement caps, so, I dunno.

withinboredom

1 replies

8h59m

2024-08-24 09:29:54 UTC

From my technical understanding from a few friends who may or may not know what they are talking about, the caps issue has to do with the processing delay between billing events being emitted and understood. That by the time the billing events have been processed and action taken, the damage would already be done in all but the most extreme cases.

Secondly, the best solution is to simply stop everything. But now the customer has to cold-start their entire infrastructure, which may actually cost more than paying the bill.

Thirdly, it is likely that customers will set a billing limit and then forget about it years later. Suddenly, they've got a complicated infrastructure setup spanning the globe. They finally hit a scale where they hit their billing limit that they had completely forgotten that Bill configured in their early days (who doesn't even work there anymore). Suddenly, the entire global infrastructure is shut down in the middle of the night.

That's the gist of what my friends said.

tptacek

0 replies

2h30m

2024-08-24 15:59:35 UTC

I'm just going to refer people to this comment.

srockets

0 replies

22h35m

2024-08-23 19:54:26 UTC

Getting charged is not the same as paying those charges. I'm willing to bet real money that all who, in good faith, went and asked not to pay those bills ended up not paying them (if they didn't ask, that's a separate issue).

bitbasher

2 replies

22h40m

2024-08-23 19:49:16 UTC

Perhaps that is true, but the stress and anxiety from seeing 100K bill is real and has an impact.

https://www.forbes.com/sites/sergeiklebnikov/2020/06/17/20-y...

tobyjsullivan

0 replies

21h42m

2024-08-23 20:47:01 UTC

If you're stressed over lighting $100k on fire, then you're not the target market. /s

gary_0

0 replies

20h54m

2024-08-23 21:35:30 UTC

Just in case you don't know, the person you're replying to is the author of the blog post, and they mention in it that being stressed out by (potential) unexpected large bills is something they are aware of.

Although even then, "we will only threaten to charge you $100k without meaning it" isn't much of a reassurance.

brundolf

0 replies

22h47m

2024-08-23 19:41:59 UTC

But more importantly: just let the customer decide! Let them decide whether there's a threshold where an outage is less costly than the hosting, and what that threshold is

moring

4 replies

22h48m

2024-08-23 19:40:51 UTC

This made me think. There is usually some "hard" cost limit X that you cannot / don't want to afford, so terminating the service is preferable.

There is also usually some "soft" limit Y < X that you don't want to exceed, and don't plan to exceed, but you'd rather pay >Y than face an outage.

But a hard limit would have to be set to X to avoid that outage, and if it gets exceeded, you'll face a bill of X and an outage.

So what a customer would actually need is to specify both X and Y, with the rule: If the cost would exceed X, then terminate it early so the cost doesn't actually exceed Y.

Sounds complicated to implement, but then, the current practice of waiving the bill is complicated too if you tried to formalize it.

(For the sake of this discussion, I'm ignoring all the technical difficulties of terminating a high-availability service at all.)

jameshart

1 replies

20h37m

2024-08-23 21:51:48 UTC

It would, of course, be just mean and unscrupulous for a cloud vendor to look at the number you have set as being ‘the absolute most I am willing to pay for this service’ and then optimize their pricing offer to you specifically to make sure they go right up to that line and no further.

moring

0 replies

9h56m

2024-08-24 08:33:38 UTC

I didn't mean to imply that. A hard limit at X that stays inactive <X, and at >X, leaves you with a bill of X and an outage is the easiest approach from a technical side: Terminate the service when X is reached, and bill exactly what was provided. It is something you would instantly come up with when asked to implement a cost limit, and you don't for a second put yourself in the customer's position.

Of course cloud vendors do put themselves in the customer's position, and that's why they say that customers would not be happy with a limit, even though they are asking for it.

bee_rider

1 replies

21h43m

2024-08-23 20:45:49 UTC

Is this a soft limit or a trajectory prediction? I think there isn’t such a thing as a soft limit. Nobody wants to spend any money really right? But you need to spend some to avoid losing service. That’s just a cost you don’t like but need to pay.

I definitely get the idea of: I don’t want to spend X so if it looks like I will, terminate service at Y. But I think that’s a special case of the general situation, I want to know how much I’m on track to spend, right?

But I don’t know much about this at all. My whole experience was accidentally getting my own personal self a $500 AWS charge and then deciding they cloud services were dumb.

moring

0 replies

9h34m

2024-08-24 08:55:21 UTC

Is this a soft limit or a trajectory prediction?

I don't know. I just tried to frame the problem from a customer's point of view, because cloud vendors' statement that customers would not like a limit is (IMHO) limited by their POV. Customers do want a limit, but not the way that cloud vendors would implement it. I think a huge part of the problem is understanding what exactly it is that you need when you use a cloud service. (This is varying from customer to customer, and from service to service, of course. You usually have important services that must be running, and others where an outage would be unpleasant but not critical.)

Nobody wants to spend any money really right? But you need to spend some to avoid losing service. That’s just a cost you don’t like but need to pay.

That is not the issue. From a customer's POV, I would be ready to spend extra to keep the service running, but there is a limit where I'd prefer an outage because I can't bear that much. There are two problems with that: First, the limit is blurry. Second, a simple hard limit would leave me with a huge bill AND an outage. I would want to be able to choose one of those evils, not be left with both. And these two problems compound.

But I don’t know much about this at all. My whole experience was accidentally getting my own personal self a $500 AWS charge and then deciding they cloud services were dumb.

I don't think they are dumber than the alternative. If you run your own hardware, you have a hard limit in both cost and computing power. You could technically get that with the cloud too, but it is not usually offered because it doesn't really solve the problem, but neither does it for for your own hardware.

That said, it would be nice if the major clouds would offer a "hard limit" option, but it really only works for "unimportant" applications that are cost-sensitive and can take an outage.

tmnvix

4 replies

19h57m

2024-08-23 22:32:29 UTC

My preferred option would be to have an optional billing cap that I can enable knowing full well that if it is exceeded the service would be terminated (obviously with notifications as that cap is approached). I could then apply this to simple hobby projects and such, while not having the risk of termination apply to more serious applications (though a 'soft cap' would be nice here so that I could still receive notifications as it approaches).

jiveturkey

2 replies

17h37m

2024-08-24 00:51:54 UTC

As a certain kind of user, you probably do think that. But I also think I should be able to have a root level admin account without MFA. The consensus is that no, that should not be up to the customer.

It's different here, sure, but the providers optimize for not letting customers shoot themselves in the foot, and remediation via bill forgiveness is a fine solution -- from the provider POV.

genewitch

0 replies

15h22m

2024-08-24 03:07:26 UTC

I don't have MFA on my root level account, is it because my account is 16 years old or so at this point? Like my personal AWS account is tied directly to my "order more dish soap" amazon account, because that's how it worked back then, i guess.

franga2000

0 replies

11h4m

2024-08-24 07:24:49 UTC

You should be able to have a root level admin account with no 2FA! I would print mine and keep it in a tamper-evident envelope in a safe at my lawyer's office with instructions for when and who can get it.

A company isn't liable if their customer gets themselves hacked because they decided to not use any of the many MFA options available to them and neither is a company liable if the customer set a billing limit rule that they executed correctly.

Companies can simply not be trusted to tell the difference between a foot-gun and a..whatever a good kind of gun would be...

maccard

0 replies

54m

2024-08-24 17:34:47 UTC

aws has billing alerts that trigger lambdas.

aws ec2 describe-instances --query 'Reservations[].Instances[].InstanceId' --output text | xargs -n1 aws ec2 stop-instances --instance-ids

Will stop all ec2 instances.

The real fix is scoping credentials on aws - if you don’t use an account or role with limited permissions then even if they had this toggle the first step in an attack would be to disable this option.

aidos

4 replies

23h5m

2024-08-23 19:24:06 UTC

Having flashbacks to the time where we had paid for a server and were paying for rack space for a customer and they were refusing to pay their bill. Our lawyers told us in no uncertain terms that turning off the server would be a terrible idea. “Obstruction of service” is the term that comes to mind.

bitbasher

1 replies

23h3m

2024-08-23 19:26:34 UTC

Cloud squatting?

psd1

0 replies

5h46m

2024-08-24 12:43:09 UTC

Node eviction

fauigerzigerk

0 replies

6h19m

2024-08-24 12:10:21 UTC

Surely it depends on the type of contract. Prepaid SIM cards stop working the second you run out of credit.

alpinisme

0 replies

20h51m

2024-08-23 21:38:24 UTC

While the parent point about cloud providers having arrived at their policies thoughtfully, this particular issue is likely not part of the equation. There are plenty of services that run on a quota system (chat gpt, sentry, etc). There is a difference between shutting off a service the customer reasonably expected to be always on and shutting off a service when it reaches a threshold set by the customer as part of their purchase. The former is more like repossessing a physical good willy-nilly if the customer misses a payment or you find a check bounces…you can’t do that.

umanwizard

2 replies

18h51m

2024-08-23 23:37:47 UTC

Okay, what are the reasons?

ec109685

1 replies

18h27m

2024-08-24 00:02:34 UTC

Big spike of real traffic and your site / app / database / system goes down.

umanwizard

0 replies

18h20m

2024-08-24 00:09:06 UTC

Only if you chose to configure an unreasonably low cap.

tgsovlerkhgsel

2 replies

18h31m

2024-08-23 23:58:18 UTC

Terminating service is a big deal for commercial customers' production environments.

Reversibly (i.e. shut down compute, don't delete anything, allow the customer to review, fix and reinstate quickly) terminating service is a minor annoyance for hobby/experimental setups, and in those, it's much more preferable than having to open a support ticket to deal with a massive bill.

Having quotas that the customer can increase themselves (but has to manually choose to increase) on storage prevents storage related surprise bills, and the rest you can shut down (optionally, make the user choose up front what they would prefer).

What am I missing? Too many commercial customers picking "experimental" initially and forgetting to change it?

solidasparagus

1 replies

11h7m

2024-08-24 07:22:19 UTC

No one really cares enough about hobby developers as a customer segment to rebuild the billing infrastructure to make this possible. Scalable billing at huge scale is solved at the cost of latency and being "eventually correct" (unless it has changed). To add a price cap feature, eventually correct isn't enough and then you ask yourself who would actually use it and you have to scroll really far down your list of biggest customers until you get to someone who wants it.

tgsovlerkhgsel

0 replies

6h34m

2024-08-24 11:55:22 UTC

The problem with not caring about hobby developers is that it means developers won't be as familiar with your cloud environment when time comes to pick one for the next "real" project.

I would also expect a price cap feature to be useful for experimental/no-approval-required projects at work. In fact, if I ran a cloud project for work as a small team-internal project, a cost explosion would become an even bigger bureaucratic nightmare than if it happened at home.

wokwokwok

1 replies

12h31m

2024-08-24 05:58:23 UTC

Let's not sugar coat it.

The problem is 100% technical. Detecting unexpected charges, scaling and restriction in real time is hard.

It's easier to just charge people money than come up with good ways to avoid charging them, and deal with edge cases as a manual process.

Sure. I get it. What company has an internal team that's like "ooo... lets find ways to cap the amount of money people pay us".

No one.

That's why.

there are reasons other than malice and avarice

Right.

It's just avarice. There's no other reason.

tptacek

0 replies

12h23m

2024-08-24 06:05:53 UTC

The problem is 100% technical. Detecting unexpected charges, scaling and restriction in real time is hard.

It's just avarice. There's no other reason.

sigseg1v

1 replies

1h43m

2024-08-24 16:46:37 UTC

For many company's use cases (make EC2s a few times a year, otherwise leave them running) I would imagine they would really feel better if there was an option that said "if charges incurred is over (configurable threshold you set to 10k higher than your normal monthly spend), then robocall the customer, and if they do not reply after 3 automated calls then block all additional AWS API calls that would incur more than 10% of my monthly spend if left running for 24h."

This would still allow all production services to run, but would stop someone from spinning up 200 crypto miners. I'm sure AWS is capable of implementing this, and I don't want to say it's "easy" but I would be shocked if they lacked the technical expertise to do this.

maccard

0 replies

1h10m

2024-08-24 17:19:24 UTC

We’ve now gone from hard billing caps to “soft billing with alerting”.

If you look at lots of these threads you’ll see that many people don’t want to provide phone numbers, lots of people ignore emails, even repeated ones, directly to them, from billing.

This isn’t a technical problem, it’s a service problem. I can see the hn posts already “my site went viral and <HOST> shut me down”

figassis

1 replies

8h53m

2024-08-24 09:36:36 UTC

Here's the thing. We have apis for everything and their grandmother. You create an instance and there are apis for adding tags, labels, nicknames...but not for spending caps? I understand I don't know all of the complexities involved, but if you can bill by the second or by the hour, you can certainly alert by the same metrics.

We have been measuring CPU, MEM with extreme granularity, how about considering price as a resource and measuring the same way, so that a service with a price cap can self manage and self terminate according to some priority field?

This might not be the actual solution, but we have been at this for a very long time, seems like there is not even a hint of an attempt at solving it by the giants. This is about incentives, sorry.

mnahkies

0 replies

8h15m

2024-08-24 10:14:06 UTC

I was pleasantly surprised when I was messing around with the Google maps API and found I was able to adjust a quota to put an upper cap on daily spend.

It made me feel much more comfortable hacking around and not needing to worry that I'd accidentally create a render loop or something that could rack up a bill whilst I wasn't looking

hinkley

0 replies

21h47m

2024-08-23 20:41:44 UTC

When you build an app for resiliency you end up with classes of service where the app fails in stages.

But to extend that to the billing case, you’d have to have a partnership with your customers, not just a dashboard where they push buttons and an API where you add or delete machines.

Maybe the website goes read only except for admin traffic when the budget is exceeded, for instance. Not as a bespoke process each company has to reinvent, but as functionality provided by the vendor.

tonnydourado

4 replies

21h27m

2024-08-23 21:01:54 UTC

When you're paying $39/month for something that generates $0/month, that is a very sensible policy.

When you're paying $50,000/month for something that generates $200,000/month in value, or if an outage can generate $100,000/month in costs, or if the people that can fix an outage cost $100,000/year, then it's not.

bitbasher

3 replies

21h21m

2024-08-23 21:07:53 UTC

Then that company's threshold for "sensible monthly cost" would be a lot higher. 500k? 1m? Give them the option to set something.

foota

2 replies

19h9m

2024-08-23 23:19:56 UTC

Why would you prefer this when the alternative is no downtime and the provider forgiving the bill?

yyhhsj0521

0 replies

18h57m

2024-08-23 23:31:45 UTC

provider forgiving the bill

That eventually will be factor into the price like credit card fraud insurance. Better have it be more transparent.

naniwaduni

0 replies

17h26m

2024-08-24 01:02:41 UTC

Because you don't want to be relying on the service provider's whim in "forgiving" the bill?

lo3p0ez

1 replies

14h28m

2024-08-24 04:00:56 UTC

Yes, and it shouldn't be too hard to implement at the business side. Deno just announced it: https://deno.com/blog/deploy-spend-limits

tptacek

0 replies

14h24m

2024-08-24 04:04:44 UTC

(on their Pro plan)

I'm not talking it down. Maybe people are right about this. We'll see.

solatic

0 replies

14h15m

2024-08-24 04:14:23 UTC

Because data storage costs money, hard billing caps require deleting both your data and its backups to stay under the hard cap. There are very few use cases where that's actually acceptable, not even development environments where people will get upset for losing their work that they haven't pushed to somewhere else yet.

mcbain

0 replies

19h25m

2024-08-23 23:04:02 UTC

It's easy for (say) AWS to terminate your EC2 instances. Do they also delete your DB backups? Delete your S3 buckets?

All of these incur costs. How hard a cap do you want?

klabb3

0 replies

1h29m

2024-08-24 17:00:20 UTC

Yes, but I imagined that you’d have a ceiling for rolling average / token bucket over a sensible time window, with two limits:

- a soft alert limit, which you set to the threshold of “hmm something is wrong but we’ll bear the cost until we figure it out”

- a hard limit which fails until more tokens trickle back in, without shutting down service

I really don’t want to rely on forgiveness, it’s just encouraging reckless behavior and submitting to the incomprehensibility of cloud pricing.

Everyone wants these limits, why not design products with that in mind from the get go? It feels like such an afterthought

osbulbul

38 replies

23h34m

2024-08-23 18:54:59 UTC

the biggest reason of I am not using aws, google cloud, vercel etc... for my personal projects is surprise bills. I am not earning anything from them so I can only put $50 but can't afford $500 or $5000. so still i will feel much more secure if I can put hard limit and absolutely sure my bill will never go above $50. (or cloud billing insurance :))

but you got my attention, i can try fly.io

tptacek

32 replies

23h31m

2024-08-23 18:57:55 UTC

Except for things like student accounts, which we're working on, I don't think hard limits are coming any time soon. Our expectation is that the enforcement of hard billing limits would mostly make customers furious with us. If you read to the end of the post, the direction we're going is preemptive detection of weird billing spikes, so you don't even have to notice and ask us.

If you're really only looking to spend $50, we should put our cards on table and say that we're generally not making product pricing decisions with you in mind. If your needs are pretty straightforward, there are hosting providers that will do a better job of serving that business than we will.

osbulbul

26 replies

23h22m

2024-08-23 19:06:53 UTC

yeah I am pretty sure no cloud provider thinks people who can willing to spend small money. but guess what? if one of my projects start earning money, then I will continue to what I know best :) I think that's why every cloud provider still has configs for couple of $

tptacek

25 replies

23h17m

2024-08-23 19:12:13 UTC

We think all the time about people starting small projects here. I think a really good line to draw, if you want to understand how we think about this stuff, is between people who would be OK with us terminating their service because they mispredicted what their cap should be, and people who need their stuff to stay up and running and will just talk to support if they're concerned about billing. A shorter way to say this is that we making product pricing decisions for people who are doing this work professionally.

gwbas1c

23 replies

23h4m

2024-08-23 19:24:57 UTC

One thing I'm really curious about is why caps are so hard? (Perhaps this would result in a more technical blog post?)

IE, you clearly don't want to terminate or shut down an account if they get too close to a cap. But what about things like a warning email, service slowdown, ect?

Likewise, the old "slashdotted" or "hug of death" might be an appropriate result when something goes beyond a reasonable safety buffer?

Anyway, just curious. It's clear that it's a complicated topic, and the real constraints and challenges are interesting.

donavanm

13 replies

17h52m

2024-08-24 00:37:37 UTC

Metering, pricing, and billing is way more complicated than you might assume. There are historical posts here with more details if you search. In short its all async, theres variable lag, theres multiple “types” or dimensions to metering, the prices vary by SKU + customer + previous metering or billing value + other SKU usage, and billing is not uniform across customers. Imagine needing to accumulate all the metering events, apply pricing plans, modify price based on accumulated value, then periodically recalculate it all to compute a bill, then apply more transforms and credits, and that gets to the billable price.

Now evaluate your “cap” rules (which will be just as complex) and feed that back to the actual admin/control plane of the service.

naniwaduni

12 replies

16h27m

2024-08-24 02:02:20 UTC

No, this is motivated reasoning brainrot. It's overcomplicating the problem by hyperfocusing on a specific implementation that's already been judged infeasible to justify not doing it.

The actual problem that people want solved is "the customer wants predictable, budgetable upper bound periodic cost". You are not unique in offering a service where this is a desirable property. Realigning this sort of cost structure is the bread and butter of insurance industry, and no, as much as they'd like to, they don't actually do it by making sure to stop the earthquake before it knocks down more of your house than your price cap.

tptacek

9 replies

16h0m

2024-08-24 02:28:56 UTC

No, this is a much harder problem than you think it is; it's, like, CAP-theorem problematic. I think you should do the exercise of working through how these billing systems work.

naniwaduni

8 replies

15h49m

2024-08-24 02:39:54 UTC

I reiterate that you are not unique in offering a service where the customer's desire for consistent billing and your willingness to provide it are at odds.

tptacek

6 replies

15h39m

2024-08-24 02:49:50 UTC

I don't believe you saw me say that.

naniwaduni

3 replies

14h41m

2024-08-24 03:48:35 UTC

The technical problem is not "we are operationally incapable of, say, getting someone else to underwrite insurance contracts". That's not a technical constraint, It's a business decision.

I get that the underlying issue is that your target consumer is whales who eat orders-of-magnitude pricing spreads as normal opex, and that anyone who comes in with a budget is barely a consideration. It's still absurd to pretend that pricing is too hard. Like, I'm not going to confidently assert "lol you can give up C, it'll be a rounding error anyway" but like, billing cycles are absurdly long on the scale of your technical constraints.

tptacek

2 replies

14h34m

2024-08-24 03:55:02 UTC

I don't think you've thought very carefully about this, because there are real technical problems here. If we did a cap feature, enabling it would involve disabling parts of our platform. We're just going to go back and forth on this, because I'm not going to write you an essay on this, and you're going to keep saying "it's a business decision" and I'm going to keep knowing you're wrong about that.

For the nth time on this thread: we don't make our quarterly nut billing people looking to spend $50/mo an occasional extra $1000. The people who actually pay the bills here do not want this feature.

They definitely want Accident Forgiveness, though. Which means it's going to cost us money to do this. And that's fine; they're growing, we're growing.

naniwaduni

1 replies

14h16m

2024-08-24 04:13:08 UTC

If we did a cap feature, enabling it would involve disabling parts of our platform.

What I'm saying is that capping billing is not the same thing as shutting down parts of the platform. You're redirecting complaints about the latter by focusing on the difficulty of doing the latter in real time, which is granted but gross overkill.

The former is something that is not only doable, but something that already happens regularly in an informal capacity and nobody believes you don't have the data to price it.

donavanm

0 replies

8h51m

2024-08-24 09:37:55 UTC

Ok, so you think unintended usage should be costed out by the provider. Sure, T&S and support definitely have a handle on the topic. Now what? Because _today_ it’s already baked in to the P&L and pricing. You want the provider to give it to you as a line item that you dont control? Or to do actuarial evaluation of your footgun propensity to charge you more or less? Why? Im totally missing what solution your suggesting, the problem it solves, and why the provider _and revenue generating customers_ care.

genewitch

1 replies

15h23m

2024-08-24 03:05:43 UTC

If you're really only looking to spend $50, we should put our cards on table and say that we're generally not making product pricing decisions with you in mind.

i think is what gave that vibe off. I was on a read-only phone in bed and saw the quoted message. got up and logged into the PC to think about what to say. It may be time for cloud providers to dissuade small users away, instead.

donavanm

0 replies

8h43m

2024-08-24 09:46:37 UTC

Hes telling you the reality. Cloud providers dont want to _dissuade_ hobbyist users. But thats not their target market and they will make business decisions based on the revenue generating customer profile. Thats just being frank and honest.

Major cloud-style companies dont drive significant revenue from your $50/mo cohort. And a $5/mo dev account is basically courtesy for the sales pipeline. The vast vast majority of revenue is “enterprise” sales with private pricing and spend in the hundreds of thousands to millions range.

Jgrubb

0 replies

3h29m

2024-08-24 14:59:59 UTC

Define “consistent billing” please, I don’t even know what that means and I work in this space.

donavanm

1 replies

8h57m

2024-08-24 09:32:01 UTC

I dont understand what youre on about. The post I replied to, and many others, use “caps” to refer to limits of service based on billing. This is an endless source of comments on every “cloud” topic. I provided a very brief overview of why large billing systems are more complicated than expected and have an impedance mismatch to the stated desire. But sure, tautological brain rot. Got it. Im sure you have a wealth of experience with metering, billing, and pricing for billion dollar revenue streams.

Now if you have some _other_ proposal for how billing and service limits could function Im legitimately interested. But I dont see anything at all specific or actionable in your replies. Insurance is interesting for _some_ facets. Im curious how you think that aligns with dynamic resource utilization and what happens at the boundary.

Dylan16807

0 replies

1h18m

2024-08-24 17:11:13 UTC

if you have some _other_ proposal for how billing and service limits could function Im legitimately interested.

Billing doesn't have to be so complicated you can't calculate it in less than a minute. That's a technical failure. Surely you can imagine a better way? If you really think it can't be better, then it's hard to argue against "brain rot".

Also on most systems it will work perfectly fine to use an estimate of the price per unit when calculating the bill for the last couple hours.

tptacek

4 replies

22h58m

2024-08-23 19:31:16 UTC

We talk about warnings in the post. We'll do that at some point. The hard part about caps is what to do when someone hits them.

If there was a way to make caps work for our core customers, we'd do it. We're open to ideas. A theme of our work this past month and these next several months is extracting maximal value from ANFWWAONW, our new billing system. The thing you have to remember though is that our belief about our core customer is that they are averse to nothing more fiercely than service disruption.

We're not in principle opposed to caps. We just don't have a product story for them that we're comfortable with. Keeping you from spending more money than you wanted to is an explicit product goal of ours (again: see post); we're just very wary of trading availability off against that goal.

Aeolun

2 replies

18h29m

2024-08-24 00:00:27 UTC

You are comfortable scaling a service down to zero when configured to do so though.

Of course that comes up again when anyone sends a request, but that feels sort of in the same category.

That said, I do understand that building your service for people like me that’d rather be restricted to just $50/month doesn’t really make sense.

tptacek

1 replies

17h31m

2024-08-24 00:57:57 UTC

I don't think anyone with a serious app running on us will use a cap. Just stay fixated on this scenario: a deploy-only token gets stolen, and the attacker (like most cloud attackers) uses it to stand up a bunch of Monero miners. As a consequence... their main app goes down? Who would be OK with that?

Aeolun

0 replies

11h20m

2024-08-24 07:09:29 UTC

I think the cap (if they had any) would probably reflect the amount they stand to lose if their app goes down. If your app brings in $1000/h, you don’t necessarily care about spending that amount on servers. When your costs rise to $10k/hour, you might want to go with the nuclear option.

Of course it’s nicer if you can be certain that your provider is going to refund you the excess, but I feel like it’s hard to count on it. Or at least, harder than having explicit rules, which you just can’t really do for those sitations that are sensitive to fraud.

Honestly, if I did set a cap I’d be very much aware of the fact my app could suddenly die in a situation where my deploy token were stolen (but it wouldn’t matter for me, since it’s a hobby project, I care about controlling costs, not uptime).

Alacart

0 replies

21h12m

2024-08-23 21:17:36 UTC

Configurable warnings as webhooks would be pretty cool. Then I can automate whatever needs to happen on my side.

I already automate apps, machines, etc with the machines API and GraphQL, so my big worries in this area are:

- Woops, some bad logic deployed too many machines (sounds like this policy helps) - Some kind of mistake or attack that just explodes bandwidth usage suddenly

naniwaduni

2 replies

16h50m

2024-08-24 01:38:43 UTC

Sinclair effect in action: "It is difficult to get a man to understand something, when his salary depends on his not understanding it."

Obviously, cloud providers are capable of billing. It's not unreasonable to expect them to offer, at minimum, "Look, we're not evaluating billing continuously but we might, at our option, try to shut down your services after $X and won't bill you excess of $Y" as a (optional) hard contractual term. Having a larger, well-capitalized party absorb risk on behalf of a smaller party for a fee is a business model humans know how to price. It's just not the desired "product".

Note that this doesn't even per se require any technical artifact to implement, just very primitive metering and lush margins.

tptacek

1 replies

16h23m

2024-08-24 02:06:09 UTC

Seriously question: have you thought through what billing for bandwidth, storage, compute, and services actually entails? I'm having a bit of a reaction to the word "obviously" there.

naniwaduni

0 replies

16h22m

2024-08-24 02:07:26 UTC

I don't need to, the fact is manifest in your ability to send out the bills for them. This seems very obvious to me, and your incredulity is baffling.

rincebrain

0 replies

21h58m

2024-08-23 20:31:36 UTC

It "seems" simple, but people have really complicated needs, and the simplest answer if you don't have the time to build and maintain something to cover enough of the state space to make enough people happy is to not support it.

Some people are going to want you to go to 1Mb/s if you blow your bandwidth limits, or cash spend. Some might want you to go to 10 connections at once. Some might want you to just disable networking entirely.

And what happens to your storage when you blow your billing? Or CPU time?

It all becomes a hairy mess of state machines and companies wanting precisely _their_ requirements met, so you try to offer as much as you need to still have a compelling offering that enough people want to use, and no more.

Probably at best you could provide an API to make it easy for customers to build their own state machine, but that's fraught because then the customer will still blame you even if their own code did the wrong thing.

spion

0 replies

10h7m

2024-08-24 08:21:49 UTC

You're selecting for people who feel they're priviledged enough to reach out to support and ask for bill forgiveness even when they may've messed up, repeatedly. This does select for something but I'm not 100% sure "people who are doing this work professionally" is that clear-cut, especially once you move past western cultural norms.

ilaksh

2 replies

20h10m

2024-08-23 22:19:38 UTC

I love your service, but this erroneous take makes me think that even if I was looking to spend up to $500 with a "real" business, you don't have me in mind.

There should obviously be multiple types of caps. AWS and others have set a precedent that they can get away with "gotchas" for anyone who isn't paying attention.

It's really the primary business model. People that aren't watching costs have more and more sneak in there.

Does anyone know of a service like fly.io that has a perspective that's more friendly to bootstrapped startups?

tptacek

0 replies

20h5m

2024-08-23 22:23:57 UTC

Let me know if you find them!

sanbor

0 replies

18h12m

2024-08-24 00:17:39 UTC

Vercel has spend management and an option to pause pro when reaching the spend amount https://vercel.com/docs/pricing/spend-management#pausing-pro...

hinkley

0 replies

21h21m

2024-08-23 21:08:02 UTC

If there is a class of people who get steered to your service by a small number of organizations, you might. Universities, trade schools, a podcaster you’re sponsoring.

You can’t scale to individualized service for $50 per month/quarter/year users, that’s true. But you can shape policy for a demographic with… shall we call it flocking behavior, for lack of a better term?

donavanm

0 replies

17h32m

2024-08-24 00:57:03 UTC

FWIW I believe your reasoning is exactly what I saw/learned at AWS. The potential cost of losing/disappointing “real” customers greatly outweighed the notional cost of lost “scared” customers.

One interesting thought was trying to model some of this as an actual insurance method. Think of the cases where an adversary of the customer might inflate their costs through 3p usage/traffic. Providers dont want to incentivize those adversaries, deny the customer service, or charge them for unuseful service. Normally it devolved to credit/forgiveness, but then that moves the customers business model risks to the provider. What if this functioned similar to an insurance model; very cheap/baked in forgiveness for everyone (as today), then based on risk profile (porn/games/polical/gambling, or previous occurrence) the customer gets the option of buying forgiveness insurance or self funding their risk. The real sticking point is around perceived/potential conflict of interest and goodwill for the provider to say “pay us more for a thing that you cant directly control.”

reducesuffering

3 replies

19h0m

2024-08-23 23:29:09 UTC

Vercel has spending caps as of about a month ago.

tptacek

2 replies

17h31m

2024-08-24 00:58:12 UTC

Vercel is pretty great.

reducesuffering

1 replies

17h10m

2024-08-24 01:19:38 UTC

There are so many Next.js apps. Please make sure the Next.js deployment on Fly.io is smooth as you'll inevitably get a huge exodus (including me) if/when too many people pay high prices on Vercel.

tptacek

0 replies

17h2m

2024-08-24 01:27:16 UTC

We're not looking to pull people from Vercel. I'm not kidding, I like Vercel a lot.

isoprophlex

0 replies

10h30m

2024-08-24 07:59:05 UTC

Hetzner. 40$/mo gets you a 10-core, 20-thread intel box, 64 gb ram, 1 TB storage, unlimited traffic.

paxys

33 replies

2024-08-23 18:08:30 UTC

It's unfortunate that the solution to cloud pricing complexity that all providers are adopting is – add even more complexity on top.

The number you see on your bill is increasingly calculated by running some black box algorithm on top of the billing events your resources generate. Was it accidental or not? What is a "weird" deployment vs a normal deployment? By what factor should the spikes on your billing graph be smoothened? None of this can be deterministically calculated from the pricing page. And there's no way for you to do these checks yourself before deployment because you have no idea what this logic even is. So you are entirely at the mercy of the provider for "forgiveness".

Who wants to bet that some provider is going to launch "AI cloud billing" within the next year?

candiddevmike

21 replies

23h44m

2024-08-23 18:45:19 UTC

You'll pay exactly the amount they think you can afford to pay and not a penny less.

tptacek

20 replies

23h21m

2024-08-23 19:08:29 UTC

That would be what you would pay in the absence of competition. This is an incredibly competitive space.

paxys

9 replies

23h10m

2024-08-23 19:18:51 UTC

Competition only matters for new contracts. Once you have picked a provider and made a big enough infrastructure investment, there's no realistic path to switch to someone else.

javcasas

3 replies

22h31m

2024-08-23 19:58:15 UTC

They laugh at me for constructing as much as I can from the infra as plain kubernetes objects.

Keep laughing, I can switch clouds in less time than you take for figuring out cloud costs.

commercialnix

2 replies

19h23m

2024-08-23 23:06:30 UTC

Keeping everything, including mission critical databases, "inside" Kubernetes is smart. The smart people keep everything "in" Kubernetes. Everyone else can pay up.

filleokus

0 replies

18h20m

2024-08-24 00:08:58 UTC

Like everything else it's a tradeoff. If you are running _everything_ inside Kubernetes you can easily move away, but I think you're loosing much of the benefit of being in the $big_cloud to begin with. If you have the staffing to provide your own db's, block storages, caches, IAM, logging/monitoring, container registries etc in a reliable way "as a service", the jump to some much cheaper bare metal hosting is not that far.

For me the sweet spot is to have all compute in Kubernetes and stick to open source or "standard" (e.g S3) services for your auxiliary needs, but outsource it to the cloud provider. Fairly easy to move somewhere else, while still keeping the operational burden down.

But agree that having e.g all your databases in a proprietary solution from the cloud vendor seems sketchy.

felixgallo

0 replies

6h38m

2024-08-24 11:51:06 UTC

Don't be ridiculous. To the nearest 9 decimal points, nobody keeps their mission critical database inside Kubernetes, and the K8s tax is excessive for essentially all cloud users anyway.

zipmapfoldright

1 replies

19h25m

2024-08-23 23:03:47 UTC

This is absolutely not true. Customers regularly shift spends in the >= 7-digit between cloud providers. Yes, it takes planning, and it takes many engineer-months of work, but it definitely happens.

And also factor in (1) the claim that most cloud growth is ahead of us, eg. moving large customers from on-prem to cloud, and (2) it would be terrible policy to try to charge existing customers more than new customers.

ensignavenger

0 replies

2h21m

2024-08-24 16:07:42 UTC

> it would be terrible policy to try to charge existing customers more than new customers.

Companies do this very, very often. This is part of the reason why they have a "call the sales department for special pricing" option. They can give large contracts a nice discount to get them onboard, then slowly (or in some cases, if they really think they have you hooked, not so slowly) ratchet up the price. This is common in both B2c and B2B businesses.

toast0

1 replies

1h43m

2024-08-24 16:46:28 UTC

It really depends on what your application is and how much critical functionality relies on hard to replace proprietary services.

If you store a lot of data, transfering it is a big deal too, of course.

But really, computers are computers. It's easy to move software between them. It just takes motivation, effort, and time.

paxys

0 replies

53m

2024-08-24 17:36:07 UTC

It just takes motivation, effort, and time

In business terms, you just said "cost, cost and cost".

jgalt212

0 replies

16h51m

2024-08-24 01:38:29 UTC

True, margins are very high for existing customers. And customer acquisition costs are very high as well.

immibis

8 replies

22h52m

2024-08-23 19:37:40 UTC

Then how do AWS, Azure and GCP charge 100 times the amount for bandwidth as other hosting providers and the IP transit quote sitting in my email inbox?

tptacek

6 replies

22h45m

2024-08-23 19:44:37 UTC

Why isn't one of those providers roflstomping the other 3 (add OCI to the mix) with better pricing?

rendall

2 replies

20h1m

2024-08-23 22:28:04 UTC

"roflstomping"¿ What does it mean?

stackghost

0 replies

16h36m

2024-08-24 01:53:28 UTC

It's from online gaming. To stomp means to defeat soundly/easily. To ROFL is to roll on the floor laughing.

To roflstomp is to defeat someone so easily and completely that it becomes so comical you'll be ROFL.

ShroudedNight

0 replies

19h48m

2024-08-23 22:40:43 UTC

A beligerent that so completely outclasses their opponent that they can inflict existentially threatening losses with so little effort that its expenditure is indistinguishable from normal overhead

Alternatively, when Stitches managed to ambush your level appropriate character alone in Duskwood

talkin

0 replies

21h32m

2024-08-23 20:57:09 UTC

With such a small group, chances are they all figured out that a race to the bottom won’t reach the desired long term outcome.

pclmulqdq

0 replies

22h39m

2024-08-23 19:50:22 UTC

Because they land your data in their systems by offering you a sweetheart deal in year 1-2 and then they jack the price back to the extortionate level once you're stuck. Compute is portable between providers, but data is not.

jeremyjh

0 replies

3h16m

2024-08-24 15:13:23 UTC

I've often wondered about this. I'm guessing large customers can all negotiate extremely deep discounts on bandwidth from all three providers. Smaller customers may not be paying that much in bandwidth that it would be decisive. I think also that if say GCP cut its bandwidth charges by 10x they might attract customers they really don't want.

toast0

0 replies

1h9m

2024-08-24 17:20:15 UTC

IMHO, there's a couple parts to this.

Part 1 is bandwidth prices vary tremendously by location, but clouds would like to have customers in expensive regions too, and if US overcharges at 100x, and Brazil overcharges at 25x, the customer will only have to pay 2x for bandwidth in Brazil. Not a lot of cheap hosting in Brazil, from what I've seen, but there are a lot of users in Brazil, attracting cloud customers justifies putting more cloud hardware there which benefits the cloud.

The other part is that bandwidth is easy to measure and broadly correlates with general usage. On a shared system, you can't really meter watt hours, but you can meter bandwidth. Bandwidth charges are how the difficult accounting is reconciled so that the cloud can charge enough to hit their desired margins. If bandwidth was just a cost plus, other things would cost more; if everything had to be cost plus, accounting would be much more difficult for everyone (and it's already pretty non transparent)

kibwen

0 replies

21h27m

2024-08-23 21:01:46 UTC

If the cost calculation is complex and opaque, that successfully prevents anyone from evaluating what their costs would be on a competing service. And vendor lock-in makes it expensive and nontrivial to simply try it out.

ToucanLoucan

6 replies

23h30m

2024-08-23 18:59:30 UTC

Anyone else remember we had widely available and completely understandable options to rent everything from baseline web-hosting all the way up to private rack services for actual years before AWS came along and apparently all of that was forgotten, Warhammer 40K style?

I've rented a VPS from a vendor for going on 20 years now (Holy fuck I'm old) and I've never once been surprised at the bill.

maliker

3 replies

19h49m

2024-08-23 22:40:27 UTC

You know that book "JavaScript: The Good Parts"? There needs to be a similar one "AWS: just the good parts". It would probably talk about EC2 (VPS), S3 (cheap bulk storage), SES (emails), and that's about it. When folks get into the elastic-super-beanstalk-container-swarm-v0.3 products, that's when they really kill themselves on the bills.

That said, yes, just using a VPS vendor is the easy way to stick to the good parts.

otteromkram

0 replies

14h18m

2024-08-24 04:11:12 UTC

"AWS: just the good parts"

I can't imagine a zero-page book selling well...

ec109685

0 replies

18h28m

2024-08-24 00:00:58 UTC

AWS the Good Parts (a bit dated now): https://dvassallo.gumroad.com/l/aws-good-parts

Aeolun

0 replies

18h47m

2024-08-23 23:42:08 UTC

I’ll argue that while wonky sometimes, ECS is pretty decent. Especially once you use fargate.

pclmulqdq

0 replies

11h23m

2024-08-24 07:05:50 UTC

You still can. My company rents space from Cogent communications, and they are very good at customer service, although the offering of "rent a cage and get a pipe" is a lot more DIY than AWS.

They also offer VPSes.

LAC-Tech

0 replies

21h58m

2024-08-23 20:30:46 UTC

Remember? Your own comment tells us this is not a distant memory! It is still available! And a lot of people use it.

thenewnewguy

1 replies

23h44m

2024-08-23 18:45:33 UTC

I'd recommend thinking of this (and other accident forgiveness schemes from competitors) as a gesture of goodwill that rarely happens rather than an official part of the billing policy.

If you actually look at your contract, no cloud provider is going to contractually obligate themselves to forgive your bill, and you shouldn't be planning or predicting your bill based on it.

monooso

0 replies

4h16m

2024-08-24 14:13:26 UTC

Isn't the entire point of the article that Fly.io is making this an official policy?

rob

0 replies

21h56m

2024-08-23 20:33:19 UTC

Speaking of complexity, Fly.io recently introduced region-specific pricing as well:

https://community.fly.io/t/region-specific-machines-pricing/...

https://news.ycombinator.com/item?id=40873001

bdcravens

0 replies

2024-08-23 18:29:22 UTC

It's far from perfect, but the large providers do give you tools for categorizing your charges (tagging, etc). There's some fuzziness, especially around data transfer, but for the most part developers can look at an application and know what the cost of each resource is. The biggest risk seems to be out-of-control auto scaling turned on without doing reasonable analysis first.

tptacek

30 replies

22h17m

2024-08-23 20:11:41 UTC

Y'all really, really want caps. We think that's a crazy feature. We don't want to do it. But I just had this conversation on a Slack with a bunch of friends at other companies and they were just as yell-y about this. There's a threshold of feedback at which I think you could get us to do some kind of cap thingo. I'm just saying.

Yeesh.

ustad

19 replies

22h15m

2024-08-23 20:13:56 UTC

In your article or in the posts here you have not said why you think its a crazy idea.

tptacek

18 replies

22h12m

2024-08-23 20:17:04 UTC

I've said it over and over again: if you have a serious app, and someone somehow steals a credential from you and uses it to light up a bunch of crypto miners, you don't want us shutting down your main app in response. We perceive it mostly as a feature that will blow people up.

We agree about the underlying problem! You don't want to spend $5000 in a month for services you never wanted. We don't want you spending that either. We'd rather just improve our billing so that you can fix this after the fact without trading off availability.

But I have just planted a flag: we are prepared to be wrong about this. I don't think we are, but like, I'm the only one. :)

bee_rider

7 replies

21h31m

2024-08-23 20:58:22 UTC

I think your position is essentially that you mostly want to serve people who have serious apps, for whom the cap idea doesn’t work. And you definitely know more about the general trajectory of your customers than all of us sitting here randomly speculating. But, is it really so uncommon for an application to transition from unserious, I want to cap, to serious?

I guess I’m slightly confused because I thought one of the nice things about cloud services is that they give you the ability to fit your infrastructure to your size while you are trying things. If I’m still trying things, I might not even know if I’m serious yet, right?

tptacek

6 replies

20h51m

2024-08-23 21:38:04 UTC

Sure, but bear in mind that across the entire industry of public clouds, which has been around for something like 2 decades now, nobody really has this feature. In fact, didn't Google have this feature and then pull it?

bee_rider

3 replies

19h59m

2024-08-23 22:29:41 UTC

Yes, clearly is it not the convention to provide this feature. But the chain of thought seems so straightforward and obvious to me: cloud infrastructure is great for exploring what you’ll be doing, and guardrails make you more willing to explore fearlessly. I’m obviously missing something, though!

Aeolun

2 replies

18h8m

2024-08-24 00:20:42 UTC

It’s also easy to see why the convention would be to not have it. It earns the provider more money. After all, not every customer will ask for forgiveness for a surprise bill.

tptacek

1 replies

17h35m

2024-08-24 00:54:31 UTC

This is not why we don't have caps. Maybe it's why AWS doesn't. This simply isn't how we make our money. Our top line is dominated by companies growing businesses with us. Getting a hobbyist to cough up an extra $100 doesn't move a single dial for us.

Aeolun

0 replies

17h33m

2024-08-24 00:56:40 UTC

I didn’t mean to imply you did (or even AWS for that matter). Just that from the consumers perspective it’s impossible to see/know the difference.

myaccountonhn

1 replies

19h8m

2024-08-23 23:21:24 UTC

I've def. wanted to have spending caps on a lot of my projects. The ones I've tried: AWS and GCP made this ridiculously difficult. It might be that they will waiver the bill, but the general feeling that they get to decide if I go bankrupt or not is nightmare fuel.

In the end, I've just set up a 5$ vps where I self-host all my apps. That removes all the stress.

tptacek

0 replies

16h37m

2024-08-24 01:51:47 UTC

Yeah, so one issue we run into here is that we're not trying to get you to stop using $5 VPS's. That is not good business for us! There is nothing at all wrong with managing your own servers.

belthesar

3 replies

21h12m

2024-08-23 21:17:02 UTC

I'll be honest, I think you've got an uphill battle to convince so many of us that are concerned about accidental pricing that this is exactly the thing we actually want, vs. what we feel we want. This position parrots what Vercel's leadership said when someone had a massive surprise bill, and the story made the rounds both here and elsewhere.

To be super honest, you might be right too. You might go down a huge engineering effort to build this in only for 5% of your customers to ever engage it. I think the real question is what percentage of your customers will feel better knowing that the have the choice to set those limits, and how will that comfort actually improve their trust with Fly, and cause them to choose it over another cloud provider.

It may end up being a lot like this feature we had in a platform that I used to support. It had a just-in-time analytics pipeline that at one point required tens of thousands of dollars in compute, storage and network hardware alone to function. Based on our analytics, it was barely used compared to the usage of the rest of our app, which made the zounds of resources and fairly frequent support attention it needed feel silly in comparison, so I advocated to sunset it. Product assured me that, regardless of how silly it might be to continue supporting this feature, it was a dealmaker, and losing it would be a dealbreaker.

So yeah, y'all might be right in that the majority of your customers don't actually want it. But maybe what they do need is to know that it's there, ready for them if they ever need to engage with it.

dlisboa

2 replies

20h40m

2024-08-23 21:49:31 UTC

You might go down a huge engineering effort to build this

This is an overlooked issue: billing caps are hard to implement and will likely incur losses for the cloud company that does.

Take an object storage service as an example. Imagine Company X has a hard cap at US$ 1000 but some bug makes their software upload millions of files to a bucket and rack up their bill. Since objects are charged in GB/month they will not reach the cap until some time later that month. Then, when they do, what does termination of service mean? Does the cloud provider destroy every last resource associated with the account the second the hard cap is reached? If they don't, and they still have to store those files somewhere in their infra, then they'll start taking a loss while Company X just says "oops, sorry".

That's what tptacek is talking about: you want to NOT destroy the customers' resources because they can quickly figure out that something went wrong and then adjust while still maintaining service. But the longer you keep the resources the more you're paying out of pocket as a cloud provider. If you can't bill the overages to the customer, which a hard cap would imply, then you're at a loss. Reclaiming every resource associated to an account the moment a cap is reached is an extreme measure no one wants.

A hard cap then becomes only a "soft" cap, a mere suggestion, and cloud providers would then say "you hit the cap, but we had to keep your resources on the books for 12 hours, so here's the supplemental overages charges". Which would lead to probably just as many charges disputes we have today.

umanwizard

0 replies

18h34m

2024-08-23 23:55:19 UTC

$1000/mo in S3 deep glacier storage buys you a petabyte (a million gigabytes) of storage. It’s hard to imagine such a small customer uploading a petabyte without noticing, and part of what happens when you hit the cap could be moving things from normal object storage to glacier.

Dylan16807

0 replies

1h25m

2024-08-24 17:03:44 UTC

If you turn off servers and shut off bandwidth you get rid of the vast majority of expenses.

Storage fees are a lot less risk, but if you want to cap those then you should cap the number of gigabytes directly. That prevents the overage issues you describe.

umanwizard

2 replies

18h39m

2024-08-23 23:50:01 UTC

Giving people the ability to use caps as an optional feature doesn’t force anyone to use it.

Also, you could turn things off in the inverse order they were turned on until the cap is satisfied. So all the crypto miner instances would be turned off before database backups being deleted.

Aeolun

1 replies

18h13m

2024-08-24 00:16:13 UTC

When you go over the cap everything turns off, otherwise, what’s the point?

Should you turn things off until you reach a certain spend rate? Then you can set your cap high enough for your server +1, but no more.

umanwizard

0 replies

6h13m

2024-08-24 12:15:45 UTC

I was imagining the cap would be a rate, not a total.

ustad

1 replies

22h8m

2024-08-23 20:21:31 UTC

Crypto miners is the reason for not having a “crazy” option for “cap on/off” at sign up?

You know what you can’t fix after the fact of getting a huge bill? a heart attack.

tptacek

0 replies

22h6m

2024-08-23 20:22:41 UTC

This post is about us not giving you that heart attack.

I'm pushing back, and I'm going to keep pushing back; if we do this feature, it is going to be kicking and screaming. :)

lilyball

0 replies

20h24m

2024-08-23 22:04:44 UTC

I'm pretty sure I'm fairly far from your target customer, given that I'm not building products, and right now my only usage of fly.io is to host a tiny app that expects to be used a few times per year by my spouse and me.

Having said that, the reason why I personally would be happier with the ability to set a hard cap is that if I'm going to put a project on fly.io, I'm spending my own personal money on it. If I'm spending my own personal money on it, I want a guarantee that it cannot possibly cost more than a given amount. When it's my own money on the line, I absolutely want the service to shut down rather than have even 0.01% chance of costing me a lot of money.

The moment I'm actually building something for a business, the moment it's company money instead of personal money, then priorities change and everything you're saying makes sense. But as long as I'm just a single developer playing with stuff and billing it to my personal credit card, I want that guarantee that I won't accidentally make myself go broke.

umanwizard

2 replies

18h41m

2024-08-23 23:48:13 UTC

Why don’t you want to do caps?

tptacek

1 replies

17h36m

2024-08-24 00:52:59 UTC

When I was 15 years old, I was walking the mean streets of Chicago after attending an opera with my parents when a billing cap leapt from the alley and snatched my mom's purse. My dad fought it valiantly, but it prevailed, killing both my parents in the process. From that day forward...

umanwizard

0 replies

6h13m

2024-08-24 12:16:37 UTC

I was actually curious what your serious reason is.

tedunangst

1 replies

19h36m

2024-08-23 22:52:51 UTC

Okay, so now that it's inevitable you add caps, can we wager on how long between rollout and the first front page "I told fly I wanted a service cap, but not like this!" post?

tptacek

0 replies

17h37m

2024-08-24 00:51:47 UTC

I think I know how to message this one. "You fuckers, here you go, now leave me alone".

amirhirsch

1 replies

20h28m

2024-08-23 22:01:26 UTC

Sounds like I should just DM Kurt until it happens?

tptacek

0 replies

20h22m

2024-08-23 22:07:27 UTC

Absolutely you should. I'm intractable, he isn't.

mixmastamyk

0 replies

14h37m

2024-08-24 03:52:11 UTC

People keep implying a cap means “break/delete everything.” Not necessarily so?

It would make a lot more sense to throttle service, say 80% instead like ISPs do. Slow bandwidth/fewer cpus would raise flags without necessarily breaking anything.

christina97

0 replies

16h26m

2024-08-24 02:02:55 UTC

I totally sympathize with the “no caps” argument. I mean it’s literally just “kill random shit on error”.

Putting both my personal/private as well as work hats on: I’m not sure I would ever want to enable it.

aidenn0

0 replies

19h16m

2024-08-23 23:13:11 UTC

I think hobbyists 100% want caps, businesses don't know what they want (but would be happier without caps if they could experience the counterfactual) and former hobbyists hired to work for businesses will put caps on if they are available.

morningsam

21 replies

2024-08-23 17:51:23 UTC

If you do something luridly stupid and rack up costs, AWS and GCP will probably cut you a break. [...] Everyone does.

If the incidents that made the rounds here in the last few months are any indication, they'll start out insisting you pay no matter what. You'll then have to write a blog post about it, post it to Twitter, HN, and Reddit, get a couple hundred comments expressing anger at the provider, and wait for someone from their PR department to see it. Only then will they finally waive the costs.

Good on Fly.io for trying to handle such situations more sensibly.

bdcravens

6 replies

2024-08-23 18:02:27 UTC

There's plenty of situations you don't hear about.

A few years ago I f'ed up and accidentally pushed keys to a public repo, and by the next morning, we racked up $50k in AWS charges from crypto miners. We reached out, they gave us a security checklist that if we followed, they'd take off the charges. We did, and by Monday (my code push was Friday evening) the charges were taken off. No public shaming required.

perchlorate

2 replies

19h59m

2024-08-23 22:30:31 UTC

You likely already know that, but to anyone else interested: a good way to prevent these kinds of situations is to run 'nosey parker' on your git repo before pushing it to a remote. It will dig through your code and configs, looking at files and through all the git history, and highlight anything that looks like tokens, passwords, keys, etc. You can set it as a pre-commit hook to block the offending code from even being committed.

https://github.com/praetorian-inc/noseyparker

Tiberium

0 replies

11h3m

2024-08-24 07:26:03 UTC

https://github.com/trufflesecurity/trufflehog is a similar tool but checks for far more secrets, so I think it'd be a better choice.

Aeolun

0 replies

18h43m

2024-08-23 23:46:27 UTC

Github has a similar feature that’s free for public repositories IIRC.

paulgb

2 replies

2024-08-23 18:05:17 UTC

I’m curious what sorts of things were on the checklist. I wonder if it’s something they will share proactively?

bdcravens

1 replies

2024-08-23 18:15:15 UTC

It's been a few years, so I'm going off of memory, but it was mostly best practices stuff (enabling Cloudtrail, rotating older keys, etc). Anything to ensure that once the attackers no longer had access, removing/monitoring anything that would have longer term implications.

skeeter2020

0 replies

23h46m

2024-08-23 18:43:23 UTC

this makes sense; plug the hole and stop taking on water before you ask for a free cleanup.

AtlasBarfed

4 replies

23h50m

2024-08-23 18:39:08 UTC

Another reason to be cloud agnostic. They are counting that switching cost will make you eat the bill.

If you have to be cloud, do dev in one cloud and test/prod in another.

I know, I know, easier said than done.

srockets

3 replies

21h59m

2024-08-23 20:29:59 UTC

An argument that is often only made by folks who didn't ran anything at scale on a public cloud. If you've ever set in a room when a cloud deal was signed, you'd know it cable: you don't get to pay less by threatening to switch providers (I've seen people try to suggest that and get laughed out of the room), but by buying more from a single one.

Hence accident forgiveness is in the same marketing bucket as free credits for new customers: reducing the aversion to trying the cloud and putting more work on it.

And the switching costs, the biggest line item isn't building for multiple clouds (you can't be cloud agnostic: abstractions always leak somewhere, so you need to select a set of specific clouds to build for), but the cost of moving data between clouds. That's the real lock-in.

Aeolun

1 replies

18h40m

2024-08-23 23:48:46 UTC

AWS is something like an insurance service. We all pay more so that they can forgive the people that mess up xD

srockets

0 replies

17h1m

2024-08-24 01:28:04 UTC

That’s not how it works.

We all pay more [sic] so data centers would be over provisioned, allowing quickly expansion when we need it. I’ve been in a few incident that were root caused to a cloud provider lacking capacity to provision more instances.

As that over capacity would’ve been bought and installed in the cloud provider’s data centers regardless of the errors that are being forgiven, not billing for those errors is a net gain to the provider, at minuscule cost, if at all, to other users.

hinkley

0 replies

21h26m

2024-08-23 21:03:36 UTC

In the IBM, HP days it wasn’t about paying less per se by threatening to switch providers, it was getting more concessions (of which money might be one). Some big companies had both systems so they could play them off of each other. Time to order more hardware, who will kiss our butts more?

I’d be very surprised if that doesn’t still exist for the big boys. Though most of us are not big boys, and half of the biggest boys are cloud providers themselves.

jmathai

2 replies

14h51m

2024-08-24 03:37:48 UTC

I was a PM in GCP and refund requests due to customer misconfigurations would make it to me for approval.

Generally, I tried to make exceptions to grant the refund. It sucks to get these bills and they can be quite scary.

Unless you have a real chance of bursty traffic I suggest going with compute that has more predictable costs.

rawgabbit

1 replies

14h17m

2024-08-24 04:12:03 UTC

So no elastic anything. Just servers?

jmathai

0 replies

6h18m

2024-08-24 12:11:06 UTC

If you have elasticity then make sure you understand the limits. Even Serverless has these knobs but for various reasons they default to high elasticity.

But many folks don’t need the elasticity at all. So you should factor that into your architectural decisions.

JimDabell

2 replies

12h50m

2024-08-24 05:39:22 UTC

What are you referring to, exactly? That doesn’t sound like AWS at all.

AWS are very well-known for bill forgiveness. It’s not something set in stone, but if your bill explodes accidentally, even due to a mistake you made, they will normally forgive it if you ask them. You don’t need to go running to social media at all.

morningsam

1 replies

11h48m

2024-08-24 06:41:15 UTC

You're right, I was getting two relatively recent posts mixed up:

1. After a DDoS attack, someone got a $100k bill from Netlify for his static site and after he asked to have it waived, they generously reduced it to $5k. Only after his posts about it blew up did Netlify waive it completely. [1]

2. Someone got a $1k bill from AWS because lots of people made _unauthorized_ requests against his empty S3 bucket. AWS did agree to waive it immediately, prior to any social media posts. [2]

I probably just remembered (2) as "that ridiculous billing situation involving AWS" but got the details of what exactly happened mixed up with (1).

[1] https://news.ycombinator.com/item?id=39520776

[2] https://news.ycombinator.com/item?id=40203126

rmccue

0 replies

8h4m

2024-08-24 10:25:14 UTC

For (2), AWS _also_ has now shipped a change that makes those requests non-billable permanently: https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-s3...

paxys

0 replies

2024-08-23 18:03:09 UTC

If the incidents that made the rounds here in the last few months are any indication

They really aren't. There's a huge world out there beyond the HN front page.

giraffe_lady

0 replies

23h48m

2024-08-23 18:41:37 UTC

I have personal knowledge of two cases of this happening and both were immediately dropped. One was a high school student's personal project gone awry for like $3k worth the other was a startup where it was $120k or so. Two different providers. Neither had to particularly argue their case, just ask and wait a few days.

ascorbic

0 replies

6h36m

2024-08-24 11:52:49 UTC

Well, the ones where they forgive it right away don't write blog posts about it, so you don't know about them. I guarantee that every case which makes the front page is either an example of a mistake made by an individual support engineer following the wrong script, or the person posting it is being dishonest about what actually happened (I recall a particularly suspicious recent one where they hid the name of their site). The cloud providers all have a policy of forgiving these sort of cases. The only speedbump is ascertaining if it was a genuine mistake.

tomjen3

16 replies

2024-08-23 18:27:28 UTC

I really, really wish there was just a way to put a hard limit on how much a cloud could charge me.

I am never going to want to spend 200k of my personal money on some project on a cloud. Never. I don't even want my ant-based basket viewing project simulator to cost me 1 thousand dollars because it went viral and all clouds overcharge for bandwith.

Just let me put in a limit.

cube2222

7 replies

23h45m

2024-08-23 18:44:08 UTC

I think this sounds easy, but it’s actually quite nontrivial in practice. You need special handling of such a limit for each kind of resource.

E.g. when my limit is reached to they remove the database, along with all backups, and all objects on S3? Since storage is billed, it should also be stopped when the limit is reached, right?

I think in practice among companies paying most of their revenue, there’d be zero interest in this, while it would be a lot of effort to implement.

This is really something specific to hobby projects, which just shouldn’t be using those “unlimited potential cost” services.

candiddevmike

5 replies

23h39m

2024-08-23 18:49:59 UTC

I don't think any of these requirements are necessary. A basic "if I reach my bill limit, turn off things that are billing" toggle would suffice for 80%+ of users, especially with better billing controls/per-team billing accounts. I think you run into more problems/user frustrations with a tenuous conditional-shut-off approach.

My tinfoil hat is that a lot of cloud billing is accidental, probably from "lab environments", and they don't want to provide a way to budget/limit these.

cube2222

2 replies

23h30m

2024-08-23 18:59:27 UTC

I mean, those S3 objects, those backups, those databases, are billing.

“Turning them off” means deleting them.

jabbany

0 replies

23h24m

2024-08-23 19:04:46 UTC

It doesn't have to be though?

The provider could reject further access to them (reads / writes) once the limit is reached. The cost of actually keeping objects as "cold" storage has a natural cap per billing cycle since those are billed based on time.

candiddevmike

0 replies

23h28m

2024-08-23 19:01:22 UTC

Sounds good to me, that's what offsite backups are for (if the data is even necessary)

tptacek

0 replies

23h34m

2024-08-23 18:54:42 UTC

The whole post is about this accidental billing situation.

bee_rider

0 replies

21h27m

2024-08-23 21:02:33 UTC

What if my bug was saving, like, way more data to the cloud than I expected, and suddenly having a big bill for the hard drive space my data is using up just sitting there? It would be a pretty dumb bug, but hey, you never know, right? In that case they have to choose to either delete my data or keep charging me, so I guess there isn’t an easy zero-cost “stop” option.

HeatrayEnjoyer

0 replies

23h31m

2024-08-23 18:58:14 UTC

It there was a law about this companies would find a way. Where there's any uncertainty they can simply eat those edge costs, the extremely fat profit margins cover it easily.

mappu

5 replies

20h40m

2024-08-23 21:49:08 UTC

You might be happier with a fixed-size VPS, instead of a cloud provider,

pocketarc

3 replies

20h22m

2024-08-23 22:06:59 UTC

I started writing this comment by saying:

Exactly - the whole point of a cloud provider is scalability. If you're doing a personal hobby project, get off big scalable clouds and get yourself one (or multiple!) fixed-price VPS or dedicated servers.

But as I think of it, I think what people really want, for hobby projects, is not so much the scalability, but the managed offerings. They want zero-ops, zero-maintenance, zero-server-updates hosting, with a fixed price and hard limits. It won't be infinitely scalable, but it doesn't need to be - it's a hobby project.

They just don't wanna sysadmin a server of their own. Which is completely understandable.

There's room in the market for something like this.

spencerchubb

0 replies

20h2m

2024-08-23 22:27:02 UTC

If you want a server that's fixed cost and zero maintenance, I recommend replit.

genewitch

0 replies

15h18m

2024-08-24 03:10:51 UTC

We tried this. I was tasked with automating popular software installs into fresh VMs. I think some of my scripts for doing so are on my github - wordpress and some dashboard software, at least.

offerings were published on the main page and afaik no takers. We migrated off whatever hypervisor we were using onto wok/kimchi and finally to proxmox, so my scripts still work, but proxmox has turnkey linux "quickstart" servers now as well as lxd, so there's less reason to use my scripts.

CaptainFever

0 replies

10h8m

2024-08-24 08:21:17 UTC

An important part of hobby projects is the scaling to zero part too. I wager a lot of hobby projects use cloud simply because it's free or almost free (e.g. 2 cents a month), which isn't the case if you rent a VPS.

tomjen3

0 replies

14h49m

2024-08-24 03:40:18 UTC

Thats a good point, however I might want to set limits higher than my normal usage, just not astronomically high.

commercialnix

1 replies

19h14m

2024-08-23 23:14:50 UTC

Would you be interested in a cloud provider that did not charge for inbound or responding (outbound in response) bandwidth?

tomjen3

0 replies

14h46m

2024-08-24 03:42:50 UTC

Yes. Very much so. Or at least one that charged SANE rates. I get 100 mb outbound on my resident ISP, and that is unmetered, but clouds that sit almost directly on the backbone somehow charge a nontrivial amount of dollars per GB of bandwidth.

I only know of Hertzner.

andrewstuart

7 replies

19h42m

2024-08-23 22:47:34 UTC

Just run Linux computers on ionos with unlimited traffic and get on with the job instead of worrying about silly things like how much traffic passed over some internal software system and will it bankrupt the company.

https://www.ionos.com/servers/vps

ilrwbwrkhv

6 replies

18h54m

2024-08-23 23:34:53 UTC

Same but on Hetzner. Incredibly beefy machines. Unlimited traffic. No fear of any sudden changes that I explicitly didn't act upon. I do not know why people fall for the shiny marketing so easily, that too hackers? Is it all the Javascript folks? Vercel I know is propped up by them for sure.

tptacek

5 replies

17h28m

2024-08-24 01:00:56 UTC

I am never going to get tired of telling people that if they can solve their problems running on Hetzner, they should do that. It's not like a dirty secret of our industry. By all means, deploy with VPSs! Or on bare metal servers!

ilrwbwrkhv

4 replies

15h32m

2024-08-24 02:56:42 UTC

Do you have a blog post on fly's blog where you tell people that?

genewitch

2 replies

15h0m

2024-08-24 03:29:10 UTC

i mentioned this as well in another comment to parent. Hopefully it catches on.

"Hey, SMB? you probably don't need our services. While AWS, GC, etc would be happy to take your money anyhow, [...]" I dunno. obviously any sort of thing like this has to clear all the departments because i imagine it increases support load.

tptacek

1 replies

14h53m

2024-08-24 03:35:55 UTC

No SMB is OK with us zapping their services because some screwed up deployment or stolen token exceeded a billing cap. You're not talking about SMBs, you're talking about people deploying random personal stuff. We love those people, we're happy to have them here, but we do not price the product specifically for them.

I get that this is a lot of venting about people's issues with cloud providers writ large, but damn.

genewitch

0 replies

10h19m

2024-08-24 08:10:11 UTC

i said SMB because SMB doesn't need cloud. if you have enough users to warrant cloud you're no longer S or M. I was actually going to start to concede that you might be right, maybe i should have said small-time users or something more eloquent; but no, i'll stand behind what i said. Dissuade all SMB and small users as much as possible. then there's 0% chance they get a surprise cloud bill. I solved the problem, yay.

as i mentioned elsewhere i'm intimately familiar with pretty much every intimate detail of "cloud" from hardware, software, network, cooling, and ops (i wouldn't call me a dev. I don't think anyone else would or should, either.) I've bootstrapped cloud services from empty racks twice and repurposed existing hardware for cloud once.

I understand why there's no "cap" available on any cloud services. I mostly have a problem with capitalism, which is ironic, considering this site.

tptacek

0 replies

15h30m

2024-08-24 02:58:46 UTC

We are incredibly backlogged on the blog. I've got 3000 words on Tigris rolling out next week, we've got the results of a $300K 3rd party GPU audit to write up, we wrote a replacement to Vault called Pet Semetary, we need to write up the billing system because, as you can see from this thread, people appear to think that's not a hard technical problem. I can imagine getting around to writing a post about when a VPS is a good alternative to launching a container on Fly.io, but it'll be aspirational; I doubt I'll ever get around to it.

But I'm happy to remind people here of it!

kasey_junk

6 replies

16h42m

2024-08-24 01:47:29 UTC

If I were fly I’d implement spending caps just to filter out unserious operators.

My implementation of that feature would be when you turned it on fly would just kick you off.

No serious operator wants a provider to turn their system off. So if you want that it’s pretty clear you are hosting a silly system. Which likely costs more to support and drives less margin.

_jab

2 replies

16h21m

2024-08-24 02:08:10 UTC

No serious operator would choose a provider with your implementation of that feature.

tptacek

1 replies

16h14m

2024-08-24 02:15:12 UTC

I agree that no serious operator would choose a provider whose spending cap feature was "enable this feature and we kick you off the platform".

psd1

0 replies

5h39m

2024-08-24 12:50:14 UTC

People will pay to be abused: c.f. Oracle

lo3p0ez

0 replies

14h24m

2024-08-24 04:05:28 UTC

Hu? For any business having a due to a cap throttled service is better than losing the money straight away due to an accidental cloud bill explosion.

langcss

0 replies

13h20m

2024-08-24 05:08:57 UTC

And yet every single cloud let's you scale from zero and play and experiment. They know today's tinkerers are tomorrow's CTO decision makers.

icedchai

0 replies

14h17m

2024-08-24 04:12:09 UTC

The real solution is to make it simpler to set up billing alerts. If my hobby project is ever projected to cost more than $0.50/hr, let me know. Billing alerts should be part of onboarding for all cloud providers, especially for individuals that don’t want surprises.

joshfraser

4 replies

23h35m

2024-08-23 18:54:20 UTC

In other words, they are socializing the costs. Servers and electricity aren't free. Wouldn't it be better for the customer if they had no accident forgiveness and passed those cost savings along? Instead, you are paying for other peoples mistakes plus all the extra overhead caused from fraud that they incentivized.

trjordan

2 replies

21h46m

2024-08-23 20:43:20 UTC

The servers are already bought; the electricity cost is negligible for actual accidents.

As mentioned in the post: the hosting providers don't actually pay marginal costs for transient mistakes, so neither should honest customers.

joshfraser

1 replies

20h57m

2024-08-23 21:31:41 UTC

And what about the extra costs fighting fraud now that they've advertised this policy?

DAlperin

0 replies

20h34m

2024-08-23 21:54:49 UTC

We have to invest in fighting fraud and abuse anyway, such is the public cloud business. We don't intend to diminish user experience in service of fighting it.

tptacek

0 replies

23h34m

2024-08-23 18:55:18 UTC

No, it would be worse if we "passed these cost savings along".

gwbas1c

4 replies

23h48m

2024-08-23 18:40:44 UTC

and may only come within tens of dollars of accurately predicting your water bill.

Whenever I've had to water new grass seed I've always been surprised by my water bill.

hinkley

1 replies

21h4m

2024-08-23 21:24:50 UTC

There’s a guy on Reddit a couple days ago showing off his softball sized watermelon he grew with a $100 watering bill. The struggle is real.

aidenn0

0 replies

19h19m

2024-08-23 23:09:59 UTC

$100 is like 14HCF where I live (though it includes sweage).

[edit]

14, not 12HCF.

tptacek

0 replies

23h45m

2024-08-23 18:44:24 UTC

There's a note on the draft of this post with like 6 comments from people on our team about how the water bill is a bad example. I stand by what I wrote!

genewitch

0 replies

15h11m

2024-08-24 03:18:11 UTC

my well only costs me electricity and has no meter - well, i have a meter somewhere, i just never installed it. I know for a fact it uses 9A to run, so it would cost me 30¢ or so, all-in, to run it for an hour. it's somewhere around 10-12GPM, so 600-700 gallons for 30c

I've wanted to put it on solar the entire time i've had it but the start current is 18A and an inverter that can handle that is (or maybe was, idk) real expensive, considering.

anyhow if you saw the area around my house, in the dog days of summer, right now, you'd think it had been raining non-stop all summer - in fact, it hasn't really rained at all since the beginning of june when we had 12" - i've personally watered 6+" over about 1/3rd of an acre, so like 50,000 gallons since then.

edit: oh crap i forgot the reason i wanted to comment on this at all! My neighbor recently had some issues with a pipe running along the bottom of his house that sprung a leak and they got a surprise $600 water bill - no forgiveness. the water company did, in fact, have the audacity to tell my neighbor "there's a light on the meter if you have a leak." His meter is like a quarter mile from his house, first of all. and second of all, when they read the meter and saw the light, why didn't they go across the street and knock on his door, or make a note to call/mail?

I'll tell you why. $600

bdcravens

4 replies

2024-08-23 18:03:00 UTC

To be clear, in the case of those abusing the policy:

We reserve the right to cut you off.

Aeolun

3 replies

18h27m

2024-08-24 00:02:39 UTC

While obvious, doesn’t this make it the same as before?

tptacek

1 replies

17h30m

2024-08-24 00:58:58 UTC

The sentence he's referring to says that we might tell you after you've, uh, aggressively asked us to forgive a series of bills, that future accidents are no longer on the house. So no, it's not the same as before.

Aeolun

0 replies

13h51m

2024-08-24 04:38:03 UTC

I say that because I’ve had extensive GPU bills credited or forgiven before.

Presumably there was already a point where you were going to say “you keep messing up with the auto shut-off, at some point we’re going to stop refunding you”

I’ll say it’s nice to have it explicit.

lo3p0ez

0 replies

14h16m

2024-08-24 04:13:40 UTC

I trust them to handle even edgy cases more generous after this announcement. Just a gut feeling, as if they would risk losing significant credibility. I've written elsewhere that I'm not a big fan of those over-marketing posts, but they are small and more trustworthy than any of the large providers.

theusus

3 replies

22h39m

2024-08-23 19:50:08 UTC

Their docs are all over the place. I don't understand what their default offerings are. And that makes it hard for me to chose them.

tptacek

2 replies

22h34m

2024-08-23 19:55:05 UTC

What do you mean by "default offerings"? Thanks!

theusus

1 replies

21h23m

2024-08-23 21:05:58 UTC

https://www.heroku.com/

Checkout their products

taormina

0 replies

1h18m

2024-08-24 17:10:45 UTC

Heroku is not Fly.io. Heroku was bought by the MBAs a while back, and they do something different than generic cloud hosting (they are closer to a hosted EBS). If you bother scrolling at all, you'll notice immediately below the fold that the pricing is not remotely as clear as you're pretending it is.

ctennis1

3 replies

20h22m

2024-08-23 22:07:36 UTC

Google Cloud will not cut you a break unless you have an incredible about of pull. Even then, you won't get much of one.

smithcoin

0 replies

19h55m

2024-08-23 22:34:36 UTC

I'm suffering for this now. They just erroneously charged me almost $4,000 out of nowhere because they decided to scan all of old container images for no reason and without notification. I have a support partner and everybody is just passing the buck around, it's super frustrating.

campers

0 replies

10h14m

2024-08-24 08:15:18 UTC

Being at the level where you have an account manager should help. Our technical account manager always said if we ever accidentally rack up a bunch of cloud costs we should at always ask if we can get a refund, and as long as its infrequent there's a good chance to get some credits.

aidenn0

0 replies

19h21m

2024-08-23 23:08:13 UTC

Other comments here (e.g. https://news.ycombinator.com/item?id=41333600) suggest that this is not universally true.

Aeolun

3 replies

18h55m

2024-08-23 23:33:53 UTC

may only come within tens of dollars of accurately predicting your water bill

Uh, what is the magnitude of these water bills? I’m fairly certain I can predict it to within a few dollars because it’s only $35 in the first place.

umanwizard

1 replies

18h46m

2024-08-23 23:43:35 UTC

I have absolutely no idea how much I pay for water, but vaguely think it’s probably between 0 and 100. Why does knowing it’s small mean you can predict it precisely, in a world where auto-payment exists so you never have to actually look at your bills?

Aeolun

0 replies

18h7m

2024-08-24 00:21:45 UTC

I mean that it’d be extremely unlikely for me to not be able to predict my water bill to within ‘tens of dollars’, because that is the entire amount.

Also, I get receipts by postal mail even if I auto pay them, so I guess that makes it easier.

lupire

0 replies

18h53m

2024-08-23 23:36:33 UTC

My water bill is $200/quarter.

spencerchubb

2 replies

20h5m

2024-08-23 22:24:02 UTC

When I was in highschool, my friends and I were doing a small project. My friend accidentally ran Google's image recognition service in a while loop for a whole minute, and racked up a $300 bill. Thankfully GCP waived it for us.

reaperman

1 replies

20h3m

2024-08-23 22:26:07 UTC

I ran up a $4,000 bill with like a week of TPU time bc I didn’t realize that they kept running after the attached CPU turned off (unlike GPU offerings).

GCP also waived the whole thing - even refunding the $4k they had already pulled from my payment method.

spencerchubb

0 replies

20h0m

2024-08-23 22:29:24 UTC

That's brutal, and if you're anything like me, probably learned a lesson very thoroughly for the future!

scop

2 replies

23h57m

2024-08-23 18:31:56 UTC

One of the things I really admire about Fly is "how human they come across". Fly is a collection of people providing a computing service to other people. As a fellow human, there is a very primitive part of my brain that will always prefer "tech from humans" over "tech".

Given that the very teleology of cloud computing is digital & ephemeral resources, having an actual human face associated with it makes it tangible in a way that is hard to place.

My little brain can briefly understand complex computer systems on a need-to-know basis, with knowledge constantly coming and going based on the demands of the day. I can never fully "understand" my cloud infrastructure. But hey, Kurt's standing there. I like Kurt. I can understand Kurt as he's a human and so am I and he's in the same exact place I am. Let's work hard and make cool stuff, what do ya say Kurt?

scop

1 replies

23h52m

2024-08-23 18:37:27 UTC

I should add that another way Fly distinguishes itself as a "tangible" cloud company is that their blog art is all based on physical mediums. It feels very familiar and quantifiable to my brain.

tptacek

0 replies

23h43m

2024-08-23 18:45:45 UTC

That's Annie Ruygt, our in-house illustrator. Check out her work; she's amazing.

https://annieruygtillustration.com/

lolinder

2 replies

5h48m

2024-08-24 12:41:19 UTC

The Fully Automated Accident-Forgiving Billing System of the Future (which we are in fact building and may even one day ship) will give you a line-item veto on your invoice.

For a lot of the personal projects and early stage startups that are most terrified of these kinds of mistakes and therefore avoid autoscaling products like fly.io, we sincerely would rather have the entire account shut down when it goes over budget than ever see a bill that's higher than our net worth. A line item veto somewhat alleviates that concern but not fully.

They indicate at the end that they're going to do something along these lines, but what they're describing there also seems over-engineered compared to a simple circuit breaker that kills the account or some subset of it. Is there a good reason for these providers to avoid implementing that feature, which on a naive look seems far simpler than what they've actually proposed?

tptacek

1 replies

2h24m

2024-08-24 16:04:47 UTC

Something like half the comments on this story are a discussion of why or why not cloud providers do or don't provide this simple circuit breaker feature.

lolinder

0 replies

1h57m

2024-08-24 16:32:27 UTC

I read yours after posting this and it provides exactly zero information:

I promise, you are not the first person to have thought of this, and, believe it or not, there are reasons other than malice and avarice that cloud providers don't terminate service based on billing caps. Terminating service is a big deal.

"Terminating service is a big deal" how? I can explicitly cancel a subscription after a certain date—what is the problem with me explicitly cancelling a subscription after a certain amount of spend?

No one is asking for automatic circuit breakers applied to all customers indiscriminately, but I'm not seeing any justification in here for why an opt-in circuit breaker is technically or legally challenging to implement.

lagniappe

2 replies

2024-08-23 17:40:04 UTC

This shouldn't be a selling point, it should just be table stakes.

tptacek

0 replies

23h43m

2024-08-23 18:45:52 UTC

We agree.

sealeck

0 replies

2024-08-23 18:11:46 UTC

You can say this for a whole bunch of new features people role out but this doesn't erase the fact that they are very much not the stakes and introducing them is great because it raises them!

csours

2 replies

20h34m

2024-08-23 21:54:57 UTC

" You probably can’t tell me how much electricity your home is using right now, and may only come within tens of dollars of accurately predicting your water bill. But neither of those bills are all that scary, because you assume there’s a limit to how much you could run them up in a single billing interval. "

I had a $600 surprise water bill. It was (partially) forgiven because the water department could drive to my house and see evidence of the leak next to my water meter. It did turn out to be on my side of the meter, so it is my responsibility.

If the water department had driven to my house and seen evidence of commercial agriculture (so to speak), then it would not have been forgiven.

---

The parallel here is that the water department can't come into my house uninvited - the cloud provider SHOULD NOT have intimate access to the running code, but they are able to observe some patterns without 'breaking in'.

---

Side note: the size of the bill and the amount of forgiveness was largely driven by waiving an 'excess usage' surcharge - similar to how you can get a discount for cloud service reservations.

wrs

1 replies

18h52m

2024-08-23 23:36:57 UTC

Water billing is indeed nonlinear with no cap. I had a surprise $3000 excess water bill due to a broken connection at the meter. Followed by a $3000 excess sewer bill (!) because the assumption is if I used $3000 of water, it must have gone down the drain. However, if you demonstrate to an inspector that the water must have leaked below ground (as in this case) there’s a process for getting the second charge waived. Unfortunately the leak was on our side of the meter, so the first charge was correct, though we got partial forgiveness.

genewitch

0 replies

15h4m

2024-08-24 03:25:05 UTC

So, when they read the meter, and saw that it was flooded at the meter or nearby, they didn't think to mention that to you? I ask because you said "surprise bill" - another comment i made explains my neighbor's water meters have a light on them that lights up if there's a leak, which the meter reader can see and the homeowner usually cannot - the only reason to not notify the homeowner of an issue is because of $.

aidenn0

2 replies

19h24m

2024-08-23 23:05:09 UTC

IANAF, but Lower Montana seems like a terrible place to farm coffee beans?

tptacek

0 replies

17h30m

2024-08-24 00:59:34 UTC

I asked him about this and he didn't explain, but did demand that I keep it. He has a weird thing about Montana.

maverwa

0 replies

17h55m

2024-08-24 00:34:00 UTC

So Kurt is going to hate that job, too?

figassis

1 replies

11h23m

2024-08-24 07:06:03 UTC

This is why I love Hetzner. It has billing alerts. What was so difficult about it that fly had to rebuild their system?

Also, do everything except databases inside kubernetes. Deploy kubernetes across multiple clouds via wireguard. Label your instances properly on each provider. Prefer bare metal instances where available. Migrate your workloads accordingly. Force cloud providers to earn your money. Don’t however have both workloads running at the same time in multiple providers as you will eat insane data costs.

psd1

0 replies

5h34m

2024-08-24 12:54:51 UTC

I can't read the tone, but perhaps you haven't seen much technical debt. It's possible to have a fairly good codebase, full of good decisions, yet you still can't "get there from here".

Second paragraph was useful, thanks!

brundolf

1 replies

22h50m

2024-08-23 19:39:25 UTC

I've never understood why providers don't just give you the option of setting a cost cap: "If this piece of infra exceeds [5x expected usage cost], shut it down and send me an email"

lo3p0ez

0 replies

14h15m

2024-08-24 04:14:14 UTC

Deno recently did: https://deno.com/blog/deploy-spend-limits

aeternum

1 replies

23h40m

2024-08-23 18:49:19 UTC

New startup idea:

Pool unused "accident" credits, automate the forgiveness request, and sell it to the highest bidder.

datadrivenangel

0 replies

23h13m

2024-08-23 19:15:56 UTC

I believe that's called a reserved instance.

Poiesis

1 replies

23h32m

2024-08-23 18:57:21 UTC

Thomas, every time I read something you write it's a delight. I love your writing style, and it reads to me like you're always putting effort into making it succinct yet unambiguous, without unnecessary embellishment.

tptacek

0 replies

23h30m

2024-08-23 18:58:58 UTC

The trick is just that I write specifically for this place.

veggieroll

0 replies

27m

2024-08-24 18:01:48 UTC

This thread is full of people complaining that they just want the ability to set a hard billing cap. And yet, providers continue to insist that no serious customer wants this and/or it's impossible to implement.

What would it take for providers to listen to real customers here?

I have $25k in cloud spend that we absolutely cannot go one single cent over due to the politics of internal budgeting. That's my reality. If you want my $25k, I need to ensure that I don't spend more than this amount.

As is, my solution is to use old-school pre-rented, long-term contracted commitment VM hosting providers. This is really the only way to guarantee that you are paying an exact amount and no more.

But, I really would like to use a more scalable system that didn't require pre-provisioning. And, I wish people would believe the customer when they say something and not continue to gaslight us.

Providers say it's impossible, but I don't see how it would be so hard. Here's my sketch of how it could work:

The main component is a system that monitors billing events and watches for the slope of the bill to ensure that there is enough runway to stay under the cap. Optionally, they could implement rate limits on resource creation to ensure that a sudden surge doesn't outrun the monitoring.

You also need notifications for when the projected spend exceeds the cap. Optionally, you could implement a soft cap, where no new resources can be created.

And finally, you need the hard cap where things start to get deleted. If they're feeling generous, the provider could implement a period where VM's/lambda's/etc are shut off, blob storage is not accessible, and so on so that the account holder has some time to fund the account and/or fix whatever is causing the overage.

That set of features are all totally within the competency of a cloud provider. Knowing how much things cost, billing for them, and turning them on and off is their main business. And I can't believe that they expect us to believe that it's impossible to do that tracking.

This is how I do system migrations. There are escalating warnings until one day the service is shutoff for an hour and turned back on. That wakes up any laggards that missed the dozen or so communications over a three-to-six month period. Finally, after a few more days, the service is shutdown for good and then data is deleted a week after that. Though I almost always keep a copy in cold storage. But that isn't necessary as a provider with a limited relationship to the client.

ultra_nick

0 replies

19h44m

2024-08-23 22:45:15 UTC

Build everything on Docker, Kubernetes, and VPS.

Run on the cheapest VPS provider with human support.

Make a quick and easy switch if they get too shifty.

tonymet

0 replies

17h21m

2024-08-24 01:08:28 UTC

It's important that all utility metering and pricing is effectively made up to balance costs across customers.

That's why residents in california probably pay 100000x / unit of water than agriculture customers do .

The same works for the cloud. An operation that costs $100000 does not cost the operator $85000 in operations on the margins.

Yes the metering and pricing schedules are necessary, but punitive pricing for accidents really is just an artifact of the system.

I'm guessing the biggest reason AWS provides forgiveness is that if anyone took them to court over a bill, a court would throw out the charge once the wholesale cost was revealed.

srazzaque

0 replies

18h18m

2024-08-24 00:10:57 UTC

Whilst it doesn't have the mindshare or some features which are tablestakes for enterprise customers, I've found Linode's pricing to be extremely predictable for small projects. Even post Akamai acquisition. I'm sure other smaller players are also just fine.

Couple that with the fact you can achieve quite a bit on simple set-ups that are adequately sized to begin with, you can save quite a bit.

Not all of us need elasticity, or environments being spun up/down on commit.

nottorp

0 replies

8h18m

2024-08-24 10:10:59 UTC

Isn't it easier to set hard limits on the account and notify the client when they're close to them so they can decide to increase them on their own if needed?

But then you run into the risk that the client will never increase those hard limits and ... pay less than they could. Not good for business.

lucifargundam

0 replies

23h23m

2024-08-23 19:05:45 UTC

I just wanted to say I first heard of Fly.io from "The Changelog" podcast

knowitnone

0 replies

22h15m

2024-08-23 20:13:54 UTC

no matter what, you'll have some fraud. Perhaps a better solution is the customer has a know they can turn that sets max limits and some alerts on when usage is abnormal.

jowdones

0 replies

11h49m

2024-08-24 06:40:01 UTC

That's why my phone SIM is still not a subscription but rather I manually buy credit each month. The price is even lower than subscription and benefits exceed it, shitloads of Gb and minutes. If I'll use roaming I'll pay for a more expensive credit that month but I have piece of mind I won't be charged thousands of euro at the end of the month.

Like telecom providers of course cloud providers could have metered service billed under "get only as much as you paid" policy but it's obviously they totally and in bad faith don't want that.

campers

0 replies

10h13m

2024-08-24 08:16:22 UTC

Meanwhile: like every public cloud, we provision our own hardware, and we have excess capacity. Your messed-up CI/CD jobs didn’t really cost us anything

This^ Don't feel bad about asking for credits when you accidentally make a costly mistake.

bruce343434

0 replies

6h54m

2024-08-24 11:35:22 UTC

So AI is now being used on the Cloud to fight the Blockchain?