The average cloud customer is paying 65% more for Kubernetes compute services than needed

Cast AI uses Kubernetes automation technology to optimize spending and performance for cloud-native apps by matching the right amount of computing power and memory to those apps.

Cast AI co-founder and Chief Product Officer Laurent Gil

Cast AI was born out of its co-founders’ frustrations with their cloud bills while they operated a prior startup.

Photo: Cast AI

Cloud customers pay an average three times more on cloud compute costs for AWS, Microsoft Azure and Google Cloud than they should, according to Cast AI. Helping them manage those costs is turning into a business itself.

The startup specializes in Kubernetes automation and cost optimization and reporting for cloud-native applications. Its platform uses artificial intelligence to identify which compute resources are needed for specific Kubernetes workloads and automatically selects the best combinations, configuring CPUs and memory to prevent over-provisioning. It continuously adds or removes resources as needed, ensuring customers aren’t overspending without compromising workload availability or performance, according to the company.

“It's impossible to do this exercise as a human,” co-founder and Chief Product Officer Laurent Gil said. “We decomplexify capabilities. We make Kubernetes or containers serverless by saying we're going to take care of the servers, and we will make the servers cost-efficient.”

Cast AI was born out of its co-founders’ frustrations with their cloud bills while they operated a prior startup: Zenedge, a cloud-based, AI-driven cybersecurity startup acquired by Oracle in 2018.

“In the beginning of that company, I would spend about $1,000 to $2,000 a month on AWS,” Gil said. “Three years later … that became $2 million dollars — by far the highest cost of the company, and we were very, very frustrated. We had a nice ride with customers, but every time we would add a client, our AWS bill would go through the roof.”

AWS’ answer was for Zenedge to prepay for three years to cut their cloud bill by 40%, but Zenedge didn’t want to be locked in, according to Gil. With Cast AI, they built the spending-management product they wished they had at the time.

The overspending tax

Companies using Cast AI’s services can reduce their cloud compute spending by 65% on average, according to Gil. Those services work with Amazon Elastic Kubernetes Service (EKS), Google Kubernetes Engine (GKE), Azure Kubernetes Service (AKS) and Kubernetes Operations (kOps) on AWS.

“The engine is instantly going to understand what applications you have … and how much compute and memory they currently consume, and how much they cost to run based on the machine that these applications are installed on,” Gil told Protocol. “Then we are going to give you another number, which is, ‘Hey, considering what this application does and uses, this should really be the cost.’”

While there are no big differences between the Big Three cloud providers’ prices, Gil said, within each cloud itself, there are cost differences when it comes to processors.

“Most … are cheaper with AMD than they are with Intel,” Gil said. “That makes our engine use more AMD sometimes for compute-intensive [workloads]. But the machine has been trained to know this, so we will always select the lowest-cost option.”

Cast AI savings Image: Cast AI

Cast AI is currently optimizing about 1,000 applications for hundreds of customers, according to Gil.

“One thing that was very surprising to us … is that the average cost-savings we provide to anybody using us … is 65%,” he said. “Sixty-five percent means you are spending three times more than you should on Amazon. So if you think of this the other way, you say, 'Well, out of $100 of your cloud bill, $66 of this is for Mr. Bezos, because it does nothing for you … and $33 is what you really use.'”

Cast AI says that, on average, its customers weren’t using 37% of the CPUs that they were paying for. They could save an additional 7% by changing one type of virtual machine (VM) for another and another 22% by switching VMs to discounted spot instances.

“We're not changing anything [with] our customer environment,” Gil said. “It's like … how you defragment disk drives. We defragment your application by moving the boxes around so that you can fill the machine more [by] using all the empty space.”

This is the way

It’s a task that’s impossible for developers to tackle on their own, and the cloud providers don’t make it easy, according to Gil.

One of Cast AI’s customers — an adtech company with a large consumer app in India — saw 84% in compute savings after turning on its engine, according to Gil. Another publicly traded company, a SaaS business, saw its cloud compute costs reduced by 72%.

Branch, a late-stage startup specializing in deep linking, mobile analytics and attribution, is a Cast AI customer that sees about 25 billion events per day and is running all of its compute inside Kubernetes clusters.

“Our cloud hosting needs to be very efficient to be able to process all that data in real time to be able to make real-time decisions … as well as to be able to aggregate and show all of the statistics inside of the analytics,” said Mark Weiler, Branch’s head of Engineering.

Branch, which uses AWS as its preferred cloud provider, started a proof of concept with Cast AI in May of 2021 and deployed it across all of its clusters within two months.

“They have saved us on the order of a couple million dollars per year on our AWS cloud bill, which is one of the highest ROI cost-savings projects that we've done in the past five or six years,” Weiler said. “The promise was they would allow us to dynamically determine what sorts of optimal spot instances to use based on our workloads without incurring any negative effects on our uptime SLAs [service-level agreements] when Amazon revokes those instances. They came through.”

“Manually configuring all that, keeping that up to date, having all the fallback scenarios set up and up to date, is extremely complicated to do on your own. It's begging for an automated solution that can monitor the actual spot market and your instances and determine what the optimal reallocation would be,” Weiler said.

Cast AI is currently adding new features for observability and cost-reporting, but Gil sees an opportunity to even further reduce other areas of customers’ cloud bills.

“We’re just scratching the surface,” he said.


New Jersey could become an ocean energy hub

A first-in-the-nation bill would support wave and tidal energy as a way to meet the Garden State's climate goals.

Technological challenges mean wave and tidal power remain generally more expensive than their other renewable counterparts. But government support could help spur more innovation that brings down cost.

Photo: Jeremy Bishop via Unsplash

Move over, solar and wind. There’s a new kid on the renewable energy block: waves and tides.

Harnessing the ocean’s power is still in its early stages, but the industry is poised for a big legislative boost, with the potential for real investment down the line.

Keep Reading Show less
Lisa Martine Jenkins

Lisa Martine Jenkins is a senior reporter at Protocol covering climate. Lisa previously wrote for Morning Consult, Chemical Watch and the Associated Press. Lisa is currently based in Brooklyn, and is originally from the Bay Area. Find her on Twitter ( @l_m_j_) or reach out via email (ljenkins@protocol.com).

Every day, millions of us press the “order” button on our favorite coffee store's mobile application: Our chosen brew will be on the counter when we arrive. It’s a personalized, seamless experience that we have all come to expect. What we don’t know is what’s happening behind the scenes. The mobile application is sourcing data from a database that stores information about each customer and what their favorite coffee drinks are. It is also leveraging event-streaming data in real time to ensure the ingredients for your personal coffee are in supply at your local store.

Applications like this power our daily lives, and if they can’t access massive amounts of data stored in a database as well as stream data “in motion” instantaneously, you — and millions of customers — won’t have these in-the-moment experiences.

Keep Reading Show less
Jennifer Goforth Gregory
Jennifer Goforth Gregory has worked in the B2B technology industry for over 20 years. As a freelance writer she writes for top technology brands, including IBM, HPE, Adobe, AT&T, Verizon, Epson, Oracle, Intel and Square. She specializes in a wide range of technology, such as AI, IoT, cloud, cybersecurity, and CX. Jennifer also wrote a bestselling book The Freelance Content Marketing Writer to help other writers launch a high earning freelance business.

Watch 'Stranger Things,' play Neon White and more weekend recs

Don’t know what to do this weekend? We’ve got you covered.

Here are our picks for your long weekend.

Image: Annapurna Interactive; Wizard of the Coast; Netflix

Kick off your long weekend with an extra-long two-part “Stranger Things” finale; a deep dive into the deckbuilding games like Magic: The Gathering; and Neon White, which mashes up several genres, including a dating sim.

Keep Reading Show less
Nick Statt

Nick Statt is Protocol's video game reporter. Prior to joining Protocol, he was news editor at The Verge covering the gaming industry, mobile apps and antitrust out of San Francisco, in addition to managing coverage of Silicon Valley tech giants and startups. He now resides in Rochester, New York, home of the garbage plate and, completely coincidentally, the World Video Game Hall of Fame. He can be reached at nstatt@protocol.com.


Debt fueled crypto mining’s boom — and now, its bust

Leverage helped mining operations expand as they borrowed against their hardware or the crypto it generated.

Dropping crypto prices have upended the economics of mining.

Photo: Lars Hagberg/AFP via Getty Images

As bitcoin boomed, crypto mining seemed almost like printing money. But in reality, miners have always had to juggle the cost of hardware, electricity and operations against the tokens their work yielded. Often miners held onto their crypto, betting it would appreciate, or borrowed against it to buy more mining rigs. Now all those bills are coming due: The industry has accumulated as much as $4 billion in debt, according to some estimates.

The crypto boom encouraged excess. “The approach was get rich quick, build it big, build it fast, use leverage. Do it now,” said Andrew Webber, founder and CEO at crypto mining service provider Digital Power Optimization.

Keep Reading Show less
Tomio Geron

Tomio Geron ( @tomiogeron) is a San Francisco-based reporter covering fintech. He was previously a reporter and editor at The Wall Street Journal, covering venture capital and startups. Before that, he worked as a staff writer at Forbes, covering social media and venture capital, and also edited the Midas List of top tech investors. He has also worked at newspapers covering crime, courts, health and other topics. He can be reached at tgeron@protocol.com or tgeron@protonmail.com.


How lax social media policies help fuel a prescription drug boom

Prescription drug ads are all over TikTok, Facebook and Instagram. As the potential harms become clear, why haven’t the companies updated their advertising policies?

Even as providers like Cerebral draw federal attention, Meta’s and TikTok’s advertising policies still allow telehealth providers to turbocharge their marketing efforts.

Illustration: Overearth/iStock/Getty Images Plus

In the United States, prescription drug advertisements are as commonplace as drive-thru lanes and Pete Davidson relationship updates. We’re told every day — often multiple times a day — to ask our doctor if some new medication is right for us. Saturday Night Live has for decades parodied the breathless parade of side effect warnings tacked onto drug commercials. Here in New York, even our subway swipes are subsidized by advertisements that deliver the good news: We can last longer in bed and keep our hair, if only we turn to the latest VC-backed telehealth service.

The U.S. is almost alone in embracing direct-to-consumer prescription drug advertisements. Nations as disparate as Saudi Arabia, France and China all find common ground in banning such ads. In fact, of all developed nations, only New Zealand joins the U.S. in giving pharmaceutical companies a direct line to consumers.

Keep Reading Show less
Hirsh Chitkara

Hirsh Chitkara ( @HirshChitkara) is a reporter at Protocol focused on the intersection of politics, technology and society. Before joining Protocol, he helped write a daily newsletter at Insider that covered all things Big Tech. He's based in New York and can be reached at hchitkara@protocol.com.

Latest Stories