Databases will be a $100 billion market. Neo4j’s CEO just needs a sliver.

The graph database company raised $325 million in June, which Neo4j said is the largest ever for a database startup.

​Neo4j CEO Emil Eifrem

Neo4j CEO Emil Eifrem is eyeing the database industry split.

Photo: Neo4j

In Neo4j CEO Emil Eifrem's mind, the database industry is split into two camps: systems that deal with historical data and those that support real-time processing.

Neo4j would be in the latter category. It's a subsector that Eifrem said is dominated by six players: Microsoft, Google Cloud, AWS, Redis Labs, MongoDB and, of course, Neo4j. But Eifrem is betting that owning just a sliver of the booming market will be lucrative.

"The database market is the single biggest one in all of enterprise software. It's about $50 billion today. But it's going to be $100 billion" in just a few years, he told Protocol. "If you are the leader of one of these big, new segments, those are massive categories. They're way bigger than what the relational database was, for example, in the late '80s, early '90s."

Other leaders in the real-time processing sector are likely to disagree with Eifrem's viewpoint. But Neo4j is a bit different in that it's peddling a newer type of architecture called graph databases, systems that are able to take the increasing amount of information that companies are collecting and draw immediate connections between them, something that Eifrem argued is impossible with the tabular databases of the past.

The company is not alone among graph database companies; AWS, for example, launched its own graph database called Neptune in 2018. But Neo4j is well on the way to establishing itself as a leader. The company raised $325 million in June — the most ever for a database startup, according to Neo4j — and is considering an IPO in the next few years, according to a marketing slide viewed by Protocol.

In an interview with Protocol, Eifrem talked about why he thinks the split within the market will grow more pronounced and why enterprises are increasingly picking databases tailored to specific end applications.

This interview has been edited and condensed for clarity.

What are graph databases and why did you have to create a new category within this industry? What is it about the historical systems that make them unable to meet the demands of today?

The modern landscape I think of in two broad buckets. On the left side is the operational data stores: developers are building applications, those applications use the database. On the right is data warehouses; that's the analytical data stores. They store historical data. On the left-hand side, we have the system of record for now. On the right-hand side, it's the system of record for history.

On that right-hand side, there's five platforms emerging — Snowflake, Databricks, Microsoft, AWS and Google Cloud — and everything else circulates around them. And there's a ton of innovation happening right outside of those five. On the left-hand side, you have the cloud platforms, one company that has gone public, which is MongoDB, and two companies that are coming up behind: Redis Labs and Neo4j.

Graph databases are focused on connected data. As the real world is becoming more connected, the data is becoming more connected. And the challenge with that is that you can't store and connect the data in a good way in a tabular database. Fundamentally that's what we're optimized for: finding patterns in connected data.

You've discussed a future where you can have very application-specific databases based on the best fit for whatever that application might be. Can you unpack that? Ultimately, how many database vendors do you think enterprises are going to use?

This has two different answers depending on if you sit on the left-hand or the right-hand side of [the industry]. On the right-hand side, it will actually converge into one category. Currently, it's two different paradigms: that data scientist-centric paradigm that Databricks represents and the business data analysts-centric paradigm that Snowflakes represents. That's converging into one category.

On the left-hand side, though, and that's where I spend most of my time, I see a few new categories emerging. We have the broader what I call "document plus-plus" space; that's not an established term, but that's what I call it. This is where MongoDB and Couchbase live. But if you look at actual customer projects, then Redis Labs, DataStax, Cassandra, Couchbase, MongoDB, they all compete for the same slot in those architectures.

How many of these different moving parts do we really want in an application, right? With graph databases, it's like stock ticker symbols or sensor data, which you can't really store in a good way in other types of databases. Then there's newSQL, where a company like Cockroach Labs lives.

Those are the major categories. And in 2030, those are [still] going to be the major categories. And then underlying all of this, of course, is the relational database, which will be around forever. And big banks like UBS, Citi or JPMorgan are going to have strategic vendors for each and every one. It's not going to be more than that. Because ultimately you want to reduce complexity and don't want too many moving parts in your architecture.

How much share of the market can you get? You're trying to tackle a more specialized area, which would seem to eventually have some sort of cap on it?

There's always subcategories in massive markets. And the database market is the single biggest one in all of enterprise software. It's about $50 billion today. But it's going to be $100 billion, already in a few years, like 2024 or 2025 depending on who you ask. All of that growth is driven by these new segments that we're talking about. If you are the leader of one of these big, new segments, those are massive categories. They're way bigger than what the relational database was, for example, in the late '80s, early '90s. So it's very sizable.

From a technology standpoint, what is going to be the main impediment for, say, Databricks to serve as both the analytic and the operational engine within organizations?

No one is gonna be able to straddle both of those completely. It's too big and too complex. The way that you have to write a kernel that is good for the analytical data source versus one that is good for the operational data source is just completely different. And it's not one of those things where if you throw enough money at it, [it's possible]. It comes down to very clear and specific technical tradeoffs.

How do customers not get confused or overwhelmed by all of the different kinds of strategies that other vendors are also promoting?

Increased funding makes for a noisier environment. The flip side of this is: If you add up all the analytical and operational data stores, it looks very, very crowded. But really, if you look at the operational data stores, there's four or five that matter now. That's the big change in the last two, three years. There's a handful of us that have really truly achieved scale. And that makes it less confusing.

You're asking customers to add more complexity to their tech stack. When you think about the return on investment, what's going to be the benefit that outweighs that increased complexity?

It's very easy, because the world is just becoming increasingly connected. And if you're not capable of digitizing those connections, you're going to be left behind. If you're a bank and you use graph databases, you'll be able to capture fraud rings for your fraud detection. We frequently see banks getting a 5% uplift in the number of fraud cases that they can capture. If [your] competitors can capture fraud rings but [you] can't, ultimately, they're going to outcompete. It's coming back to the business value of being able to operate on top of connections.

What is going to stop the hyperscalers from cutting vendors like yourself out of the equation?

We get adopted through developers. The fact that we're the leader in terms of the developer community and data science community, that's huge. That's what makes MongoDB Atlas work. Specifically for enterprises, the fact that we are multicloud, that's massive.

Today, multicloud is an absolute requirement for many enterprise CIOs. And if there's one area of their IP architecture where they care the most, it's for the data. Having that managed by a multicloud offering so they're not beholden to one of the platforms, that's really important for them.


Musk’s texts reveal what tech’s most powerful people really want

From Jack Dorsey to Joe Rogan, Musk’s texts are chock-full of überpowerful people, bending a knee to Twitter’s once and (still maybe?) future king.

“Maybe Oprah would be interested in joining the Twitter board if my bid succeeds,” one text reads.

Photo illustration: Patrick Pleul/picture alliance via Getty Images; Protocol

Elon Musk’s text inbox is a rarefied space. It’s a place where tech’s wealthiest casually commit to spending billions of dollars with little more than a thumbs-up emoji and trade tips on how to rewrite the rules for how hundreds of millions of people around the world communicate.

Now, Musk’s ongoing legal battle with Twitter is giving the rest of us a fleeting glimpse into that world. The collection of Musk’s private texts that was made public this week is chock-full of tech power brokers. While the messages are meant to reveal something about Musk’s motivations — and they do — they also say a lot about how things get done and deals get made among some of the most powerful people in the world.

Keep Reading Show less
Issie Lapowsky

Issie Lapowsky ( @issielapowsky) is Protocol's chief correspondent, covering the intersection of technology, politics, and national affairs. She also oversees Protocol's fellowship program. Previously, she was a senior writer at Wired, where she covered the 2016 election and the Facebook beat in its aftermath. Prior to that, Issie worked as a staff writer for Inc. magazine, writing about small business and entrepreneurship. She has also worked as an on-air contributor for CBS News and taught a graduate-level course at New York University's Center for Publishing on how tech giants have affected publishing.

Sponsored Content

Great products are built on strong patents

Experts say robust intellectual property protection is essential to ensure the long-term R&D required to innovate and maintain America's technology leadership.

Every great tech product that you rely on each day, from the smartphone in your pocket to your music streaming service and navigational system in the car, shares one important thing: part of its innovative design is protected by intellectual property (IP) laws.

From 5G to artificial intelligence, IP protection offers a powerful incentive for researchers to create ground-breaking products, and governmental leaders say its protection is an essential part of maintaining US technology leadership. To quote Secretary of Commerce Gina Raimondo: "intellectual property protection is vital for American innovation and entrepreneurship.”

Keep Reading Show less
James Daly
James Daly has a deep knowledge of creating brand voice identity, including understanding various audiences and targeting messaging accordingly. He enjoys commissioning, editing, writing, and business development, particularly in launching new ventures and building passionate audiences. Daly has led teams large and small to multiple awards and quantifiable success through a strategy built on teamwork, passion, fact-checking, intelligence, analytics, and audience growth while meeting budget goals and production deadlines in fast-paced environments. Daly is the Editorial Director of 2030 Media and a contributor at Wired.

Circle’s CEO: This is not the time to ‘go crazy’

Jeremy Allaire is leading the stablecoin powerhouse in a time of heightened regulation.

“It’s a complex environment. So every CEO and every board has to be a little bit cautious, because there’s a lot of uncertainty,” Circle CEO Jeremy Allaire told Protocol at Converge22.

Photo: Circle

Sitting solo on a San Francisco stage, Circle CEO Jeremy Allaire asked tennis superstar Serena Williams what it’s like to face “unrelenting skepticism.”

“What do you do when someone says you can’t do this?” Allaire asked the athlete turned VC, who was beaming into Circle’s Converge22 convention by video.

Keep Reading Show less
Benjamin Pimentel

Benjamin Pimentel ( @benpimentel) covers crypto and fintech from San Francisco. He has reported on many of the biggest tech stories over the past 20 years for the San Francisco Chronicle, Dow Jones MarketWatch and Business Insider, from the dot-com crash, the rise of cloud computing, social networking and AI to the impact of the Great Recession and the COVID crisis on Silicon Valley and beyond. He can be reached at bpimentel@protocol.com or via Google Voice at (925) 307-9342.


Is Salesforce still a growth company? Investors are skeptical

Salesforce is betting that customer data platform Genie and new Slack features can push the company to $50 billion in revenue by 2026. But investors are skeptical about the company’s ability to deliver.

Photo: Marlena Sloss/Bloomberg via Getty Images

Salesforce has long been enterprise tech’s golden child. The company said everything customers wanted to hear and did everything investors wanted to see: It produced robust, consistent growth from groundbreaking products combined with an aggressive M&A strategy and a cherished culture, all operating under the helm of a bombastic, but respected, CEO and team of well-coiffed executives.

Dreamforce is the embodiment of that success. Every year, alongside frustrating San Francisco residents, the over-the-top celebration serves as a battle cry to the enterprise software industry, reminding everyone that Marc Benioff’s mighty fiefdom is poised to expand even deeper into your corporate IT stack.

Keep Reading Show less
Joe Williams

Joe Williams is a writer-at-large at Protocol. He previously covered enterprise software for Protocol, Bloomberg and Business Insider. Joe can be reached at JoeWilliams@Protocol.com. To share information confidentially, he can also be contacted on a non-work device via Signal (+1-309-265-6120) or JPW53189@protonmail.com.


The US and EU are splitting on tech policy. That’s putting the web at risk.

A conversation with Cédric O, the former French minister of state for digital.

“With the difficulty of the U.S. in finding political agreement or political basis to legislate more, we are facing a risk of decoupling in the long term between the EU and the U.S.”

Photo: David Paul Morris/Bloomberg via Getty Images

Cédric O, France’s former minister of state for digital, has been an advocate of Europe’s approach to tech and at the forefront of the continent’s relations with U.S. giants. Protocol caught up with O last week at a conference in New York focusing on social media’s negative effects on society and the possibilities of blockchain-based protocols for alternative networks.

O said watching the U.S. lag in tech policy — even as some states pass their own measures and federal bills gain momentum — has made him worry about the EU and U.S. decoupling. While not as drastic as a disentangling of economic fortunes between the West and China, such a divergence, as O describes it, could still make it functionally impossible for companies to serve users on both sides of the Atlantic with the same product.

Keep Reading Show less
Ben Brody

Ben Brody (@ BenBrodyDC) is a senior reporter at Protocol focusing on how Congress, courts and agencies affect the online world we live in. He formerly covered tech policy and lobbying (including antitrust, Section 230 and privacy) at Bloomberg News, where he previously reported on the influence industry, government ethics and the 2016 presidential election. Before that, Ben covered business news at CNNMoney and AdAge, and all manner of stories in and around New York. He still loves appearing on the New York news radio he grew up with.

Latest Stories