10 Top Big Data Startups in 2024

Office with laptop on the desk showing data.

As businesses across industries increasingly rely on data to drive decisions and gain competitive advantages, these cutting-edge companies are developing solutions that promise to revolutionize how we handle big data. From advanced data integration platforms to artificial intelligence (AI)-powered analytics tools, the following startups are at the forefront of the big data revolution — each offering technologies that could shape the future of data management and analysis. Let’s explore some of the most promising big data startups that industry watchers and potential investors should keep on their radar.

Big Data Companies to Follow

Every year, new startups pull to the forefront of their industry through exciting innovation and industry-disrupting business models. We’ve rounded up the most exciting big data startups of 2024 that startup-lovers, investors, and aspiring entrepreneurs should follow. 

Disclaimer: With so many exciting startups launching and growing worldwide, we aren’t able to cover them all. Furthermore, the startups that are listed below are not officially ranked and are listed in no particular order.

1. Airbyte

Airbyte is an open-source data extraction platform that helps businesses test and verify existing data pipelines. The company is dedicated to challenging existing data extractions by other big data companies to improve data sets and generate better business outcomes. Currently, it is building commercial cloud and enterprise editions of its data engine that will have even more capabilities. 

2. Cherre

  • Location: New York, New York
  • Founder(s): L.D. Salmanson, Ben Hizak
  • Founded In: 2016
  • Funding: Series C, $130 Million
  • Investors Include: Navitas Capital, Glilot Capital Partners, Carthona Capital

Cherre is a real estate data and analytics platform that aggregates and connects disparate real estate data sources. The company’s AI-powered platform provides a single source of truth for property and market information, enabling real estate professionals to make faster and more informed decisions. Cherre’s solutions help clients in areas like investment management, underwriting, and risk assessment by providing comprehensive, real-time insights into the real estate market.

3. FourKites

  • Location: Chicago, Illinois
  • Founder(s): Matt Elenjickal
  • Founded In: 2014
  • Funding: Series D, $292 Million
  • Investors Include: Basil Capital, Fabrica Ventures, Pine Grove Holdings

FourKites is a supply chain visibility platform that provides logistic companies with real-time data to be able to overhaul their supply chains to meet and exceed customer demand and expectations. Companies that partner with FourKites save money and improve the customer experience.

4. Inocras

  • Location: San Diego, California 
  • Founder(s): Dr. Young Seok Ju, Dr. Jeong Seok Lee
  • Founded In: 2020 
  • Funding: Series B, $52.2 Million
  • Investors Include: Ignite Innovation, Signite Partners, Samsung Venture Investment

Inocras is working to improve healthcare at the genetic level utilizing big data to translate patient genomic data fast and accurately. The company uses both technology and science to aggregate genetic sequences to find patterns that improve patient care. Inocras’ main areas of focus are genomics-driven precision oncology, solutions for rare and orphan diseases, infectious and inflammatory disease models, and high-end genomics research techniques.

5. Domino

  • Location: San Francisco, California 
  • Founder(s): Nick Elprin, Christopher Yang, Matthew Granade
  • Founded In: 2013
  • Funding: Series F, $224 Million
  • Investors Include: Sequoia Capital, Snowflake Ventures, Great Hill Partners 

Domino, formerly known as Domino Data Labs, is an enterprise machine learning operations (MLOps) platform that helps data science teams accelerate the development and deployment of machine learning models. The company’s platform provides a collaborative environment for data scientists to build, validate, and deploy models at scale while also offering governance and reproducibility features. Domino aims to help organizations become model-driven by streamlining the entire data science lifecycle — from ideation to production.

6. Bigeye

  • Location: San Francisco, California
  • Founder(s): Kyle Kirwan, Egor Gryaznov
  • Founded In: 2019
  • Funding: Series B, $66 Million
  • Investors Include: Coatue Management, Sequoia Capital, Costanoa Ventures

Bigeye is a big data analytics company that helps companies clean up their data and find anomalies that may distort data accuracy. The company’s mission is to improve data veracity (i.e., how accurate or truthful data is). With better data mining techniques and cleaner data, companies are able to make wiser business decisions that produce better outcomes.

7. Kestra 

  • Location: Paris, France
  • Founder(s): Emmanuel Darras, Ludovic Dehon
  • Founded In: 2019
  • Funding: Series A, $15.1 Million
  • Investors Include: Alven Capital Partners, Clément Delangue, Antoine Balliet

Kestra is a data integration platform that helps businesses streamline their data workflows. The company offers cloud-native, code-based data orchestration solutions that enable companies to build, run, and manage data pipelines efficiently. Kestra’s platform is designed to handle complex data processes, making it easier for organizations to transform and move data between various sources and destinations.

8. Deepnote

  • Location: San Francisco, California
  • Founder(s): Jakub Jurových
  • Founded In: 2019
  • Funding: Series A, $23.8 Million
  • Investors Include: Tiger Global Management, Dig Ventures, Accel

Deepnote is a special communications platform for data scientists and analytics professionals. Deepnote is where the industry goes to collaborate and develop some of the world’s best innovations in big data management and advancement. Fittingly, it is based in San Francisco like most of the other big data companies on our list.

9. Sniffie

  • Location: Helsinki, Finland
  • Founder(s): Tomi Grönfors, Niko Naakka, Tuomas Uutela
  • Founded In: 2017
  • Funding: Seed, $1.9 Million
  • Investors Include: Superhero Capital

Sniffie is a big data company that specializes in utilizing artificial intelligence (AI) to follow competitor pricing, find their top-selling items, and more. It also acts as a software as a service (SaaS) to modify and change prices in real time for companies to be as profitable as possible. For companies in the ecommerce industry, Sniffie can be quite valuable in sniffing out competitor analytics.

10. YugabyteDB

  • Location: Sunnyvale, California
  • Founder(s): Kannan Muthukkaruppan, Karthik Ranganathan, Mikhail Bautin
  • Founded In: 2016
  • Funding: Series C, $289 Million
  • Investors Include: Lightspeed Venture Partners, Meritech Capital Partners, Wipro Limited 

YugabyteDB is a big data storage company operating an open-source SQL database. It primarily serves customers in cybersecurity, financials, as well as technology companies, specifically those companies creating the Internet of Things (IoT).