Data & Analytics
58 companies in this sector.
| # | Company | Description | Location | Status | Funding |
|---|---|---|---|---|---|
| 01 | Databricks | Data and AI company that provides a unified analytics platform built on Apache Spark, enabling data engineering, data science, and machine learning at scale. | San Francisco, US | Private | $20.0B |
| 02 | Scale AI | AI data infrastructure company that provides data labeling, model evaluation, and software to accelerate the development of AI applications for enterprises, government agencies, and AI labs. | San Francisco, US | Private | $15.9B |
| 03 | Nebius Group | AI cloud infrastructure company that builds full-stack infrastructure for AI, including large-scale GPU clusters, cloud platforms, and developer tools for AI builders. Formerly Yandex N.V., rebranded in 2024 after divesting Russian operations. | Amsterdam, Netherlands | Public | $3.7B |
| 04 | Palantir Technologies | Data analytics and software company building platforms for government intelligence, defense, and commercial enterprises, enabling organizations to integrate, manage, and analyze large datasets for operational decision-making. | Denver, US | Public | $2.9B |
| 05 | Celonis | Global leader in process mining and execution management, using AI to analyze enterprise business processes and identify optimization opportunities. Founded by three TU Munich students. | Munich, Germany | Private | $1.8B |
| 06 | Snowflake | Cloud-based data warehousing company providing a platform for data storage, processing, and analytics that runs on AWS, Azure, and Google Cloud, enabling organizations to consolidate data into a single source of truth. | Bozeman, US | Public | $1.6B |
| 07 | VAST Data | AI-powered data platform that unifies storage, database, and compute infrastructure for enterprise and AI workloads at scale, reaching $2B ARR and serving as critical data infrastructure for hyperscale AI training. | New York, United States | Private | $1.4B |
| 08 | Opendoor | Opendoor is a digital real estate platform that pioneered the iBuying model, enabling homeowners to sell their homes online with instant cash offers. The company uses data science and pricing algorithms to buy, renovate, and resell residential homes. | San Francisco, United States | Public | $1.3B |
| 09 | Freenome | Biotech company developing AI-powered multiomics blood tests for early cancer detection. Its platform analyzes cell-free DNA, methylation patterns, and protein biomarkers from routine blood draws to screen for cancers in their earliest stages. Lead product SimpleScreen targets FDA approval for colorectal cancer screening. | South San Francisco, California, United States | Private | $1.3B |
| 10 | ClickHouse | Open-source columnar database management system designed for real-time analytics and AI workloads. Originally developed at Yandex and open-sourced in 2016, it became an independent company in 2021. | San Francisco, US | Private | $1.1B |
| 11 | Samsara | Pioneer of the Connected Operations Cloud, combining IoT hardware (sensors, telematics devices, AI-powered dash cams) with cloud-based software to help organizations improve safety, efficiency, and sustainability. Serves tens of thousands of customers across transportation, logistics, construction, and manufacturing. IPO'd on the NYSE in December 2021. | San Francisco, United States | Private | $930M |
| 12 | Grafana Labs | Open-source observability platform providing visualization, monitoring, and analytics tools for metrics, logs, and traces, building and maintaining the widely used Grafana dashboard and a composable observability stack | New York, United States | Private | $804M |
| 13 | Fivetran | Automated data movement platform that reliably moves data from 700+ sources including SaaS applications, databases, and files to data warehouses, data lakes, and other destinations. | Oakland, US | Private | $725M |
| 14 | Cribl | Data engine for IT and security teams that provides an observability pipeline platform, enabling enterprises to route, reduce, enrich, and transform streaming data from any source to any destination in real time | San Francisco, United States | Private | $715M |
| 15 | Domo | Domo is a cloud-based business intelligence platform that integrates data from multiple sources to provide real-time insights and data visualization. Founded by Omniture co-founder Josh James, Domo helps organizations make data-driven decisions through customizable dashboards and collaborative analytics tools. | American Fork, United States | Public | $693M |
| 16 | Cockroach Labs | Developer of CockroachDB, a cloud-native distributed SQL database designed for global, scalable, and resilient applications that survive disasters and maintain consistency across multiple regions | New York, United States | Private | $633M |
| 17 | Supabase | Open-source alternative to Firebase built on Postgres, providing developers with a backend platform including database, authentication, real-time subscriptions, storage, and edge functions | Singapore, Singapore | Private | $496M |
| 18 | Confluent | Enterprise data streaming platform built on Apache Kafka. Founded by the creators of Kafka at LinkedIn, Confluent provides the infrastructure for real-time data pipelines and event-driven architectures. | Mountain View, United States | Acquired | $456M |
| 19 | Tomorrow.io | Weather intelligence and climate adaptation platform providing real-time forecasting, enterprise APIs, and its own constellation of weather radar satellites. Formerly ClimaCell. | Boston, US | Private | $422M |
| 20 | dbt Labs | dbt Labs is the company behind dbt (data build tool), an open-source analytics engineering framework that enables data teams to transform data in their warehouses using SQL. Originally founded as Fishtown Analytics, the company signed a definitive agreement to merge with Fivetran in October 2025 in an all-stock deal, creating a combined data infrastructure company approaching $600M in ARR. | Philadelphia, United States | Private | $416M |
| 21 | Starburst | Starburst is the commercial company behind Trino (formerly PrestoSQL), the open-source distributed SQL query engine. The platform provides a data lakehouse analytics layer that lets enterprises query data across any source without requiring data movement or migration. | Boston, United States | Private | $414M |
| 22 | Benchling | Benchling is a cloud-based life sciences R&D platform that helps biotech and pharmaceutical companies manage experiments, track data, and accelerate research and development. The platform replaces physical lab notebooks and spreadsheets with collaborative, AI-ready digital tools used by over 1,200 customers worldwide. | San Francisco, US | Private | $412M |
| 23 | Foursquare | Foursquare is a location technology platform that provides geospatial data and intelligence to help businesses make smarter decisions and create engaging customer experiences. Originally launched as a check-in app at SXSW in 2009, the company evolved into the industry-leading platform for spatial analytics, powering location services for apps like Uber, Spotify, Apple Maps, and Airbnb. | New York, United States | Private | $379M |
| 24 | Amplitude | Amplitude is a product analytics platform that helps companies understand user behavior to build better digital products. Originally incubated at Y Combinator (W12) as a voice-to-text app called Sonalight, the founders pivoted to analytics after realizing the internal tools they built were more valuable than the app itself. The company pioneered the product intelligence category and went public via direct listing on NASDAQ in September 2021. | San Francisco, US | Public | $336M |
| 25 | Alation | Alation is the pioneer of the data catalog category, providing an agentic data intelligence platform that helps enterprises discover, govern, and trust their data assets. The platform uses AI-powered search, automated documentation, and lineage tracking. Over 40% of the Fortune 100 are customers, and the company surpassed $100M in annual recurring revenue in 2022. | Redwood City, United States | Private | $315M |
| 26 | Actifio | Actifio pioneered copy data virtualization, reducing unnecessary duplication of enterprise data for backup, disaster recovery, and DevOps. The company's software enabled businesses to manage virtual copies of data across on-premises and cloud environments, serving over 3,700 customers before being acquired by Google in December 2020. | Waltham, Massachusetts, United States | Acquired | $312M |
| 27 | MongoDB | Provides a developer data platform built around its flagship document-oriented NoSQL database, offering cloud database services (MongoDB Atlas), enterprise server, and related tools for building modern applications at scale. | New York, United States | Public | $311M |
| 28 | Segment | Customer data infrastructure platform that collects, cleans, and routes customer data to hundreds of analytics, marketing, and data warehouse tools through a single API. Founded by four MIT students as part of Y Combinator S11. | San Francisco, US | Acquired | $282M |
| 29 | Mixpanel | Product analytics platform that helps teams understand user behavior by tracking event-based interactions across web and mobile applications. Customers include OpenAI, Netflix, Uber, and Pinterest. | San Francisco, US | Private | $277M |
| 30 | Optimizely | Digital experience optimization platform providing A/B testing, experimentation, and personalization tools for websites, mobile apps, and connected devices. Co-founded by two former Google product managers who worked on the Obama 2008 campaign's digital optimization. Acquired by Episerver (now Optimizely) in 2020. | San Francisco, US | Acquired | $251M |
| 31 | Heap | Heap is a digital insights platform that automatically captures every user interaction on websites and mobile apps, including clicks, taps, swipes, form submissions, and page views, without requiring manual tracking code. Unlike traditional analytics tools that require engineers to instrument events upfront, Heap retroactively analyzes user behavior from day one. The platform served over 10,000 companies including Twilio, Zendesk, and Liberty Mutual. Acquired by Contentsquare in December 2023. | San Francisco, US | Acquired | $205M |
| 32 | Labelbox | Data-centric AI platform providing tools for data labeling, annotation, and model training, enabling teams to build and improve machine learning and generative AI applications. The platform serves as the interface for human experts to create high-quality training data at scale, with a network of over one million domain experts through its Alignerr product. | San Francisco, United States | Private | $189M |
| 33 | Nominal | Nominal builds the unified, real-time test stack for physical systems. The platform helps engineering teams developing complex hardware (aircraft, satellites, autonomous vehicles, fusion energy systems, weapons programs) to test, validate, and monitor their systems continuously. | Los Angeles, US | Private | $183M |
| 34 | Elastic | Open-source search and analytics company behind Elasticsearch, Kibana, and the Elastic Stack. Provides enterprise search, observability, and security solutions used by thousands of organizations worldwide for log analysis, application monitoring, and threat detection. | San Francisco, United States | Public | $162M |
| 35 | Profound | AI search visibility platform helping businesses understand and optimize how they appear in AI-powered search results. Provides analytics and tools for brands to track their presence across AI assistants like ChatGPT, Claude, Perplexity, and Google AI Overviews. | New York, US | Private | $155M |
| 36 | ActionIQ | ActionIQ is an enterprise customer data platform (CDP) that enables large organizations to unify, activate, and orchestrate customer experiences across all channels without complex data movement. Built by the founders of Aster Data (acquired by Teradata for $325M), the platform uses a composable architecture that keeps data securely in place while making it accessible for marketing teams. ActionIQ served major enterprises including The New York Times, Shopify, American Eagle Outfitters, and Hertz before being acquired by Uniphore in December 2024. | New York, United States | Acquired | $145M |
| 37 | Pinecone | Pinecone is the leading vector database platform for building accurate, performant AI applications at scale. Founded by former AWS Director of Research Edo Liberty, the company provides a fully managed, serverless infrastructure that makes it easy to connect enterprise data with large language models and other AI systems. | New York, United States | Private | $138M |
| 38 | Apptio | Apptio is an enterprise technology business management software company that helps IT leaders manage the cost, quality, and value of IT services. The platform provides financial transparency into technology spending, enabling data-driven decisions about IT investments. Acquired by IBM in 2023 for $4.6 billion after being taken private by Vista Equity Partners in 2019. | Bellevue, United States | Acquired | $136M |
| 39 | Neon | Serverless Postgres platform that separates storage and compute, enabling autoscaling, database branching, and scale-to-zero for developers and AI agents. Acquired by Databricks for ~$1B in May 2025. | San Francisco, United States | Acquired | $126M |
| 40 | Braintrust | AI-native observability and evaluation platform that helps engineering and product teams evaluate, log, and monitor AI agents and large language model interactions in production. Built on a custom database optimized for massive AI trace data, the platform enables teams to run experiments against real datasets, compare prompts side-by-side, catch regressions in CI, and inspect every trace with real-time latency, cost, and quality metrics. | San Francisco, US | Private | $124M |
| 41 | MotherDuck | MotherDuck is a serverless cloud data warehouse built on the open-source DuckDB database. The platform combines the speed and simplicity of local analytics with the scalability of the cloud, enabling data teams to query data without managing infrastructure. | Seattle, United States | Private | $100M |
| 42 | Panorama Education | Panorama Education is a K-12 data analytics platform that helps school districts collect and act on survey data about social-emotional learning, school climate, and student outcomes. Trusted by 2,000+ districts serving 15 million students across all 50 states, the company provides research-backed surveys and analytics tools for educators. | Boston, United States | Private | $92M |
| 43 | Nansen | Nansen is a blockchain analytics platform that enriches on-chain data with millions of wallet labels across multiple blockchains. Crypto investors and institutions use Nansen to discover opportunities, perform due diligence, and defend portfolios with real-time dashboards and alerts, tracking over 244 million labeled wallets across 11+ blockchains. | Singapore, Singapore | Private | $88M |
| 44 | Alluxio | Alluxio is a data orchestration and AI acceleration platform that provides a unified data access layer between compute frameworks and storage systems. Originating from founder Haoyuan Li's PhD research at UC Berkeley's AMPLab, the platform enables high-performance data access across hybrid and multi-cloud environments. Alluxio powers workloads at nine of the world's ten largest internet companies. | San Mateo, United States | Private | $82M |
| 45 | Gather AI | Leader in Physical AI for logistics, deploying autonomous drones inside warehouses to scan and track inventory using AI-powered computer vision. Spun out of Carnegie Mellon University's Robotics Institute. | Pittsburgh, US | Private | $71M |
| 46 | Urban SDK | Urban SDK is a geospatial AI platform providing a 'System of Action' for local and state governments. The platform equips over 300 civic leaders across 40 states with actionable insights and automation for mission-critical decisions in public safety, transportation, infrastructure, and administration, operating at roughly ten cents on the dollar versus traditional approaches. | Jacksonville, Florida, United States | Private | $71M |
| 47 | AnyRoad | AnyRoad is an experience relationship management (ERM) platform that helps brands operate, measure, and optimize experiential marketing programs. The platform connects back-end processes like booking, ticketing, and payments with first-party data capture, consumer feedback, and analytics. AnyRoad serves major brands including Anheuser-Busch, Nike, and others seeking to quantify the ROI of in-person experiences. | San Francisco, United States | Private | $66M |
| 48 | Airware | Airware was a commercial drone operating system and platform company that provided hardware, software, and cloud services enabling enterprises to turn aerial data into actionable business intelligence. Based in San Francisco, the company raised over $118M from top-tier investors before shutting down in September 2018, with its assets acquired by French drone company Delair. | San Francisco, United States | Closed | $66M |
| 49 | Doxel | Doxel uses computer vision and deep learning to automate progress tracking on construction sites. By analyzing 360-degree video from job sites, the platform measures actual construction progress against plans and schedules, providing predictive analytics to help owners and general contractors eliminate cost overruns and schedule delays. | Redwood City, California, United States | Private | $57M |
| 50 | Nimble Way | Israeli web data infrastructure company providing AI-powered web data collection and delivery platform. Enables businesses to access structured web data at scale for market intelligence, pricing analytics, and competitive research. | Tel Aviv, IL | Private | $47M |
| 51 | Validio | Enterprise data quality platform that uses machine learning to automatically detect, alert, and fix data quality issues. Helps companies ensure their data pipelines produce reliable data for AI models, analytics, and business-critical applications. | Stockholm, Sweden | Private | $47M |
| 52 | Coactive AI | AI-powered platform that helps enterprises unlock insights from unstructured image and video data. Coactive's multimodal application platform enables users to search, organize, analyze, and generate metadata from visual content without requiring metadata or manual tagging. | San Jose, California, United States | Private | $44M |
| 53 | Cask | Cask developed CDAP, an open-source data application platform that served as an abstraction layer above Apache Hadoop, enabling developers to build large-scale analytics applications without deep Hadoop expertise. The platform provided enterprise-grade governance, portability, security, and scalability for big data workloads. Notable customers included AT&T, Cloudera, and Salesforce. | Palo Alto, United States | Acquired | $33M |
| 54 | 1touch.io | 1touch.io provides a sensitive data intelligence and orchestration platform that discovers, classifies, contextualizes, and enriches data across all datasets and environments. The platform gives enterprises a comprehensive, unified view of their information landscape for data privacy and security. Originally incubated in JVP's Cyber Labs in Israel. | New York, US | Acquired | $32M |
| 55 | Condor Software | AI-powered financial intelligence platform for the biopharma industry. Provides pharmaceutical companies with data-driven insights for commercial planning, market access, and competitive intelligence using machine learning and proprietary datasets. | San Diego, United States | Private | $24M |
| 56 | Halcyon | AI-powered energy intelligence platform built on the most comprehensive catalog of US energy regulatory data, spanning all 50 state public utility commissions, every ISO/RTO, and FERC. Helps utilities, developers, hyperscalers, and investors make faster decisions about energy infrastructure. | New York City, United States | Private | $21M |
| 57 | Gorilla Technology | Global provider of AI-powered edge computing, video intelligence, IoT security, and cybersecurity solutions. Specializes in edge AI for real-time data processing, serving government institutions, telecom companies, and enterprises across Asia Pacific, the Middle East, and globally. | London, UK | Public | $15M |
| 58 | 7Rivers | 7Rivers is a technology services company and certified Snowflake Elite Partner that helps enterprises harness data and AI for real business value. Services span data migration, Data Vault 2.0 implementation, generative AI solutions, data science, and managed services across insurance, banking, software, manufacturing, and healthcare. | Milwaukee, US | Private | $11M |