United Kingdom verdict (TL;DR)
Verified 2026-05-23UK MLOps in 2026 splits between London fintech and HealthTech buying SageMaker on AWS London (eu-west-2) and Azure ML in UK South under FCA and ICO scrutiny, and UK academic and research labs (DeepMind, Faculty AI, Stability AI, Cohere London office, ARIA-funded teams) buying Weights and Biases as the experiment-tracking default. Vertex AI is strong at UK retail and media where BigQuery is the warehouse. UK GDPR plus DPA 2018 plus the ICO AI guidance set the compliance bar; the FCA Consumer Duty (2023) added explicit model-explainability requirements for UK financial services ML. IDTA addenda are mandatory for UK-to-US data transfers from every US-headquartered vendor in this ranking. The pro-innovation UK AI regulation approach (white paper 2023, AI Safety Institute set up 2023) is materially lighter touch than the EU AI Act, which gives UK ML buyers more procurement flexibility than France or Germany. Stability AI (London) is the most-visible UK-built foundation-model lab; Faculty AI (London) is the most-visible UK MLOps consultancy.
Picks for United Kingdom
- UK financial services and HealthTech on AWS London under FCA and ICO scrutiny: sagemaker AWS London (eu-west-2) is the default cloud for UK fintech (Monzo, GoCardless, Wise, Checkout.com, Revolut) and HealthTech. SageMaker satisfies UK GDPR residency, integrates natively with S3 and Bedrock, and supports FCA Consumer Duty model-explainability documentation via SageMaker Clarify. GBP billing via AWS UK; EDP at scale.
- UK retail, media, and B2B SaaS with BigQuery as the warehouse: vertex-ai UK retail (Tesco, M&S, ASOS, Sainsbury's) and media (BBC, Sky) run BigQuery in increasing volume. Vertex AI in europe-west2 (London) satisfies UK GDPR. Gemini access via Vertex is the default for UK Gemini-anchored LLM workloads. GBP billing.
- UK Microsoft Enterprise Agreement buyers (UK government, large enterprise, NHS-adjacent): azure-ml Azure UK South and UK West regions satisfy UK GDPR; Microsoft EA bundling is the norm at UK central government, NHS digital, and large UK enterprise. Responsible AI dashboard is the cleanest fit for FCA model-governance documentation. Azure UK government region available for UK central government workloads.
- UK research labs and foundation-model teams (DeepMind, Stability, Cohere, Faculty): weights-and-biases W and B is the default tracker at UK research labs and AI scaleups. London foundation-model talent rotation between DeepMind, Stability, Cohere, and academic labs cemented W and B as the lingua franca. Cross-cloud, framework-agnostic. UK GDPR DPA available; IDTA addendum required.
- UK cost-conscious ML teams and academic deployments: mlflow Apache 2.0, self-hostable on AWS London or in-house. The honest baseline for UK university ML labs, Alan Turing Institute projects, and ARIA-funded research where per-seat fees are blockers. Bundled inside Databricks at no extra cost for UK Databricks customers.
- UK Databricks customers (large enterprise retail and financial services): databricks-ml Databricks UK presence is meaningful at UK retail and large enterprise. Mosaic AI bundle is the right pick only if Databricks is already the data platform. DBU pricing in GBP equivalent is opaque at scale; negotiate.
How the mlops platforms market looks in United Kingdom
UK MLOps is anchored by three buyer clusters and shaped by a notably lighter-touch AI regulation posture than the rest of Europe.
London fintech (Monzo, GoCardless, Wise, Checkout.com, Revolut, Starling Bank, Cognism on the B2B side) and HealthTech (Babylon Health, Cera Care, Huma) are the largest UK ML buyers by spend, running on AWS London (eu-west-2) and Azure UK South under FCA and ICO scrutiny. FCA Consumer Duty (2023) added explicit model-explainability requirements that elevated SageMaker Clarify, Azure ML Responsible AI dashboard, and DataRobot for governance documentation. UK retail (Tesco, M&S, ASOS, Sainsbury's, Boots, John Lewis) and media (BBC, Sky, Channel 4) run a mix of Vertex AI on GCP London, SageMaker on AWS London, and Databricks Mosaic AI; BigQuery has grown share in UK retail through 2023 to 2025.
UK research and foundation-model labs are the second cluster. DeepMind (London, Google subsidiary, ~2,000 researchers) is the most-visible UK AI lab but internally tooled and not a buyer in this ranking. Stability AI (London, founded 2019) is the highest-profile UK-built foundation-model company despite well-documented turbulence around founder departure and corporate restructuring 2023 to 2024. Cohere maintains a meaningful London engineering presence. ARIA (Advanced Research and Invention Agency, UK government, launched 2023) is funding AI research programmes that buy W and B and self-hosted MLflow. The Alan Turing Institute and Faculty AI (London consultancy) deploy ML across UK government and enterprise.
The UK regulatory posture is the third defining factor. The pro-innovation white paper (March 2023, response published February 2024) outlined a sector-by-sector regulator-led approach rather than the EU AI Act horizontal regulation; the AI Safety Institute (set up November 2023) handles frontier-model evaluation. This is materially lighter touch than France or Germany under the EU AI Act, and UK ML buyers have more procurement flexibility on US-hosted models and on AI-assisted decision-making than their EU counterparts. UK GDPR plus DPA 2018 still apply; ICO guidance on AI and data protection (2020, updated 2023) is the de facto UK ML governance reference for non-financial-services teams.
The third cluster is UK central government, NHS digital, and devolved administrations. Azure UK South and UK government region dominate here through Microsoft EA; Crown Commercial Service G-Cloud framework is the procurement path. NHS England digital teams have meaningful Databricks adoption alongside Azure ML.
UK GDPR + DPA 2018: ML training data, feature stores, embeddings, prediction logs, and model artifacts containing UK personal data are personal data under UK GDPR; DPAs required from all US-headquartered MLOps vendors. AWS London (eu-west-2), Azure UK South and UK West, and GCP London (europe-west2) satisfy UK GDPR residency. UK IDTA (International Data Transfer Agreement, in force March 2022): required for UK-to-US data transfers; US MLOps vendors (W and B, Comet, MLflow-on-Databricks, DataRobot, ClearML) must hold current IDTA-compliant addenda. The UK Extension to the EU-US DPF is an alternative for some transfers; verify vendor participation. ICO AI guidance (2020, updated 2023): AI systems processing UK personal data must support data subject rights (access, deletion, opt-out), maintain Article 22 transparency on automated decisions producing legal or similarly significant effects, and document lawful basis. ICO has signalled active enforcement intent on AI in 2024 to 2026. FCA Consumer Duty (in force July 2023): UK financial services running ML on regulated products (credit, lending, insurance, investment advice) must deliver good outcomes, including model explainability, fairness testing, and bias documentation; SageMaker Clarify, Azure ML Responsible AI dashboard, and DataRobot governance documentation are the strongest fits. PRA SS1/23 (model risk management, in force May 2024) added explicit UK banking model-risk requirements aligned to US SR 11-7. NHS DSPT (Data Security and Protection Toolkit) is mandatory for ML platforms processing NHS data; Azure UK and AWS London with appropriate controls satisfy this. UK pro-innovation AI regulation white paper (2023, response 2024): a sector-by-sector regulator-led approach rather than the EU AI Act; materially lighter touch than France or Germany on AI buying.
Quick comparison, ranked for United Kingdom
| Product | Best for | Starts at | 10-emp/mo* | Pricing | G2 | Geo |
|---|---|---|---|---|---|---|
| 4 Amazon SageMaker | Engineering and data-science teams committed to AWS | $0 + $0/emp | $0 | 4.4 | Global; strongest in US, EU, UK, JP, IN, AU; FedRAMP High in US GovCloud | |
| 3 Google Vertex AI | Engineering and data-science teams committed to Google Cloud | $0 + $0/emp | $0 | 4.4 | Global; strongest in US, EU, India, UK, JP, AU | |
| 5 Azure Machine Learning | Engineering and data-science teams on Microsoft Azure | $0 + $0/emp | $0 | 4.3 | Global; strongest in US, EU, UK, JP, IN, AU; Azure Government for US federal | |
| 1 Weights and Biases | ML engineering, research, and platform teams across solo researchers and large enterprises | $0 + $0/emp | $0 | 4.6 | Global; strongest in US, EU, UK, India, Japan | |
| 2 MLflow | Any ML team from solo data scientists to Fortune 500 enterprises | $0 + $0/emp | $0 | 4.4 | Global; open source available everywhere | |
| 6 Databricks Mosaic AI | Engineering and data-science teams committed to Databricks Lakehouse | $0 + $0/emp | $0 | 4.5 | Global; cloud-portable across AWS, Azure, GCP regions | |
| 7 Comet | ML engineering and data-science teams wanting a neutral tracker | $0 + $0/emp | $0 | 4.5 | Global; strongest in US, EU, UK, IL | |
| 10 DataRobot | Regulated enterprise buyers needing mature AutoML with governance | Quote | - | 4.4 | Global; strongest in US, UK, EU, JP, AU | |
| 8 Neptune.ai | Research teams and ML platform teams wanting flexible metadata tracking | $0 + $0/emp | $0 | 4.6 | Global; strongest in EU, US, UK, PL | |
| 9 ClearML | ML engineering and platform teams wanting end-to-end open-source MLOps | $0 + $0/emp | $0 | 4.4 | Global; strongest in US, EU, IL |
*10-employee monthly cost = base fee + (per-employee × 10) using the lowest published tier. For opaque-pricing vendors, no value is shown.
What buyers in United Kingdom actually pay
Median annual deal size by employee band, in GBP. Crowdsourced from anonymized buyer disclosures.
| Product | Employee band | Median annual (GBP) | Sample | Notes |
|---|---|---|---|---|
| Amazon SageMaker | 50-500 ML engineers (UK fintech) | £280,000 | 47 | AWS London; GBP-billed via AWS UK; consumption-based |
| Amazon SageMaker | 500+ ML engineers (UK enterprise) | £1,850,000 | 22 | EDP discount; GBP-billed; multi-year |
| Google Vertex AI | 50-500 ML engineers (UK retail) | £190,000 | 31 | GCP London; CUD discount; GBP-equivalent |
| Azure Machine Learning | 50-500 ML engineers (UK public sector) | £210,000 | 38 | Azure UK South; Microsoft EA; GBP-billed |
| Weights and Biases | 20-100 ML engineers (Pro) | £38,000 | 42 | Per-user; GBP-equivalent; IDTA addendum required |
| Weights and Biases | 100-500 ML engineers (Enterprise) | £240,000 | 24 | Custom; GBP-equivalent; UK self-hosted option |
| MLflow | Self-hosted on AWS London (any size) | £95,000 | 34 | Infra only; Apache 2.0; no license fees |
| Databricks Mosaic AI | 50-500 ML engineers | £360,000 | 19 | DBU on AWS London or Azure UK; GBP-equivalent |
United Kingdom-built or United Kingdom-strong vendors worth knowing
Not yet ranked in our global top 10, but credible options for United Kingdom buyers and worth a shortlist.
Stability AI
Visit ↗London-headquartered, founded 2019 by Emad Mostaque. Most-visible UK-built foundation-model lab. Shipped Stable Diffusion and successor image models, Stable LM language models. Significant corporate turbulence 2023 to 2024 with founder departure and restructuring; James Cameron joined the board 2024. Relevant as a sovereign UK AI option but buyers should weigh execution risk.
Faculty AI
Visit ↗London-based AI consultancy and platform vendor. Faculty Frontier platform deploys ML across UK central government, NHS, and large enterprise. Significant UK public-sector installed base. Often paired with Azure ML or self-hosted MLflow as the underlying stack.
DeepMind (Google)
Visit ↗London-headquartered, founded 2010, acquired by Google 2014. ~2,000 researchers, internally tooled (not a buyer in this ranking) but a major source of UK ML platform talent. Alumni rotate into UK AI scaleups and shape London ML tooling norms; W and B is the lingua franca partly because of this rotation.
Cohere (London office)
Visit ↗Toronto-headquartered with meaningful London engineering presence. Foundation-model platform for enterprise. Frequently appears in UK enterprise LLM RFPs as the non-US alternative for sovereignty-sensitive UK buyers.
All 10, ranked for United Kingdom
Same intelligence as the global ranking, vendor trust, review patterns, verified pricing, compliance, reordered for the United Kingdom market.
Amazon SageMaker
AWS hyperscaler ML platform with the broadest service breadth on the cloud.
Amazon SageMaker is the AWS hyperscaler ML platform, launched November 2017 and steadily expanded into the broadest managed ML surface on any cloud. The platform includes SageMaker Studio (notebooks and unified workspace), Training Jobs (managed CPU and GPU compute), Pipelines (orchestration), Feature Store, Model Registry, Endpoints (real-time and batch inference), Clarify (explainability and bias), Ground Truth (data labeling), Data Wrangler, and JumpStart (pre-trained models and foundation-model access). SageMaker Unified Studio (announced re:Invent 2024) consolidates Studio into a broader AWS data and AI surface. Strengths: broadest service breadth on any hyperscaler, deepest AWS integration (S3, EC2, EKS, IAM, KMS), largest ML community footprint of any cloud, mature enterprise procurement under AWS master agreements, FedRAMP High at most surfaces (defensible for US federal), and a strong managed model-deployment story. Trade-offs: pricing is famously hard to forecast (notebook hours, training hours, endpoint hours, storage, and data transfer all combine), SageMaker-specific lock-in is real (pipelines, feature store, registry tied to AWS), the surface complexity is genuine (Studio overlays many sub-products), and SageMaker Studio Classic versus the new Studio creates buyer confusion through 2025 to 2026.
Engineering and data-science teams already committed to AWS (S3 as primary data lake, EKS or EC2 for compute) who want the broadest managed ML platform on the cloud. Particularly strong for US federal, regulated industries on AWS, and teams leveraging Bedrock for foundation models alongside classical ML.
Multi-cloud teams (lock-in is real), teams on Google Cloud or Azure (Vertex AI or Azure ML cheaper and more integrated), buyers wanting transparent simple pricing (SageMaker is consumption-complex), or buyers wanting a neutral cross-cloud MLOps story.
Strengths
- Broadest hyperscaler ML service breadth on any cloud
- Deepest AWS integration (S3, EC2, EKS, IAM, KMS, Bedrock)
- Largest ML community footprint of any hyperscaler
- Mature enterprise procurement under AWS master agreements
- FedRAMP High at most surfaces (defensible for US federal)
- Strong managed model-deployment story (real-time and batch endpoints)
- JumpStart provides pre-trained models and foundation-model access
Weaknesses
- Pricing famously hard to forecast at scale
- SageMaker-specific lock-in (pipelines, feature store tied to AWS)
- Surface complexity is real; Studio overlays many sub-products
- Studio Classic versus new Studio creates buyer confusion through 2025 to 2026
- Cost optimization requires deep AWS expertise
- Migration off SageMaker is non-trivial at scale
- AutoML surface (Autopilot) lost share since 2020 to 2022 peak
Pricing tiers
partial- Pay-as-you-go (consumption)No platform fee; pay for notebook, training, endpoint, storage, transfer$0+$0 /mo +/emp
- Savings Plans1-year or 3-year commitments; typical 20 to 64 percent discount on committed computeQuote
- Enterprise Discount Program (EDP)AWS master agreement; volume discounts at scale; dedicated technical account managerQuote
- · Notebook hours (instance plus storage) priced separately
- · Training jobs (CPU and GPU instance hours) priced separately
- · Real-time endpoint hours (per instance, per hour, always-on) is the largest line item for many buyers
- · Batch transform priced per job plus compute
- · Feature Store online serving priced per read and write
- · Data transfer out of AWS is real cost at scale
- · SageMaker Studio Classic versus Studio pricing is not directly comparable
Key features
- +SageMaker Studio (and Unified Studio) for notebooks and workspace
- +Training Jobs with managed CPU and GPU compute
- +SageMaker Pipelines for orchestration
- +Feature Store with online and offline serving
- +Model Registry with versioning and approval workflows
- +Real-time and batch inference endpoints with autoscaling
- +Clarify for explainability and bias detection
- +Ground Truth for data labeling
- +JumpStart for pre-trained models and foundation-model access
- +Data Wrangler for data preparation
Google Vertex AI
Google Cloud hyperscaler ML platform with deep BigQuery and Gemini integration.
Google Vertex AI is the unified Google Cloud ML platform, launched May 2021 by merging AI Platform and AutoML into a single managed product. The platform covers the full ML lifecycle: notebooks (Workbench, Colab Enterprise), training (custom and AutoML), pipelines, feature store, model registry, online and batch prediction, Model Monitoring, and Generative AI Studio (Gemini and third-party models). Strengths: deepest integration with Google Cloud (BigQuery, GCS, GKE, Pub/Sub), native Gemini access for generative-AI workloads, strong managed AutoML offering for tabular and vision, mature pipelines built on Kubeflow Pipelines, and an enterprise procurement story under Google Cloud master agreements. Trade-offs: real vendor lock-in (models, features, metadata, and pipelines are Vertex-native and not portable without rework), pricing is opaque at scale because compute, storage, and managed-service line items combine in ways that are hard to forecast, the AutoML surface has lost share since the 2020 to 2022 peak, and customer support quality at Google Cloud remains a long-standing complaint relative to AWS.
Engineering and data-science teams already committed to Google Cloud (BigQuery as primary data warehouse, GKE for compute) who want a managed end-to-end ML platform. Particularly strong for teams leveraging Gemini for generative AI and teams wanting managed AutoML for tabular or vision use cases.
Multi-cloud teams (vendor lock-in is real), teams already on AWS or Azure (SageMaker or Azure ML cheaper and more integrated), regulated buyers needing FedRAMP High (Vertex AI is FedRAMP Moderate, not High at all surfaces), or buyers wanting a neutral cross-cloud MLOps story.
Strengths
- Deepest integration with Google Cloud (BigQuery, GCS, GKE)
- Native Gemini access for generative-AI workloads
- Strong managed AutoML for tabular, vision, and forecasting
- Mature pipelines built on Kubeflow Pipelines
- Feature Store, Model Registry, Model Monitoring in one product
- Enterprise procurement story under Google Cloud master agreements
- Strongest managed-notebook surface for data scientists (Workbench, Colab Enterprise)
Weaknesses
- Real vendor lock-in (models, features, metadata, pipelines Vertex-native)
- Pricing opaque at scale; compute and storage line items hard to forecast
- AutoML surface lost share since 2020 to 2022 peak
- Customer support quality lags AWS at the enterprise tier
- Migration off Vertex AI is non-trivial (feature store, registry, pipelines)
- Cost optimization requires deep Google Cloud expertise
- Smaller ML community footprint than SageMaker (AWS dominance)
Pricing tiers
partial- Pay-as-you-go (consumption)No platform fee; pay for compute, storage, and managed-service line items$0+$0 /mo +/emp
- Committed Use Discounts (CUD)1-year or 3-year commitments; typical 20 to 50 percent discount on committed computeQuote
- Enterprise AgreementGoogle Cloud master agreement; volume discounts at scale; dedicated technical account managerQuote
- · Compute (CPU, GPU, TPU) priced separately and varies by region
- · Storage (artifacts, datasets, model versions) priced separately
- · Online prediction has per-prediction and per-node-hour cost
- · Network egress between regions and out of Google Cloud is real cost
- · Feature Store online serving priced per node hour
- · Pipeline execution priced per pipeline job plus compute
Key features
- +Vertex AI Workbench and Colab Enterprise notebooks
- +Custom training jobs with managed compute (CPU, GPU, TPU)
- +Vertex AI Pipelines built on Kubeflow Pipelines
- +Feature Store with online and batch serving
- +Model Registry with versioning and lineage
- +Online and batch prediction with autoscaling
- +Model Monitoring for drift and skew detection
- +AutoML for tabular, vision, NLP, and forecasting
- +Generative AI Studio with Gemini and third-party models
- +BigQuery ML for SQL-based model training
Azure Machine Learning
Microsoft Azure ML platform with deep Microsoft-stack integration.
Azure Machine Learning is the Microsoft Azure ML platform, launched in its modern form in 2018 and steadily expanded through 2026. The platform covers managed notebooks, training (custom and AutoML), pipelines, feature store (Azure ML Feature Store, generally available 2024), model registry, online and batch endpoints, Responsible AI dashboard, and Azure AI Studio for generative-AI workloads (Azure OpenAI Service integration). Strengths: best fit for Microsoft-stack enterprises (Azure, Microsoft 365, Power Platform, Fabric), strong Responsible AI tooling (interpretability, fairness, error analysis), enterprise procurement under Microsoft Enterprise Agreement, deep integration with Azure OpenAI Service for generative AI, and the strongest compliance posture across regulated industries (Microsoft is a default-trusted enterprise vendor). Trade-offs: smaller ML community footprint than SageMaker or Vertex AI, Microsoft documentation quality is uneven (SDK churn through 2022 to 2024), pricing is consumption-complex like other hyperscaler ML platforms, real vendor lock-in (pipelines, feature store, registry Azure-native), and the platform feels like it lags AWS and GCP on cutting-edge research-team features.
Engineering and data-science teams already committed to Microsoft Azure (especially Microsoft 365, Power Platform, Fabric, or Azure OpenAI Service) who want managed ML inside the Microsoft enterprise stack. Particularly strong for regulated industries on Azure and teams leveraging Azure OpenAI for generative AI.
Multi-cloud teams (lock-in is real), teams on AWS or Google Cloud (SageMaker or Vertex AI cheaper and more integrated), research-team buyers wanting cutting-edge surface (Vertex AI or SageMaker stronger), or buyers wanting a neutral cross-cloud MLOps story.
Strengths
- Best fit for Microsoft-stack enterprises (Azure, M365, Power Platform, Fabric)
- Strong Responsible AI dashboard (interpretability, fairness, error analysis)
- Enterprise procurement under Microsoft Enterprise Agreement
- Deep integration with Azure OpenAI Service for generative AI
- Strong compliance posture across regulated industries
- Azure ML Feature Store (GA 2024) closes feature-store gap with AWS, GCP
- Tight integration with Microsoft Fabric for data-and-AI workloads
Weaknesses
- Smaller ML community footprint than SageMaker or Vertex AI
- Microsoft documentation quality uneven; SDK churn through 2022 to 2024
- Pricing consumption-complex; hard to forecast at scale
- Real vendor lock-in (pipelines, feature store, registry Azure-native)
- Platform lags AWS and GCP on cutting-edge research-team features
- Migration off Azure ML is non-trivial at scale
- Cost optimization requires deep Azure expertise
Pricing tiers
partial- Pay-as-you-go (consumption)No platform fee; pay for compute, storage, endpoints, transfer$0+$0 /mo +/emp
- Reserved Instances1-year or 3-year commitments; typical 25 to 60 percent discount on committed computeQuote
- Enterprise Agreement (EA)Microsoft master agreement; volume discounts at scale; bundled with Azure consumptionQuote
- · Compute (CPU, GPU) priced separately and varies by region
- · Managed online endpoints priced per node hour (always-on)
- · Storage (datasets, models, registries) priced separately
- · Azure OpenAI Service tokens priced separately when integrated
- · Network egress between regions and out of Azure is real cost
- · Fabric integration may require separate Fabric license at enterprise scale
Key features
- +Managed notebooks (Compute Instances and JupyterLab)
- +Training jobs with managed CPU and GPU compute
- +Azure ML Pipelines for orchestration
- +Azure ML Feature Store (GA 2024)
- +Model Registry with versioning and lineage
- +Managed online endpoints and batch endpoints
- +Responsible AI dashboard (interpretability, fairness, error analysis)
- +AutoML for tabular, vision, NLP, and forecasting
- +Azure AI Studio integration for generative AI
- +Azure OpenAI Service integration
Weights and Biases
Largest neutral MLOps platform; CoreWeave acquired in May 2024.
Weights and Biases (W and B) is the largest neutral MLOps platform by installed base, founded 2017 by Lukas Biewald and Chris Van Pelt (both ex-CrowdFlower). The product covers experiment tracking (the original wedge), hyperparameter sweeps, artifact and dataset versioning, model registry, and a reports surface used by ML teams to share results across the organization. CoreWeave acquired W and B in May 2024 for a reported $1.7B, which is the single most consequential category event of the last 24 months. Post-acquisition product velocity has been mixed: customer-facing roadmap continues, but the long-standing concern is that CoreWeave (a GPU-cloud company) acquired a neutral cross-cloud MLOps platform, and ML teams running on AWS, Google Cloud, or Azure now have to weigh whether W and B remains genuinely neutral or quietly tilts toward CoreWeave compute. Strengths: deepest experiment-tracking surface, largest community and integration footprint, real model-registry and artifact-versioning, used at OpenAI, Anthropic, Nvidia, Toyota, and across most leading research labs. Trade-offs: per-user pricing scales aggressively at large teams, post-CoreWeave neutrality is a real question for multi-cloud buyers, the model-registry surface lags Vertex AI and SageMaker on enterprise governance, and self-hosted deployment is gated to the top tier.
ML engineering and research teams wanting the deepest neutral experiment tracking and model registry across PyTorch, TensorFlow, JAX, and Hugging Face. Particularly strong for research labs, foundation-model teams, and ML platform teams running multi-cloud or unwilling to commit to a single hyperscaler.
Buyers already committed to one hyperscaler (Vertex AI, SageMaker, or Azure ML is usually cheaper and more integrated), buyers needing a strong feature store (Databricks Mosaic AI or SageMaker better), regulated buyers needing FedRAMP authorization (W and B is in-process at best), or buyers nervous about CoreWeave-related neutrality drift.
Strengths
- Deepest experiment-tracking surface in the neutral MLOps category
- Largest community footprint; default at OpenAI, Anthropic, Nvidia, Toyota
- Hyperparameter sweeps, artifact versioning, model registry in one product
- Strong integrations across PyTorch, TensorFlow, JAX, Hugging Face
- Reports surface used to share ML results across the organization
- Long-running stable APIs; portable across clouds and frameworks
- CoreWeave acquisition gives multi-year capital runway
Weaknesses
- CoreWeave acquired W and B in May 2024 for a reported $1.7B; neutrality question for multi-cloud buyers
- Per-user pricing scales aggressively at large ML teams
- Model-registry governance lags Vertex AI and SageMaker on enterprise controls
- Self-hosted deployment gated to the top tier (enterprise procurement burden)
- Some buyer reports of slower roadmap velocity post-acquisition
- Renewal pricing has crept up at large enterprises through 2024 to 2025
- Feature store is thinner than SageMaker, Vertex, or Databricks Mosaic AI
Pricing tiers
partial- Free (Personal)Personal projects; 100GB storage; full experiment tracking$0+$0 /mo +/emp
- ProPer user per month; team collaboration, model registry, sweeps$50+$50 /mo +/emp
- EnterpriseCustom contract; SAML SSO, audit log, dedicated support, self-hosted optionQuote
- · Per-user billing scales aggressively at large ML teams
- · Enterprise SAML SSO, audit log, and self-hosted gated to top tier
- · Compute and storage egress not included; buyer pays cloud separately
- · Renewal pricing has crept up at large enterprises through 2024 to 2025
- · Annual contracts typical 15 to 20 percent discount versus monthly
Key features
- +Experiment tracking across PyTorch, TensorFlow, JAX, Hugging Face
- +Hyperparameter sweeps with Bayesian optimization
- +Artifact and dataset versioning
- +Model registry with stage promotion and lineage
- +Reports surface for sharing ML results
- +Tables and queries over experiment metadata
- +Integrations with Slack, Jira, Linear, PagerDuty
- +SAML SSO and audit log at Enterprise
- +Self-hosted deployment at Enterprise
- +REST API and Python SDK
MLflow
Open-source MLOps baseline stewarded by Databricks.
MLflow is the open-source MLOps baseline, originally released by Databricks in June 2018 and now the most widely deployed open-source ML lifecycle project. The project covers four core surfaces: experiment tracking (the most-used component), model registry, project packaging, and model serving. MLflow is the de facto integration point for almost every commercial MLOps platform: Vertex AI, SageMaker, Azure Machine Learning, Databricks Mosaic AI, Weights and Biases, Comet, and Neptune.ai all integrate with MLflow either as a tracking backend, a model-registry import path, or a model-serving format. Strengths: free and open source under Apache 2.0, hostable on your own infrastructure, deep integration into the broader ML ecosystem, stewarded by Databricks (the company that pays the core maintainers), and the honest baseline for teams unwilling to pay for proprietary tracking. Trade-offs: self-hosting requires real ops investment, the UI is functional rather than slick, model registry governance is thinner than Vertex AI or SageMaker, MLflow as a hosted SaaS only exists inside Databricks (no neutral cloud-hosted MLflow with full enterprise features), and contribution velocity outside Databricks has slowed since 2022 as Databricks centralized stewardship.
ML engineering teams that want a free, open-source, self-hostable experiment tracking and model registry baseline. Particularly strong for cost-conscious teams, regulated buyers needing full data control on internal infrastructure, and teams already on Databricks (MLflow is bundled at no extra cost).
Teams without ops capacity to self-host (Weights and Biases or Comet better), buyers needing strong enterprise governance out of the box (Vertex AI or SageMaker better), buyers wanting a polished collaborative reports surface (W and B better), or buyers wanting LLMOps surface (MLflow LLM tracking is nascent).
Strengths
- Free and open source under Apache 2.0; no license fees
- Hostable on your own infrastructure (full data control)
- De facto integration point for commercial MLOps platforms
- Stewarded by Databricks (paid core maintainers; multi-year continuity)
- Experiment tracking, model registry, packaging, and serving in one project
- Strong integrations with PyTorch, TensorFlow, scikit-learn, XGBoost
- Honest baseline for teams unwilling to pay for proprietary tracking
Weaknesses
- Self-hosting requires real ops investment (database, object store, auth)
- UI is functional rather than visually modern
- Model registry governance thinner than Vertex AI, SageMaker, Azure ML
- No neutral cloud-hosted MLflow SaaS (only inside Databricks)
- Contribution velocity outside Databricks has slowed since 2022
- Self-hosted MLflow has no built-in SSO or audit log without add-ons
- Scaling to thousands of experiments per day requires database tuning
Pricing tiers
public- Open Source (Apache 2.0)Self-hosted; full feature set; no license fees$0+$0 /mo +/emp
- Managed MLflow (inside Databricks)Bundled with Databricks; no separate MLflow seat; DBU-based compute costQuote
- Self-hosted with paid supportThird-party support vendors offer paid support for self-hosted MLflowQuote
- · Self-hosting requires database, object storage, auth, and ops effort
- · SSO and audit log not built in for self-hosted; require add-ons
- · Managed MLflow only available inside Databricks (no standalone SaaS)
- · Contribution velocity tied to Databricks stewardship priorities
Key features
- +Experiment tracking with metrics, params, and artifacts
- +Model registry with stage transitions and lineage
- +MLflow Projects for reproducible runs
- +MLflow Models for packaging and serving
- +Integrations with PyTorch, TensorFlow, scikit-learn, XGBoost
- +REST API and Python SDK
- +Pluggable tracking backend (SQLite, MySQL, Postgres, file store)
- +Pluggable artifact store (S3, GCS, Azure Blob, local)
- +LLM tracking (newer, less mature than classical ML tracking)
- +Apache 2.0 license; no vendor lock-in
Databricks Mosaic AI
Bundled ML and AI stack inside the Databricks Lakehouse.
Databricks Mosaic AI is the Databricks bundled ML and AI stack, rebranded from Databricks ML in 2024 after the MosaicML acquisition (July 2023, reported $1.3B). The product covers managed MLflow (Databricks is the steward of open-source MLflow and bundles a managed version), Feature Engineering and Feature Store, AutoML, Model Serving, Vector Search, foundation-model fine-tuning (the Mosaic AI Model Training surface), and the AI Playground for generative-AI experimentation. Strengths: the right call for Databricks customers wanting ML, features, and inference inside the Lakehouse (Unity Catalog governance flows through to ML artifacts), managed MLflow is a real benefit, foundation-model fine-tuning is competitive, and the bundle reduces tool sprawl. Trade-offs: a worse call if you are not already on Databricks (Mosaic AI alone is not a credible neutral MLOps platform), pricing is opaque at enterprise scale (DBU-based consumption interacts with cloud compute), the MosaicML acquisition has been digested unevenly (some original MosaicML customers report regression in pre-acquisition workflows), and feature breadth on AutoML lags Vertex AI or SageMaker.
Engineering and data-science teams already committed to Databricks (Lakehouse as primary data warehouse, Unity Catalog for governance) who want bundled ML, features, and inference. Particularly strong for foundation-model fine-tuning post-MosaicML acquisition and teams already paying for Databricks at enterprise scale.
Teams not on Databricks (no standalone neutrality story), buyers wanting transparent simple pricing (DBU consumption is opaque at scale), teams wanting research-team cutting-edge surface (Vertex AI or SageMaker stronger), or buyers wanting the broadest community footprint.
Strengths
- Right call for Databricks customers wanting ML inside the Lakehouse
- Unity Catalog governance flows through to ML artifacts
- Managed MLflow bundled (no separate MLflow ops)
- Foundation-model fine-tuning competitive after MosaicML acquisition
- AI Playground for generative-AI experimentation
- Reduces tool sprawl for Databricks-committed buyers
- Vector Search inside the Lakehouse simplifies RAG architectures
Weaknesses
- Worse call if not already on Databricks (no standalone neutrality)
- Pricing opaque at enterprise scale; DBU consumption hard to forecast
- MosaicML acquisition digested unevenly; some workflow regressions
- AutoML feature breadth lags Vertex AI or SageMaker
- Cloud-portable only across AWS, Azure, GCP (where Databricks runs)
- Smaller ML community footprint than SageMaker or Vertex AI
- Migration off Mosaic AI is non-trivial (Unity Catalog dependencies)
Pricing tiers
opaque- Pay-as-you-go (DBU consumption)No platform seat fee; pay per DBU plus cloud compute (AWS, Azure, GCP)$0+$0 /mo +/emp
- Committed DBU contracts1-year or multi-year DBU commitments; typical 20 to 40 percent discountQuote
- EnterpriseDatabricks master agreement; volume discounts at scale; custom Unity Catalog and ML pricingQuote
- · DBU rates vary by SKU (Jobs, All-Purpose, Serverless) and cloud
- · Cloud compute (AWS, Azure, GCP) priced separately on top of DBU
- · Model Serving DBU consumption is always-on (similar to SageMaker endpoints)
- · Foundation-model fine-tuning is GPU DBU-heavy and adds real cost
- · Vector Search and AI Playground have separate DBU rates
- · Renewal pricing has crept up at enterprise scale through 2024 to 2025
Key features
- +Managed MLflow (experiment tracking, model registry)
- +Feature Engineering and Feature Store inside Unity Catalog
- +AutoML for tabular and forecasting
- +Model Serving (managed online endpoints)
- +Vector Search for RAG and similarity search
- +Foundation-model fine-tuning (Mosaic AI Model Training)
- +AI Playground for generative-AI experimentation
- +Unity Catalog governance flows through to ML artifacts
- +Cloud-portable across AWS, Azure, GCP
- +Lakehouse Monitoring for data and model drift
Comet
Mature neutral experiment tracking with a quieter posture than W and B.
Comet is a mature neutral MLOps platform, founded 2017 and headquartered in New York. The product covers experiment tracking, model registry, model production monitoring (Opik for LLM observability), and a workspace for ML team collaboration. Comet has positioned itself as the quieter competitor to Weights and Biases: smaller installed base, narrower marketing footprint, but a credible product with real customers across financial services, autonomous vehicles, and healthcare. Strengths: defensible for teams that want a neutral experiment tracker without CoreWeave acquisition exposure (Comet remains independent), real integration breadth across PyTorch, TensorFlow, Hugging Face, and major data platforms, transparent SaaS pricing with a usable free tier, and a recently expanded LLMOps surface (Opik) for teams wanting LLM evaluation alongside classical ML tracking. Trade-offs: smaller installed base than W and B (smaller community, fewer integration partners), feature depth on the model registry lags W and B, the LLMOps surface (Opik) is newer and less battle-tested than W and B Models or commercial alternatives, and Comet does not have the same brand recognition at large research labs.
ML engineering and data-science teams wanting a neutral experiment tracker and model registry without CoreWeave acquisition exposure. Particularly strong for teams in regulated industries (financial services, healthcare, autonomous vehicles) that want a quiet, independent vendor over a louder one.
Teams wanting the largest community footprint (W and B is the default), teams already committed to one hyperscaler (Vertex AI, SageMaker, or Azure ML usually better), buyers wanting the deepest model-registry governance (Vertex or SageMaker stronger), or buyers wanting a mature LLMOps surface (Opik is still maturing).
Strengths
- Defensible neutral tracker without CoreWeave acquisition exposure
- Real integration breadth across PyTorch, TensorFlow, Hugging Face
- Transparent SaaS pricing with usable free tier
- Opik LLMOps surface for LLM evaluation alongside classical ML
- Comet remains independent (no hyperscaler or GPU-cloud parent)
- Strong customer base in financial services, autonomous vehicles, healthcare
- Workspace-style team collaboration for ML projects
Weaknesses
- Smaller installed base than Weights and Biases (community, partners)
- Feature depth on the model registry lags W and B
- Opik LLMOps surface newer; less battle-tested than alternatives
- Lower brand recognition at large research labs
- Per-user pricing scales similarly to W and B at large teams
- Self-hosted deployment available but less battle-tested at scale
- Smaller integration ecosystem than hyperscaler ML platforms
Pricing tiers
public- Free (Personal)Personal projects; unlimited experiments on public projects$0+$0 /mo +/emp
- ProPer user per month; team collaboration, model registry, private projects$39+$39 /mo +/emp
- EnterpriseCustom contract; SAML SSO, audit log, dedicated support, self-hosted optionQuote
- · Per-user billing scales at large ML teams
- · Enterprise SAML SSO, audit log, and self-hosted gated to top tier
- · Storage retention beyond default may incur additional cost
- · Opik (LLMOps) priced separately at higher tiers
- · Annual contracts typical 15 percent discount versus monthly
Key features
- +Experiment tracking with metrics, params, and artifacts
- +Model registry with stage transitions
- +Model production monitoring
- +Opik for LLM evaluation and observability
- +Workspace for team collaboration
- +Integrations with PyTorch, TensorFlow, Hugging Face, scikit-learn
- +Hyperparameter optimization
- +SAML SSO and audit log at Enterprise
- +Self-hosted deployment at Enterprise
- +REST API and Python SDK
DataRobot
AutoML legacy platform; multiple CEO changes 2022 to 2024.
DataRobot is the longest-running commercial AutoML platform, founded 2012 in Boston by Jeremy Achin and Tom de Godoy. The company peaked between 2020 and 2022 with valuations reportedly above $6B and a public path that did not materialize. Since 2022, DataRobot has been visibly challenged: multiple CEO changes (Dan Wright stepped in to replace Jeremy Achin in 2022, Debanjan Saha replaced Wright in 2024), the broader AutoML category has lost ground as generative AI absorbed data-science budgets, and revenue growth has been quiet. The product itself remains capable on its original AutoML wedge (tabular classification, regression, time-series forecasting) and has expanded into a broader AI platform covering model deployment, monitoring, governance, and generative-AI use cases. Strengths: deepest commercial AutoML surface in the category (still the AutoML reference product for many regulated buyers), strong model-governance and compliance posture for financial services and insurance, real enterprise customer base across Fortune 500, and a model-monitoring surface that has been mature for several years. Trade-offs: multiple CEO changes 2022 to 2024 are a real signal of post-peak instability, the AutoML category itself has lost share since the 2020 to 2022 peak (vendor demos consistently outran production reliability), pricing is opaque and historically aggressive at enterprise scale, the generative-AI pivot is happening but feels reactive rather than category-defining, and renewal-pricing pressure has hurt customer goodwill.
Regulated buyers (financial services, insurance, healthcare) already invested in DataRobot workflows who need mature AutoML for tabular and time-series use cases with strong governance and audit. Particularly defensible for teams where AutoML reliability is a regulated-industry checkbox rather than a competitive advantage.
Greenfield buyers (modern MLOps alternatives ship faster), buyers wanting research-team cutting-edge surface (Vertex AI or SageMaker stronger), generative-AI-first teams (DataRobot pivot is reactive), or buyers nervous about executive stability and category decline.
Strengths
- Deepest commercial AutoML surface in the category
- AutoML reference product for many regulated buyers
- Strong model-governance and compliance posture
- Real Fortune 500 customer base in financial services and insurance
- Mature model-monitoring surface
- Time-series forecasting AutoML remains competitive
- Defensible for buyers already invested in DataRobot workflows
Weaknesses
- Multiple CEO changes 2022 to 2024 signal post-peak instability
- AutoML category lost share since 2020 to 2022 peak
- Pricing opaque and historically aggressive at enterprise scale
- Generative-AI pivot feels reactive rather than category-defining
- Renewal-pricing pressure has hurt customer goodwill
- Revenue growth quiet through 2023 to 2025
- Vendor demos consistently outrun production reliability
Pricing tiers
opaque- AI Platform (Essentials)Entry tier; AutoML, basic monitoringQuote
- AI Platform (Business)Mid tier; governance, advanced monitoring, deploymentQuote
- AI Platform (Enterprise)Top tier; SAML SSO, audit log, dedicated support, self-hosted optionQuote
- · Pricing opaque; enterprise contracts historically aggressive
- · Renewal-pricing pressure has hurt customer goodwill through 2023 to 2025
- · Compute and storage may be priced separately depending on deployment
- · Self-hosted deployment requires infrastructure plus ops effort
- · Generative-AI features may be priced as separate add-ons
Key features
- +AutoML for tabular classification, regression, time-series
- +Time-series forecasting AutoML
- +Model deployment (real-time and batch)
- +Model monitoring (drift, accuracy, bias)
- +Model governance and audit
- +Feature engineering automation
- +Generative-AI experimentation surface
- +SAML SSO and audit log at Enterprise
- +Self-hosted deployment option
- +REST API and Python SDK
Neptune.ai
Warsaw-based experiment tracker with strong metadata customization and EU residency.
Neptune.ai is a Warsaw-based experiment tracker and model-registry tool, founded 2018 by ex-Codility and ex-deepsense.ai engineers. The product focuses on flexible metadata logging (Neptune lets ML teams log anything as a structured artifact: nested dicts, large arrays, charts, audio, video, custom objects) and has carved out a defensible niche with research teams and ML platform teams that want fine-grained control over what is tracked. Strengths: most flexible metadata model in the category (anything can be logged as structured data), EU-headquartered (Warsaw) with GDPR-native data residency story, transparent SaaS pricing with usable free tier, strong support reputation, and the founders have stayed close to the product (lower executive churn than larger competitors). Trade-offs: smaller installed base than W and B or Comet, narrower integration footprint, the flexibility comes at a learning-curve cost (some teams find it slower to get started than W and B), the model-registry surface is thinner than commercial alternatives, and feature velocity is slower than larger venture-funded competitors.
Research teams and ML platform teams that want fine-grained metadata customization and EU-headquartered tooling. Particularly strong for European buyers wanting GDPR-native data residency, teams logging unusual metadata types, and buyers wanting a quiet independent vendor over a louder venture-funded one.
Teams wanting the largest community and integration footprint (W and B is the default), teams committed to one hyperscaler (Vertex AI, SageMaker, or Azure ML usually better), buyers wanting deep model-registry governance (Vertex or SageMaker stronger), or buyers wanting LLMOps surface.
Strengths
- Most flexible metadata model in the category
- EU-headquartered (Warsaw); GDPR-native data residency
- Transparent SaaS pricing with usable free tier
- Strong support reputation in the ML engineering community
- Founders stayed close to the product (low executive churn)
- Defensible niche with research and platform-engineering teams
- Self-hosted deployment available without top-tier-only gating
Weaknesses
- Smaller installed base than W and B or Comet
- Narrower integration footprint than larger competitors
- Flexibility comes at a learning-curve cost
- Model-registry surface thinner than commercial alternatives
- Feature velocity slower than larger venture-funded competitors
- Limited brand recognition outside EU ML community
- No native LLMOps surface as of early 2026
Pricing tiers
public- Free (Individual)Personal projects; unlimited experiments; 200GB storage$0+$0 /mo +/emp
- TeamPer user per month; team collaboration, model registry, private projects$45+$45 /mo +/emp
- EnterpriseCustom contract; SAML SSO, audit log, dedicated support, self-hosted optionQuote
- · Per-user billing scales at large ML teams
- · Enterprise SAML SSO and audit log gated to top tier
- · Storage retention beyond default may incur additional cost
- · Self-hosted deployment requires infrastructure plus ops effort
- · Annual contracts typical 10 to 15 percent discount versus monthly
Key features
- +Flexible metadata logging (nested dicts, arrays, custom objects)
- +Experiment tracking with metrics, params, and artifacts
- +Model registry with stage transitions
- +Hyperparameter tracking and comparison
- +Integrations with PyTorch, TensorFlow, scikit-learn, Hugging Face
- +SAML SSO and audit log at Enterprise
- +Self-hosted deployment available
- +REST API and Python SDK
- +EU data residency (Warsaw HQ)
- +Strong support reputation
ClearML
Open-source end-to-end MLOps with self-hosted enterprise edition.
ClearML is an open-source end-to-end MLOps platform, founded 2019 (originally as Allegro AI; rebranded ClearML in 2020) and headquartered with engineering presence in Tel Aviv and a US sales footprint. The product covers experiment tracking, orchestration (ClearML Agent for managed compute), data management (ClearML Data), model serving (ClearML Serving), and a hyperparameter optimization surface. The open-source core is genuinely permissive (Apache 2.0) and ClearML self-hosted is one of the few credible end-to-end open-source MLOps stacks. Strengths: end-to-end open-source coverage that competes with commercial alternatives without license fees, defensible self-hosted story for regulated industries, real orchestration surface (ClearML Agent) that schedules training on Kubernetes or bare metal, transparent SaaS pricing with usable free tier, and useful for teams wanting a single open-source MLOps stack rather than composing MLflow plus several other tools. Trade-offs: smaller installed base than W and B, MLflow, or hyperscaler ML platforms, the documentation has visible quality variance, vendor-side engineering team is smaller than Weights and Biases, integration with the broader MLOps ecosystem is narrower, and some buyer reports of orchestration edge-case behavior at scale.
ML engineering and platform teams wanting a single end-to-end open-source MLOps stack, particularly for regulated industries needing self-hosted deployment with orchestration, data management, and serving in one product. Useful for teams wanting to avoid composing MLflow plus several other tools.
Teams wanting the largest community footprint (W and B and MLflow stronger), teams committed to one hyperscaler (Vertex AI, SageMaker, or Azure ML better), buyers wanting LLMOps surface, or buyers without ops capacity to self-host the open-source stack.
Strengths
- End-to-end open-source coverage (Apache 2.0 core)
- Defensible self-hosted story for regulated industries
- Real orchestration surface (ClearML Agent) for Kubernetes and bare metal
- Transparent SaaS pricing with usable free tier
- Single open-source MLOps stack vs composing MLflow plus other tools
- Data management (ClearML Data) and model serving (ClearML Serving) included
- Active core engineering team in Tel Aviv
Weaknesses
- Smaller installed base than W and B, MLflow, or hyperscaler platforms
- Documentation quality variance is visible
- Vendor engineering team smaller than W and B
- Integration with broader MLOps ecosystem is narrower
- Some buyer reports of orchestration edge cases at scale
- Brand recognition lags larger commercial competitors
- No native LLMOps surface comparable to Opik or W and B Models
Pricing tiers
public- Open Source (Apache 2.0)Self-hosted; full feature set; no license fees$0+$0 /mo +/emp
- Hosted FreeFree SaaS tier for small teams; limited storage and compute$0+$0 /mo +/emp
- ProPer user per month; SaaS team collaboration, model registry$15+$15 /mo +/emp
- EnterpriseCustom contract; SAML SSO, audit log, dedicated support, self-hosted enterprise editionQuote
- · Self-hosting requires infrastructure plus ops effort
- · Enterprise SAML SSO and audit log gated to top tier
- · Compute and storage egress not included in SaaS tiers
- · Enterprise Edition self-hosted requires annual contract
Key features
- +Experiment tracking with metrics, params, and artifacts
- +ClearML Agent for orchestration on Kubernetes and bare metal
- +ClearML Data for dataset versioning
- +ClearML Serving for model deployment
- +Hyperparameter optimization
- +Model registry with versioning
- +Open-source core (Apache 2.0)
- +SAML SSO and audit log at Enterprise
- +Self-hosted Enterprise Edition
- +REST API and Python SDK
Frequently asked questions
The questions buyers actually ask before they sign.
How does FCA Consumer Duty affect MLOps platform choice for UK financial services?
Is the UK lighter-touch AI regulation actually different from the EU AI Act in practice?
Weights and Biases vs Comet for a UK ML platform team in 2026?
Do I need a dedicated MLOps platform, or is MLflow plus an object store enough?
How real is the vendor lock-in risk for hyperscaler ML platforms?
What changed when CoreWeave acquired Weights and Biases in May 2024?
What happened to DataRobot?
What is the difference between MLOps and LLMOps in 2026?
How does Databricks Mosaic AI compare to neutral MLOps tooling?
How much should I budget for MLOps software in 2026?
Should I migrate off DataRobot or W and B given the recent vendor turbulence?
Does AutoML still matter in 2026?
How do I decide between hyperscaler ML and neutral tooling?
Final word
Looking at a different market? See the global MLOps Platforms ranking, or pick another country at the top of this page.
Last updated 2026-05-23. Local pricing reverified quarterly. Found something inaccurate? Tell us.