MLflow: A Machine Learning Lifecycle Platform

MLflow is an open-source platform, purpose-built to assist machine learning practitioners and teams in handling the complexities of the machine learning process. MLflow focuses on the full lifecycle for machine learning projects, ensuring that each phase is manageable, traceable, and reproducible

The core components of MLflow are:

Experiment Tracking 📝: A set of APIs to log models, params, and results in ML experiments and compare them using an interactive UI.
Model Packaging 📦: A standard format for packaging a model and its metadata, such as dependency versions, ensuring reliable deployment and strong reproducibility.
Model Registry 💾: A centralized model store, set of APIs, and UI, to collaboratively manage the full lifecycle of MLflow Models.
Serving 🚀: Tools for seamless model deployment to batch and real-time scoring on platforms like Docker, Kubernetes, Azure ML, and AWS SageMaker.
Evaluation 📊: A suite of automated model evaluation tools, seamlessly integrated with experiment tracking to record model performance and visually compare results across multiple models.
Observability 🔍: Tracing integrations with various GenAI libraries and a Python SDK for manual instrumentation, offering smoother debugging experience and supporting online monitoring.

Installation

To install the MLflow Python package, run the following command:

pip install mlflow

Alternatively, you can install MLflow from on different package hosting platforms:


PyPI
conda-forge
CRAN
Maven Central

Documentation 📘

Official documentation for MLflow can be found at here.

Running Anywhere 🌐

You can run MLflow on many different environments, including local development, Amazon SageMaker, AzureML, and Databricks. Please refer to this guidance for how to setup MLflow on your environment.

Usage

Experiment Tracking (Doc)

The following examples trains a simple regression model with scikit-learn, while enabling MLflow's autologging feature for experiment tracking.

import mlflow

from sklearn.model_selection import train_test_split
from sklearn.datasets import load_diabetes
from sklearn.ensemble import RandomForestRegressor

# Enable MLflow's automatic experiment tracking for scikit-learn
mlflow.sklearn.autolog()

# Load the training dataset
db = load_diabetes()
X_train, X_test, y_train, y_test = train_test_split(db.data, db.target)

rf = RandomForestRegressor(n_estimators=100, max_depth=6, max_features=3)
# MLflow triggers logging automatically upon model fitting
rf.fit(X_train, y_train)

Once the above code finishes, run the following command in a separate terminal and access the MLflow UI via the printed URL. An MLflow Run should be automatically created, which tracks the training dataset, hyper parameters, performance metrics, the trained model, dependencies, and even more.

mlflow ui

Serving Models (Doc)

You can deploy the logged model to a local inference server by a one-line command using the MLflow CLI. Visit the documentation for how to deploy models to other hosting platforms.

mlflow models serve --model-uri runs:/<run-id>/model

Evaluating Models (Doc)

The following example runs automatic evaluation for question-answering tasks with several built-in metrics.

import mlflow
import pandas as pd

# Evaluation set contains (1) input question (2) model outputs (3) ground truth
df = pd.DataFrame(
    {
        "inputs": ["What is MLflow?", "What is Spark?"],
        "outputs": [
            "MLflow is an innovative fully self-driving airship powered by AI.",
            "Sparks is an American pop and rock duo formed in Los Angeles.",
        ],
        "ground_truth": [
            "MLflow is an open-source platform for managing the end-to-end machine learning (ML) "
            "lifecycle.",
            "Apache Spark is an open-source, distributed computing system designed for big data "
            "processing and analytics.",
        ],
    }
)
eval_dataset = mlflow.data.from_pandas(
    df, predictions="outputs", targets="ground_truth"
)

# Start an MLflow Run to record the evaluation results to
with mlflow.start_run(run_name="evaluate_qa"):
    # Run automatic evaluation with a set of built-in metrics for question-answering models
    results = mlflow.evaluate(
        data=eval_dataset,
        model_type="question-answering",
    )

print(results.tables["eval_results_table"])

Observability (Doc)

MLflow Tracing provides LLM observability for various GenAI libraries such as OpenAI, LangChain, LlamaIndex, DSPy, AutoGen, and more. To enable auto-tracing, call mlflow.xyz.autolog() before running your models. Refer to the documentation for customization and manual instrumentation.

import mlflow
from openai import OpenAI

# Enable tracing for OpenAI
mlflow.openai.autolog()

# Query OpenAI LLM normally
response = OpenAI().chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hi!"}],
    temperature=0.1,
)

Then navigate to the "Traces" tab in the MLflow UI to find the trace records OpenAI query.

Community

For help or questions about MLflow usage (e.g. "how do I do X?") visit the docs or Stack Overflow.
Alternatively, you can ask the question to our AI-powered chat bot. Visit the doc website and click on the "Ask AI" button at the right bottom to start chatting with the bot.
To report a bug, file a documentation issue, or submit a feature request, please open a GitHub issue.
For release announcements and other discussions, please subscribe to our mailing list (mlflow-users@googlegroups.com) or join us on Slack.

Contributing

We happily welcome contributions to MLflow! We are also seeking contributions to items on the MLflow Roadmap. Please see our contribution guide to learn more about contributing to MLflow.

Core Members

MLflow is currently maintained by the following core members with significant contributions from hundreds of exceptionally talented community members.

Name	Name	Last commit message	Last commit date
Latest commit B-Step62 Support OpenAI Agent SDK Auto Tracing (#14987 ) Mar 14, 2025 7d8d98d · Mar 14, 2025 History 7,386 Commits
.circleci	.circleci	Add docusaurus build script that can be run from release pipeline (#1…	Mar 4, 2025
.devcontainer	.devcontainer	updates pre-commit version and fixes stage names (#13630 )	Nov 1, 2024
.github	.github	chore: remove unused code (#14930 )	Mar 12, 2025
assets	assets	Add dev container configuration for VSCode (#6493 )	Nov 16, 2022
dev	dev	Support exporting trace to DB tracing server. (#14805 )	Mar 10, 2025
docker	docker	Move psutil to test requirements and do optimistic load (#10342 )	Nov 13, 2023
docs	docs	Openai agent sdk doc (#14992 )	Mar 14, 2025
examples	examples	Add demo notebooks for type hints (#14946 )	Mar 14, 2025
mlflow	mlflow	Support OpenAI Agent SDK Auto Tracing (#14987 )	Mar 14, 2025
requirements	requirements	Avoid `dspy` 2.6.9 (#14814 )	Mar 4, 2025
skinny	skinny	chore: remove office hours announcements (#14927 )	Mar 10, 2025
tests	tests	Support OpenAI Agent SDK Auto Tracing (#14987 )	Mar 14, 2025
.dockerignore	.dockerignore	Merge `large-requirements.txt` and `small-requirements.txt` (#5942 )	May 25, 2022
.git-blame-ignore-revs	.git-blame-ignore-revs	Add commits for import sorting in `.git-blame-ignore-revs` (#9426 )	Aug 23, 2023
.gitattributes	.gitattributes	MLflow UI updates (#5346 )	Feb 4, 2022
.gitignore	.gitignore	Add necessary redirects in docusaurus (#14895 )	Mar 7, 2025
.gitmodules	.gitmodules	Initial setup to migrate pipeline to recipe (#7221 )	Nov 2, 2022
.pre-commit-config.yaml	.pre-commit-config.yaml	Enable markdown files in typo checker (#14499 )	Feb 7, 2025
CHANGELOG.md	CHANGELOG.md	Run `python3 dev/update_changelog.py --prev-version 2.2...` (#14975 )	Mar 13, 2025
CODE_OF_CONDUCT.rst	CODE_OF_CONDUCT.rst	Replace CONTRIBUTING.rst with CONTRIBUTING.md (#6791 )	Sep 14, 2022
COMMITTER.md	COMMITTER.md	Introduce committer process for MLflow (#10941 )	Jan 30, 2024
CONTRIBUTING.md	CONTRIBUTING.md	Document how to exclude symlinks from searches (#14870 )	Mar 6, 2025
Dockerfile	Dockerfile	Bump python and node version in Dockerfile (#13381 )	Oct 12, 2024
EXTRA_DEPENDENCIES.rst	EXTRA_DEPENDENCIES.rst	Merge `large-requirements.txt` and `small-requirements.txt` (#5942 )	May 25, 2022
ISSUE_POLICY.md	ISSUE_POLICY.md	Format `.md`, `.json`, `.yaml` files using prettier (#7429 )	Dec 1, 2022
ISSUE_TRIAGE.rst	ISSUE_TRIAGE.rst	MLflow Gateway (#8694 )	Jul 12, 2023
LICENSE.txt	LICENSE.txt	Update LICENSE.txt (#29 )	Jun 8, 2018
MANIFEST.in	MANIFEST.in	Exclude tests from mlflow package distributions (#12316 )	Jun 12, 2024
README.md	README.md	chore: remove office hours announcements (#14927 )	Mar 10, 2025
SECURITY.md	SECURITY.md	Add MLflow typo checker (#10224 )	Oct 31, 2023
conftest.py	conftest.py	Handle LangGraph breaking change (#14794 )	Mar 3, 2025
mlflow-charter.pdf	mlflow-charter.pdf	Added mlflow charter to the contribution.rst (#3066 )	Jul 13, 2020
pyproject.release.toml	pyproject.release.toml	Run `python3 dev/update_mlflow_versions.py post-release...` (#14874 )	Mar 6, 2025
pyproject.toml	pyproject.toml	Run `python3 dev/update_mlflow_versions.py post-release...` (#14874 )	Mar 6, 2025
pytest.ini	pytest.ini	Turn `PytestUnhandledCoroutineWarning` into an error (#14399 )	Feb 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLflow: A Machine Learning Lifecycle Platform

Installation

Documentation 📘

Running Anywhere 🌐

Usage

Experiment Tracking (Doc)

Serving Models (Doc)

Evaluating Models (Doc)

Observability (Doc)

Community

Contributing

Core Members

About

Releases 116

Packages 3

Used by 53.6k

Contributors 828

Languages

License

mlflow/mlflow

Folders and files

Latest commit

History

Repository files navigation

MLflow: A Machine Learning Lifecycle Platform

Installation

Documentation 📘

Running Anywhere 🌐

Usage

Experiment Tracking (Doc)

Serving Models (Doc)

Evaluating Models (Doc)

Observability (Doc)

Community

Contributing

Core Members

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases 116

Packages 3

Used by 53.6k

Contributors 828

Languages