git for models — know what models you have deployed, where they run, what they depend on, and what changed.
📖 Documentation · Quickstart · Concepts · Governance
model-ledger is a model inventory for any organization with deployed models. It discovers models, heuristic rules, and ETL across your platforms, maps the dependency graph automatically, and records every change as an immutable event. Unlike registries tied to a single platform (MLflow, SageMaker, W&B), it spans all of them — as one connected graph — and it's built to be driven by AI agents through a native MCP server.
pip install model-ledgerEvery model is a DataNode with typed input and output ports. When an output port name
matches an input port name, connect() creates the dependency edge — no hand-wiring.
from model_ledger import Ledger, DataNode
ledger = Ledger()
ledger.add([
DataNode("segmentation", platform="etl", outputs=["customer_segments"]),
DataNode("fraud_scorer", platform="ml", inputs=["customer_segments"], outputs=["risk_scores"]),
DataNode("fraud_alerts", platform="alerting", inputs=["risk_scores"]),
])
ledger.connect()
ledger.trace("fraud_alerts")
# ['segmentation', 'fraud_scorer', 'fraud_alerts']Every mutation is recorded as an immutable Snapshot — an append-only event log that gives you full history and point-in-time reconstruction, because nothing is overwritten.
The MCP server is a first-class surface — point Claude (or any MCP agent) at it:
pip install "model-ledger[mcp]"
claude mcp add model-ledger -- model-ledger mcp --demoYou: if we deprecate
customer_features, what breaks?Claude: 3 models consume it directly, 2 more transitively.
Everything lives at block.github.io/model-ledger — and it can't drift, because the API reference is generated from source and every example runs in CI:
- Quickstart — install to your first dependency trace in 60 seconds
- Concepts — DataNode, Snapshot, and Composite, in three ideas
- Agents (MCP) — the eight-tool agent surface, with a worked transcript
- Connectors — discover from SQL, REST, GitHub, or your own platform
- Backends — in-memory, SQLite, JSON, Snowflake, or remote HTTP
- Governance — how the primitives map to SR 26‑2, the EU AI Act, and NIST
- API reference — generated from the source
The OSS core handles discovery, graph building, change tracking, storage, and the agent protocol. Your internal package provides the thin layer on top — connector configs, custom connectors for internal platforms, authentication, and compliance profiles. Thin config and credentials, not reimplemented logic.
See CONTRIBUTING.md. All commits require DCO sign-off.
Apache-2.0. See LICENSE.