Engram

AI workflow evaluation and experimentation framework.

Teams building AI-powered features need to iterate on prompts and models, measure the impact of each change, compare alternatives across different platforms, and track what worked. Today this is done through spreadsheets, ad-hoc scripts, and platform UIs with no version history. Engram provides a structured experimentation loop: define what your workflow does, run it against labeled data, score the results, track experiments, and compare alternatives. Git is the version tracker, platforms are interchangeable, and cost is a first-class metric alongside quality.

Install

Requires Python 3.14+.

uv add engram

Quick start

engram init
engram eval <implementation> --dataset <dataset>
engram score <experiment-id> --save
engram baseline set <experiment-id>
engram compare <experiment-id> --prompts
engram baseline promote <experiment-id>
engram estimate <implementation> --dataset <dataset>

Development

uv sync
uv run poe test
uv run poe coverage
uv run poe lint
uv run poe typecheck

How it relates to other tools

Langfuse is an observability platform. It traces every LLM call in production, tracks latency and cost per user/session, and provides a dashboard for monitoring live systems. It answers: "what's happening in prod, and is it good?"

DeepEval is an evaluation library. It provides LLM-as-judge metrics (faithfulness, hallucination, toxicity, etc.) and integrates with pytest. It answers: "given these outputs, how good are they?"

Engram is an experimentation framework. It compares AI workflow implementations across platforms: sync configs, run evals against labeled datasets, score with deterministic metrics, track experiments in git, and diff what changed between any two runs. It answers: "which implementation is better, and what changed?"

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
engram		engram
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
renovate.json		renovate.json
ruff.toml		ruff.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Engram

Install

Quick start

Development

How it relates to other tools

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Engram

Install

Quick start

Development

How it relates to other tools

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages