Run LLM-generated Python safely from Go — no containers, no CGO, no subprocess.
A pure-Go wrapper around Pydantic's Monty Python interpreter, compiled to WebAssembly and loaded via wazero. Your Go agent writes Python code, monty-go executes it in a sandboxed WASM instance with sub-millisecond startup, and pauses whenever the code calls an external function so your Go code can handle it.
go get github.com/fugue-labs/monty-goLLMs work faster, cheaper, and more reliably when they write code instead of making sequential tool calls. Instead of:
Agent → tool_call("search", {query: "weather london"}) → result
Agent → tool_call("search", {query: "weather tokyo"}) → result
Agent → tool_call("compare", {a: result1, b: result2}) → result
The LLM writes:
london = search(query="weather london")
tokyo = search(query="weather tokyo")
compare(a=london, b=tokyo)One model call instead of three. The Python code calls your Go functions, Monty pauses at each call, your Go code executes it, and Monty resumes. No containers. No sandbox services. No exec(). Just a 2.9MB WASM binary embedded in your Go binary.
For motivation, see:
- Programmatic Tool Calling from Anthropic
- Code Execution with MCP from Anthropic
- Code Mode from Cloudflare
- Smol Agents from Hugging Face
package main
import (
"context"
"fmt"
"log"
montygo "github.com/fugue-labs/monty-go"
)
func main() {
runner, err := montygo.New()
if err != nil {
log.Fatal(err)
}
defer runner.Close()
result, err := runner.Execute(context.Background(),
"x * 2 + y",
map[string]any{"x": 10, "y": 5},
)
if err != nil {
log.Fatal(err)
}
fmt.Println(result) // 25
}The real power is external function calls. Monty pauses execution whenever Python code calls a function you've declared, your Go callback handles it, and Monty resumes with the return value:
result, err := runner.Execute(ctx,
`
london = get_weather("London")
tokyo = get_weather("Tokyo")
f"{london['city']}: {london['temp']}°C, {tokyo['city']}: {tokyo['temp']}°C"
`,
nil,
montygo.WithExternalFunc(func(ctx context.Context, call *montygo.FunctionCall) (any, error) {
city, _ := call.Args["city"].(string)
// Your real implementation here — HTTP call, database query, anything.
return map[string]any{"city": city, "temp": 22}, nil
}, montygo.Func("get_weather", "city")),
)
// result: "London: 22°C, Tokyo: 22°C"Multiple functions work the same way — register them all and dispatch by name:
result, err := runner.Execute(ctx, code, nil,
montygo.WithExternalFunc(func(ctx context.Context, call *montygo.FunctionCall) (any, error) {
switch call.Name {
case "search":
return doSearch(call.Args)
case "calculate":
return doCalculate(call.Args)
case "store":
return doStore(call.Args)
default:
return nil, fmt.Errorf("unknown function: %s", call.Name)
}
},
montygo.Func("search", "query"),
montygo.Func("calculate", "expression"),
montygo.Func("store", "key", "value"),
),
)Prevent runaway code with memory, time, allocation, and recursion limits:
result, err := runner.Execute(ctx, code, inputs,
montygo.WithLimits(montygo.Limits{
MaxDuration: 5 * time.Second,
MaxMemoryBytes: 10 * 1024 * 1024, // 10 MB
MaxAllocations: 100000,
MaxRecursionDepth: 100,
}),
)Infinite loops, memory bombs, and deep recursion all terminate cleanly with a *MontyError. Go's context.Context deadlines are also respected — cancel the context and the WASM instance stops.
Capture Python print() output:
var output strings.Builder
_, err := runner.Execute(ctx, `print("step 1 done")`, nil,
montygo.WithPrintFunc(func(s string) { output.WriteString(s) }),
)
fmt.Print(output.String()) // "step 1 done\n"Python filesystem and environment access routes through your Go callback:
result, err := runner.Execute(ctx,
`
from pathlib import Path
data = Path("/config/settings.json").read_text()
data
`,
nil,
montygo.WithOsCallFunc(func(ctx context.Context, call *montygo.OsCall) (any, error) {
switch call.Function {
case "Path.read_text":
path, _ := call.Args[0].(string)
return readFromYourStorage(path)
case "Path.exists":
path, _ := call.Args[0].(string)
return existsInYourStorage(path), nil
default:
return nil, fmt.Errorf("blocked: %s", call.Function)
}
}),
)No filesystem access happens unless your callback allows it.
monty-go is designed to power code-mode in Gollem, the production agent framework for Go. Instead of sequential tool calls, the LLM writes Python that calls your tools as functions — Monty executes it safely, and Gollem orchestrates the whole thing.
Here's what this looks like with Gollem:
import (
"github.com/fugue-labs/gollem"
"github.com/fugue-labs/gollem/provider/anthropic"
montygo "github.com/fugue-labs/monty-go"
)
// Your existing Gollem tools — search, calculate, store, whatever.
searchTool := gollem.FuncTool[SearchParams]("search", "Search the knowledge base", doSearch)
calcTool := gollem.FuncTool[CalcParams]("calculate", "Run calculations", doCalc)
// Create a code-mode tool that wraps your toolset with Monty.
// The LLM writes Python code, Monty executes it, external function calls
// route to your Go tools.
codeMode := NewCodeModeTool(runner, searchTool, calcTool)
agent := gollem.NewAgent[Analysis](anthropic.New(),
gollem.WithTools[Analysis](codeMode),
gollem.WithSystemPrompt[Analysis](`You have a code execution tool.
Write Python code to call the available functions. Available functions:
- search(query: str) -> dict: Search the knowledge base
- calculate(expression: str) -> float: Evaluate math expressions
Write code that calls these functions and returns the result.`),
)
result, _ := agent.Run(ctx, "Compare Q3 and Q4 revenue and calculate the growth rate")With one model call, the LLM writes:
q3 = search(query="Q3 revenue")
q4 = search(query="Q4 revenue")
growth = calculate(expression=f"({q4['revenue']} - {q3['revenue']}) / {q3['revenue']} * 100")
{"q3": q3, "q4": q4, "growth_rate": growth}Monty pauses three times (two searches, one calculation), your Go functions handle each one, and the final result flows back through Gollem's typed output pipeline. Three tool calls in one LLM round-trip.
Why Gollem + monty-go:
| Traditional tool calling | Code-mode with monty-go | |
|---|---|---|
| LLM calls | One per tool use | One for all tools |
| Latency | N × model round-trip | 1 × model round-trip + μs execution |
| Cost | N × input/output tokens | 1 × input/output tokens |
| Logic | LLM reasons step by step | LLM writes the logic once |
| Control flow | None (sequential only) | Loops, conditionals, variables |
| Error handling | LLM must react to each failure | try/except in Python |
| Security | ✅ (tools are Go functions) | ✅ (WASM sandbox + your callbacks) |
Gollem gives you compile-time type safety, structured output, guardrails, cost tracking, middleware, and multi-provider support. monty-go gives you secure embedded Python execution. Together, your agents do more work per model call.
github.com/fugue-labs/gollem — The production agent framework for Go.
┌─────────────────────────────────────────────────────────┐
│ Your Go Application │
│ │
│ runner, _ := montygo.New() │
│ result, _ := runner.Execute(ctx, code, inputs, opts) │
│ │ │
│ ▼ │
│ ┌──────────────────────────────────┐ │
│ │ wazero (pure Go WASM runtime) │ │
│ │ │ │
│ │ ┌───────────────────────────┐ │ │
│ │ │ monty.wasm (2.9 MB) │ │ ◄── go:embed │
│ │ │ Monty Python Interpreter │ │ │
│ │ │ compiled to wasm32-wasi │ │ │
│ │ └──────────┬────────────────┘ │ │
│ │ │ │ │
│ │ pause on external call │ │
│ │ │ │ │
│ └─────────────┼───────────────────┘ │
│ │ │
│ ▼ │
│ ExternalFunc callback ──► your Go code ──► resume │
│ OsCallFunc callback ──► your Go code ──► resume │
│ PrintFunc callback ──► your Go code │
└─────────────────────────────────────────────────────────┘
- No CGO. wazero is a pure-Go WebAssembly runtime.
- No subprocess. The WASM binary is embedded via
go:embedand compiled once at startup. - Fresh instance per call. Each
Execute()gets an isolated WASM instance. No state leaks between calls. - JSON at the boundary. All data crossing the Go↔WASM boundary is JSON. Go types map naturally:
int→float64,string→string,bool→bool,nil→None,[]any→list,map[string]any→dict.
// Create a reusable runner. Compiles the WASM module once.
runner, err := montygo.New()
defer runner.Close()
// Execute Python code with inputs and options.
result, err := runner.Execute(ctx, code, inputs, opts...)
// Options:
montygo.WithExternalFunc(fn, // register callable functions
montygo.Func("search", "query", "limit"), // with named parameters
montygo.Func("calculate", "expression"),
)
montygo.WithOsCallFunc(fn) // handle filesystem/env access
montygo.WithLimits(montygo.Limits{...}) // resource limits
montygo.WithPrintFunc(fn) // capture print output
// FunctionCall provides named args (positional mapped by param name):
call.Args["query"].(string) // access by parameter name
call.ArgsJSON() // pre-serialized JSON string| Python | Go (result) | Go (input) |
|---|---|---|
int |
float64 |
int, float64 |
float |
float64 |
float64 |
str |
string |
string |
bool |
bool |
bool |
None |
nil |
nil |
list, tuple |
[]any |
[]any |
dict |
map[string]any |
map[string]any |
set |
[]any |
— |
Python exceptions become *montygo.MontyError:
result, err := runner.Execute(ctx, "1 / 0", nil)
var me *montygo.MontyError
if errors.As(err, &me) {
fmt.Println(me.Message) // "Traceback... ZeroDivisionError: division by zero"
}Tracks upstream Monty v0.0.11.
- Arithmetic, string operations, f-strings, slicing
- Functions, lambdas, closures, generators
for/whileloops,if/elif/else,break/continuetry/except/finally/else,raise, exception hierarchy- List/dict/set comprehensions, dict/set view operators
range,len,sum,min,max,sorted,reversed,enumerate,zip,map,filter,all,any,getattrisinstance,type,int(),float(),str(),bool(),abs()print()withsepandendkwargs- PEP 448 generalized unpacking (
*args,**kwargsin calls, literals, etc.) - Nested and augmented subscript assignment (
a[i][j] = v,a[i] += 1) - Tuple comparison (
<,>,<=,>=) - Multi-module imports (
import a, b, c) - Stdlib modules:
math(all functions),re,datetime,json, andsys/typing/asynciosubsets import os,from pathlib import Path(routed through OsCallFunc)- Dataclass instances flow through external function calls (args, returns, and method calls surface with
method_call=true) - Resource limits: time, memory, allocations, recursion depth
- Class definitions (only dataclass instances via external I/O; upstream Monty flags class
defas "coming soon") matchstatements (coming soon upstream)- Context managers (
with ...) - Rest of stdlib and all third-party libraries
float('inf')/float('nan')(JSON serialization limitation in this bridge)
97 end-to-end tests covering every testable scenario from Monty's core test suite:
make testCovers: basic expressions, print variants, all exception types, data type round-tripping, external functions (args, kwargs, mixed, complex types, chaining, loops), input handling and scoping, resource limits (timeout, recursion, memory, allocations), OS calls, builtins, control flow, lambdas/closures, and execution isolation.
Requires Rust with wasm32-wasip1 target and Go 1.23+:
rustup target add wasm32-wasip1
make build # compiles Rust → WASM, copies to monty.wasm
make test # builds and runs Go testsmonty-go exists because of Monty, created by Samuel Colvin and the Pydantic team. Monty is a genuinely novel piece of engineering — a minimal, secure Python interpreter written from scratch in Rust, purpose-built for AI agents. The insight that LLMs should write code instead of making sequential tool calls, and that you need a safe interpreter (not a container) to execute it, is what makes code-mode possible.
Samuel and the Pydantic team have a track record of building foundational tools that the whole ecosystem builds on — Pydantic, Pydantic AI, Logfire, and now Monty. This project is a Go bridge to their work, and we're grateful they built it.
MIT