Hanzo Ingress

Cloud-native L7 reverse proxy and load balancer for Hanzo AI infrastructure. Kubernetes-native with automatic TLS, dynamic configuration, and zero-downtime reloads.

Overview

Hanzo Ingress is the front door for all Hanzo production traffic. It watches Kubernetes Ingress resources, automatically provisions TLS certificates via Let's Encrypt, and routes traffic to internal services -- including Hanzo Gateway for API endpoints and direct service routing for web applications.

Deployed on the hanzo-k8s cluster as the default IngressClass (hanzo), it handles all *.hanzo.ai traffic with 2 replicas in host-network mode for direct port 80/443 binding.

Features

Kubernetes-native -- watches Ingress resources, auto-configures routes
Automatic TLS -- Let's Encrypt certificate provisioning and renewal (wildcard support)
Dynamic configuration -- zero-restart config updates as Ingress resources change
HTTP/2, gRPC, WebSocket -- full protocol support for all backend types
Circuit breakers -- automatic failure isolation with configurable thresholds
Retry logic -- built-in retry with exponential backoff
Access logging -- JSON and Common Log Format output
Metrics export -- Prometheus, Datadog, StatsD, InfluxDB, OTLP
Web dashboard -- built-in UI for route visualization and health monitoring
REST API -- programmatic access to configuration and status
Single static binary -- no runtime dependencies, minimal attack surface
Host-network mode -- direct port binding for minimal latency

Container Images

Tag	Description
`ghcr.io/hanzoai/ingress:latest`	Stable release, production-ready
`ghcr.io/hanzoai/ingress:experimental-master`	Latest master build, unstable
`ghcr.io/hanzoai/ingress:vX.Y.Z`	Pinned release version

Quick Start

Docker

docker run -d \
  --name hanzo-ingress \
  -p 80:80 \
  -p 443:443 \
  -v /var/run/docker.sock:/var/run/docker.sock \
  ghcr.io/hanzoai/ingress:latest \
  --entrypoints.web.address=:80 \
  --entrypoints.websecure.address=:443 \
  --providers.docker=true \
  --ping=true

Kubernetes

# Apply all manifests (RBAC, IngressClass, Deployment, Service)
kubectl apply -f https://raw.githubusercontent.com/hanzoai/ingress/master/k8s/hanzo/rbac.yaml
kubectl apply -f https://raw.githubusercontent.com/hanzoai/ingress/master/k8s/hanzo/ingressclass.yaml
kubectl apply -f https://raw.githubusercontent.com/hanzoai/ingress/master/k8s/hanzo/deployment.yaml
kubectl apply -f https://raw.githubusercontent.com/hanzoai/ingress/master/k8s/hanzo/service.yaml

# Verify
kubectl -n hanzo get pods -l app=hanzo-ingress

Binary

Download a pre-built binary from GitHub Releases:

# Download latest release (Linux amd64)
curl -sL https://github.com/hanzoai/ingress/releases/latest/download/hanzo-ingress_linux_amd64.tar.gz | tar xz

# Run
./hanzo-ingress \
  --entrypoints.web.address=:80 \
  --entrypoints.websecure.address=:443 \
  --providers.kubernetesingress=true

Docker Compose

services:
  ingress:
    image: ghcr.io/hanzoai/ingress:latest
    ports:
      - "80:80"
      - "443:443"
    volumes:
      - /var/run/docker.sock:/var/run/docker.sock:ro
      - ./acme.json:/acme.json
    command:
      - "--entrypoints.web.address=:80"
      - "--entrypoints.websecure.address=:443"
      - "--providers.docker=true"
      - "--certificatesresolvers.letsencrypt.acme.httpchallenge.entrypoint=web"
      - "--certificatesresolvers.letsencrypt.acme.email=ops@hanzo.ai"
      - "--certificatesresolvers.letsencrypt.acme.storage=/acme.json"

Build from Source

make build
./hanzo-ingress --configFile=config.toml

Architecture

              Internet
                 |
        +--------+--------+
        | Cloudflare CDN  |
        | DNS, WAF, DDoS  |
        +--------+--------+
                 |
        +--------+--------+
        | Hanzo Ingress   |   L7 reverse proxy
        | (ports 80/443)  |   TLS termination
        | IngressClass:   |   Route matching
        |   "hanzo"       |   Load balancing
        +--+-+-+-+-+--+---+
           | | | | |  |
     +-----+ | | | |  +--------+
     |   +---+ | | +-----+     |
     |   |  +--+ +--+    |     |
     v   v  v       v    v     v
  +-----+-----+  +----+ +---+ +-------+  +-----+
  | Hanzo     |  | IAM| |KMS| | Cloud |  | PaaS|
  | Gateway   |  +----+ +---+ +-------+  +-----+
  | (API)     |
  +--+--+--+--+
     |  |  |
     v  v  v
  +------+------+------+
  |Engine|Search|  LLM |    Backend services
  +------+------+------+

Request Flow

DNS resolves *.hanzo.ai to Cloudflare
Cloudflare proxies to hanzo-k8s cluster LB (24.199.76.156)
Hanzo Ingress terminates TLS, matches host/path rules
Request forwarded to the matching backend service
For API traffic (api.hanzo.ai), Ingress routes to Hanzo Gateway for endpoint-level routing

Middleware Reference

Hanzo Ingress ships with a full suite of built-in middlewares that can be composed via annotations or configuration.

Middleware	Description
auth	Forward authentication, basic auth, digest auth
ratelimiter	Token-bucket rate limiting per client or route
circuitbreaker	Automatic failure isolation with configurable thresholds
retry	Automatic retry with exponential backoff
compress	Gzip/Brotli response compression
headers	Add, remove, or override request/response headers
ipallowlist	Restrict access by client IP CIDR ranges
buffering	Request/response buffering with size limits
inflightreq	Limit concurrent in-flight requests per source
redirect	HTTP-to-HTTPS and regex-based URL redirects
stripprefix	Remove path prefix before forwarding to backend
stripprefixregex	Remove path prefix by regex pattern
addprefix	Prepend a path prefix to the forwarded request
replacepath	Replace the entire request path
replacepathregex	Replace request path by regex pattern
chain	Compose multiple middlewares into a named pipeline
passtlsclientcert	Forward client TLS certificate info as headers
grpcweb	Translate gRPC-Web requests to native gRPC
contenttype	Auto-detect and set `Content-Type` headers
customerrors	Serve custom error pages by status code
recovery	Recover from panics and return 500 instead of crashing
forwardedheaders	Trust and propagate `X-Forwarded-*` headers
observability	Distributed tracing spans (OpenTelemetry, Jaeger, Zipkin)
metrics	Prometheus, Datadog, StatsD, InfluxDB, OTLP metrics
accesslog	Structured access logging (JSON, CLF)
capture	Capture request/response sizes for metrics
snicheck	Validate TLS SNI against allowed hostnames
tcp	TCP-level middlewares (IP allowlist, in-flight limit)

Middlewares are applied via Kubernetes Ingress annotations:

metadata:
  annotations:
    hanzo.ai/ingress-ratelimit-average: "100"
    hanzo.ai/ingress-ratelimit-burst: "200"

Or via TOML/YAML configuration for non-Kubernetes providers.

Kubernetes Deployment

IngressClass

Hanzo Ingress registers as the default IngressClass on the cluster:

apiVersion: networking.k8s.io/v1
kind: IngressClass
metadata:
  name: hanzo
  annotations:
    ingressclass.kubernetes.io/is-default-class: "true"
spec:
  controller: hanzo.ai/ingress-controller

Any Ingress resource without an explicit ingressClassName is automatically picked up.

K8s Manifests

k8s/hanzo/
  rbac.yaml             # ServiceAccount, ClusterRole, ClusterRoleBinding
  ingressclass.yaml     # IngressClass "hanzo" (default)
  deployment.yaml       # 2 replicas, hostNetwork, ports 80/443
  service.yaml          # LoadBalancer service

Creating Ingress Resources

Once deployed, create standard Kubernetes Ingress resources to route traffic:

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: my-service
  namespace: hanzo
  annotations:
    cert-manager.io/cluster-issuer: letsencrypt
spec:
  ingressClassName: hanzo
  tls:
  - hosts:
    - my-service.hanzo.ai
    secretName: my-service-tls
  rules:
  - host: my-service.hanzo.ai
    http:
      paths:
      - path: /
        pathType: Prefix
        backend:
          service:
            name: my-service
            port:
              number: 8080

Production Deployment

Hanzo Ingress runs on the hanzo-k8s DOKS cluster (24.199.76.156) as the sole ingress controller for all Hanzo services.

Property	Value
Image	`ghcr.io/hanzoai/ingress:latest`
Replicas	2
Namespace	`hanzo`
Network	hostNetwork (direct port binding)
Ports	80 (HTTP), 443 (HTTPS)
Service type	LoadBalancer
Health check	`GET /ping` on port 80
Liveness probe	HTTP `/ping`, 5s initial, 10s interval
Readiness probe	HTTP `/ping`, 3s initial, 5s interval
Resources	100m-1000m CPU, 128Mi-512Mi memory
Security	`NET_BIND_SERVICE` capability, all others dropped

Deploy / Update

kubectl --context do-sfo3-hanzo-k8s apply -f k8s/hanzo/

# Verify pods
kubectl --context do-sfo3-hanzo-k8s -n hanzo get pods -l app=hanzo-ingress

# Check service
kubectl --context do-sfo3-hanzo-k8s -n hanzo get svc hanzo-ingress

# View logs
kubectl --context do-sfo3-hanzo-k8s -n hanzo logs -l app=hanzo-ingress --tail=100 -f

Routed Domains (hanzo-k8s)

All domains below resolve through Cloudflare to this ingress instance:

Domain	Backend Service
`hanzo.ai`	hanzo-app
`api.hanzo.ai`, `llm.hanzo.ai`	Hanzo Gateway
`hanzo.id`, `lux.id`, `zoo.id`	IAM (Casdoor)
`kms.hanzo.ai`	KMS (Infisical)
`platform.hanzo.ai`	Platform (Dokploy)
`console.hanzo.ai`	Console
`cloud.hanzo.ai`	Cloud

Service Discovery

Hanzo Ingress supports multiple provider backends:

Provider	Description
Kubernetes Ingress	Watches `networking.k8s.io/v1` Ingress resources (primary)
Docker	Discovers containers via Docker socket labels
File	Static TOML/YAML configuration files
Consul	Service catalog integration
Etcd	Key-value store configuration
ECS	AWS ECS task discovery

Production runs exclusively with the Kubernetes Ingress provider.

Configuration

CLI Flags (Production)

./hanzo-ingress \
  --providers.kubernetesingress=true \
  --providers.kubernetesingress.ingressendpoint.publishedservice=hanzo/hanzo-ingress \
  --providers.kubernetesingress.allowemptyservices=true \
  --entrypoints.web.address=:80 \
  --entrypoints.websecure.address=:443 \
  --entrypoints.websecure.http.tls=true \
  --ping=true \
  --ping.entryPoint=web \
  --api.dashboard=false \
  --log.level=INFO \
  --accesslog=true

Configuration File

[entryPoints]
  [entryPoints.web]
    address = ":80"
  [entryPoints.websecure]
    address = ":443"
    [entryPoints.websecure.http.tls]

[providers]
  [providers.kubernetesIngress]
    [providers.kubernetesIngress.ingressEndpoint]
      publishedService = "hanzo/hanzo-ingress"

[ping]
  entryPoint = "web"

[log]
  level = "INFO"

[accessLog]

See the sample configuration files in the repository root for full examples.

Repository Structure

cmd/                    # Binary entry point
internal/               # Core routing, middleware, provider logic
pkg/                    # Public packages and configuration types
webui/                  # Built-in dashboard (React)
k8s/
  hanzo/                # Production K8s manifests
    rbac.yaml           # ServiceAccount + ClusterRole
    ingressclass.yaml   # IngressClass "hanzo" (default)
    deployment.yaml     # 2-replica Deployment
    service.yaml        # LoadBalancer Service
integration/            # Integration test suite
contrib/                # Community contributed configs
docs/                   # Extended documentation
Dockerfile              # Multi-stage build (Node webui + Go binary)
Makefile                # Build, test, Docker targets

PaaS Integration

Hanzo Ingress serves as the ingress layer for Hanzo Platform (PaaS). Applications deployed through the platform automatically get:

Ingress resource creation with proper host rules
TLS certificate provisioning
Load balancing across application replicas
Access logging and metrics

Documentation

Full documentation is available at docs.hanzo.ai/docs/services/ingress.

Related Projects

Hanzo Ingress is part of the Hanzo AI infrastructure stack:

Project	Role	Repository
Hanzo Ingress	L7 reverse proxy, TLS termination, load balancing	`hanzoai/ingress`
Hanzo Gateway	API gateway, rate limiting, endpoint routing	`hanzoai/gateway`
Hanzo Engine	GPU inference engine, model serving	`hanzoai/engine`
Hanzo Edge	On-device inference runtime (mobile, web, embedded)	`hanzoai/edge`

Internet -> Ingress (TLS/L7) -> Gateway (API routing) -> Engine (inference) / Cloud API / Services
                                                          Edge (on-device, client-side)

License

MIT -- see LICENSE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 6,047 Commits
.github		.github
cmd		cmd
contrib		contrib
docs		docs
integration		integration
internal		internal
k8s		k8s
pkg		pkg
script		script
webui		webui
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.go-version		.go-version
.golangci.yml		.golangci.yml
.goreleaser.yml.tmpl		.goreleaser.yml.tmpl
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.bin		Dockerfile.bin
LICENSE.md		LICENSE.md
LLM.md		LLM.md
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
generate.go		generate.go
go.mod		go.mod
go.sum		go.sum
ingress.sample.toml		ingress.sample.toml
ingress.sample.yml		ingress.sample.yml

Folders and files

Latest commit

History

Repository files navigation

Hanzo Ingress

Overview

Features

Container Images

Quick Start

Docker

Kubernetes

Binary

Docker Compose

Build from Source

Architecture

Request Flow

Middleware Reference

Kubernetes Deployment

IngressClass

K8s Manifests

Creating Ingress Resources

Production Deployment

Deploy / Update

Routed Domains (hanzo-k8s)

Service Discovery

Configuration

CLI Flags (Production)

Configuration File

Repository Structure

PaaS Integration

Documentation

Related Projects

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages