Receipt Text Extractor (Ollama + Vision Model)

This Python script extracts structured text data from receipt images using Ollama and a multimodal LLM (default: llama3.2-vision).
It processes the image, sends it to the model, and returns strictly valid JSON with merchant, items, date, and totals.

Features

Image preprocessing: resizes large images to a max side of 1280px for efficiency.
Structured JSON output: always follows the same schema (merchant, items, totals, VAT, etc.).
Strict format: model is instructed to return only JSON (no explanations or extra text).
CLI usage: specify receipt image path and optional model.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
ocr.py		ocr.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Receipt Text Extractor (Ollama + Vision Model)

Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Receipt Text Extractor (Ollama + Vision Model)

Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages