Skip to content

Add pdfmux to Data Extraction#21

Open
NameetP wants to merge 1 commit intopy-pdf:mainfrom
NameetP:add-pdfmux
Open

Add pdfmux to Data Extraction#21
NameetP wants to merge 1 commit intopy-pdf:mainfrom
NameetP:add-pdfmux

Conversation

@NameetP
Copy link
Copy Markdown

@NameetP NameetP commented Mar 28, 2026

Adds pdfmux to the Data Extraction section.

pdfmux is a smart PDF-to-Markdown router that picks the best extractor per page, audits output quality, and re-extracts failures. Zero config, local-first, with an MCP server included.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant