Independent Development Resources: Document Processing Series

NameCategoryRemarks
markitdownmarkdownConvert data from other formats to markdown documents
pdf-extract-apipdfConvert PDF to markdown via ollama
mihomoVPNRumored to be a better open-source VPN tool than clash
pdf2htmlEXpdfPDF to HTML tool, only suitable for Linux installation, command-line operation
doclingDocument ConversionConvert PDF to markdown, JSON
textinDocument ConversionCommercial service, paid, intelligent PDF parsing
caj2pdf-qtDocumentBatch convert CAJ
markerDocumentPDF to markdown with multi-language support
pdf2zhpdfTranslate PDF documents into other languages, open-source
PDFMathTranslatepdfFormat-preserving translation of scientific PDF papers - AI-based full bilingual translation of PDF documents