Camelot
Document LoadersOpen SourceVerifiedOpen Source
Python library that extracts tables from text-based PDF files into structured data formats. Offers two parsing methods (Stream and Lattice) with tweakable settings and built-in accuracy/whitespace metrics for quality control. Best suited for ETL pipelines and data analysis workflows requiring reliable table extraction from PDF documents.
API
No API
Price
$0 – $0
License: MIT