Discussion

Hello Product Hunt! I'm Vinayak, creator of Camelot.
There are many open-source (Tabula, pdf-table-extract) and closed-source (smallpdf, pdftables) tools to extract tables from PDFs. But they either give a nice output or fail miserably. There is no in between. This is not helpful since everything in the real world, including PDF table extraction, is fuzzy. This leads to the creation of ad-hoc table extraction scripts for each type of PDF table. We, at SocialCops, created Camelot to offer users complete control over table extraction. It is a Python library to extract tabular data from PDFs!
You can install it using conda or pip! Check out the installation instructions in the README: https://www.github.com/socialcop...
Great documentation is available here: https://camelot-py.readthedocs.i...
We would be really grateful if you could give us any feedback that can help us improve it! You can follow the development on GitHub.