The Sensible Code Company

We make products that turn messy data into valuable information. We work with economists, statisticians and data managers to help them to improve their business operations using modern data science techniques and machine learning.

PDFTables.com – Accurately convert PDF tables to Excel, there’s an API for automation.
It has artificial eyes that “see” columns by their shape. It’s really good at getting data out accurately. You can effortlessly convert PDF to Excel XLSX online and you can download results via your browser. Or if you’re a coder, you can automate the process using the PDFTables web API. It’s a fast online SaaS with no queues. We offer a free trial.

DataBaker – make gnarly spreadsheet content machine readable.
DataBaker is a Python library that helps you wrangle complex spreadsheets into clean, normalised data tables. DataBaker is built to integrate with Jupyter and Pandas. It allows users to iteratively build up intuitive recipes that describe the structure of a spreadsheet. These recipes translate the spreadsheet into flat tables of data that can be used by Pandas and other data analysis libraries or saved as CSVs. It’s on Github so take a look.