Open Source Electronic Invoicing

Why Electronic Invoices?

Every year billions of invoices are exchanged. The majority is emailed in an unstructured PDF format, which means only humans can read and process it.

Invoice-x is a collection of open source applications and libraries aimed at automating the invoicing- and accounting workflow for businesses. Currently we offer tools to extract structured data from legacy PDF invoices, as well as embedding and editing structured XML-representations in hybrid invoices.

Use unstructured data in legacy invoices.

invoice2data is a Python library and command line tool to extract structured data from PDF invoices using regex templates and (optionally) OCR.