These docs are for v2.0. Click to read the latest docs for v3.0.

Document Splitting

Affinda's AI models are trained to assume a document only has a single instance of an invoice or other document type. However, often, a document will contain multiple invoices that need to be separated for the model to be effective in picking up all the relevant fields. This is particularly common for paper invoices that have been scanned together into a single file.  

Within the validation interface, we provide the option for users to 'Edit Pages'. This will bring up a new interface that will allow users to split the document into multiple parts, as well as delete irrelevant pages and rotate pages to be the right way up. 

What happens when I split an invoice?

If there are any edits made to the file, the AI model will re-parse the data to give the most accurate predictions. Any field validations made will be lost.

When a document is split into multiple components, new files are created in your account. These new files are created with a suffix added to the file name (e.g. [filename]_1, [filename]_2, etc..).

Within the API response of the original file, users will also be able to find the identifier of the new files created, so that they can then get the data from these newly created files. The PDF file of the documents is also included in the response so that new documents created can be added to your platform.