Document Splitting

Document Splitting is a powerful feature that allows you to separate a single file containing multiple documents into individual documents for further processing. This is especially useful when working with batch-scanned files.

Affinda’s intelligent AI models analyze your document to detect logical splitting points. These splitting points might be based on:

  • New document type detected.
  • Blank pages or separators.
  • Unique patterns in text, headers, or footers.
  • Clear non-continuation.
  • Predefined delimiters like barcodes.

Once split, each segment inherits the Workspace’s classification and extraction rules, simplifying downstream workflows.

Splitting Settings

Configure splitting settings at the Workspace level. Click on the toggle to (de)select automatic splitting of files and select the Splitter to use.

What happens to documents that are split?

If there are any edits made to the file, the AI model will re-parse the data to give the most accurate predictions. Any field validations made will be lost.

When a document is split into multiple components, new files are created in your account. These new files are created with a suffix added to the file name (e.g. [filename]_1, [filename]_2, etc..).

Within the API response of the original file, users will also be able to find the identifier of the new files created, so that they can then get the data from these newly created files. The PDF file of the documents is also included in the response so that new documents created can be added to your platform.

Editing the split

While the document splitter automatically splits a document, users will still have the option to manually split or combine documents again through the 'Edit Pages' pages within the document validation UI. This gives users full control over their documents, even after the initial split (see below for more information).

See Splitting & Page Editor for more information.