Detailed Tutorial for Validating Splitting
Splitting Settings

General Document Splitter
All customers will have access to Affinda’s General Document Splitter. This model has been designed to identify specific cues that indicate a new document, including:- Change in page numbering sequence (e.g. Page 1)
- Change in key party within the document (e.g. an invoice from a different supplier is identified)
- Change in key document identifier
General Document Splitter is not self learning
Custom splitter
While the General Document Splitter has been designed to work across most use cases, there will be some use cases that will need additional configuration. Three different types of custom splitters can be created:- LLM based
Design a prompt that details when a file should be split - Key word
Split when a specific keyword(s) is found on a page - Trained model
Train a new model using a number of representative documents to help identify when a new document occurs within a file
Reach out to the Affinda team if the General Document Splitter is not meeting your use case, and you want to discuss a custom splitting model.