The Affinda platform automates the splitting and classification of documents. However, user review may be required to validate the model outputs. This tutorial will walk you through how to validate the splitting and classification of your documents in the Affinda Platform.
Firstly, let’s get across what splitting and classification mean in your automated document workflow.
Splitting
Splitting in Affinda refers to the platform’s ability to identify individual documents in a multi-document file. Our models will split these into individual files. This allows them to be classified and sent to the correct extraction model.
Classification
Classification is Affinda’s ability to identify what type of document it has received. This is beneficial in two ways:
It gives you visibility on the type of documents you’ve received
It allows these documents to be sent to the correct workflow and extraction model
How to validate document splitting and classification
1
Open the document for review in the Affinda app
Start by opening the Document Validation Interface for a document you want to review.
2
Validate Splitting
You can see if your document has been split by the Affinda model in the document validation interface. Splitting is indicated by
The purple scissors icon will appear next to the document’s name on the right-hand panel
The “Edit Pages” button in the top right corner will be blue.
If your document has not had splitting applied (and does not need to be split), skip this step.To view the splitting, open the splitting interface by clicking the “Edit Pages” button (2).Here you can see the full file that was received. You can preview each page to get a closer look.To create a new split: click in between the pages that you want to split.To remove a split: click the “Remove Split” button and merge the documents.You can also delete unwanted pages using the bin icon or rotate misoriented pages in this view.Once you are happy with the changes you have made, click “Apply Changes”.Changing the splitting will trigger the relevant documents to be reparsed to refresh the extraction.
3
Validate Classification
In Edit Pages Interface:If you are already checking the splitting of documents, you can also check the classification in the Splitting Interface. You can see how the model has classified each document by the label next to the document’s filename. Change the classification by using the drop-down menu. Make sure you click “Apply Changes” to save changes.In Document Validation Interface:If you are checking individual documents in the Document Validation Interface, you can see the classification the model has predicted in the top right corner. You can change the classification by using the drop-down menu.Changing the classification of a document will reparse it as it gets sent to the correct model for extraction.
For Admins: If the appropriate classification isn’t available in your Workspace, select ‘Remove Classification’. This will allow Admins to either create a new Document Type or add an existing one to the Workspace and apply it.