Purpose

The Affinda platform automates the splitting and classification of documents. However, user review may be required to validate the model outputs. This tutorial will walk you through how to validate the splitting and classification of your documents in the Affinda Platform.
If you’re a new user of the Affinda platform, we recommend viewing the Getting Started with the Affinda Platform tutorial first to get familiar with our app.
This tutorial is aimed at any users who are reviewing the outputs from the Affinda platform. This may be for:
  1. Model improvement: Confirming documents when the data is correct will improve model performance over time by adding the documents to ‘model memory’
  2. Ongoing document processing: ‘Human in the loop’ review of documents processed as part of general processes

Definitions

Firstly, let’s get across what splitting and classification mean in your automated document workflow.

Splitting

Splitting in Affinda refers to the platform’s ability to identify individual documents in a multi-document file. Our models will split these into individual files. This allows them to be classified and sent to the correct extraction model.

Classification

Classification is Affinda’s ability to identify what type of document it has received. This is beneficial in two ways:
  • It gives you visibility on the type of documents you’ve received
  • It allows these documents to be sent to the correct workflow and extraction model

How to validate document splitting and classification

1

Open the document for review in the Affinda app

Start by opening the Document Validation Interface for a document you want to review.
2

Validate Splitting

How to see if a document has been splitHow to see if a document has been splitYou can see if your document has been split by the Affinda model in the document validation interface. Splitting is indicated by
  1. The purple scissors icon will appear next to the document’s name on the right-hand panel
  2. The “Edit Pages” button in the top right corner will be blue.
If your document has not had splitting applied (and does not need to be split), skip this step.To view the splitting, open the splitting interface by clicking the “Edit Pages” button (2).Splitting InterfaceSplitting InterfaceHere you can see the full file that was received. You can preview each page to get a closer look.To create a new split: click in between the pages that you want to split.To remove a split: click the “Remove Split” button and merge the documents.You can also delete unwanted pages using the bin icon or rotate misoriented pages in this view.Once you are happy with the changes you have made, click “Apply Changes”.Changing the splitting will trigger the relevant documents to be reparsed to refresh the extraction.
3

Validate Classification

In Edit Pages Interface:Changing the Classification in Edit PagesChanging the Classification in Edit PagesIf you are already checking the splitting of documents, you can also check the classification in the Splitting Interface. You can see how the model has classified each document by the label next to the document’s filename. Change the classification by using the drop-down menu. Make sure you click “Apply Changes” to save changes.In Document Validation Interface:Changing the Classification in Document Validation ViewChanging the Classification in Document Validation ViewIf you are checking individual documents in the Document Validation Interface, you can see the classification the model has predicted in the top right corner. You can change the classification by using the drop-down menu.Changing the classification of a document will reparse it as it gets sent to the correct model for extraction.
For Admins: If the appropriate classification isn’t available in your Workspace, select ‘Remove Classification’. This will allow Admins to either create a new Document Type or add an existing one to the Workspace and apply it.