Document Classification

Document classification is the process of automatically identifying and categorizing documents based on their content or metadata. In the Affinda platform, document classification ensures that documents are correctly categorized into their relevant type. This is a foundational step in document processing workflows, as accurate classification enables targeted and optimized downstream actions relating to data extraction, validation, and post-processing default settings.

Configuring Classification

Classification behaviour can be configured in the Workspace settings.


Auto Classification

When enabled, Affinda will classify and route the documents to the relevant Document Type when uploaded to a Workspace. If the document is uploaded to a specific Document Type, Affinda will not attempt to re-classify automatically.

If disabled, documents sent to a Workspace will remain unclassified until updated by a user.

Reject Documents

Workspaces can be configured so that Affinda automatically rejects documents that are not of the right type. These documents will be sent to the 'Rejected' list in the app, where they can be reviewed and moved to the appropriate Collection by a user.

Document rejection will occur when:

  • If a document is sent to a Workspace and no relevant Document Type is found within the Workspace
  • If a document is sent to a specific Document Type and the classification the model returns doesn't match the Document Type

Types of Classification Models

  1. Default
    A self-learning classifier that learns and adapts to documents added to Model Memory. The 'Default' classifier also uses signals from the Document Type name to aid with accurate classification so it is important that the name is representative of the type of documents that will be received.
  2. Fast Classifier
    A non-learning classifier that is suitable for Resume Parsing and Recruit AI customers. This model is optimised for performance on Resumes and Job Descriptions in particular and is the default classifier used for all customers with a Recruitment use case.

For customers with bespoke requirements that the above options do not meet, a custom classifier can be created. Get in touch with the Affinda team to learn more about this option.

Updating Classification

Users can update the classification of a document in both the document validation UI and document list view. Select the dropdown where the current classification is shown to either Remove Classification or change it to a different Document Type.

When the classification is changed, Affinda will re-parse the document with the corresponding extraction model.