Settings for removing duplicates can be found for each workspace in the Workflow Settings under ‘Pre-processing’. If Affinda identifies a duplicate and the setting is enabled, the document will be automatically updated to be Rejected. The user can still manually override and place the document back into a Workspace if required. Affinda uses Document Binary Match to detect and reject duplicates. Documents with matching fields (e.g the same Invoice Number) will not be rejected by enabling Remove Duplicates. To do this, users will need to support this logic on their side and can utilise our APIs to delete unwanted documents.

Different versions of the same document are not classified as duplicates.

Affinda inspects the document metadata to identify if the exact duplicate of the document has been previously uploaded to the Workspace. Documents re-submitted as a new version will not be classified as a duplicate.