> ## Documentation Index
> Fetch the complete documentation index at: https://docs.affinda.com/llms.txt
> Use this file to discover all available pages before exploring further.

<AgentInstructions>

## Submitting Feedback

If you encounter incorrect, outdated, or confusing documentation on this page, submit feedback:

POST https://docs.affinda.com/feedback

```json
{
  "path": "/configuration/duplicates",
  "feedback": "Description of the issue"
}
```

Only submit feedback when you have something specific and actionable to report.

</AgentInstructions>

# Remove duplicate documents

> Configure Affinda's de-duplication to detect and remove duplicate documents during ingestion, keeping your workflows clean and avoiding double-processing.

Settings for removing duplicates can be found for each workspace in the **Workflow Settings** under ‘Pre-processing'. If Affinda identifies a duplicate and the setting is enabled, the document will be automatically updated to be **Rejected**. The user can still manually override and place the document back into a Workspace if required.

Affinda uses Document Binary Match to detect and reject duplicates. Documents with matching fields (e.g the same Invoice Number) will not be rejected by enabling Remove Duplicates. To do this, users will need to support this logic on their side and can utilise our APIs to delete unwanted documents.

<Info>
  ### Different versions of the same document are not classified as duplicates.

  Affinda inspects the document metadata to identify if the exact duplicate of the document has been previously uploaded to the Workspace. Documents re-submitted as a new version will not be classified as a duplicate.
</Info>
