Detailed guide for Schema Design
Schema Design
Click here for Affinda Academy tutorial
Basic Configuration
Field Name
The field name represents the label for the extracted data in the validation UI. It is user-defined and helps the model in extracting the field from documents; as such, it is important to have clear and relevant field names.Field Description
Users may optionally enhance the model’s predictions by providing additional context, such as how the data is typically labelled and where it appears on the page. The field description and Text Options serve different purposes. The field description guides the model on what to extract from the document — for example, “Extract the billing address, not the supplier’s address.” Text Options control how the extracted value is formatted in the output — for example, trimming whitespace, normalising dates, or stripping trailing characters. If the model is extracting the correct text but the format is wrong, use Text Options. If the model is extracting from the wrong place on the document, refine the field description. The field description controls what the model extracts from the document. If you instead need to control how the extracted value is formatted in the output (for example, trimming whitespace or normalising dates), use Text Transformations.Data Types
Your field’s data type determines how extracted values are processed and standardized. Different data types are available to ensure that structured and unstructured data is correctly categorized. The selected data type influences the structure of the data and the post-processing logic applied to extracted values, ensuring consistency and accuracy.Standard Data Types
Text, Numbers, Dates, Location, Phone Number, URLs
Structures
Groups and Tables
Picklists
Options and Data Source
Checkboxes
Labels and True/False
Image Fields
Signatures, Seals, and Headshots
Basic Field Settings
Enabling Fields
This setting determines whether the field should be predicted and visible in the extracted output. Users can toggle this option depending on whether they want the model to extract and display the field. Disabling a field instead of deleting it lets the model keep all previously validated annotations and automatically restores them when the field is re‑enabled.
Multiple Values
Enable multiple predictions only when a field can have multiple distinct values within a single document. Examples include:- Line item tables on an Invoice
- Parties or Signatories in a Legal Contract
- Transactions in a Bank Statement
