The page contains all information about the NextGen Resume Parser:

Getting started with Affinda Recruit

Affinda's Resume Parser is the flagship product within our Recruitment Product Suite. It accurately returns all relevant data found in candidate resumes in seconds

Key features of the Affinda Resume Parser

The current version of our Resume Parser, “NextGen”, was released in April 2024 and is our fourth iteration. This version benefits from:

Significantly improved accuracy
Expanded data coverage, supported by field relationships
Configurable taxonomies and data mapping to enhance downstream data quality

The previous version, our Legacy Resume Parser, remains available to existing customers. For customers who are migrating from legacy parser to NextGen Parser, please see HERE for more information.

Data extracted

Standard Fields

The full list of standard fields extracted by the Affinda Resume Parser is available here.

Custom Fields

Affinda's NextGen Parser offers customisable resume extraction tailored to specific industries or templates. We can add custom fields to meet unique requirements, such as "driver licence number" when processing drivers' resumes.

To explore whether custom fields suit your needs, please get in touch.

Taxonomy mapping

Affinda's NextGen Resume Parser uses pre-built taxonomies to standardize key fields like skills, making data more accurate and consistent across your systems. It works with resumes in any language by mapping skills to a single, shared framework. This eliminates the hassle of cleaning and normalizing data, making it easier to analyze, report, and integrate into other processes. Here's an overview:

Configurable Taxonomies

Skills

By default, the Resume Parser utilises the Lightcast taxonomy. To modify this setting, please get in contact with your Affinda account manager or submit a request via our contact form.

This Lightcast skills taxonomy works across all 50+ languages providing standardised data using a best-in-class taxonomy. By default, mapped skills are returned in English. We can return mapped values in the original resume's language. To enable this, please get in contact with your Affinda account manager or submit a request via our contact form.

Job titles

By default, the NextGen Resume Parser does not map job titles to any taxonomy and returns only the extracted raw string. To change this, please get in contact with your Affinda account manager or submit a request via our contact form.

Occupation Classification

All three Occupation Classifications are available in the API response automatically.

Custom taxonomies

Additional taxonomies that customers use internally that are not available as pre-configured options can also be added, ensuring that customers get the data in the format they need. Custom taxonomies can be defined for fields that already have pre-configured options (e.g. skills, job titles), but also any other text field (e.g. work organisations).

For example, this means customers may add their own internal skills taxonomy that is in place of Lightcast or ESCO, or they may wish to map against a defined list of universities that are relevant to the customer.

For more information on specifying a custom taxonomy, please get in touch with the Affinda Sales Team Member or submit a request via our contact form.

Using the Resume Parser

Parse documents via the Affinda application

Document upload

After creating your free trial, you should automatically see the Recruitment workspace, which includes all the recruitment tools for testing. If the workspace doesn’t appear, please reach out to your Affinda account manager or submit a request via our contact form.

Customers can upload documents to Affinda and review extraction results in three ways:

Drag and Drop: Simply drag and drop documents directly into the app.
Email Upload: Navigate to Workflow > Email Upload > Configure to find the unique email address for sending documents.
API Upload: Refer to the API integration details for setup and usage.

Understanding the Affinda document interface

Affinda's document interface provides a simple tool to visualise all of the outputs from the model. This means that customers can quickly assess the accuracy of the solution and all of the data that has been extracted. Customers can observe the raw values that have been extracted from the document, as well as the final 'parsed' values that have been formatted or mapped into standardised values that can be more easily used in downstream processing. To view more details, users can click on the document, select the export option next to "Confirm Document," and export it as a JSON file.

Managing Fields in the Resume Parser

If you require additional fields beyond those currently available in the Affinda application, please reach out to your Affinda account manager or submit a request through our contact form. Please note that users cannot manually add custom fields to the standard resume parser.

If you’ve accidentally removed a field or section in the Recruit products, please contact your account manager or submit a request via our contact form. We’ll help reset your parser and restore any missing elements.

Relevant API End-points

Document Upload via API: Documents can be uploaded to Affinda by sending a request to the POST documents endpoint. Each new document upload will consume one credit. Upon successful upload, the API response will include a unique document identifier (e.g., "document": "EfHgjcsD").
- The following parameters are required to successfully send a POST request to Affinda:
  - Workspace ID: Log in to your Affinda application and go to Workspace > Settings> Workflow>Configure Integrations to find your Workspace ID
  - Document Type ID: Open an already uploaded document in the app, then navigate to Configure Fields > Settings > Document Type ID. Copy the Document Type ID and paste it into the "documentType" field in your API request.
  📘
  Can’t find your Document Type ID? You might be using Affinda's Legacy Portal. Check out the Affinda's Legacy Portal Guidefor instructions on using Affinda APIs.
  - Bearer Token: Log in to your Affinda application and go to Workspace > Settings> Workflow>Configure Integrations to manage API keys
- Retrieve document after initial parsing: To retrieve the results of a previously processed document, clients can use the Get Document endpoint. Retrieving results from this endpoint does not consume any credits.
  
  There are a number of decisions for customers when uploading a document via API

Area	Description
Document	Either a file or URL needs to be included in the POST request. The following formats are supported: PDF, ZIP, DOC, DOCX,XLS, XLSX, ODT, RTF, TXT, HTML, PNG, JPG, TIFF, JPEG* - Volume Limits: There is no limit to the number of documents you can submit to the Affinda API. However, you will be limited to 30 documents per minute that will be processed by our high priority queue. If you would like to specify which queue to use, you can set the `lowPriority` parameter during document submission. Note that if you explicitly set `lowPriority` to `false`, and if you have exceeded the high priority queue rate limit, you will receive a 429 (Too Many Requests) response. - Page Limits: The default page limit for all customers is 10 pages. Settings may be adjusted to increase this limit on a case-by-case basis. Please get in contact with your Affinda account manager for details or submit a request via our contact form .
Workspace and Document ID	Specify Document Type ID instead of Workspace ID:Customers using our Resume or Job Description Parser typically know the type of document being submitted. Therefore, we recommend specifying both the relevant Document Type ID and Workspace ID when sending documents via the API. This avoids relying on automatic classification to route the document to the correct model, ensuring more accurate and efficient processing. Find Workspace ID: Log in to your Affinda application and go to Workspace > Settings> Workflow>Configure Integrations to find your Workspace ID: Find Document Type ID: Open an already uploaded document in the app, then navigate to Configure Fields > Settings > Document Type ID. Copy the Document Type ID and paste it into the "documentType" field in your API request.
Synchronous or Asynchronous Responses	Within the request, customers can set `wait`to true / false depending if the parsing response needs to be returned synchronously or not. For customers who are bulk uploading, it is recommended to set `wait`to false. - Synchronous - If "true" (default), will return a response only after processing has been completed. - Asynchronous - If "false", will return the document identifier and other metadata alongside an empty data object. The data object can be returned at a later date by either: - Polling GET endpoint until processing is complete; or - Setting up webhooks to get a notification that parsing has completed and then using the GET endpoint to pull the extracted data
Parsing Time	For customers who need real-time responses where seconds count, please set parameters to the following when submitting a document: - enableValidationTool: False - deleteAfterParse: True - compact:** True By setting these parameters, Affinda can bypass the need to save any data to our database, which eliminates unnecessary processing time and reduces the overall time taken to return results. However, note that this means that: - The document can not be viewed in the Affinda app(e.g. for 'human in the loop' validation) - The document is not retained in our system so responses can not be fetched at a later date - Field metadata is not returned, only the 'parsed' value

Retrieve document after initial parsing: To retrieve the results of a previously processed document, clients can use the Get Document endpoint. Retrieving results from this endpoint does not consume any credits.

Area	Description
identifier	Relates to the document unique identifier received via the response from the POST documents endpoint
Format	File format to be returned - options includes JSON, XML and hr-xml
compact	If "true", the response is compacted to only the annotations' parsed data. Annotations' meta data are excluded. If not specified, default is "false"

Patching and Updating Custom Fields: Some customers may require custom fields to be consistently set to a specific value for system compatibility. For detailed instructions, refer to the Patching and Updating Custom Fields guide

Information Security

Overview

Affinda’s parsers convert unstructured resumes and job descriptions into structured data, enabling faster, more informed decisions for HR and recruiters. Unlike generative AI services, Affinda is an extractive AI solution. Our proprietary technology extracts data from documents, reducing risks commonly associated with generative AI, such as copyright infringement and bias risks. We do not use large language models like GPT and ensure data privacy by not using client documents to train our models.

Furthermore, Affinda is ISO27001 certified, reflecting our commitment to rigorous, best-in-class information security practices and robust risk management protocols. All our products are designed with privacy, security, and compliance as core principles.

On Premise Deployment

Most customers benefit from Affinda's technology through our hosted solution. However, some may require a locally deployed solution for specific needs. To support these users, we’ve published an Affinda Self-Hosted Deployment Guide

If you’re interested in a local deployment, please contact your Affinda account manager for further details or submit a request via our contact form.

Document Lifecycle

The Affinda API provides customers full control over the lifecycle of their submitted documents.

Lifecycle options can be set per document and include:

Deletion
When a document is deleted, the document and all associated files are immediately removed from our servers. All access to the document will be lost. Document metadata, which may include file names but does not include the file content, may remain in Affinda’s database or backups of Affinda’s database for some time.

Expiration
A typical scenario when incorporating the API into a web app is to enable a customer’s end users to perform document parses on demand. In such cases, it is not necessary or desirable to store the result indefinitely. To facilitate this, the API allows a customer to specify an expiry time when they submit the document. When a document has an expiration set, it will be deleted automatically at the expiration date.

Offboarding

User Offboarding: When a user is removed from an organisation, they are automatically offboarded and will no longer have access to any data within the organisation
Delete Organisation: An organization can be deleted either via API or through the Affinda app by navigating to: Organization > Settings > "Delete [Organization Name]".

Getting started with Affinda Recruit

Key features of the Affinda Resume Parser

Data extracted

Standard Fields

Custom Fields

Taxonomy mapping

Configurable Taxonomies

Skills

Job titles

Occupation Classification

Custom taxonomies

Using the Resume Parser

Parse documents via the Affinda application

Document upload

Understanding the Affinda document interface

Managing Fields in the Resume Parser

Relevant API End-points

📘Can’t find your Document Type ID? You might be using Affinda's Legacy Portal. Check out the Affinda's Legacy Portal Guidefor instructions on using Affinda APIs.

Information Security

Overview

On Premise Deployment

Document Lifecycle

📘
Can’t find your Document Type ID? You might be using Affinda's Legacy Portal. Check out the Affinda's Legacy Portal Guidefor instructions on using Affinda APIs.