NextGen Resume Parser

What is the NextGen Resume Parser?

Affinda has developed the fourth version of our Resume Parser, “NextGen”, which incorporates a refreshed machine learning component. This new version provides:

  1. Significantly improved accuracy
  2. Expanded data coverage, supported by field relationships
  3. Configurable taxonomies and data mapping to enhance downstream data quality

Setting up the NextGen Resume Parser

To set up an account, please follow the instructions under Getting Started with Affinda. Under "Create a Workspace and Collection" - please select "Resumes NextGen".

Getting Started with the NextGen Resume Parser

Understanding the Affinda document interface

Affinda's document interface provides a simple tool to visualise all of the outputs from the model. This means that customers can quickly assess the accuracy of the solution and all of the data that has been extracted. Customers can observe the raw values that have been extracted from the document, as well as the final 'parsed' values that have been formatted or mapped into standardised values that can be more easily used in downstream processing.

Updating visible fields

By default, when you first create a new Collection with the NextGen Resume Parser, all fields will be visible. However, often customers will only care about a subset of fields that Affinda extracts from resumes. To enhance the testing process, customers can 'disable' certain fields in our document interface. These fields are then no longer visible in the user interface, so testing is restricted just to the fields that matter.

Note, disabled fields will still be present in the API response.

Additional features

📘

Adjust parameters to reduce parsing time

To optimise response times, include these parameters when submitting a document:

  • enableValidationTool: False
  • deleteAfterParse: True
  • compact: True

Taxonomy mappings

The NextGen Resume Parser has a range of pre-configured taxonomies that can be enabled to ensure data is standardised into a more usable format. Please see here for more information about these taxonomies and how to change them.

Additional data available in API response

While the Affinda document interface includes a wide range of data and provides a clear representation of model accuracy, additional data is available within the API response (either in JSON or XML format). This includes a wide range of metadata and both the raw and parsed values. Additional data about specific fields not visible in the UI is also present, including Standard Occupation Classifications, structured location information, and additional details on websites, phone numbers, and other contact information. See here for an example response.

We recommend customers familiarise themselves with all of the data present in the API response as part of testing.

🚧

NextGen Resume Parser not supported by API V2

Customers will need to upgrade to API V3 to take advantage of the new capabilities