Data Extracted by NextGen Resume Parser

Affinda's latest version of the Resume Parser, our fourth, has a new schema that includes wider data coverage and new field relationships and structures. As a result, any customers using our Legacy Resume Parser will need to update their field mappings in their platform to make use of the new schema and the new capabilities provided through the NextGen Resume Parser.

The below link provides a mapping of the Legacy Resume Parser to the NextGen Resume Parser, with optionality on whether to use our compact view.

NextGen Resume Parser - Field Mappings.xlsx

Data extracted

Personal Information

  • Candidate Name (Title, First, Middle and Last)
  • Date of Birth
  • Birthplace
  • Nationality
  • Headshot
  • Right to work (visa)

Contact Details

  • Email Address
  • Phone Number (Including country code, e.g., AU)
  • Website (Type, Url)
  • Location
  • Preferred Work Location
  • Willing to Relocate
  • Availability
  • Referees

Overview

  • Objective
  • Summary
  • Achievements
  • Associations
  • Hobby

Work Experience

  • Job Title
    • Mapped to default taxonomies (optional)
    • Standard Occupation Classifications provided
  • Organization
  • Location
  • Dates
  • Description
  • Type (e.g. fulltime, volunteer, intern)

Education

  • Accreditation
  • Level - incl. Certifications & Conferences
  • Organization
  • Location
  • Major
  • Minor
  • Dates
  • Grade - incl. Score & Unit

Projects

  • Title
  • Description
  • Dates
  • Organization
  • Type (e.g. Professional or Education)

Skills

  • Language - incl. Proficiency
  • Skills - mapped to multiple default taxonomies (optional)

Publications

  • Title
  • Dates
  • Journal Name

Patents

  • Patent Name

Schema

The following is an example 'compact' data object returned for a resume in the API response. A compact data response returns only the 'parsed' values and not the 'raw' text and any additional metadata.

{
  "data": {
    "achievement": null,
    "association": null,
    "availability": null,
    "candidateName": {
      "candidateNameFamily": "Christensen",
      "candidateNameFirst": "Hugo"
    },
    "dateOfBirth": null,
    "education": [
      {
        "educationDateRange": "09/2001 - 7/2005",
        "educationLocation": {
          "formatted": "Melbourne VIC, Australia",
          "streetNumber": null,
          "street": null,
          "apartmentNumber": null,
          "city": "Melbourne",
          "postalCode": null,
          "state": "Victoria",
          "country": "Australia",
          "rawInput": "Melbourne, VIC",
          "countryCode": "AU",
          "latitude": -37.8136276,
          "longitude": 144.9630576,
          "poBox": null
        },
        "educationAccreditation": "BS",
        "educationOrganization": "Monash University",
        "educationMajor": [
          "Computer science and technology"
        ],
        "educationLevel": {
          "id": 7871,
          "label": "Bachelor",
          "value": "Bachelor"
        }
      },
      {
        "educationAccreditation": "Project Management Professional (PMP)",
        "educationOrganization": "PMI",
        "educationDateRange": "June 2017",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationAccreditation": "Certified ScrumMaster (CSM)",
        "educationOrganization": "Scrum Alliance",
        "educationDateRange": "June 2017",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationOrganization": "Ulead System Company",
        "educationAccreditation": "C+ +, and MFC",
        "educationDateRange": "2005",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationAccreditation": "IBM Portal and WCM training",
        "educationOrganization": "IBM",
        "educationDateRange": "201",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationAccreditation": "Platform Architecture and Design training program",
        "educationOrganization": "BOM Design University",
        "educationDateRange": "201",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationAccreditation": "Interfunctional Communication Skills",
        "educationOrganization": "51Job",
        "educationDateRange": "201",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationAccreditation": "New Ways of Leading",
        "educationOrganization": "BOM Vendor",
        "educationDateRange": "2016",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      },
      {
        "educationOrganization": "BOM Vendor",
        "educationAccreditation": "PMS – Managing Poor performance and unacceptable behavior",
        "educationDateRange": "201",
        "educationLevel": {
          "id": 8313,
          "label": "Course/Certificate",
          "value": "Course/Certificate"
        }
      }
    ],
    "email": [
      "[email protected]"
    ],
    "headshot": null,
    "hobby": null,
    "language": null,
    "location": null,
    "nationality": null,
    "objective": null,
    "patent": null,
    "phoneNumber": [
      "0458 023 928"
    ],
    "preferredWorkLocation": null,
    "project": null,
    "publication": null,
    "referee": null,
    "rightToWork": null,
    "skill": [
      "AngularJS",
      "React",
      "Application",
      "RESTful",
      "programming",
      "Object",
      "Analysis",
      "SDLC",
      "logical",
      "programing skills.",
      "Self-Motivated",
      "Self-Starter",
      "Team",
      "supervision",
      "Communication skills",
      "CDN)",
      "MS SQL Server",
      "bootstrap",
      "website analytics",
      "SEO",
      "AWS",
      "MS access",
      "Data layer O/R Mapping (Subsonic",
      "LLBLGen",
      "Linq2Entities)",
      "In-memory data cache (Redis)",
      "Crystal",
      "Organizational skills",
      "Time management skills.",
      "Programming",
      "C#",
      ".NET Framework",
      "JavaScript",
      "C+ +",
      "MFC",
      "Delphi 5",
      "VB 6.0",
      "Web Technologies",
      "HTML",
      "JavaScript/AJAX",
      "Bootstrap",
      "AngularJS",
      "Website performance tuning",
      "Website analytics",
      "A/B test",
      "SEO",
      "Web application configurations (IIS",
      "domain redirect",
      "load balance",
      "SQL Performance tuning",
      "Complicated Stored procedure",
      "function",
      "Active Reports",
      "SQL Server Reporting Service",
      "Crystal Reports",
      "ASP.NET",
      "ADO.NET",
      "Web Services",
      "Windows Services",
      "WCF",
      "Visual Studio",
      "2003/2005/2008/2010/2015",
      "Version Control Tools",
      "SVN",
      "GIT",
      "TFS",
      "Visual Source Safe (VSS)",
      "Web Content Manager (WCM)",
      "AEM",
      "Visio",
      "Visual Paradigm",
      "Power Designer",
      "Bugzilla",
      "RTC",
      "JIRA",
      "Confluence",
      "ASP.NET",
      "GIT;",
      "Website performance tuning",
      "CDN",
      "Load balance",
      "SEO"
    ],
    "summary": {},
    "totalYearsExperience": null,
    "website": null,
    "willingToRelocate": null,
    "workExperience": [
      {
        "workExperienceDateRange": "01/2014 – Now",
        "workExperienceOrganization": "Bank of Melbourne",
        "jobTitle": "Technical Leader, Online Solution team",
        "jobDescription": {},
        "workExperienceType": {
          "id": 7874,
          "label": "Full time",
          "value": "Full time"
        }
      },
      {
        "workExperienceDateRange": "11/2011 – 12/2013",
        "workExperienceOrganization": "PEPPERMAYO",
        "jobTitle": "Sr. eCommence Website Developer",
        "jobDescription": {},
        "workExperienceType": {
          "id": 7874,
          "label": "Full time",
          "value": "Full time"
        }
      },
      {
        "workExperienceDateRange": "01/2008 – 08/2010",
        "workExperienceOrganization": "Accelx Software Solutions",
        "jobTitle": "Sr. .NET Developer, EMC Journaling",
        "jobDescription": {},
        "workExperienceType": {
          "id": 7874,
          "label": "Full time",
          "value": "Full time"
        }
      }
    ],
    "rawText": "Resume Hugo Christensen Tel: 0458 023 928 Email: [email protected] SUMMARY: ➢ Technical leader currently with BOM Company (Bank of Melbourne). Responsible for leading the offshore team to maintain and enhance the website of BOM.com. Including the responsive design with bootstrap, rapid online forms and loan calculators, interest rates, website analytics and SEO ➢ Microsoft Technology Professional with 1 1 years of hands-on experience in analysis, designing, coding and testing of web applications ➢ Experienced in developing large scale e-commerce application with ASP. NET MVC, social network application with AWS, and other web applications with ASP.NET/SQL Server technologies ➢ Experienced in developing financial open systems such as credit card and merchants management system, post transaction systems for generation audit reports and customer letters ➢ Experienced in data modeling, database design, programming of stored procedures, functions, triggers, and cursors using Transact-SQL. Experienced in migration business application from Oracle to SQL Server and knowledge of MongoDB ➢ Experienced in web-based reporting with SQL Server Reporting Service, Active Reports and Crystal Report ➢ Experienced in performance tuning of large scale e-commerce website from both front-end and backend with different technologies, including but not limited to reduce http requests, combine and minify multiple CSS/JS files, CSS sprite, lazy load images, partial page caching, memory cache with Redis, preloaded data for page rendering, and leverage of CDN ➢ Experienced in JavaScript programming skills such as JQuery, AJAX, Bootstrap. Knowledge of AngularJS and React Native ➢ Experienced in Windows Service Application and RESTful Services with C# .Net Framework. Experienced in developing batch jobs of file generation and importing across different bank systems communication ➢ Experienced in n-tier enterprise web application development. Strong knowledge with Object Oriented programming (OOPS), Object Oriented Analysis and Design (OOAD). Knowledge of .NET best practices and design patterns, such as SOLID principles, design patterns like Factory, Adapter, Proxy, Observer, and IoC etc. ➢ Experienced in production support with strong technical research capability, including problem analysis and urgent incident fix ➢ Experienced in software development process with both agile (scrum) and waterfall SDLC including requirements analysis, design, implementation, testing and deployment of large scale web application ➢ Excellent analytical, logical and programing skills. Self-Motivated, Self-Starter, Team Player can work with minimal supervision with effective Communication skills and Organizational skills combined with attention and Time management skills. TECHNI CAL SKI LLS: ➢ Programming Languages: C# and .NET Framework, JavaScript, C+ + and MFC, Delphi 5, VB 6.0 ➢ Web Technologies: HTML, JavaScript/AJAX, Bootstrap, AngularJS, Website performance tuning, Website analytics, A/B test, SEO, Web application configurations (IIS, domain redirect, load balance, CDN) ➢ Database Technologies: MS SQL Server, MS access, Data layer O/R Mapping (Subsonic, LLBLGen, Linq2Entities), In-memory data cache (Redis), SQL Performance tuning, Complicated Stored procedure and function ➢ Reporting Tools: Active Reports, SQL Server Reporting Service, Crystal Reports ➢ .NET Technologies: ASP.NET, ADO.NET, Web Services, Windows Services, WCF, Visual Studio 2003/2005/2008/2010/2015 ➢ Version Control Tools: SVN, GIT, TFS, Visual Source Safe (VSS) ➢ Other Packages/Tools: Web Content Manager (WCM), AEM, Visio, Visual Paradigm, Power Designer, Bugzilla, RTC, JIRA, Confluence PROFESSI ONAL EXPERI ENCE: Bank of Melbourne 01/2014 – Now Technical Leader, Online Solution team Description: The team maintains and enhances multiple web sites and applications in BOM. Including BOM.com website maintenance and enhancement, SG EPATs web application, China Inc. SharePoint intranet site, and the other systems like Rapid Online Forms backend, eAdmin Interest Rates. Responsibilities: Lead developer team to design and implement the business requirements, to cooperate with the onshore teams in Singapore and Melbourne. Responsible for communication, coordination and management of the offshore team in Chengdu. Lead the technical transition from WCM to AEM (Adobe Experience Manager), responsive design with bootstrap, and site analytics. Technologies: IBM WCM; AEM; Website analytics; Test & Target; SEO; Responsive design, bootstrap; MS.Net, C#, SQL Server, SSRS; Radware, Akamai, AWS; change management and production support. PEPPERMAYO 11/2011 – 12/2013 Sr. eCommence Website Developer, Peppermayo.com Description: PEPPERMAYO started in 2007 as a specialty fashion e-retailer, and has become a leading apparel brand in Australia. PEPPERMAYO has more than tens of millions registered users; and yearly sales was around 3 billion in 2012. Responsibilities: Implemented web application according to business requirements. Improved performance from the client side with different technologies; Include lazy load, CSS Sprite, http requests reduction, combine minify and compress CSS/JavaScript. Implemented the controller layer of the web application with MVC architecture. Encapsulated interfaces and data access layer. Used different technologies to improve the server side performance including output cache, Redis, and preloaded data for page rendering. After performance tuning from both client side and server side, the server side response time (TP999) was less than 700ms; client side average load time was less than 2300ms. We had cut down 37% servers (Web + DB) and network traffic decreased by 41% . Technologies: C# .NET Framework; ASP.NET MVC; GIT; Website performance tuning, CDN, Load balance, SEO Accelx Software Solutions 01/2008 – 08/2010 Sr. .NET Developer, EMC Journaling Description: EMC Journaling is a user-transparent addition of EMC solutions. It saves content in EMC DCTM instead of MS SQL Server; and access \"native\" in SharePoint. Responsibilities: Implemented Edrsadm (a command-line backup tool) for Journaling Database. Implemented CRSS (content recovery staging system). Implement templates for various admin configuration pages. Technologies: C# .NET Framework; SQL Server 2005; Perforce; Fisheye ; JIRA; EMC DCTM; SharePoint 2007 EDUCATI ON: BS Monash University – Melbourne, VIC 09/2001 - 7/2005 Major: Computer science and technology CERTI FI CATES: Certificate Name Month/Year Authority Project Management Professional (PMP) June 2017 PMI Certified ScrumMaster (CSM) June 2017 Scrum Alliance TRAI NI NGS: Training Name Year Trainer C+ +, and MFC 2005 Ulead System Company IBM Portal and WCM training 201 4 IBM Platform Architecture and Design training program 201 4 BOM Design University Interfunctional Communication Skills 201 5 51Job New Ways of Leading 2016 BOM Vendor PMS – Managing Poor performance and unacceptable behavior 201 7 BOM Vendor"
  },

Additional Metadata

Field Level Metadata

MetadataDescription
idIdentifier associated with the specific data point
rectangle(s)x/y coordinates for the rectangular bounding box containing the data
pageIndexThe page that the data is found on
rawRaw data extracted before any processing and formatting
confidenceOverall confidence that indicates the likelihood the data extracted is correct. This considers both classification and text extraction confidence scores
classificationConfidenceA value that indicates the confidence that the model has that the data returned is correct
textExtractionConfidenceA value that indicates the confidence that the text extracted from the document is correct (relevant for scanned documents)
isVerifiedIndicates whether the data has been validated, either by a human using our validation tool or through auto-validation rules
isClientVerifiedIndicates whether the data has been validated by a human
isAutoVerifiedIndicates whether the data has been auto-validated
dataPointA unique identifier associated with that data field
contentTypeType of data. Options include text, date, date-time, enum, location, float, and decimal.
parsedParsed data extract after post-processing steps, including reformatting or mapping to a defined taxonomy

Document Level Metadata

MetadataDescription
identifierA unique identifier associated with the document. Can be specified on upload, or else will be randomly generated by Affinda
fileNameAn optional filename of the file
readyIf true, the document has finished processing. Particularly useful if an endpoint request specified wait=False, when polling use this variable to determine when to stop polling
readyDtThe date-time when the document was ready
failedIf true, some exception was raised during processing. Check the 'error' field of the main return object
expiryTimeThe date/time in ISO-8601 format when the document will be automatically deleted. Defaults to no expiry
languageThe document's language
pdfThe URL to the document's pdf (if the uploaded document is not already pdf, it's converted to pdf as part of the parsing process)
parentDocument.identifierIf this document is part of a split document, this attribute points to the original document that this document is split from
childDocuments.identifierIf this document has been split into a number of child documents, this attribute points to those child documents
pagesThe number of pages in the document
isOcrdBoolean indicating whether the document has had OCR applied to extract text (if false, the data was extracted from an existing text layer on the document)
ocrConfidenceOverall confidence in the accuracy of text extracted from the document by OCR
reviewUrlA signed URL that is valid for 60 mins that can be used to review and validate the data extracted by the model.
collectionThe Collection that the document is within
extractorThe Extractor that is associated with the Collection. An Extractor is an AI model used to extract data from documents
workspaceThe Workspace that the Collection and document is within
archivedDtThe date-time when the document was archived
isArchivedBoolean to show if the document has been archived
confirmedDtThe date-time when the document was confirmed
isConfirmedBoolean to show if the document has been confirmed
rejectedDtThe date-time when the document was rejected
isRejectedBoolean to show if the document has been rejected
createdDtThe date-time when the document was created in Affinda
errorCodeIf the document processing fails, will return an error code
errorDetailIf document processing fails, will detail error identified
fileURL to view the file
tagsTags applied to documents to enable filtering and searching
confirmedByDetails of the user that last confirmed the document
sourceEmailIf the document is created via email ingestion, this field stores the email file's URL.