Skip to main content

Document Collection & Data Extraction

Accelerated Document Processing with AI-Powered OCR

Carolina Navarro avatar
Written by Carolina Navarro
Updated over a week ago

With our OCR capabilities you can analyze all types of official and employment-related documents. You can process documents in real time, get quick results, and enhance your team’s data management with a reliable mechanism to analyze your candidate’s data; at any stage: prescreening, latter stages, onboarding, and offboarding.


How it works

Talkpush uses Optical-Character-Recognition (OCR) technology powered by AI to read through different document types (ID cards, resumes, licenses, certificates, etc.). It is a specialized document-processing model rather than a general-purpose AI. You can incorporate OCR in your questionnaires to ask for various documentation. Once the candidates have uploaded their documents, the system extracts key data points and confirms the document’s authenticity by cross-verifying all expected elements (such as ID number, keywords, and format). This multi-point validation ensures high accuracy and significantly reduces false positives. With OCR, you are guaranteed to:

  • Collect the right documents at the right time

  • Eliminate the manual task of inspecting documents manually

  • Get accurate results effortlessly

  • Reject documents that do not clearly meet all the recognition criteria. It does not make assumptions or fill in missing data.

How to activate it

Reach out to your Account Manager to discuss the pricing options. Once your plan has been approved, you’ll see the ‘Documents’ section under Templates.


We're currently working to expand our library of supported documents, if the document is not on the list below, discuss its feasibility with your Account Manager.

Supported Documents

Recruitment OCR | Document Collection


How to set it up

To access the feature go to Templates > Documents

Add a new document

  1. Click the '+ Add Document' button, these are the available configurations:

  2. Add your details to the specific document, here's a breakdown:

    1. Official Name of the Document (e.g. Drivers License, Passport). Required Field. 70 characters maximum.

    2. Country of Issue: Select the country that issues the document. Required Field.

    3. Permissions: Select who can access the document.

      • Full Access: Anyone with access to the Campaign wherein the document is located can view and download the documents.

      • Restricted: Only 'Onboarding Admins' can view and download.

    4. Document Recognition: This toggle enables the OCR capabilities, once you toggle it on, you will be able to edit the 'Real-Time Recognition Feedback' & 'Identity Verification' options.

    5. Real-time Recognition Feedback: This feature can provide real-time feedback to candidates on their document uploads, ensuring they know if their submissions have been successfully recognized or need adjustments.

      • When this feature is toggled on:

        Candidates receive instant notifications within the Landing Page as soon as they upload a document: Recognized & Not Recognized.

        • If recognized, the candidate can proceed without any further action.

        • If unrecognized or requires re-uploading, the candidate is promptly informed, allowing them to correct the issue immediately.

        • If a document lacks essential information, it is automatically rejected, reducing the chances of incorrect matches.

        Candidates are allowed to resubmit documents that are not recognized up to three times. After three tries, candidates can still submit the file, but it will be marked as 'not recognized.'

      • When this feature is toggled off:

        • The document will still be uploaded in the same channel, but candidates will not receive instant feedback on the status.

        • Instead, the document's status will be processed without immediate notification to the candidate, potentially delaying necessary corrections.

    6. Identity Verification: Enabling this feature validates if the name of the document belongs to the candidate. If Identity Verification is enabled then it will show as configured in the document list.


      Documents are:

      • Verified: The document type matches what was requested and the name matches the candidate's name.

      • Not Verified: The name does not match the candidate's name or was not readable.

    7. Data Extraction: This feature allows you to extract up to 10 data points from the documents which are then converted into Candidate Attributes within Talkpush. This data can be exported, and used for automation & filtering.

      The system will extract the exact data it found (e.g., name, ID number) and note any missing elements.


      Note: If the Candidate Attribute has existing data associated with it, it will be overwritten by the new data extracted from the document.

      • Status Display: Each Extracted Candidate Attribute will show its status next to it in the Candidate Profile, they include:

        • Recognized

        • Unrecognized

        • Failed

        • Processing

Navigating the Documents Tab

Document Search

Find a document by typing its name or the associated country.


Note: For existing documents (before July 2024), the country field may be empty.

Edit/Delete Documents

Click the three-dot menu (⋮) to edit the Document's settings or Delete them from the list.

Document Status Visibility

You can see the processing statuses directly from the Candidate Profile.

Collecting Documents via Chat/Email

Some candidates prefer to complete their process via Chat (WhatsApp or Messenger). On other occasions, you might need to request a specific document via email. In such cases, where the Landing Page is not a completion option, you can use our tokens <mail attachment> or <chat attachment> to collect the documents.

Steps to Request Documents:

  1. Compose Your Message: Send a message to the candidate asking for the required document, and include the <mail attachment> token for emails or the <chat attachment> token for Messenger & WhatsApp.

  2. Choose Document Type: Decide whether the attachment should be stored in the candidate's Attachments section or the Documents Tab. If you choose Documents, specify the document type to ensure proper categorization.

  3. Select Channel and Send: Pick your preferred communication channel (email or chat) and click Send.

  4. Saving the Document

    • For email requests, the candidate sends the document to the provided email address, and it is saved in the specified document template.

    • For chat requests, the document is sent via chat and saved accordingly, without the need for an email address.​

Note: If you can’t manually upload a document to a candidate's profile, no document template may be created in the Templates tab. Adding a new template with the name of the desired document should solve this problem.


Q: What file format can the candidates use to upload Documents?

A: Most file formats are supported, including: Mp3, Mp4, PDF, JPEG, PNG, Docx, Doc, Excel, CSV, Txt, PPT

Q: Who can view uploaded Documents?

A: Owners and users with campaign access can view and download documents

Q: Can I download all documents for a single candidate at once?
A: If you have Onboarding Admin Access, you can download all the documents for a single candidate profile. Just click Download, and the documents will be available as a zipped file.

Q: Am I able to upload more than one file to a candidates document field?

A: You can manually do this by clicking “Upload” under the Document section. The limit is 5 files per one document type. Furthermore you can ask candidates to upload multiple documents to one document type using the <Mail attachment> and <Chat attachment> tokens.




Did this answer your question?