Introduction

The vOCR is a robust Object Character Recognition (OCR) API, enhanced with fine-tuned vLLMs, designed to efficiently extract data from various document types in a consistent structure.

vOCR shines at the following:

  • KYC Automation: Streamlines the verification process by extracting Personally Identifiable Information (PII) from documents to verify customer identity.
  • Financial Data Extraction: Automates the extraction of financial data from statements, reports, and other financial documents for analysis and reporting.
  • Healthcare Records Management: Extracts patient information, medical history, and treatment details from healthcare records for efficient data management and analysis.
  • Classification Engines: Facilitates accurate categorization and organization of extracted data.

Let’s see the vOCR in action by building a data extractor.

Initial requirements

  • Setup a JigsawStack account (if you don’t have an account already)
  • Get your JigsawStack API key from here.

Request

JavaScript
import { JigsawStack } from "jigsawstack";

const jigsawstack = JigsawStack({
apiKey: "your-api-key",
});

const result = await jigsawstack.vision.vocr({
  prompt: ["Title", "Description"],
  url: "https://res.cloudinary.com/dev-ninja/image/upload/v1712567895/webhooks_qaaslj.png"
});

Response

{
"success": true,
"context": {
  "Title": [
    "Webhook Trigger"
  ],
  "Description": [
    "Select events that will trigger the webhook will listen to. At least one event must be selected.",
    "Success logs",
    "Error logs",
    "Endpoint",
    "Create"
  ]
},
"width": 2880,
"height": 1800,
"tags": [
  "text",
  "screenshot",
  "software",
  "multimedia software",
  "graphics software"
],
"has_text": true,
"sections": [
  {
    "text": "",
    "lines": []
  }
]
}
Find more information on vOCR API here