Overview

The vOCR is a robust Object Character Recognition (OCR) API, enhanced with fine-tuned vLLMs, designed to efficiently extract data from various document types in a consistent structure. vOCR shines at the following:
  • KYC Automation: Streamlines the verification process by extracting Personally Identifiable Information (PII) from documents to verify customer identity.
  • Financial Data Extraction: Automates the extraction of financial data from statements, reports, and other financial documents for analysis and reporting.
  • Healthcare Records Management: Extracts patient information, medical history, and treatment details from healthcare records for efficient data management and analysis.
  • Classification Engines: Facilitates accurate categorization and organization of extracted data.

API Endpoint

POST /v1/vocr

Quick Start

JavaScript
import { JigsawStack } from "jigsawstack";

const jigsaw = JigsawStack({ apiKey: "your-api-key" });

const response = await jigsaw.vision.vocr({
  "prompt": [
        "total_price",
        "tax"
  ],
  "url": "https://jigsawstack.com/preview/vocr-example.jpg"
})

Response Example

{
  "success": true,
  "context": {
        "total_price": [
              "144.02"
        ],
        "tax": [
              "4.58"
        ]
  },
  "width": 720,
  "height": 960,
  "tags": [
        "text",
        "paper",
        "receipt"
  ],
  "has_text": true,
  "sections": [...],
  "_usage": {
        "input_tokens": 23,
        "output_tokens": 11203,
        "inference_time_tokens": 3082,
        "total_tokens": 14308
  }
}
Find more information on vOCR API here