Introduction
The vOCR is a robust Object Character Recognition (OCR) API, enhanced with fine-tuned vLLMs, designed to efficiently extract data from various document types in a consistent structure.
vOCR shines at the following:
- KYC Automation: Streamlines the verification process by extracting Personally Identifiable Information (PII) from documents to verify customer identity.
- Financial Data Extraction: Automates the extraction of financial data from statements, reports, and other financial documents for analysis and reporting.
- Healthcare Records Management: Extracts patient information, medical history, and treatment details from healthcare records for efficient data management and analysis.
- Classification Engines: Facilitates accurate categorization and organization of extracted data.
Let’s see the vOCR in action by building a data extractor.
Initial requirements
- Setup a JigsawStack account (if you don’t have an account already)
- Get your JigsawStack API key from here.
Request
import { JigsawStack } from "jigsawstack";
const jigsawstack = JigsawStack({
apiKey: "your-api-key",
});
const result = await jigsawstack.vision.vocr({
prompt: ["Title", "Description"],
url: "https://res.cloudinary.com/dev-ninja/image/upload/v1712567895/webhooks_qaaslj.png"
});
Response
{
"success": true,
"context": {
"Title": [
"Webhook Trigger"
],
"Description": [
"Select events that will trigger the webhook will listen to. At least one event must be selected.",
"Success logs",
"Error logs",
"Endpoint",
"Create"
]
},
"width": 2880,
"height": 1800,
"tags": [
"text",
"screenshot",
"software",
"multimedia software",
"graphics software"
],
"has_text": true,
"sections": [
{
"text": "",
"lines": []
}
]
}
Find more information on vOCR API
hereResponses are generated using AI and may contain mistakes.