vOCR API

OCR + AI = Magic

Extract data from any document type in a consistent structure with fine-tuned vLLMs for the highest accuracy

  • 80+ languages

  • Structured data extraction to JSON

  • Query images with any question

  • Line and word level bounding boxes

  • Works with noisy and low quality images

  • Image AI tagging classification

  • High accuracy with fine-tuned LLMs

AI Optical character recognition (OCR)

Mix the power of OCR AI with fine-tuned LLMs to extract and accurately correct text from images and documents

Structured data

Get structured data in JSON format categorized by line and word level, including pixel coordinates for each word to form bounding boxes

File & document types

File support including PDF, PNG, JPEG from passports, invoices, complex images and more

Blazing fast speed

Run millions of images in seconds with the latest AI models and GPUs globally distributed for low latency

Tagging

Get image AI tagging classification for your images to understand the content and context of the image

Secure and private

Keep your data secure and private with end-to-end encryption and containerized AI models for processing data

Integrate vOCR on any platform

Easy to use REST APIs that work out of the box in every language and framework with fully managed caching, logging and authentication

import { JigsawStack } from "jigsawstack";

const jigsaw = JigsawStack({
    apiKey: "sk39wo393.....32ncsmw9339RNj3"
});

const response = await jigsaw.vision.vocr({"url":"https://storage.com/complex_legal_image.jpeg"})

$

npm i jigsawstack

What can you build with JigsawStack vOCR?

5 ways our customers use JigsawStack to build vOCR powered applications

KYC automation

Automate your KYC process by securely extracting text from documents to verify customer identity

Fraud detection

Detect fraudulent activities by analyzing risk factors in documents and images using AI tagging classification

Accessibility

Increase accessibility seamlessly by accurately extracting text from images without the need for manual transcription

Build document solutions

Powered by AI, build document solutions that can extract text from documents and images for various layouts

Healthcare or legal

Safely extract information from sensitive documents and images with end-to-end encryption for compliance and digital transformation

Join the community of AI Engineers shipping faster with JigsawStack 🧩

First class Developer Experience (DX)

Striking the right balance between code and dashboard

Logging and analytics on all APIs

Logging and analytics on all APIs

Access real-time logs and analytics on all your APIs. Debug errors, track users, location maps, sessions, countries, IPs and 30+ data points

API key security control

API key security control

Fine grained control over API keys. Whitelist domains with flexible wildcard support, set expiration date and limit access to specific APIs with unlimited keys

API key security control

Fully typed SDKs

The best docs are the kind that you don't need. Fully typed SDKs with auto-completion and self explanatory params

Team and project management

Team and project management

Manage multiple projects and teams with access control. Invite unlimited team members and assign roles

Globally distributed APIs with 99+ locations without the hassle

JigsawStack APIs are built from the ground up on the edge network

Blazing fast

99.5% uptime with APIs latency reaching as low as 200ms globally

Simple scalable pricing

Scale up and down as you need without worrying about abused cost with usage based pricing

Consistency

Consistent request and response structure across all API services for predictable use

Up to date

Consistent training for all JigsawStack models to ensure the latest technology is always available without breaking changes

JigsawStack icon

The missing piece to your tech stack