Small models for your data extraction pipeline

Specialized data and extraction models for OCR, object detection, web search, and more

npm i jigsawstack

AI Scraper

vOCR

Object Detection

Web Search

Speech to Text

AI Web Scraper Preview

Try more advanced controls ->

Sign up for free to run AI Web Scraper preview

...

Trusted by builders at

Small models, big results

vOCR

Extract text from images and documents

Object Detection

Detect any object in real life or GUIs

AI Web Scraper

Scrape by prompting

Web Search

Web search for AI

Classification

Classify anything you want

Speech to Text

Transcribe audio/video to text in seconds

JigsawStack's AI models are organized into three key categories. Data Extraction, Transformation, and Validation.

View all models ->

Integrate in seconds on any platform

Install with coding agent

JavaScript

Python

PHP

Ruby

Go

Java

Swift

Dart

Kotlin

C#

cURL

...
npm install jigsawstack

Community of AI Engineers shipping faster with us

Features for every developer

Structured data

All models have been trained from the ground up to response in a consistent structure on every run

Automatic scale

Serverlessly run BILLIONS of models concurrently in less than 200ms and only pay for what you use

Purpose-Built Models

Purpose-built models trained for specific tasks, delivering state-of-the-art quality and performance

Easy integration

Fully typed SDKs, clear documentation, and copy-pastable code snippets for seamless integration into any codebase

Observability

Real-time logs and analytics. Debug errors, track users, location maps, sessions, countries, IPs and 30+ data points

Secure & Private

Secure and private instance for your data. Fine grained access control on API keys.

Global first models

Multilingual

Global support for over 160+ languages across all models

Global training datasets

We collect training data from all around the world to ensure our models are as accurate no matter the locality or niche context

Distributed GPUs

90+ global GPUs to ensure the fastest inference times all the time

Smart cache

Automatic smart caching to lower cost and improve latency

The missing piece to your tech stack