Image to markdown API

Image OCR API

Turn document images into clean markdown instead of screenshot text fragments.

Image-heavy workflows break simple parsers fast. LeapOCR helps teams convert screenshots, scanned pages, receipts, and photo captures into readable markdown that keeps sections, lists, and tables usable.

Why teams use this

Support screenshots, document photos, scans, and mixed image uploads.

Return markdown that is easier to review, quote, and feed into downstream workflows.

Add instructions and bounding boxes only when the image actually needs more control.

Read API docs Start free

Image extraction request

Use markdown when a person or another model still needs a readable version of the page.

Image to markdown request

  {  "url": "https://example.com/document-photo.jpg",  "file_name": "document-photo.jpg",  "format": "markdown",  "instructions": "Keep headings and tables intact. Normalize dates."}

Why it works

Why image workflows need dedicated OCR pages

The useful question is whether the output still reads like the source document after OCR, not only whether text was detected.

Readable output

Markdown is easier to review than raw OCR text

Headings, lists, and tables stay intact, which makes screenshots and scans more usable in review workflows.

Messy capture

Phone photos and screenshots share one path

Use the same OCR layer for camera captures, screenshots, and image-heavy document uploads.

Control

Add instructions only when the image needs help

Keep the request surface simple and layer in cleanup or normalization rules only where they matter.

What you control

What teams usually care about

Image-to-markdown workflows are mostly about readability, cleanup, and preserving document structure.

markdown

Readable output

Return a clean document view

Markdown is useful when the next consumer needs to read, review, or pass the page into another model.

images

Input types

Support screenshots, scans, and photos

The same workflow can handle screenshots, mobile captures, scanned pages, and mixed image uploads.

instructions

Optional

Normalize labels or clean noisy sections

Use instructions to translate, condense, or standardize parts of the page without changing the main extraction flow.

bbox

Optional

Attach page geometry when review tools need it

Bounding boxes stay optional so simple image-to-markdown flows do not carry extra output they do not need.

Examples

Two common image-to-markdown flows

Most teams either need a readable page for review or a markdown layer that can feed another system cleanly.

Screenshot OCR

Convert a dashboard screenshot into structured markdown

Useful when teams need an extracted view of a visual page without manually copying sections into notes or tickets.

The output stays readable.

Sections survive extraction better than a flat blob.

Useful for review and QA workflows.

Markdown excerpt

  # Shipment exception dashboard## Open exceptions- Missing POD: 14- Carrier disputes: 6- Delayed invoices: 3

Scanned page OCR

Turn a scanned page into markdown for downstream review

Useful when a scanned image still needs to stay readable for reviewers or internal workflows.

Preserves headings and lists.

Supports scans and lower-quality images.

Keeps a human-readable page representation.

Request

json

  {  "file_name": "scan.png",  "format": "markdown",  "model": "standard-v1"}

FAQ

Questions teams ask before wiring this up

Straight answers for teams evaluating how this workflow fits into production.

Can LeapOCR convert screenshots and photos into markdown?

Yes. The workflow supports screenshots, scans, and mobile image captures, with markdown returned as a readable output format.

When should I use image-to-markdown instead of structured output?

Use markdown when the next consumer is a reviewer, analyst, or LLM that benefits from a readable page representation.

Can I still add bounding boxes to image workflows?

Yes. Bounding boxes remain optional when geometry matters for overlays, review tools, or human-in-the-loop processes.

Ready to test

Test real screenshots and scans on an image-to-markdown OCR API

Run an actual screenshot or phone capture and see whether the markdown still reads like the original page after OCR.

Test your documents See pricing