Skip to main content

Data Extraction APIs

Welcome to the DumplingAI API reference. Our focus is helping you pull structured data out of the real world: websites, PDFs, screenshots, videos, audio, and social feeds, fast. What you can extract
  • Web & app pages: crawl full sites or targeted URLs, run JS, and return cleaned HTML/markdown (/crawl, /scrape).
  • Documents & PDFs: convert, merge, or read metadata; turn files into text or embeddings (/doc-to-text, /convert-to-pdf, /merge-pdfs, /read-pdf-metadata).
  • Media to text: transcribe and summarize YouTube/TikTok/audio/video; capture frames and OCR images (/get-youtube-transcript, /get-tiktok-transcript, /extract-audio, /extract-video, /extract-image).
  • Social & places data: pull profiles, posts, reviews, search results, or screenshots from major networks (/search-youtube, /get-linkedin-profile, /search-places, /screenshot).

Authentication

All API requests require authentication using an API key. To use the API, you need to include your API key in the Authorization header of your requests.
Authorization: Bearer YOUR_API_KEY
If you don’t have an API key yet, sign in to the DumplingAI dashboard and generate one from the API Keys section.

Base URL

All extraction endpoints share the same host:
https://app.dumplingai.com

Rate limits & versioning

Requests consume credits tied to your plan; exceeding limits returns 429 Too Many Requests. The current version is v1; include it in URLs where specified in individual endpoint docs.

Next steps

Pick the extractor you need from the sidebar (web, document, media, or social). Each page includes required params, sample requests, and example responses tailored to that data type.