Data Extraction APIs
Welcome to the DumplingAI API reference. Our focus is helping you pull structured data out of the real world: websites, PDFs, screenshots, videos, audio, and social feeds, fast. What you can extract- Web & app pages: crawl full sites or targeted URLs, run JS, and return cleaned HTML/markdown (
/crawl,/scrape). - Documents & PDFs: convert, merge, or read metadata; turn files into text or embeddings (
/doc-to-text,/convert-to-pdf,/merge-pdfs,/read-pdf-metadata). - Media to text: transcribe and summarize YouTube/TikTok/audio/video; capture frames and OCR images (
/get-youtube-transcript,/get-tiktok-transcript,/extract-audio,/extract-video,/extract-image). - Social & places data: pull profiles, posts, reviews, search results, or screenshots from major networks (
/search-youtube,/get-linkedin-profile,/search-places,/screenshot).
Authentication
All API requests require authentication using an API key. To use the API, you need to include your API key in the Authorization header of your requests.If you don’t have an API key yet, sign in to the DumplingAI
dashboard and generate one from the API Keys
section.
Base URL
All extraction endpoints share the same host:Rate limits & versioning
Requests consume credits tied to your plan; exceeding limits returns429 Too Many Requests. The current version is v1; include it in URLs where specified in individual endpoint docs.