Crawl website
Web Scraping
Crawl Website
Crawl a site or sitemap and return captured pages with metadata.
POST
Crawl website
Documentation Index
Fetch the complete documentation index at: https://docs.dumplingai.com/llms.txt
Use this file to discover all available pages before exploring further.
Description
This endpoint crawls a website and returns structured content from multiple pages.Endpoint
Headers
- Content-Type:
application/json - Authorization: Bearer
<API_KEY>(required)
Request Body
Responses
Success (200)
Example Request
Notes
- Uses 10 credits per crawled page
- Uses anti-bot measures and stealth crawling techniques
- Limit is the max number of pages to crawl
- Depth refers to the distance between the base URL path and sub paths
Rate Limiting
Rate limit headers (X-RateLimit-Limit and X-RateLimit-Remaining) are included in the response.Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json
Parameters controlling a crawl job.
Root URL to crawl.
Maximum crawl depth.
Required range:
x >= 1Maximum number of pages to fetch.
Required range:
x >= 1Output format for the crawled pages.
Available options:
markdown, text, raw Optional request source identifier.