Doc to Text

Description

This endpoint converts PDF or DOCX documents into plain text. It supports input via URL or base64-encoded file content.

Endpoint

POST https://app.dumplingai.com/api/v1/doc-to-text

Headers

Content-Type: application/json
Authorization: Bearer <API_KEY> (required)

Request Body

{
  "inputMethod": "string", // Required. Either "url" or "base64".
  "file": "string", // Required. URL or base64-encoded file content.
  "pages": "string" // Optional. Specify pages to process.
}

Responses

Success (200)

Returns the extracted text from the document.

{
  "text": "string" // Extracted text content
}

X-RateLimit-Limit: The rate limit for the user.
X-RateLimit-Remaining: The remaining number of requests for the user.

Bad Request (400)

Returned if the request is invalid or the file format is unsupported.

{
  "error": "Error message describing the issue"
}

Internal Server Error (500)

Returned if there’s an error during the document processing.

{
  "error": "Error processing document"
}

Example Request

curl -X POST https://app.dumplingai.com/api/v1/doc-to-text \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
  "inputMethod": "url",
  "file": "https://example.com/sample.pdf"
}'

Example Response

{
  "text": "This is the extracted text content from the document..."
}

Notes

Supported file formats: PDF and DOCX.
Maximum file size may be limited (refer to your plan details).
If using the URL method, ensure the file is publicly accessible.
This endpoint uses 2 credits per request.
The file type is automatically detected based on the file content.
The “pages” field allows you to specify which pages to process:
- Use comma-separated values or ranges (e.g., “1, 2-” or “1, 2, 3-7”).
- The first page index is 1.
- Use ”!” before a number for inverted page numbers (e.g., “!1” for the last page).
- If not specified, all pages will be processed by default.
- The input must be in string format.

Rate Limiting

Rate limit headers (X-RateLimit-Limit and X-RateLimit-Remaining) are included in the response to indicate the user’s current rate limit status.

API Documentation

Endpoints

Description

Endpoint

Headers

Request Body

Responses

Success (200)

Bad Request (400)

Internal Server Error (500)

Example Request

Example Response

Notes

Rate Limiting

API Documentation

Endpoints

​Description

​Endpoint

​Headers

​Request Body

​Responses

​Success (200)

​Bad Request (400)

​Internal Server Error (500)

​Example Request

​Example Response

​Notes

​Rate Limiting

Description

Endpoint

Headers

Request Body

Responses

Success (200)

Bad Request (400)

Internal Server Error (500)

Example Request

Example Response

Notes

Rate Limiting