Endpoints
Extract Image
Extract Image API Documentation
Description
This endpoint extracts structured data from image files based on a user-defined prompt. It supports input via URL or base64-encoded image content and uses vision-capable Large Language Models (LLMs) to interpret and extract relevant information from the images.
Endpoint
POST /api/v1/extract-image
Headers
- Content-Type:
application/json
- Authorization: Bearer
<API_KEY>
(required) - Request-Source:
string
(optional)
Request Body
{
"inputMethod": "string", // Required. Either "url" or "base64".
"images": ["string"], // Required. Array of URLs or base64-encoded image contents.
"prompt": "string", // Required. The prompt describing the data to extract.
"jsonMode": boolean // Optional. Whether to return the result in JSON format. Default: false.
}
Responses
Success (200)
Returns the extracted data based on the provided prompt, along with additional information.
{
"results": "string", // Extracted data based on the prompt
"prompt": "string", // The original prompt used for extraction
"imageCount": number, // Number of images processed
"creditUsage": number // Total credits used for this request
}
- Content-Type: application/json
- X-RateLimit-Limit: The rate limit for the user.
- X-RateLimit-Remaining: The remaining number of requests for the user.
Bad Request (400)
Returned if the request is invalid or the total file size exceeds the limit.
{
"error": "Error message describing the issue"
}
Unauthorized (401)
Returned if the API key is invalid or missing.
{
"error": "Invalid or missing Authorization header"
}
Internal Server Error (500)
Returned if there’s an error during the image extraction process.
{
"error": "Failed to extract image data: [error details]"
}
Example Request
curl -X POST https://app.dumplingai.com/api/v1/extract-image \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Request-Source: API" \
-d '{
"inputMethod": "url",
"images": ["https://example.com/sample-image.jpg"],
"prompt": "Describe the main elements in this image.",
"jsonMode": false
}'
Notes
- The maximum total file size for all images combined is 100MB.
- The maximum file size for a single image is 20MB.
- The maximum number of images that can be processed in a single request is 3000.
- The maximum output is 8,192 tokens.
- Credit usage:
- Base cost: 10 credits
- Additional 1 credit per image processed
- The total credit usage is returned in the response as
creditUsage
. - If using the URL method, ensure the image is publicly accessible.
- The
jsonMode
parameter determines whether the output is formatted as JSON (true) or plain text (false). - Supported image formats: PNG, JPEG, WebP, HEIC, HEIF. Other formats will be automatically converted to PNG.
- Temporary files are created during processing and are deleted after use.
- You can get a list of supported image formats by calling:
GET /api/v1/extract-image
Rate Limiting
Rate limit headers (X-RateLimit-Limit
and X-RateLimit-Remaining
) are included in the response to indicate the user’s current rate limit status.
Error Handling
- If the required parameters (
images
orprompt
) are missing, a 400 Bad Request error is returned. - If the total file size exceeds 100MB, a 400 Bad Request error is returned.
- If there’s an error during extraction, a 500 Internal Server Error is returned with details about the failure.
Security and Privacy
- Uploaded images are temporarily stored and then deleted after processing.
- The endpoint uses the Gemini 1.5 Flash model for image analysis and data extraction.