- Full page content: Get raw HTML for custom processing
- Readable text: Convert pages to clean markdown format
- Visual records: Capture screenshots for monitoring or archival
- Link extraction: Get all URLs from the page for further crawling
- Response headers: Inspect HTTP headers returned by the server
You can combine multiple formats in a single request. All specified formats
will be included in the response.
Parameters
formats
formats
Choose which types of content you want in your response. You can request multiple formats at once, and each will be included in the result.Available formats:
html- The full HTML source code of the page- Best for: Custom parsing, preserving exact page structure, accessing all DOM elements
markdown- Clean, readable markdown version of the page- Best for: Content analysis, LLM processing, human-readable output
screenshot- Full-page screenshot as base64-encoded PNG- Best for: Visual verification, monitoring, archival, debugging
- Note: Automatically enables rendering (VX8/VX10 driver)
headers- HTTP response headers returned by the server- Best for: Inspecting content type, caching policies, redirects, server metadata
- Returns: Key-value object under
data.headers
links- All URLs found on the page- Best for: Discovering pages to crawl, mapping site structure, finding external references
- Returns: Array of URL strings under
data.links
Usage
HTML format
Request one format type - html (default). Best for:- Custom HTML parsing
- Preserving exact page structure
- Accessing all DOM elements and attributes
- Archival purposes
Markdown format
Convert the page to clean, readable markdown. Best for:- Clean text extraction
- Content analysis
- LLM processing
- Human-readable output:
Screenshot format
Capture a full-page screenshot as a base64-encoded PNG image. Best for:- Visual verification and monitoring
- Page archival and documentation
- Comparing page changes over time
Headers format
Return the HTTP response headers from the server. Best for:- Inspecting
Content-Type,Cache-Control, and redirect behavior - Debugging server responses and middleware
- Verifying geo-targeted responses return the expected locale or region
Links format
Extract all URLs found in the page HTML. Best for:- Discovering new pages to crawl or extract
- Mapping internal site structure
- Finding outbound links and external references
Multiple formats
Combine multiple formats to get different data representations:When combining screenshot with other formats, rendering is automatically
enabled for all formats in the request.
Example response
When formats are specified, all requested data is included in the response. The response includes:- data.html: Raw HTML if requested
- data.markdown: Converted markdown if requested
- data.screenshot: Base64-encoded PNG if requested
- data.links: Array of extracted URLs if requested
- data.headers: HTTP response headers key-value object if requested
- data.parsing: Structured data if parsing was used
- metadata: Execution details and formats included:
Best practices
Format selection
Choose formats based on your needs:- Use
htmlwhen you need full DOM access - Use
markdownfor clean text and content analysis - Use
screenshotfor visual verification - Use
linksfor discovering URLs to crawl - Use
headersfor inspecting server metadata, content type, or caching behavior
Performance considerations
- Each format adds processing time
- Screenshots require rendering and are slower
- HTML and markdown are faster to generate
- Request only needed formats for optimal performance