Skip to main content

Overview

Extract retrieves and parses content from any URL, giving you clean structured data instead of raw HTML. Point it at a webpage, specify what you want, and get back exactly the information you need. Think of it as a smart web reader that understands page structure and extracts precisely what you ask for.
  • For most users, we recommend starting with Web Search Agents - pre-built extractors maintained by Nimble for popular sites.
  • Need a site that’s not in the gallery?Create a custom agent in minutes using natural language - no coding required.

How it works

1

You provide a URL and extraction rules

Give Extract the webpage URL and define what data you want using CSS selectors or parsing schemas
2

Extract fetches and renders the page

  • Loads the webpage with full JavaScript rendering
  • Handles cookies, headers, and authentication
  • Bypasses anti-bot protections with stealth mode
  • Renders dynamic content completely
3

Extracts your specified data

  • Follows your exact selectors for precise extraction
  • Parses content into clean, structured format
  • Returns only the data you requested
4

Returns structured results

Get clean JSON with your extracted data, ready to use in your application

When to use Extract

Specific Data Points

Extract exact fields like prices, titles, descriptions from product pages or listings

Dynamic Content

Handle JavaScript-heavy sites that require full page rendering

Protected Sites

Bypass anti-bot protections with stealth mode and residential proxies

Custom Scraping

Build tailored extraction logic for any website structure

Common use cases

E-commerce data extraction Scrape product details, prices, reviews, and availability from online stores. Real estate listings Extract property details, pricing, descriptions, and agent contact information. Job board scraping Collect job titles, descriptions, requirements, and application details. News and article extraction Pull headlines, authors, publish dates, and full article content.

Extract vs. other tools

What you needUse this
Data from popular sites (Amazon, Google, etc.)Public Agent - maintained by Nimble, zero setup
Data from sites not in the gallery (non-expert)Custom Agent - create with natural language
Data from specific URLs (expert users)Extract - CSS selectors & advanced features
Data from entire websiteCrawl
Search web + extract content from resultsSearch
URLs with context for AI planningMap

Why use Extract

  • Flexible - Full control on page interactions and parsing schemas
  • Robust - Handles JavaScript rendering and anti-bot protection
  • Precise - Extract exactly what you need, nothing more
  • Reliable - Built-in retry logic and error handling
  • Cost-efficient - Optimized for high-volume production workloads

Example

Input: Product page URL
{
  	"url": "https://www.amazon.com/s?k=wireless+headphones",
  	"render": false,
  	"country":"US"
}
Output: Full HTML page content
{
    "url": "https://www.amazon.com/s?k=wireless+headphones",
    "task_id": "b1fa7943-cba5-4ec2-a88c-4d2d6799c794",
    "status": "success",
	"data": {
		"html": "...",
		"headers": {}
	},
    "metadata": {
        "query_time": "2026-02-08T22:00:36.132Z",
        "query_duration": 1877,
        "response_parameters": {
            "input_url": "https://www.amazon.com/s?k=wireless+headphones"
        },
        "driver": "vx6"
    },
    "status_code": 200
}

Key features

Full JavaScript rendering Execute JavaScript and wait for dynamic content to load completely. Stealth mode Bypass anti-bot protections with advanced fingerprint randomization. Browser actions Automate interactions like clicking, scrolling, and form filling. Geo-targeting Access content as it appears in different countries with residential proxies.

Next steps

Extract Usage

See all parameters, code examples, and advanced features