Create Crawl

curl --request POST \
  --url https://sdk.nimbleway.com/v1/crawl \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "url": "https://example.com",
  "name": "my-crawler-1",
  "sitemap": "include",
  "limit": 100,
  "max_discovery_depth": 3
}
'

{
  "crawl_id": "85b21e98-69c5-4aa5-9d25-261a55bbf0db",
  "name": "my-crawler-1",
  "url": "https://www.amazon.com/dp",
  "status": "queued",
  "account_name": "ACCOUNT_NAME",
  "created_at": "2026-02-11T13:55:45.512Z",
  "updated_at": "2026-02-11T13:55:45.512Z",
  "completed_at": null,
  "crawl_options": {
    "sitemap": "include",
    "crawl_entire_domain": false,
    "limit": 10,
    "max_discovery_depth": 5,
    "ignore_query_parameters": false,
    "allow_external_links": false,
    "allow_subdomains": false
  },
  "extract_options": null
}

POST

crawl

Create Crawl

curl --request POST \
  --url https://sdk.nimbleway.com/v1/crawl \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "url": "https://example.com",
  "name": "my-crawler-1",
  "sitemap": "include",
  "limit": 100,
  "max_discovery_depth": 3
}
'

{
  "crawl_id": "85b21e98-69c5-4aa5-9d25-261a55bbf0db",
  "name": "my-crawler-1",
  "url": "https://www.amazon.com/dp",
  "status": "queued",
  "account_name": "ACCOUNT_NAME",
  "created_at": "2026-02-11T13:55:45.512Z",
  "updated_at": "2026-02-11T13:55:45.512Z",
  "completed_at": null,
  "crawl_options": {
    "sitemap": "include",
    "crawl_entire_domain": false,
    "limit": 10,
    "max_discovery_depth": 5,
    "ignore_query_parameters": false,
    "allow_external_links": false,
    "allow_subdomains": false
  },
  "extract_options": null
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

url

string<uri>

required

Url to crawl

Example:

"https://example.com"

name

string

Name of the crawl

Example:

"my-crawler-1"

sitemap

enum<string>

Sitemap and other methods will be used together to find URLs

Available options:

skip,

include,

only

Example:

"include"

crawl_entire_domain

boolean

Allows the crawler to follow internal links to sibling or parent URLs, not just child paths

Example:

false

limit

integer

Maximum number of pages to crawl

Required range: 1 <= x <= 10000

Example:

100

max_discovery_depth

integer

Maximum depth to crawl based on discovery order

Required range: 1 <= x <= 20

Example:

3

exclude_paths

string[]

URL pathname regex patterns that exclude matching URLs from the crawl

Example:

["/exclude-this-path", "/and-this-path"]

include_paths

string[]

URL pathname regex patterns that include matching URLs in the crawl

Example:

["/include-this-path", "/and-this-path"]

ignore_query_parameters

boolean

Do not re-scrape the same path with different (or none) query parameters

Example:

false

allow_external_links

boolean

Allows the crawler to follow links to external websites

Example:

false

allow_subdomains

boolean

Allows the crawler to follow links to subdomains of the main domain

Example:

false

callback

Webhook configuration for receiving crawl results

Show child attributes

extract_options

object

Request body model for the /extract endpoint

Show child attributes

Response

Successful Response - Crawl Task Created

url

string<uri>

required

Starting URL for the crawl

crawl_options

object

required

Crawl configuration settings applied to this task

Show child attributes

crawl_id

string<uuid>

required

Unique identifier for the crawl task

status

enum<string>

required

Current status of the crawl task

Available options:

queued,

running,

completed,

failed

created_at

string<date-time>

required

Timestamp when the crawl task was created

name

string | null

Optional name for the crawl task

account_name

string | null

The user's account name

extract_options

object

Optional extraction configuration for each crawled page

completed_at

string<date-time> | null

Timestamp when the crawl was completed (null if not completed)

updated_at

string<date-time> | null

Timestamp when the crawl was updateded

Map

List Crawls

⌘I

Agent API

Extract API

Search API

Map API

Crawl API

Tasks API

Create Crawl

Authorizations

Body

Response