Overview
Thelangchain-nimble package provides production-grade LangChain integrations for the Nimble web data platform. Built on the official nimble_python SDK, it enables developers to build RAG applications and AI agents that can search, extract, crawl, map, and retrieve structured data from anywhere on the web.
The package includes a unified toolkit, five tools, and two retrievers:
NimbleToolkit — a single entry point to configure and enable all tools.
Tools (for AI Agents):
NimbleSearchTool— Web search with three depth levels (lite,fast,deep) and optional AI-generated answersNimbleExtractTool— Single-URL content extraction, returns clean markdownNimbleCrawlTool— Multi-page crawl jobs that discover and extract content across a domain (up to 10,000 pages)NimbleMapTool— URL discovery across a website by combining sitemap parsing and link crawling (up to 100,000 URLs)NimbleAgentTools— Pre-built web agents that extract structured data from specific page types (products, jobs, reviews)
NimbleSearchRetriever— Web search with depth levels, LLM answers, and filteringNimbleExtractRetriever— Direct URL content extraction
Key Features
- Three search depth levels —
litefor quick metadata,fastfor balanced results,deepfor comprehensive extraction - Unified toolkit — Configure all tools with a single API key and opt-in to crawl, map, and agent capabilities
- Smart filtering — Domain and date filtering, topic-based routing
- Multiple parsing formats — Plain text, markdown (default), or simplified HTML
- Full async support — Both sync and async operations via the
nimble_pythonSDK - Graceful error handling — API errors are returned as messages to the agent instead of crashing the workflow
Quick Start
Installation
Setup
Get your API credentials from Nimble’s dashboard (free trial available) and set as an environment variable:Build an AI Agent with the Nimble Toolkit
Tools
Crawl
Submit multi-page crawl jobs that automatically discover and extract content across an entire domain. The tool handles polling internally — the agent just gets results back.Map
Discover URLs on a website by combining sitemap parsing and link crawling. Useful for understanding a site’s structure before targeted extraction.Agent Tools
Access pre-built web agents that extract structured data from specific page types (product pages, job listings, reviews). Three tools (List → Get → Run) let the LLM agent browse available extractors, inspect their schemas, and run them.