docs: align CLI and MCP pages with v2 releases

VinciGit00 · claude · VinciGit00 · commit 88ba53aba82e · 2026-04-13T14:06:30.000+02:00
CLI (just-scrape#13):
- scrape: document 8 formats, multi-format via comma-separated -f,
  and new --html-mode / --scrolls / --prompt / --schema flags
- search: document --location-geo-code, --time-range, --format
- crawl: document -f / --format
- Add the Fetch Modes enum table (auto|fast|js|direct+stealth|js+stealth)
  that replaces the legacy --stealth boolean

MCP server (scrapegraph-mcp#16):
- Replace the stale v1 tool list with the v2 surface: markdownify,
  smartscraper, searchscraper, scrape (formats[]), smartcrawler_*
  (markdown default), crawl_stop/resume, monitor_* lifecycle,
  credits, sgai_history
- Note removal of sitemap and agentic_scrapper
- Document SCRAPEGRAPH_API_BASE_URL override and v2 auth headers

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/services/cli.mdx b/services/cli.mdx
@@ -96,22 +96,42 @@ just-scrape search <query>
 just-scrape search <query> --num-results <n>
 just-scrape search <query> -p <prompt>
 just-scrape search <query> --schema <json>
+just-scrape search <query> --format markdown            # or html
+just-scrape search <query> --location-geo-code <iso>    # e.g. us, it, gb
+just-scrape search <query> --time-range past_week       # past_hour|past_24_hours|past_week|past_month|past_year
 just-scrape search <query> --headers <json>
 ```
 
 ### Scrape
 
-Scrape content from a URL in various formats: markdown (default), html, screenshot, or branding. [Full docs →](/api-reference/scrape)
+Scrape a URL into one or more of 8 output formats. Multi-format is supported via comma-separated `-f`. [Full docs →](/api-reference/scrape)
 
 ```bash
-just-scrape scrape <url>
+just-scrape scrape <url>                                   # markdown (default)
 just-scrape scrape <url> -f html
 just-scrape scrape <url> -f screenshot
-just-scrape scrape <url> -f branding
+just-scrape scrape <url> -f markdown,links,images          # multi-format
+just-scrape scrape <url> -f json -p "Extract the title"    # json format requires --prompt
+just-scrape scrape <url> -f json -p <prompt> --schema <json>
+just-scrape scrape <url> --html-mode reader                # normal | reader | prune
+just-scrape scrape <url> --scrolls 3
 just-scrape scrape <url> -m direct+stealth
 just-scrape scrape <url> --country <iso>
 ```
 
+#### Formats
+
+| Format | Description |
+|---|---|
+| `markdown` | Clean markdown conversion (default). Respects `--html-mode`. |
+| `html` | Raw / processed HTML. Respects `--html-mode`. |
+| `screenshot` | Page screenshot (PNG). |
+| `branding` | Extracted brand assets (logos, colors, fonts). |
+| `links` | All links on the page. |
+| `images` | All images on the page. |
+| `summary` | AI-generated page summary. |
+| `json` | Structured JSON via `--prompt` (+ optional `--schema`). |
+
 ### Markdownify
 
 Convert any webpage to clean markdown (convenience wrapper for `scrape --format markdown`). [Full docs →](/api-reference/scrape)
@@ -132,9 +152,22 @@ just-scrape crawl <url> --max-pages <n>
 just-scrape crawl <url> --max-depth <n>
 just-scrape crawl <url> --max-links-per-page <n>
 just-scrape crawl <url> --allow-external
+just-scrape crawl <url> -f markdown        # or html, json, etc.
 just-scrape crawl <url> -m direct+stealth
 ```
 
+### Fetch Modes
+
+Use `-m / --mode` on `extract`, `search`, `scrape`, `markdownify`, and `crawl` to choose how pages are fetched. The legacy `--stealth` boolean is replaced by the mode enum below.
+
+| Mode | Description |
+|---|---|
+| `auto` | Automatic selection (default) |
+| `fast` | Fastest, no JS rendering |
+| `js` | Full JS rendering |
+| `direct+stealth` | Direct fetch with anti-bot bypass |
+| `js+stealth` | JS rendering with anti-bot bypass |
+
 ### History
 
 Browse request history for any service.
diff --git a/services/mcp-server.mdx b/services/mcp-server.mdx
@@ -22,11 +22,16 @@ A production‑ready Model Context Protocol (MCP) server that connects LLMs to t
 
 ## Key Features
 
-- 8 tools covering markdown conversion, AI extraction, search, crawling, sitemap, and agentic flows
+- Full v2 API coverage: extract, search, scrape, crawl (+ stop/resume), monitor lifecycle, credits, and history
+- Uses the v2 API base URL (`https://api.scrapegraphai.com/api/v2`) with `Authorization: Bearer` + `SGAI-APIKEY` headers
 - Remote HTTP MCP endpoint and local Python server support
 - Works with Cursor, Claude Desktop, and any MCP‑compatible client
 - Robust error handling, timeouts, and production‑tested reliability
 
+<Warning>
+The MCP server is now on **v2** (`scrapegraph-mcp@2.0.0`). The v1 tools `sitemap`, `agentic_scrapper`, `markdownify_status`, and `smartscraper_status` have been removed. See [scrapegraph-mcp#16](https://github.com/ScrapeGraphAI/scrapegraph-mcp/pull/16) for the migration details.
+</Warning>
+
 ## Get Your API Key
 
 Create an account and copy your API key from the [ScrapeGraph Dashboard](https://scrapegraphai.com/dashboard).
@@ -164,19 +169,30 @@ python -m scrapegraph_mcp.server
 
 ---
 
+## Configuration
+
+The server reads the ScrapeGraph API key from `SGAI_API_KEY` (local) or the `X-API-Key` header (remote). Optional environment overrides:
+
+| Variable | Description | Default |
+|---|---|---|
+| `SGAI_API_KEY` | ScrapeGraph API key | — |
+| `SCRAPEGRAPH_API_BASE_URL` | Override the v2 API base URL | `https://api.scrapegraphai.com/api/v2` |
+
 ## Available Tools
 
-The server exposes 8 enterprise‑ready tools:
+The server exposes the full v2 API surface.
 
-### 1. markdownify
+### Content tools
+
+#### markdownify
 Convert a webpage to clean markdown.
 
 ```python
 markdownify(website_url: str)
 ```
 
-### 2. smartscraper
-AI‑powered extraction with optional infinite scrolls.
+#### smartscraper
+AI‑powered extraction (v2 `/extract`) with optional infinite scrolls.
 
 ```python
 smartscraper(
@@ -186,8 +202,8 @@ smartscraper(
 )
 ```
 
-### 3. searchscraper
-Search the web and extract structured results.
+#### searchscraper
+Search the web and extract structured results (v2 `/search`).
 
 ```python
 searchscraper(
@@ -197,53 +213,92 @@ searchscraper(
 )
 ```
 
-### 4. scrape
-Fetch raw HTML from a URL.
+#### scrape
+Fetch a URL using the v2 `/scrape` endpoint with configurable formats.
 
 ```python
-scrape(website_url: str)
+scrape(
+  website_url: str,
+  formats: list | None = None,   # e.g. [{"type": "markdown", "mode": "normal"}]
+  content_type: str | None = None,
+)
 ```
 
-### 5. sitemap
-Discover a site’s URLs and structure.
+### Crawl tools
 
-```python
-sitemap(website_url: str)
-```
-
-### 6. smartcrawler_initiate
-Start an async multi‑page crawl (AI or markdown mode).
+#### smartcrawler_initiate
+Start a multi‑page crawl. `extraction_mode` defaults to `markdown` in v2 (also supports `html`).
 
 ```python
 smartcrawler_initiate(
   url: str,
-  prompt: str | None = None,
-  extraction_mode: str = "ai",
-  depth: int | None = None,
+  extraction_mode: str = "markdown",   # markdown | html
+  max_depth: int | None = None,
   max_pages: int | None = None,
-  same_domain_only: bool | None = None
+  max_links_per_page: int | None = None,
+  allow_external: bool | None = None,
+  include_patterns: list[str] | None = None,
+  exclude_patterns: list[str] | None = None,
 )
 ```
 
-### 7. smartcrawler_fetch_results
-Poll results using the returned request_id.
+#### smartcrawler_fetch_results
+Poll status / results for a crawl.
 
 ```python
 smartcrawler_fetch_results(request_id: str)
 ```
 
-### 8. agentic_scrapper
-Agentic, multi‑step workflows with optional schema and session persistence.
+#### crawl_stop
+Stop a running crawl.
 
 ```python
-agentic_scrapper(
+crawl_stop(request_id: str)
+```
+
+#### crawl_resume
+Resume a paused / stopped crawl.
+
+```python
+crawl_resume(request_id: str)
+```
+
+### Monitor tools
+
+Replace v1 "scheduled jobs". All monitor operations sit under a single namespace.
+
+```python
+monitor_create(
   url: str,
-  user_prompt: str | None = None,
-  output_schema: dict | None = None,
-  steps: list | None = None,
-  ai_extraction: bool | None = None,
-  persistent_session: bool | None = None,
-  timeout_seconds: float | None = None
+  interval: str,                 # 5-field cron expression
+  name: str | None = None,
+  formats: list | None = None,
+  webhook_url: str | None = None,
+)
+monitor_list()
+monitor_get(monitor_id: str)
+monitor_pause(monitor_id: str)
+monitor_resume(monitor_id: str)
+monitor_delete(monitor_id: str)
+```
+
+### Account tools
+
+#### credits
+Get the remaining credit balance.
+
+```python
+credits()
+```
+
+#### sgai_history
+Browse paginated request history, optionally filtered by service.
+
+```python
+sgai_history(
+  service: str | None = None,   # scrape | extract | search | monitor | crawl
+  page: int | None = None,
+  limit: int | None = None,
 )
 ```