ScrapeGraphAI
diff --git a/‎docs.json‎
Lines changed: 20 additions & 1 deletion b/‎docs.json‎
Lines changed: 20 additions & 1 deletion
diff --git a/‎knowledge-base/cli/ai-agent-skill.mdx‎
Lines changed: 66 additions & 0 deletions b/‎knowledge-base/cli/ai-agent-skill.mdx‎
Lines changed: 66 additions & 0 deletions
diff --git a/‎knowledge-base/cli/command-examples.mdx‎
Lines changed: 168 additions & 0 deletions b/‎knowledge-base/cli/command-examples.mdx‎
Lines changed: 168 additions & 0 deletions
diff --git a/‎knowledge-base/cli/getting-started.mdx‎
Lines changed: 83 additions & 0 deletions b/‎knowledge-base/cli/getting-started.mdx‎
Lines changed: 83 additions & 0 deletions
@@ -50,7 +50,17 @@
               "services/smartcrawler",
               "services/sitemap",
               "services/agenticscraper",
-              "services/cli",
+              {
+                "group": "CLI",
+                "icon": "terminal",
+                "pages": [
+                  "services/cli/introduction",
+                  "services/cli/commands",
+                  "services/cli/json-mode",
+                  "services/cli/ai-agent-skill",
+                  "services/cli/examples"
+                ]
+              },
               {
                 "group": "MCP Server",
                 "icon": "/logo/mcp.svg",
@@ -125,6 +135,15 @@
               "knowledge-base/introduction"
             ]
           },
+          {
+            "group": "CLI",
+            "pages": [
+              "knowledge-base/cli/getting-started",
+              "knowledge-base/cli/json-mode",
+              "knowledge-base/cli/ai-agent-skill",
+              "knowledge-base/cli/command-examples"
+            ]
+          },
           {
             "group": "AI Tools",
             "pages": [
 
@@ -0,0 +1,66 @@
+---
+title: Using just-scrape as a coding agent skill
+description: 'Give AI coding agents access to web scraping through the just-scrape skill'
+---
+
+`just-scrape` can be installed as a **skill** for AI coding agents via [Vercel's skills.sh](https://skills.sh). This lets agents like Claude, Cursor, and others call ScrapeGraphAI commands directly during a coding session.
+
+## Install the skill
+
+```bash
+bunx skills add https://github.com/ScrapeGraphAI/just-scrape
+```
+
+Browse the skill page: [skills.sh/scrapegraphai/just-scrape/just-scrape](https://skills.sh/scrapegraphai/just-scrape/just-scrape)
+
+## What this enables
+
+Once installed, your coding agent can:
+
+- Scrape a website to gather data needed for a task
+- Convert documentation pages to markdown for context
+- Search the web and extract structured results
+- Check your credit balance mid-session
+- Browse request history
+
+## How agents use it
+
+Agents invoke the skill in `--json` mode so output is clean and token-efficient:
+
+```bash
+just-scrape smart-scraper https://api.example.com/docs \
+  -p "Extract all endpoint names, methods, and descriptions" \
+  --json
+```
+
+```bash
+just-scrape search-scraper "latest release notes for react-query" \
+  --num-results 3 --json
+```
+
+## Manual setup with Cursor
+
+If you are using Cursor without the skills.sh integration, configure `just-scrape` via the [MCP Server](/services/mcp-server/cursor) for the best experience.
+
+Alternatively, add a script to your project that Cursor can call:
+
+```bash
+# .cursor/scrape.sh
+#!/bin/bash
+just-scrape smart-scraper "$1" -p "$2" --json
+```
+
+Then tell Cursor: *"Run `.cursor/scrape.sh <url> <prompt>` to scrape a page."*
+
+## Tips
+
+- Set `SGAI_API_KEY` in your shell profile so the skill picks it up automatically across all agent sessions.
+- Use `--json` every time — agents don't need spinners or banners.
+- Pass `--schema` with a JSON schema to get typed, predictable output that agents can parse reliably.
+
+```bash
+just-scrape smart-scraper https://example.com \
+  -p "Extract company info" \
+  --schema '{"type":"object","properties":{"name":{"type":"string"},"founded":{"type":"number"},"employees":{"type":"string"}}}' \
+  --json
+```
@@ -0,0 +1,168 @@
+---
+title: CLI command examples
+description: 'Practical examples for every just-scrape command'
+---
+
+## Smart Scraper
+
+Extract structured data from any URL using AI.
+
+```bash
+# Basic extraction
+just-scrape smart-scraper https://news.ycombinator.com \
+  -p "Extract the top 10 story titles and their URLs"
+
+# Enforce a strict output schema
+just-scrape smart-scraper https://news.example.com \
+  -p "Get all article headlines and dates" \
+  --schema '{"type":"object","properties":{"articles":{"type":"array","items":{"type":"object","properties":{"title":{"type":"string"},"date":{"type":"string"}}}}}}'
+
+# Scroll to load more content, then extract
+just-scrape smart-scraper https://store.example.com/shoes \
+  -p "Extract all product names, prices, and ratings" \
+  --scrolls 5
+
+# Bypass anti-bot protection (costs +4 credits)
+just-scrape smart-scraper https://app.example.com/dashboard \
+  -p "Extract user stats" \
+  --stealth
+
+# Pass cookies and custom headers
+just-scrape smart-scraper https://example.com/protected \
+  -p "Extract the protected content" \
+  --cookies '{"session": "abc123"}' \
+  --headers '{"X-Custom-Header": "value"}'
+```
+
+## Search Scraper
+
+Search the web and extract structured data from results.
+
+```bash
+# Research across multiple sources
+just-scrape search-scraper "What are the best Python web frameworks in 2025?" \
+  --num-results 10
+
+# Get raw markdown only (2 credits instead of 10)
+just-scrape search-scraper "React vs Vue comparison" \
+  --no-extraction --num-results 5
+
+# Structured output with schema
+just-scrape search-scraper "Top 5 cloud providers pricing" \
+  --schema '{"type":"object","properties":{"providers":{"type":"array","items":{"type":"object","properties":{"name":{"type":"string"},"free_tier":{"type":"string"}}}}}}'
+```
+
+## Markdownify
+
+Convert any webpage to clean markdown.
+
+```bash
+# Convert a blog post
+just-scrape markdownify https://blog.example.com/my-article
+
+# Save to file
+just-scrape markdownify https://docs.example.com/api \
+  --json | jq -r '.result' > api-docs.md
+
+# Bypass Cloudflare or anti-bot protections
+just-scrape markdownify https://protected.example.com --stealth
+```
+
+## Crawl
+
+Crawl multiple pages and extract data from each.
+
+```bash
+# Crawl a docs site and collect code examples
+just-scrape crawl https://docs.example.com \
+  -p "Extract all code snippets with their language" \
+  --max-pages 20 --depth 3
+
+# Crawl only blog pages
+just-scrape crawl https://example.com \
+  -p "Extract article titles and summaries" \
+  --rules '{"include_paths":["/blog/*"],"same_domain":true}' \
+  --max-pages 50
+
+# Get raw markdown from all pages (no AI extraction, cheaper)
+just-scrape crawl https://example.com \
+  --no-extraction --max-pages 10
+```
+
+## Scrape
+
+Get raw HTML from a URL.
+
+```bash
+# Basic HTML fetch
+just-scrape scrape https://example.com
+
+# Geo-targeted request with anti-bot bypass
+just-scrape scrape https://store.example.com \
+  --stealth --country-code DE
+
+# Extract branding info (logos, colors, fonts)
+just-scrape scrape https://example.com --branding
+```
+
+## Sitemap
+
+Get all URLs from a website's sitemap.
+
+```bash
+# List all pages
+just-scrape sitemap https://example.com
+
+# Pipe URLs to another command
+just-scrape sitemap https://example.com --json | jq -r '.urls[]'
+```
+
+## Agentic Scraper
+
+Browser automation with AI — login, click, navigate, fill forms.
+
+```bash
+# Log in and extract dashboard data
+just-scrape agentic-scraper https://app.example.com/login \
+  -s "Fill email with user@test.com,Fill password with secret,Click Sign In" \
+  --ai-extraction -p "Extract all dashboard metrics"
+
+# Navigate a multi-step form
+just-scrape agentic-scraper https://example.com/wizard \
+  -s "Click Next,Select Premium plan,Fill name with John,Click Submit"
+
+# Persist browser session across runs
+just-scrape agentic-scraper https://app.example.com \
+  -s "Click Settings" --use-session
+```
+
+## Generate Schema
+
+Generate a JSON schema from a natural language description.
+
+```bash
+# Generate a schema
+just-scrape generate-schema "E-commerce product with name, price, ratings, and reviews array"
+
+# Refine an existing schema
+just-scrape generate-schema "Add an availability field" \
+  --existing-schema '{"type":"object","properties":{"name":{"type":"string"},"price":{"type":"number"}}}'
+```
+
+## History
+
+Browse request history interactively or export it.
+
+```bash
+# Interactive history browser (arrow keys to navigate)
+just-scrape history smartscraper
+
+# Fetch a specific request by ID
+just-scrape history smartscraper abc123-def456-7890
+
+# Export last 100 crawl jobs as JSON
+just-scrape history crawl --json --page-size 100 \
+  | jq '.requests[] | {id: .request_id, status}'
+```
+
+Services: `markdownify`, `smartscraper`, `searchscraper`, `scrape`, `crawl`, `agentic-scraper`, `sitemap`
@@ -0,0 +1,83 @@
+---
+title: Getting started with just-scrape
+description: 'Install and configure the just-scrape CLI in minutes'
+---
+
+`just-scrape` is the official command-line interface for ScrapeGraphAI. It gives you AI-powered web scraping, data extraction, search, and crawling directly from your terminal.
+
+## Installation
+
+<CodeGroup>
+
+```bash npm
+npm install -g just-scrape
+```
+
+```bash pnpm
+pnpm add -g just-scrape
+```
+
+```bash yarn
+yarn global add just-scrape
+```
+
+```bash bun
+bun add -g just-scrape
+```
+
+```bash npx (no install)
+npx just-scrape --help
+```
+
+```bash bunx (no install)
+bunx just-scrape --help
+```
+
+</CodeGroup>
+
+Package: [just-scrape](https://www.npmjs.com/package/just-scrape) on npm | [GitHub](https://github.com/ScrapeGraphAI/just-scrape)
+
+## Setting up your API key
+
+The CLI needs a ScrapeGraphAI API key. Get one from the [dashboard](https://dashboard.scrapegraphai.com). The CLI checks for it in this order:
+
+1. **Environment variable** — `export SGAI_API_KEY="sgai-..."`
+2. **`.env` file** — `SGAI_API_KEY=sgai-...` in the project root
+3. **Config file** — `~/.scrapegraphai/config.json`
+4. **Interactive prompt** — the CLI will ask and save it automatically
+
+The easiest approach for a new machine is to just run any command — the CLI will prompt you for the key and save it to `~/.scrapegraphai/config.json` so you never need to set it again.
+
+## Environment variables
+
+| Variable | Description | Default |
+|---|---|---|
+| `SGAI_API_KEY` | ScrapeGraphAI API key | — |
+| `JUST_SCRAPE_API_URL` | Override the API base URL | `https://api.scrapegraphai.com/v1` |
+| `JUST_SCRAPE_TIMEOUT_S` | Request/polling timeout in seconds | `120` |
+| `JUST_SCRAPE_DEBUG` | Set to `1` to enable debug logging to stderr | `0` |
+
+## Verify your setup
+
+Run a quick health check to confirm the key is valid:
+
+```bash
+just-scrape validate
+```
+
+Check your credit balance:
+
+```bash
+just-scrape credits
+```
+
+## Your first scrape
+
+```bash
+just-scrape smart-scraper https://news.ycombinator.com \
+  -p "Extract the top 5 story titles and their URLs"
+```
+
+<Note>
+  See the [full CLI reference](/services/cli) for all commands and options.
+</Note>