@@ -132,10 +132,13 @@ Search the web and extract structured data from results (replaces `search-scrape
132132### Usage
133133
134134``` bash
135- just-scrape search < query> # AI-powered web search
136- just-scrape search < query> --num-results < n> # Sources to scrape (1-20, default 3)
137- just-scrape search < query> -p < prompt> # Extraction prompt for results
138- just-scrape search < query> --schema < json> # Enforce output schema
135+ just-scrape search < query> # AI-powered web search
136+ just-scrape search < query> --num-results < n> # Sources to scrape (1-20, default 3)
137+ just-scrape search < query> -p < prompt> # Extraction prompt for results
138+ just-scrape search < query> --schema < json> # Enforce output schema
139+ just-scrape search < query> --location-geo-code < code> # Geo-target search (e.g. 'us', 'de', 'jp-tk')
140+ just-scrape search < query> --time-range < range> # past_hour | past_24_hours | past_week | past_month | past_year
141+ just-scrape search < query> --format < markdown| html> # Result format (default markdown)
139142just-scrape search < query> --headers < json>
140143```
141144
@@ -145,40 +148,53 @@ just-scrape search <query> --headers <json>
145148# Research a topic across multiple sources
146149just-scrape search " What are the best Python web frameworks in 2025?" --num-results 10
147150
151+ # Recent news only, scoped to Germany
152+ just-scrape search " EU AI act latest news" --time-range past_week --location-geo-code de
153+
148154# Structured output with schema
149155just-scrape search " Top 5 cloud providers pricing" \
150156 --schema ' {"type":"object","properties":{"providers":{"type":"array","items":{"type":"object","properties":{"name":{"type":"string"},"free_tier":{"type":"string"}}}}}}'
151157```
152158
153159## Scrape
154160
155- Scrape content from a URL in various formats: markdown (default), html, screenshot, or branding. [ docs] ( https://docs.scrapegraphai.com/api-reference/scrape )
161+ Scrape content from a URL in one or more formats. The v2 API supports ** 8 formats ** : ` markdown ` , ` html ` , ` screenshot ` , ` branding ` , ` links ` , ` images ` , ` summary ` , ` json ` . [ docs] ( https://docs.scrapegraphai.com/api-reference/scrape )
156162
157163### Usage
158164
159165``` bash
160- just-scrape scrape < url> # Markdown (default)
161- just-scrape scrape < url> -f html # Raw HTML
162- just-scrape scrape < url> -f screenshot # Screenshot
163- just-scrape scrape < url> -f branding # Extract branding info
164- just-scrape scrape < url> -m direct+stealth # Anti-bot bypass
165- just-scrape scrape < url> --country < iso> # Geo-targeting
166+ just-scrape scrape < url> # Markdown (default)
167+ just-scrape scrape < url> -f html # Raw HTML
168+ just-scrape scrape < url> -f screenshot # Page screenshot
169+ just-scrape scrape < url> -f branding # Branding (logos, colors, fonts)
170+ just-scrape scrape < url> -f links # Extracted links
171+ just-scrape scrape < url> -f images # Extracted images
172+ just-scrape scrape < url> -f summary # AI-generated page summary
173+ just-scrape scrape < url> -f json -p < prompt> # Structured JSON via prompt
174+ just-scrape scrape < url> -f markdown,links,images # Multi-format (comma-separated)
175+ just-scrape scrape < url> --html-mode reader # normal (default), reader, or prune
176+ just-scrape scrape < url> --scrolls < n> # Infinite scroll (0-100)
177+ just-scrape scrape < url> -m direct+stealth # Anti-bot bypass
178+ just-scrape scrape < url> --country < iso> # Geo-targeting
166179```
167180
168181### Examples
169182
170183``` bash
171- # Get markdown of a page
184+ # Markdown of a page
172185just-scrape scrape https://example.com
173186
174- # Get raw HTML
175- just-scrape scrape https://example.com -f html
187+ # Raw HTML with reader-mode extraction
188+ just-scrape scrape https://blog.example.com -f html --html-mode reader
189+
190+ # Multi-format: markdown + links + images in a single call
191+ just-scrape scrape https://example.com -f markdown,links,images
192+
193+ # Structured JSON output with a prompt
194+ just-scrape scrape https://store.example.com -f json -p " Extract product name and price"
176195
177196# Scrape with anti-bot bypass and geo-targeting
178197just-scrape scrape https://store.example.com -m direct+stealth --country DE
179-
180- # Extract branding info (logos, colors, fonts)
181- just-scrape scrape https://example.com -f branding
182198```
183199
184200## Markdownify
@@ -218,6 +234,7 @@ just-scrape crawl <url> --max-pages <n> # Max pages (default 50)
218234just-scrape crawl < url> --max-depth < n> # Crawl depth (default 2)
219235just-scrape crawl < url> --max-links-per-page < n> # Links per page (default 10)
220236just-scrape crawl < url> --allow-external # Allow external domains
237+ just-scrape crawl < url> -f html # Page format (default markdown)
221238just-scrape crawl < url> -m direct+stealth # Anti-bot bypass
222239```
223240
@@ -283,8 +300,9 @@ Commands have been renamed to match the v2 API:
283300| ` smart-scraper ` | ` extract ` | Renamed |
284301| ` search-scraper ` | ` search ` | Renamed |
285302| ` markdownify ` | ` markdownify ` | Now wraps ` scrape --format markdown ` |
286- | ` scrape ` | ` scrape ` | Gains ` --format ` flag (markdown, html, screenshot, branding) |
287- | ` crawl ` | ` crawl ` | New options: ` --max-depth ` , ` --max-links-per-page ` , ` --allow-external ` |
303+ | ` scrape ` | ` scrape ` | Gains ` --format ` (markdown, html, screenshot, branding, links, images, summary, json), multi-format via comma, ` --html-mode ` , ` --scrolls ` , ` --prompt ` , ` --schema ` |
304+ | ` crawl ` | ` crawl ` | New options: ` --max-depth ` , ` --max-links-per-page ` , ` --allow-external ` , ` --format ` |
305+ | ` search ` | ` search ` | New options: ` --location-geo-code ` , ` --time-range ` , ` --format ` |
288306| ` --stealth ` flag | ` --mode direct+stealth ` | Fetch mode enum replaces boolean (` auto ` , ` fast ` , ` js ` , ` direct+stealth ` , ` js+stealth ` ) |
289307| ` agentic-scraper ` | — | Removed from API |
290308| ` generate-schema ` | — | Removed from API |
0 commit comments