apify
diff --git a/‎docs/01_introduction/quick-start.mdx‎
Lines changed: 17 additions & 9 deletions b/‎docs/01_introduction/quick-start.mdx‎
Lines changed: 17 additions & 9 deletions
diff --git a/‎docs/03_guides/01_beautifulsoup_httpx.mdx‎
Lines changed: 7 additions & 3 deletions b/‎docs/03_guides/01_beautifulsoup_httpx.mdx‎
Lines changed: 7 additions & 3 deletions
diff --git a/‎docs/03_guides/02_parsel_impit.mdx‎
Lines changed: 7 additions & 3 deletions b/‎docs/03_guides/02_parsel_impit.mdx‎
Lines changed: 7 additions & 3 deletions
diff --git a/‎docs/03_guides/03_playwright.mdx‎
Lines changed: 8 additions & 4 deletions b/‎docs/03_guides/03_playwright.mdx‎
Lines changed: 8 additions & 4 deletions
diff --git a/‎docs/03_guides/04_selenium.mdx‎
Lines changed: 10 additions & 4 deletions b/‎docs/03_guides/04_selenium.mdx‎
Lines changed: 10 additions & 4 deletions
diff --git a/‎docs/03_guides/05_crawlee.mdx‎
Lines changed: 6 additions & 2 deletions b/‎docs/03_guides/05_crawlee.mdx‎
Lines changed: 6 additions & 2 deletions
diff --git a/‎docs/03_guides/06_scrapy.mdx‎
Lines changed: 2 additions & 2 deletions b/‎docs/03_guides/06_scrapy.mdx‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/03_guides/07_running_webserver.mdx‎ ‎docs/03_guides/12_running_webserver.mdx‎docs/03_guides/07_running_webserver.mdx renamed to docs/03_guides/12_running_webserver.mdx
Lines changed: 26 additions & 2 deletions b/‎docs/03_guides/07_running_webserver.mdx‎ ‎docs/03_guides/12_running_webserver.mdx‎docs/03_guides/07_running_webserver.mdx renamed to docs/03_guides/12_running_webserver.mdx
Lines changed: 26 additions & 2 deletions
@@ -67,7 +67,7 @@ The Actor's source code is in the `src` folder. This folder contains two importa
             {MainExample}
         </CodeBlock>
     </TabItem>
-    <TabItem value="__main__.py" label="__main.py__">
+    <TabItem value="__main__.py" label="__main__.py">
         <CodeBlock className="language-python">
             {UnderscoreMainExample}
         </CodeBlock>
@@ -97,12 +97,20 @@ To learn more about the features of the Apify SDK and how to use them, check out
 
 ### Guides
 
-To see how you can integrate the Apify SDK with popular web scraping libraries, check out our guides:
+To see how you can integrate the Apify SDK with popular scraping libraries and frameworks, check out these guides:
 
-- [BeautifulSoup with HTTPX](../guides/beautifulsoup-httpx)
-- [Parsel with Impit](../guides/parsel-impit)
-- [Playwright](../guides/playwright)
-- [Selenium](../guides/selenium)
-- [Crawlee](../guides/crawlee)
-- [Scrapy](../guides/scrapy)
-- [Running webserver](../guides/running-webserver)
+- [Scraping with BeautifulSoup and HTTPX](../guides/beautifulsoup-httpx)
+- [Scraping with Parsel and Impit](../guides/parsel-impit)
+- [Browser automation with Playwright](../guides/playwright)
+- [Browser automation with Selenium](../guides/selenium)
+- [Building crawlers with Crawlee](../guides/crawlee)
+- [Building crawlers with Scrapy](../guides/scrapy)
+- [Adaptive scraping with Scrapling](../guides/scrapling)
+- [LLM-ready scraping with Crawl4AI](../guides/crawl4ai)
+- [Browser AI agents with Browser Use](../guides/browser-use)
+
+For other aspects of Actor development, explore these guides:
+
+- [Project management with uv](../guides/uv)
+- [Input validation with Pydantic](../guides/input-validation)
+- [Running a web server](../guides/running-webserver)
@@ -1,14 +1,14 @@
 ---
 id: beautifulsoup-httpx
-title: Use BeautifulSoup with HTTPX
+title: Scraping with BeautifulSoup and HTTPX
 description: Build an Apify Actor that scrapes web pages using BeautifulSoup and HTTPX.
 ---
 
 import RunnableCodeBlock from '@site/src/components/RunnableCodeBlock';
 
 import BeautifulSoupHttpxExample from '!!raw-loader!roa-loader!./code/01_beautifulsoup_httpx.py';
 
-In this guide, you'll learn how to use the [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/) library with the [HTTPX](https://www.python-httpx.org/) library in your Apify Actors.
+In this guide, you'll learn how to scrape web pages with the [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/) and [HTTPX](https://www.python-httpx.org/) libraries in your Apify Actors.
 
 ## Introduction
 
@@ -20,12 +20,16 @@ To create an Actor which uses those libraries, start from the [BeautifulSoup & P
 
 ## Example Actor
 
-Below is a simple Actor that recursively scrapes titles from all linked websites, up to a specified maximum depth, starting from URLs provided in the Actor input. It uses [HTTPX](https://www.python-httpx.org/) for fetching pages and [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/) for parsing their content to extract titles and links to other pages.
+Below is a simple Actor that recursively scrapes data from linked pages on the same site, up to a specified maximum depth, starting from URLs provided in the Actor input. It uses [HTTPX](https://www.python-httpx.org/) for fetching pages through [Apify Proxy](https://docs.apify.com/platform/proxy) and [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/) for parsing their content to extract the title, headings, and links to other pages.
 
 <RunnableCodeBlock className="language-python" language="python">
     {BeautifulSoupHttpxExample}
 </RunnableCodeBlock>
 
+## Using Apify Proxy
+
+Running on the Apify platform gives your scraper access to [Apify Proxy](https://docs.apify.com/platform/proxy), which rotates IP addresses to avoid rate limiting and blocking. The example creates a proxy configuration with `Actor.create_proxy_configuration` and fetches a fresh proxy URL for every request, so each page goes through a different IP. A new HTTPX client is created per request to apply that URL. To select specific proxy groups or a country, pass the relevant arguments to `Actor.create_proxy_configuration`. For more details, see the [Proxy management](../concepts/proxy-management) guide.
+
 ## Conclusion
 
 In this guide, you learned how to use the [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/) with the [HTTPX](https://www.python-httpx.org/) in your Apify Actors. By combining these libraries, you can efficiently extract data from HTML or XML files, making it easy to build web scraping tasks in Python. See the [Actor templates](https://apify.com/templates/categories/python) to get started with your own scraping tasks. If you have questions or need assistance, feel free to reach out on our [GitHub](https://github.com/apify/apify-sdk-python) or join our [Discord community](https://discord.com/invite/jyEM2PRvMU). Happy scraping!
 
@@ -1,14 +1,14 @@
 ---
 id: parsel-impit
-title: Use Parsel with Impit
+title: Scraping with Parsel and Impit
 description: Build an Apify Actor that scrapes web pages using Parsel selectors and the Impit HTTP client.
 ---
 
 import RunnableCodeBlock from '@site/src/components/RunnableCodeBlock';
 
 import ParselImpitExample from '!!raw-loader!roa-loader!./code/02_parsel_impit.py';
 
-In this guide, you'll learn how to combine the [Parsel](https://github.com/scrapy/parsel) and [Impit](https://github.com/apify/impit) libraries when building Apify Actors.
+In this guide, you'll learn how to scrape web pages with the [Parsel](https://github.com/scrapy/parsel) and [Impit](https://github.com/apify/impit) libraries in your Apify Actors.
 
 ## Introduction
 
@@ -18,12 +18,16 @@ In this guide, you'll learn how to combine the [Parsel](https://github.com/scrap
 
 ## Example Actor
 
-The following example shows a simple Actor that recursively scrapes titles from linked pages, up to a user-defined maximum depth. It uses [Impit](https://github.com/apify/impit) to fetch pages and [Parsel](https://github.com/scrapy/parsel) to extract titles and discover new links.
+The following example shows a simple Actor that recursively scrapes data from linked pages on the same site, up to a user-defined maximum depth. It uses [Impit](https://github.com/apify/impit) to fetch pages through [Apify Proxy](https://docs.apify.com/platform/proxy) and [Parsel](https://github.com/scrapy/parsel) to extract the title, headings, and links.
 
 <RunnableCodeBlock className="language-python" language="python">
     {ParselImpitExample}
 </RunnableCodeBlock>
 
+## Using Apify Proxy
+
+Running on the Apify platform gives your scraper access to [Apify Proxy](https://docs.apify.com/platform/proxy), which rotates IP addresses to avoid rate limiting and blocking. The example creates a proxy configuration with `Actor.create_proxy_configuration` and fetches a fresh proxy URL for every request, so each page goes through a different IP. A new Impit client is created per request to apply that URL. To select specific proxy groups or a country, pass the relevant arguments to `Actor.create_proxy_configuration`. For more details, see the [Proxy management](../concepts/proxy-management) guide.
+
 ## Conclusion
 
 In this guide, you learned how to use [Parsel](https://github.com/scrapy/parsel) with [Impit](https://github.com/apify/impit) in your Apify Actors. By combining these libraries, you get a powerful and efficient solution for web scraping: [Parsel](https://github.com/scrapy/parsel) provides excellent CSS selector and XPath support for data extraction, while [Impit](https://github.com/apify/impit) offers a fast and simple HTTP client built by Apify. This combination makes it easy to build scalable web scraping tasks in Python. See the [Actor templates](https://apify.com/templates/categories/python) to get started with your own scraping tasks. If you have questions or need assistance, feel free to reach out on our [GitHub](https://github.com/apify/apify-sdk-python) or join our [Discord community](https://discord.com/invite/jyEM2PRvMU). Happy scraping!
 
@@ -1,6 +1,6 @@
 ---
 id: playwright
-title: Use Playwright
+title: Browser automation with Playwright
 description: Build an Apify Actor that scrapes dynamic web pages using Playwright browser automation.
 ---
 
@@ -11,7 +11,7 @@ import RunnableCodeBlock from '@site/src/components/RunnableCodeBlock';
 
 import PlaywrightExample from '!!raw-loader!roa-loader!./code/03_playwright.py';
 
-In this guide, you'll learn how to use [Playwright](https://playwright.dev) for web scraping in your Apify Actors.
+In this guide, you'll learn how to use [Playwright](https://playwright.dev) for browser automation and web scraping in your Apify Actors.
 
 ## Introduction
 
@@ -48,14 +48,18 @@ playwright install --with-deps`
 
 ## Example Actor
 
-This is a simple Actor that recursively scrapes titles from all linked websites, up to a maximum depth, starting from URLs in the Actor input.
+This is a simple Actor that recursively scrapes data from linked pages on the same site, up to a maximum depth, starting from URLs in the Actor input.
 
-It uses Playwright to open the pages in an automated Chrome browser, and to extract the title and anchor elements after the pages load.
+It uses Playwright to open the pages in an automated Chrome browser, and to extract the title, headings, and links after the pages load.
 
 <RunnableCodeBlock className="language-python" language="python">
     {PlaywrightExample}
 </RunnableCodeBlock>
 
+## Using Apify Proxy
+
+Running on the Apify platform gives your scraper access to [Apify Proxy](https://docs.apify.com/platform/proxy), which rotates IP addresses to avoid rate limiting and blocking. The example creates a proxy configuration with `Actor.create_proxy_configuration` and launches the browser through it. Playwright applies the proxy at the browser level, so the whole run shares a single proxy URL rather than rotating per request; the `to_playwright_proxy` helper splits that URL into the `server`, `username`, and `password` fields Playwright expects. To select specific proxy groups or a country, pass the relevant arguments to `Actor.create_proxy_configuration`. For more details, see the [Proxy management](../concepts/proxy-management) guide.
+
 ## Conclusion
 
 In this guide you learned how to create Actors that use Playwright to scrape websites. Playwright is a powerful tool that can be used to manage browser instances and scrape websites that require JavaScript execution. See the [Actor templates](https://apify.com/templates/categories/python) to get started with your own scraping tasks. If you have questions or need assistance, feel free to reach out on our [GitHub](https://github.com/apify/apify-sdk-python) or join our [Discord community](https://discord.com/invite/jyEM2PRvMU). Happy scraping!
 
@@ -1,14 +1,14 @@
 ---
 id: selenium
-title: Use Selenium
+title: Browser automation with Selenium
 description: Build an Apify Actor that scrapes dynamic web pages using Selenium WebDriver.
 ---
 
 import RunnableCodeBlock from '@site/src/components/RunnableCodeBlock';
 
 import SeleniumExample from '!!raw-loader!roa-loader!./code/04_selenium.py';
 
-In this guide, you'll learn how to use [Selenium](https://www.selenium.dev/) for web scraping in your Apify Actors.
+In this guide, you'll learn how to use [Selenium](https://www.selenium.dev/) for browser automation and web scraping in your Apify Actors.
 
 ## Introduction
 
@@ -32,14 +32,20 @@ Refer to the [Selenium documentation](https://www.selenium.dev/documentation/web
 
 ## Example Actor
 
-This is a simple Actor that recursively scrapes titles from all linked websites, up to a maximum depth, starting from URLs in the Actor input.
+This is a simple Actor that recursively scrapes data from linked pages on the same site, up to a maximum depth, starting from URLs in the Actor input.
 
-It uses Selenium ChromeDriver to open the pages in an automated Chrome browser, and to extract the title and anchor elements after the pages load.
+It uses Selenium ChromeDriver to open the pages in an automated Chrome browser, and to extract the title, headings, and links after the pages load.
 
 <RunnableCodeBlock className="language-python" language="python">
     {SeleniumExample}
 </RunnableCodeBlock>
 
+## Using Apify Proxy
+
+Running on the Apify platform gives your scraper access to [Apify Proxy](https://docs.apify.com/platform/proxy), which rotates IP addresses to avoid rate limiting and blocking. The example creates a proxy configuration with `Actor.create_proxy_configuration` and routes the browser through it for the whole run.
+
+Chrome ignores the credentials passed in the `--proxy-server` flag, so an authenticated proxy such as Apify Proxy has to be configured from inside a small extension. The `proxy_auth_extension` helper builds one at runtime: its service worker sets the proxy server and answers the browser's authentication challenge with the username and password. Note that the new headless mode (`--headless=new`) is required for Chrome to load the extension. To select specific proxy groups or a country, pass the relevant arguments to `Actor.create_proxy_configuration`. For more details, see the [Proxy management](../concepts/proxy-management) guide.
+
 ## Conclusion
 
 In this guide you learned how to use Selenium for web scraping in Apify Actors. You can now create your own Actors that use Selenium to scrape dynamic websites and interact with web pages just like a human would. See the [Actor templates](https://apify.com/templates/categories/python) to get started with your own scraping tasks. If you have questions or need assistance, feel free to reach out on our [GitHub](https://github.com/apify/apify-sdk-python) or join our [Discord community](https://discord.com/invite/jyEM2PRvMU). Happy scraping!
 
@@ -1,6 +1,6 @@
 ---
 id: crawlee
-title: Use Crawlee
+title: Building crawlers with Crawlee
 description: Build Apify Actors using Crawlee's BeautifulSoupCrawler, ParselCrawler, or PlaywrightCrawler.
 ---
 
@@ -10,7 +10,7 @@ import CrawleeBeautifulSoupExample from '!!raw-loader!roa-loader!./code/05_crawl
 import CrawleeParselExample from '!!raw-loader!roa-loader!./code/05_crawlee_parsel.py';
 import CrawleePlaywrightExample from '!!raw-loader!roa-loader!./code/05_crawlee_playwright.py';
 
-In this guide, you'll learn how to use the [Crawlee](https://crawlee.dev/python) library in your Apify Actors.
+In this guide, you'll learn how to build web crawlers with the [Crawlee](https://crawlee.dev/python) library in your Apify Actors.
 
 ## Introduction
 
@@ -42,6 +42,10 @@ The [`PlaywrightCrawler`](https://crawlee.dev/python/api/class/PlaywrightCrawler
     {CrawleePlaywrightExample}
 </RunnableCodeBlock>
 
+## Using Apify Proxy
+
+All three crawlers above route their requests through [Apify Proxy](https://docs.apify.com/platform/proxy), which rotates IP addresses to avoid rate limiting and blocking. `Actor.create_proxy_configuration` returns a Crawlee-compatible proxy configuration, which is passed to the crawler as `proxy_configuration`; Crawlee then rotates the proxy IP for every request on its own. Because the configuration is only available inside the running Actor, the crawler is created in `main` and the request handler is registered on a standalone [`Router`](https://crawlee.dev/python/api/class/Router) up front. To select specific proxy groups or a country, pass the relevant arguments to `Actor.create_proxy_configuration`. For more details, see the [Proxy management](../concepts/proxy-management) guide.
+
 ## Conclusion
 
 In this guide, you learned how to use the [Crawlee](https://crawlee.dev/python) library in your Apify Actors. By using the [`BeautifulSoupCrawler`](https://crawlee.dev/python/api/class/BeautifulSoupCrawler), [`ParselCrawler`](https://crawlee.dev/python/api/class/ParselCrawler), and [`PlaywrightCrawler`](https://crawlee.dev/python/api/class/PlaywrightCrawler) crawlers, you can efficiently scrape static or dynamic web pages, making it easy to build web scraping tasks in Python. See the [Actor templates](https://apify.com/templates/categories/python) to get started with your own scraping tasks. If you have questions or need assistance, feel free to reach out on our [GitHub](https://github.com/apify/apify-sdk-python) or join our [Discord community](https://discord.com/invite/jyEM2PRvMU). Happy scraping!
 
@@ -1,6 +1,6 @@
 ---
 id: scrapy
-title: Use Scrapy
+title: Building crawlers with Scrapy
 description: Convert Scrapy spiders into Apify Actors with platform storage and proxy integration.
 ---
 
@@ -15,7 +15,7 @@ import ItemsExample from '!!raw-loader!./code/scrapy_project/src/items.py';
 import SpidersExample from '!!raw-loader!./code/scrapy_project/src/spiders/title.py';
 import SettingsExample from '!!raw-loader!./code/scrapy_project/src/settings.py';
 
-In this guide, you'll learn how to use the [Scrapy](https://scrapy.org/) framework in your Apify Actors.
+In this guide, you'll learn how to build web crawlers with the [Scrapy](https://scrapy.org/) framework in your Apify Actors.
 
 ## Introduction
 
 
@@ -1,12 +1,13 @@
 ---
 id: running-webserver
-title: Run a web server
+title: Running a web server
 description: Run an HTTP server inside your Actor for monitoring or serving content during execution.
 ---
 
 import RunnableCodeBlock from '@site/src/components/RunnableCodeBlock';
 
-import WebserverExample from '!!raw-loader!roa-loader!./code/07_webserver.py';
+import WebserverExample from '!!raw-loader!roa-loader!./code/12_webserver.py';
+import WebserverFastApiExample from '!!raw-loader!roa-loader!./code/12_webserver_fastapi.py';
 
 In this guide, you'll learn how to run a web server inside your Apify Actor. This is useful for monitoring Actor progress, creating custom APIs, or serving content during the Actor run.
 
@@ -30,6 +31,29 @@ The following example shows how to start a simple web server in your Actor, whic
     {WebserverExample}
 </RunnableCodeBlock>
 
+## Using FastAPI
+
+The example above relies only on Python's standard library, which keeps it dependency-free but leaves you handling requests by hand. For anything beyond a single endpoint, a web framework such as [FastAPI](https://fastapi.tiangolo.com/) is a better fit - it gives you routing, request parsing, and automatic JSON responses, and is served by an ASGI server like [uvicorn](https://www.uvicorn.org/).
+
+Install both, for example by adding them to your `requirements.txt`:
+
+```text
+fastapi
+uvicorn[standard]
+```
+
+The following Actor serves the same processed-items counter as before, but through a FastAPI endpoint. The key difference is that uvicorn runs inside the Actor's event loop as a background task, bound to `Actor.configuration.web_server_port` so the platform routes the container URL to it:
+
+<RunnableCodeBlock className="language-python" language="python">
+    {WebserverFastApiExample}
+</RunnableCodeBlock>
+
+A few things worth pointing out:
+
+- `uvicorn.Server(...).serve()` is a coroutine, so it runs as an `asyncio` task alongside the Actor's own work instead of blocking it. Setting `server.should_exit = True` triggers a graceful shutdown once the work is done.
+- The server binds to `0.0.0.0` (all interfaces) rather than `localhost`, so it's reachable through the container URL, not only from inside the container.
+- The same pattern powers an [Actor Standby](#actor-standby) service - swap the one-off work loop for an Actor that just keeps serving requests.
+
 ## Actor Standby
 
 The example above runs a web server for the duration of a single Actor run. With [Actor Standby](https://docs.apify.com/platform/actors/development/programming-interface/standby), you can instead expose your Actor as an always-ready HTTP API: the platform keeps the Actor running in the background and routes incoming HTTP requests to the web server inside it, spinning up additional instances as the load grows.