Skip to content

feat: Add FirecrawlCrawler#2859

Merged
bogdankostic merged 12 commits intomainfrom
firecrawl-integration
Feb 24, 2026
Merged

feat: Add FirecrawlCrawler#2859
bogdankostic merged 12 commits intomainfrom
firecrawl-integration

Conversation

@bogdankostic
Copy link
Copy Markdown
Contributor

@bogdankostic bogdankostic commented Feb 19, 2026

Related Issues

Proposed Changes:

Added new FirecrawlFetcher component under haystack_integrations.components.fetchers.firecrawl.
implementing both sync (run) and async (run_async) crawling flows with Firecrawl clients.

How did you test it?

I added unit tests and integration tests.

Notes for the reviewer

Firecrawl also provides websearch functionality. We can add that in a separate PR by introducing a FirecrawlWebSearch component, analogous to the already exisitng SearchApiWebSearch and SerperDevWebSearch components.

Checklist

@github-actions github-actions Bot added the type:documentation Improvements or additions to documentation label Feb 19, 2026
@bogdankostic bogdankostic marked this pull request as ready for review February 19, 2026 17:03
@bogdankostic bogdankostic requested a review from a team as a code owner February 19, 2026 17:03
@bogdankostic bogdankostic requested review from anakin87 and removed request for a team February 19, 2026 17:03
@bogdankostic bogdankostic changed the title feat: Add FirecrawlCrawler feat: Add FirecrawlFetcher Feb 19, 2026
Copy link
Copy Markdown
Member

@anakin87 anakin87 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work.

I left some initial comments and also added a task list (copied from our new integration template) in https://github.com/deepset-ai/haystack-private/issues/89#issuecomment-3932711305.

Comment thread .github/workflows/firecrawl.yml Outdated
Comment thread integrations/firecrawl/README.md Outdated
Comment thread integrations/firecrawl/README.md
Comment thread integrations/firecrawl/pyproject.toml Outdated
Comment thread integrations/firecrawl/tests/test_firecrawl_fetcher.py Outdated
Comment thread integrations/firecrawl/tests/test_firecrawl_fetcher.py Outdated
@bogdankostic bogdankostic changed the title feat: Add FirecrawlFetcher feat: Add FirecrawlCrawler Feb 20, 2026
Comment thread .github/workflows/firecrawl.yml Outdated
@anakin87
Copy link
Copy Markdown
Member

Please add the integration to the Inventory section of this repo README

Comment thread integrations/firecrawl/tests/test_firecrawl_crawler.py Outdated
Comment thread README.md Outdated
Copy link
Copy Markdown
Member

@anakin87 anakin87 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

@bogdankostic bogdankostic merged commit 3f8b97b into main Feb 24, 2026
30 checks passed
@bogdankostic bogdankostic deleted the firecrawl-integration branch February 24, 2026 14:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

integration:firecrawl topic:CI type:documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants