Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
101 commits
Select commit Hold shift + click to select a range
4045f04
Merge pull request #994 from ScrapeGraphAI/pre/beta
VinciGit00 Jun 26, 2025
ad6845d
ci(release): 1.60.0 [skip ci]
semantic-release-bot Jun 26, 2025
2dc6b9b
feat: update doc
VinciGit00 Jul 3, 2025
132823b
ci(release): 1.61.0 [skip ci]
semantic-release-bot Jul 3, 2025
4cc1fc5
Fix critical schema transformation bugs and improve logging
Mirza-Samad-Ahmed-Baig Jul 25, 2025
c2abb9f
docs: removed duplicated line
alecontuu Aug 4, 2025
7fe566b
Merge pull request #1004 from alecontuu/readme_fix
VinciGit00 Aug 8, 2025
72b43b3
Update README.md
VinciGit00 Aug 13, 2025
e65da4d
Merge pull request #1001 from Mirza-Samad-Ahmed-Baig/fix-schema-trans…
VinciGit00 Aug 13, 2025
c07b3c0
feat: update pr
VinciGit00 Aug 13, 2025
5f8dbd3
ci(release): 1.62.0 [skip ci]
semantic-release-bot Aug 13, 2025
739b05a
doc: 1$ banner
lurenss Oct 4, 2025
79db9b9
feat: update model tokens
VinciGit00 Oct 22, 2025
7346c26
ci(release): 1.63.0 [skip ci]
semantic-release-bot Oct 22, 2025
8f0433c
fix: url redirect
lurenss Oct 24, 2025
365761a
ci(release): 1.63.1 [skip ci]
semantic-release-bot Oct 24, 2025
e81a4ed
feat: Add configurable timeout to FetchNode
Xyerophyte Nov 1, 2025
eeffa33
Merge pull request #1020 from Xyerophyte/feature/add-timeout-configur…
VinciGit00 Nov 6, 2025
914b85c
Empty commit
VinciGit00 Nov 6, 2025
93b3c5d
ci(release): 1.64.0 [skip ci]
semantic-release-bot Nov 6, 2025
32d5636
Remove downloads badge from README
VinciGit00 Nov 17, 2025
5516ec6
docs: update korean readme
PzaThief Nov 19, 2025
3dc6484
Add download badge and linting badges to README
VinciGit00 Nov 21, 2025
0e12bac
Merge pull request #1022 from PzaThief/main
VinciGit00 Nov 26, 2025
6d13212
Initial plan
Copilot Nov 26, 2025
9439fe5
Fix langchain import issues blocking tests
Copilot Nov 26, 2025
323f26a
Add comprehensive timeout feature documentation
Copilot Nov 26, 2025
34e1308
chore: apply semantic commit format as requested
Copilot Nov 26, 2025
2920d8b
docs: add guide for applying semantic commit format
Copilot Nov 26, 2025
dcd4f9c
docs: update semantic commit guide to use feat(timeout)
Copilot Nov 26, 2025
e887eaf
Merge pull request #1024 from ScrapeGraphAI/copilot/add-timeout-to-fe…
VinciGit00 Nov 26, 2025
cea8e6b
Initial plan
Copilot Nov 26, 2025
cddf497
Add NVIDIA LLM integration support
Copilot Nov 26, 2025
f23072c
Fix linting issues - remove unused imports and whitespace
Copilot Nov 26, 2025
8cf81c9
Add documentation explaining __new__ usage in Nvidia class
Copilot Nov 26, 2025
cfa2fae
Merge pull request #1026 from ScrapeGraphAI/copilot/add-nvidia-integr…
VinciGit00 Nov 30, 2025
6c5f7bb
fix: add null check for document.body when reading scrollHeight
denis-ershov Dec 3, 2025
82fa6fd
Add review comment for PR #1027
VinciGit00 Dec 3, 2025
42c366b
Merge pull request #1027 from denis-ershov/patch-1
VinciGit00 Dec 3, 2025
518d1b0
ci(release): 1.64.1 [skip ci]
semantic-release-bot Dec 3, 2025
4553412
Initial plan
Copilot Dec 3, 2025
7cb49e4
Fix whitespace formatting errors (W291, W292, W293)
Copilot Dec 3, 2025
6deac76
Apply black and isort formatting to modified files
Copilot Dec 3, 2025
2fc5187
Initial plan
Copilot Dec 3, 2025
ced0373
Fix E402 errors in smart_scraper_graph.py by moving imports to top
Copilot Dec 3, 2025
5bf7198
Merge pull request #1028 from ScrapeGraphAI/copilot/fix-whitespace-fo…
VinciGit00 Dec 3, 2025
ece2bb4
Merge pull request #1029 from ScrapeGraphAI/copilot/fix-e402-import-i…
VinciGit00 Dec 4, 2025
c582303
fix: trigger build
VinciGit00 Dec 4, 2025
f038ca1
ci(release): 1.64.2 [skip ci]
semantic-release-bot Dec 4, 2025
e230856
added posthog proxy
mdehsan873 Dec 8, 2025
1f7e533
Merge pull request #1031 from ScrapeGraphAI/posthog-proxy-implement
VinciGit00 Dec 8, 2025
5f07858
feat: empty commit
VinciGit00 Dec 8, 2025
d121e35
ci(release): 1.65.0 [skip ci]
semantic-release-bot Dec 8, 2025
2cd3c8c
feat: add openai gpt 5.2
VinciGit00 Dec 13, 2025
b70afdf
Merge branch 'main' of https://github.com/ScrapeGraphAI/Scrapegraph-ai
VinciGit00 Dec 13, 2025
212f5df
ci(release): 1.66.0 [skip ci]
semantic-release-bot Dec 13, 2025
e81db73
Updates dependencies
jesse-peters Dec 19, 2025
da112db
feat: add benchmark
VinciGit00 Dec 19, 2025
85b0a2c
ci(release): 1.67.0 [skip ci]
semantic-release-bot Dec 19, 2025
54b77f0
Merge pull request #1032 from jesse-peters/Updates-dependencies
VinciGit00 Dec 23, 2025
484e6d7
feat: update of the dependencies
VinciGit00 Dec 23, 2025
118b92a
ci(release): 1.68.0 [skip ci]
semantic-release-bot Dec 23, 2025
e6c6060
feat: add new banner
VinciGit00 Dec 24, 2025
1eb77e4
Merge branch 'main' of https://github.com/ScrapeGraphAI/Scrapegraph-ai
VinciGit00 Dec 24, 2025
4e6e06a
ci(release): 1.69.0 [skip ci]
semantic-release-bot Dec 24, 2025
2b711b4
Update README.md
VinciGit00 Dec 24, 2025
0849e29
Merge branch 'main' of https://github.com/ScrapeGraphAI/Scrapegraph-ai
VinciGit00 Dec 24, 2025
2ef8826
Update README.md
VinciGit00 Dec 26, 2025
ab0da22
feat: add tests
VinciGit00 Jan 3, 2026
f3dbbca
ci(release): 1.70.0 [skip ci]
semantic-release-bot Jan 3, 2026
621d3a5
fix: update langchain imports for v1.0+ compatibility
majiayu000 Jan 4, 2026
ebd909a
fix: use 'content' instead of 'context' in generate_answer_node_k_level
majiayu000 Jan 4, 2026
5ca0e95
Merge pull request #1033 from majiayu000/fix/langchain-v1-compatibility
VinciGit00 Jan 5, 2026
2673c26
feat: add langchain v1.0
VinciGit00 Jan 5, 2026
95e430b
ci(release): 1.71.0 [skip ci]
semantic-release-bot Jan 5, 2026
6ea2cbf
Add format key to LLM configuration, solve bug.
adrienpacifico Jan 7, 2026
fb90295
Merge pull request #1034 from adrienpacifico/patch-1
VinciGit00 Jan 8, 2026
909a0c9
add new readme
VinciGit00 Jan 8, 2026
f315f3a
feat: add new tests
VinciGit00 Jan 20, 2026
d3041bb
ci(release): 1.72.0 [skip ci]
semantic-release-bot Jan 20, 2026
9c24ecc
feat: update model tokens
VinciGit00 Jan 30, 2026
7dc1956
ci(release): 1.73.0 [skip ci]
semantic-release-bot Jan 30, 2026
518945d
use custom api for tracing
Vikrant-Khedkar Feb 16, 2026
b17b154
fix: handle list content in telemetry event validation
Vikrant-Khedkar Feb 16, 2026
96dc59c
remove client side validation to save cpu usage for user
Vikrant-Khedkar Feb 16, 2026
abfa8f1
Merge pull request #1038 from Vikrant-Khedkar/feat/custom-tracing-end…
VinciGit00 Feb 16, 2026
c6fef1f
ci(release): 1.73.1 [skip ci]
semantic-release-bot Feb 16, 2026
e439021
Add initial test file
ramanathnk Feb 23, 2026
09fa945
Merge pull request #1042 from ramanathnk/agentic-bench-recigcdpkcukczsgs
VinciGit00 Feb 24, 2026
6a2f8ec
feat: add MiniMax as a supported LLM provider
octo-patch Mar 14, 2026
d83ec57
Merge pull request #1046 from octo-patch/feature/add-minimax-provider
VinciGit00 Mar 15, 2026
a264b41
ci(release): 1.74.0 [skip ci]
semantic-release-bot Mar 15, 2026
1c21e5a
Update README.md
VinciGit00 Mar 17, 2026
f47be50
feat: upgrade MiniMax default model to M2.7
octo-patch Mar 18, 2026
b17e76b
Merge pull request #1047 from octo-patch/feature/upgrade-minimax-m27
VinciGit00 Mar 18, 2026
cf9b87e
ci(release): 1.75.0 [skip ci]
semantic-release-bot Mar 18, 2026
1d9551a
fix: replace print() statements with proper logging across codebase
Vikrant-Khedkar Mar 24, 2026
7208bff
Merge pull request #1053 from Vikrant-Khedkar/fix/replace-print-with-…
VinciGit00 Mar 24, 2026
ed8685d
ci(release): 1.75.1 [skip ci]
semantic-release-bot Mar 24, 2026
403080a
ci: reduce GitHub Actions costs by ~85% on PRs
VinciGit00 Mar 24, 2026
7b5733d
Merge pull request #1054 from ScrapeGraphAI/ci/reduce-actions-costs
VinciGit00 Mar 24, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
41 changes: 0 additions & 41 deletions .github/workflows/code-quality.yml

This file was deleted.

213 changes: 213 additions & 0 deletions .github/workflows/test-suite.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,213 @@
name: Test Suite

on:
push:
branches: [main, pre/beta, dev]
pull_request:
branches: [main, pre/beta]
workflow_dispatch:

jobs:
unit-tests:
name: Unit Tests (Python ${{ matrix.python-version }})
runs-on: ${{ matrix.os }}

strategy:
fail-fast: false
matrix:
os: ${{ github.event_name == 'pull_request' && fromJSON('["ubuntu-latest"]') || fromJSON('["ubuntu-latest", "macos-latest", "windows-latest"]') }}
python-version: ['3.10', '3.12']

steps:
- name: Checkout code
uses: actions/checkout@v4

- name: Set up Python ${{ matrix.python-version }}
uses: actions/setup-python@v5
with:
python-version: ${{ matrix.python-version }}

- name: Install uv
uses: astral-sh/setup-uv@v4

- name: Install dependencies
run: |
uv sync

- name: Install Playwright browsers
run: |
uv run playwright install chromium

- name: Run unit tests
run: |
uv run pytest tests/ -m "unit or not integration" --cov --cov-report=xml --cov-report=term

- name: Upload coverage to Codecov
uses: codecov/codecov-action@v4
with:
file: ./coverage.xml
flags: unittests
name: codecov-${{ matrix.os }}-py${{ matrix.python-version }}
token: ${{ secrets.CODECOV_TOKEN }}
if: matrix.os == 'ubuntu-latest' && matrix.python-version == '3.12'

integration-tests:

Check warning

Code scanning / CodeQL

Workflow does not contain permissions Medium test

Actions job or workflow does not limit the permissions of the GITHUB_TOKEN. Consider setting an explicit permissions block, using the following as a minimal starting point: {contents: read}
Comment on lines +12 to +54
name: Integration Tests
runs-on: ubuntu-latest

strategy:
fail-fast: false
matrix:
test-group: [smart-scraper, multi-graph, file-formats]

steps:
- name: Checkout code
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.11'

- name: Install uv
uses: astral-sh/setup-uv@v4

- name: Install dependencies
run: |
uv sync

- name: Install Playwright browsers
run: |
uv run playwright install chromium

- name: Run integration tests
env:
OPENAI_APIKEY: ${{ secrets.OPENAI_APIKEY }}
ANTHROPIC_APIKEY: ${{ secrets.ANTHROPIC_APIKEY }}
GROQ_APIKEY: ${{ secrets.GROQ_APIKEY }}
run: |
uv run pytest tests/integration/ -m integration --integration -v

- name: Upload test results
uses: actions/upload-artifact@v4
if: always()
with:
name: integration-test-results-${{ matrix.test-group }}
path: |
htmlcov/
benchmark_results/

benchmark-tests:

Check warning

Code scanning / CodeQL

Workflow does not contain permissions Medium test

Actions job or workflow does not limit the permissions of the GITHUB_TOKEN. Consider setting an explicit permissions block, using the following as a minimal starting point: {contents: read}
Comment on lines +55 to +100
name: Performance Benchmarks
runs-on: ubuntu-latest
if: github.event_name == 'push' && github.ref == 'refs/heads/main'

steps:
- name: Checkout code
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.11'

- name: Install uv
uses: astral-sh/setup-uv@v4

- name: Install dependencies
run: |
uv sync

- name: Run performance benchmarks
env:
OPENAI_APIKEY: ${{ secrets.OPENAI_APIKEY }}
run: |
uv run pytest tests/ -m benchmark --benchmark -v

- name: Upload benchmark results
uses: actions/upload-artifact@v4
with:
name: benchmark-results
path: benchmark_results/

- name: Compare with baseline
if: github.event_name == 'pull_request'
run: |
# Download baseline from main branch
# Compare and comment on PR if regression detected
echo "Benchmark comparison would run here"

code-quality:

Check warning

Code scanning / CodeQL

Workflow does not contain permissions Medium test

Actions job or workflow does not limit the permissions of the GITHUB_TOKEN. Consider setting an explicit permissions block, using the following as a minimal starting point: {contents: read}
Comment on lines +101 to +140
name: Code Quality Checks
runs-on: ubuntu-latest
if: github.event_name == 'push'

steps:
- name: Checkout code
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.11'

- name: Install uv
uses: astral-sh/setup-uv@v4

- name: Install dependencies
run: |
uv sync

- name: Run Ruff linting
run: |
uv run ruff check scrapegraphai/ tests/

- name: Run Black formatting check
run: |
uv run black --check scrapegraphai/ tests/

- name: Run isort check
run: |
uv run isort --check-only scrapegraphai/ tests/

- name: Run type checking with mypy
run: |
uv run mypy scrapegraphai/
continue-on-error: true

test-coverage-report:

Check warning

Code scanning / CodeQL

Workflow does not contain permissions Medium test

Actions job or workflow does not limit the permissions of the GITHUB_TOKEN. Consider setting an explicit permissions block, using the following as a minimal starting point: {contents: read}
Comment on lines +141 to +178
name: Test Coverage Report
needs: [unit-tests, integration-tests]
runs-on: ubuntu-latest
if: always()

steps:
- name: Checkout code
uses: actions/checkout@v4

- name: Download coverage artifacts
uses: actions/download-artifact@v4

- name: Generate coverage report
run: |
echo "Coverage report generation would run here"

- name: Comment coverage on PR
if: github.event_name == 'pull_request'
uses: py-cov-action/python-coverage-comment-action@v3
with:
GITHUB_TOKEN: ${{ github.token }}

test-summary:

Check warning

Code scanning / CodeQL

Workflow does not contain permissions Medium test

Actions job or workflow does not limit the permissions of the GITHUB_TOKEN. Consider setting an explicit permissions block, using the following as a minimal starting point: {contents: read}
Comment on lines +179 to +201
name: Test Summary
needs: [unit-tests, integration-tests, code-quality]
runs-on: ubuntu-latest
if: always()

steps:
- name: Check test results
run: |
echo "All test jobs completed"
echo "Unit tests: ${{ needs.unit-tests.result }}"
echo "Integration tests: ${{ needs.integration-tests.result }}"
echo "Code quality: ${{ needs.code-quality.result }}"

Check warning

Code scanning / CodeQL

Workflow does not contain permissions Medium test

Actions job or workflow does not limit the permissions of the GITHUB_TOKEN. Consider setting an explicit permissions block, using the following as a minimal starting point: {}
Comment on lines +202 to +213
11 changes: 11 additions & 0 deletions .semantic-commits-applied
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
This file marks that commits have been rewritten to follow Conventional Commits format.

Original commits:
- 9439fe5: Fix langchain import issues blocking tests
- 323f26a: Add comprehensive timeout feature documentation

Rewritten as:
- 8c9cb8b: fix(imports): update deprecated langchain imports to langchain_core
- 4c764bc: docs(timeout): add comprehensive timeout configuration guide

These follow the semantic-release convention configured in .releaserc.yml
Loading
Loading