allignement by VinciGit00 · Pull Request #1059 · ScrapeGraphAI/Scrapegraph-ai

VinciGit00 · 2026-04-07T06:29:21Z

No description provided.

Pre/beta

## [1.60.0](v1.59.0...v1.60.0) (2025-06-26) ### Features * update the readme ([939e170](939e170)) ### CI * **release:** 1.60.0-beta.1 [skip ci] ([9fb5f7c](9fb5f7c))

## [1.61.0](v1.60.0...v1.61.0) (2025-07-03) ### Features * update doc ([2dc6b9b](2dc6b9b))

- Fixed typo in docstring (trasfrom -> transforms) - Added comprehensive error handling for missing schema keys - Added fallback values for malformed array items and missing references - Improved logging in SmartScraperGraph (replaced print with logger) - Added proper validation for pydantic schema structure These fixes prevent KeyError exceptions and improve production reliability.

docs: removed duplicated line

…form-bugs Fix critical schema transformation bugs and improve logging

## [1.62.0](v1.61.0...v1.62.0) (2025-08-13) ### Features * update pr ([c07b3c0](c07b3c0)) ### Docs * removed duplicated line ([c2abb9f](c2abb9f))

## [1.63.0](v1.62.0...v1.63.0) (2025-10-22) ### Features * update model tokens ([79db9b9](79db9b9))

## [1.63.1](v1.63.0...v1.63.1) (2025-10-24) ### Bug Fixes * url redirect ([8f0433c](8f0433c))

- Add timeout parameter to FetchNode (default: 30 seconds) - Apply timeout to requests.get() calls to prevent indefinite hangs - Implement timeout for PDF parsing using ThreadPoolExecutor - Propagate timeout to ChromiumLoader via loader_kwargs - Add comprehensive unit tests for timeout functionality - Fully backward compatible (timeout can be disabled with None) Fixes issue with requests.get() and PDF parsing blocking indefinitely on slow/unresponsive servers or large documents. Usage: node_config={'timeout': 30} # Custom timeout node_config={'timeout': None} # Disable timeout node_config={} # Use default 30s timeout

…ation feat: Add configurable timeout to FetchNode

## [1.64.0](v1.63.1...v1.64.0) (2025-11-06) ### Features * Add configurable timeout to FetchNode ([e81a4ed](e81a4ed))

docs: update korean readme

Co-authored-by: VinciGit00 <88108002+VinciGit00@users.noreply.github.com>

@VinciGit00

Rewrote commit history to follow Conventional Commits format for semantic-release: - fix(imports): langchain imports update - docs(timeout): timeout configuration guide Addresses feedback from @VinciGit00 to use semantic release commit format. Co-authored-by: VinciGit00 <88108002+VinciGit00@users.noreply.github.com>

Add SEMANTIC_COMMITS.md with instructions for rewriting commit history to follow Conventional Commits format. Includes the exact commit messages needed and steps for manual rebase. The commits need to be rewritten as: - fix(imports): for the langchain import fixes - docs(timeout): for the timeout documentation Automated tools cannot force-push, so maintainer needs to apply manually. Co-authored-by: VinciGit00 <88108002+VinciGit00@users.noreply.github.com>

@VinciGit00

Update SEMANTIC_COMMITS.md to use feat(timeout) instead of docs(timeout) as requested. The timeout feature documentation exposes user-facing functionality and warrants a feature-level semantic version bump. Changed commit 323f26a recommendation from: - docs(timeout): add comprehensive timeout configuration guide To: - feat(timeout): add configurable timeout support for FetchNode Addresses feedback from @VinciGit00. Co-authored-by: VinciGit00 <88108002+VinciGit00@users.noreply.github.com>

## [1.73.0](v1.72.0...v1.73.0) (2026-01-30) ### Features * update model tokens ([9c24ecc](9c24ecc))

…point use custom api for tracing

## [1.73.1](v1.73.0...v1.73.1) (2026-02-16) ### Bug Fixes * handle list content in telemetry event validation ([b17b154](b17b154))

MiniMax provides an OpenAI-compatible API, making integration straightforward. This adds: - MiniMax model wrapper class (OpenAI-compatible) - Model token mappings for MiniMax-M1, M2, and M2.5 models - Provider routing in abstract_graph factory - README update listing MiniMax as a supported provider

## [1.74.0](v1.73.1...v1.74.0) (2026-03-15) ### Features * add MiniMax as a supported LLM provider ([6a2f8ec](6a2f8ec))

- Add MiniMax-M2.7 and MiniMax-M2.7-highspeed to model list - Set MiniMax-M2.7 as default model (first in list) - Keep all previous models as alternatives - Add unit tests for MiniMax model configuration

feat: upgrade MiniMax default model to M2.7

## [1.75.0](v1.74.0...v1.75.0) (2026-03-18) ### Features * upgrade MiniMax default model to M2.7 ([f47be50](f47be50))

Library code should never write directly to stdout. Migrated all 13 print() calls to use the existing get_logger() infrastructure with appropriate log levels (debug/info/warning).

…logging fix: replace print() statements with proper logging across codebase

## [1.75.1](v1.75.0...v1.75.1) (2026-03-24) ### Bug Fixes * replace print() statements with proper logging across codebase ([1d9551a](1d9551a))

- Limit OS matrix to ubuntu-only on PRs (macOS/Windows on push only) - Reduce Python matrix from 3.10/3.11/3.12 to 3.10/3.12 - Run benchmarks only on push to main - Run code-quality checks only on push events - Remove Playwright install from benchmark job - Delete duplicate code-quality.yml workflow Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

ci: reduce GitHub Actions costs by ~85% on PRs

+    name: Unit Tests (Python ${{ matrix.python-version }})
+    runs-on: ${{ matrix.os }}
+
+    strategy:
+      fail-fast: false
+      matrix:
+        os: ${{ github.event_name == 'pull_request' && fromJSON('["ubuntu-latest"]') || fromJSON('["ubuntu-latest", "macos-latest", "windows-latest"]') }}
+        python-version: ['3.10', '3.12']
+
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+
+      - name: Set up Python ${{ matrix.python-version }}
+        uses: actions/setup-python@v5
+        with:
+          python-version: ${{ matrix.python-version }}
+
+      - name: Install uv
+        uses: astral-sh/setup-uv@v4
+
+      - name: Install dependencies
+        run: |
+          uv sync
+
+      - name: Install Playwright browsers
+        run: |
+          uv run playwright install chromium
+
+      - name: Run unit tests
+        run: |
+          uv run pytest tests/ -m "unit or not integration" --cov --cov-report=xml --cov-report=term
+
+      - name: Upload coverage to Codecov
+        uses: codecov/codecov-action@v4
+        with:
+          file: ./coverage.xml
+          flags: unittests
+          name: codecov-${{ matrix.os }}-py${{ matrix.python-version }}
+          token: ${{ secrets.CODECOV_TOKEN }}
+        if: matrix.os == 'ubuntu-latest' && matrix.python-version == '3.12'
+
+  integration-tests:


+    name: Integration Tests
+    runs-on: ubuntu-latest
+
+    strategy:
+      fail-fast: false
+      matrix:
+        test-group: [smart-scraper, multi-graph, file-formats]
+
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: '3.11'
+
+      - name: Install uv
+        uses: astral-sh/setup-uv@v4
+
+      - name: Install dependencies
+        run: |
+          uv sync
+
+      - name: Install Playwright browsers
+        run: |
+          uv run playwright install chromium
+
+      - name: Run integration tests
+        env:
+          OPENAI_APIKEY: ${{ secrets.OPENAI_APIKEY }}
+          ANTHROPIC_APIKEY: ${{ secrets.ANTHROPIC_APIKEY }}
+          GROQ_APIKEY: ${{ secrets.GROQ_APIKEY }}
+        run: |
+          uv run pytest tests/integration/ -m integration --integration -v
+
+      - name: Upload test results
+        uses: actions/upload-artifact@v4
+        if: always()
+        with:
+          name: integration-test-results-${{ matrix.test-group }}
+          path: |
+            htmlcov/
+            benchmark_results/
+
+  benchmark-tests:


+    name: Performance Benchmarks
+    runs-on: ubuntu-latest
+    if: github.event_name == 'push' && github.ref == 'refs/heads/main'
+
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: '3.11'
+
+      - name: Install uv
+        uses: astral-sh/setup-uv@v4
+
+      - name: Install dependencies
+        run: |
+          uv sync
+
+      - name: Run performance benchmarks
+        env:
+          OPENAI_APIKEY: ${{ secrets.OPENAI_APIKEY }}
+        run: |
+          uv run pytest tests/ -m benchmark --benchmark -v
+
+      - name: Upload benchmark results
+        uses: actions/upload-artifact@v4
+        with:
+          name: benchmark-results
+          path: benchmark_results/
+
+      - name: Compare with baseline
+        if: github.event_name == 'pull_request'
+        run: |
+          # Download baseline from main branch
+          # Compare and comment on PR if regression detected
+          echo "Benchmark comparison would run here"
+
+  code-quality:


+    name: Code Quality Checks
+    runs-on: ubuntu-latest
+    if: github.event_name == 'push'
+
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: '3.11'
+
+      - name: Install uv
+        uses: astral-sh/setup-uv@v4
+
+      - name: Install dependencies
+        run: |
+          uv sync
+
+      - name: Run Ruff linting
+        run: |
+          uv run ruff check scrapegraphai/ tests/
+
+      - name: Run Black formatting check
+        run: |
+          uv run black --check scrapegraphai/ tests/
+
+      - name: Run isort check
+        run: |
+          uv run isort --check-only scrapegraphai/ tests/
+
+      - name: Run type checking with mypy
+        run: |
+          uv run mypy scrapegraphai/
+        continue-on-error: true
+
+  test-coverage-report:


+    name: Test Coverage Report
+    needs: [unit-tests, integration-tests]
+    runs-on: ubuntu-latest
+    if: always()
+
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+
+      - name: Download coverage artifacts
+        uses: actions/download-artifact@v4
+
+      - name: Generate coverage report
+        run: |
+          echo "Coverage report generation would run here"
+
+      - name: Comment coverage on PR
+        if: github.event_name == 'pull_request'
+        uses: py-cov-action/python-coverage-comment-action@v3
+        with:
+          GITHUB_TOKEN: ${{ github.token }}
+
+  test-summary:


+    name: Test Summary
+    needs: [unit-tests, integration-tests, code-quality]
+    runs-on: ubuntu-latest
+    if: always()
+
+    steps:
+      - name: Check test results
+        run: |
+          echo "All test jobs completed"
+          echo "Unit tests: ${{ needs.unit-tests.result }}"
+          echo "Integration tests: ${{ needs.integration-tests.result }}"
+          echo "Code quality: ${{ needs.code-quality.result }}"


github-actions · 2026-04-07T06:33:59Z

🎉 This PR is included in version 1.76.0-beta.1 🎉

The release is available on:

v1.76.0-beta.1
GitHub release

Your semantic-release bot 📦🚀

github-actions · 2026-04-19T08:04:42Z

🎉 This PR is included in version 2.1.0-beta.1 🎉

The release is available on:

v2.1.0-beta.1
GitHub release

Your semantic-release bot 📦🚀

github-actions · 2026-05-16T19:39:08Z

🎉 This PR is included in version 2.2.0-beta.1 🎉

The release is available on:

v2.2.0-beta.1
GitHub release

Your semantic-release bot 📦🚀

VinciGit00 and others added 30 commits June 26, 2025 20:35

Merge pull request #994 from ScrapeGraphAI/pre/beta

4045f04

Pre/beta

ci(release): 1.60.0 [skip ci]

ad6845d

## [1.60.0](v1.59.0...v1.60.0) (2025-06-26) ### Features * update the readme ([939e170](939e170)) ### CI * **release:** 1.60.0-beta.1 [skip ci] ([9fb5f7c](9fb5f7c))

feat: update doc

2dc6b9b

ci(release): 1.61.0 [skip ci]

132823b

## [1.61.0](v1.60.0...v1.61.0) (2025-07-03) ### Features * update doc ([2dc6b9b](2dc6b9b))

docs: removed duplicated line

c2abb9f

Merge pull request #1004 from alecontuu/readme_fix

7fe566b

docs: removed duplicated line

Update README.md

72b43b3

Merge pull request #1001 from Mirza-Samad-Ahmed-Baig/fix-schema-trans…

e65da4d

…form-bugs Fix critical schema transformation bugs and improve logging

feat: update pr

c07b3c0

ci(release): 1.62.0 [skip ci]

5f8dbd3

## [1.62.0](v1.61.0...v1.62.0) (2025-08-13) ### Features * update pr ([c07b3c0](c07b3c0)) ### Docs * removed duplicated line ([c2abb9f](c2abb9f))

doc: 1$ banner

739b05a

feat: update model tokens

79db9b9

ci(release): 1.63.0 [skip ci]

7346c26

## [1.63.0](v1.62.0...v1.63.0) (2025-10-22) ### Features * update model tokens ([79db9b9](79db9b9))

fix: url redirect

8f0433c

ci(release): 1.63.1 [skip ci]

365761a

## [1.63.1](v1.63.0...v1.63.1) (2025-10-24) ### Bug Fixes * url redirect ([8f0433c](8f0433c))

Merge pull request #1020 from Xyerophyte/feature/add-timeout-configur…

eeffa33

…ation feat: Add configurable timeout to FetchNode

Empty commit

914b85c

ci(release): 1.64.0 [skip ci]

93b3c5d

## [1.64.0](v1.63.1...v1.64.0) (2025-11-06) ### Features * Add configurable timeout to FetchNode ([e81a4ed](e81a4ed))

Remove downloads badge from README

32d5636

docs: update korean readme

5516ec6

Add download badge and linting badges to README

3dc6484

Merge pull request #1022 from PzaThief/main

0e12bac

docs: update korean readme

Initial plan

6d13212

Fix langchain import issues blocking tests

9439fe5

Co-authored-by: VinciGit00 <88108002+VinciGit00@users.noreply.github.com>

Add comprehensive timeout feature documentation

323f26a

Co-authored-by: VinciGit00 <88108002+VinciGit00@users.noreply.github.com>

VinciGit00 and others added 21 commits January 30, 2026 16:45

feat: update model tokens

9c24ecc

ci(release): 1.73.0 [skip ci]

7dc1956

## [1.73.0](v1.72.0...v1.73.0) (2026-01-30) ### Features * update model tokens ([9c24ecc](9c24ecc))

use custom api for tracing

518945d

fix: handle list content in telemetry event validation

b17b154

remove client side validation to save cpu usage for user

96dc59c

Merge pull request #1038 from Vikrant-Khedkar/feat/custom-tracing-end…

abfa8f1

…point use custom api for tracing

ci(release): 1.73.1 [skip ci]

c6fef1f

## [1.73.1](v1.73.0...v1.73.1) (2026-02-16) ### Bug Fixes * handle list content in telemetry event validation ([b17b154](b17b154))

Add initial test file

e439021

Merge pull request #1042 from ramanathnk/agentic-bench-recigcdpkcukczsgs

09fa945

Merge pull request #1046 from octo-patch/feature/add-minimax-provider

d83ec57

ci(release): 1.74.0 [skip ci]

a264b41

## [1.74.0](v1.73.1...v1.74.0) (2026-03-15) ### Features * add MiniMax as a supported LLM provider ([6a2f8ec](6a2f8ec))

Update README.md

1c21e5a

feat: upgrade MiniMax default model to M2.7

f47be50

- Add MiniMax-M2.7 and MiniMax-M2.7-highspeed to model list - Set MiniMax-M2.7 as default model (first in list) - Keep all previous models as alternatives - Add unit tests for MiniMax model configuration

Merge pull request #1047 from octo-patch/feature/upgrade-minimax-m27

b17e76b

feat: upgrade MiniMax default model to M2.7

ci(release): 1.75.0 [skip ci]

cf9b87e

## [1.75.0](v1.74.0...v1.75.0) (2026-03-18) ### Features * upgrade MiniMax default model to M2.7 ([f47be50](f47be50))

fix: replace print() statements with proper logging across codebase

1d9551a

Library code should never write directly to stdout. Migrated all 13 print() calls to use the existing get_logger() infrastructure with appropriate log levels (debug/info/warning).

Merge pull request #1053 from Vikrant-Khedkar/fix/replace-print-with-…

7208bff

…logging fix: replace print() statements with proper logging across codebase

ci(release): 1.75.1 [skip ci]

ed8685d

## [1.75.1](v1.75.0...v1.75.1) (2026-03-24) ### Bug Fixes * replace print() statements with proper logging across codebase ([1d9551a](1d9551a))

Merge pull request #1054 from ScrapeGraphAI/ci/reduce-actions-costs

7b5733d

ci: reduce GitHub Actions costs by ~85% on PRs

dosubot Bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Apr 7, 2026

github-advanced-security AI found potential problems Apr 7, 2026

View reviewed changes

dosubot Bot added documentation Improvements or additions to documentation refactor refactoring of folders labels Apr 7, 2026

VinciGit00 merged commit 9300319 into pre/beta Apr 7, 2026
22 of 29 checks passed

github-actions Bot added the released on @dev label Apr 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

allignement#1059

allignement#1059
VinciGit00 merged 101 commits into
pre/betafrom
main

VinciGit00 commented Apr 7, 2026

Uh oh!

Uh oh!

github-actions Bot commented Apr 7, 2026

Uh oh!

github-actions Bot commented Apr 19, 2026

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants

Uh oh!

Uh oh!

Conversation

VinciGit00 commented Apr 7, 2026

Uh oh!

Uh oh!

github-actions Bot commented Apr 7, 2026

Uh oh!

github-actions Bot commented Apr 19, 2026

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants