Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
config.yaml	config.yaml
evaluator.py	evaluator.py
initial_program.py	initial_program.py
requirements.txt	requirements.txt

Web Scraper Evolution with optillm

This example demonstrates how to use optillm with OpenEvolve to leverage test-time compute techniques for improved code evolution accuracy. We'll evolve a web scraper that extracts structured data from documentation pages, showcasing two key optillm features:

readurls plugin: Automatically fetches webpage content when URLs are mentioned in prompts
Inference optimization: Uses techniques like Mixture of Agents (MoA) to improve response accuracy

Why optillm?

Traditional LLM usage in code evolution has limitations:

LLMs may not have knowledge of the latest library documentation
Single LLM calls can produce inconsistent or incorrect code
No ability to dynamically fetch relevant documentation during evolution

optillm solves these problems by:

Dynamic Documentation Fetching: The readurls plugin automatically fetches and includes webpage content when URLs are detected in prompts
Test-Time Compute: Techniques like MoA generate multiple responses and synthesize the best solution
Flexible Routing: Can route requests to different models based on requirements

Problem Description

We're evolving a web scraper that extracts API documentation from Python library documentation pages. The scraper needs to:

Parse HTML documentation pages
Extract function signatures, descriptions, and parameters
Structure the data in a consistent format
Handle various documentation formats

This is an ideal problem for optillm because:

The LLM benefits from seeing actual documentation HTML structure
Accuracy is crucial for correct parsing
Different documentation sites have different formats

Architecture

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│   OpenEvolve    │────▶│     optillm     │────▶│   Local LLM     │
│                 │     │  (proxy:8000)   │     │  (Qwen-0.5B)    │
└─────────────────┘     └─────────────────┘     └─────────────────┘
                               │
                               ├── readurls plugin
                               │   (fetches web content)
                               │
                               └── MoA optimization
                                   (improves accuracy)

Setup Instructions

1. Install and Configure optillm

# Clone optillm
git clone https://github.com/codelion/optillm.git
cd optillm

# Install dependencies
pip install -r requirements.txt

# Start optillm proxy with local inference server (in a separate terminal)
export OPTILLM_API_KEY=optillm
python optillm.py --port 8000

optillm will now be running on http://localhost:8000 with its built-in local inference server.

Note for Non-Mac Users: This example uses Qwen/Qwen3-1.7B-MLX-bf16 which is optimized for Apple Silicon (M1/M2/M3 chips). If you're not using a Mac, you should:

For NVIDIA GPUs: Use a CUDA-compatible model like:
- Qwen/Qwen2.5-32B-Instruct (best quality, high VRAM)
- Qwen/Qwen2.5-14B-Instruct (good balance)
- meta-llama/Llama-3.1-8B-Instruct (efficient option)
- Qwen/Qwen2.5-7B-Instruct (lower VRAM)
For CPU-only: Use a smaller model like:
- Qwen/Qwen2.5-7B-Instruct (7B parameters)
- meta-llama/Llama-3.2-3B-Instruct (3B parameters)
- Qwen/Qwen2.5-3B-Instruct (3B parameters)

Update the config: Replace the model names in config.yaml with your chosen model:

models:
  - name: "readurls-your-chosen-model"
    weight: 0.9
  - name: "moa&readurls-your-chosen-model"
    weight: 0.1

2. Install Web Scraping Dependencies

# Install required Python packages for the example
pip install -r examples/web_scraper_optillm/requirements.txt

3. Run the Evolution

# From the openevolve root directory
export OPENAI_API_KEY=optillm
python openevolve-run.py examples/web_scraper_optillm/initial_program.py \
    examples/web_scraper_optillm/evaluator.py \
    --config examples/web_scraper_optillm/config.yaml \
    --iterations 100

The configuration demonstrates both optillm capabilities:

Primary model (90%): readurls-Qwen/Qwen3-1.7B-MLX-bf16 - fetches URLs mentioned in prompts
Secondary model (10%): moa&readurls-Qwen/Qwen3-1.7B-MLX-bf16 - uses Mixture of Agents for improved accuracy

How It Works

1. readurls Plugin

When the evolution prompt contains URLs (e.g., "Parse the documentation at https://docs.python.org/3/library/json.html"), the readurls plugin:

Detects the URL in the prompt
Fetches the webpage content
Extracts text and table data
Appends it to the prompt as context

This ensures the LLM has access to the latest documentation structure when generating code.

2. Mixture of Agents (MoA)

The MoA technique improves accuracy by:

Generating 3 different solutions to the problem
Having each "agent" critique all solutions
Synthesizing a final, improved solution based on the critiques

This is particularly valuable for complex parsing logic where multiple approaches might be valid.

3. Evolution Process

Initial Program: A basic BeautifulSoup scraper that extracts simple text
Evaluator: Tests the scraper against real documentation pages, checking:
- Correct extraction of function names
- Accurate parameter parsing
- Proper handling of edge cases
Evolution: The LLM improves the scraper by:
- Fetching actual documentation HTML (via readurls)
- Generating multiple parsing strategies (via MoA)
- Learning from evaluation feedback

Actual Evolution Results

Based on our evolution run, here's what we achieved:

Performance Metrics

Initial Score: 0.6864 (72.2% accuracy, 32.5% completeness)
Final Score: 0.7458 (83.3% accuracy, 37.5% completeness)
Improvement: +8.6% overall performance (+11.1% accuracy)
Time to Best: Found optimal solution by iteration 3 (within 10 minutes)

Key Evolution Improvements

Initial Program (Basic approach):

# Simple code block parsing
code_blocks = soup.find_all('code')
for block in code_blocks:
    text = block.get_text(strip=True)
    if '(' in text and ')' in text:
        # Extract function info

Evolved Program (Sophisticated multi-strategy parsing):

# 1. Code blocks
code_blocks = soup.find_all('code')
# 2. Headers (h3)
h3_blocks = soup.find_all('h3')
# 3. Documentation signatures
dt_blocks = soup.find_all('dt', class_='sig')
# 4. Table-based documentation (NEW!)
table_blocks = soup.find_all('table')
for block in table_blocks:
    rows = block.find_all('tr')
    for row in rows:
        cells = row.find_all('td')
        if len(cells) >= 2:
            signature = cells[0].get_text(strip=True)
            description = cells[1].get_text(strip=True)
            # Extract structured function data

What optillm Contributed

Early Discovery: Found best solution by iteration 3, suggesting enhanced reasoning helped quickly identify effective parsing strategies
Table Parsing Innovation: The evolved program added sophisticated table parsing logic that wasn't in the initial version
Robust Architecture: Multiple fallback strategies ensure the scraper works across different documentation formats

Monitoring Progress

Watch the evolution progress and see how optillm enhances the process:

# View optillm logs (in the terminal running optillm)
# You'll see:
# - URLs being fetched by readurls
# - Multiple completions generated by MoA
# - Final synthesized responses

# View OpenEvolve logs
tail -f examples/web_scraper_optillm/openevolve_output/evolution.log

Results Analysis

After 100 iterations of evolution, here's what we achieved:

Quantitative Results

Accuracy: 72.2% → 83.3% (+11.1% improvement)
Completeness: 32.5% → 37.5% (+5% improvement)
Robustness: 100% (maintained - no parsing errors)
Combined Score: 0.6864 → 0.7458 (+8.6% improvement)

Qualitative Improvements

Multi-Strategy Parsing: Added table-based extraction for broader documentation format support
Robust Function Detection: Improved pattern matching for function signatures
Better Parameter Extraction: Enhanced parameter parsing from various HTML structures
Error Resilience: Maintained 100% robustness with no parsing failures

Evolution Pattern

Early Success: Best solution found by iteration 3 (within 10 minutes)
Plateau Effect: Algorithm maintained optimal score from iteration 3-90
Island Migration: MAP-Elites explored alternatives but local optimum was strong

Compare the evolution:

# View the final evolved program
cat examples/web_scraper_optillm/openevolve_output/best/best_program.py

# Compare initial vs final
diff examples/web_scraper_optillm/initial_program.py \
     examples/web_scraper_optillm/openevolve_output/best/best_program.py

Key Insights from This Run

optillm Enhanced Early Discovery: The best solution was found by iteration 3, suggesting optillm's test-time compute (MoA) and documentation access (readurls) helped quickly identify effective parsing strategies.
Smaller Models Can Excel: The 1.7B Qwen model with optillm achieved significant improvements (+8.6%), proving that test-time compute can make smaller models highly effective.
Local Optimization Works: Fast inference times (<100ms after initial) show that local models with optillm provide both efficiency and quality.
Pattern: Quick Discovery, Then Plateau: Evolution found a strong local optimum quickly. This suggests the current test cases were well-solved by the table parsing innovation.
optillm Plugin Value: The evolved program's sophisticated multi-strategy approach (especially table parsing) likely benefited from optillm's enhanced reasoning capabilities.

Available optillm Plugins and Techniques

optillm offers many plugins and optimization techniques. Here are the most useful for code evolution:

Core Plugins

readurls: Automatically fetches web content when URLs are detected in prompts
executecode: Runs code and includes output in the response (great for validation)

Optimization Techniques

moa (Mixture of Agents): Generates multiple responses, critiques them, and synthesizes the best
cot_reflection: Uses chain-of-thought reasoning with self-reflection
rstar: Advanced reasoning technique for complex problems
bon (Best of N): Generates N responses and selects the best one
z3_solver: Uses Z3 theorem prover for logical reasoning
rto (Round Trip Optimization): Optimizes responses through iterative refinement

Combining Techniques

You can chain multiple techniques using &:

llm:
  models:
    # Use chain-of-thought + readurls for primary model
    - name: "cot_reflection&readurls-Qwen/Qwen3-1.7B-MLX-bf16"
      weight: 0.7
    # Use MoA + code execution for secondary validation
    - name: "moa&executecode-Qwen/Qwen3-1.7B-MLX-bf16"
      weight: 0.3

Recommended Combinations for Code Evolution

For Documentation-Heavy Tasks: cot_reflection&readurls
For Complex Logic: moa&executecode
For Mathematical Problems: cot_reflection&z3_solver
For Validation-Critical Code: bon&executecode

Troubleshooting

optillm not responding: Ensure it's running on port 8000 with OPTILLM_API_KEY=optillm
Model not found: Make sure optillm's local inference server is working (check optillm logs)
Slow evolution: MoA generates multiple completions, so it's slower but more accurate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Web Scraper Evolution with optillm

Why optillm?

Problem Description

Architecture

Setup Instructions

1. Install and Configure optillm

2. Install Web Scraping Dependencies

3. Run the Evolution

How It Works

1. readurls Plugin

2. Mixture of Agents (MoA)

3. Evolution Process

Actual Evolution Results

Performance Metrics

Key Evolution Improvements

What optillm Contributed

Monitoring Progress

Results Analysis

Quantitative Results

Qualitative Improvements

Evolution Pattern

Key Insights from This Run

Available optillm Plugins and Techniques

Core Plugins

Optimization Techniques

Combining Techniques

Recommended Combinations for Code Evolution

Troubleshooting

Further Reading

Uh oh!

FilesExpand file tree

web_scraper_optillm

Directory actions

More options

Directory actions

More options

Latest commit

History

web_scraper_optillm

Folders and files

parent directory

README.md

Web Scraper Evolution with optillm

Why optillm?

Problem Description

Architecture

Setup Instructions

1. Install and Configure optillm

2. Install Web Scraping Dependencies

3. Run the Evolution

How It Works

1. readurls Plugin

2. Mixture of Agents (MoA)

3. Evolution Process

Actual Evolution Results

Performance Metrics

Key Evolution Improvements

What optillm Contributed

Monitoring Progress

Results Analysis

Quantitative Results

Qualitative Improvements

Evolution Pattern

Key Insights from This Run

Available optillm Plugins and Techniques

Core Plugins

Optimization Techniques

Combining Techniques

Recommended Combinations for Code Evolution

Troubleshooting

Further Reading