Skip to content

Support base_workload=all to run all workload types #61

@maryamtahhan

Description

@maryamtahhan

Enhancement Request

Currently, the llm-benchmark-concurrent-load.yml playbook requires specifying a single base_workload parameter:

ansible-playbook -i inventory/hosts.yml \
  llm-benchmark-concurrent-load.yml \
  -e "test_model=meta-llama/Llama-3.2-1B-Instruct" \
  -e "base_workload=chat" \
  -e "requested_cores=16"

Proposed Enhancement

Support base_workload=all to automatically loop over all available workload types:

ansible-playbook -i inventory/hosts.yml \
  llm-benchmark-concurrent-load.yml \
  -e "test_model=meta-llama/Llama-3.2-1B-Instruct" \
  -e "base_workload=all" \
  -e "requested_cores=16"

This would automatically run all workloads:

  • chat - Chat workload (512:256)
  • rag - RAG workload (4096:512)
  • code - Code generation (512:4096)
  • summarization - Summarization (1024:256)
  • short_codegen - Short code generation

Benefits

  1. Comprehensive testing: Easy way to run all workload types for a model
  2. Reduced manual intervention: No need to run the playbook multiple times
  3. Better coverage: Ensures all workload types are tested consistently

Implementation Notes

The playbook would need to:

  1. Detect when base_workload=all is specified
  2. Loop over all available workload types
  3. Run the 3-phase testing for each workload
  4. Collect and organize results by workload type

Related file: automation/test-execution/ansible/llm-benchmark-concurrent-load.yml

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions