Skip to content

test(formatters): query_clarification times out on CPU during full test run #1295

@planetf1

Description

@planetf1

Summary

test/formatters/granite/test_intrinsics_formatters.py::test_run_transformers[query_clarification] hangs for 180+ seconds during torch.nn.Linear.forward (stuck in F.linear()) when running on CPU/MPS without GPU.

Context

Discovered during full test run for #1294 (aLoRA input_ids fix). This is a separate pre-existing issue.

Details

  • Model: ibm-granite/granite-4.1-3b
  • Context: 4 conversation turns + 12 retrieved documents (~2000+ tokens)
  • Generation: max_completion_tokens: 512
  • Hardware: Apple Silicon (MPS), no CUDA

This is a performance/resource limitation, not a code bug. The test is already gated on CI via gh_run == 1 xfail, but it blocks local CI-aware runs.

Proposed Fix

Add a @pytest.mark.slow marker or a CPU-specific skip to prevent it from blocking local test runs, or increase the timeout for this specific test case.

Metadata

Metadata

Assignees

Labels

area/adapter-functionsGranite adapter functions: framework and adaptiers including RAG, Guardian, CorebugSomething isn't workingtesting

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions