Skip to content

Commit f61d7c1

Browse files
authored
Add adapter_quant field to LoraConfig
Differential Revision: D109642451 Pull Request resolved: #20503
1 parent fa5d85a commit f61d7c1

1 file changed

Lines changed: 3 additions & 0 deletions

File tree

extension/llm/export/config/llm_config.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -96,6 +96,9 @@ class LoraConfig:
9696
lora_rank: int = 0
9797
lora_alpha: int = 0
9898
target_modules: List[str] = field(default_factory=list)
99+
# Per-adapter quantization/precision: "int8" | "fp16" | "fp32" | None.
100+
# Overrides the global --lora_precision flag for this adapter only.
101+
adapter_quant: Optional[str] = None
99102

100103

101104
@dataclass

0 commit comments

Comments
 (0)