Qualcomm AI Engine Direct - Decouple quantization and compile graphs for faster VLM/LLM PTQ#19220
Open
DannyYuyang-quic wants to merge 1 commit intopytorch:mainfrom
Open
Qualcomm AI Engine Direct - Decouple quantization and compile graphs for faster VLM/LLM PTQ#19220DannyYuyang-quic wants to merge 1 commit intopytorch:mainfrom
DannyYuyang-quic wants to merge 1 commit intopytorch:mainfrom