Skip to content

Commit 8960c01

Browse files
committed
update README.md
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
1 parent 698bc95 commit 8960c01

1 file changed

Lines changed: 18 additions & 17 deletions

File tree

README.md

Lines changed: 18 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,7 @@
2121

2222
## Latest News
2323

24+
* 05/13/2026 7.1.0-dev `main`: ✨ Added `minicpm_v_4_6` model support
2425
* 05/07/2026 7.1.0-dev `main`: ✨ Added `GLM-4.5V`, `GLM-4.6V`, `Zamba` and `Zamba2` model support
2526
* 04/29/2026 7.1.0-dev `main`: ✨ Added PoolSideAI `Laguna` model support for fused Laguna MoE checkpoints. Added `ERNIE 4.5 VL MoE`, `Ling-2.6-flash` and NVIDIA `Nemotron 3 Nano Omni` model support.
2627
* 04/28/2026 [7.0.0](https://github.com/ModelCloud/GPTQModel/releases/tag/v7.0.0): 🚀 Added Huawei Ascend NPU support through native torch kernels for GPTQ, AWQ, ParoQuant, GGUF, QQQ, and EXL3. Added `internvl_chat`, `gemma3n`, `GLM-OCR`, `GLM-ASR`, and `falcon_mamba` model support.
@@ -243,23 +244,23 @@ Selected public references where teams or companies explicitly mention GPT-QMode
243244
<img src=https://github.com/user-attachments/assets/c1b89394-f8f6-44e5-9949-bef15a124723 width="51%"> <img src=https://github.com/user-attachments/assets/23901236-10c5-4435-ac2f-06cf2e097f1e width="47%">
244245

245246
## Model Support
246-
| Model | | | | | | | | | |
247-
|--------------------------|---|---------------------------------|---|------------------|---|-------------------|---|-------------------------|---|
248-
| Apertus || EXAONE 3/4 || Dots1 || Mistral3 || Qwen 2/3/3.5 (Next/MoE) ||
249-
| Baichuan || Falcon (H1 / Mamba) || InternLM 1/2/2.5 || Mixtral || Qwen 2/2.5/3 VL ||
250-
| Bloom || FastVLM || Kimi K2 || MobileLLM || Qwen 2.5/3 Omni ||
251-
| ChatGLM || Gemma 1-4 / 3n || Klear || MOSS || RefinedWeb ||
252-
| CodeGen || GPTBigCode || LING/RING || MPT || StableLM ||
253-
| Cohere 1-2 || GPT-Neo / NeoX || Llama 1-3.3 || Nemotron H / Omni || StarCoder2 ||
254-
| DBRX Converted || GPT-2 || Llama 3.2 VL || Nemotron Ultra || TeleChat2 ||
255-
| Deci || GPT-J || Llama 4 || OPT || Trinity ||
256-
| DeepSeek-V2/V3/R1 || GPT-OSS || LongCat Flash || OLMo2 / LLaDA2 || Yi ||
257-
| DeepSeek-V2-Lite || Granite / Granite MoE || LongLLaMA || Ovis 1.6/2 || Seed-OSS ||
258-
| Dream || GRIN-MoE || Instella || Phi 1-4 || Voxtral ||
259-
| ERNIE 4.5 / MoE / VL MoE || GLM 4/4V/4.5V/4.6V/5/5.1/OCR/ASR || GLM4 MoE / Lite / 4.5V MoE || MiniCPM 3/O/V || PanGu-α ||
260-
| XVERSE || Brumby || Hymba || Mistral || Qwen 1/2/3/3.5 ||
261-
| MiniMax M2 || AfMoE || Bailing-MoE || LFM2-MoE || Marin ||
262-
| InternVL Chat || Laguna || Zamba / Zamba2 || | | | |
247+
| Model | | | | | | | | | |
248+
|--------------------------|---|---------------------------------|---|------------------|---|---------------------|---|-------------------------|---|
249+
| Apertus || EXAONE 3/4 || Dots1 || Mistral3 || Qwen 2/3/3.5 (Next/MoE) ||
250+
| Baichuan || Falcon (H1 / Mamba) || InternLM 1/2/2.5 || Mixtral || Qwen 2/2.5/3 VL ||
251+
| Bloom || FastVLM || Kimi K2 || MobileLLM || Qwen 2.5/3 Omni ||
252+
| ChatGLM || Gemma 1-4 / 3n || Klear || MOSS || RefinedWeb ||
253+
| CodeGen || GPTBigCode || LING/RING || MPT || StableLM ||
254+
| Cohere 1-2 || GPT-Neo / NeoX || Llama 1-3.3 || Nemotron H / Omni || StarCoder2 ||
255+
| DBRX Converted || GPT-2 || Llama 3.2 VL || Nemotron Ultra || TeleChat2 ||
256+
| Deci || GPT-J || Llama 4 || OPT || Trinity ||
257+
| DeepSeek-V2/V3/R1 || GPT-OSS || LongCat Flash || OLMo2 / LLaDA2 || Yi ||
258+
| DeepSeek-V2-Lite || Granite / Granite MoE || LongLLaMA || Ovis 1.6/2 || Seed-OSS ||
259+
| Dream || GRIN-MoE || Instella || Phi 1-4 || Voxtral ||
260+
| ERNIE 4.5 / MoE / VL MoE || GLM 4/4V/4.5V/4.6V/5/5.1/OCR/ASR || GLM4 MoE / Lite / 4.5V MoE || MiniCPM 3/O/V/V 4_6 || PanGu-α ||
261+
| XVERSE || Brumby || Hymba || Mistral || Qwen 1/2/3/3.5 ||
262+
| MiniMax M2 || AfMoE || Bailing-MoE || LFM2-MoE || Marin ||
263+
| InternVL Chat || Laguna || Zamba / Zamba2 || | | | |
263264

264265
Prism Bonsai GGUF checkpoints are supported for inference only through GPT-QModel's native GGUF path and internal GGUF runtime. Bonsai checkpoints load through the normal model path or repo argument and do not require the external `gguf` package. Prism model quantization is not included.
265266

0 commit comments

Comments
 (0)