feat: GPTQModel Migration by tharapalanivel · Pull Request #102 · foundation-model-stack/fms-model-optimizer

tharapalanivel · 2025-04-25T00:15:51Z

Description of the change

Related issue number

How to verify the PR

Was the PR tested

I have added >=1 unit test(s) for every new method I have added.
I have ensured all unit tests pass

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

fix: Update gh-action-pypi-publish version

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

chichun-charlie-liu · 2025-05-06T17:23:05Z

Everythign looks fine. The only question I have is that I'm not sure if our unit tests have enough coverage. Could you please confirm that our GPTQ example can still run successfully?

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>

chichun-charlie-liu

include all the feedback from Bayo, except OoM verifications.

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>

BrandonGroth · 2025-06-02T15:33:03Z

 dev = ["pre-commit>=3.0.4,<5.0"]
 fp8 = ["llmcompressor"]
-gptq = ["auto_gptq>0.4.2", "optimum>=1.15.0"]
+gptq = ["Cython", "gptqmodel>=1.7.3"]


Should we be including exllama and exllamav2 here as well? Seems like they are "required" for gptq to work.

exllama and exllamav2 require GPU for installation and are indeed needed to run GPTQ modules on GPU, but not to run GPTQ on CPU or AIU via our addons for FMS. As we must support the option for users to run FMS-MO in an environment without GPUs, we can't add these two packages to our requirements.

previously these 2 exllama_kernel packages come from auto_gptq (see setup.py from autogptq, they are not "dependencies" but embedded in the auto_gptq package installation ). We didn't need to install them separately, but the new gptqmodel package indeed has renamed the embedded packages, so we still need to update our code accordingly. (done and pushed to this PR)

…tqmodel_exllama_kernels`

Signed-off-by: cliu-us <cliu@us.ibm.com>

tharapalanivel added 8 commits January 22, 2025 21:17

Initial commit for GPTQModel migration

0f69223

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

Enable gptq quantization through quantize API

2e1e58d

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

Move to gptqmodel

50aea62

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

Fix lint and fmt

ab98dff

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

Fix layer defs in custom gptq classes

75521d7

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

Merge pull request #9 from tharapalanivel/fix_pypi_wf

45c8ded

fix: Update gh-action-pypi-publish version

Merge branch 'foundation-model-stack:main' into main

f8b7a76

Merge branch 'main' into gptq_model

2c30b2b

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

github-actions Bot added the feat label Apr 25, 2025

chichun-charlie-liu reviewed May 6, 2025

View reviewed changes

Comment thread .spellcheck-en-custom.txt

chichun-charlie-liu assigned chichun-charlie-liu and tharapalanivel May 8, 2025

chichun-charlie-liu linked an issue May 8, 2025 that may be closed by this pull request

Migrate from autoGPTQ to GPTQModel #108

Closed

chichun-charlie-liu added 3 commits June 2, 2025 10:43

Update .spellcheck-en-custom.txt

52e4955

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>

Update run_quant.py

e8cc8e8

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>

Update README.md

51140cf

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>

chichun-charlie-liu approved these changes Jun 2, 2025

View reviewed changes

Merge branch 'main' into gptq_model

2be858a

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>

chichun-charlie-liu marked this pull request as ready for review June 2, 2025 15:00

chichun-charlie-liu requested review from BrandonGroth, andrea-fasoli, kcirred and nwang-ibm as code owners June 2, 2025 15:00

BrandonGroth reviewed Jun 2, 2025

View reviewed changes

gptqmodel has renamed the package names from exllama_kernels to `gp…

c51ce60

…tqmodel_exllama_kernels`

BrandonGroth approved these changes Jun 2, 2025

View reviewed changes

chichun-charlie-liu added 2 commits June 2, 2025 17:34

bug fix

3d6e242

Signed-off-by: cliu-us <cliu@us.ibm.com>

fix typo

a6eed1d

Signed-off-by: cliu-us <cliu@us.ibm.com>

BrandonGroth approved these changes Jun 4, 2025

View reviewed changes

chichun-charlie-liu merged commit 9fef5b2 into foundation-model-stack:main Jun 4, 2025
12 checks passed

chichun-charlie-liu mentioned this pull request Jun 4, 2025

fix: disable granite in custom gptq as gptqmodel already supports it, fix … #130

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: GPTQModel Migration#102

feat: GPTQModel Migration#102
chichun-charlie-liu merged 15 commits into
foundation-model-stack:mainfrom
tharapalanivel:gptq_model

tharapalanivel commented Apr 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

chichun-charlie-liu commented May 6, 2025 •

edited

Loading

Uh oh!

chichun-charlie-liu left a comment

Uh oh!

BrandonGroth Jun 2, 2025

Uh oh!

andrea-fasoli Jun 2, 2025

Uh oh!

chichun-charlie-liu Jun 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

tharapalanivel commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Related issue number

How to verify the PR

Was the PR tested

Uh oh!

Uh oh!

chichun-charlie-liu commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chichun-charlie-liu left a comment

Choose a reason for hiding this comment

Uh oh!

BrandonGroth Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

andrea-fasoli Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

chichun-charlie-liu Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tharapalanivel commented Apr 25, 2025 •

edited

Loading

chichun-charlie-liu commented May 6, 2025 •

edited

Loading

chichun-charlie-liu Jun 2, 2025 •

edited

Loading