Skip to content

feat: GPTQModel Migration#102

Merged
chichun-charlie-liu merged 15 commits intofoundation-model-stack:mainfrom
tharapalanivel:gptq_model
Jun 4, 2025
Merged

feat: GPTQModel Migration#102
chichun-charlie-liu merged 15 commits intofoundation-model-stack:mainfrom
tharapalanivel:gptq_model

Conversation

@tharapalanivel
Copy link
Copy Markdown
Collaborator

@tharapalanivel tharapalanivel commented Apr 25, 2025

Description of the change

Related issue number

How to verify the PR

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>
Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>
Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>
Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>
Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>
fix: Update gh-action-pypi-publish version
Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>
@github-actions github-actions Bot added the feat label Apr 25, 2025
Comment thread .spellcheck-en-custom.txt
@chichun-charlie-liu
Copy link
Copy Markdown
Collaborator

chichun-charlie-liu commented May 6, 2025

Everythign looks fine. The only question I have is that I'm not sure if our unit tests have enough coverage. Could you please confirm that our GPTQ example can still run successfully?

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>
Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>
Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>
Copy link
Copy Markdown
Collaborator

@chichun-charlie-liu chichun-charlie-liu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

include all the feedback from Bayo, except OoM verifications.

Signed-off-by: chichun-charlie-liu <57839396+chichun-charlie-liu@users.noreply.github.com>
@chichun-charlie-liu chichun-charlie-liu marked this pull request as ready for review June 2, 2025 15:00
Comment thread pyproject.toml
dev = ["pre-commit>=3.0.4,<5.0"]
fp8 = ["llmcompressor"]
gptq = ["auto_gptq>0.4.2", "optimum>=1.15.0"]
gptq = ["Cython", "gptqmodel>=1.7.3"]
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we be including exllama and exllamav2 here as well? Seems like they are "required" for gptq to work.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

exllama and exllamav2 require GPU for installation and are indeed needed to run GPTQ modules on GPU, but not to run GPTQ on CPU or AIU via our addons for FMS. As we must support the option for users to run FMS-MO in an environment without GPUs, we can't add these two packages to our requirements.

Copy link
Copy Markdown
Collaborator

@chichun-charlie-liu chichun-charlie-liu Jun 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

previously these 2 exllama_kernel packages come from auto_gptq (see setup.py from autogptq, they are not "dependencies" but embedded in the auto_gptq package installation ). We didn't need to install them separately, but the new gptqmodel package indeed has renamed the embedded packages, so we still need to update our code accordingly. (done and pushed to this PR)

Signed-off-by: cliu-us <cliu@us.ibm.com>
Signed-off-by: cliu-us <cliu@us.ibm.com>
@chichun-charlie-liu chichun-charlie-liu merged commit 9fef5b2 into foundation-model-stack:main Jun 4, 2025
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Migrate from autoGPTQ to GPTQModel

4 participants