fix:Add tiny granite vision model and update ReadMe for vision model support. by Abhishek-TAMU · Pull Request #533 · foundation-model-stack/fms-hf-tuning

Abhishek-TAMU · 2025-04-18T21:45:05Z

Description of the change

Fix for supporting vision model tuning.

Changes:

Removal of use_cache from AutoModelForVision2Seq.from_pretrained as vision model of type (MllamaForConditionalGeneration, LlavaForConditionalGeneration and LlavaNextForConditionalGeneration) doesn't support it.
ReadMe update to include LoRA tuning support of vision models along with Full fine tuning.
Added tiny granite vision model along with image dataset file.

Related issue number

Issue: https://github.ibm.com/ai-foundation/watson-fm-stack-tracker/issues/1610

How to verify the PR

Was the PR tested

I have added >=1 unit test(s) for every new method I have added.
I have ensured all unit tests pass

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

github-actions · 2025-04-18T21:45:17Z

Thanks for making a pull request! 😃
One of the maintainers will review and advise on the next steps.

Abhishek-TAMU · 2025-04-18T22:03:00Z

@dushyantbehl Addition of tiny vision models and image dataset file in this PR would also be used in further v2 work in writing end-to-end unit test case as currently I couldn't write e2e test case as we just accept PIL format image as input and in v2 plan to accept PNG and JPEG image format.

dushyantbehl · 2025-04-22T13:34:50Z

+### Constants used for model path
+PREDEFINED_MODEL_PATH = os.path.join(os.path.dirname(__file__))
+LLAMA_VISION_MODEL_NAME = os.path.join(PREDEFINED_MODEL_PATH, "tiny_llama_vision_model")
+GRANITE_VISION_MODEL_NAME = os.path.join(


Suggested change

GRANITE_VISION_MODEL_NAME = os.path.join(

TINY_GRANITE_VISION_MODEL_NAME = os.path.join(

similarly for llama to tiny llama

dushyantbehl

minor change

dushyantbehl · 2025-04-22T13:35:14Z

+### Constants used for model path
+PREDEFINED_MODEL_PATH = os.path.join(os.path.dirname(__file__))
+LLAMA_VISION_MODEL_NAME = os.path.join(PREDEFINED_MODEL_PATH, "tiny_llama_vision_model")
+GRANITE_VISION_MODEL_NAME = os.path.join(


similarly for llama to tiny llama

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

dushyantbehl

LGTM! Thanks @Abhishek-TAMU

@willmj up to you to merge post the tests pass.

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

…support. (foundation-model-stack#533) * Update readme and fix Signed-off-by: Abhishek <maurya.abhishek@ibm.com> * Assign use_cache for vision model Signed-off-by: Abhishek <maurya.abhishek@ibm.com> * tiny granite vision model and dataset with doc change Signed-off-by: Abhishek <maurya.abhishek@ibm.com> * fix * Change GRANITE_VISION_MODEL_NAME to TINY_GRANITE_VISION_MODEL_NAME Signed-off-by: Abhishek <maurya.abhishek@ibm.com> * fmt/lint fix Signed-off-by: Abhishek <maurya.abhishek@ibm.com> --------- Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

Abhishek-TAMU added 4 commits April 17, 2025 16:59

Update readme and fix

bed91b6

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

Assign use_cache for vision model

6485865

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

tiny granite vision model and dataset with doc change

5651f3a

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

Merge remote-tracking branch 'upstream/main' into fix_vision_pr

2dea162

Abhishek-TAMU requested review from aluu317, anhuong, dushyantbehl, fabianlim and kmehant as code owners April 18, 2025 21:45

github-actions Bot added the fix label Apr 18, 2025

fix

843ac85

Merge remote-tracking branch 'upstream/main' into fix_vision_pr

e00703e

dushyantbehl reviewed Apr 22, 2025

View reviewed changes

Change GRANITE_VISION_MODEL_NAME to TINY_GRANITE_VISION_MODEL_NAME

bc56615

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

dushyantbehl previously approved these changes Apr 22, 2025

View reviewed changes

fmt/lint fix

798d61d

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

Abhishek-TAMU dismissed dushyantbehl’s stale review via 798d61d April 22, 2025 13:57

willmj approved these changes Apr 22, 2025

View reviewed changes

willmj merged commit b826c18 into foundation-model-stack:main Apr 22, 2025
9 checks passed

Abhishek-TAMU deleted the fix_vision_pr branch April 22, 2025 20:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix:Add tiny granite vision model and update ReadMe for vision model support.#533

fix:Add tiny granite vision model and update ReadMe for vision model support.#533
willmj merged 8 commits into
foundation-model-stack:mainfrom
Abhishek-TAMU:fix_vision_pr

Abhishek-TAMU commented Apr 18, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 18, 2025

Uh oh!

Abhishek-TAMU commented Apr 18, 2025 •

edited

Loading

Uh oh!

dushyantbehl Apr 22, 2025

Uh oh!

dushyantbehl Apr 22, 2025

Uh oh!

dushyantbehl left a comment

Uh oh!

dushyantbehl Apr 22, 2025

Uh oh!

dushyantbehl left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	GRANITE_VISION_MODEL_NAME = os.path.join(
	TINY_GRANITE_VISION_MODEL_NAME = os.path.join(

Conversation

Abhishek-TAMU commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Related issue number

How to verify the PR

Was the PR tested

Uh oh!

github-actions Bot commented Apr 18, 2025

Uh oh!

Abhishek-TAMU commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dushyantbehl Apr 22, 2025

Choose a reason for hiding this comment

Uh oh!

dushyantbehl Apr 22, 2025

Choose a reason for hiding this comment

Uh oh!

dushyantbehl left a comment

Choose a reason for hiding this comment

Uh oh!

dushyantbehl Apr 22, 2025

Choose a reason for hiding this comment

Uh oh!

dushyantbehl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Abhishek-TAMU commented Apr 18, 2025 •

edited

Loading

Abhishek-TAMU commented Apr 18, 2025 •

edited

Loading