@@ -337,20 +337,20 @@ Model Name & Size | Model Architecture | Full Finetuning | Low Rank Adaptation
337337[ Granite 3.0 8B] ( https://huggingface.co/ibm-granite/granite-3.0-8b-base ) | GraniteForCausalLM | ✅* | ✅* | ✔️ |
338338[ GraniteMoE 1B] ( https://huggingface.co/ibm-granite/granite-3.0-1b-a400m-base ) | GraniteMoeForCausalLM | ✅ | ✅** | ? |
339339[ GraniteMoE 3B] ( https://huggingface.co/ibm-granite/granite-3.0-3b-a800m-base ) | GraniteMoeForCausalLM | ✅ | ✅** | ? |
340- [ Granite 3B Code Base ] ( https://huggingface.co/ibm-granite/granite-3b-code-base-2k ) | LlamaForCausalLM | ✅ | ✔️ | ✔️ |
341- [ Granite 8B Code Base ] ( https://huggingface.co/ibm-granite/granite-8b-code-base-4k ) | LlamaForCausalLM | ✅ | ✅ | ✅ |
340+ [ Granite 3B Code] ( https://huggingface.co/ibm-granite/granite-3b-code-base-2k ) | LlamaForCausalLM | ✅ | ✔️ | ✔️ |
341+ [ Granite 8B Code] ( https://huggingface.co/ibm-granite/granite-8b-code-base-4k ) | LlamaForCausalLM | ✅ | ✅ | ✅ |
342342Granite 13B | GPTBigCodeForCausalLM | ✅ | ✅ | ✔️ |
343343Granite 20B | GPTBigCodeForCausalLM | ✅ | ✔️ | ✔️ |
344- [ Granite 34B Code Instruct ] ( https://huggingface.co/ibm-granite/granite-34b-code-instruct-8k ) | GPTBigCodeForCausalLM | 🚫 | ✅ | ✅ |
345- [ Llama3.1-8B] ( https://huggingface.co/meta-llama/Llama-3.1-8B ) | LLaMA 3.1 | ✅*** | ✔️ | ✔️ |
346- [ Llama3.1-70B] ( https://huggingface.co/meta-llama/Llama-3.1-70B ) (same architecture as llama3) | LLaMA 3.1 | 🚫 - same as Llama3-70B | ✔️ | ✔️ |
347- [ Llama3.1-405B] ( https://huggingface.co/meta-llama/Llama-3.1-405B ) | LLaMA 3.1 | 🚫 | 🚫 | ✅ |
348- [ Llama3-8B] ( https://huggingface.co/meta-llama/Meta-Llama-3-8B ) | LLaMA 3 | ✅ | ✅ | ✔️ |
349- [ Llama3-70B] ( https://huggingface.co/meta-llama/Meta-Llama-3-70B ) | LLaMA 3 | 🚫 | ✅ | ✅ |
344+ [ Granite 34B Code] ( https://huggingface.co/ibm-granite/granite-34b-code-instruct-8k ) | GPTBigCodeForCausalLM | 🚫 | ✅ | ✅ |
345+ [ Llama3.1-8B] ( https://huggingface.co/meta-llama/Llama-3.1-8B ) | LlamaForCausalLM | ✅*** | ✔️ | ✔️ |
346+ [ Llama3.1-70B] ( https://huggingface.co/meta-llama/Llama-3.1-70B ) (same architecture as llama3) | LlamaForCausalLM | 🚫 - same as Llama3-70B | ✔️ | ✔️ |
347+ [ Llama3.1-405B] ( https://huggingface.co/meta-llama/Llama-3.1-405B ) | LlamaForCausalLM | 🚫 | 🚫 | ✅ |
348+ [ Llama3-8B] ( https://huggingface.co/meta-llama/Meta-Llama-3-8B ) | LlamaForCausalLM | ✅ | ✅ | ✔️ |
349+ [ Llama3-70B] ( https://huggingface.co/meta-llama/Meta-Llama-3-70B ) | LlamaForCausalLM | 🚫 | ✅ | ✅ |
350350aLLaM-13b | LlamaForCausalLM | ✅ | ✅ | ✅ |
351- [ Mixtral 8x7B] ( https://huggingface.co/mistralai/Mixtral-8x7B-v0.1 ) | Mixtral | ✅ | ✅ | ✅ |
352- [ Mistral-7b] ( https://huggingface.co/mistralai/Mistral-7B-v0.1 ) | Mistral | ✅ | ✅ | ✅ |
353- Mistral large | Mistral | 🚫 | 🚫 | 🚫 |
351+ [ Mixtral 8x7B] ( https://huggingface.co/mistralai/Mixtral-8x7B-v0.1 ) | MixtralForCausalLM | ✅ | ✅ | ✅ |
352+ [ Mistral-7b] ( https://huggingface.co/mistralai/Mistral-7B-v0.1 ) | MistralForCausalLM | ✅ | ✅ | ✅ |
353+ Mistral large | MistralForCausalLM | 🚫 | 🚫 | 🚫 |
354354
355355(* ) - Supported with ` fms-hf-tuning ` v2.4.0 or later.
356356
0 commit comments