Optimized Inference Deployment
Clone the repository
git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
Build the project
make
Download the SmolLM2-1.7B-Instruct-GGUF model
curl -L -O https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF/resolve/main/smollm2-1.7b-instruct.Q4_K_M.gguf`
I have tried the code on my mac but got error, I think the doc is not updated

Optimized Inference Deployment
Clone the repository
git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
Build the project
make
Download the SmolLM2-1.7B-Instruct-GGUF model
curl -L -O https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF/resolve/main/smollm2-1.7b-instruct.Q4_K_M.gguf`
I have tried the code on my mac but got error, I think the doc is not updated