Skip to content

LLM Course:- Chapter 2(Infrence Deployment Not Working ) #1250

@apking2000

Description

@apking2000

Optimized Inference Deployment

Clone the repository

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp

Build the project

make

Download the SmolLM2-1.7B-Instruct-GGUF model

curl -L -O https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF/resolve/main/smollm2-1.7b-instruct.Q4_K_M.gguf`

I have tried the code on my mac but got error, I think the doc is not updated

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions