Add support of using local NVME for model storage

### Component

Helm Chart

### Desired use case or feature

Right now there is no way to choose how to deal with the model cache. If you choose the hugging face then the model is by default stored in memory:

https://github.com/llm-d/llm-d-deployer/blob/c9e16e91d264ff719d4e9885fbe5e1b239eb87a1/charts/llm-d/templates/modelservice/presets/basic-gpu-with-nixl-preset.yaml#L234-L237

And the other way is to use pvc.

---

I believe that we can offer a user the place to store model on the host nvme disks, when the model is being downloaded from the huggingface.

### Proposed solution

Allow user to specify the hostpath or volume type when the model is being downloaded from HF.

### Alternatives

_No response_

### Additional context or screenshots

_No response_

	{{ `{{ if .HFModelName }}` }}
	- name: model-cache
	emptyDir: {}
	{{ `{{ end }}` }}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support of using local NVME for model storage #324

Component

Desired use case or feature

Proposed solution

Alternatives

Additional context or screenshots

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add support of using local NVME for model storage #324

Description

Component

Desired use case or feature

Proposed solution

Alternatives

Additional context or screenshots

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions