Misc. bug: --grammar-file does nothing with llama-server but APIs calls that pass "grammar" fields work fine

### Name and Version

llama-server --version
ggml_cuda_init: found 1 CUDA devices (Total VRAM: 8150 MiB):
  Device 0: NVIDIA GeForce RTX 5060 Laptop GPU, compute capability 12.0, VMM: yes, VRAM: 8150 MiB
version: 8616 (ced5734c4)
built with MSVC 19.50.35728.0 for Windows AMD64

### Operating systems

Windows

### Which llama.cpp modules do you know to be affected?

llama-server

### Command line

```shell
llama-server --grammar-file grammar.gbnf -m qwen_qwen3.5-0.8b-q8_0.gguf
```

### Problem description & steps to reproduce

The server ignores the file passed by the command line flag, but honors APIs requests that pass a "grammar" field. 

possibly caused by commit

5e54d51b199ad2d70cf6eba4bff756bbf63366a6

as it removed defaults.sampling.grammar from the initialization process (default initialization to empty string instead) and seems to depend on the grammar field having been sent through the API

### First Bad Commit

_No response_

### Relevant log output

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc. bug: --grammar-file does nothing with llama-server but APIs calls that pass "grammar" fields work fine #21262

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Misc. bug: --grammar-file does nothing with llama-server but APIs calls that pass "grammar" fields work fine #21262

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions