Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 244 Bytes

File metadata and controls

2 lines (2 loc) · 244 Bytes

This is a tool to run llama.cpp in a remote Docker container with different parameters and to collect benchmarks from logs. It is designed to measure the influence of runtime parameters on model performance, especially for speculative decoding.