Skip to content

Commit 160894e

Browse files
committed
doc: fix attn typo in readme
1 parent b2eccc2 commit 160894e

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -81,7 +81,7 @@
8181

8282
- 使用paged attention, flash attention后端,cuda graph等功能:
8383
```bash
84-
CUDA_VISIBLE_DEVICES=0,1,2,3 python python/infinilm/server/inference_server.py --device nvidia --model=/models/9G7B_MHA/ --enable-paged-attn --attn=flash-atten --enable-graph
84+
CUDA_VISIBLE_DEVICES=0,1,2,3 python python/infinilm/server/inference_server.py --device nvidia --model=/models/9G7B_MHA/ --enable-paged-attn --attn=flash-attn --enable-graph
8585
```
8686

8787
- 测试推理服务性能:

0 commit comments

Comments
 (0)