ModelEngine-Group
diff --git a/‎docs/source/getting-started/quickstart_vllm.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/source/getting-started/quickstart_vllm.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/source/getting-started/quickstart_vllm_ascend.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/source/getting-started/quickstart_vllm_ascend.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/source/user-guide/prefix-cache/pipeline_store.md‎
Lines changed: 25 additions & 4 deletions b/‎docs/source/user-guide/prefix-cache/pipeline_store.md‎
Lines changed: 25 additions & 4 deletions
@@ -133,7 +133,7 @@ For quick start, just follow the guide below to launch your own inference experi
 
 ### Feature 1:  Prefix Caching
 
-You may directly edit the example file at `unified-cache-management/examples/ucm_config_example.yaml`. For more please refer to [Prefix Cache with NFS Store](../user-guide/prefix-cache/nfs_store.md) document.
+You may directly edit the example file at `unified-cache-management/examples/ucm_config_example.yaml`. For more please refer to [Prefix Cache with NFS Store](../user-guide/prefix-cache/nfs_store.md) and [Prefix Cache with Pipeline Store](../user-guide/prefix-cache/pipeline_store.md) document.
 
 ⚠️ Make sure to replace `/mnt/test` with your actual storage directory. 
 
 
@@ -107,7 +107,7 @@ For quick start, just follow the guide below to launch your own inference experi
 
 ### Feature 1:  Prefix Caching
 
-You may directly edit the example file at `unified-cache-management/examples/ucm_config_example.yaml`. For more please refer to [Prefix Cache with NFS Store](../user-guide/prefix-cache/nfs_store.md) document.
+You may directly edit the example file at `unified-cache-management/examples/ucm_config_example.yaml`. For more please refer to [Prefix Cache with NFS Store](../user-guide/prefix-cache/nfs_store.md) and [Prefix Cache with Pipeline Store](../user-guide/prefix-cache/pipeline_store.md) document.
 
 ⚠️ Make sure to replace `/mnt/test` with your actual storage directory. 
 
 
@@ -102,7 +102,22 @@ load_only_first_rank: false
   Whether to enable direct I/O.
 
 * **stream_number** *(optional, default: 8)*  
-  Number of concurrent streams used for data transfer.
+  Number of threads used for data transfer between the Host and Storage.
+
+* **buffer_number** *(optional, default: 16384)*  
+  The number of dram pinned buffers for data transfer between the Device and Host.
+  In the vast majority of cases, the default value of 16384 is already sufficient.  
+  You can also check the vLLM startup logs, where you’ll see a line like  
+  ```
+  vllm cache_config_info with initialization after num_gpu_blocks is: xxx
+  ```
+  As a rule of thumb, set `buffer_number` **>=** the reported `num_gpu_blocks` for better performance.  
+  If you are using the **Layerwise Connector**, you could set  
+  ```
+  buffer_number = num_gpu_blocks × num_layers
+  ```
+  But as said before, the default value of 16384 is already enough in most cases.
+
 
 * **waiting_queue_depth** *(optional, default: 1024)*  
   Depth of the waiting queue for transfer tasks.  
@@ -113,9 +128,6 @@ load_only_first_rank: false
 * **timeout_ms** *(optional, default: 30000)*  
   Timeout in milliseconds for external interfaces.
 
-* **buffer_size** *(optional, default: 64GB)*  
-  Amount of dram pinned memory used by a single worker process.
-
 ### Must-be-Set Parameters
 
 * **load_only_first_rank** (must be `false`):  
@@ -146,6 +158,15 @@ vllm serve Qwen/Qwen2.5-14B-Instruct \
     "kv_connector_extra_config": {"UCM_CONFIG_FILE": "/vllm-workspace/unified-cache-management/examples/ucm_config_example.yaml"}
 }'
 ```
+You can also use the Layerwise Connector by adding `"use_layerwise": true` to the `kv_connector_extra_config`.
+for example:
+
+```bash
+"kv_connector_extra_config": {
+  "use_layerwise": true,
+  "UCM_CONFIG_FILE": "/home/qiuyuhao1/unified-cache-management/examples/ucm_config_example.yaml"
+}
+```
 
 **⚠️ Make sure to replace `"/vllm-workspace/unified-cache-management/examples/ucm_config_example.yaml"` with your actual config file path.**