Skip to content

Commit c1eef36

Browse files
committed
bump to latest
1 parent a34d70e commit c1eef36

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -187,9 +187,9 @@ Here is the detailed architecture of `Ask Question/Doubt` component:
187187

188188
$ bash pot_benchmark_qa.sh
189189

190-
Please note that, above shell script can perform optimization using IPEX to get Pytorch-(IPEX)-FP32 model
191-
or It can perform optimization/quantization using Intel® Neural Compressor to get Static-QAT-INT8,
192-
Static-Smooth-QAT-INT8 models. Moreover, you can choose the backend as `default` or `ipex` for INT8 models.
190+
# Please note that, above shell script can perform optimization using IPEX to get Pytorch-(IPEX)-FP32 model
191+
# or It can perform optimization/quantization using Intel® Neural Compressor to get Static-QAT-INT8,
192+
# Static-Smooth-QAT-INT8 models. Moreover, you can choose the backend as `default` or `ipex` for INT8 models.
193193

194194
```
195195

@@ -255,7 +255,7 @@ recent Generative AI LLMs (Large Language models) (open access models like LLaMA
255255

256256
Here for performance gain, we can use INT8 quantized model optimized using Intel® Neural Compressor (Few options are like https://huggingface.co/decapoda-research/llama-7b-hf-int8 etc.)
257257

258-
Please Note that for fun 😄, we also provide usage of Azure OpenAI Cognitive Service to use models like GPT3 paid subscription API. You just need to provide `azure_deployment_name`, set `llm_name` as `hf_pipeline` in the below configuration and then add `<your_key>`
258+
Please Note that for fun 😄, we also provide usage of Azure OpenAI Cognitive Service to use models like GPT3 paid subscription API. You just need to provide `azure_deployment_name`, set `llm_name` as `azure_gpt3` in the below configuration and then add `<your_key>`
259259

260260
```python
261261

assets/.DS_Store

-2 KB
Binary file not shown.

0 commit comments

Comments
 (0)