Skip to content

Commit 35c5cf5

Browse files
CharleneHu-42joshuayaopre-commit-ci[bot]yinghu5
authored
Refine README with highlighted examples and updated support info (#1006)
Signed-off-by: CharleneHu-42 <yabai.hu@intel.com> Co-authored-by: Yi Yao <yi.a.yao@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Ying Hu <ying.hu@intel.com>
1 parent 63b789a commit 35c5cf5

2 files changed

Lines changed: 52 additions & 15 deletions

File tree

README.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,22 @@ GenAIExamples are designed to give developers an easy entry into generative AI,
1212

1313
[GenAIEval](https://github.com/opea-project/GenAIEval) measures service performance metrics such as throughput, latency, and accuracy for GenAIExamples. This feature helps users compare performance across various hardware configurations easily.
1414

15+
## Use Cases
16+
17+
Below are some highlighted GenAI use cases across various application scenarios:
18+
19+
| Scenario | Use Case |
20+
| ---------------------------- | ------------------------------------------------------------------------------------------------------------------------------------- |
21+
| <b>Question Answering<b/> | [ChatQnA](ChatQnA) ✨: Chatbot with Retrieval Augmented Generation (RAG). <br/> [VisualQnA](VisualQnA) ✨: Visual Question-answering. |
22+
| <b>Image Generation<b/> | [Text2Image](Text2Image) ✨: Text-to-image generation. |
23+
| <b>Content Summarization<b/> | [DocSum](DocSum): Document Summarization Application. |
24+
| <b>FAQ Generation<b/> | [FaqGen](FaqGen): Frequently asked questions (FAQs) generation from your documents, legal texts, customer queries etc. |
25+
| <b>Code Generation<b/> | [CodeGen](CodeGen): Gen-AI Powered Code Generator. |
26+
| <b>Information Retrieval<b/> | [DocIndexRetriever](DocIndexRetriever): Document Retrieval with Retrieval Augmented Generation (RAG). |
27+
| <b>Fine-tuning<b/> | [InstructionTuning](InstructionTuning): Application of Instruction Tuning. |
28+
29+
For the full list of the available use cases and their supported deployment type, please refer [here](#deploy-examples).
30+
1531
## Documentation
1632

1733
The GenAIExamples [documentation](https://opea-project.github.io/latest/examples/index.html) contains a comprehensive guide on all available examples including architecture, deployment guides, and more. Information on GenAIComps, GenAIInfra, and GenAIEval can also be found there.

supported_examples.md

Lines changed: 36 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -105,9 +105,9 @@ This document introduces the supported examples of GenAIExamples. The supported
105105

106106
[VisualQnA](./VisualQnA/README.md) is an example of chatbot for question and answering based on the images.
107107

108-
| LVM | HW | Description |
109-
| --------------------------------------------------------------------------------------------- | ------ | ----------- |
110-
| [llava-hf/llava-v1.6-mistral-7b-hf](https://huggingface.co/llava-hf/llava-v1.6-mistral-7b-hf) | Gaudi2 | Chatbot |
108+
| LVM | HW | Description |
109+
| --------------------------------------------------------------------------------------------- | ----------- | ----------- |
110+
| [llava-hf/llava-v1.6-mistral-7b-hf](https://huggingface.co/llava-hf/llava-v1.6-mistral-7b-hf) | Xeon/Gaudi2 | Chatbot |
111111

112112
### VideoQnA
113113

@@ -122,43 +122,43 @@ By default, the embedding and LVM models are set to a default value as listed be
122122

123123
### RerankFinetuning
124124

125-
Rerank model finetuning example is for training rerank model on a dataset for improving its capability on specific field.
125+
[Rerank model finetuning](./RerankFinetuning/README.md) example is for training rerank model on a dataset for improving its capability on specific field.
126126

127127
By default, the base model is set to a default value as listed below:
128128

129-
| Service | Base Model | HW | Description |
130-
| ----------------- | ------------------------------------------------------------------------- | ---- | ------------------------------- |
131-
| Rerank Finetuning | [BAAI/bge-reranker-large](https://huggingface.co/BAAI/bge-reranker-large) | Xeon | Rerank model finetuning service |
129+
| Service | Base Model | HW | Description |
130+
| ----------------- | ------------------------------------------------------------------------- | ----------- | ------------------------------- |
131+
| Rerank Finetuning | [BAAI/bge-reranker-large](https://huggingface.co/BAAI/bge-reranker-large) | Xeon/Gaudi2 | Rerank model finetuning service |
132132

133133
### InstructionTuning
134134

135-
The Instruction Tuning example is designed to further train large language models (LLMs) on a dataset consisting of (instruction, output) pairs using supervised learning. This process bridges the gap between the LLM's original objective of next-word prediction and the user’s objective of having the model follow human instructions accurately. By leveraging Instruction Tuning, this example enhances the LLM's ability to better understand and execute specific tasks, improving the model's alignment with user instructions and its overall performance.
135+
The [Instruction Tuning](./InstructionTuning/README.md) example is designed to further train large language models (LLMs) on a dataset consisting of (instruction, output) pairs using supervised learning. This process bridges the gap between the LLM's original objective of next-word prediction and the user’s objective of having the model follow human instructions accurately. By leveraging Instruction Tuning, this example enhances the LLM's ability to better understand and execute specific tasks, improving the model's alignment with user instructions and its overall performance.
136136

137137
By default, the base model is set to a default value as listed below:
138138

139-
| Service | Base Model | HW | Description |
140-
| ----------------- | ------------------------------------------------------------------------------------- | ---------- | ------------------------------------ |
141-
| InstructionTuning | [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) | Xeon/Gaudi | LLM model Instruction Tuning service |
139+
| Service | Base Model | HW | Description |
140+
| ----------------- | ------------------------------------------------------------------------------------- | ----------- | ------------------------------------ |
141+
| InstructionTuning | [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) | Xeon/Gaudi2 | LLM model Instruction Tuning service |
142142

143143
### DocIndexRetriever
144144

145-
The DocRetriever example demonstrates how to match user queries with free-text records using various retrieval methods. It plays a key role in Retrieval-Augmented Generation (RAG) systems by dynamically fetching relevant information from external sources, ensuring responses are factual and up-to-date. Powered by vector databases, DocRetriever enables efficient, semantic retrieval by storing data as vectors and quickly identifying the most relevant documents based on similarity.
145+
The [DocRetriever](./DocIndexRetriever/README.md) example demonstrates how to match user queries with free-text records using various retrieval methods. It plays a key role in Retrieval-Augmented Generation (RAG) systems by dynamically fetching relevant information from external sources, ensuring responses are factual and up-to-date. Powered by vector databases, DocRetriever enables efficient, semantic retrieval by storing data as vectors and quickly identifying the most relevant documents based on similarity.
146146

147147
| Framework | Embedding | Vector Database | Serving | HW | Description |
148148
| ------------------------------------------------------------------------------ | --------------------------------------------------- | -------------------------- | --------------------------------------------------------------- | ----------- | -------------------------- |
149149
| [LangChain](https://www.langchain.com)/[LlamaIndex](https://www.llamaindex.ai) | [BGE-Base](https://huggingface.co/BAAI/bge-base-en) | [Redis](https://redis.io/) | [TEI](https://github.com/huggingface/text-embeddings-inference) | Xeon/Gaudi2 | Document Retrieval service |
150150

151151
### AgentQnA
152152

153-
The AgentQnA example demonstrates a hierarchical, multi-agent system designed for question-answering tasks. A supervisor agent interacts directly with the user, delegating tasks to a worker agent and utilizing various tools to gather information and generate answers. The worker agent primarily uses a retrieval tool to respond to the supervisor's queries. Additionally, the supervisor can access other tools, such as APIs to query knowledge graphs, SQL databases, or external knowledge bases, to enhance the accuracy and relevance of its responses.
153+
The [AgentQnA](./AgentQnA/README.md) example demonstrates a hierarchical, multi-agent system designed for question-answering tasks. A supervisor agent interacts directly with the user, delegating tasks to a worker agent and utilizing various tools to gather information and generate answers. The worker agent primarily uses a retrieval tool to respond to the supervisor's queries. Additionally, the supervisor can access other tools, such as APIs to query knowledge graphs, SQL databases, or external knowledge bases, to enhance the accuracy and relevance of its responses.
154154

155155
Worker agent uses open-source websearch tool (duckduckgo), agents use OpenAI GPT-4o-mini as llm backend.
156156

157157
> **_NOTE:_** This example is in active development. The code structure of these use cases are subject to change.
158158
159159
### AudioQnA
160160

161-
The AudioQnA example demonstrates the integration of Generative AI (GenAI) models for performing question-answering (QnA) on audio files, with the added functionality of Text-to-Speech (TTS) for generating spoken responses. The example showcases how to convert audio input to text using Automatic Speech Recognition (ASR), generate answers to user queries using a language model, and then convert those answers back to speech using Text-to-Speech (TTS).
161+
The [AudioQnA](./AudioQnA/README.md) example demonstrates the integration of Generative AI (GenAI) models for performing question-answering (QnA) on audio files, with the added functionality of Text-to-Speech (TTS) for generating spoken responses. The example showcases how to convert audio input to text using Automatic Speech Recognition (ASR), generate answers to user queries using a language model, and then convert those answers back to speech using Text-to-Speech (TTS).
162162

163163
<table>
164164
<tr>
@@ -179,7 +179,7 @@ The AudioQnA example demonstrates the integration of Generative AI (GenAI) model
179179

180180
### FaqGen
181181

182-
FAQ Generation Application leverages the power of large language models (LLMs) to revolutionize the way you interact with and comprehend complex textual data. By harnessing cutting-edge natural language processing techniques, our application can automatically generate comprehensive and natural-sounding frequently asked questions (FAQs) from your documents, legal texts, customer queries, and other sources. In this example use case, we utilize LangChain to implement FAQ Generation and facilitate LLM inference using Text Generation Inference on Intel Xeon and Gaudi2 processors.
182+
[FAQ Generation](./FaqGen/README.md) application leverages the power of large language models (LLMs) to revolutionize the way you interact with and comprehend complex textual data. By harnessing cutting-edge natural language processing techniques, our application can automatically generate comprehensive and natural-sounding frequently asked questions (FAQs) from your documents, legal texts, customer queries, and other sources. In this example use case, we utilize LangChain to implement FAQ Generation and facilitate LLM inference using Text Generation Inference on Intel Xeon and Gaudi2 processors.
183183
| Framework | LLM | Serving | HW | Description |
184184
| ------------------------------------------------------------------------------ | ----------------------------------------------------------------- | --------------------------------------------------------------- | ----------- | ----------- |
185185
| [LangChain](https://www.langchain.com)/[LlamaIndex](https://www.llamaindex.ai) | [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) | [TGI](https://github.com/huggingface/text-generation-inference) | Xeon/Gaudi2 | Chatbot |
@@ -199,3 +199,24 @@ FAQ Generation Application leverages the power of large language models (LLMs) t
199199
### ProductivitySuite
200200

201201
[Productivity Suite](./ProductivitySuite/README.md) streamlines your workflow to boost productivity. It leverages the power of OPEA microservices to deliver a comprehensive suite of features tailored to meet the diverse needs of modern enterprises.
202+
203+
### DBQnA
204+
205+
[DBQnA](./DBQnA/README.md) converts your natural language query into an SQL query, automatically executes the generated query on the database and delivers real-time query results.
206+
| Framework | LLM | Database | HW | Description |
207+
|----------------------------------------|-------------------------------------------------------------------------------------------------|-------------------------------------------|------|----------------------------|
208+
| [LangChain](https://www.langchain.com) | [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) | [PostgresDB](https://www.postgresql.org/) | Xeon | Natural language SQL agent |
209+
210+
### Text2Image
211+
212+
[Text2Image](./Text2Image/README.md) generates image based on your provided text.
213+
| Framework | LDM | HW | Description |
214+
|----------------------------------------|--------------------------------------------------------------------------------------------------------|-------------|-------------|
215+
| [LangChain](https://www.langchain.com) | [stabilityai/stable-diffusion](https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers) | Xeon/Gaudi2 | Text2Image |
216+
217+
### AvatarChatbot
218+
219+
[AvatarChatbot](./AvatarChatbot/README.md) example is a chatbot with a visual character that provides users dynamic, engaging interactions, by leveraging multiple generative AI components including LLM, ASR (Audio-Speech-Recognition), and TTS (Text-To-Speech).
220+
| LLM | ASR | TTS | Animation | HW | Description |
221+
|-------------------------------------------------------------------------------|---------------------------------------------------------------------|---------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------|-------------|----------------------------|
222+
| [Intel/neural-chat-7b-v3-3](https://huggingface.co/Intel/neural-chat-7b-v3-3) | [openai/whisper-small](https://huggingface.co/openai/whisper-small) | [microsoft/SpeechT5](https://huggingface.co/microsoft/speecht5_tts) | [Rudrabha/Wav2Lip](https://github.com/Rudrabha/Wav2Lip) <br> [TencentARC/GFPGAN](https://github.com/TencentARC/GFPGAN) | Xeon/Gaudi2 | Interactive chatbot Avatar |

0 commit comments

Comments
 (0)