You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+18-18Lines changed: 18 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,4 @@
1
-
##Semantic Search with Couchbase Vector Store and LLM Integration
1
+
# Semantic Search with Couchbase Vector Store and LLM Integration
2
2
3
3
This repository demonstrates how to build a powerful semantic search engine using Couchbase as the backend database, combined with various AI-powered embedding and language model providers such as OpenAI, Azure OpenAI, Anthropic (Claude), Cohere, Hugging Face, Jina AI, Mistral AI, and Voyage AI.
4
4
@@ -8,45 +8,45 @@ Each example provides two distinct approaches:
8
8
9
9
Semantic search goes beyond simple keyword matching by understanding the context and meaning behind the words in a query, making it essential for applications that require intelligent information retrieval.
10
10
11
-
###Features
11
+
## Features
12
12
13
13
-**Multiple Embedding Models**: Support for embeddings from OpenAI, Azure OpenAI, Anthropic (Claude), Cohere, Hugging Face, Jina AI, Mistral AI, and Voyage AI.
Use the provided `{model}_index.json` index definition file in each model's `search_based/` directory to create a new vector search index in your Couchbase cluster.
38
38
39
-
#####For Hyperscale Vector Index, Composite Vector Index Examples:
39
+
#### For Hyperscale Vector Index, Composite Vector Index Examples:
40
40
No additional setup required. Hyperscale and Composite Vector Indexes will be created in each model's example.
41
41
42
-
####3. Run the notebook file
42
+
### 3. Run the notebook file
43
43
44
44
You can either run the notebook file on [Google Colab](https://colab.research.google.com/) or run it on your system by setting up the Python environment.
45
45
46
46
47
-
###Components
47
+
## Components
48
48
49
-
####1. Multiple Embedding Models
49
+
### 1. Multiple Embedding Models
50
50
51
51
The system supports embeddings from various AI providers:
52
52
@@ -59,23 +59,23 @@ The system supports embeddings from various AI providers:
59
59
* Mistral AI
60
60
* Voyage AI
61
61
62
-
####2. Couchbase Vector Store
62
+
### 2. Couchbase Vector Store
63
63
64
64
Couchbase is used to store document embeddings and metadata. The index structure allows for efficient retrieval across different embedding types.
65
65
66
-
####3. Retrieval-Augmented Generation (RAG)
66
+
### 3. Retrieval-Augmented Generation (RAG)
67
67
68
68
The RAG pipeline integrates with language models like GPT-4 to generate contextually relevant answers based on retrieved documents.
69
69
70
-
####4. Semantic Search
70
+
### 4. Semantic Search
71
71
72
-
Each notebook implements a semantic search function that performs similarity searche using the appropriate embedding type and retrieves the top-k most similar documents.
72
+
Each notebook implements a semantic search function that performs similarity search using the appropriate embedding type and retrieves the top-k most similar documents.
73
73
74
-
####5. Caching
74
+
### 5. Caching
75
75
76
76
The system implements caching functionality using `CouchbaseCache` to improve performance for repeated queries.
77
77
78
-
###Couchbase Search Vector Index
78
+
## Couchbase Search Vector Index
79
79
80
80
For Search Vector Index examples, you'll need to create a vector search index using the provided JSON configuration files. For more information on creating a vector search index, please follow the [instructions](https://docs.couchbase.com/cloud/vector-search/create-vector-search-index-ui.html). The following is an example for Azure OpenAI Model.
0 commit comments