You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: integrations/amazon_s3_vectors/src/haystack_integrations/components/retrievers/amazon_s3_vectors/embedding_retriever.py
+4-2Lines changed: 4 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -113,9 +113,11 @@ def run(
113
113
114
114
:param query_embedding: Embedding of the query.
115
115
:param filters: Filters applied to the retrieved Documents. The way runtime filters are applied depends on
116
-
the ``filter_policy`` chosen at retriever initialization.
117
-
:param top_k: Maximum number of Documents to return.
116
+
the ``filter_policy`` chosen at retriever initialization. Filters are applied server-side during
117
+
the vector search.
118
+
:param top_k: Maximum number of Documents to return. S3 Vectors caps this at 100.
118
119
:returns: A dictionary with key ``"documents"`` containing the retrieved Documents.
Copy file name to clipboardExpand all lines: integrations/amazon_s3_vectors/src/haystack_integrations/document_stores/amazon_s3_vectors/document_store.py
+19-6Lines changed: 19 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -48,10 +48,19 @@ class S3VectorsDocumentStore:
48
48
"""
49
49
A Document Store using [Amazon S3 Vectors](https://aws.amazon.com/s3/features/vectors/).
50
50
51
-
Amazon S3 Vectors provides native vector storage and similarity search within Amazon S3.
51
+
Amazon S3 Vectors provides serverless vector storage and similarity search within Amazon S3.
52
52
This document store stores Haystack `Document` objects as vectors with associated metadata
53
53
in an S3 vector bucket and index.
54
54
55
+
**Service limits:**
56
+
57
+
- Maximum ``top_k``: 100 results per query
58
+
- Maximum vector dimension: 4,096
59
+
- Metadata per vector: 40 KB total, 2 KB filterable
60
+
- All documents must have embeddings (``float32`` only)
61
+
- Distance metrics: ``cosine`` or ``euclidean`` (set at index creation, immutable)
62
+
- ``filter_documents()`` is client-side — prefer ``S3VectorsEmbeddingRetriever`` with filters
63
+
55
64
Usage example:
56
65
```python
57
66
from haystack_integrations.document_stores.amazon_s3_vectors import S3VectorsDocumentStore
0 commit comments