Name	Name	Last commit message	Last commit date
parent directory ..
.vscode	.vscode
Wikipedia	Wikipedia
.gitignore	.gitignore
README.md	README.md
diskann-quickstart-azure-sql-improvements.sql	diskann-quickstart-azure-sql-improvements.sql
diskann-quickstart-azure-sql.sql	diskann-quickstart-azure-sql.sql
diskann-quickstart-sql-server-2025.sql	diskann-quickstart-sql-server-2025.sql

Name

Last commit message

Last commit date

Wikipedia

.gitignore

README.md

diskann-quickstart-azure-sql-improvements.sql

diskann-quickstart-azure-sql.sql

diskann-quickstart-sql-server-2025.sql

Approximate Nearest Neighbor Search

Quickstart

The quickstart is the simplest way to get started with DiskANN in SQL Server. It doesn't require any external resource and it is great to start to get familiar with the DiskANN syntax and capabilities. Once you are familiar with the quickstart, you can explore the Wikipedia sample in this folder that provides a complete end-to-end example.

Wikipedia Sample Dataset

SQL Server 2025 introduces a new VECTOR_SEARCH function that allows you to perform approximate nearest neighbor search using the DiskANN algorithm. This function is designed to work with vector columns in SQL Server, enabling efficient similarity search on high-dimensional data.

The samples in this folder demonstrate how to use the VECTOR_SEARCH function with DiskANN. The samples include:

Creating a table with a vector column, importing data from a CSV file, and inserting data into the table.
Creating a approximate vector index on the table using CREATE VECTOR INDEX statement.
Performing approximate nearest neighbor search using the VECTOR_SEARCH function.
Performing hybrid search using the VECTOR_SEARCH function along with full-text search.
Semantic Reranking using Cohere rerank model via sp_invoke_external_rest_endpoint function. For more details on semantic reranking, refer to the Semantic Reranking Sample.
Use Half-Precision floating points to store embeddings to have a more compact representation of vectors.
Use the Vectorizer to generate embeddings for text data.

Vectorizer

To quickly generate embeddings for existing text data, you can use the Vectorizer, which is available as an sample open-source project here: azure-sql-db-vectorizer

End-To-End sample

A full end-to-end sample using Streamlit is available here: https://github.com/Azure-Samples/azure-sql-diskann

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Approximate Nearest Neighbor Search

Quickstart

Wikipedia Sample Dataset

Vectorizer

End-To-End sample

FilesExpand file tree

DiskANN

Directory actions

More options

Directory actions

More options

Latest commit

History

DiskANN

Folders and files

parent directory

README.md

Approximate Nearest Neighbor Search

Quickstart

Wikipedia Sample Dataset

Vectorizer

End-To-End sample