RAG Core Libraries

This directory contains the core libraries of the STACKIT RAG template. These libraries provide comprehensive document extraction capabilities including support for files (PDF, DOCX, XML, EPUB), web sources via sitemaps, and Confluence pages. It consists of the following python packages:

1. Rag Core API
2. Admin API lib
3. Extractor API lib
4. RAG Core lib
- 4.1 Requirements
- 4.2 Retry decorator (exponential backoff)

With the exception of the RAG Core lib all of these packages contain an API definition and are easy to adjust for your specific use case. Each of the packages defines the replaceable parts(1.3 Replaceable Parts, 2.3 Replaceable Parts, 3.3 Replaceable Parts), expected types and offer a brief description.

ⓘ INFO: If you replace parts it is important to keep the name of the component, otherwise the replacing-logic will not work.

This directory also contains a Dockerfile that is used to ensure proper linting and testing of the library packages.

For an example on how to use the packages, please consult the RAG Template project or the individual service implementations in the services directory.

1. RAG Core API

The rag-core-api contains a default implementation of a RAG. For a default use case, no adjustments should be required.

The following endpoints are provided by the backend:

/chat/{session_id}: The endpoint for chatting.
/evaluate: Will start the evaluation of the RAG using the provided question-answer pairs.
/information_pieces/remove: Endpoint to remove documents from the vector database.
/information_pieces/upload: Endpoint to upload documents into the vector database. These documents need to have been parsed. For simplicity, a LangChain Documents like format is used.

1.1 Requirements

All required python libraries can be found in the pyproject.toml file. In addition to python libraries, the following system packages are required:

build-essential
make

1.2 Endpoints

`/chat/{session_id}`

This endpoint is used for chatting.

`/evaluate`

Will start the evaluation of the RAG using the provided question-answer pairs. The file containing the dataset can be set by changing the RAGAS_DATASET_FILENAME environment variable, the default is test_data.json. This path can be either an absolute path, or a path relative to the current working directory. By default OpenAI is used by the evaluation. If you want to use the same LLM-class for the evaluation as is used for the chat you have to set the environment variable RAGA_USE_OPENAI to false and adjust the RAGAS_MODEL environment variable to the model-name of your choice.

📝 NOTE: Due to quality problems with open-source LLMs, it is recommended to use OpenAI for the evaluation.

`/information_pieces/remove`

Endpoint to remove documents from the vector database.

`/information_pieces/upload`

Endpoint to upload documents into the vector database. These documents need to have been parsed. For simplicity, a LangChain Documents like format is used. Uploaded documents are required to contain the following metadata:

document_url that points to a download link to the source document.
All documents of the type IMAGE require the content of the image encoded in base64 in the base64_image key.

1.3 Replaceable parts

Name	Type	Default	Notes
embedder	`rag_core_lib.impl.embeddings.embedder.Embedder`	Depends on your settings. Can be `rag_core_lib.impl.embeddings.langchain_community_embedder.LangchainCommunityEmbedder` or `rag_core_lib.impl.embeddings.stackit_embedder.StackitEmbedder`	Selected by `rag_core_lib.impl.settings.embedder_class_type_settings.EmbedderClassTypeSettings.embedder_type`.
vector_database	`rag_core_api.vector_databases.vector_database.VectorDatabase`	`rag_core_api.impl.vector_databases.qdrant_database.QdrantDatabase`
reranker	`rag_core_api.reranking.reranker.Reranker`	`rag_core_api.impl.reranking.flashrank_reranker.FlashrankReranker`	Used in the composed_retriever
composed_retriever	`rag_core_api.retriever.retriever.Retriever`	`rag_core_api.impl.retriever.composite_retriever.CompositeRetriever`	Handles retrieval, re-ranking, etc.
large_language_model	`langchain_core.language_models.chat_models.BaseChatModel`	Provided via `rag_core_lib.impl.llms.llm_factory.chat_model_provider`: `langchain_openai.ChatOpenAI` or `langchain_ollama.ChatOllama`	The LLM used for all LLM tasks. The default depends on `rag_core_lib.impl.settings.rag_class_types_settings.RAGClassTypeSettings.llm_type`. A fake model is used in tests.
prompt	`str`	`rag_core_api.prompt_templates.answer_generation_prompt.ANSWER_GENERATION_PROMPT`	The prompt used for answering the question.
rephrasing_prompt	`str`	`rag_core_api.prompt_templates.question_rephrasing_prompt.QUESTION_REPHRASING_PROMPT`	The prompt used for rephrasing the question. The rephrased question (and the original question are both used for retrieval of the documents).
language_detection_prompt	`str`	`rag_core_api.prompt_templates.language_detection_prompt.LANGUAGE_DETECTION_PROMPT`	Prompt for detecting input language. Enforces structured JSON output `{ "language": "<iso639-1>" }` and defaults to `en` when uncertain.
langfuse_manager	`rag_core_lib.impl.langfuse_manager.langfuse_manager.LangfuseManager`	`rag_core_lib.impl.langfuse_manager.langfuse_manager.LangfuseManager`	Retrieves additional settings, as well as the prompt from langfuse if available.
answer_generation_chain	`rag_core_lib.runnables.AsyncRunnable[rag_core_api.impl.graph.graph_state.graph_state.AnswerGraphState, str]`	`rag_core_api.impl.answer_generation_chains.answer_generation_chain.AnswerGenerationChain`	LangChain chain used for answering the question. Is part of the chat_graph.
rephrasing_chain	`rag_core_lib.runnables.AsyncRunnable[rag_core_api.impl.graph.graph_state.graph_state.AnswerGraphState, str]`	`rag_core_api.impl.answer_generation_chains.rephrasing_chain.RephrasingChain`	LangChain chain used for rephrasing the question. Is part of the chat_graph.
language_detection_chain	`rag_core_lib.runnables.AsyncRunnable[rag_core_api.impl.graph.graph_state.graph_state.AnswerGraphState, str]`	`rag_core_api.impl.answer_generation_chains.language_detection_chain.LanguageDetectionChain`	Detects the language of the question and returns an ISO 639-1 code (e.g., `en`, `de`). Uses structured-output guidance and robust parsing with fallback to `en`. Part of the chat_graph.
chat_graph	`rag_core_api.graph.graph_base.GraphBase`	`rag_core_api.impl.graph.chat_graph.DefaultChatGraph`	Langgraph graph that contains the entire logic for question answering.
traced_chat_graph	`rag_core_lib.runnables.AsyncRunnable[Any, Any]`	`rag_core_lib.impl.tracers.langfuse_traced_runnable.LangfuseTracedRunnable`	Wraps around the chat_graph and adds Langfuse tracing.
evaluator	`rag_core_api.impl.evaluator.langfuse_ragas_evaluator.LangfuseRagasEvaluator`	`rag_core_api.impl.evaluator.langfuse_ragas_evaluator.LangfuseRagasEvaluator`	The evaulator used in the evaluate endpoint.
chat_endpoint	`rag_core_api.api_endpoints.chat.Chat`	`rag_core_api.impl.api_endpoints.default_chat.DefaultChat`	Implementation of the chat endpoint. Default implementation just calls the traced_chat_graph
ragas_llm	`langchain_core.language_models.chat_models.BaseChatModel`	`langchain_openai.ChatOpenAI` or `langchain_ollama.ChatOllama`	The LLM used for the ragas evaluation.

1.4 Embedder retry behavior

The default STACKIT embedder implementation (StackitEmbedder) uses the shared retry decorator with exponential backoff from the rag-core-lib.

Decorator: rag_core_lib.impl.utils.retry_decorator.retry_with_backoff
Base settings (fallback): RetryDecoratorSettings
Per-embedder overrides: StackitEmbedderSettings

How it resolves settings

Each retry-related field in StackitEmbedderSettings is optional. When a field is provided (not None), it overrides the corresponding value from RetryDecoratorSettings.
When a field is not provided (None), the embedder falls back to the value from RetryDecoratorSettings.

Configuring via environment variables

Embedder-specific (prefix STACKIT_EMBEDDER_):
- STACKIT_EMBEDDER_MAX_RETRIES
- STACKIT_EMBEDDER_RETRY_BASE_DELAY
- STACKIT_EMBEDDER_RETRY_MAX_DELAY
- STACKIT_EMBEDDER_BACKOFF_FACTOR
- STACKIT_EMBEDDER_ATTEMPT_CAP
- STACKIT_EMBEDDER_JITTER_MIN
- STACKIT_EMBEDDER_JITTER_MAX
Global fallback (prefix RETRY_DECORATOR_): see section 4.2 for all keys and defaults.
Helm chart: set the same keys under backend.envs.stackitEmbedder in infrastructure/rag/values.yaml.

2. Admin API Lib

The Admin API Library contains all required components for file management capabilities for RAG systems, handling all document lifecycle operations. It also includes a default dependency_container, that is pre-configured and should fit most use-cases.

The following endpoints are provided by the admin-api-lib:

/delete_document/{identification}: Deletes the file from storage (if applicable) and vector database. The identification can be retrieved from the /all_documents_status endpoint.
/document_reference/{identification}: Returns the document.
/all_documents_status: Return the identification and status of all available sources.
/upload_file: Endpoint to upload files.
/upload_source: Endpoint to upload non-file sources.

2.1 Requirements

All required python libraries can be found in the pyproject.toml file. In addition to python libraries, the following system packages are required:

build-essential
make

2.2 Endpoints

`/delete_document/{identification}`

Will delete the document from the connected storage system and will send a request to the backend to delete all related Documents from the vector database.

`/document_reference/{identification}`

Will return the source document stored in the connected storage system.

ⓘ INFO: Confluence pages are not stored in the connected storage system. They are only stored in the vector database and can't be retrieved using this endpoint.

`/all_documents_status`

Will return a list of all sources for the chat and their current status.

`/upload_file`

Files can be uploaded here. This endpoint will process the document in a background and will extract information using the document-extractor. The extracted information will be summarized using a LLM. The summary, as well as the unrefined extracted document, will be uploaded to the rag-core-api.

`/upload_source`

Loads all the content from an arbitrary non-file source using the document-extractor. The type of the source needs to correspond to an extractor in the document-extractor. Supported types include confluence for Confluence pages and sitemap for web content via XML sitemaps. The extracted information will be summarized using LLM. The summary, as well as the unrefined extracted document, will be uploaded to the rag-core-api. An is configured. Defaults to 3600 seconds (1 hour). Can be adjusted by values in the helm chart.

2.3 Replaceable parts

Name	Type	Default	Notes
file_service	`admin_api_lib.file_services.file_service.FileService`	`admin_api_lib.impl.file_services.s3_service.S3Service`	Handles operations on the connected storage.
large_language_model	`langchain_core.language_models.chat_models.BaseChatModel`	Provided via `rag_core_lib.impl.llms.llm_factory.chat_model_provider`: `langchain_openai.ChatOpenAI` or `langchain_ollama.ChatOllama`	The LLM used for all LLM tasks. The default depends on `rag_core_lib.impl.settings.rag_class_types_settings.RAGClassTypeSettings.llm_type`.
semantic_chunker_embeddings	`admin_api_lib.chunker.chunker.Chunker`	Depends on your settings. Can be `rag_core_lib.impl.embeddings.langchain_community_embedder.LangchainCommunityEmbedder` or `rag_core_lib.impl.embeddings.stackit_embedder.StackitEmbedder`	Selected by `rag_core_lib.impl.settings.embedder_class_type_settings.EmbedderClassTypeSettings.embedder_type`. Can be `recursive` or `semantic`.
key_value_store	`admin_api_lib.impl.key_db.file_status_key_value_store.FileStatusKeyValueStore`	`admin_api_lib.impl.key_db.file_status_key_value_store.FileStatusKeyValueStore`	Is used for storing the available sources and their current state.
chunker	`admin_api_lib.chunker.chunker.Chunker`	`admin_api_lib.impl.chunker.text_chunker.TextChunker` or `admin_api_lib.impl.chunker.semantic_text_chunker.SemanticTextChunker`	Splits documents into chunks. Select implementation via `CHUNKER_CLASS_TYPE_CHUNKER_TYPE` (`recursive` or `semantic`).
document_extractor	`admin_api_lib.extractor_api_client.openapi_client.api.extractor_api.ExtractorApi`	`admin_api_lib.extractor_api_client.openapi_client.api.extractor_api.ExtractorApi`	Needs to be replaced if adjustments to the `extractor-api` is made.
rag_api	`admin_api_lib.rag_backend_client.openapi_client.api.rag_api.RagApi`	`admin_api_lib.rag_backend_client.openapi_client.api.rag_api.RagApi`	Needs to be replaced if changes to the `/information_pieces/remove` or `/information_pieces/upload` of the `rag-core-api` are made.
summarizer_prompt	`str`	`admin_api_lib.prompt_templates.summarize_prompt.SUMMARIZE_PROMPT`	The prompt used of the summarization.
langfuse_manager	`rag_core_lib.impl.langfuse_manager.langfuse_manager.LangfuseManager`	`rag_core_lib.impl.langfuse_manager.langfuse_manager.LangfuseManager`	Retrieves additional settings, as well as the prompt from langfuse if available.
summarizer	`admin_api_lib.summarizer.summarizer.Summarizer`	`admin_api_lib.impl.summarizer.langchain_summarizer.LangchainSummarizer`	Creates the summaries. Uses the shared retry decorator with optional per-summarizer overrides (see 2.5).
untraced_information_enhancer	`admin_api_lib.information_enhancer.information_enhancer.InformationEnhancer`	`admin_api_lib.impl.information_enhancer.general_enhancer.GeneralEnhancer`	Uses the summarizer to enhance the extracted documents.
information_enhancer	`rag_core_lib.runnables.AsyncRunnable[Any, Any]`	`rag_core_lib.impl.tracers.langfuse_traced_runnable.LangfuseTracedRunnable`	Wraps around the untraced_information_enhancer and adds Langfuse tracing.
document_deleter	`admin_api_lib.api_endpoints.document_deleter.DocumentDeleter`	`admin_api_lib.impl.api_endpoints.default_document_deleter.DefaultDocumentDeleter`	Handles deletion of sources.
documents_status_retriever	`admin_api_lib.api_endpoints.documents_status_retriever.DocumentsStatusRetriever`	`admin_api_lib.impl.api_endpoints.default_documents_status_retriever.DefaultDocumentsStatusRetriever`	Handles return of source status.
source_uploader	`admin_api_lib.api_endpoints.source_uploader.SourceUploader`	`admin_api_lib.impl.api_endpoints.default_source_uploader.DefaultSourceUploader`	Handles data loading and extraction from various non-file sources.
document_reference_retriever	`admin_api_lib.api_endpoints.document_reference_retriever.DocumentReferenceRetriever`	`admin_api_lib.impl.api_endpoints.default_document_reference_retriever.DefaultDocumentReferenceRetriever`	Handles return of files from connected storage.
file_uploader	`admin_api_lib.api_endpoints.file_uploader.FileUploader`	`admin_api_lib.impl.api_endpoints.default_file_uploader.DefaultFileUploader`	Handles upload and extraction of files.

2.4 Chunker configuration (multiple chunkers)

The default dependency container now exposes two chunking strategies which can be chosen by ChunkerClassTypeSettings:

recursive (default) wraps LangChain's RecursiveCharacterTextSplitter.
semantic wraps LangChain's SemanticChunker, and considers minimum/maximum chunk size with nltk/RecursiveCharacterTextSplitter.

You can switch between them and fine-tune their behaviour through environment variables:

Setting	Description	Default
`CHUNKER_MAX_SIZE`	Maximum character count per recursive chunk.	`1000`
`CHUNKER_OVERLAP`	Character overlap between recursive chunks.	`100`

`CHUNKER_BREAKPOINT_THRESHOLD_TYPE`	Breakpoint heuristic (`percentile`, `standard_deviation`, `interquartile`).	`percentile`
`CHUNKER_BREAKPOINT_THRESHOLD_AMOUNT`	Threshold associated with the selected heuristic.	`95.0`
`CHUNKER_BUFFER_SIZE`	Context buffer that is kept on both sides of a semantic breakpoint.	`1`
`CHUNKER_MIN_SIZE`	Minimum size for semantic chunks.	`200`

📌 The recursive chunker only uses the CHUNKER_MAX_SIZE and CHUNKER_OVERLAP knobs. The remaining keys are ignored unless CHUNKER_CLASS_TYPE_CHUNKER_TYPE=semantic.

Behavior details

Recursive chunker enforces CHUNKER_MAX_SIZE and CHUNKER_OVERLAP only.
Semantic chunker uses embeddings to detect semantic breakpoints and can also enforce min/max sizes:
- Oversized chunks are re-split with RecursiveCharacterTextSplitter (auto-provisioned when max > min).
- Trailing undersized chunks are rebalanced using sentence-aware splitting (NLTK Punkt when available, regex fallback otherwise) to avoid tiny tails while respecting [min, max].

Embeddings backend for semantic chunking

When CHUNKER_CLASS_TYPE_CHUNKER_TYPE is set to semantic, the dependency container selects embeddings using EmbedderClassTypeSettings. Configure the backend via:

EMBEDDER_CLASS_TYPE_EMBEDDER_TYPE: choose one of stackit, ollama.

Backend-specific options:

STACKIT embeddings (production default)
- STACKIT_EMBEDDER_MODEL
- STACKIT_EMBEDDER_BASE_URL
- STACKIT_EMBEDDER_API_KEY (required)
- Optional retry overrides: STACKIT_EMBEDDER_MAX_RETRIES, STACKIT_EMBEDDER_RETRY_BASE_DELAY, STACKIT_EMBEDDER_RETRY_MAX_DELAY, STACKIT_EMBEDDER_BACKOFF_FACTOR, STACKIT_EMBEDDER_ATTEMPT_CAP, STACKIT_EMBEDDER_JITTER_MIN, STACKIT_EMBEDDER_JITTER_MAX
Ollama embeddings (self-hosted)
- OLLAMA_EMBEDDER_MODEL
- OLLAMA_EMBEDDER_BASE_URL

In the Helm chart set CHUNKER_* keys under adminBackend.envs.chunker. The admin deployment reuses the embedder config maps from the backend release, so adjust backend.envs.embedderClassTypes, backend.envs.stackitEmbedder, backend.envs.ollamaEmbedder, or backend.envs.fakeEmbedder accordingly when switching embeddings for semantic chunking.

2.5 Summarizer retry behavior

The default summarizer implementation (LangchainSummarizer) now uses the shared retry decorator with exponential backoff from the rag-core-lib.

Decorator: rag_core_lib.impl.utils.retry_decorator.retry_with_backoff
Base settings (fallback): RetryDecoratorSettings
Per-summarizer overrides: SummarizerSettings

How it resolves settings

Each field in SummarizerSettings is optional. When a field is provided (not None), it overrides the corresponding value from RetryDecoratorSettings.
When a field is not provided (None), the summarizer falls back to the value from RetryDecoratorSettings.

Configuring via environment variables

Summarizer-specific (prefix SUMMARIZER_):
- SUMMARIZER_MAX_RETRIES
- SUMMARIZER_RETRY_BASE_DELAY
- SUMMARIZER_RETRY_MAX_DELAY
- SUMMARIZER_BACKOFF_FACTOR
- SUMMARIZER_ATTEMPT_CAP
- SUMMARIZER_JITTER_MIN
- SUMMARIZER_JITTER_MAX
Global fallback (prefix RETRY_DECORATOR_): see section 4.2 for all keys and defaults.
Helm chart: set the same keys under adminBackend.envs.summarizer in infrastructure/rag/values.yaml.

3. Extractor API Lib

The Extractor Library contains components that provide document parsing capabilities for various file formats and web sources. It supports extracting content from PDF, DOCX, XML files, as well as web pages via sitemaps and Confluence pages. It also includes a default dependency_container, that is pre-configured and is a good starting point for most use-cases. This API should not be exposed by ingress and only used for internally.

The following endpoints are provided by the extractor-api-lib:

/extract_from_file: This endpoint extracts the information from files.
/extract_from_source: This endpoint extracts the information from a non-file source.

3.1 Requirements

All required python libraries can be found in the pyproject.toml file. In addition to python libraries, the following system packages are required:

build-essential
make
ffmpeg
poppler-utils
tesseract-ocr
tesseract-ocr-deu
tesseract-ocr-eng

3.2 Endpoints

`/extract_from_file`

This endpoint will extract the information from PDF,PTTX,WORD,XML files. It will load the files from the connected storage. The following types of information will be extracted:

TEXT: plain text
TABLE: data in tabular form found in the document

`/extract_from_source`

This endpoint will extract data for non-file source. The type of information that is extracted will vary depending on the source. Supported sources include confluence for Confluence pages and sitemap for web pages via XML sitemaps. The following types of information can be extracted:

TEXT: plain text
TABLE: data in tabular form found in the document
IMAGE: image found in the document

For Confluence sources, provide the instance url and API token and include either a space_key or a cql filter (empty values are ignored). Optional flags such as include_attachments, keep_markdown_format, and keep_newlines mirror the parameters supported by LangChain's ConfluenceLoader.

For sitemap sources, additional parameters can be provided, e.g.:

web_path: The URL of the XML sitemap to crawl
filter_urls: JSON array of URL patterns to filter pages (optional)
header_template: JSON object for custom HTTP headers (optional)
continue_on_failure: Whether to skip pages that fail to load instead of aborting the crawl (optional, default: true)

Technically, all parameters of the SitemapLoader from LangChain can be provided.

The HTML parsing logic can be tuned via the SITEMAP_PARSER environment variable (default: docusaurus; options: docusaurus, astro, generic). For Helm deployments, set extractor.envs.sitemap.SITEMAP_PARSER in infrastructure/rag/values.yaml. You can also override the parser per upload by passing a sitemap_parser key/value pair (same options) in the /upload_source request (available as a dropdown in the admin frontend).

3.3 Replaceable parts

Name	Type	Default	Notes
file_service	`extractor_api_lib.file_services.file_service.FileService`	`extractor_api_lib.impl.file_services.s3_service.S3Service`	Handles operations on the connected storage.
database_converter	`extractor_api_lib.table_converter.dataframe_converter.DataframeConverter`	`extractor_api_lib.impl.table_converter.dataframe2markdown.DataFrame2Markdown`	Converts the extracted table from pandas.DataFrame to markdown. If you want the table to have another format, this would need to be adjusted.
pdf_extractor	`extractor_api_lib.extractors.information_file_extractor.InformationFileExtractor`	`extractor_api_lib.impl.extractors.file_extractors.pdf_extractor.PDFExtractor`	Extractor used for extracting information from PDF documents.
ms_docs_extractor	`extractor_api_lib.extractors.information_file_extractor.InformationFileExtractor`	`extractor_api_lib.impl.extractors.file_extractors.ms_docs_extractor.MSDocsExtractor`	Extractor used for extracting information from Microsoft Documents like *.docx, etc.
xml_extractor	`extractor_api_lib.extractors.information_file_extractor.InformationFileExtractor`	`extractor_api_lib.impl.extractors.file_extractors.xml_extractor.XMLExtractor`	Extractor used for extracting content from XML documents.
epub_extractor	`extractor_api_lib.extractors.information_file_extractor.InformationFileExtractor`	`extractor_api_lib.impl.extractors.file_extractors.epub_extractor.EPUBExtractor`	Extractor used for extracting content from EPUB documents.
file_extractors	`dependency_injector.providers.List[extractor_api_lib.extractors.information_file_extractor.InformationFileExtractor]`	`dependency_injector.providers.List(pdf_extractor, ms_docs_extractor, xml_extractor)`	List of all available file extractors. If you add a new type of file extractor you would have to add it to this list.
intern2external	`extractor_api_lib.impl.mapper.internal2external_information_piece.Internal2ExternalInformationPiece`	`extractor_api_lib.impl.mapper.internal2external_information_piece.Internal2ExternalInformationPiece`	Maps internal information pieces to external information pieces, converting between internal and external content types.
confluence_document2information_piece	`extractor_api_lib.mapper.source_langchain_document2information_piece.SourceLangchainDocument2InformationPiece`	`extractor_api_lib.impl.mapper.confluence_langchain_document2information_piece.ConfluenceLangchainDocument2InformationPiece`	Maps LangChain documents from Confluence to information pieces with Confluence-specific metadata handling.
sitemap_document2information_piece	`extractor_api_lib.mapper.source_langchain_document2information_piece.SourceLangchainDocument2InformationPiece`	`extractor_api_lib.impl.mapper.sitemap_document2information_piece.SitemapLangchainDocument2InformationPiece`	Maps LangChain documents from sitemap sources to information pieces with sitemap-specific metadata handling.
general_file_extractor	`extractor_api_lib.api_endpoints.file_extractor.FileExtractor`	`extractor_api_lib.impl.api_endpoints.general_file_extractor.GeneralFileExtractor`	Combines multiple file extractors and decides which one to use for the given file format.
confluence_extractor	`extractor_api_lib.extractors.information_extractor.InformationExtractor`	`extractor_api_lib.impl.extractors.confluence_extractor.ConfluenceExtractor`	Implementation of an extractor for the source `confluence`.
sitemap_extractor	`extractor_api_lib.extractors.information_extractor.InformationExtractor`	`extractor_api_lib.impl.extractors.sitemap_extractor.SitemapExtractor`	Implementation of an extractor for the source `sitemap`. Supports XML sitemap crawling with configurable parameters including URL filtering, custom headers, and crawling depth. Uses LangChain's SitemapLoader with support for custom parsing and meta functions via dependency injection.
sitemap_parsing_function	`dependency_injector.providers.Factory[Callable]`	`extractor_api_lib.impl.utils.sitemap_extractor_utils.custom_sitemap_parser_function`	Custom parsing function for sitemap content extraction. Used by the sitemap extractor to parse HTML content from web pages. Can be replaced to customize how web page content is processed and extracted.
sitemap_meta_function	`dependency_injector.providers.Factory[Callable]`	`extractor_api_lib.impl.utils.sitemap_extractor_utils.custom_sitemap_metadata_parser_function`	Custom meta function for sitemap content processing. Used by the sitemap extractor to extract metadata from web pages. Can be replaced to customize how metadata is extracted and structured from web content.
source_extractor	`extractor_api_lib.api_endpoints.source_extractor.SourceExtractor`	`extractor_api_lib.impl.api_endpoints.general_source_extractor.GeneralSourceExtractor`	Implementation of the `/extract_from_source` endpoint. Will decide the correct extractor for the source and handles available extractors for confluence and sitemap sources.

4. RAG Core Lib

The rag-core-lib contains components of the rag-core-api that are also useful for other services and therefore are packaged in a way that makes it easy to use. Examples of included components:

tracing for LangChain chains using Langfuse
settings for multiple LLMs and Langfuse
factory for LLMs
ContentType enum of the Documents.
...

4.1 Requirements

All required python libraries can be found in the pyproject.toml file. In addition to python libraries the following system packages are required:

build-essential
make

4.2 Retry decorator (exponential backoff)

The rag-core-lib provides a reusable retry decorator with exponential backoff and rate‑limit awareness for both sync and async functions.

Module: rag_core_lib.impl.utils.retry_decorator.retry_with_backoff
Settings: rag_core_lib.impl.settings.retry_decorator_settings.RetryDecoratorSettings
Works with: synchronous and asynchronous callables
Rate-limit aware: optionally inspects HTTP status 429 and headers like x-ratelimit-reset-requests / x-ratelimit-reset-tokens

Usage example

from rag_core_lib.impl.utils.retry_decorator import retry_with_backoff
from rag_core_lib.impl.settings.retry_decorator_settings import RetryDecoratorSettings

# Configure via code (env vars also supported, see below)
settings = RetryDecoratorSettings(
    max_retries=3,
    retry_base_delay=0.2,
)

@retry_with_backoff(settings=settings)
def fetch_something():
    return "ok"

@retry_with_backoff(settings=settings)
async def fetch_async_something():
    return "ok"

Configuration

Environment variables (prefix RETRY_DECORATOR_):
- RETRY_DECORATOR_MAX_RETRIES (default: 5)
- RETRY_DECORATOR_RETRY_BASE_DELAY (default: 0.5)
- RETRY_DECORATOR_RETRY_MAX_DELAY (default: 600)
- RETRY_DECORATOR_BACKOFF_FACTOR (default: 2)
- RETRY_DECORATOR_ATTEMPT_CAP (default: 6)
- RETRY_DECORATOR_JITTER_MIN (default: 0.05)
- RETRY_DECORATOR_JITTER_MAX (default: 0.25)
Helm chart (shared values): set the same keys under shared.envs.retryDecorator in infrastructure/rag/values.yaml to apply cluster‑wide defaults for backend/admin services.

Advanced

Customize which exceptions trigger retries via exceptions and rate_limit_exceptions parameters of retry_with_backoff().
Header‑based wait: When rate‑limited, the decorator will honor reset headers if present and add jitter.

For more examples, see tests in ./rag-core-lib/tests/retry_decorator_test.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG Core Libraries

1. RAG Core API

1.1 Requirements

1.2 Endpoints

`/chat/{session_id}`

`/evaluate`

`/information_pieces/remove`

`/information_pieces/upload`

1.3 Replaceable parts

1.4 Embedder retry behavior

2. Admin API Lib

2.1 Requirements

2.2 Endpoints

`/delete_document/{identification}`

`/document_reference/{identification}`

`/all_documents_status`

`/upload_file`

`/upload_source`

2.3 Replaceable parts

2.4 Chunker configuration (multiple chunkers)

Embeddings backend for semantic chunking

2.5 Summarizer retry behavior

3. Extractor API Lib

3.1 Requirements

3.2 Endpoints

`/extract_from_file`

`/extract_from_source`

3.3 Replaceable parts

4. RAG Core Lib

4.1 Requirements

4.2 Retry decorator (exponential backoff)

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

RAG Core Libraries

1. RAG Core API

1.1 Requirements

1.2 Endpoints

/chat/{session_id}

/evaluate

/information_pieces/remove

/information_pieces/upload

1.3 Replaceable parts

1.4 Embedder retry behavior

2. Admin API Lib

2.1 Requirements

2.2 Endpoints

/delete_document/{identification}

/document_reference/{identification}

/all_documents_status

/upload_file

/upload_source

2.3 Replaceable parts

2.4 Chunker configuration (multiple chunkers)

Embeddings backend for semantic chunking

2.5 Summarizer retry behavior

3. Extractor API Lib

3.1 Requirements

3.2 Endpoints

/extract_from_file

/extract_from_source

3.3 Replaceable parts

4. RAG Core Lib

4.1 Requirements

4.2 Retry decorator (exponential backoff)

`/chat/{session_id}`

`/evaluate`

`/information_pieces/remove`

`/information_pieces/upload`

`/delete_document/{identification}`

`/document_reference/{identification}`

`/all_documents_status`

`/upload_file`

`/upload_source`

`/extract_from_file`

`/extract_from_source`