Skip to content

Separating infinity server capability from reranking capability #505

@noamgershi

Description

@noamgershi

Feature request

Hi
I want to suggest that this repo will be separated to its reranking-feature and its fastAPI server-feature, and its reranking-feature will be in a separate pypi dependency (other then infinity_emb).

I would like to be able to get a pypi dependency which contains only the reranking/embedding/etc features, but without the fastAPI server feature.
Meaning - only the content of file engine.py, and I will create the AsyncEmbeddingEngine and I will call the rerank() method directly, without having the infinity_server.py file at all.

Motivation

Basically I don't want features like the sending StartupTelemetry to PostHog at all, and also I want to be able to have control on the server creation/parameters

Your contribution

Not currently

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions