feat: support multiple text embedding providers #76

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

peter-gy wants to merge 11 commits into apple:main

from peter-gy:peter-gy/support-multiple-embedding-providers

Open

feat: support multiple text embedding providers #76

peter-gy wants to merge 11 commits into apple:main from peter-gy:peter-gy/support-multiple-embedding-providers

+585 −30

Conversation

@peter-gy

Copy link

@peter-gy peter-gy commented Oct 22, 2025

Summary

Adds support for multiple text embedding providers through LiteLLM integration, enabling users to leverage API-based models (OpenAI, Cohere, Azure, Ollama, etc.) alongside the existing local SentenceTransformers approach. compute_text_projection() API remains 100% backward compatible; new parameters are optional.

Changes

New CLI Options: Added --text-projector to route between litellm and sentence_transformers, and exposed --api-key, --api-base, --dimensions, and --sync flags for LiteLLM-specific configuration
Provider Abstraction: Introduced TextProjectorCallback type and provider-specific implementations (_project_text_with_sentence_transformers, _project_text_with_litellm) so that we can keep benefitting from the existing caching approach regardless of the model used to compute the projections
Examples: Added notebook cells demonstrating Ollama (locally-served API) and OpenAI (remote API) embedding workflows

Testing

Verified in packages/backend/examples/notebook.ipynb with:

SentenceTransformers (default, backward compatibility)
Ollama API (nomic-embed-text)
OpenAI API (text-embedding-3-small)

peter-gy added 9 commits

October 22, 2025 10:25

@peter-gy


 chore: add litellm as dependency

ac9e9d1

@peter-gy


 feat: support LiteLLM-compatible embedding models

caec65f

@peter-gy


 docs: add example notebook

658d0c9

@peter-gy


 fix: ensure async batches are awaited

570a9a9

@peter-gy


 feat: expose options via CLI

0c2e060

@peter-gy


 docs: keep single example notebook

649268a

@peter-gy


 style: format code

ff401ad

@peter-gy


 docs: comment wording

ee79549

@peter-gy


 docs: remove noise

39ddb55

donghaoren

donghaoren reviewed

Oct 28, 2025

View reviewed changes

Copy link

Collaborator

@donghaoren donghaoren left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the new addition! A couple of comments.

packages/backend/pyproject.toml Outdated

"llvmlite >= 0.43.0",

"accelerate >= 1.5.0",

"tqdm >= 4.60.0",

"litellm>=1.78.5",

Copy link

Collaborator

@donghaoren donghaoren Oct 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change

"litellm>=1.78.5",

"litellm >= 1.78.5",

packages/backend/embedding_atlas/projection.py

text_projector = _project_text_with_sentence_transformers

hasher = Hasher()

hasher.update(

Copy link

Collaborator

@donghaoren donghaoren Oct 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In _projection_for_texts, we have a caching mechanism that computes a hash from the parameters and use the hash as a cache filename. It seems like we are not taking into account all text projector parameters (e.g., the text projector type, dimensions). Could you update the code to include the text projector type (as string) and args into the hasher, and increment the "version" number?

Copy link

Author

@peter-gy peter-gy Oct 28, 2025 •

edited

Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Of course. Thanks for pointing it out. Addressed this in 823a82b.

@donghaoren donghaoren requested a review from domoritz

October 28, 2025 20:19

peter-gy added 2 commits

October 29, 2025 01:02

@peter-gy


 style: format code

d21947a

@peter-gy


 fix: ensure text projector args are considered when computing cache hash

823a82b

Labels

None yet

2 participants

@peter-gy @donghaoren

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support multiple text embedding providers #76

Are you sure you want to change the base?

feat: support multiple text embedding providers #76

Conversation

@peter-gy peter-gy commented Oct 22, 2025

Summary

Changes

Testing

Uh oh!

@donghaoren donghaoren left a comment

Choose a reason for hiding this comment

Uh oh!

@donghaoren donghaoren Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

@donghaoren donghaoren Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

@peter-gy peter-gy Oct 28, 2025 •

edited

Loading

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: support multiple text embedding providers #76

Are you sure you want to change the base?

feat: support multiple text embedding providers #76

Conversation

@peter-gy peter-gy commented Oct 22, 2025

Summary

Changes

Testing

Uh oh!

@donghaoren donghaoren left a comment

Choose a reason for hiding this comment

Uh oh!

@donghaoren donghaoren Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

@donghaoren donghaoren Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

@peter-gy peter-gy Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

@peter-gy peter-gy Oct 28, 2025 •

edited

Loading