Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Hello, we're Minish!

About us

We're an open-source lab, with a focus on Natural Language Processing. Minish is currently maintained by @pringled. The lab was originally founded by @pringled and @stephantul.

We believe that if you make models fast enough, you unlock new possibilities.

Using our models and packages, you can:

  • Embed the entire English Wikipedia in 5 minutes
  • Classify tens of thousands of documents per second on a CPU
  • Approximately deduplicate extremely large datasets in minutes
  • Build the fastest RAG application in the world
  • Easily evaluate which ANN algorithm works best for your data

Our projects:

  • model2vec: tiny static embedding models with state-of-the-art performance.
  • potion: the best small models in the world. 100-500x faster than a sentence-transformer, and almost as good.
  • vicinity: consistent interfaces to many approximate nearest neighbor algorithms.
  • semhash: lightning-fast, super accuracte, semantic deduplication and filtering for your text datasets.
  • model2vec-rs: a Rust port of model2vec.

You can also find us on:

Pinned Loading

  1. model2vec model2vec Public

    Fast State-of-the-Art Static Embeddings

    Python 1.9k 103

  2. semhash semhash Public

    Fast Semantic Text Deduplication & Filtering

    Python 815 50

  3. vicinity vicinity Public

    Lightweight Nearest Neighbors with Flexible Backends

    Python 311 10

  4. tokenlearn tokenlearn Public

    Pre-train Static Word Embeddings

    Python 87 8

  5. model2vec-rs model2vec-rs Public

    Official Rust Implementation of Model2Vec

    Rust 138 12

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 10 repositories

AltStyle によって変換されたページ (->オリジナル) /