carsonpo/haystackdb

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
benches		benches
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Repository files navigation

HaystackDB

Minimal but performant Vector DB

Features

Binary embeddings by default (soon int8 reranking)
JSON filtering for queries
Scalable, distributed architecture for use with multi replica deployments
Durable (WAL), persistent data, mem mapped for fast access in the client

Benchmarks

On a MacBook with an M2, 1024 dimension, binary quantized.

FAISS is using a flat index, so brute force, but it's in memory. Haystack is storing the data on disk, and also brute forces.

TLDR is Haystack is ~10x faster despite being stored on disk.

100,000 Vectors
Haystack — 3.44ms
FAISS — 29.67ms
500,000 Vectors
Haystack — 11.98ms
FAISS - 146.50ms
1,000,000 Vectors
Haystack — 22.65ms
FAISS — 293.91ms

Roadmap

Quickstart Guide
Quality benchmarks (this is in progress)
Int8 reranking
~~(削除) Better queries with more than simple equality (削除ここまで)~~ (this is done now)
Full text search
~~(削除) Better insertion performance with batch B+Tree insertion (削除ここまで)~~ (could probably be further improved, but good for now)
~~(削除) Point in time backups/rollback (削除ここまで)~~
- currently this is destructive (ie you cannot return forward after you go backwards), so a nondestructive version is next on the todo list.
Cursor based pagination
Schema migrations
Vector Kmeans clustering with centroid similarity for improved search perf

About

No description, website, or topics provided.

Releases

No releases published

Packages

No packages published

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

carsonpo/haystackdb

Folders and files

Latest commit

History

Repository files navigation

HaystackDB

Features

Benchmarks

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

License

carsonpo/haystackdb

Folders and files

Latest commit

History

Repository files navigation

HaystackDB

Features

Benchmarks

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages