notjedi / ruml Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

[WIP] a tiny inference only tensor library.

License

GPL-3.0 license

0 stars 0 forks Branches Tags Activity

Star

Notifications

notjedi/ruml

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.cargo		.cargo
benches		benches
examples		examples
scripts		scripts
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
NOTES.md		NOTES.md
README.md		README.md
rust-toolchain.toml		rust-toolchain.toml

Repository files navigation

ruml

The goal of this project is to implement a tiny inference only library for running ML models. I want this to be something like ggml and tinygrad.

The idea is to support different optimization backends like:

Accelerate
AVX
openblas
cuBLAS (not sure about cuBLAS)
naive CPU only (fallback)
etc

The roadmap right now is more or less like this:

implement a minimal tensor class with support for broadcasting and dynamic shapes
implement a CPU only backend and write tests for different ops
write other backends
support fp16, int8 and quantization
a demo of the lib using llama or something similar
would also like this to work on vision models like segment anything, resnet, etc

About

[WIP] a tiny inference only tensor library.

Releases

No releases published

Packages

No packages published

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

notjedi/ruml

Folders and files

Latest commit

History

Repository files navigation

ruml

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

License

notjedi/ruml

Folders and files

Latest commit

History

Repository files navigation

ruml

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages