MathieuDubart / Llama-cpp-API Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

A Python API for Llama.cpp, allowing you to fetch routes from devices using Tailscale

MathieuDubart/Llama-cpp-API

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
server.py		server.py

Repository files navigation

Llama.cpp API

Prerequisites

Setup Tailscale on your devices
Git clone G. Gerganov's Llama.cpp from his Github repository
Download a .gguf model (mistral-7b-instruct-v0.2.Q4_K_M.gguf is recommended)
Place it inside ./models

Starting project

Git clone this repository inside Llama.cpp one's git clone git@github.com:MathieuDubart/Llama-cpp-api.git
Open server.py and change model path to match with your model name
Run python3 server.py in root directory
You can now access your API routes on every linked to Tailscale device, with your hosting device's Tailscale IP (Port 5000)) (e.g: 100.x.x.x:5000/generate)

About

A Python API for Llama.cpp, allowing you to fetch routes from devices using Tailscale

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MathieuDubart/Llama-cpp-API

Folders and files

Latest commit

History

Repository files navigation

Llama.cpp API

Prerequisites

Starting project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

MathieuDubart/Llama-cpp-API

Folders and files

Latest commit

History

Repository files navigation

Llama.cpp API

Prerequisites

Starting project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages