Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

esumerfd/ai-models

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

35 Commits

Repository files navigation

AI Models

AI Models Banner

A collection of language model experiments built from scratch — each one exploring a different technique, architecture, or dataset.

The goal is hands-on understanding: no pre-built pipelines, just raw implementation, training, and deployment.

All content is suspect — this repo is created by someone who knows nothing... yet.


Models

GPT-style causal transformer built from scratch with PyTorch, trained on the STACK construction estimating platform support knowledge base. Implements BPE tokenization, multi-head self-attention, and causal language modelling with a ~5.8M parameter model targeting Raspberry Pi deployment via GGUF/Ollama.

Tokenization

Data Preparation

Architecture


Deployment Target

All models are built to run on a Raspberry Pi cluster using the Raspberry Pi AI HAT+ 2 (Hailo-10H, 10 TOPS).

About

A mono-repo of a number of experiments on how to create language models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

AltStyle によって変換されたページ (->オリジナル) /