Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

This repo is a tiny language model mimicking LLMs. I would like to create LLM-like model from scratch. To some degree LLM could also be abbreviated to little language model. :D

License

Notifications You must be signed in to change notification settings

BreezeShane/LittleLanguageModel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

9 Commits

Repository files navigation

LittleLanguageModel

This repo is a tiny language model mimicking LLMs. I would like to create LLM-like model from scratch. To some degree LLM could also be abbreviated to little language model. :D

Todolist

  • Collect Data
  • Clean & Augment Data
  • Make Tokenizer
  • Make Model Based on Transformer
  • Pretrain Model
  • LLM Alignment

Optional

  • Transfer to downstream tasks.
  • SFT Training
  • Distribute Data Parallel
  • Distill model
  • RAG
  • Agent

Recommend to read

  1. LLM数据工程3——数据收集魔法:获取顶级训练数据的方法
  2. 掌握 LLM 技术:数据预处理

About

This repo is a tiny language model mimicking LLMs. I would like to create LLM-like model from scratch. To some degree LLM could also be abbreviated to little language model. :D

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

AltStyle によって変換されたページ (->オリジナル) /