daveshap / raspberry_experiments Public

Notifications You must be signed in to change notification settings
Fork 5
Star 69

Keeping my personal experiments separate from the main repo

License

MIT license

69 stars 5 forks Branches Tags Activity

Star

Notifications

daveshap/raspberry_experiments

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Questions		Questions
step02_task_representation		step02_task_representation
.gitignore		.gitignore
English_Words_Long.txt		English_Words_Long.txt
LICENSE		LICENSE
README.md		README.md
question_categories.json		question_categories.json
standard.xml		standard.xml
step01_generate_questions.py		step01_generate_questions.py
step02_task_representation.py		step02_task_representation.py
system_prompt_homogenizer.md		system_prompt_homogenizer.md
system_question_generator.md		system_question_generator.md
system_task_representation.md		system_task_representation.md
xml_standard.md		xml_standard.md

Repository files navigation

Raspberry Experiments

Mission: Create an open source finetuning dataset that allows anyone to train AI models to engage in "Test Time Comput" or chain of thought reasoning.

Process

For starters, OpenAI GPT-4o only needs 50 to 100 samples to get started. We can demonstrate success by simply improving what the base model does. For instance, if the base GPT-4o scores 25% on a given benchmark, but we can boost that to 32% with a simple finetuning job, that is a solid proof of concept. So our goal is simply to create a statistically significant lift in performance.

We need some high quality queries that would be realistic from a user perspective. These can be synthesized easily enough.
We need some high quality chains of thought (reasoning) as the answer. Ideally they would be correct answers, but most importantly we just need to train the model to use reasoning.

About

Keeping my personal experiments separate from the main repo

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

daveshap/raspberry_experiments

Folders and files

Latest commit

History

Repository files navigation

Raspberry Experiments

Process

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

License

daveshap/raspberry_experiments

Folders and files

Latest commit

History

Repository files navigation

Raspberry Experiments

Process

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages