This repository is dedicated to my AI/ML learning journey and practice.
Here, Iβll be pushing code, experiments, notes, and resources related to Artificial Intelligence, Deep Learning, and Large Language Models (LLMs).
- π¬ Practice code for AI/ML/Deep Learning
- π Notes on Transformer architecture & modern NLP techniques
- π§© Implementations of key AI concepts (Tokenization, Embeddings, Attention, etc.)
- ποΈ Experiments with state-of-the-art LLMs
Here are some of the most important LLMs you can explore:
- GPT-4 (OpenAI)
- Claude 3 (Anthropic)
- Gemini (Google DeepMind)
- LLaMA 3 (Meta AI)
- Mistral (Mistral AI)
- Falcon (TII)
- Cohere Command R
- Jurassic-2 (AI21 Labs)
- Mixtral (Mistral AI, MoE)
- DeepSeek LLM
The field of LLMs is rapidly evolving. Some of the latest research and engineering trends include:
-
Reasoning (RLVR, GRPO)
- RLVR (Reinforcement Learning via Verifiable Rewards): ensures models get rewarded for provably correct reasoning steps.
- GRPO (Generalized Reinforcement Policy Optimization): an advanced RL method for improving structured reasoning.
-
MoEs (Mixture of Experts)
- Models with multiple expert subnetworks, where only a subset activates per query.
- Enables efficiency and scalability.
- Example: Mixtral of Experts.
-
Tool Use
- LLMs can call external tools (APIs, calculators, databases, code execution).
- Moves LLMs from static text generators to active AI agents.
-
Multi-Head Latent Attention
- Extends multi-head attention to focus on latent (hidden) representations.
- Enhances reasoning depth and compression of long contexts.
Hareesh Bhittam
Full-stack developer exploring AI & LLMs.