Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Hareesh108/ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

13 Commits

Repository files navigation

🧠 AI – Experiments & Practice

This repository is dedicated to my AI/ML learning journey and practice.
Here, I’ll be pushing code, experiments, notes, and resources related to Artificial Intelligence, Deep Learning, and Large Language Models (LLMs).


πŸš€ What’s Inside?

  • πŸ”¬ Practice code for AI/ML/Deep Learning
  • πŸ“š Notes on Transformer architecture & modern NLP techniques
  • 🧩 Implementations of key AI concepts (Tokenization, Embeddings, Attention, etc.)
  • πŸ—οΈ Experiments with state-of-the-art LLMs

πŸ”— Top 10 Popular LLMs

Here are some of the most important LLMs you can explore:

  1. GPT-4 (OpenAI)
  2. Claude 3 (Anthropic)
  3. Gemini (Google DeepMind)
  4. LLaMA 3 (Meta AI)
  5. Mistral (Mistral AI)
  6. Falcon (TII)
  7. Cohere Command R
  8. Jurassic-2 (AI21 Labs)
  9. Mixtral (Mistral AI, MoE)
  10. DeepSeek LLM

πŸ”₯ Current Trends in LLMs

The field of LLMs is rapidly evolving. Some of the latest research and engineering trends include:

  1. Reasoning (RLVR, GRPO)

    • RLVR (Reinforcement Learning via Verifiable Rewards): ensures models get rewarded for provably correct reasoning steps.
    • GRPO (Generalized Reinforcement Policy Optimization): an advanced RL method for improving structured reasoning.
  2. MoEs (Mixture of Experts)

    • Models with multiple expert subnetworks, where only a subset activates per query.
    • Enables efficiency and scalability.
    • Example: Mixtral of Experts.
  3. Tool Use

    • LLMs can call external tools (APIs, calculators, databases, code execution).
    • Moves LLMs from static text generators to active AI agents.
  4. Multi-Head Latent Attention

    • Extends multi-head attention to focus on latent (hidden) representations.
    • Enhances reasoning depth and compression of long contexts.

πŸ“š References


πŸ› οΈ Author

Hareesh Bhittam
Full-stack developer exploring AI & LLMs.

About

πŸš€ This repository is dedicated to my AI/ML learning journey and practice.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /