FunctionGemma released, a model tuned for function calling! Learn more

Gemma Scope

Analyze Gemma 3 with Gemma Scope 2

Gemma Scope 2 is a comprehensive, open suite of interpretability tools designed for the Gemma 3 model collection. This tool allows you to examine the behavior of individual layers. It allows researchers to analyze complex language model behaviors and debug emergent behaviors such as jailbreaks or hallucinations.

This toolkit acts as a microscope for the model, providing Sparse Autoencoders (SAEs) and Transcoders trained on every layer of the Gemma 3 family.

Looking for the previous version?
The original Gemma Scope (for Gemma 2) remains available for researchers working with the Gemma 2 family of models.

  • Model behavior evaluation

    Use SAEs and Transcoders to analyze complex internal behaviors and multi-step algorithms in Gemma 3.
  • Chatbot safety & debugging

    Analyze specific chat behaviors, refusal mechanisms, and chain-of-thought faithfulness to build safer AI agents.

Learn more

Read the Gemma Scope 2 Blog

Read about the new architecture, training data, and capabilities of Gemma Scope 2.

Download from Hugging Face

Access the weights, code, and documentation for the Gemma 3 interpretability suite.

Run in Colab

Try the interactive tutorial to visualize features and modify model behavior.

Previous Version (Scope Gemma for Gemma 2)

Access the blog and resources for the original Gemma Scope for Gemma 2.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025年12月19日 UTC.