Gemma Scope 2 is a comprehensive, open suite of interpretability tools designed for the
Gemma 3 model collection. This tool allows you to examine the behavior of
individual layers. It allows researchers to analyze complex language model behaviors and
debug emergent behaviors such as jailbreaks or hallucinations.
This toolkit acts as a microscope for the model, providing Sparse Autoencoders (SAEs)
and Transcoders trained on every layer of the Gemma 3 family.
Looking for the previous version?
The original Gemma Scope (for Gemma 2)
remains available for researchers working with the Gemma 2 family of models.
biotech
Model behavior evaluation
Use SAEs and Transcoders to analyze complex internal behaviors and multi-step algorithms in Gemma 3.
tune
Chatbot safety & debugging
Analyze specific chat behaviors, refusal mechanisms, and chain-of-thought faithfulness to build safer AI agents.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025年12月19日 UTC."],[],[]]