A Three-Phase Factual Recall Circuit in Gemma-2B and Gemma-12B-IT
Evolving story · 1 updatesGemma Factual Recall Circuit DiscoveryTimeline →Researchers identify a three-phase mechanism for factual recall in Google's Gemma-2B and Gemma-12B-IT models using activation patching, revealing how facts are stored and retrieved across transformer layers.

Activation patching reveals how facts are stored, routed, and read out across transformer layers, and why the residual stream does most of the work
The post A Three-Phase Factual Recall Circuit in Gemma-2B and Gemma-12B-IT appeared first on Towards Data Science.
Source: A Three-Phase Factual Recall Circuit in Gemma-2B and Gemma-12B-IT. Read the full piece at the source.
Summary and analysis generated by AI (mistral). Always verify against the original sources.