Logit Lens — What would the model predict at each layer?
Each cell shows the model's top prediction if it stopped thinking at that layer. Color intensity = confidence in the final answer.
Last-Token Evolution
How the prediction for the last token position changes as information flows through layers.