Progress measures for grokking via mechanistic interpretability

Mentions

Yo Shavit @yonashav · Mar 13, 2023

From Twitter

If this interests you, you should read this paper. IMO the most in-depth proof that after sufficient training, NNs learn simple, parsimonious, highly-general rules, even in the absence of overwhelming data.

Paper May 2, 2023

Progress measures for grokking via mechanistic interpretability

by Neel Nanda (at ICLR)

Recommended by 1 person

1 mention