Darker green = more grokking. Grey = no data yet.
Median training steps until test accuracy > 97%. Only shown for N values with at least one grokked model.
| N | Sacrifices | Grokked | Grok % | Avg Train Time (s) | Median Steps to Grok |
|---|---|---|---|---|---|
| 5 | 3 | 0 | 0.0 | 62.23 | 10000.0 |
| 10 | 3 | 0 | 0.0 | 74.91 | 10000.0 |
| 15 | 3 | 0 | 0.0 | 85.72 | 10000.0 |
| 20 | 3 | 0 | 0.0 | 108.34 | 10000.0 |
| 25 | 3 | 0 | 0.0 | 122.66 | 10000.0 |
| 30 | 3 | 0 | 0.0 | 144.80 | 10000.0 |
| 35 | 2 | 0 | 0.0 | 164.88 | 10000.0 |
| 40 | 2 | 0 | 0.0 | 197.88 | 10000.0 |
| 45 | 2 | 0 | 0.0 | 224.43 | 10000.0 |
| 50 | 2 | 0 | 0.0 | 252.55 | 10000.0 |
| 55 | 2 | 0 | 0.0 | 290.71 | 10000.0 |
| 60 | 2 | 0 | 0.0 | 323.02 | 10000.0 |