13 week Mech Interp Study Journal

Oct 28, 2023 Andrew Chen

Foreword

Mech Interp is super cool! I’m gonna set aside 13 weeks to study it. In these 13 weeks, I plan to master basic ML math, master transformers, read a couple mech interp papers in depth, and hopefully carry out some small-scaled, original mech interp exploration.

Note, this is a direct mirror of my personal log in my Obsidian vault, and some formatting might not be appropriate (I will format everything at the end).

Timeline

Target: 10/28/2023 to 1/26/2023 (end of my winter break)

Undated Day 1

Read some basic definitions of Linear Algebra Done Right.
Undated Day 2
Finished Linear Algebra Done Right Chapter 1. Re-learnt subspaces, sums of subspaces, direct sums.
Looking ahead, to solidfy my linear algebra for MI, I should master Chapter 1-3, 5-6, and 7d. More importantly, I need to spend some time everyday in finding connections of the niche linear algebra techniques are used for ML.
I found one such connection today (querying ChatGPT): Gradients should be initialized with orthonormal matrices in order to preserve the magnitude of gradients.

Day 1

Read Linear Algebra Done Right Chapter 2 up until the last definition. I think basis and theorems regarding bases drilled in the definition of linear independence and span.
Tomorrow, I just want to finish up the definition, do a few problems. Then, I want to skim some high-level material on why bases matters for machine learning; in particular, I hear a lot about “orthonormal” bases in ML, when and why do they come up?
Why bases matter for machine learning - high level view. 📅 2023-10-28 ✅ 2023-10-28

Day 2

Yesterday i assigned the “why bases matters” task, I have a high-level answer: To address the curse of dimensionality and other dimension related issues. Keywords are privileged, non-privileged bases, PCA, SVD.
- Resources: https://harrisonpim.com/blog/privileged-vs-non-privileged-bases-in-machine-learning

Day 3

Skipped

Day 4

Did 15 exercises for [[Chapter 2]]. I didn’t try too hard, I looked at the solution if I can’t solve in 5 mins.

Day 5

Skipped

Day 6

Skipped

Day 7 (11-03-2023)

Wrote MLP forward stuff.

Watched this great video by WhyML on Skip connection and residual blocks.

Review Batch Normaliztaion, Optimizers, Regularization, Gradient Clipping, Weight Constraints, Weight Normalization, Layer Normalization 📅 2023-11-04 ✅ 2023-11-09
What is Skip Connections and Residual Blocks? 📅 2023-11-03 ✅ 2023-11-04
Understand the gradients that are taken. 📅 2023-11-12 ✅ 2023-11-15

Day 8 (11-04-2023)

Studied backpropagation. I used multiple resources: The original sparse encoder article, some random article by ML-dawn, and Brilliant Wiki. But, when I revisited 3b1b’s video, that’s the one that made it all click. Tomorrow, I plan to rewatch the video and take a lot of notes.

Day 9 (11-05-2023)

Rewatched 3b1b’s video and I think all the formulae individually makes sense. Tomorrow, I’ll try to

Derive the gradients from scratch
If I get them right, give a shot at drawing derivative diagrams for a MLP
Implement backprop in MLP.

Day 10-18 (11-06-2023 to 11-14-2023)

Skipped. Not gonna lie, played a little too much minesweeper.

Day 19

Deleted my online minesweeper account. Started 2 weeks ago and sunk 35 hours into it, no good, those hours will now be redirected to mech interp.
Watched Neel Nanda’s MI and Math talk and annotated it. The slides are here
40 mins into ML street talk episode with Neel Nanda. Take aways:
- A biologist’s getting hands dirty view might be more appropriate than a mathematician’s view. A good quality of a mech interp researcher is someone who can handle a surprise.

My goals this week is to start understanding transformers well, read this paper, and get a sense of the other techniques I gotta study (dimensionality reduction, Fourier transform even?).

Idea: Analyze transformers that implement simple regexes / text manipulation patterns?

Day 20

Read the Attention blog post, it is a prior of the Illustrated Transformers guide.

Some knowledge tasks.

Understand: What is the Curse of Dimensionality? 📅 2023-11-17
Master PCA 📅 2023-11-19

Inspired by Neel’s talks yesterday, I will run a mech interp notebook this week to get a feel of the tools.

Run, completely, one mech interp notebook 📅 2023-11-17

Foreword

Timeline

Undated Day 1

Undated Day 2

Day 1

Day 2

Day 3

Day 4

Day 5

Day 6

Day 7 (11-03-2023)

Day 8 (11-04-2023)

Day 9 (11-05-2023)

Day 10-18 (11-06-2023 to 11-14-2023)

Day 19

Day 20