ayushtambde.com

Matrix Orthogonalization Improves Memory in Recurrent Models

at2005 · 62 points · 10 comments · vor 8 Stunden

Comments

4 preview comments · loading full thread
phkahlervor 32 Minuten

If it can be made orthogonal, can you go a step further and diagonalize it? The storage and performance improvement from that would be huge.

BirbSingularityvor 7 Stunden

I can't help but think of orthogonal frequency-division multiplexing and it's use in encoding data on multiple carrier frequencies, and it makes me wonder what other parallels we will discover between digital transmission technology for cross-domain stuff like this.

harveyrookvor 51 Minuten

Now I’m wondering what is the eigenspace of an LLM? If I take a set of LLM’s with the same number of parameters, then what are the eigenvectors? Do they have different personalities?

mv_d5339e31vor 4 Stunden

[deleted]