Dot Products and Projections - The Math Behind Attention
A deep engineering dive into dot products, orthogonality, vector projection, Gram-Schmidt orthogonalization, and least squares - the mathematical heart of the transformer attention mechanism.
A deep engineering dive into dot products, orthogonality, vector projection, Gram-Schmidt orthogonalization, and least squares - the mathematical heart of the transformer attention mechanism.
A deep engineering dive into eigenvalues, eigenvectors, and eigendecomposition - the mathematics behind PCA, PageRank, spectral clustering, and graph neural networks.
A complete module map showing how vectors, matrices, eigenvalues, SVD, and tensors connect to every major ML algorithm - from attention to PCA to backpropagation.
A complete engineering reference for NumPy linear algebra - np.linalg module, solving systems, decompositions, performance tips, numerical stability, and PyTorch torch.linalg equivalents.
A deep engineering dive into linear maps, kernel (null space), image (column space), rank-nullity theorem, and change of basis - the geometry behind every neural network layer.
A deep engineering dive into matrix multiplication, transpose, inverse, rank, and determinant - and how they power attention mechanisms, backpropagation, and neural network layers.
A deep engineering dive into L1, L2, L∞, Frobenius, and nuclear norms - and how the geometry of different norms determines which model weights go to zero in Lasso vs Ridge regularization.
Engineering guide to condition numbers, ill-conditioned matrices, LU/QR/Cholesky factorizations, why you should never invert a matrix, and the numerical stability of neural network backpropagation.
A complete derivation of Principal Component Analysis from first principles - covariance matrices, eigendecomposition, explained variance, and PCA via SVD - the way sklearn actually does it.
A deep engineering dive into Singular Value Decomposition, LU, QR, and Cholesky decompositions - the mathematical tools behind recommender systems, image compression, and numerically stable ML computations.
A deep engineering dive into tensors as generalizations of matrices - shapes, axes, contractions, Einstein summation, broadcasting, and vectorization - the computational substrate of every deep learning model.
A deep engineering dive into vectors, vector spaces, norms, and inner products - the mathematical foundation behind embeddings, cosine similarity, and KNN in ML systems.