Personal Notes · AI & Deep Learning

Neural Architecture
Hub

My notes on deep learning — written to make the math click. Every page pairs derivations with an interactive visualisation so you build intuition before you formalise it.

Browse all docs

Deep Neural Networks

7 articles

The building blocks — convolutional layers, optimisation landscapes, and positional representations that underpin vision and language models.

Understanding Convolutional Neural NetworksA deep dive into the architecture and mathematics of CNNs — the backbone of modern computer vision.
Dropout: Regularization by NoiseHow randomly silencing neurons during training prevents overfitting — with interactive visualizations of masks, rate effects, and the inverted-dropout trick.
Tokens to Embeddings — Giving Numbers MeaningHow integer token IDs become dense learned vectors that carry semantic meaning — the embedding layer explained from lookup table to positional encoding.
Understanding Optimizers in Deep LearningA comprehensive guide to gradient descent variants and optimization algorithms used in training deep neural networks.
Positional Embeddings in TransformersA deep dive into absolute, relative, sine/cosine, and rotational positional embeddings — with interactive playgrounds.
RNN, LSTM & GRU: Sequence ModelingHow recurrent networks learn from sequences — the hidden state, vanishing gradients, LSTM's memory cell, and GRU's streamlined gating — with interactive visualizations of each architecture.
Tokenization — How Language Models Read TextA deep dive into subword tokenization, Byte-Pair Encoding, and vocabulary lookup — the first stage of every language model.

Generative Models

5 articles

The math behind models that create — from next-token prediction to diffusion processes that synthesise images from noise.

Work in progress — more topics added as I work through them.

Neural ArchitectureHub

Deep Neural Networks

Generative Models

Neural Architecture
Hub