Preliminaries
Linear Neural Networks for Regression
Linear Neural Networks for Classification
Multilayer Perceptrons
Builders' Guide
Convolutional Neural Networks
Modern Convolutional Neural Networks
- 1.Deep Convolutional Neural Networks (AlexNet)PTTFJAXMX
- 2.Networks Using Blocks (VGG)PTTFJAXMX
- 3.Network in Network (NiN)PTTFJAXMX
- 4.Multi-Branch Networks (GoogLeNet)PTTFJAXMX
- 5.Batch NormalizationPTTFJAXMX
- 6.Residual Networks (ResNet) and ResNeXtPTTFJAXMX
- 7.Densely Connected Networks (DenseNet)PTTFJAXMX
- 8.Designing Convolutional Network ArchitecturesPTTFJAXMX
Recurrent Neural Networks
Modern Recurrent Neural Networks
- 1.Long Short-Term Memory (LSTM)PTTFJAXMX
- 2.Gated Recurrent Units (GRU)PTTFJAXMX
- 3.Deep Recurrent Neural NetworksPTTFJAXMX
- 4.Bidirectional Recurrent Neural NetworksPTTFJAXMX
- 5.Machine Translation and the DatasetPTTFJAXMX
- 6.The Encoder--Decoder ArchitecturePTTFJAXMX
- 7.Sequence-to-Sequence Learning for Machine TranslationPTTFJAXMX
Attention Mechanisms and Transformers
- 1.Queries, Keys, and ValuesPTTFJAXMX
- 2.Attention Pooling by SimilarityPTTFJAXMX
- 3.Attention Scoring FunctionsPTTFJAXMX
- 4.The Bahdanau Attention MechanismPTTFJAXMX
- 5.Multi-Head AttentionPTTFJAXMX
- 6.Self-Attention and Positional EncodingPTTFJAXMX
- 7.The Transformer ArchitecturePTTFJAXMX
- 8.Transformers for VisionPTTFJAXMX
Optimization Algorithms
- 1.Optimization and Deep LearningPTTFJAXMX
- 2.ConvexityPTTFJAXMX
- 3.Gradient DescentPTTFJAXMX
- 4.Stochastic Gradient DescentPTTFJAXMX
- 5.Minibatch Stochastic Gradient DescentPTTFJAXMX
- 6.MomentumPTTFJAXMX
- 7.AdagradPTTFJAXMX
- 8.RMSPropPTTFJAXMX
- 9.AdadeltaPTTFJAXMX
- 10.AdamPTTFJAXMX
- 11.Learning Rate SchedulingPTTFJAXMX
Computational Performance
Computer Vision
- 1.Image AugmentationPTTFJAXMX
- 2.Fine-TuningPTTFJAXMX
- 3.Object Detection and Bounding BoxesPTTFJAXMX
- 4.Anchor BoxesPTTFJAXMX
- 5.Multiscale Object DetectionPTTFJAXMX
- 6.The Object Detection DatasetPTTFJAXMX
- 7.Single Shot Multibox DetectionPTTFJAXMX
- 8.Region-based CNNs (R-CNNs)PTTFJAXMX
- 9.Semantic Segmentation and the DatasetPTTFJAXMX
- 10.Transposed ConvolutionPTTFJAXMX
- 11.Fully Convolutional NetworksPTTFJAXMX
- 12.Neural Style TransferPTTFJAXMX
- 13.Image Classification (CIFAR-10) on KagglePTTFJAXMX
- 14.Dog Breed Identification (ImageNet Dogs) on KagglePTTFJAXMX
Natural Language Processing: Pretraining
- 1.The Dataset for Pretraining Word EmbeddingsPTTFJAXMX
- 2.Pretraining word2vecPTTFJAXMX
- 3.Subword EmbeddingPTTFJAXMX
- 4.Word Similarity and AnalogyPTTFJAXMX
- 5.Bidirectional Encoder Representations from Transformers (BERT)PTTFJAXMX
- 6.The Dataset for Pretraining BERTPTTFJAXMX
- 7.Pretraining BERTPTTFJAXMX
Natural Language Processing: Applications
- 1.Sentiment Analysis and the DatasetPTTFJAXMX
- 2.Sentiment Analysis: Using Recurrent Neural NetworksPTTFJAXMX
- 3.Sentiment Analysis: Using Convolutional Neural NetworksPTTFJAXMX
- 4.Natural Language Inference and the DatasetPTTFJAXMX
- 5.Natural Language Inference: Using AttentionPTTFJAXMX
- 6.Natural Language Inference: Fine-Tuning BERTPTTFJAXMX
Reinforcement Learning
Gaussian Processes
Hyperparameter Optimization
Generative Adversarial Networks
Recommender Systems
- 1.The MovieLens DatasetPTTFJAXMX
- 2.Matrix FactorizationPTTFJAXMX
- 3.AutoRec: Rating Prediction with AutoencodersPTTFJAXMX
- 4.Personalized Ranking for Recommender SystemsPTTFJAXMX
- 5.Neural Collaborative Filtering for Personalized RankingPTTFJAXMX
- 6.Sequence-Aware Recommender SystemsPTTFJAXMX
- 7.Feature-Rich Recommender SystemsPTTFJAXMX
- 8.Factorization MachinesPTTFJAXMX
- 9.Deep Factorization MachinesPTTFJAXMX
Appendix: Mathematics for Deep Learning
- 1.Geometry and Linear Algebraic OperationsPTTFJAXMX
- 2.EigendecompositionsPTTFJAXMX
- 3.Single Variable CalculusPTTFJAXMX
- 4.Multivariable CalculusPTTFJAXMX
- 5.Integral CalculusPTTFJAXMX
- 6.Random VariablesPTTFJAXMX
- 7.Maximum LikelihoodPTTFJAXMX
- 8.DistributionsPTTFJAXMX
- 9.Naive BayesPTTFJAXMX
- 10.StatisticsPTTFJAXMX
- 11.Information TheoryPTTFJAXMX