Advanced Deep Learning

This site is dedicated to the simplest video tutorials on Advance Topics of Deep Learning. I have used my knowledge and experience to prepare these tutorials. All feedback and suggestions are welcome (email me at nirajrkumar@gmail.com or nirajrkumar@yahoo.com).

As, scientific development is an endless process, so I will keep updating it. Clicking on the link will drive you to the YouTube page for related content. Or You can use the link: https://www.youtube.com/c/DrNirajRKumar

Basics of Transformer Architecture and Transformer-to-RNN/T2RNN.

Content:

Part-1
Explains the Transformer Architecture in Details
Positional Encoding
Multi-Head Attention
Part-2.
Explains the Advancements in Transformer Architecture (T2RNN)

Video Link:

Part-1: Transformer Architecture in Details (Direct Link: https://www.youtube.com/watch?v=JMeYGYANEqU)
Part-2: Transformer to RNN (T2RNN) (Direct Link: https://www.youtube.com/watch?v=UHgy2faOD_M&t=417s)

2. How Transformers Support Autoregressive Language Model based LLMs

Content:

Detailed mathematical details about Autoregressive language models and Transformer architecture.

Video Link: https://www.youtube.com/watch?v=KNoW9E-TDU8&t=801s

3. XLNet.

Content:

Part-1
BERT Vs XLNet,
Overview of XLNet,
Autoregressive Language Modeling
Part-2
Permutation Language Modeling for XLNet,
Merits and Demerits of Permutation Language Modeling
Part-3
Masked Attention for XLNet,
Two Stream Self Attention for XLNet,
Final Working Overview of XLNet

Video Link:

XLNet Made Easy Part-1 (Direct Link: https://www.youtube.com/watch?v=1yPT-aAD_a0&t=104s)
XLNet Made Easy Part-2 (Direct Link: https://www.youtube.com/watch?v=HnlVO5n3mtY&t=2s)
XLNet Made Easy Part-3 (Direct Link: https://www.youtube.com/watch?v=o7zbeGb2nZQ&t=29s)

4. Scalability of the Transformer Architecture.

Content:

Part-1 Contains.
Paper: “Transformer Quality in Linear Time”
Gated Linear Unit
Gated Attention Unit
Mixed Chunk Attention
Relative Position Bias
Squared RELU.

Video Link:

Deep-Learning: How to improve the Scalability of The Transformer Architecture Part-1 (Direct Link: https://www.youtube.com/watch?v=eQgwHrtCI_s&t=1766s)

5. Transfer Learning.

Content:

Part-1:
Overview of Transfer Learning
Different types of Transfer Learning
Part-2:
Multi-Task Learning with sample code.

Video Link:

Transfer Learning Part-1 (Direct Link: https://www.youtube.com/watch?v=IS22-AoinGQ)
Transfer Learning Part-2 (Multi-Task Learning). (Direct Link: https://www.youtube.com/watch?v=lFKbZ66KjPs)

6. Multivariate Time Series Forecasting Using Deep Learning.

Content:

Part-1:
Different Types of Multivariate Time Series Forecasting Strategies.
Multivariate Multi-Step Multi-Output Time series Forecasting
- Strategy to prepare dataset.
- How to write code?
Part-2:
Multivariate Single-Step Multi-Output Time series Forecasting
- Strategy to prepare dataset.
- How to write code?
Strategy for the Future Enhancements.

Video Link:

Multivariate Time Series Forecasting Using Deep Learning [Part-1] (Direct Link: https://youtu.be/xaQpLz6QkVQ)
Multivariate Time Series Forecasting Using Deep Learning [Part-2]. (Direct Link: https://youtu.be/DLzaG4SW4pM)

7. Deep Clustering (A Self-Supervised Deep Learning Algorithm).

Content:

Part-1:
Basics of Self-Supervised Algorithm
Basics of Deep Clustering
Part-2:
Details of Deep Clustering
Details of Cost Functions used in the Deep Clustering Algorithms.

Video Link:

Deep Clustering- Part-1 (A Self-Supervised Deep Learning Algorithm) (Direct Link: https://youtu.be/j9KmEpaLers)
Deep Clustering- Part-2 (A Self-Supervised Deep Learning Algorithm). (Direct Link: https://youtu.be/Ca0r0ZbeHxM)

8. Forced/Guided Learning in Deep Learning.

Content:

Part-1:
Teacher Forcing
Exposure Bias
Part-2:
Scheduled Sampling
Exposure Bias.

Video Link:

Forced/Guided Learning in Deep Learning Part-1 (Direct Link: https://youtu.be/FsidD3Tb1as)
Forced/Guided Learning in Deep Learning Part-2. (Direct Link: https://youtu.be/Gz_TxwqppKg)

9. Internal Covariate Shift.

Content:

Part-1:
Basics of Internal Covariate Shift
Basics of Network Whitening
Requirement of Normalization Techniques – e.g. Batch Normalization.
Part-2:
Batch Normalization
Differentiability of ‘Batch Normalization’
Discussion on Merits and Demerits of ‘Batch Normalization’

Video Link:

Internal Covariate Shift – Part-1 (with Batch Normalization) (Direct Link: https://www.youtube.com/watch?v=VSM9ZXXS0BQ)
Internal Covariate Shift and Batch Normalization– Part-2. (Direct Link: https://www.youtube.com/watch?v=nbDHgsyhkio)

10. MAMBA Explained: The Next Gen Sequence Model for Deep Learning State Space Gates and More.

Contains:
- What is MAMBA? - Understand the motivation and theory behind Maximum-Memory Attention with Multiplicative Bias Architecture.
- Core Building Blocks: - See how Causal 1D Convolution (local mixing), State Evolution (SSM core), and Multiplicative Bias (gating) fit together.
- Deep Learning Architecture: Explore how MAMBA layers are stacked to create powerful, scalable models for text, time series, and beyond.
- Complete Example: Follow a step-by-step walk-through with a small matrix example, seeing how each input is transformed as it passes through the model.
Video Link-1: MAMBA Explained Part-1: The Next Gen Sequence Model for Deep Learning State Space Gates and More
Video Link-2: MAMBA Explained Part-2: The Next-Gen Sequence Model for Deep Learning—State Space, Gates & More

Page updated

Google Sites

Report abuse

Advanced Deep Learning

Basics of Transformer Architecture and Transformer-to-RNN/T2RNN.

Content:

Video Link:

2. How Transformers Support Autoregressive Language Model based LLMs

Content:

Video Link: https://www.youtube.com/watch?v=KNoW9E-TDU8&t=801s

3. XLNet.

Content:

Video Link:

4. Scalability of the Transformer Architecture.

Content:

Video Link:

5. Transfer Learning.

Content:

Video Link:

6. Multivariate Time Series Forecasting Using Deep Learning.

Content:

Video Link:

7. Deep Clustering (A Self-Supervised Deep Learning Algorithm).

Content:

Video Link:

8. Forced/Guided Learning in Deep Learning.

Content:

Video Link:

9. Internal Covariate Shift.

Content:

Video Link:

10. MAMBA Explained: The Next Gen Sequence Model for Deep Learning State Space Gates and More.