Generative AI Part 2

Generative AI Part 2

Instructor: Ramin Mohammadi

7 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

2 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

7 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

2 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 7 modules in this course

Introduces the theoretical foundations and advanced concepts of neural networks, generative models, transformers, and large language models. Students will explore how these AI systems create new data, process information, and learn through feedback, while analyzing their applications across various fields. The course emphasizes key principles in model building, optimization, and real-world generative AI use cases.

In this module, you will explore Transformer-based models in natural language processing. You will study pretraining approaches such as BERT and GPT, the mathematics of pretraining word embeddings, and various optimization and scaling strategies critical to effective language modeling.

What's included

5 videos20 readings3 assignments

5 videosTotal 28 minutes

Pre-Training4 minutes
BERT & Tuning9 minutes
GPT and RAG5 minutes
Prompt Engineering6 minutes
Scaling Law & Transfer Learning4 minutes

20 readingsTotal 217 minutes

Course Introduction1 minute
Meet Your Faculty1 minute
Syllabus - Generative AI Part 210 minutes
Recommended Prior Knowledge100 minutes
Academic Integrity1 minute
Module Overview2 minutes
Transformers for NLP8 minutes
Pre-Trained Word Embeddings3 minutes
Pre-Training Whole Models3 minutes
Reconstructing the Input3 minutes
Pre-Training Through Language Modeling8 minutes
Fine-Tuning BERT15 minutes
Fine-Tuning In-Depth15 minutes
Pre-Training Decoders5 minutes
Generative Pretrained Transformer10 minutes
Scaling Laws8 minutes
Scaling Efficiency7 minutes
Pre-Training Encoder/Decoders7 minutes
Span Corruption7 minutes
Module Wrap-Up3 minutes

3 assignmentsTotal 9 minutes

Module 8- Assess Your Learning 13 minutes
Module 8- Assess Your Learning 23 minutes
Module 8- Assess Your Learning 33 minutes

This module investigates deep latent variable models, focusing on variational autoencoders (VAEs) and related probabilistic methods. You will analyze the mathematics behind sampling strategies, evidence lower bound (ELBO), variational inference, reparameterization tricks, and amortized inference, developing an advanced toolkit for probabilistic generative modeling.

What's included

6 videos14 readings3 assignments

6 videosTotal 54 minutes

Probability, Density, Mass Function10 minutes
VAE Introduction8 minutes
Sampling & Monte Carlo Optimization11 minutes
Evidence Lower Bound (ELBO) Part 18 minutes
Evidence Lower Bound (ELBO) Part 25 minutes
Variational Autoencoders in Depth12 minutes

14 readingsTotal 112 minutes

Module Overview2 minutes
Deep Latent Variable Models8 minutes
Mixture of Gaussians10 minutes
Variational Autoencoder (VAE)10 minutes
Discrete and Continuous Space8 minutes
Naïve Monte Carlo5 minutes
Importance Sampling8 minutes
ELBO Deep Dive8 minutes
Return to Variational Autoencoders15 minutes
Variational Approximation10 minutes
Variational Autoencoder Continued10 minutes
Reparameterization Trick10 minutes
Amortization in VAE5 minutes
Module Wrap-Up3 minutes

3 assignmentsTotal 9 minutes

Module 9- Assess Your Learning 13 minutes
Module 9- Assess Your Learning 23 minutes
Module 9- Assess Your Learning 33 minutes

In this module, you'll explore normalizing flows as precise tools for modeling complex probability distributions through invertible neural networks. You’ll examine the underpinnings, including determinants, geometry, invertibility constraints, and specific flow architectures like Real-NVP and autoregressive models. You'll also investigate practical applications and synthesis of complex densities using normalizing flows.

What's included

8 videos25 readings4 assignments

8 videosTotal 33 minutes

Normalizing Flow Part 14 minutes
1D Introduction4 minutes
Change of Variables Explained3 minutes
Introduction to Forward and Inverse Mapping4 minutes
2D Example: Deep Neural Network4 minutes
Linear Flows6 minutes
Elementwise & Other Types of Flows7 minutes
Summary of Normalizing Flows1 minute

25 readingsTotal 124 minutes

Module Overview2 minutes
Introduction to Normalizing Flow10 minutes
1D Normalizing Flow2 minutes
Measuring Probability12 minutes
Change of Variables Formula5 minutes
Geometry Info5 minutes
Determinants and Volumes2 minutes
Forward and Inverse Mapping2 minutes
Learning1 minute
General Use Case12 minutes
Forward Mapping With a Deep Neural Network5 minutes
Training Objective for Normalizing Flows5 minutes
Flow Model Requirements3 minutes
Triangular Jacobian1 minute
Overview and Linear Flows3 minutes
Elementwise Flows5 minutes
Coupling Flows5 minutes
Introduction to NICE5 minutes
Real-NVP: Non-Volume Preserving Extension of NICE7 minutes
Interpolation in Latent Space With Real-NVP3 minutes
Autoregressive Flows3 minutes
Continuous Autoregressive Models as Flow Models5 minutes
Inverse Autoregressive Flows8 minutes
Applications of Normalizing Flows10 minutes
Module Wrap-Up3 minutes

4 assignmentsTotal 12 minutes

Module 10- Assess Your Learning 13 minutes
Module 10- Assess Your Learning 23 minutes
Module 10- Assess Your Learning 33 minutes
Module 10- Assess Your Learning 43 minutes

This module provides a deep exploration of Generative Adversarial Networks (GANs), focusing on their formulation as likelihood-free generative models. You'll analyze GAN training dynamics, including optimization challenges, mode collapse, and divergence minimization strategies. The module also covers advanced GAN variants such as f-GAN and Wasserstein GAN (WGAN).

What's included

29 readings5 assignments

29 readingsTotal 121 minutes

Module Overview2 minutes
Refresher5 minutes
Towards Likelihood-Free Learning6 minutes
Likelihood-Free Learning5 minutes
Generative Modeling and Two-Sample Tests3 minutes
Discrimination as a Signal4 minutes
Overview4 minutes
Generator vs. Discriminator Diagram5 minutes
Training Objective for Discriminator4 minutes
Interpretation5 minutes
Loss Functions5 minutes
Training Algorithm3 minutes
Key Observations2 minutes
Alternating Optimization in GANs2 minutes
Examples4 minutes
Introduction1 minute
Optimization Challenges in GANs2 minutes
Mode Collapse5 minutes
Beyond KL and Jenson-Shannon Divergence2 minutes
f-divergences2 minutes
What is Lower Semicontinuity?4 minutes
Examples of f-divergences and Training5 minutes
Toward Variational Divergence Minimization10 minutes
f-GAN Variational Divergence Minimization5 minutes
Wasserstein (Earth Mover) Distance5 minutes
Discrete Distributions8 minutes
Wasserstein Distance for Continuous Distributions5 minutes
Inferring Latent Representations in GANs5 minutes
Module Wrap-Up3 minutes

5 assignmentsTotal 15 minutes

Module 11- Assess Your Learning 13 minutes
Module 11- Assess Your Learning 23 minutes
Module 11- Assess Your Learning 33 minutes
Module 11- Assess Your Learning 43 minutes
Module 11- Assess Your Learning 53 minutes

In this module, you will explore energy-based generative models and score-based modeling frameworks from a mathematical and implementation perspective. You'll dive deeply into the details of training via score functions, contrastive divergence, and various forms of score matching including denoising techniques, highlighting their theoretical and practical implications.

What's included

34 readings5 assignments

34 readingsTotal 175 minutes

Module Overview2 minutes
Background3 minutes
Parameterizing Probability Distribution: Definition3 minutes
Parameterizing Probability Distributions: Solution7 minutes
Energy-Based Models5 minutes
Pros and Cons of Energy Based Models2 minutes
Examples5 minutes
Examples Continued5 minutes
Computing the Normalization Constant5 minutes
Introduction2 minutes
Contrastive Divergence Algorithm8 minutes
Sampling in Energy-Based Models5 minutes
Score Function8 minutes
Score Matching8 minutes
Score-Based Models Introduction2 minutes
Background3 minutes
Denoising Score Matching Part 1: Introduction6 minutes
Denoising Score Matching Part 2: Defining the Objective4 minutes
Denoising Score Matching Part 3: Gradient Expansion8 minutes
Gradient Derivation6 minutes
Intuition5 minutes
Why Denoising Works in Score Matching3 minutes
Comparison Between NSM and DSM2 minutes
Tweedie Formula4 minutes
Overview of Sliced Score Matching (SSM)8 minutes
Data Generation with Score-Based Models8 minutes
Pitfalls With Score-Based Models8 minutes
Solution to Pitfalls8 minutes
Introduction to NCSBM5 minutes
Annealed Langevin Dynamics8 minutes
Training Noise Conditional Score Networks3 minutes
Choosing Noise Scales5 minutes
Choosing the Weighting Function8 minutes
Module Wrap-Up3 minutes

5 assignmentsTotal 15 minutes

Module 12- Assess Your Learning 13 minutes
Module 12- Assess Your Learning 23 minutes
Module 12- Assess Your Learning 33 minutes
Module 12- Assess Your Learning 43 minutes
Module 12- Assess Your Learning 53 minutes

You'll delve deeply into diffusion models, understanding them mathematically as stochastic processes and connecting them explicitly to score-based models. The module examines forward and reverse diffusion processes, training objectives, SDEs, predictor-corrector methods, and latent diffusion architectures, providing robust foundations for modern generative modeling.

What's included

41 readings6 assignments

41 readingsTotal 201 minutes

Module Overview2 minutes
Introduction4 minutes
Model Families Continued8 minutes
Definition2 minutes
Diffusion Process5 minutes
Distribution of Each Term5 minutes
Diffusion Kernel15 minutes
Marginal Distributions4 minutes
Conditional Distribution7 minutes
Backward Diffusion Process-Decoder15 minutes
Encoder / Decoder4 minutes
Loss Function2 minutes
Gaussian Distribution and Its Mean2 minutes
Diffusion Models as Score-Based Models4 minutes
Decoder Parameterization8 minutes
Loss Function5 minutes
Training and Inference5 minutes
U-Net Architecture7 minutes
Infinite Noise Levels Score-Based Modeling5 minutes
Perturbing Data With Stochastic Processes3 minutes
Stochastic Differential Equations (SDEs)5 minutes
Types of SDEs and Noise Evolution5 minutes
Reverse Stochastic Process7 minutes
Role of the Score Function3 minutes
Time-Dependent Score-Based Model2 minutes
Training Objective2 minutes
Reverse-Time SDE3 minutes
Euler-Maruyama Approximation and Summary5 minutes
Where Does the Time Step Come From?5 minutes
Step-by-Step Sampling: Euler-Maruyama Method5 minutes
Predictor-Corrector Sampling Methods5 minutes
Combined Predictor-Corrector Sampling3 minutes
Probability Flow ODE5 minutes
Likelihood Computation6 minutes
Practical Considerations and Conclusion5 minutes
Intro to Latent Diffusion Models2 minutes
Conditional Generation5 minutes
Improving Image Quality3 minutes
Control the Generation Process7 minutes
Examples3 minutes
Module Wrap-Up3 minutes

6 assignmentsTotal 18 minutes

Module 13- Assess Your Learning 13 minutes
Module 13- Assess Your Learning 23 minutes
Module 13- Assess Your Learning 33 minutes
Module 13- Assess Your Learning 43 minutes
Module 13- Assess Your Learning 53 minutes
Module 13- Assess Your Learning 63 minutes

In this module, you'll study annealed importance sampling (AIS) methods for estimating complex probability distributions with rigorous mathematical treatment. You will mathematically analyze AIS step-by-step processes, intermediate distributions, and normalization constants, applying these techniques effectively to probabilistic models, to wrap up the course. You will also assess the evolution of generative models.

What's included

40 readings7 assignments

40 readingsTotal 136 minutes

Module Overview2 minutes
Overview of AIS5 minutes
Example: AIS With a Gaussian Distribution3 minutes
Intermediate Step (t = 1)5 minutes
Intermediate Step (t = 2)5 minutes
Final Steps (t = 8)5 minutes
Setup5 minutes
Step-By Step Solution for t = 15 minutes
Applications and Takeaways2 minutes
Normalization of Probability Density Functions2 minutes
Examples of Normalizing Constants2 minutes
Steps to Normalize p(z)3 minutes
Wrapping Up Probability Distributions2 minutes
Model Family Recap5 minutes
Model Families Continued5 minutes
Distances of Probability Distributions5 minutes
Evaluating Generative Models1 minute
What is the Task That You Care About?1 minute
Evaluation7 minutes
Kernel Density Estimation (KDE)7 minutes
Latent Variables & Sample Quality5 minutes
HYPE: Human Eye Perceptual Evaluation3 minutes
Inception Scores3 minutes
Sharpness3 minutes
Diversity2 minutes
Inception Scores Finalized2 minutes
Relationship Between Inception Score and KL Divergence7 minutes
Frechet Inception Distance (FID)2 minutes
Kernel Inception Distance (KID)2 minutes
FID vs. KID1 minute
Evaluating Sample Quality for Text-to-Image Models5 minutes
Evaluating Latent Representations1 minute
Clustering3 minutes
Lossy Compression or Reconstruction1 minute
Distentanglement3 minutes
Beta-VAE3 minutes
Solving Tasks Through Prompting4 minutes
Holistic Evaluation of Language Models (HELM)5 minutes
Module Wrap-Up3 minutes
Congratulations!1 minute

7 assignmentsTotal 21 minutes

Module 14- Assess Your Learning 13 minutes
Module 14- Assess Your Learning 23 minutes
Module 14- Assess Your Learning 33 minutes
Module 14- Assess Your Learning 43 minutes
Module 14- Assess Your Learning 53 minutes
Module 14- Assess Your Learning 63 minutes
Module 14- Assess Your Learning 73 minutes

Instructor

Ramin Mohammadi

Northeastern University

4 Courses781 learners

Offered by

Northeastern University

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.