

\( \text{Agenda of this Lecture:}\)
- Where does data come from?
- What is the structure of a data? What is a distribution?
- Types of distributions
- Approximating a distribution
- Why Gaussian Distribution is ubiquitous?
- Bayes rule and Marginalization
- Images as data points
- Interpolation for data generation
- The power of Sampling
- The Gaussian Distribution
- Properties of Gaussian Distribution
- Variational AutoEncoders
- Quick intro to Diffusion Models

Let's say you are given a bunch of data points:




Neural Networks have two components:
- Feature Extractor Module
- Task specific head
You can experiment with simple neural networks at Tensorflow Playground
Usually extracted features are of
lower dimension than data (x)




A simple example of a Neural Network


but, what does it mean when two images are closer to each other?




Let's do a small experiment...

We have very powerful discriminator models:
- E.g., Image classification models
What about generative models?
Given a label (e.g., "cat"), can we
generate a data point (image)?

Line Fit

Where does the data come from?

Where does the data come from?
Random Experiment
and
Random Variable

Heads, Tails, Tails, Heads, Heads ......
Guess the random Experiment that gives:

Heads, Tails, Tails, Heads, Heads ......
Guess the random Experiment that gives:
Flipping a coin - of course

Heads
Tails
What can we expect about the outcome?


In probability theory, the law of large numbers is a mathematical law that states that the average of the results obtained from a large number of independent random samples converges to the true value.
Let \(X_1,X_2,\dots\) be independent and identically distributed random variables with finite mean \(\mu=\mathbb{E}[X_1]\).
Define the sample average:
\( \mu \) be the true mean. Then according to the law:

\( \text{class 0}\)
\( \text{class 1}\)
Decision Boundary

Where does the data come from?

Interpolation for data generation
Can I interpolate between data points?
Basic idea behind morphing images, style mixing, data augmentation


What is the Data Distribution?

What is a Probability Distribution?
A probability distribution describes how the probability mass (discrete) or probability density (continuous) is assigned to different possible outcomes of a random variable.
For a discrete variable X:
For a continuous variable X with PDF \( p(x) \):

What is a Probability Distribution?
- \( p_\theta(x)\) : probability density or mass function parameterized by \( \theta \).
- \( f_\theta(x) \) : energy function or negative log probability
- \( Z_\theta \) : partition function (normalization constant) that ensures the total probability integrates or sums to 1.



Mean - \( \mu \)
Variance - \( \sigma^2 \)


Mean - \( \mu \)
Variance - \( \sigma^2 \)
All of these denote Gaussian distributions
A sample from the above distribution:

Multivariate Gaussian



Bayes Rule:
Bayes’ Theorem relates conditional probabilities in both directions
Prior
Evidence
Likelihood
Posterior

But, what about images?
Where does your sample come from?




But, what about images?
Where does your sample come from?



unknown
Where does your sample come from?



unknown

But, what about images?
Images are multidimensional vectors
but, what does it mean when two images are closer to each other?





\( \mu \) and \( \sigma\) can be multi -dimensional

The power of sampling.....




Data reconstruction using VAEs













Source:
https://lilianweng.github.io/posts/2024-04-12-diffusion-video/


Guest Lecture: CSE 574
By Naresh Kumar Devulapally
Guest Lecture: CSE 574
Invited Guest Lecture - CSE 555 Pattern Recognition - Spring 2025
- 125