Lecture 14: Information theory
Institute for Theoretical Physics, Heidelberg University
Please complete by Jan 31, 2024.
What: MVSem
When: Block seminar, Summer term 2024
Lecturers: Tristan Bereau, Falko Ziebert
Why is it so hard to get ketchup out of its bottle? How do soap bubbles form? Soft matter is the physics of everyday life!
Soft matter systems display unique physics, including fractality, phase transitions, and self-organization. We will discuss the main theoretical concepts needed to describe soft condensed matter systems like polymers, liquid crystals, membranes, complex fluids and colloids.
What: MVSem
When: Summer term 2024
Lecturers: Rebecca Wade, Tristan Bereau
Recent developments in machine learning methods has fueled progress in biomolecular simulations.
In this seminar we will explore the recent literature on these efforts ranging from protein structure and dynamics, to drug design, to small molecules. The encoding of physical inductive bias (e.g., symmetries) in the representation or architecture will be one of the core topics.
Entropy
Measure of uncertainty, or lack of predictability, associated with a random variable drawn from a given distribution.
Consider a discrete random variable
Consider a binary random variable,
Cross entropy
The cross entropy between distribution
Joint entropy
The joint entropy of two random variables
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | |
0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 |
The joint distribution is
So the joint entropy yields
On the other hand, the marginal probabilities are uniform,
Lower bound on
Conditional entropy
The conditional entropy of
Definition
Given two distributions
For discrete distributions
Scalar case:
Objective: Find the distribution
Suppose that
Replace
Minimizing KL divergence to the empirical distribution is equivalent to maximizing likelihood.
Likelihood-based training puts too much weight on the training set. Will lead to generalization issues.
Approaches: smoothen the empirical distribution, data augmentation, etc.
Definitions
Measure the dependence of two random variables,
Definition
For discrete random variables
Rewrite as a function of entropies