Loading

BIOSC 1540: L10A (Atomistic insight)

aalexmmaldonado

This is a live streamed presentation. You will automatically follow the presenter and see the slide they're currently on.

Computational Biology

(BIOSC 1540)

Mar 18, 2025

Lecture 10A

Atomistic insight

Foundations

Announcements

  • Quiz 03 is today
  • Quiz 04 (our last one) will be on Apr 8
  • Remember that your lowest quiz is dropped

Assignments

  • P02B is due Mar 28
  • P02C is due Mar 28
  • P03A is due Apr 4

Quizzes

  • The final exam is on Monday, Apr 28, at 4:00 pm in 244 Cathedral of Learning
  • The exam is cumulative and optional
  • Will replace any quiz lower than your final exam grade

Final exam

After today, you should have a better understanding of

Quiz 03

Please put away all materials as we distribute the quiz

Fill out the cover page, and do not start yet

Sit with an empty seat between you and your neighbors for the quiz

Quiz ends around 9:51 am

When you are finished, please hold on to your quiz and feel free to doodle or write anything on the last page

After today, you should have a better understanding of

Molecular ensembles and their relevance

Macrostates

Physics at the molecular level is statistical

Number of Particles: Biological systems contain billions of atoms interacting simultaneously

Thermal Motion: Atoms and molecules are in constant motion due to thermal energy

Uncertainty and Variability: Exact positions and velocities of particles are inherently uncertain

Observable properties are averages of atomistic behaviors

Microscopic level: Individual atoms and molecules

Macroscopic level: Bulk properties  from collective behavior

Atomistic systems are stochastic, measurable properties are computed as averages

Statistical mechanics uses statistical methods to relate microscopic properties to macroscopic observables

Changing any one of these values changes the macrostate

A Macrostate Describes the Overall Condition of a System

A macrostate is defined by macroscopic variables such as temperature, pressure, volume, and number of particles.

Example: Methanol and water

Composition: 70% methanol and 30% water by mass

Temperature: 25 C

Pressure: 1.01325 bar

Volume: 100 mL

It provides a coarse-grained system description, ignoring the specific details of individual particles.

Macrostates Capture What We Can Measure

Instead, we use macroscopic variables like density, energy, and composition, which summarize the system’s overall state.

Example: The pressure in a tire depends on the average behavior of gas molecules, not the exact motion of each one.

We cannot measure each molecule's exact position and velocity in a system.

Changing a Macrostate Can Change a System’s Properties

Example: Supercooled water can remain liquid below 0°C, but a small disturbance changes its macrostate to solid ice.

When a macrostate changes, the system may undergo phase transitions or shifts in observable properties.

Some macrostates are stable, while others are metastable (temporarily stable before changing).

Molecular example: Protein simulations

In structural biology, our molecular ensembles are normally defined with temperature, pressure, and chemical species

Chemical species are our proteins, solvent molecules, ions, etc.

Environmental factors such as pH will influence our chemical species

After today, you should have a better understanding of

Molecular ensembles and their relevance

Microstates

Molecular properties emerge from atomic interactions, but we cannot measure individual atoms directly

Biological and chemical properties arise from atomic-scale interactions like hydrogen bonding, electrostatic forces, and conformational changes.

Experimental techniques measure averages over many molecules, but they do not provide direct access to individual atomic motions.

Computational methods, such as molecular simulations, allow us to track how atoms move and interact over time.

Ensembles Provide a Statistical View of Molecular Behavior

A system at a given temperature, pressure, and volume can exist in many possible microscopic configurations.

Each configuration (microstate) represents a unique arrangement of atomic positions and velocities.

By sampling an ensemble of microstates, we can determine probability distributions of molecular properties.

A microstate is a single arrangement of atoms and their velocities within a macrostate

Every microstate is one specific realization of atomic positions and momenta.

The system constantly moves between different microstates due to thermal motion and molecular interactions.

Example: A protein-ligand complex exists in many conformations—some tightly bound, others loosely interacting.

We can use ensembles to study the strength and length of a His148 hydrogen bond to the anionic chromophore

His148 stabilizes the anionic chromophore through hydrogen bonding, which influences fluorescence properties.

The hydrogen bond length fluctuates over time as atoms move between different microstates.

By sampling an ensemble of molecular simulations, we determine the mean hydrogen bond length and energy.

To measure the hydrogen bond, we must average over many microstates to get a meaningful result

Our macrostate: roGFP2 in water, with 150 mM NaCl at 300 K and 1 atm

A single microstate may show a short or long bond length, but this does not represent the overall behavior.

A properly sampled ensemble gives the average bond length and the distribution of bond fluctuations.

Here is the MD trajectory

with a mean of 3.155 Å

Accurate results require statistical sampling across many microstates

Observing one molecular snapshot is like looking at one frame of a movie—it does not capture the full dynamics.

By simulating thousands of microstates, we capture how the hydrogen bond length varies over time.

After today, you should have a better understanding of

Role of enthalpy and entropy in molecular interactions

Selective binding to a protein is governed by thermodynamics (and kinetics)

Binding occurs when a compound/ligand interacts specifically with a protein

Protein

Ligand

Binding

Protein-

ligand

We can model this as a reversible protein-ligand binding

P + L \leftrightharpoons PL

Binding affinity is determined by the Gibbs free energy change

The change in free energy when a ligand binds to a protein

\Delta G_{bind} = G_{PL} - G_{P} - G_{L}

Determines binding process spontaneity

Gibbs free energy combines enthalpy and entropy

\Delta G_{bind} = \Delta H_{bind} - T\Delta S_{bind}
\Delta H_{bind}
\Delta S_{bind}

Entropy

Enthalpy

Accounts for energetic interactions

How much conformational flexibility changes

Note: Simulations capture free energy directly instead of treating enthalpy and entropy separately

After today, you should have a better understanding of

Noncovalent interactions and enthalpy

Enthalpy accounts for noncovalent interactions

Noncovalent interactions: Electrostatics, hydrogen bonds, dipoles, π-π stacking, etc.

Ensemble differences in noncovalent interactions provide binding enthalpy

\Delta H_{bind} = \left\langle H_{PL} \right\rangle - \left\langle H_{P} \right\rangle - \left\langle H_{L} \right\rangle
\left\langle \cdots \right\rangle

Ensemble average

(We assume no covalent bond breaking)

Chemical interactions are determined by fluctuating electron densities

Our noncovalent interactions conceptual framework:

3. Regions of increased electron density are associated with higher partial negative charges

4. Electrons are mobile and can be perturbed by external interactions

1. Coulomb's law describes the interactions between charges

Molecular interactions are governed by their electron densities (Hohenberg-Kohn theorem)

This is rather difficult, so we often use conceptual frameworks to explain trends (e.g., hybridization and resonance)

2. Molecular geometry uniquely specifies an electron density

Electrostatic forces govern interactions between charged and polar regions

Charged molecules have a net imbalance between

  • Positive charges in their nuclei
  • Negative charges from their electrons

This leads to net electrostatic attractions or repulsions between different atoms or molecules

Arginine

Glycine

~5 to 20 kcal/mol per interaction

Long-Range Interaction: Can attract ligands to the binding site from a distance

Anchor Points: Often serves as key anchoring interactions in the binding site

Role in binding

Hydrogen bonds are a type of electrostatic interactions

Attraction between a (donor) hydrogen atom covalently bonded to an electronegative atom and another (acceptor) electronegative atom with a lone pair

  • Common donors: O-H, N-H groups
  • Common acceptors: O and N atoms with lone pairs

~2 to 7 kcal/mol per hydrogen bond

Strongest when the hydrogen, donor, and acceptor atoms are colinear

Specificity​: Precise orientation of the ligand

Stabilization​: Moderately strong interactions

Role in binding

Dynamic​: Allows for adaptability of ligands

Uneven electron distribution creates partial charges and dipoles

Electronegativity differences lead to unequal distribution of electron density

Unequal distribution results in regions or partial positive or partial negative charges

Consistent electron density spatial variation results in permanent dipoles

~0.01 to 1 kcal/mol per interaction

Directional binding: Highly directional, ensuring that the ligand aligns correctly

Flexibility: Can accommodate slight conformational changes

Role in binding

Van der Waals forces are weak, non-directional interactions

Dispersion: Electrons in molecules are constantly moving, leading to temporary uneven distributions that induce dipoles in neighboring molecules

~0.4 to 4 kcal/mol per interaction

Complementary fit​: Maximizes surface contact

Flexibility: Allows small conformational changes

Role in binding

Induction: The electric field of a polar molecule distorts the electron cloud of a nonpolar molecule, creating a temporary dipole

π-π interactions involve stacking of aromatic rings

Noncovalent interactions between aromatic rings due to overlap of π-electron clouds

~1 to 15 kcal/mol per interaction

Edge-to-face

Displaced

Face-to-face

Orientation​: Proper positioning of aromatics

Selectivity: Recognition of ligands

Role in binding

After today, you should have a better understanding of

Entropy as molecular flexibility and dynamics

Entropy accounts for microstate diversity of a single system state

One of Alex's esoteric points: "Entropy is disorder," is a massive oversimplification that breaks down in actual practice

Entropy is formally defined as 

S = k_B \ln \Omega

is the total number of microstates available to the system without changing the system state

\Omega

Entropy is "energy dispersion"

Higher entropy implies greater microstate diversity

"System state" can be arbitrarily defined and compared as

  • Unbound ligand vs. bound ligand
  • Unfolded protein vs. folded protein
  • Liquid water at 300 K vs. 500 K

Grid-based protein-ligand binding

Suppose I have a system with

  • Protein receptor
  • Ligands positioned on a grid

My macrostate (number and identity of particles, temperature, and pressure) remain constant

How many ways can I rearrange the ligands without binding to the receptor?

L
N

Number of ligands

Number of sites

\Omega = \frac{N!}{L! \left( N - L \right)!}

Number of ways to choose L grid sites out of N is the binomial coefficient

Grid-based protein-ligand binding

\Omega = \frac{N!}{(L-1)! \left( N - L + 1 \right)!}

What if one ligand binds to the receptor?

How does entropy change?

Increase

No change

Decrease

It depends on our ligand concentration!

How to interpret this: Pick a number of ligands and move to the right (L - 1), does entropy go up or down?

Entropic contributions involve accounting for the number of accessible microstates/configurations for protein and ligand

Before the next class, you should

Lecture 10B:

Atomistic insights -
Methodology

Lecture 10A:

Atomistic insights -
Foundations

Today

Thursday

Experiments measure the weighted mean of microstates

p (x_i)

Remember: Multiple microstates (i.e., configurations) can have the same distance

We measure the ensemble probability of observing a microstate with value

x_i
\left\langle x \right\rangle = \frac{\sum p (x_i) x_i}{\sum p(x_i)}

Expected value of ensemble is computed by weighted mean

Note: Our denominator will always be 1 because we are not using actual partition function

\left\langle x \right\rangle =

2.946 Å

The partition function is the ensemble sum of energy-weighted microstates

Note: To make our lives easier, we assume each microstate has the same energy

Energy

State

L \varepsilon_{sol}
(L - 1) \varepsilon_{sol} + \varepsilon_{b}

Multiplicity

\frac{N!}{L!(N - L)!} \approx \frac{N^L}{L!}
\frac{N!}{(L-1) !(N - L + 1)!} \\ \approx \frac{N^{L - 1}}{(L - 1)!}

Weight

Z_B = \frac{N^{L - 1}}{(L - 1)!} e^{- \beta \left[ (L - 1) \varepsilon_{sol} + \varepsilon_{b} \right]}
Z_A = \frac{N^L}{L!} e^{-\beta \left[ L \varepsilon_{sol} \right]}

The Stirling approximation

Z = Z_A + Z_B

Energy

              of each microstate. (In our model, this is based on number of solvated and bound ligands)

The               of this system state in our macrostate ensemble

weight

Total partition function

Multiplicity

, or the the number microstates