Day 9 part 2

Protein structure

🧬 Molecular Dynamics (MD) Simulation — Theoretical Summary

⚙️ 1. Newton’s Equations and Motion of Atoms

Molecular dynamics simulations are based on classical Newtonian mechanics.

  • Each atom has:
    • Position (x)
    • Velocity (v) → derivative of position
    • Acceleration (a) → derivative of velocity
  • Motion is described by Force = mass × acceleration

This means MD simulation becomes a huge system of ordinary differential equations because:

  • For M atoms → 3M position coordinates + 3M velocity coordinates
  • So the system exists in a 6M-dimensional phase space

👉 Because this system is extremely complex, an analytical (exact) solution is impossible. Instead, MD uses numerical integration — moving atoms step-by-step in time.


⏱️ 2. Time Step — The Most Critical Parameter

MD simulations advance in small discrete time steps (Δt).

Typical values:

  • 🟢 ~2 femtoseconds → standard protein simulations
  • 🟡 ~4 femtoseconds → very long or membrane simulations

Why is time step important?

If the time step is:

❌ Too small

  • Simulation becomes very slow
  • You may never reach biologically relevant timescales (µs or ms)

❌ Too large

  • You may miss important events like collisions
  • System can become numerically unstable
  • Atoms may “jump” outside the simulation box → simulation “explodes”

Thus, MD requires a trade-off between accuracy and computational speed.


🐸 3. Leapfrog Integration Algorithm

To propagate motion efficiently, MD often uses the leapfrog algorithm.

Concept:

  • Velocities and positions are calculated at offset time points
  • Values “jump over” each other like a frog hopping

Advantages:

  • Time-symmetric
  • Numerically stable
  • Accurate for long simulations

This makes it one of the most widely used integration methods in MD.


🌡️ 4. Atoms Are Always Moving (Even at Low Temperature)

Important physical concept:

  • Atoms are never static
  • Even near 0 K → quantum and thermal motion still exist

Thus:

  • Proteins do not sit permanently in minimum energy
  • They constantly fluctuate around it

Over long simulation time, atomic configurations follow the:

📊 Boltzmann Distribution

This distribution describes:

The probability of a conformation as a function of its potential energy.

  • Low-energy conformations → frequent
  • High-energy conformations → rare

If MD samples all conformations, we can calculate thermodynamic properties identical to experiments.

But reaching full sampling requires very long simulations.


🎰 5. MD vs Monte Carlo Sampling

Monte Carlo (MC)

  • Samples conformations randomly
  • Immediately shows distribution
  • Good for thermodynamic properties

Molecular Dynamics (MD)

  • Follows realistic time evolution
  • Captures:
    • Fast motions
    • Dynamic pathways
    • Mechanistic transitions

Thus:

  • MC → better statistics
  • MD → better physical realism and motion information

🔋 6. Energy Conservation and Thermostats

Total system energy:

E_ = E_ + E_

However:

  • Real atomic collisions are not perfectly elastic
  • Energy is gradually lost

Therefore MD uses:

🛁 Heat bath / thermostat

Functions:

  • Adds energy if system cools
  • Removes energy if system overheats
  • Maintains constant temperature

Energy curves therefore fluctuate around a mean value, not perfectly flat.


🚀 7. Equilibration Phase

At simulation start:

  • All atoms may have identical velocities
  • This is physically unrealistic

During equilibration:

  • Velocities redistribute according to mass
  • System relaxes to correct temperature distribution
  • Artifacts from starting conditions disappear

Only after equilibration → production simulation begins.


💧 8. Solvent Representation

Explicit Solvent

  • Real water molecules simulated
  • Most accurate
  • Very computationally expensive
  • Can introduce artifacts (water clustering, ion shells)

Implicit Solvent

  • No water molecules
  • Environment treated mathematically like water

Advantages:

  • Faster
  • Fewer atoms

Disadvantages:

  • Loss of specific hydrogen-bond interactions
  • Reduced realism near protein surface

📦 9. Periodic Boundary Conditions (PBC)

To avoid atoms leaving the system:

  • Simulation box is surrounded by copies of itself

When atom exits one side:

  • It re-enters from opposite side

Benefits:

  • Constant particle number
  • No artificial wall effects
  • Mimics infinite bulk environment

⚡ 10. Computational Limits of MD

Challenges:

  • Very small time steps required
  • Structural changes may occur on ms scale
  • Huge number of steps needed

Solutions:

  • GPU acceleration
  • Parallel computing
  • Algorithm improvements

📚 11. Force Fields — Semi-Empirical Models

Force fields are:

  • Parameter tables describing atomic interactions
  • Based on:
    • Physics equations
    • Experimental data

Thus MD is semi-empirical.

⚠️ Must use scientific judgement:

  • Simulations may produce physically unrealistic minima
  • Example: alkane chain passing through benzene ring (artifact)

Always check: 👉 Does the simulation result make chemical sense?


❌ 12. What MD Cannot Do

Classical MD limitations:

  • Cannot form or break covalent bonds
  • Cannot change protonation states dynamically
  • Cannot simulate enzyme reactions directly

Reason:

  • Electronic structure is fixed at simulation start
  • Born-Oppenheimer approximation assumed

To model reactions → need QM/MM methods (quantum mechanics + molecular mechanics).


🖥️ 13. Software and Acceleration

Common MD packages:

  • GROMACS
  • AMBER
  • CHARMM

Visualization:

  • VMD (Visual Molecular Dynamics)

Speed improvements:

  • Parallelization
  • GPU (CUDA cores especially important)

⭐ Key Conceptual Takeaway

Molecular dynamics simulations:

  • Provide time-resolved molecular motion
  • Sample conformational landscapes
  • Require careful parameter choices
  • Balance accuracy vs computational feasibility
  • Must always be interpreted with chemical and physical intuition

Quiz

Score: 0/30 (0%)