Day 10 part 2

Protein structure

🧬 Molecular Docking — Full Conceptual Overview


🧠 What is Molecular Docking?

Molecular docking is a computational method used to predict how a molecule (ligand) binds to a protein (target).

  • Goal:
    • Find binding position (pose)
    • Estimate binding strength (affinity)
    • Predict stability of the complex

📌 Key idea:

“Which ligand fits best into the protein, and how strongly?”


⚠️ Important Limitation (Often Misunderstood)

Docking is NOT designed for large molecules (e.g., polymers).

Why?

  • Only a small region (active site) is considered
  • Large molecules extend outside → unrealistic scoring

✔️ Correct approach:

  • Break polymers into small fragments before docking

🔑 Core Concept: Lock-and-Key vs Induced Fit

🔒 Rigid docking (Lock-and-key)

  • Protein = fixed
  • Ligand = fixed
  • Works like LEGO blocks

➡️ If geometry doesn’t match → no binding


🔄 Flexible docking (Induced fit)

  • Ligand (and sometimes protein) can change shape
  • More realistic but computationally heavier

🔄 Sampling Conformations (Your Question 1)

✔️ Your understanding is correct.

What happens:

  1. Place ligand in active site
  2. Rotate flexible bonds step-by-step
  3. Generate many conformations
  4. Evaluate each one

➡️ “Which conformation interacts best?”

📌 Important nuance:

  • Rotation ignores physical barriers (not realistic movement)
  • It just tests possibilities

🧮 Scoring Function → Ranking

Each conformation gets a score based on:

  • van der Waals interactions
  • electrostatics
  • solvation energy

➡️ Final goal:

Lowest energy = best binding

✔️ Yes, you are right: 👉 We aim for minimum energy


🧬 Example: HIV Protease (Your Question 2)

HIV protease

Why it’s important:

  • Target for anti-HIV drugs
  • Inhibitors block enzyme → virus cannot mature

Key problem:

HIV mutates very fast

➡️ Implication:

  • Binding site changes → drugs stop working
  • Docking must account for mutations

✔️ Insight: Docking is used to design new inhibitors faster than mutations evolve


🧬 Protein–Protein Docking (Your Question 3)

What it means:

  • Predict how two proteins interact

Applications:

  • Protein complexes
  • Signaling pathways
  • Enzyme regulation

Difference vs small molecule docking:

  • Larger interfaces
  • More complex geometry
  • More flexibility

🧪 Reproducing Binding Mode (Your Question 4)

What this means:

You already have:

  • Experimental structure (X-ray / NMR)

Goal:

  • Check if docking can reproduce the same binding pose

📌 Why important:

  • Validates docking method
  • Confirms scoring function reliability

✔️ If docking ≈ crystal structure → method is trustworthy


🧩 Fragment-Based Docking (Your Question 5)

✔️ Your understanding is correct.

Process:

  1. Break ligand into fragments
  2. Dock fragments individually
  3. Recombine best fragments

➡️ Build optimized molecule step-by-step

📌 Advantage:

  • Explores chemical space efficiently

⚙️ Docking Workflow (Corrected & Expanded)


1️⃣ Protein & Ligand Selection

  • Need:
    • Protein structure (PDB)
    • Ligand structure (mol2, etc.)

2️⃣ Protein Preparation (Your Question)

Why protonate?

X-ray structures do not include hydrogens

➡️ But hydrogen atoms are essential for:

  • Hydrogen bonding
  • Electrostatics

So you must: ✔️ Add hydrogens ✔️ Assign correct protonation states


⚠️ Critical Issue: Histidine (Your Question)

Histidine

Histidine can be:

  • Protonated at Nδ1
  • Protonated at Nε2
  • Both (charged)
  • None

Why this matters:

  • Wrong protonation → wrong hydrogen bonds
  • Can completely ruin docking results

✔️ Key rule:

Protonation must match local environment of active site


Additional Preparation Steps

  • Remove crystal waters
  • Define active site residues

📦 Binding Box (Docking Box)

What is it?

A 3D region where docking occurs

Trade-off:

  • Too small → miss interactions
  • Too large → inefficient & noisy

✔️ Best:

Small but includes all active residues


🎯 Docking Types


🔍 Blind Docking (Your Question)

✔️ Your description is correct.

  • Dock ligand across entire protein surface
  • Many simulations → map binding hotspots

➡️ Output:

  • Clusters of preferred binding sites

🎯 Focused Docking

  • Restrict docking to known active site

✔️ More accurate, faster


🔄 Reverse Docking (Your Question)

✔️ Correct idea.

Instead of:

  • Many ligands → 1 protein

You do:

  • 1 ligand → many proteins

➡️ Find:

Which protein binds best?


🔁 Redocking (Your Question)

What it means:

Take a ligand from:

  • Known crystal structure

Then:

  • Remove it
  • Dock it again

➡️ Compare predicted vs experimental pose

✔️ Purpose:

  • Evaluate docking accuracy ("docking power")

🔬 Docking + MD Simulation

Docking gives:

  • Fast prediction

But:

  • Not always realistic

So: ➡️ Run Molecular Dynamics (MD)

Check:

  • Does ligand stay bound?
  • Or diffuse away?

✔️ Key insight:

Good docking ≠ stable complex


⚖️ Energy Principle (Core Concept)

✔️ You are correct:

Systems aim for minimum energy

Docking searches:

  • Many conformations
  • Finds lowest-energy state

⚠️ Pros & Cons

✅ Advantages

  • Fast
  • Good for screening large libraries
  • Low computational cost

❌ Limitations

  • Accuracy depends on:
    • Protonation
    • Flexibility assumptions
  • Can give false positives
  • Needs post-validation (MD)

🔄 Docking vs MD (Important Distinction)

Docking:

  • Samples conformations artificially
  • Ignores energy barriers

MD:

  • Follows real physics
  • Includes time evolution

✔️ Key difference:

Docking jumps between states, MD simulates transitions


🧠 Final Takeaways

  • Docking = pose prediction + scoring
  • Best pose = lowest energy
  • Preparation (especially protonation!) is critical
  • Flexible docking is more realistic but harder
  • Always validate with MD or experiment

Quiz

Score: 0/0 (NaN%)