Lesson 10 PPT

Protein structure

๐Ÿงฌ Molecular Simulation & Protein Structure Prediction

๐Ÿ”ฌ NMR vs X-ray Crystallography

  • NMR (Nuclear Magnetic Resonance)
    • Produces an ensemble of structures (multiple conformations)
    • Reflects protein flexibility in solution
    • Useful for studying dynamic proteins
  • X-ray crystallography
    • Produces a single high-resolution structure
    • Requires crystallization (non-physiological)
    • Often misses flexible regions

๐Ÿ‘‰ Key takeaway: NMR = dynamic + multiple conformations X-ray = static + high resolution


โš™๏ธ Protein Folding Problem

  • Proteins fold into structures that minimize free energy
  • The folding problem = searching for global minimum energy

โš ๏ธ Problem:

  • Huge search space (ฯˆ/ฯ† backbone angles + side chains)
  • Number of conformations grows exponentially

๐Ÿ‘‰ This is why pure computational folding is hard


๐Ÿง  Protein Structure Prediction Methods

1. Ab initio (First principles)

  • Uses physics only
  • โŒ Too computationally expensive (not practical for large proteins)

2. Comparative Modeling

  • Uses known structures as templates

3. Homology Modeling

  • Based on sequence similarity
  • If sequences are similar โ†’ structures are similar

4. Protein Threading

  • Fits sequence into known structural folds, even with low similarity

๐Ÿงต Protein Threading

๐Ÿ’ก Concept

  • Place sequence onto known structure
  • Evaluate how well it fits (energy scoring)

Why it works:

  • Proteins adopt limited number of folds
  • ~10 folds explain ~50% of structures

๐Ÿ‘‰ Instead of searching infinite possibilities โ†’ reuse known folds


โš™๏ธ Threading Components

  • Structural template database
  • Energy/scoring function
  • Alignment algorithm
  • Reliability assessment

๐Ÿ“Š Scoring (Punctuation) Functions

Key factors:

  • Solvation potentials (buried vs exposed residues)
  • Contact potentials
  • Secondary structure agreement
  • Accessibility predictions

๐Ÿ”ฅ Contact Potential (Important!)

Uses Boltzmann principle:

  • Favorable contacts occur more frequently
  • Energy is derived from observed frequencies

๐Ÿ‘‰ Translation:

  • Common interactions โ†’ energetically favorable

๐Ÿงฌ Sequence Profiles + Secondary Structure

  • Combines:
    • Evolutionary info (profiles)
    • Predicted secondary structure

๐Ÿ‘‰ Improves accuracy significantly


๐Ÿ“Š Post-processing & Evaluation

  • Filter bad models
  • Combine additional data
  • Benchmark using CASP experiments

๐Ÿค– AlphaFold (Deep Learning Revolution)

๐Ÿง  Overview

  • Uses deep learning + evolutionary data
  • Predicts 3D structure from sequence

๐Ÿงฌ Key Components

1. Input

  • Amino acid sequence

2. MSA (Multiple Sequence Alignment)

  • Detects:
    • conserved residues
    • co-evolution (residues interacting)

๐Ÿ‘‰ If two residues mutate together โ†’ likely interact


3. Evoformer (Core engine)

  • Transformer-based
  • Learns:
    • long-range interactions
    • structural constraints

4. Structure Module

  • Converts predictions into 3D coordinates
  • Uses Invariant Point Attention (IPA)

๐Ÿ‘‰ Important:

  • Handles spatial geometry properly

5. Output

  • Full atomic model
  • Confidence metrics:
    • pLDDT โ†’ per residue confidence
    • PAE โ†’ domain relationship uncertainty

๐Ÿงช Molecular Docking

๐Ÿ”‘ What is Docking?

  • Predicts how a ligand binds to a protein

โš ๏ธ Important:

  • Does NOT directly predict bioactivity

๐Ÿ”„ Docking Theories

  • Lock-and-key โ†’ rigid fit
  • Induced fit โ†’ protein adapts

โš™๏ธ Two Main Steps

1. Sampling

  • Try many ligand conformations

2. Scoring

  • Rank based on binding energy

๐Ÿงฌ Types of Docking

  • Proteinโ€“protein
  • Proteinโ€“ligand
  • Proteinโ€“nucleotide

๐ŸŽฏ Applications

  • Reproduce known binding modes
  • Predict binding of known ligands
  • Estimate binding affinities
  • Virtual screening (drug discovery)

โš™๏ธ Algorithms

  • Use:
    • conformational search methods
    • scoring functions

๐Ÿ” Docking Workflow

Typical steps:

  1. Prepare protein + ligand
  2. Define binding site
  3. Generate conformations
  4. Score poses
  5. Rank results

๐Ÿงช Validation Methods

๐Ÿ” Redocking

  • Dock ligand back into known structure
  • Tests accuracy

๐Ÿ”„ Cross-docking

  • Dock ligand into different structures
  • Tests robustness

๐Ÿ“ˆ ROC Curve (Image slide explanation)

  • Measures model performance
  • AUC (Area Under Curve):
    • 1.0 = perfect
    • 0.5 = random

๐Ÿ‘‰ Used in virtual screening


๐Ÿ”ฌ Binding Energy Example

  • Example: -10.10 kcal/mol
  • More negative = stronger binding

๐Ÿงพ Docking Score Table

  • Compares different ligand poses
  • Used to select best candidate

โš ๏ธ Types of Docking (Advanced)

  • Covalent docking โ†’ ligand forms bond
  • Blind docking โ†’ unknown binding site
  • Reverse docking โ†’ one ligand vs many proteins

โš–๏ธ Pros & Cons of Docking

โœ… Pros

  • Fast screening
  • Cost-effective
  • Useful for drug discovery

โŒ Cons

  • Scoring functions imperfect
  • Protein flexibility limited
  • Does not guarantee biological activity

๐Ÿง  Big Picture Summary

  • Protein structure prediction evolved from:
    • โŒ physics-based โ†’ too slow
    • โœ… template-based โ†’ useful
    • ๐Ÿš€ AI-based (AlphaFold) โ†’ breakthrough
  • Docking:
    • Predicts binding mode, not activity
    • Depends heavily on sampling + scoring quality

๐Ÿ”ฅ Key Concept Connections (Important for Exams)

  • Threading vs Homology modeling
    • Homology โ†’ sequence similarity
    • Threading โ†’ structure similarity
  • AlphaFold vs Docking
    • AlphaFold โ†’ structure prediction
    • Docking โ†’ interaction prediction
  • Energy principle
    • Folding โ†’ minimize energy
    • Docking โ†’ optimize binding energy

Quiz

Score: 0/30 (0%)