Lecture 7/8 Ex Paper 1 Wlodawer

Protein structure

Here is a fun, structured, and educational deep summary of the review article:

** – Protein crystallography for non-crystallographers (Wlodawer et al., 2008)


🧬 Protein Crystallography for Non-Crystallographers β€” Detailed Summary

This review is essentially a survival guide for biologists / structural users who download PDB structures and want to interpret them correctly β€” without over-interpreting them.

It explains:

  • How crystal structures are determined
  • What electron density actually means
  • How to judge structure quality
  • Common mistakes and red flags
  • How to interpret biological conclusions safely

🌍 Why Protein Crystal Structures Matter

  • The Protein Data Bank (PDB) contains tens of thousands of macromolecular structures.
  • Most are solved by X-ray crystallography.
  • Structural information is crucial for:
    • Understanding biological mechanisms
    • Drug design
    • Protein engineering

BUT:

πŸ‘‰ Beautiful molecular graphics can be misleading. πŸ‘‰ Coordinates are interpretations of experimental data, not direct observations.

The paper’s core message:

Users must learn to critically evaluate structures β€” not just trust them.


πŸ”¬ How a Crystal Structure Is Determined (Conceptual Workflow)

1️⃣ X-ray diffraction

  • X-rays scatter off electrons in atoms.
  • A crystal behaves like a 3D diffraction grating.
  • The geometry of diffraction spots depends on:
    • Crystal lattice
    • Wavelength
  • The intensity depends on atom positions.

Important insight:

Each reflection depends on the positions of all atoms, so the entire structure must be modeled.


2️⃣ Phase problem

  • Diffraction gives amplitudes but not phases.
  • Phases must be obtained via methods like:
    • Molecular replacement
    • Heavy atom methods
  • Initial phases β†’ approximate electron density map

3️⃣ Model building + refinement

Refinement adjusts:

  • Atomic coordinates (x,y,z)
  • B-factor (atomic mobility / disorder)
  • Occupancy

Optimization aims to:

  • Minimize difference between observed and calculated diffraction.

⚠️ Huge complexity:

  • Even a 20 kDa protein β‰ˆ 6000 parameters to refine.

Hence:

  • Refinement uses stereochemical restraints (chemical knowledge).

πŸ—ΊοΈ Electron Density Maps β€” The Real Experimental Result

Important conceptual point:

The electron density map is the experiment. The atomic model is interpretation.


Types of maps

🟦 (Fobs, Ο†calc) map

β†’ Approximate structure density.

πŸŸ₯ Difference map (Fobs βˆ’ Fcalc)

Shows:

  • Positive peaks β†’ missing atoms
  • Negative peaks β†’ wrongly modeled atoms

🟩 2Fobs βˆ’ Fcalc map

Most commonly used:

  • Combines both for visualization.

Noise and contour levels

  • Maps contain noise due to experimental errors.
  • Typical contour levels:
    • ~1Οƒ for main map
    • Β±3Οƒ for difference map

Using lower levels can show noise instead of real structure.


πŸ“ Resolution β€” The Most Important Parameter

Resolution determines:

Level of detail visible in the structure.

ResolutionWhat you see
~6 Γ…Overall shape
~3 Γ…Secondary structure
~2 Γ…Side chains + waters
~1.2 Γ…Atomic resolution
<1 Γ…Hydrogen atoms

Higher resolution β†’ more reflections β†’ better model.

At atomic resolution:

  • Individual peaks separate
  • Atom types distinguishable
  • Unusual conformations detectable.

πŸ’§ Disorder and Solvent

Protein crystals are:

πŸ‘‰ ~50% solvent on average.

Two disorder types:

  • Static disorder β€” multiple conformations
  • Dynamic disorder β€” atomic motion

Consequences:

  • Density smearing
  • Poor modeling in flexible regions
  • Missing density in low-resolution maps

Water modeling depends strongly on resolution.


🧾 What to Check in a PDB File (Practical Guide)

Always inspect:

πŸ“Š Data quality indicators

  • Resolution
  • Completeness
  • I/Οƒ(I) (signal-to-noise)
  • Rmerge

🧱 Model quality indicators

  • R-factor
  • Rfree
  • Geometry deviations
  • Ramachandran plot

PDB headers may be:

  • incomplete
  • contradictory
  • even erroneous.

πŸ“‰ R-factor and Rfree β€” Structure Quality Metrics

πŸ”΅ R-factor

Measures agreement between:

  • Observed diffraction
  • Model diffraction

Guidelines:

  • Good: <20%
  • Suspicious: ~30%
  • Excellent: <10%

🟠 Rfree

Calculated on unused reflections.

Purpose:

  • Detect over-fitting.

Warning signs:

  • Rfree βˆ’ R > 7% β†’ possible over-interpretation.

πŸ“ Geometry Validation

Ramachandran plot

Shows backbone torsion angles.

  • 90% residues in favored regions β†’ good structure

  • Many outliers β†’ major issues

Peptide planarity and bond rmsd also critical.


⚠️ Common Problems in Published Structures

1️⃣ Fabrication or manipulation (rare)

Example:

  • Diffraction pattern misidentified as different protein.

2️⃣ Honest errors

Examples:

  • Chain traced backwards
  • Secondary structure misinterpreted
  • Over-interpretation of low-resolution maps

But:

πŸ‘‰ Community validation usually corrects mistakes.


3️⃣ Over-interpretation

Typical cases:

  • Too many water molecules at low resolution
  • Incorrect metal assignment
  • Hydrogen atoms modeled without evidence
  • Catalytic mechanisms inferred beyond data support

Example insight:

  • Magnesium vs calcium assignment must match coordination distances.

🧠 Biological Interpretation Caveat

Even a perfect crystal structure may represent:

  • Inactive state
  • Artifact of crystallization
  • Non-physiological conformation

Thus:

Structure β‰  mechanism automatically.

Multiple structures + biochemical data are essential.


πŸ§ͺ Final Practical Checklist (From the Paper’s Philosophy)

When reading a structural paper:

βœ… Check resolution vs level of claimed detail βœ… Inspect number of waters / metals βœ… Look at R and Rfree βœ… Examine Ramachandran statistics βœ… Check B-factors and occupancies βœ… Ensure conclusions match data quality βœ… Apply biochemical common sense


⭐ Big Take-Home Message

Protein crystallography is powerful but interpretative.

  • Most structures are reliable.
  • But critical evaluation is necessary.
  • Built-in validation metrics make crystallography uniquely self-correcting.

Quiz

Score: 0/30 (0%)