Lecture 7 Video 13

Protein structure

🧬 Lecture Summary — Building Atomic Models from Electron Density

🌊 From Diffraction Pattern → Electron Density → Atomic Model

After collecting diffraction data and performing Fourier synthesis, the experimental result is:

➡️ A 3D electron density map of the unit cell

This map is the true experimental representation of where electrons (and therefore atoms) are located. But it is not yet a structure.

👉 The scientist must now interpret the density and build an atomic model inside it.

🎨 Different Ways to Represent Protein Structures (Models)

A “model” can be shown in many formats — each emphasizes different biological or physical insights.

🌀 Cartoon (Ribbon) Representation

Shows secondary structure elements
Helps visualize helices, sheets, topology
Often combined with ball-and-stick residues for important sites

🌍 Surface Representation

Displays domain organization
Shows pockets, interfaces, ligand accessibility
Useful for functional interpretation

🧱 Ball-and-Stick in Electron Density

Shows how atoms fit into the experimental density
Used during model validation

📚 Ensemble Models (Typical for NMR)

Many overlaid models
Shows flexibility and dynamic regions
Regions with spread = more mobile

🥚 Atomic Displacement Ellipsoids (Anisotropic B-factors)

Each atom drawn as an ellipsoid
Shape indicates direction and magnitude of motion
Small ellipsoids → rigid region
Large ellipsoids → flexible region

🧱 How Do We Actually Build the Model?

🦴 Step 1 — Build a Skeleton (Historical Method)

Skeleton = simplified representation of continuous density
Helps trace C-alpha backbone path
First used in the 1970s

Goal: ➡️ Identify how the polypeptide chain winds through the density

🪄 Step 2 — Pattern Builder (Baton Method)

A baton tool is manually placed in the density:

Connect Cα → next Cα → next Cα
Creates a C-alpha trace
Quickly reveals:
- α-helices
- β-strands
- turns

Once backbone is traced → side chains are added.

🧩 Recognizing Secondary Structure from Density

When the backbone trace is known:

You can identify:

🌀 α-helix
📄 β-sheet (parallel / antiparallel)
🔁 turns

These structural motifs have distinct geometric patterns.

⚗️ Side Chain Chemistry Matters

The final model must make:

Physical sense
Chemical sense
Biological sense

Interactions to consider:

Hydrogen bonds
Hydrophobic packing
Polar interactions

Incorrect chemistry = incorrect structure.

🔎 Recognizing Directionality in Density

Peptide bonds produce characteristic density features:

Carbonyl “bumps”
Planar peptide geometry
Side chains emerge from Cα

These features allow you to determine:

➡️ N-terminus → C-terminus direction of the chain.

🧬 Identifying Specific Amino Acids in Density

Some residues are especially diagnostic:

⭐ Very characteristic residues

Glycine → no side chain
Proline → cyclic backbone link
Methionine → sulfur density
Aromatics → large rings (Phe, Tyr, Trp)

Scientists often:

Use primary sequence knowledge
Search for distinctive density patterns Example: two adjacent tryptophans.

⚡ Using Heavy Atoms and Anomalous Scatterers

Helpful tricks:

Mercury binds cysteine → reveals cysteine positions
Selenium-methionine labeling → shows methionine sites
Sulfur anomalous maps → locate cysteines/met

These provide anchor points for building the model.

🔮 Secondary Structure Prediction Helps!

Before model building, researchers often:

Predict helices/sheets computationally
Compare prediction with observed Cα trace
Helps assign sequence register correctly

📊 Resolution — The Key Quality Indicator

Resolution determines how detailed the density is.

🟥 ~4 Å (Low resolution)

Mostly featureless
Only fold / chain path visible

🟧 ~3 Å

Some side chains visible

🟩 ~2 Å

Hydrogen bonding visible
Waters / ions visible
Good model quality

🟦 ~1 Å (Atomic resolution)

Individual atoms visible
“Full chemistry” interpretation possible

🔁 Rotamer Libraries — Fixing Side Chains

Side chains adopt preferred conformations.

Rotamer libraries:

Statistical database of allowed conformations
Weighted by observed frequency
Helps quickly fit density

Alternative:

Manually drag atoms into density
Then refine geometry computationally.

🧮 Difference Fourier Maps — Extremely Powerful Tools

These maps show what is missing or wrong in the model.

Fo − Fc map

Positive density → something missing
Negative density → something incorrectly modeled

Typical color convention:

Green → add atoms
Red → remove atoms

Applications:

Place water molecules
Identify mutations
Locate metal binding sites
Detect conformational changes

❌ Common Model Building Errors (Very Important for Exams)

From worst → mildest:

🚨 Severe

Completely wrong fold
Backbone traced in wrong density
Secondary structures connected incorrectly

⚠️ Moderate

Wrong chain direction
Out-of-register sequence placement

🙂 Minor

Wrong peptide plane flip
Wrong side chain rotamer

Even PDB structures can contain errors → always be critical.

🤖 Automated Model Building (Modern Practice)

Examples:

RESOLVE (Phenix)
ARP/wARP

👍 Advantages

Fast
Objective
Can build 50–90% of model

👎 Limitations

May fail in difficult regions
Hard to define molecular boundaries
Trouble with ligands / nucleic acids / modifications

Manual finishing is still essential.

🧠 Big Picture — What This Lecture Wants You to Understand

Protein structure determination is not:

❌ “Software gives structure automatically”

It is:

✅ A scientific interpretation process

You must:

Understand density features
Know chemistry of residues
Use Fourier maps intelligently
Validate stereochemistry
Be skeptical of models

Quiz

Score: 0/30 (0%)

Q0. What is the immediate experimental result obtained after Fourier synthesis of diffraction data?

An atomic coordinate file

A three-dimensional electron density map

A rotamer library

A Patterson vector list

Q1. Which model representation best highlights secondary structure elements like α-helices and β-sheets?

Surface representation

Cartoon (ribbon) representation

Ellipsoid representation

Difference Fourier map

Q2. What information is primarily conveyed by anisotropic atomic displacement ellipsoids?

Electrostatic potential

Hydrogen bond strength

Direction and magnitude of atomic motion

Residue hydrophobicity

Q3. What is the main purpose of building a skeleton during early model building?

To calculate structure factors

To predict ligand binding

To trace continuous density corresponding to the backbone

To determine rotamer frequencies

Q4. What does the baton method help determine quickly in an electron density map?

Hydrogen atom positions

Cα backbone trace and secondary structure

Water molecule occupancy

Crystal symmetry operations

Q5. Which amino acid is easiest to recognize in electron density because it lacks a side chain?

Proline

Methionine

Glycine

Valine

Q6. Why are aromatic residues particularly useful during model building?

They always bind metals

They produce large and distinctive electron density features

They are always surface exposed

They have no rotamers

Q7. What is the primary interpretation of positive peaks in an Fo−Fc difference map?

Atoms modeled too strongly

Missing features that need to be added

Incorrect symmetry

Overfitting during refinement

Q8. At approximately which resolution do individual atoms begin to appear as separate spheres in density?

4 Å

3 Å

2 Å

1 Å

Q9. What is the main benefit of using rotamer libraries in model building?

They calculate phases

They provide statistically preferred side-chain conformations

They improve crystal growth

They detect anomalous scatterers

Q10. Which map is especially useful for locating anomalous scatterers such as selenium or rubidium?

Isomorphous Patterson map

Anomalous difference Fourier map

Composite omit map

Sigma-A map

Q11. What structural feature helps determine N→C directionality in density maps?

Crystal packing contacts

Carbonyl density bumps in peptide bonds

Metal ion coordination

Unit cell dimensions

Q12. Which type of error involves assigning residues shifted relative to the true sequence position?

Rotamer error

Peptide flip

Out-of-register error

Domain swap

Q13. What major limitation can automated model building programs have?

They cannot interpret density at all

They always produce wrong stereochemistry

They may fail to connect model parts across unit cell boundaries

They only work at atomic resolution

Q14. Why is secondary structure prediction useful before model building?

It replaces refinement

It predicts crystal symmetry

It helps align predicted helices/sheets with backbone density

It determines B-factors

Q15. Electron density maps represent the distribution of electrons in the crystal unit cell.

True

False

Q16. Surface representations are best for visualizing atomic flexibility via B-factors.

True

False

Q17. An ensemble of overlaid structures can indicate flexible regions of a protein.

True

False

Q18. At low resolution (~4 Å), individual side chains are usually clearly visible.

True

False

Q19. Methionine residues can be located using anomalous maps if selenium substitution is used.

True

False

Q20. Negative peaks in difference maps suggest atoms were modeled where density is absent.

True

False

Q21. Hydrophobic residues are typically found on protein surfaces exposed to solvent.

True

False

Q22. Peptide plane flipping is considered a mild model building error compared to wrong fold tracing.

True

False

Q23. Automated model building can often construct 50–90% of a protein model.

True

False

Q24. Difference maps can be used to locate bound ions or ligands.

True

False

Q25. Rotamer libraries describe all theoretically possible conformations equally.

True

False

Q26. Secondary structure elements are easier to recognize once the Cα trace is determined.

True

False

Q27. Replacing potassium with rubidium can help identify ion binding sites through anomalous signal.

True

False

Q28. Even structures deposited in databases may contain modeling errors.

True

False

Q29. At atomic (~1 Å) resolution, hydrogen bonding networks and water molecules become invisible.

True

False