Lecture 4 PPT

Protein structure

ChatGPT conversation

🧲 PART I – Introduction

Slide 1 – Title

Protein NMR Spectroscopy II Reinhard Wimmer – Aalborg University Focus: how NMR data becomes 3D structure.

Slide 2 – Protein Structure Determination

Key themes:

What NMR data is structurally relevant?
NMR vs X-ray crystallography

The Escher artwork symbolizes structural ambiguity — multiple possible interpretations depending on perspective.

Slide 3 – NMR Investigation Workflow

Protein NMR structure determination follows this pipeline:

Sample preparation
Optimization
Resonance assignment (huge NMR time investment)
Collect NOEs, couplings, etc.
Structure calculation (huge computer time)
Structure known → study:
- Function
- Dynamics
- Mechanism

Important message: 📌 Data collection is long. Computation is long. Interpretation is iterative.

Slide 4 – What Structural Information Can NMR Give?

Three fundamental types:

1️⃣ Distances

From NOEs
From PREs
From H-bonds

→ Define secondary, tertiary, quaternary structure

2️⃣ Dihedral angles

Mainly backbone (φ, ψ)
From scalar couplings and chemical shifts

→ Define local conformation

3️⃣ Relative orientations

From residual dipolar couplings (RDCs)

→ Define global fold and domain orientation

Important distinction:

Distances = local + global
Angles = mostly local
Orientations = mostly global

Slides 5–7 – The Puzzle Analogy 🧩

Solving NMR structures is like:

A puzzle
With missing pieces
And extra pieces

Meaning:

Some restraints are ambiguous.
Some regions lack data.
Some peaks are overlapped.
There are multiple possible conformations.

Structure determination = constraint satisfaction under uncertainty.

Slides 8–10 – NMR vs X-ray Crystallography

Advantages of NMR:

No crystal required
Solution conditions (closer to physiological)
Easy to change buffer, add ligands
Study dynamics
Study folding/unfolding

Disadvantages:

Size limitation (~ <20–30 kDa typically)
Time consuming
Requires isotope labeling (¹⁵N, ¹³C)
Expensive instrumentation

Size distribution slide:

NMR → small proteins
X-ray → broader size range
EM → very large complexes

Clear methodological niches:

NMR = small + dynamic
X-ray = high resolution static
EM = large complexes

Slides 11–12 – Workflow Comparison

NMR:

Expression
Isotope labeling
Sample optimization
Data collection
Structure calculation

X-ray:

Expression
Crystallization
Heavy atom derivative
Data collection
Phasing
Refinement

X-ray easier once crystal obtained. NMR allows more functional studies.

Slide 13 – When Should You Choose NMR?

Use NMR when:

No crystal available
Interested in dynamics
Studying ligand binding
Studying mechanism
Studying pKa
Folding pathways

Slide 14 – Take-home Messages

There are three main structural data types:

Distances
Dihedral angles
Orientations

Distances are often the most important.

🧪 PART II – Distances as Structural Information

Slides 15–16 – Distances Overview

Same classification repeated:

NOEs
H-bonds
PREs

Distances define structure extremely well.

Slide 17 – Distances from NOESY

Each cross peak = one distance constraint.

But problem: 👉 WHO IS WHO?

This requires:

Full resonance assignment

Without assignment, NOE peaks are meaningless.

Slide 18 – Why Distances Are Useful

Because structure = spatial arrangement.

If you know enough pairwise distances → geometry is constrained.

Slides 19–20 – Distance Network

Example:

68 amino acids
993 NOE distances
- 201 intraresidual
- 277 sequential
- 218 medium range
- 297 long range

Long-range NOEs are most important for defining tertiary structure.

Slide 21 – Types of NOESY Experiments

2D NOESY (all protons)
3D ¹⁵N NOESY
3D ¹³C aliphatic
3D ¹³C aromatic
4D NOESY

Higher dimensions = better resolution.

Slide 22 – Automated NOE Assignment

Programs:

CANDID
ATNOS
FLYA

They:

Take peak list
Take assignments
Assign NOEs + calculate structure simultaneously

Requires near-complete resonance assignment.

Slide 23 – Distances from NOEs 📏

Key equation:

V = rac{k}{r^6}

Thus:

r = left(rac{k}{V} ight)^{1/6}

But practically:

Motion
Spin diffusion
Overlap

So instead of exact distance: 👉 Use upper distance limits

Empirically often behaves like:

V le rac{k}{r^4}

Important: NOEs are converted to distance restraints, not exact values.

Slide 24 – NOEs from Secondary Structure

Characteristic patterns:

α-helix:

i → i+3
i → i+4
HN–HN
HN–Hα

β-sheet:

Inter-strand NOEs
HN–HN
Hα–Hα across strands

These patterns help identify secondary structure.

Slide 25 – Sequence Plot

Combines:

NOE patterns
Chemical shifts
Couplings

To map:

α-helices
β-strands

Slide 26 – Hydrogen Bonds

Sometimes detectable via:

Scalar couplings across H-bonds

But: ⚠️ H-bonds should NOT be added unless strong evidence exists.

Slides 27–29 – PREs (Paramagnetic Relaxation Enhancement) 🧲

Insert spin label (unpaired electron).

Effect:

Increases relaxation rate (R2)
Distance dependent (~1/r⁶)

Use:

Engineer single Cys mutant
Attach spin label
Measure signal attenuation
Compare oxidized vs reduced label

PREs give long-range distance information (up to ~25 Å).

Very powerful for domain orientation.

Slide 30 – Distance Take-Home

Three sources:

NOEs
PREs
H-bonds

NOEs:

Main information source
Converted to upper distance limits
Long-range NOEs define tertiary structure

📐 PART III – Dihedral Angles & Orientation

Slide 31 – Introduction

Angles and orientation add additional constraints.

Slide 32 – Chemical Shifts & Secondary Structure

Cα and Cβ shifts differ in:

α-helix
β-sheet

Secondary chemical shift: Observed − random coil value

Patterns:

Helix → positive Cα shift
Sheet → negative Cα shift

Slide 33 – TALOS

Programs:

TALOS
TALOS+
TALOS-N

Use chemical shifts to predict:

Machine learning + database comparison.

Provides torsion angle restraints.

Slides 34–35 – J-Coupling (Scalar Coupling)

Electron-mediated, through-bond interaction.

Karplus relationship:

J = Acos^2( heta) + Bcos( heta) + C

J depends on dihedral angle.

Common example: ³J(HN–Hα)

Large J (~8 Hz) → β-sheet Small J (~3–4 Hz) → α-helix

Thus J-couplings provide dihedral angle constraints.

Slides 36–38 – Measuring Orientations in Anisotropic Media

Normally:

Molecules tumble isotropically
Dipolar couplings average to zero

If weakly aligned:

Residual dipolar couplings (RDCs) remain

How to align?

Bicelles
Bacteriophages
Liquid crystals
Polyacrylamide gels

RDCs give:

Orientation of bond vectors (N-H, C-H)
Relative domain orientation

Extremely valuable for multi-domain proteins.

🧮 PART IV – Structure Calculation & Validation

Slide 41 – Structure Calculation Software

Programs:

CYANA
CNS
X-PLOR

Input:

Distance constraints
Angle constraints
RDCs
All other restraints

Procedure: Iterative:

Calculate
Evaluate
Adjust
Recalculate

Slide 42 – Target Function

Structure calculation minimizes deviation between:

Measured restraints
Calculated geometry

Many conformers generated.

Select best-fitting ensemble.

Slide 43 – Evaluation Metrics

1️⃣ Target function (restraint violations)

2️⃣ RMSD (precision of ensemble)

3️⃣ Ramachandran statistics

Slides 45–46 – RMSD

RMSD = root mean square deviation between structures.

Lower RMSD → tighter ensemble → higher precision.

But: Depends on:

Superposition region
Structured vs flexible regions

You can artificially lower RMSD by fitting only secondary structure.

Slide 47 – Method Dependence

NMR, X-ray, modeling can give different structures.

Solution vs crystal packing effects.

Slides 48–49 – Validation

Check:

Ramachandran plot
Side chain rotamers
Bond geometry
van der Waals contacts
H-bond geometry

Use:

PROCHECK
ProQ
WhatIf
ProseSS

Use independent data (RDCs, unusual shifts) for validation.

Slide 50 – Ramachandran Plot

Allowed φ/ψ regions:

α-helix
β-sheet
Left-handed helix

Low percentage in disallowed region = good structure.

Slide 51 – Example Structure Statistics

Example publication statistics:

1576 NOEs
20 conformers
Target function ~4.6
Max violation 0.14 Å
72.8% favored Ramachandran
RMSD backbone ~1.19 Å
RMSD secondary structure ~0.67 Å

Important: RMSD of structured core is more meaningful than full-length RMSD.

🎯 Overall Big Picture

Protein NMR structure determination integrates:

Distances (NOEs, PREs)

→ Define fold

Angles (J-coupling, chemical shifts)

→ Define local conformation

Orientations (RDCs)

→ Define global orientation

Then:

Structure calculation → Minimize violations → Select best ensemble → Validate geometry → Report RMSD + statistics

NMR structures are ensembles, not single static models.

Quiz

Score: 0/30 (0%)

Q0. Which three main types of structural information can protein NMR provide according to the presentation?

Distances, dihedral angles, and relative orientations

Chemical shifts, peak intensities, and linewidths

Spin states, relaxation rates, and isotopes

pKa values, folding rates, and catalytic constants

Q1. In NMR structure determination, which type of NOE is most critical for defining tertiary structure?

Intraresidual NOEs

Sequential NOEs (i to i+1)

Medium-range NOEs (2≤Δres≤4)

Long-range NOEs (Δres>4)

Q2. The theoretical relationship between NOESY cross-peak volume (V) and interproton distance (r) is best described by:

V ∝ r^6

V ∝ 1/r^6

V ∝ 1/r^2

V ∝ r

Q3. In practice, NOE-derived distances are usually treated as:

Exact distances

Lower distance limits

Upper distance limits

Orientation constraints only

Q4. Which experiment helps resolve NOE overlap by adding heteronuclear resolution?

1D proton spectrum

2D COSY

3D 15N-edited NOESY

1D 13C spectrum

Q5. Paramagnetic relaxation enhancement (PRE) distance information arises from:

Through-bond scalar coupling

Electron-mediated dipolar interactions from an unpaired electron

Hydrogen bond detection

Chemical shift anisotropy only

Q6. Which structural feature can be predicted from chemical shifts using programs like TALOS?

Exact atomic coordinates

Backbone dihedral angles (φ/ψ)

Side chain methyl distances

Hydrogen bond lengths

Q7. J-couplings depend primarily on:

Interatomic distance only

Magnetic field strength only

Dihedral angle (Karplus relationship)

Molecular weight

Q8. Residual dipolar couplings (RDCs) require:

Completely isotropic tumbling

Full crystallization

Weak molecular alignment in anisotropic media

Spin labeling with radicals only

Q9. Which of the following is NOT typically used to weakly align proteins for RDC measurements?

Lipid bicelles

Bacteriophages

Polyacrylamide gels

High salt buffer only

Q10. Structure calculation programs mentioned include:

CYANA, CNS, X-PLOR

BLAST, ClustalW, MUSCLE

PyMOL, Chimera, VMD

Gaussian, ORCA, NAMD

Q11. The target function in NMR structure calculation represents:

The protein’s biological activity

The deviation between experimental restraints and calculated structure

The molecular weight

The number of NOEs

Q12. RMSD in an NMR ensemble primarily reflects:

Accuracy compared to crystal structure

Precision within the ensemble

Protein stability

Ligand binding affinity

Q13. Which validation metric evaluates backbone φ/ψ angle distribution?

Karplus plot

Ramachandran plot

Stern-Volmer plot

Van't Hoff plot

Q14. Compared to X-ray crystallography, NMR is especially advantageous for:

Very large macromolecular complexes

Studying molecular dynamics in solution

Achieving sub-angstrom resolution routinely

Avoiding isotope labeling

Q15. Distances derived from NOEs can define both local and global structure.

True

False

Q16. Long-range NOEs connect residues that are close in sequence.

True

False

Q17. Hydrogen bonds should be added as distance restraints even without experimental evidence.

True

False

Q18. PRE effects decay approximately with a 1/r^6 dependence.

True

False

Q19. Chemical shifts can provide information about secondary structure.

True

False

Q20. J-couplings are mediated through space rather than through bonds.

True

False

Q21. Residual dipolar couplings give information about bond vector orientation relative to an alignment tensor.

True

False

Q22. A lower RMSD always guarantees that the structure is correct.

True

False

Q23. Structure calculation is typically an iterative procedure.

True

False

Q24. NOESY cross peaks are meaningless without resonance assignment.

True

False

Q25. NMR structures are typically represented as single static conformations.

True

False

Q26. Anisotropic media are required to observe RDCs.

True

False

Q27. The Karplus equation relates J-coupling values to dihedral angles.

True

False

Q28. X-ray crystallography generally has no size limitations compared to NMR.

True

False

Q29. Ramachandran plot statistics are used to validate protein structures.

True

False