Protein structure determination by NMR is essentially a constrained optimization problem: We want to find 3D structures that satisfy all experimental restraints while remaining chemically realistic.
It happens in two major phases:
Let’s break it down.
The goal is to create a bundle of structures that all satisfy the experimental restraints (NOEs, dihedral angles, RDCs, etc.).
Historically, different computational strategies were developed:
Instead of working in Cartesian coordinates, this method works in distance space.
Programs: early implementations before more modern methods.
Key idea: Bond lengths and bond angles barely fluctuate → fix them.
Only allow:
Result:
Programs:
This became the dominant approach.
Why?
Simple minimization gets stuck in local minima.
Simulated annealing:
Modern example: CYANA
From page 3 (Figure 4.9 in your file):
Covalent structure:
Instead of full Lennard–Jones potential:
Example values (Table 4.3):
Disulfide bonds enforced manually via restraints.
The target function acts like the potential energy.
It penalizes:
If all restraints satisfied → V = 0
Distance penalty usually quadratic:
f(d,b) = (d-b)^2
Only applied when violated.
From page 4 in your file:
The cooling process:
Calculation repeated 50–150 times.
Typically:
Rule of thumb: Final bundle ≈ 25% of generated conformers.
If fold is well-defined:
Major bottleneck: manual NOESY assignment 😵
Problem:
Solution: integrate assignment into structure calculation.
This mimics how experts do it manually — but automated.
Solution:
Automation is improving, especially with RDC integration.
Structure generation uses simplified force fields.
Result:
Refinement step:
Huge improvement since mid-1990s:
From page 6:
Why heat?
Restraints active during entire simulation:
Programs:
If refinement introduces new violations: → likely incorrect assignments → must re-examine restraints
A structure is a model, not truth.
Validation asks:
Commonly reported:
🚨 Consistent violations (>75% conformers) are serious.
Measures overall fit to distance restraints.
Lower RMS → better agreement.
Inspired by crystallography.
Compare:
Full relaxation matrix used.
Modified R-factor reduces bias from short-distance interactions.
Program: RFAC.
Instead of intensity:
Ask:
RPF gives:
Very powerful validation metric.
Rdip factor:
Cross-validation (Rdip free) possible but computationally heavy.
Because NMR restraints are sparse, geometry depends strongly on force field.
Validation uses reference databases.
Z = rac{x - ext{mean}}{ ext{std}}
Interpretation:
Usually tightly constrained.
Most important backbone validation tool.
Categories:
High-quality structures: → Almost all residues in favored regions.
Programs:
Side chains prefer:
Compare to PDB statistics.
Programs:
Nonbonded atoms too close.
Should not occur.
Detected by:
Important but hard in NMR:
Validation:
Find conformations minimizing target function.
Escapes local minima.
Explicit solvent is critical.
Must check: