(From Lecture 4 – Video 4)
This lecture walks through the heart of NMR structure determination:
This is where experimental data becomes a 3D protein model.
Several dedicated programs exist:
They use different algorithms but follow the same core logic.
The primary input is:
Each cross peak must be:
Modern programs can sometimes:
NOE intensity → converted to distance constraint.
The software:
Example:
NOE intensity corresponds to max distance of 3 Å → The two hydrogens must be ≤ 3 Å apart.
Besides distances, you may include:
All constraints are fed into the algorithm.
A protein has enormous conformational freedom.
Each residue:
For 100 residues: → 200 backbone degrees of freedom → Plus side chains
The algorithm performs:
Starting from a random conformation, the structure is adjusted to:
Minimize deviation between structure and experimental constraints
This is a fitting problem in a massive multidimensional energy landscape.
You might find a local minimum, not the global one.
Solution:
👉 Start from many random structures 👉 Minimize each independently
Typically:
This gives you an ensemble.
You never report one structure.
Instead:
You report the best 20 (or 30) structures.
Why?
Because:
The ensemble shows:
After calculation, you evaluate:
A violation = structure does not satisfy constraint.
Example:
Possible reasons:
Violations must be investigated.
Sum of violation penalties = target function.
But beware:
Low target function ≠ correct structure It could also mean:
Measures:
How well the ensemble structures overlap
Low RMSD:
High RMSD:
Early stage:
After refinement:
But loops & termini often remain flexible.
RMSD depends on which atoms you superimpose.
Same 20 structures:
Same data — different numbers.
Conclusion:
RMSD can be manipulated by choice of alignment region.
Interpret carefully.
Calmodulin has:
When superimposing:
Meaning:
✔ Each lobe is well defined ✖ Their relative orientation is not
Conclusion:
The lobes are flexible relative to each other in solution.
This is structural information.
Two typical visualizations:
Shows all 20 conformers overlaid. → Reveals precision & flexibility.
Shows secondary structure arrangement. → Easier to interpret → Less scientifically informative than ensemble
Different methods give different results:
NMR uniquely captures flexibility.
Structural validation is challenging.
Check against known protein geometry:
⚠️ But much of this was already used in refinement.
Shows distribution of ϕ/ψ angles.
Categories:
You should have:
Typical table contains:
If you have data not used in structure calculation:
These are strong validation tools.
For NMR structures:
They identify structural inconsistencies.
You repeatedly refine and re-check.
Single models are misleading.
Depends on alignment region.
They often indicate input errors.
NMR captures solution dynamics.
Internal checks can be circular.
NMR structure determination is not:
“The software gives me the structure.”
It is:
An iterative fitting process in a massive conformational space, constrained by experimental data, evaluated statistically, and interpreted biologically.
The ensemble is not a weakness — it is the strength of NMR.
It shows what the protein really looks like in solution:
That concludes the structure calculation and validation process from Lecture 4 Video 4.