Lecture 3 Book Chapter 3.1.1

Protein structure

Chapter 3.1.1 — Why NMR Structures? 🧲🧬 (detailed summary)

Big picture: what makes NMR special?

NMR is valuable in structural biology because it gives both structure and dynamics in conditions that often resemble a protein’s physiological (solution) environment. Unlike a single “best” structure, an NMR structure is typically an ensemble of conformers that all satisfy the experimental restraints—this naturally aligns with the fact that proteins can be dynamic rather than rigid.

A quick historical anchor (why the method mattered early on)

The first NMR solution structure of a biological macromolecule was solved in 1984 (Wüthrich and coworkers) for a 57–amino acid proteinase inhibitor IIA from bull seminal plasma.
NMR’s value was strongly reinforced when the solution NMR structure of rat metallothionein-2 showed that an available crystal structure needed revision—a key early example of NMR revealing that “crystal ≠ solution reality” in some cases.

Where NMR sits among the “big three” structure methods 🧩

The chapter frames structural biology around three main experimental approaches (highlighted in the pie chart on page 2, Fig. 3.1.1):

X-ray crystallography
Solution NMR
Cryo-electron microscopy (cryo-EM)

Key takeaways:

X-ray and NMR are the two main atomic-level methods in routine use for biomolecules (in crystals vs solution).
Cryo-EM is increasingly important for very large assemblies (ribosome, microtubules, viruses), and methodological advances have pushed its “routine” applicability down to around ~200 kDa.
Cryo-EM often has lower resolution than X-ray, but may capture more biologically relevant conformations because it avoids crystal lattice constraints.
Methods can be combined: atomic structures (X-ray or NMR) of components can be docked into cryo-EM maps to build pseudo-atomic models and identify contact interfaces.

Figure 3.1.2 (page 2) gives a key “size preference” message via the bar chart:

NMR is preferentially used for low molecular weight systems
Cryo-EM is preferentially used for very large assemblies

Practical/technical differences: what each method “needs” and what it “sees”

Hydrogens:

NMR restraints (like distance restraints) often explicitly involve hydrogen positions, so H atoms are typically included in NMR PDB coordinate files.
X-ray structures usually don’t include hydrogens in coordinate files because diffraction is dominated by heavier atoms and hydrogens are not directly seen in typical data.

Sample requirements (surprisingly similar): For both NMR and X-ray you generally need protein samples that are:

homogeneous
stable for days/weeks
soluble enough
not irreversibly aggregated at high concentration

Even if you meet these, you still can’t be sure you’ll get:

crystals that diffract well, or
NMR spectra good enough for structure determination

NMR as a screening tool for crystallography 🧪➡️🧊

Because broad requirements overlap, NMR has been used to screen protein samples to identify ones more likely to crystallize well.

1D ¹H NMR screening idea

The chapter notes that ¹H 1D spectra can categorize proteins into groups that:

crystallize with similar success rates,
but differ in the quality of diffraction the crystals provide.

Better diffraction tends to come from proteins whose ¹H spectra show:

higher chemical shift dispersion (suggesting a defined tertiary structure),
narrower linewidths (arguing against conformational exchange/equilibria).

HSQC as the workhorse screen (and why)

A central concept here is the ¹H–¹⁵N HSQC spectrum:

It gives a 2D map where (ideally) you see one peak per backbone amide (except prolines).
¹⁵N labeling is relatively affordable, making HSQC-based screening feasible.
HSQC can guide rational construct design (domain boundaries, trimming flexible ends/loops) before crystallization trials, improving chances of crystallization success.

Key point: NMR spectral quality ≠ guaranteed crystallography success ⚠️

The chapter is explicit that correlation exists, but it is not stringent.

Two example findings:

In one study (159 proteins), only about ~20% of high-resolution X-ray structures came from samples with excellent/good HSQC spectra.
Surprisingly, ~33% of proteins with lower-quality but still “promising” HSQC spectra could still yield successful X-ray structures.

Another study (263 proteins) found:

only 21/263 (~8%) were amenable to both NMR and X-ray based on HSQC + optimized crystallization trials, but using both methods in parallel increased the total “solvable” targets to 107/263 (~41%), split roughly as:
43 NMR-only
43 X-ray-only (and the remainder being the overlap/other outcomes described in the text).

Interpretation (as given in the text): strong complementarity. Some proteins are simply better suited to one method than the other even if they look “good” by shared criteria (solubility/monodispersion).

Why would a protein be “NMR-friendly” but “crystal-hostile”?

A major reason proposed: highly flexible or unstructured regions.

These regions may not wreck NMR spectra as badly,
but can be extremely adverse to crystallization (crystals often require a consistent, well-packed arrangement).

Construct redesign can sometimes help:

If unstructured parts are at termini, you may remove them.
If they’re internal (e.g., a long loop between secondary structure elements), redesign may be difficult or impossible.

How to “read” HSQC spectra as a folding/quality report 📈

Figure 3.1.3 (page 4) is central here: it shows three HSQC patterns that correspond to different folding states.

A simple quantitative metric

You can evaluate HSQC quality by:

counting the number of well-detectable (intense, well-resolved) backbone amide peaks,
comparing that count to the number of non-Pro residues in the sequence.

This gives an estimate of how many residues are in folded regions.

What different features mean

Weak minor peaks and/or broad peaks collapsed near the center indicate regions lacking a defined 3D structure.
More peaks than expected can indicate multiple conformations that interconvert slowly on the chemical shift timescale (so you see separate sets rather than one averaged peak).

The three HSQC “fingerprints” (Fig. 3.1.3)

(a) Fully folded protein: sharp, intense peaks; wide chemical shift dispersion; number of backbone amide peaks matches expectation.
(b) Partially unfolded: reduced dispersion (especially in ¹H); intense peaks become broad and cluster around ~8.5 ppm (random-coil-like); peaks outside are weak.
(c) Unfolded: almost all backbone amide resonances collapse into a narrow band around ~8.3 ppm; even side-chain NH₂ pairs (Gln/Asn) lose resolution as unfolding increases.

No “scientific” reason to always prefer X-ray or NMR 🧠

The authors argue there is no a priori scientific reason to prefer one technique for a protein target. The choice is often practical/feasibility-based, and the methods are complementary.

Practical considerations mentioned:

NMR structure determination has historically been less automated than X-ray pipelines (though automation exists and is discussed elsewhere in the book).
NMR generally requires isotopic labeling, and the needed labeling strategy depends on protein size.

A big conceptual advantage: solution relevance and conformational selection 🌊🧬

A distinct advantage of NMR emphasized here:

X-ray structures can capture a conformation that is not the dominant one in solution, which is often closer to physiological conditions for soluble proteins.

Example given:

HPr (histidine-containing phosphocarrier protein) from Enterococcus faecalis: an X-ray structure showed a strained conformation of an important active-loop residue (proposed to be catalytically important), but NMR later indicated this conformation is at most a minority in solution.

Proposed explanation:

If multiple states exist in solution, crystallization may select one—the most “crystal-prone” conformation (a form of conformational selection imposed by crystallization conditions).

Second illustrative case:

Matrix metalloproteinase-12: different crystalline environments “freeze” different conformations of flexible regions, while NMR in solution reveals conformational equilibria across timescales.

Core limitation of solution NMR: size 🐘 vs 🧲

The chapter reiterates a key limitation:

De novo solution NMR structure determination is largely confined to proteins below ~25 kDa.

Why (physical + practical):

Slower tumbling for larger proteins increases linewidths, potentially making signals too broad to detect.
More residues create more peaks → spectral overlap increases → assignments become difficult.

This is described as an active frontier: pushing the size limit is a major goal in NMR methodology.

Consequences for method choice:

X-ray is typically the technique of choice for large proteins/complexes at atomic resolution.
Even larger or non-crystallizable systems can be studied by cryo-EM.
NMR/X-ray can be combined with cryo-EM by supplying atomic structures for components to interpret the lower-resolution overall map.

Solid-state NMR: a way around the “tumbling” size limit 🧱🧲

A key point near the end of 3.1.1:

The size limit due to slow tumbling does not apply to solid-state NMR, because molecules are not freely rotating in solution—they’re effectively fixed in orientation.

The chapter notes major recent efforts to develop solid-state NMR methods for various sample types:

soluble proteins in microcrystalline form,
protein fibrils,
membrane proteins in lipid bilayers, etc.

Important nuance:

In favorable cases, membrane proteins can still be studied by solution NMR if reconstituted in detergent micelles.

Why microcrystals matter:

Microcrystals can be easier to produce than the high-quality single crystals needed for X-ray diffraction, motivating solid-state NMR as an alternative route for larger proteins.

But also:

Solid-state NMR is still in heavy methodological development, without fully standardized protocols yet (as described here).

Where solid-state NMR has been especially successful:

Protein fibrils: solid-state NMR can uniquely provide detailed atomic-level packing information.
Membrane proteins: a long-standing target for solid-state NMR (decades), and often difficult to crystallize.

Historical and modern examples mentioned:

Early successes: bacteriorhodopsin and gramicidin as cases where solid-state NMR enabled structural study when other methods struggled.
Functional conformational cycling: retinal proteins (e.g., rhodopsin) undergoing light-activated state changes in both protein and chromophore.
M2 proton channel: studied for structure, function, and ligand binding; solid-state NMR has uniquely enabled investigation of ligand binding (drugs/toxins) to functional membrane proteins.

Tiny “stick it in memory” recap 🧠✨

NMR shines because it links structure + dynamics in solution (often more physiological).
NMR vs X-ray is often about complementarity, not superiority.
HSQC spectra are a powerful way to judge folding state, conformational heterogeneity, and sometimes crystallization prospects—but correlation is imperfect.
Solution NMR size limit (~25 kDa) comes from tumbling/linewidth and overlap/assignment issues; solid-state NMR can bypass tumbling limits and is key for fibrils/membranes.

Quiz

Score: 0/30 (0%)

Q0. What was a major conceptual innovation introduced by the first NMR solution structures of proteins?

They demonstrated that proteins are completely rigid in solution

They provided a single definitive structure identical to crystal structures

They introduced the idea of structural ensembles reflecting protein dynamics

They eliminated the need for X-ray crystallography

Q1. Why was the NMR structure of rat metallothionein-2 historically important?

It was the first protein structure ever solved

It showed that X-ray crystallography is unreliable

It revealed that an existing crystal structure needed revision

It demonstrated that NMR can only be used for very small peptides

Q2. Which molecular weight range is most suitable for de novo solution NMR structure determination?

Below ~25 kDa

25–100 kDa

Above 200 kDa

Any size, provided isotopic labeling is used

Q3. Why are hydrogen atoms typically included in NMR-derived PDB structures but not in X-ray structures?

NMR requires hydrogen bonding networks for refinement

X-ray diffraction cannot detect hydrogen atoms at all

NMR restraints often explicitly involve hydrogen positions

Hydrogens destabilize crystal lattices

Q4. What key information does a 1H–15N HSQC spectrum provide?

Side-chain rotamer populations only

One peak per backbone amide (except proline)

Direct interatomic distances

Complete atomic coordinates

Q5. Which HSQC feature is most indicative of a well-folded protein?

Peaks clustered tightly around 8.3 ppm

Broad peaks with low intensity

Wide chemical shift dispersion and sharp peaks

More peaks than non-proline residues

Q6. What does an excess number of HSQC peaks compared to the number of non-proline residues suggest?

Experimental noise

Fast conformational averaging

Slow interconversion between multiple conformations

Incorrect isotope labeling

Q7. Why can flexible or disordered regions hinder crystallization but still allow NMR analysis?

They increase protein solubility

They reduce the need for isotope labeling

They prevent formation of a regular crystal lattice

They enhance diffraction quality

Q8. Which statement best describes the complementarity of NMR and X-ray crystallography?

NMR always provides more accurate structures

X-ray is preferred for all soluble proteins

Each method can succeed where the other fails

Both methods work on exactly the same targets

Q9. Why might an X-ray structure capture a conformation that is rare in solution?

X-ray radiation damages proteins

Crystallization can select a specific conformational state

Proteins refold during diffraction

Hydrogens are missing from X-ray models

Q10. What is a major advantage of cryo-EM compared to X-ray crystallography?

Higher atomic resolution for small proteins

No need for large protein complexes

Avoidance of crystal lattice constraints

Requirement for isotopic labeling

Q11. Why does increasing protein size broaden NMR signals in solution?

Larger proteins contain more hydrogens

Slower molecular tumbling increases transverse relaxation

Chemical shifts become temperature-dependent

Isotope labeling becomes inefficient

Q12. How can NMR be used to improve crystallization success?

By directly growing crystals in the NMR tube

By identifying rigid domains and flexible regions via spectra

By replacing X-ray diffraction entirely

By increasing protein concentration indefinitely

Q13. What fundamental limitation of solution NMR does solid-state NMR overcome?

Requirement for isotope labeling

Dependence on molecular tumbling

Need for homogeneous samples

Sensitivity to magnetic field strength

Q14. Which type of biological system has been particularly well-suited for solid-state NMR studies?

Small globular enzymes

Protein fibrils and membrane proteins

Highly soluble monomers only

DNA duplexes exclusively

Q15. NMR solution structures are typically represented as ensembles rather than single conformations.

True

False

Q16. The first NMR protein structure was solved before any X-ray protein structures existed.

True

False

Q17. Cryo-EM is particularly well suited for very large macromolecular assemblies.

True

False

Q18. Hydrogen atoms are usually visible directly in X-ray diffraction data.

True

False

Q19. A narrow cluster of HSQC peaks near 8.3 ppm is characteristic of an unfolded protein.

True

False

Q20. A high-quality HSQC spectrum guarantees successful X-ray crystallization.

True

False

Q21. Flexible regions in proteins generally favor crystal lattice formation.

True

False

Q22. Both NMR and X-ray crystallography require homogeneous and stable protein samples.

True

False

Q23. Conformational selection during crystallization can lead to structures that differ from dominant solution states.

True

False

Q24. The main size limitation in solution NMR arises from slower molecular tumbling.

True

False

Q25. Solid-state NMR completely eliminates the need for specialized sample preparation.

True

False

Q26. Membrane proteins can sometimes be studied by solution NMR when embedded in detergent micelles.

True

False

Q27. Cryo-EM structures are always determined at lower resolution than X-ray structures.

True

False

Q28. Protein fibrils are particularly well suited to study by solid-state NMR.

True

False

Q29. There is a fundamental scientific reason to always prefer X-ray crystallography over NMR.

True

False