This lecture explains the final stages of X-ray crystallographic structure determination β how we improve, validate, and judge the quality of a protein model before publication.
Think of this stage as:
π§© You already built a rough protein model β now you polish, test, and verify if it truly matches the experimental data.
After building an initial model, refinement aims to:
β Minimize the difference between:
This is the central refinement goal.
Refinement is not blind fitting β we use known stereochemistry constraints:
These help guide the model toward physically realistic conformations.
Initial models often get stuck in local minima.
To escape this:
(Molecular-dynamics style refinement)
Goal β reach global minimum = best model.
These control model complexity vs data amount.
Reduce number of parameters.
Example:
Why?
π Low-resolution data β fewer reflections π Too many parameters β overfitting
Constraints prevent over-parameterization.
Allow flexibility but within allowed ranges:
Model can move β but not unrealistically.
Describes atomic mobility / disorder.
High B-factors cause:
β‘ Faster fall-off of scattering β‘ Poor high-resolution density visibility
Especially important for:
Measures mismatch between data and model.
R = rac{sum |F_ - F_|}{sum F_}
Refinement aims to reduce R continuously.
Super important concept β
Procedure:
Interpretation:
| Situation | Meaning |
|---|---|
| Rwork β and Rfree β | Model improving |
| Rwork β but Rfree β | β Overfitting noise |
| Rfree β 68% | Random model |
Difference between Rwork and Rfree β 5% is typical.
Plots Ο (phi) vs Ο (psi) torsion angles.
Regions:
π΄ Allowed π‘ Additional allowed π¨ Generously allowed βͺ Disallowed
Good model:
Exception:
π Catalytic residues may appear strained but real β always check electron density.
Measures how well model density matches observed density.
Good value:
RSCC > 0.9
Low RSCC + High B-factor β poorly defined region Typical example: flexible loops or incorrectly modeled ligands.
Ligands often:
Reasons:
Contour level matters:
Typical parameters:
ext{Redundancy} = rac{ ext{Measured reflections}}{ ext{Unique reflections}}
Higher redundancy β better precision.
How much of reciprocal space was measured.
Otherwise resolution claim is unreliable.
Agreement between symmetry-related reflections.
Rule of thumb:
Average B-factor for crystal.
Again shows disorder limits resolution.
Quality indicator of geometry.
Typical targets:
At low resolution β strong restraints β artificially small RMSD At high resolution β restraints can be loosened.
Structural waters may still appear even at lower resolution.
Protein crystallography workflow ends with:
1οΈβ£ Build model 2οΈβ£ Refine model (fit data + chemistry) 3οΈβ£ Validate model (statistics + geometry + density)
Only after passing all checks β structure is considered reliable.
This lecture essentially teaches:
𧬠A protein structure is not just βsolvedβ β it must be statistically and chemically proven correct.