Tom Goddard
March 29, 2022
SBGrid webinar: Cryo-electron Microscopy of Membrane Proteins from Sample to Structure
We try to find an initial atomic model for the human TACAN dimer structure using the AlphaFold database at the EBI and ChimeraX. This map and an atomic model were published August 2021, had no prior known homologous models in the Protein Databank and is thought to be a mechano-sensitive ion channel involved in pain sensation or lipid metabolism enzyme.
Cryo-EM structures of human TMEM120A and TMEM120B. Ke M, Yu Y, Zhao C, Lai S, Su Q, Yuan W, Yang L, Deng D, Wu K, Zeng W, Geng J, Wu J, Yan Z. Cell Discov. 2021 Aug 31;7(1):77. doi: 10.1038/s41421-021-00319-5. PMID: 34465718.
The AlphaFold database has about 1 million predicted structures (January 2022) including all human genes, all genes from 20 model system organisms, all SwissProt curated sequences, and sequences related to anti-microbial resistance and neglected tropical diseases.
>sp|Q9BXJ8|TACAN_HUMAN Ion channel TACAN OS=Homo sapiens OX=9606 GN=TMEM120A PE=1 SV=1 MQPPPPGPLGDCLRDWEDLQQDFQNIQETHRLYRLKLEELTKLQNNCTSSITRQKKRLQE LALALKKCKPSLPAEAEGAAQELENQMKERQGLFFDMEAYLPKKNGLYLSLVLGNVNVTL LSKQAKFAYKDEYEKFKLYLTIILILISFTCRFLLNSRVTDAAFNFLLVWYYCTLTIRES ILINNGSRIKGWWVFHHYVSTFLSGVMLTWPDGLMYQKFRNQFLSFSMYQSFVQFLQYYY QSGCLYRLRALGERHTMDLTVEGFQSWMWRGLTFLLPFLFFGHFWQLFNALTLFNLAQDP QCKEWQVLMCGFPFLLLFLGNFFTTLRVVHHKFHSQRHGSKKD
EMDB map 30495, 3.4 Angstroms. (fetched with ChimeraX command open 30495 from emdb). |
ChimeraX AlphaFold tool, in menu Tools / Structure Prediction, with UniProt sequence TACAN_HUMAN, then press Fetch button. |
AlphaFold EBI database model fit into map (smoothed with volume gaussian #1 sdev 2). |
The long intracellular alpha helix at the bottom can be rigidly moved with the ChimeraX move atoms mouse mode to better fit the density to improve the initial model. Then the atomic model can be refined in the map to correct side positions, e.g. with the ChimeraX ISOLDE tool.
Can find closest sequences in AlphaFold database with Search button. This is useful to see different conformations when there is no close match.
AlphaFold database BLAST search results for TACAN_HUMAN. | Four best sequence matches in AlphaFold Database. |
AlphaFold database only has predictions of single proteins. Use the Predict button to run AlphaFold on the TACAN dimer after pasting two copies of the sequence separate by commas. This will run AlphaFold on Google Colab free servers. You will be asked to sign in to your Google account (same account used for Google email, drive, calendar). A security warning will display saying the ChimeraX AlphaFold code being run is not from Google, click Run Anyway.
AlphaFold dimer (blue). Solved cryoEM structure 7cxr (red).
C-alpha RMSD 4 Angstroms.
~/Downloads/ChimeraX/AlphaFold/prediction_3
How do you judge if an AlphaFold model is correct without a known structure?
Predicted Aligned Error is shown with Error Plot button on ChimeraX AlphaFold panel. Residues can be clustered that have low PAE to define domains with the Color PAE Domains button. Different domains may be misaligned relative to one another.
pLDDT per-residue confidence coloring. | Residue vs residue PAE values. | Domains with high PAE between them may be incorrectly packed. |
With a modern GPU (NVidia RTX 3090 24 Gbytes, A40 48 Gbytes) not available on Colab you can predict structures of 2000 or 3000 amino acids.