Tom Goddard
January 12, 2022, updated February 5, 2024
The table shows ColabFold 1.5.5 is about 2 to 10 times faster than AlphaFold 2.3.2.
AlphaFold version 2.1.1 has limitations in the protein or complex size that can be predicted limited mostly by available GPU memory. Also predictions of large structures can take more than 10 hours. Here are examples to gauge what is possible.
AlphaFold 2.1.2 switched OpenMM minimization to use GPU while AlphaFold 2.1.1 used CPU. This makes runs faster. Times below are for 2.1.1 unless stated otherwise.
How many amino acids can a protein or complex contain before AlphaFold runs out of memory? And how often does it make wrong predictions of large complexes? Here are examples of runs on large structures.
PDB | Proteins / unique | Total amino acids | GPU memory (GB) | Nvidia GPU | Time (hours) | Computer | Notes |
---|---|---|---|---|---|---|---|
CNAG_01613 CNAG_06362 | 2 / 2 | 4678 | ? | ? | 0.2 | Google AF3 server | AlphaFold 3. Completed, 5 predictions in 12 minutes. All are C-shaped instead of straight. Homologs of UNC79 and UNC80 in PDB 7sx3. |
CNAG_01613 CNAG_06362 | 2 / 2 | 4678 | 48 | A40 | 65 | UCSF cluster | ColabFold 1.5.5. Completed, two plausible folds. Homologs of UNC79 and UNC80 in PDB 7sx3 |
CNAG_01613 CNAG_06362 | 2 / 2 | 4678 | 48 | A40 | 103 | UCSF cluster | AlphaFold 2.3.2. Completed, two plausible folds. Homologs of UNC79 and UNC80 in PDB 7sx3 |
7KIY | 3 / 3 | 3780 | 48 | A40 | 16.5 | UCSF cluster | Colabfold 1.5.5. Completed, wrong packing, and monomers have lots of self-clashes. Image. |
6IG9 | 1 | 3744 | 48 | A40 | 35 | UCSF cluster | Completed, correct fold, used PDB templates. Image. |
3GHG + 8T2V | 5/5 | 3204 | 48 | A40 | 7.5 | UCSF cluster1 | ColabFold 1.5.5. Fibrin and integrin each predicted correctly but no binding identified. Image. |
6YEJ | 1 | 3199 | 48 | A40 | 28 | UCSF cluster1 | Each of 2 domains predicted correctly, but relative position wrong. Image. |
7JGD | 1 | 2660 | 48 | A40 | 13 | UCSF cluster | Completed. Misaligned domains. Image. |
7LHW | 1 | 2527 | 48 | A40 | 17 | UCSF cluster | Completed, roughly correct with many shifted domain positions. Image. |
6UM1 | 1 | 2499 | 24 | 3090 | 1 | Desktop PC | Failed in jax CUDA_ERROR_ILLEGAL_ADDRESS, probably out of GPU memory. Log. |
8DIT | 3 / 3 | 2498 | 48 | A40 | 3 | UCSF cluster | ColabFold 1.5.5. Completed. Trimer not packed correctly but domains of proteins are mostly correct. |
8DIT | 3 / 3 | 2498 | 24 | 3090 | 12 | Desktop PC | AlphaFold 2.3.1. Completed. Trimer not packed correctly but domains of proteins are mostly correct. |
8DIT | 3 / 3 | 2498 | 48 | A40 | 16 | UCSF cluster | AlphaFold 2.3.1. Completed. Trimer not packed correctly but domains of proteins are mostly correct. |
7M1P | 1 | 2273 | 48 | A40 | 7.5 | UCSF cluster | AlphaFold 2.1.2 and reduced databases which does not use hhblits. Mostly correct prediction, 1A RMSD for 1900 atoms, N terminal domains shifted 10A. Image. |
7M1P | 1 | 2273 | 48 | A40 | 3.7 | UCSF cluster | Failed making sequence alignment in hhblits "ERROR: did not find 569 match states in sequence 1 of tr|A0A1H9CGJ3|A0A1H9CGJ3_9FIRM". AlphaFold 2.1.1. Log. |
7M1P | 1 | 2273 | 24 | 3090 | 4.5 | Desktop PC | Failed making sequence alignment. Log. |
7ALP | 1 | 2084 | 24 | 3090 | 8 | Desktop PC2 | Completed. Misaligned domains. Image. |
7BAN | 1 | 1932 | 24 | 3090 | 1.3 | Desktop PC | Failed making sequence alignment in hhblits. Log. |
7P3T | 6 / 1 | 1812 | 48 | A40 | 12.5 | UCSF cluster | Correct prediction, mostly 1A rmsd. Image. |
8T2V | 2 / 2 | 1770 | 48 | A40 | 1 | UCSF cluster | ColabFold 1.5.5. Correct prediction. Image. |
6X6U | 4 / 2 | 1592 | 24 | 3090 | 3.5 | Desktop PC | Correct prediction, mostly 1A rmsd. Image. |
7vku | 4 / 4 | 1527 | 24 | 3090 | 1.5 | Desktop PC | Alphafold 2.3.1 using local ColabFold installation, no templates, no minimization. Part correct 0.9 A rmsd, beta barrels not joined. |
7vku | 4 / 4 | 1527 | 16 | T4 | 4 | Google Colab | Failed after computing 2 models. Alphafold 2.3.1 using ColabFold web site, no templates, no minimization. Said "session crashed after using all available RAM". Also said "Session timed out". Run with ColabPro. |
7S62 | 1 | 1441 | 16 | P100 | 2 | Google Colab | Failed, out of GPU memory allocating 19 GB. Colab resource monitor shows GPU (not CPU) memory is exhausted. 5000 aligned sequences. Log. |
7Q5Z | 1 | 1436 | 16 | P100 | 1 | Google Colab | Failed using ColabFold after predicting two models. CUDA_ERROR_ILLEGAL_ADDRESS. Log. |
3GHG | 3 / 3 | 1434 | 48 | A40 | 0.7 | UCSF cluster | Colabfold 1.5.5. Correct fold. Image. |
7E5N | 4 / 1 | 1292 | 48 | A40 | 4 | UCSF cluster | Completed with AlphaFold 2.1.2. Wrong packing. Image. |
7E5N | 4 / 1 | 1292 | 48 | A40 | 11 | UCSF cluster | Completed with AlphaFold 2.1.1. Wrong packing. Image. |
7E5N | 4 / 1 | 1292 | 24 | 3090 | 2.5 | Desktop PC | Completed with AlphaFold 2.1.2. Wrong packing. Image. |
7E5N | 4 / 1 | 1292 | 24 | 3090 | 3.5 | Desktop PC | Failed predicting model 2 with OpenMM error writing PDB "The number of positions must match the number of atoms". AlphaFold 2.1.1. Identical error on second try. Log. |
7QFP | 1 | 1269 | 16 | P100 | 2 | Google Colab | Completed with ColabFold with some wrong domain positions. Image. |
6XMP | 1 | 1239 | 16 | P100 | 5 | Google Colab | Failed. Only 1 of 5 models completed, others gave out of memory. One completed model ran out of memory in energy minimization. 6000 aligned sequences. |
7KTT | 1 | 1142 | 16 | P100 | 3 | Google Colab | Failed. Only 2 of 5 models completed, others gave out of memory allocating 13 GB. One completed model ran out of memory in energy minimization. 1500 aligned sequences. Log. Image. |
6Z03 | 2 / 1 | 1078 | 16 | P100 | 5 | Google Colab3 | Completed with wrong complex, proteins on top of each other. Image. |
5N5F | 10 / 1 | 980 | 16 | P100 | 2.5 | Google Colab | Correct prediction, 0.4 A rmsd. Image. |
6Z1J | 3 / 3 | 824 | 24 | 3090 | 1.5 | Desktop PC | Correct prediction, mostly 0.5 A rmsd. Image. |
6Z1J | 3 / 3 | 824 | 48 | A40 | 2.5 | UCSF cluster | Correct prediction, mostly 0.5 A rmsd. |
6Z1J | 3 / 3 | 824 | 16 | P100 | 5 | Google Colab | Correct prediction, mostly 0.5 A rmsd. |
7fc7 | 1 | 96 | 24 | 3090 | 0.02 | Desktop PC | Alphafold 2.3.1 using local ColabFold installation, no templates, no minimization. Correct prediction, mostly 0.8 A rmsd. |
7fc7 | 1 | 96 | 16 | T4 | 0.08 | Google Colab | Alphafold 2.2.2. ColabFold run on Google Colab through ChimeraX, no templates, no minimization. Correct prediction, mostly 0.8 A rmsd. 1 minute to install ColabFold on Google Colab and 4 minutes to run 5 models. |
7fc7 | 1 | 96 | 48 | A40 | 0.02 | UCSF cluster | ColabFold 1.5.5, no templates, no minimization. Correct prediction, mostly 0.7 A rmsd. Sequence alignment done in advance with --msa-only option. |
7fc7 | 1 | 96 | 48 | A40 | 0.8 | UCSF cluster | Alphafold 2.3.1. Correct prediction, mostly 0.7 A rmsd. |
AlphaFold can be run without a GPU using the AlphaFold Docker --use_gpu=False option or the run_alphafold.py script --gpu_devices=-1 option. Here are some tests with small proteins comparing runs without GPU to with GPU.
PDB | Proteins / unique | Total amino acids | GPU memory (GB) | Nvidia GPU | Time (hours) | Computer | Notes |
---|---|---|---|---|---|---|---|
6Z1J | 3 / 3 | 824 | No GPU | 17 | UCSF cluster | Correct prediction. | |
6Z1J | 3 / 3 | 824 | 48 | A40 | 2.5 | UCSF cluster | Correct prediction, mostly 0.5 A rmsd. |
7B76 | 1 | 125 | No GPU | 2.7 | UCSF cluster | Wrong fold. Image. | |
7B76 | 1 | 125 | 48 | A40 | 0.5 | UCSF cluster | Wrong fold. |
7B76 | 1 | 125 | No GPU | 1 | Desktop PC | Wrong fold. | |
7B76 | 1 | 125 | 24 | 3090 | 0.3 | Desktop PC | Wrong fold. |