(Well, the Science and Art parts, anyway! This is originally a year-old story, republished by Cosmos today. I scouted around for some sort of an update, but didn’t find one. I still thought this is interesting, and at least now we know another area in which A.I. might be applied. I think that’s good to know, since A.I. does make mistakes, as noted below.)
The artificial intelligence program AlphaFold is proving to be a gamechanger for biological research, Imma Perfetto reports. This article was originally published in the Cosmos Print Magazine, September 2024.
October 11, 2025 Imma Perfetto

This artwork of an origami bird holds AlphaFold 3 predictions of a complex of two proteins (ScpA and ScpB) in its beak. The protein complex is important during cell division in bacteria. Top: ScpA is cyan and ScpB is green. Bottom: Confidence measures, where dark blue is very high confidence, light blue is confident, yellow is low confidence, and orange is very low confidence in the structural prediction. Credit: AlphaFold 3, Katie Michie.
A protein is made from of a chain of amino acids strung together like beads on a necklace. This chain spontaneously folds, like origami, into intricate pleats, folds, and loops through interactions between its amino acids. The resulting unique 3D structure largely determines its vital function within the lifeform. Solving the structure allows biologists to better understand how the protein works and design experiments to affect and modify it.
The smallest known protein, TAL, influences development of the fruit fly Drosophila melanogaster and has just 11 amino acids. The largest, Titin, is found in human muscle cells and is made up of roughly 35,000.
Proteins are far too tiny to inspect under a regular microscope. For decades researchers used complex experimental techniques, such as X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and cryogenic electron microscopy (cryo-EM) to solve their structures. Itโs painstaking, time-consuming work that takes specialised skill and sometimes hundreds of thousands of dollars. And, as Kate Michie can attest, success is not always guaranteed.
โI spent four years trying to solve the crystal structure of a complex of two human proteins and got scooped. You know, I got nothing out of four years. I worked really hard at it, and it was a really difficult project. AlphaFold can calculate those in a few hours,โ says Michie, who is chief scientist of the Structural Biology Facility at the Mark Wainwright Analytical Centre, of the University of New South Wales Sydney.
On 8 May 2024 Nature dropped a paper introducing the third and latest iteration of the artificial intelligence (AI) system AlphaFold, which predicts the 3D structure of proteins from their amino acid sequences. Google DeepMind and Isomorphic Labs, both subsidiaries of Alphabet, co-developed the new model. They say AlphaFold 3 (AF3) is โa revolutionary model that can predict the structure and interactions of all lifeโs molecules with unprecedented accuracyโ. But, while AF3 has generated significant interest since its release, it has simultaneously sparked criticism among those in the scientific community.
Letโs take a closer look at how AI is changing the world of structural biology.
A revolution in protein structure
AF3โs predecessor, AlphaFold 2, was released as open source code in July 2021 and immediately changed the game in structural biology.
โI contacted the high-performance computation people and said, โwe really need to get this piece of code runningโ. And then I asked my colleague, โDo you have any structures that you never submitted to the Protein Data Bank?โโ says Michie.
The Protein Data Bank (PDB) is the global archive of all the experimentally solved structures for large biological molecules. As of June 2024, its estimated to include more than 220,000 proteins, which sounds like a lot until you consider the number of proteins we know of exceeds 200 million.
โMy colleague sent me a sequence of a small protein he never submitted to the PDB, I ran it, and I just sent him the result. His email response to me was: โMy mind is blown!โ And he said, โI immediately thought someone else must have solved the structure.โโ
But they hadnโt, AF2 had accurately predicted the 3D structure of the protein from its amino acid sequence alone. What had taken years to describe experimentally had been done in just a few hours.
AF2 is a deep learning algorithm. In the world of AI that means it simulates the neural networks found in human brains. First, it takes the protein sequence of interest and searches several databases for similar proteins. By comparing these sequences, it can identify areas of similarity and difference to understand how the protein has changed across evolution.
For instance, if two amino acids are in close contact in 3D space then a mutation in one will usually be accompanied by a mutation in the other (to conserve the structure of the protein). But if they are far apart then they tend to evolve independently from each other. Using this to work out the relative positions of the amino acids, AF2 then takes its training on PDB structural data and iteratively constructs a 3D model of the proteinโs structure with relatively high accuracy.
Scientists can take advantage of that predicted structure to accelerate their science by doing smarter, more strategic experiments in the laboratory right off the bat. โIโve done work with some scientists working with immune complexes, and the models coming out of AlphaFold enable them to really trim down the number of animal experiments they do,โ says Michie. โSo instead of making say 20 CRISPR mice, they only might make two.โ

Crystal clues
An accurate AlphaFold structure can also be the crucial missing piece of the puzzle that allows researchers to experimentally solve the structure using X-ray crystallography.
โOne of my other colleagues is virologist and heโd been working on a protein that had eluded structural elucidation for 20โ30 years. It was from the worldโs first known retrovirus,โ says Michie.
โThe trick of crystallography is you need to know two components of the maths to solve them,โ she continues. The diffraction data provided by X-ray crystallography gives you one of those components, but you donโt have the other: the phase.
Traditional methods of obtaining phase information had proved unsuccessful, until Michie suggested using AlphaFold instead.
โImmediately the structure came out. AlphaFold helped him get the crystals but then actually enabled him to phase the structure. It told us that the Alpha Fold model was very good, but it also fixed up this problem in structural biology.โ
To Michie, AlphaFold represents a massive step forward: โitโs genuinely the biggest scientific advance in my careerโ.
โThe Alpha Fold model was very good, but it also fixed up this problem in structural biology.โ
Predicting the structures of lifeโs molecules
Proteins donโt exist in a vacuum. They move around, bind to and modify each other, and even form large, complicated complexes.
Peter Czabotar, joint head of the Structural Biology Division at WEHI, the oldest medical research institute in Australia, says one of the early limitations of AF2 was you could only ever get structural predictions of one protein, alone. โOften what youโre interested in is how different proteins will interact with each other. For example, we work on proteins that are involved with cell death and the interactions between those proteins will dictate whether a cell will live or die.โ
The gap has since been bridged by other research groups adapting and building upon AF2โs open source code, and with the AlphaFold-Multimer extension in October 2021.
The newest version, AF3, extends upon this capability by predicting interactions of multiple proteins, and nucleic acids (DNA and RNA). It can predict the impact of ions and post-translational modifications โ the addition of chemical groups to amino acids โ on these molecular systems too. AF3 can also be used to predict how a selection of small molecules called ligands bind to proteins, though this is restricted to ligands that have high-quality experimental data available in the PDB.
โBut where the real power is, something that we do a lot of, is in the drug discovery world,โ says Czabotar. โAnd it is extremely powerful for that, potentially, but they havenโt enabled that in the way that itโs released. Weโve done drug discovery against cell death proteins, for example. I canโt take one of the drugs that weโve worked with and see how it interacts with my target protein, I can only use the [ligands] that theyโve enabled us to use.โ
That capability to predict the structure of novel drug molecules interacting with target proteins seems to be restricted to Isomorphic Labs, which was launched in 2021 to pursue commercial drug discovery.
AF3 uses a very different approach for this new suit of predictions: generative AI. After processing the sequence inputs, it assembles its predictions using a diffusion network, the likes of which power AI image generators. According to Isomorphic Labsโ website: โthe diffusion process starts with a cloud of atoms, and over many steps converges on its final, most accurate molecular structureโ. Diffusion has been applied to protein structure prediction before, for example, in the seminal RoseTTAFold diffusion (RFdiffusion) by the Baker Laboratory at the Institute for Protein Design, the University of Washington.
But generative AI is not without its limitations. AF3 will occasionally produce structures with overlapping atoms (this is physically impossible) or replace a detail of the structure with its mirror image (chemically impossible). As a generative model, it is also prone to hallucinations in which it invents plausible-looking structures โ particularly in disordered regions of the protein that lack a stable 3D structure โ similarly to how a text to image AI struggles to create realistic-looking hands. In-built confidence measures help to identify when AF3 isnโt so sure about its structural prediction, but ultimately it takes a scientist with understanding of the underlying structural biology to come along and identify whatโs gone wrong, and why.
โItโs very, very powerful. But it doesnโt exclude the need to necessarily confirm things experimentally. Whether that is by solving structures themselves or by, for example, testing the structures in some way in an experiment,โ says Czabotar.
Concerns about code
In a major departure from AF2, access to the newest iteration of AlphaFold is limited to a web server and for non-commercial research only. โWe have various structure-based drug discovery projects and some of them are purely academic, as students, PhDs and honours projects. But we also have had commercial partnerships, because thatโs a way to push your discoveries into a clinical setting,โ says Czabotar. โSo generally, anything that is going to make an impact is done by an academic lab in a commercial partnership. Now, I guess it puts us in a bit of an awkward situation. Even if we could look at our compounds bound to the target [protein], thereโs some projects where we wonโt be able to do it because, you know, weโve ticked a box.โ
AF3โs accompanying Nature paper was also published without the source code, but with a โpseudocodeโ instead โ a detailed description of what the code can do and how it works. This prompted an open letter to the Editors of Nature, published 16 May and endorsed by more than 1,000 scientists as of June.
The letter raised concerns that โthe absence of available code compromises peer reviewโ and that the pseudocode released would โrequire months of effort to turn into workable code that approximates the performance, wasting valuable time and resourcesโ. Access to the web server was also initially capped at 10 predictions per day, which the letter stated, โrestricts the scientific communityโs capacity to verify the broad claims of the findings or apply the predictions on a large scaleโ.
The sentiments appear to have hit home. Shortly after the letterโs release, DeepMindโs Vice President of research, Pushmeet Kohli announced via X that they would double the daily job limit to 20 and are โworking on releasing the AF3 model (incl weights) for academic use โฆ within 6 monthsโ.
On 22 May Nature responded in an editorial, stating its reasoning for publishing the paper without code: โthe private sector funds most global research and development, and many of the results of such work are not published in peer-reviewed journals. We at Nature think itโs important that journals engage with the private sector and work with its scientists so they can submit their research for peer review and publication.โ
In the meantime, other researchers wonโt be sitting idly by until the code release at the end of 2024. Already, multiple teams are racing to develop their own open source versions of AlphaFold 3, without any strings attached.








