Perturbation models could also accelerate hypothesis generation and testing, sparing researchers the need for massive high-throughput screens. This would allow them to focus on validating computationally generated hits. “I think we’re going to see a shift towards simulating first, doing experiments later,” says Lundberg. Because these simulations evolve to capture more detail about the cellular environment and its surroundings, they could help to reduce reliance on animal models for drug development and testing, yielding human-centric predictions that reduce the risk of toxicity and failure in clinical trials.

But researchers will also need to overcome the well-known limitations of generative AI. Chatbots routinely fabricate ‘facts’ and even actively mislead users, and image-generation algorithms are prone to hallucinatory flights of fancy. Xing cautions that early-generation models will fundamentally be simulations of biology rather than true replicas of cellular reality. Accordingly, he and others in the field favour early release and public testing of models, and the data sets used to train them, so that users can uncover the models’ strengths and limitations. “Once our virtual cell is there, we are going to make it public for people to play with,” says Xing. “It may embarrass us to have very bad results, but I think that’s a journey that we have to work through.”