Another option is to increase data sharing so that researchers do not repeat the same observations or store several copies of a data set unnecessarily. This is an area in which domain specialists can be invaluable, says Glaves, because they can point scientists to the relevant resources. “If you have a data set that is an improvement, a higher resolution or better quality than a previous data set”, domain specialists can guide researchers on whether the older versions can be discarded, she says.

Data collection and storage, says Glaves, are quintessentially twenty-first-century problems because of the vast quantities of data that can now be created. Some researchers hope that they will be saved from having to weed out old data sets by technological leaps such as quantum-computing technologies and data centres that do not require as much energy to run. But those involved in data curation warn against keeping data for the sake of it.

It is not just about having a lot of data, says Briney. “It’s about being able to get your hands on it, understand it and then reuse it.”