A method for clustering billions of unidentified tandem mass spectra from shotgun proteomics experiments offers new ways of storing, organizing and analyzing proteomics data, with potential benefits ...