Data ECCB 2016 main conference

HT09 – Searching molecular structure databases with tandem mass spectra using CSI:FingerID


Theater (plenary hall) September 6, 2016 10:20 am - 10:40 am

Bookmark and Share


Highlight talk – Theme: Data

Abstract

Speaker: Juho Rousu Identification of small molecules from biological samples remains a major bottleneck in understanding the inner working of organisms and their ecosystems. In metabolomics, vast majority of the molecules in tandem mass spectra remain unknown. In recent years, major advances have been gained through adopting machine learning models trained on large databases of reference spectra. I will present CSI:FingerID (http://www.csi-fingerid.org), a software that identifies small molecules from their tandem mass spectra. CSI:FingerID is currently the most accurate methods for identification of metabolite from metabolomics samples, providing 150% more correct identifications than the closest competing method. In the presentation, I will give an overview of the technology underlying CSI:FingerID, combining multiple kernel learning, fragmentation tree models and arrays of support vector machines. I will also give a peek to the recent developments that allow three orders of magnitude faster training and also provide improved identification accuracy.

Authors

Kai Dührkop, Friedrich-Schiller-University Jena, Germany
Huibin Shen, Aalto University, Finland
Marvin Meusel, Friedrich-Schiller-University Jena, Germany
Céline Brouard, Aalto university, Finland
Sebastian Böcker, Friedrich Schiller University Jena, Germany
Juho Rousu, Aalto University, Finland

Source of publication

2015, Proceedings of the National Academy of Sciences, vol 112 no 41, pages 12580-12585.