Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/32829
Title: PRiSM: A prototype for exhaustive, restriction-free database searching for mass spectrometry-based identification
Authors: VAN HOUTVEN, Joris 
Boonen, Kurt
Geert Baggerman, |
Askenazi, Manor
Laukens, Kris
HOOYBERGHS, Jef 
VALKENBORG, Dirk 
Issue Date: 2020
Publisher: WILEY
Source: RCM. Rapid communications in mass spectrometry, (Art N° e8962)
Status: Early view
Abstract: Rationale: The current methods for identifying peptides in mass spectral product ion data still struggle to do so for the majority of spectra. Based on the experimental setup and other assumptions, such methods restrict the search space to speed up computations, but at the cost of creating blind spots. The proteomics community would greatly benefit from a method that is capable of covering the entire search space without using any restrictions, thus establishing a baseline for identification. Methods: We conceived the "mass pattern paradigm" (MPP) that enables the creation of such an identification method, and we implemented it into a prototype database search engine "PRiSM" (PRotein-Spectrum Matching). We then assessed its operational characteristics by applying it to publicly available high-precision mass spectra of low and high identification difficulty. We used those characteristics to gain theoretical insights into trade-offs between sensitivity and speed when trying to establish a baseline for identification. Results: Of 100 low difficulty spectra, PRiSM and SEQUEST agree on 84 identifications (of which 75 are statistically significant). Of 15 of 100 spectra not identified in a previous study (using SEQUEST), 13 are considered reliable after visual inspection and represent 3 proteins (out of 9 in total) not detected previously. Conclusions: Despite leaving noise intact, the simple PRiSM prototype can make statistically reliable identifications, while controlling the false discovery rate by fitting a null distribution. It also identifies some spectra previously unidentifiable in an "extremely open" SEQUEST search, paving the way to establishing a baseline for identification in proteomics.
Keywords: Tool;Algorithm;Spectra
Document URI: http://hdl.handle.net/1942/32829
ISSN: 0951-4198
e-ISSN: 1097-0231
DOI: 10.1002/rcm.8962
ISI #: WOS:000589528500001
Rights: 2020 John Wiley & Sons, Ltd
Category: A1
Type: Journal Contribution
Validations: ecoom 2021
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
PRiSM_RCM.pdf
  Restricted Access
Published version5.78 MBAdobe PDFView/Open    Request a copy
PRiSM_SuppInfo_reviewed_submitted.pdfPeer-reviewed author version356.32 kBAdobe PDFView/Open
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.