Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/34605
Title: A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides
Authors: AGTEN, Annelies 
PROSTKO, Piotr 
GEUBBELMANS, Melvin 
De Vijlder, Thomas
VALKENBORG, Dirk 
Liu, Youzhong
Issue Date: 2021
Publisher: MDPI
Source: Metabolites, 11 (6) (Art N° 400)
Abstract: Structural modifications of DNA and RNA molecules play a pivotal role in epigenetic and posttranscriptional regulation. To characterise these modifications, more and more MS and MS/MS- based tools for the analysis of nucleic acids are being developed. To identify an oligonucleotide in a mass spectrum, it is useful to compare the obtained isotope pattern of the molecule of interest to the one that is theoretically expected based on its elemental composition. However, this is not straightforward when the identity of the molecule under investigation is unknown. Here, we present a modelling approach for the prediction of the aggregated isotope distribution of an average DNA or RNA molecule when a particular (monoisotopic) mass is available. For this purpose, a theoretical database of all possible DNA/RNA oligonucleotides up to a mass of 25 kDa is created, and the aggregated isotope distribution for the entire database of oligonucleotides is generated using the BRAIN algorithm. Since this isotope information is compositional in nature, the modelling method is based on the additive log-ratio analysis of Aitchison. As a result, a univariate weighted polynomial regression model of order 10 is fitted to predict the first 20 isotope peaks for DNA and RNA molecules. The performance of the prediction model is assessed by using a mean squared error approach and a modified Pearson's chi(2) goodness-of-fit measure on experimental data. Our analysis has indicated that the variability in spectral accuracy contributed more to the errors than the approximation of the theoretical isotope distribution by our proposed average DNA/RNA model. The prediction model is implemented as an online tool. An R function can be downloaded to incorporate the method in custom analysis workflows to process mass spectral data.
Notes: Valkenborg, D (corresponding author), UHasselt Hasselt Univ, Data Sci Inst, Agoralaan 1, BE-3590 Diepenbeek, Belgium.; Valkenborg, D (corresponding author), Interuniv Inst Biostat & Stat Bioinformat I BioSt, Agoralaan 1, BE-3590 Diepenbeek, Belgium.
annelies.agten@uhasselt.be; piotr.prostko@uhasselt.be;
melvin.geubbelmans@student.uhasselt.be; YLiu186@its.jnj.com;
TDEVIJLD@its.jnj.com; dirk.valkenborg@uhasselt.be
Keywords: DNA; RNA; oligonucleotide; prediction; isotope distribution; mass;spectrometry; software
Document URI: http://hdl.handle.net/1942/34605
e-ISSN: 2218-1989
DOI: 10.3390/metabo11060400
ISI #: WOS:000666674500001
Category: A1
Type: Journal Contribution
Validations: ecoom 2022
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
metabolites-11-00400-v2.pdfPublished version3.74 MBAdobe PDFView/Open
Show full item record

WEB OF SCIENCETM
Citations

2
checked on Apr 24, 2024

Page view(s)

56
checked on Sep 7, 2022

Download(s)

26
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.