Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/35299
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorVALKENBORG, Dirk
dc.contributor.advisorVAN HYFTE, Dirk
dc.contributor.authorAnkunda, Violet
dc.date.accessioned2021-09-13T13:06:28Z-
dc.date.available2021-09-13T13:06:28Z-
dc.date.issued2021
dc.identifier.urihttp://hdl.handle.net/1942/35299-
dc.description.abstractSequence alignment is the process of comparing different sequences by searching for a series of individual characters. Current state-of-the-art biological sequence alignment algorithms such as BLAST relies on heuristics and dynamical programming based on probabilistic models. Moreover, these algorithms perform analysis within a so-called query window defined as the most similar region to that of the query sequence, with a risk of missing homologies outside that window which may possibly remain relevant. The main purpose of this Master Thesis project is to evaluate the performance of BLAST, retrieve homologous sequences of given queries from a set of well known protein sequences, as well as evaluating how different metrics can be used for performance. Lastly, is to use this framework to compare BLAST with other heuristic- based biological alignment algorithms. In this project, the Protein Data Bank (PDB) data was used as the target sequence. The Structural Classification of Proteins-extended which was used to generate the query set with 100 sequences classifies proteins based on similarities of their structures and amino acid sequences. Receiver Operating Characteristic (ROC) and precision-recall curves were plotted for various results to compare BLAST results of different varying parameters. To assess the overall performance, area under the curve was calculated for each of the graphs. The results indicated a marginal difference between the performance of BLAST using default parameters and modifying the parameters.
dc.format.mimetypeApplication/pdf
dc.languageen
dc.publishertUL
dc.titleAssessing performance of heuristic-based biological alignment algorithms
dc.typeTheses and Dissertations
local.bibliographicCitation.jcatT2
dc.description.notesMaster of Statistics and Data Science-Bioinformatics
local.type.specifiedMaster thesis
item.fullcitationAnkunda, Violet (2021) Assessing performance of heuristic-based biological alignment algorithms.-
item.accessRightsOpen Access-
item.fulltextWith Fulltext-
item.contributorAnkunda, Violet-
Appears in Collections:Master theses
Files in This Item:
File Description SizeFormat 
c580a16f-f5e9-4ab7-b18a-a1bc2af4830a.pdf2.76 MBAdobe PDFView/Open
Show simple item record

Page view(s)

118
checked on Nov 7, 2023

Download(s)

60
checked on Nov 7, 2023

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.