A. Kanavos, I. Livieris, Ph. Mylonas, S. Sioutas, G. Vonitsanos |
Apache Spark Implementations for String Patterns in DNA Sequences |
GeNeDis 2018, Toronto, Canada, 25-28 October 2018 |
ABSTRACT
|
The availability of numerical data grows from one day to the other in an extraordinary way. This is the case for DNA sequences produced by new technologies of high-throughput Next Generation Sequencing (NGS). In this paper, we perform some experiments using Apache Spark in some sequences derived from National Center for Biotechnology Information (NCBI). The problems we deal with are some of the most popular, namely, Longest Common Prefix (LCP), Longest Common Substring (LCS) and Longest Common Subsequence (LCSub).
|
25 October , 2018 |
A. Kanavos, I. Livieris, Ph. Mylonas, S. Sioutas, G. Vonitsanos, "Apache Spark Implementations for String Patterns in DNA Sequences", GeNeDis 2018, Toronto, Canada, 25-28 October 2018 |
[ PDF] [
BibTex] [
Print] [
Back] |