English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Conference Paper

Evaluation of distributed DNA representations on the classification of conserved non-coding elements

MPS-Authors
/persons/resource/persons272422

Athanasouli,  M       
Department Integrative Evolutionary Biology, Max Planck Institute for Developmental Biology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Gialitsis, N., Giannakopoulos, G., & Athanasouli, M. (2020). Evaluation of distributed DNA representations on the classification of conserved non-coding elements. In C. Spyropoulos, I. Varlamis, I. Androutsopoulos, & P. Malakasiotis (Eds.), SETN 2020: 11th Hellenic Conference on Artificial Intelligence (pp. 41-47). NewYork, NY, USA: Association for Computing Machinery.


Cite as: https://hdl.handle.net/21.11116/0000-000D-68D3-0
Abstract
The representation of DNA sequences has been an interesting topic of discussion for many years. Presently, given the usefulness of representations built upon embeddings for Natural Language Processing (NLP), there have been efforts to transfer such paradigms to the DNA world and related problems. In this paper, we study different DNA representations on the well-studied problem of Conserved Non-coding Elements (CNEs), trying to understand how well existing representations utilize the value of context, both in terms of local, near context, but also of long-distance interactions in genomic sequences. To this end, we apply a number of methods, including probabilistic models (LDA) and hybrid probabilistic-neural models (lda2vec) on appropriate datasets, compare the results to pre-existing methods and discuss the findings to better understand the value and challenges of different representations in the given domain.