Evaluation of Convolutionary Neural Networks Modeling of DNA Sequences using Ordinal versus one-hot Encoding Method

Chieng, Allen Hoon Choong and Lee, Nung Kion (2017) Evaluation of Convolutionary Neural Networks Modeling of DNA Sequences using Ordinal versus one-hot Encoding Method. International Conference On Computer And Drone Applications (ICONDA) 2017. ISSN 978-1-5386-0765-7 (ISBN) (In Press)

[img]
Preview
PDF
Evaluation of Convolutionary Neural Networks (abstract).pdf

Download (46kB) | Preview
Official URL: https://www.biorxiv.org/content/early/2017/10/25/1...

Abstract

Convolutionary neural network (CNN) is a popular choice for supervised DNA motif prediction due to its excellent performances. To employ CNN, the input DNA sequences are required to be encoded as numerical values and represented as either vectors or multi-dimensional matrices. This paper evaluates a simple and more compact ordinal encoding method versus the popular one-hot encoding for DNA sequences. We compare the performances of both encoding methods using three sets of datasets enriched with DNA motifs. We found that the ordinal encoding performs comparable to the one-hot method but with significant reduction in training time. In addition, the one-hot encoding performances are rather consistent across various datasets but would require suitable CNN configuration to perform well. The ordinal encoding with matrix representation performs best in some of the evaluated datasets. This study implies that the performances of CNN for DNA motif discovery depends on the suitable design of the sequence encoding and representation. The good performances of the ordinal encoding method demonstrates that there are still rooms for improvement for the one-hot encoding method.

Item Type: Article
Uncontrolled Keywords: DNA sequence encoding, convolutionary neural networks, motif discovery, research, Universiti Malaysia Sarawak, unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education
Subjects: Q Science > Q Science (General)
Divisions: Academic Faculties, Institutes and Centres > Faculty of Cognitive Sciences and Human Development
Faculties, Institutes, Centres > Faculty of Cognitive Sciences and Human Development
Academic Faculties, Institutes and Centres > Faculty of Cognitive Sciences and Human Development
Depositing User: Lee
Date Deposited: 03 Jan 2018 03:59
Last Modified: 01 Aug 2019 02:35
URI: http://ir.unimas.my/id/eprint/18960

Actions (For repository members only: login required)

View Item View Item