GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction

Yu, Shiong Wong and Nung, Kion Lee and Norshafarina, Omar (2016) GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction. CSBio '16 Proceedings of the 7th International Conference on Computational Systems-Biology and Bioinformatics. ISSN ISBN: 978-1-4503-4794-5

[img]
Preview
PDF
GMFR-CNN An Integration of Gapped Motif (abstract).pdf

Download (232kB) | Preview
Official URL: http://dl.acm.org/citation.cfm?id=3029380

Abstract

Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or downstream thousands of base pairs away without any fixed position. Therefore, the identification task of enhancers is extremely challenging. The inclusion of gaps in motif identification improved the overall accuracy and sensitivity, however, this feature is not fully utilised in deep learning method yet. Deep learning, is a powerful machine learning technique that has been actively used in image recognition and this technique has begun to shed light in bioinformatics. The expressiveness of deep learning enables higher feature learning from lower level ones. As a result, an integration of gapped motif feature representation (GMFR) and deep learning approach called deep convolutional neural networks (CNNs) is introduced to improve the predictive power of enhancer prediction. We called this method as GMFR-CNN. Comparative studies indicate that GMFR-CNN outperforms the other deep learning and gapped k-mer SVM tools with average 98% prediction accuracy. Breakthrough in deep learning technique certainly improves the performance in the near future.

Item Type: Article
Uncontrolled Keywords: Convolution neural network, enhancer motifs, gapped motif feature representation, research, Universiti Malaysia Sarawak, unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education
Subjects: T Technology > T Technology (General)
Divisions: Academic Faculties, Institutes and Centres > Faculty of Cognitive Sciences and Human Development
Faculties, Institutes, Centres > Faculty of Cognitive Sciences and Human Development
Academic Faculties, Institutes and Centres > Faculty of Cognitive Sciences and Human Development
Depositing User: Karen Kornalius
Date Deposited: 30 Mar 2017 06:36
Last Modified: 30 Mar 2017 06:36
URI: http://ir.unimas.my/id/eprint/15731

Actions (For repository members only: login required)

View Item View Item