Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance

Wang, Dianhui and Lee, Nung Kion and Dillon, Tharam S. (2003) Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance. In: Neural Networks, 2003. Proceedings of the International Joint Conference on, 20-24 July 2003.

Data Mining_abstract.pdf

Download (106kB) | Preview
Official URL:


Traditionally, two protein sequences are classified into the same class if their feature patterns have high homology. These feature patterns were originally extracted by sequence alignment algorithms, which measure similarity between an unseen protein sequence and identified protein sequences. Neural network approaches, while reasonably accurate at classification, give no information ahout the relationship between the unseen case and the classified items that is useful to biologist. In contrast, in this paper we use a generalized radial basis function (GRBF) neural network architecture 'that generates fuzzy classification rules that could he used for further knowledge discovery. Our proposed techniques were evaluated using protein sequences with ten classes of super-families downloaded from a public domain database, and the results compared favorably with other standard machine learning techniques.

Item Type: Proceeding (Paper)
Uncontrolled Keywords: Data mining, neural protein sequence, generalized radial basis function (GRBF), unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education, research, Universiti Malaysia Sarawak
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > T Technology (General)
Divisions: Academic Faculties, Institutes and Centres > Faculty of Cognitive Sciences and Human Development
Depositing User: Karen Kornalius
Date Deposited: 12 May 2016 04:32
Last Modified: 12 May 2016 04:32

Actions (For repository members only: login required)

View Item View Item