Classification of Protein Sequences using the Growing Self-Organizing Map

Protein sequence analysis is an important task in bioinformatics. The classification of protein sequences into groups is beneficial for further analysis of the structures and roles of a particular group of protein in biological process. It also allows an unknown or newly found sequence to be identif...

Full description

Saved in:
Bibliographic Details
Main Author: Ahmad, N.
Format: Conference or Workshop Item
Language:English
Published: 2008
Subjects:
Online Access:http://eprints.utem.edu.my/id/eprint/90/1/Norashikin__iciafs2008.pdf
http://eprints.utem.edu.my/id/eprint/90/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Protein sequence analysis is an important task in bioinformatics. The classification of protein sequences into groups is beneficial for further analysis of the structures and roles of a particular group of protein in biological process. It also allows an unknown or newly found sequence to be identified by comparing it with protein groups that have already been studied. In this paper, we present the use of growing self-organizing map (GSOM), an extended version of the self-organizing map (SOM) in classifying protein sequences. With its dynamic structure, GSOM facilitates the discovery of knowledge in a more natural way. This study focuses on two aspects; analysis of the effect of spread factor parameter in the GSOM to the node growth and the identification of grouping and subgrouping under different level of abstractions by using the spread factor.