Improved H3K27ac Histone Mark Prediction using K-mer Proximity Feature

Prediction of gene regulatory elements-enhancers is computationally challenging because features associated with them are ill-understood. Several histone marks are known to be associated with enhancers locations and have been successfully used to predict multiple thousands of enhancers approximate l...

Full description

Saved in:
Bibliographic Details
Main Authors: Fong, Pui Kwan, Lee, Nung Kion
Format: Conference or Workshop Item
Language:English
Published: IEEE 2015
Subjects:
Online Access:http://ir.unimas.my/id/eprint/11936/1/Improved%20H3K27ac_abstract.pdf
http://ir.unimas.my/id/eprint/11936/
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7349830
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Prediction of gene regulatory elements-enhancers is computationally challenging because features associated with them are ill-understood. Several histone marks are known to be associated with enhancers locations and have been successfully used to predict multiple thousands of enhancers approximate locations. The k-mer (a short continuous nucleotides of length k) is one of the most commonly engineered features from histone sequences for machine learning task. However, usually large kmer (i.e. 5 ≤ k ≤ 7) feature set is needed to perform well and no domain knowledge is used. In this study we proposed the kmer proximity feature which is domain dependent to represent the H3K27ac histone enrichment in DNA sequences. This feature represents the spatial content of DNA sequences. We compare the performances of using the proximity and the k-mer feature for H3K27ac marks prediction and results indicate that the proposed feature gives higher prediction accuracy rates. These findings supported that the proximity feature is a more distinguishing feature of DNA sequences with histone modification enrichment.