Abstract
We proposed to use data mining to identify LINE-1 (L1) characteristics that were associated with gene expression in bladder cancer. The data were collected from L1Base and GSE3167. The memory-efficient data structure called FP-Tree was employed to enumerate all frequent item sets. The frequent item sets were then used to produce rules for predicting “down regulation” and “not down.” Each rule was assigned a p-value by means of Chi-square test. No statistically significant rules for “down” had been found, in contrast 692 rules for “not down” were significant with odd ratios ranging from 1.68 to 1.98. All the significant rules were concentrated only in 20 characteristics. We were able to infer the L1 characteristics that down-regulated genes. Those characteristics were number of L1 elements in host genes, full-length intactness, number of CpG islands, conserved 5’UTR and mutated ORF2.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Robertson, K.D.: DNA methylation and human disease. Nature Reviews Genetics 6, 597–610 (2005)
Chalitchagorn, K., Shuangshoti, S., Hourpai, N., Kongruttanachok, N., Tangkijvanich, P., et al.: Distinctive pattern of LINE-1 methylation level in normal tissues and the association with carcinogenesis. Oncogene 23, 8841–8846 (2004)
Phokaew, C., Kowudtitham, S., Subbalekha, K., Shuangshoti, S., Mutirangura, A.: LINE-1 methylation patterns of different loci in normal and cancerous cells. Nucl. Acids Res. 36, 5704–5712 (2008)
Subbalekha, K., Pimkhaokham, A., Pavasant, P., Chindavijak, S., Phokaew, C., et al.: Detection of LINE-1s hypomethylation in oral rinses of oral squamous cell carcinoma patients. Oral Oncology 45, 184–191 (2009)
Han, J.S., Szak, S.T., Boeke, J.D.: Transcriptional disruption by the L1 retrotransposon and implications for mammalian transcriptomes. Nature 429, 268–274 (2004)
Clark, D.: Molecular Biology. Elsevier Academic Press, Amsterdam (2005)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2005)
Penzkofer, T., Dandekar, T., Zemojtel, T.: L1Base: from Functional Annotation to Prediction of Active LINE-1 Elements. Nucl. Acids Res. 33(Database issue), D498–D500 (2005)
WEKA Project, The University of Waikato, http://www.cs.waikato.ac.nz/~ml
Han, J., Pei, J., Yin, Y., Mao, R.: Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Mining and Knowledge Discovery 8(1), 53–87 (2001)
Rapid-I, http://rapid-i.com
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pratanwanich, N., Mutirangura, A., Aporntewan, C. (2010). Mining LINE-1 Characteristics That Mediate Gene Expression. In: Chan, J.H., Ong, YS., Cho, SB. (eds) Computational Systems-Biology and Bioinformatics. CSBio 2010. Communications in Computer and Information Science, vol 115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16750-8_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-16750-8_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16749-2
Online ISBN: 978-3-642-16750-8
eBook Packages: Computer ScienceComputer Science (R0)