Abstract
In the paper new modification of the rules induction method for description of gene groups using Gene Ontology based on FP-growth algorithm is proposed. The modification takes advantage of the hierarchical structure of GO graph, specific property of a single prefix-path FP tree and the fact that if we generate rules for description purposes we do not include into rule premise two GO terms that are in parent-children relation. The proposed algorithms was implemented and tested with two different expression datasets. Time performance of old and new approach is compared together with descriptions obtained with two methods. The results show that the new method allows generating rules faster, while the number of rules and coverage is similar in both approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Bocca, J.B., Jarke, M., Zaniolo, C. (eds.) Proceedings of 20th International Conference on Very Large Data Bases (VLDB 1994), pp. 487–499. Morgan Kaufmann Publishers Inc. (1994)
Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., et al.: Gene Ontology: tool for the unification of biology. Nature Genetics 25(1), 25–29 (2000)
Cho, R.J., Campbell, M.J., Winzeler, E.A., Steinmetz, L., Conway, A., et al.: A genome-wide transcriptional analysis of the mitotic cell cycle. Molecular Cell 2(1), 65–73 (1998)
Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences of the United States of America 95(25), 14,863–14,868 (1998)
Gruca, A., Sikora, M., Polański, A.: RuleGO: a logical rules-based tool for description of gene groups by means of gene ontology. Nucleic Acids Research 39(suppl. 2), W293–W301 (2011)
Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Mining and Knowledge Discovery 8(1), 53–87 (2004)
Iyer, V.R., Eisen, M.B., Ross, D.T., Schuler, G., Moore, T., et al.: The transcriptional program in the response of human fibroblasts to serum. Science 283(5398), 83–87 (1999)
Sikora, M., Gruca, A.: Induction and selection of the most interesting gene ontology based multiattribute rules for descriptions of gene groups. Pattern Recognition Letters 32(2), 258–269 (2011)
Stefanowski, J., Vanderpooten, D.: Induction of decision rules in classification and discovery-oriented perspectives. International Journal of Intelligent Systems 16(1), 13–27 (2001)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Gruca, A. (2014). Improvement of FP-Growth Algorithm for Mining Description-Oriented Rules. In: Gruca, D., Czachórski, T., Kozielski, S. (eds) Man-Machine Interactions 3. Advances in Intelligent Systems and Computing, vol 242. Springer, Cham. https://doi.org/10.1007/978-3-319-02309-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-02309-0_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02308-3
Online ISBN: 978-3-319-02309-0
eBook Packages: EngineeringEngineering (R0)