Abstract
The optimal directed acyclic graph search problem constitutes searching for a DAG with a minimum score, where the score of a DAG is defined on its structure. This problem is known to be NP-hard, and the state-of-the-art algorithm requires exponential time and space. It is thus not feasible to solve large instances using a single processor. Some parallel algorithms have therefore been developed to solve larger instances. A recently proposed parallel algorithm can solve an instance of 33 vertices, and this is the largest solved size reported thus far. In the study presented in this paper, we developed a novel parallel algorithm designed specifically to operate on a parallel computer with a torus network. Our algorithm crucially exploits the torus network structure, thereby obtaining good scalability. Through computational experiments, we confirmed that a run of our proposed method using up to 20,736 cores showed a parallelization efficiency of 0.94 as compared to a 1296-core run. Finally, we successfully computed an optimal DAG structure for an instance of 36 vertices, which is the largest solved size reported in the literature.
R. Suda—This work was supported by JSPS KAKENHI Grant Number 15K20965. The computational resource of Fujitsu FX10 was awarded by the “Large-scale HPC Challenge” Project, Information Technology Center, the University of Tokyo. This research was conducted using the Fujitsu PRIMEHPC FX10 System (Oakleaf-FX) at the Information Technology Center, the University of Tokyo.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fujitsu. http://www.fujitsu.com/global/. Accessed 01 11 2015
Information Technology Center, the University of Tokyo. http://www.cc.u-tokyo.ac.jp/. Accessed 01 11 2015
Chaiken, R., Jenkins, B., Larson, P.A., Ramsey, B., Shakib, D., Weaver, S., Zhou, J.: SCOPE: easy and efficient parallel processing of massive data sets. Proc. VLDB Endow. 1(2), 1265–1276 (2008)
Cheng, J., Bell, D.A., Liu, W.: Learning belief networks from data: an information theory based approach. In: Proceedings of the Sixth International Conference on Information and Knowledge Management CIKM 1997, NY, USA, pp. 325–331. ACM, New York (1997)
Chickering, D.M., Geiger, D., Heckerman, D.: Learning Bayesian networks is NP-Hard. Technical report, Citeseer (1994)
Friedman, N., Goldszmidt, M.: Learning Bayesian networks with local structure. In: Jordan, M.I. (ed.) Learning in Graphical Models, vol. 89, pp. 421–459. Springer, Netherlands (1998)
Friedman, N., Linial, M., Nachman, I., Pe’er, D.: Using Bayesian networks to analyze expression data. J. Comput. Biol. 7(3–4), 601–620 (2000)
Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20(3), 197–243 (1995)
Imoto, S., Goto, T., Miyano, S.: Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression. In: Pacific symposium on Biocomputing, vol. 7, pp. 175–186. World Scientific (2002)
Italiano, G.F.: Finding paths and deleting edges in directed acyclic graphs. Inf. Process. Lett. 28(1), 5–11 (1988)
Kramer, R., Gupta, R., Soffa, M.L.: The combining DAG: a technique for parallel data flow analysis. IEEE Trans. Parallel Distrib. Syst. 5(8), 805–813 (1994)
Lecca, P.: Methods of biological network inference for reverse engineering cancer chemoresistance mechanisms. Drug Discov. Today 19(2), 151–163 (2014). http://www.sciencedirect.com/science/article/pii/S1359644613003930, system Biology
Nikolova, O., Zola, J., Aluru, S.: Parallel globally optimal structure learning of Bayesian networks. J. Parallel Distrib. Comput. 73(8), 1039–1048 (2013)
Ott, S., Imoto, S., Miyano, S.: Finding optimal models for small gene networks. In: Pacific Symposium on Biocomputing. vol. 9, pp. 557–567. World Scientific (2004)
Tamada, Y., Imoto, S., Miyano, S.: Parallel algorithm for learning optimal Bayesian network structure. J. Mach. Learn. Res. 12, 2437–2459 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Honda, H., Tamada, Y., Suda, R. (2016). Efficient Parallel Algorithm for Optimal DAG Structure Search on Parallel Computer with Torus Network. In: Carretero, J., Garcia-Blas, J., Ko, R., Mueller, P., Nakano, K. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2016. Lecture Notes in Computer Science(), vol 10048. Springer, Cham. https://doi.org/10.1007/978-3-319-49583-5_37
Download citation
DOI: https://doi.org/10.1007/978-3-319-49583-5_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49582-8
Online ISBN: 978-3-319-49583-5
eBook Packages: Computer ScienceComputer Science (R0)