Abstract
The critical cross-section identification problem (CCIP) presents a significant and highly challenging issue in power grid analysis, aiming to identify a partition of the graph into two disjoint cuts that maximize the total weight of the cut. Traditionally, critical cross-sections have been determined through manual experience or mechanistic analysis, and effective intelligent methods to address these issues are lacking. Therefore, we propose a deep reinforcement learning framework based on a greedy approach (DEER) to solve the CCIP problem. Initially, proven to be NP-hard, a greedy vertex merging approach is proposed that enables the acquisition of all CCIP solutions through vertex merging. To prevent the greedy algorithm from converging to local optima, a deep reinforcement learning (DRL) framework combined with vertex marking is proposed to simulate the Markov decision process of vertex merging. Through training the DRL model, repetitive searches for vertex marking can be effectively avoided. Furthermore, the greedy algorithm can be augmented with genetic algorithms to address CCIP. Extensive experiments demonstrate the effectiveness of the proposed methods in addressing CCIP.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhou, X., Zhang, Z.: Opinion maximization in social networks via leader selection[C]. In: Proceedings of the ACM Web Conference, pp. 133–142 (2023)
Zhou, H., Sun, F.: Topology path search method of active distribution network based on undirected graph[J]. Guangdong Electric Power 35(11), 64–71 (2022)
Eliasof, M., Haber, E., Treister, E.: Feature transportation improves graph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence , vol. 38 no. 11, pp. 11874–11882 (2024)
Grinsztajn, N., Furelos-Blanco, D., Surana, S., et al.: Winner takes it all: training performant RL populations for combinatorial optimization[J]. Adv. Neural Inform. Process. Syst. 36 (2024)
Voigt, B.F.: Der Handlungsreisende, wie er sein soll und was er zu thun hat, um Aufträge zu erhalten und eines glücklichen Erfolgs in seinen Geschäften gewiss zu sein[J], pp. 69–72. Ilmenau, Commis-Voageur (1831)
Garey, M.R., Johnson, D.S.: Computers and intractability[M]. San Francisco: freeman (1979)
Miller, R.E., Muller, D.E.: A problem of maximum consistent subsets[R]. IBM Research Report RC-240, JT Watson Research Center, Yorktown Heights, NY, (1960)
Wang, T., Li, Y., Gu, X., et al.: Identification of the key transmission sections considering optimization of geographical partition boundary for power grids [J]. Trans. China Electrotech. Society 29(04), 220–228+245 (2014)
Zhang, X., Grijalva, S.: Decentralized total transfer capability evaluation using domain decomposition methods[J]. IEEE Trans. Power Syst. 31(5), 3349–3357 (2015)
Min, L., Abur, A.: Total transfer capability computation for multi-area power systems. IEEE Trans. Power Syst. 21(3), 1141–1147 (2006)
Yan, Y., Zhou, Q., He, H., et al.: Subarea division and transmission sections search method based on complex network theory. Electric Power Construct. 38(6), 100–107 (2017)
Papadimitriou, C., Steiglitz, K.: Combinatorial optimization: algorithms and complexity[M]. Courier Corporation (1998)
Lawler, E., Wood, D.: Branch-and-bound methods: a survey. Oper. Res. 14(4), 699–719 (1966)
Bertsekas, D.P.: Dynamic Programming and Optimal Control. Athena Scientific, Belmont, MA (1995)
Sniedovich, M.: Dynamic Programming: Foundations and Principles (Second edition). CRC Press, Boca Raton, FL (2010)
Williamson, D., Shmoys, D.: The Design of Approximation Algorithms. Cambridge University Press, Cambridge (2011)
Vazirani, V.: Approximation Algorithms. Springer, Berlin, Heidelberg (2003)
Hochba, D.: Approximation algorithms for NP-hard problems. ACM SIGACT News 28(2), 40–52 (1997)
Chen, X., Tian, Y.: Learning to perform local rewriting for combinatorial optimization. In: Proceedings of the 33rd Conference on Neural Information Processing Systems. Vancouver, Canada: Curran Associates, Inc., pp. 6278–6289 (2019)
Yolcu, E., Poczos, B.: Learning local search heuristics for Boolean satisfiability. In: Proceedings of the 33rd Conference on Neural Information Processing Systems. Vancouver, Canada: Curran Associates, Inc., pp. 7992–8003 (2019)
Gao, L., Chen, M., Chen, Q., et al.: Learn to design the heuristics for vehicle routing problem. arXiv preprint arXiv: 2002.08539 (2020)
Lu, H., Zhang, X., Yang, S.: A learning-based iterative method for solving vehicle routing problems. In: Proceedings of the 8th International Conference on Learning Representations. Addis Ababa, Ethiopia, (2020)
Karalias, N., Loukas, A.: Erdos goes neural: an unsupervised learning framework for combinatorial optimization on graphs. Adv. Neural. Inf. Process. Syst. 33, 6659–6672 (2020)
Barrett, T., Clements, W., Foerster, J., et al.: Exploratory combinatorial optimization with reinforcement learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34 no. 04, pp. 3243–3250 (2020)
Wang, H., Wu, N., Yang, H., et al.: Unsupervised learning for combinatorial optimization with principled objective relaxation. Adv. Neural. Inf. Process. Syst. 35, 31444–31458 (2022)
Aarts, E., Aarts, E., Lenstra, J.: Local search in combinatorial optimization[M]. Princeton University Press (2003)
Sivanandam, S., Deepa, S.: Genetic Algorithms[M], pp. 15–37. Berlin, Heidelberg, Introduction to genetic algorithms. Springer (2008)
Dorigo, M., Birattari, M., Stutzle, T.: Ant colony optimization. IEEE Comput. Intell. Mag. 1(4), 28–39 (2006)
Wu, C., Shankari, K., Kamar, E., et al.: Optimizing the diamond lane: a more tractable carpool problem and algorithms. In: 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC). IEEE, pp. 1389–1396 (2016)
Croes, G.: A method for solving traveling-salesman problems. Oper. Res. 6(6), 791–812 (1958)
Helsgaun, K.: An effective implementation of the Lin-Kernighan traveling salesman heuristic. Eur. J. Oper. Res. 126(1), 106–130 (2000)
Lin, S., Kernighan, B.: An effective heuristic algorithm for the traveling-salesman problem. Oper. Res. 21(2), 498–516 (1973)
Lin, S.: Computer solutions of the traveling salesman problem. Bell Labs Tech. J. 44(10), 2245–2269 (1965)
Helsgaun, K.: An Extension Of The Lin-kernighan-helsgaun Tsp Solver for Constrained Traveling Salesman nd Vehicle Routing Problems, pp. 24–50. Roskilde University, Roskilde (2017)
Vidal, T.: Hybrid genetic search for the CVRP: Open-source implementation and SWAP* neighborhood. Comput. Oper. Res. 140, 105643 (2022)
Vidal, T., Crainic, T., Gendreau, M., et al.: A hybrid genetic algorithm for multidepot and periodic vehicle routing problems. Oper. Res. 60(3), 611–624 (2012)
Bourel, M., Canale, E., Robledo, F., et al.: Complexity and heuristics for the weighted max cut-clique problem. Int. Trans. Oper. Res. 29(2), 908–928 (2022)
Nogueira, B., Pinheiro, R., Subramanian, A.: A hybrid iterated local search heuristic for the maximum weight independent set problem. Optimiz. Lett. 12(3), 567–583 (2018)
Vinyals, O., Fortunato, M., Jaitly, N.: vertexer networks. Adv. Neural Inform. Process. Systems, vol. 28 (2015)
Li, Z., Chen, Q., Koltun, V.: Combinatorial optimization with graph convolutional networks and guided tree search. Adv. Neural Inform. Process. Syst. vol. 31 (2018)
Selsam, D., Lamm, M., Bünz, B., et al.L Learning a SAT solver from single-bit supervision[C]. In: 7th International Conference on Learning Representations(ICLR 2019) (2019)
Bello, I., Pham, H., Le, Q., et al.: Neural combinatorial optimization with reinforcement learning. arXiv preprint arXiv:1611.09940 (2016)
Khalil, E., Dai, H., Zhang, Y., et al.: Learning combinatorial optimization algorithms over graphs[J]. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Deudon, M., Cournut, P., Lacoste, A., et al.: Learning heuristics for the tsp by policy gradient. In: International conference on the integration of constraint programming, artificial intelligence, and operations research. Springer, Cham, pp. 170–181 (2018). https://doi.org/10.1007/978-3-319-93031-2_12
Kool, W., Van Hoof, H., Welling, M.: Attention, Learn to Solve Routing Problems!. In: 7th International Conference on Learning Representations (ICLR 2019) (2019)
Hopfield, J., Tank, D.: Neural computation of decisions in optimization problemsD. Biol. Cybern. 52(3), 141–152 (1985)
Smith, K.: Neural networks for combinatorial optimization: a review of more than a decade of research. INFORMS J. Comput. 11(1), 15–34 (1999)
Amizadeh, S., Matusevych, S., Weimer, M.: Learning to solve circuit-SAT: an unsupervised differentiable approach. In: International Conference on Learning Representations (2018)
Álvarez-Miranda, E., Ljubić, I., Mutzel, P.: The Maximum Weight Connected Subgraph Problem, pp. 245–270. Festschrift for Martin Grötschel, Facets of combinatorial optimization (2013)
Wei, Q., Tang, W., Jiang, C., et al.: Online equivalent modeling of active distribution network based on improved reinforcement learning algorithm. Guangdong Electric Power 34(11), 19–26 (2021)
Acknowledgments
This paper was supported by the Science and Technology Project of State Grid: Research on artificial intelligence analysis technology of available transmission capacity (ATC) of the key section under multiple power grid operation modes (5100-202255020A-1-1-ZN).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Liu, H. et al. (2024). Deep Reinforcement Learning Based on Greed for the Critical Cross-Section Identification Problem. In: Xu, C., et al. Data Science. ICPCSEE 2024. Communications in Computer and Information Science, vol 2213. Springer, Singapore. https://doi.org/10.1007/978-981-97-8743-2_9
Download citation
DOI: https://doi.org/10.1007/978-981-97-8743-2_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-8742-5
Online ISBN: 978-981-97-8743-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)