Abstract
Digitization of scanned Piping and Instrumentation diagrams (P&ID), widely used in manufacturing or mechanical industries such as oil and gas over several decades, has become a critical bottleneck in dynamic inventory management and creation of smart P&IDs that are compatible with the latest CAD tools. Historically, P&ID sheets have been manually generated at the design stage, before being scanned and stored as PDFs. Current digitization initiatives involve manual processing and are consequently very time consuming, labour intensive and error-prone. Thanks to advances in image processing, machine and deep learning techniques there is an emerging body of work on P&ID digitization. However, existing solutions face several challenges owing to the variation in the scale, size and noise in the P&IDs, the sheer complexity and crowdedness within the drawings, domain knowledge required to interpret the drawings and the very minute visual differences among symbols. This motivates our current solution called Digitize-PID which comprises of an end-to-end pipeline for detection of core components from P&IDs like pipes, symbols and textual information, followed by their association with each other and eventually, the validation and correction of output data based on inherent domain knowledge. A novel and efficient kernel-based line detection and a two-step method for detection of complex symbols based on a fine-grained deep recognition technique is presented in the paper. In addition, we have created an annotated synthetic dataset, Dataset-P&ID, of 500 P&IDs by incorporating different types of noise and complex symbols which is made available for public use (currently there exists no public P&ID dataset). We evaluate our proposed method on this synthetic dataset and a real-world anonymized private dataset of 12 P&ID sheets. Results show that Digitize-PID outperforms the existing state-of-the-art for P&ID digitization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ablameyko, S., Uchida, S.: Recognition of engineering drawing entities: review of approaches. Int. J. Image Graph. 7, 709–733 (2007)
Baek, Y., Lee, B., Han, D., Yun, S., Lee, H.: Character region awareness for text detection (CRAFT). In: Conference on Computer Vision and Pattern Recognition (CVPR) (2019). https://arxiv.org/abs/1904.01941
Cordella, L., Vento, M.: Symbol recognition in documents: a collection of techniques? IJDAR 3, 73–88 (2000). https://doi.org/10.1007/s100320000036
Elyan, E., Garcia, C.M., Jayne, C.: Symbols classification in engineering drawings. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2018). https://doi.org/10.1109/IJCNN.2018.8489087
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. KDD 1996, pp. 226–231. AAAI Press (1996)
Fu, L., Kara, L.: From engineering diagrams to engineering models: visual recognition and applications. Comput. Aided Des. 43, 278–292 (2011). https://doi.org/10.1016/j.cad.2010.12.011
Gellaboina, M., Venkoparao, V.: Graphic symbol recognition using auto associative neural network model, pp. 297–301 (2009). https://doi.org/10.1109/ICAPR.2009.45
Ishii, M., Ito, Y., Yamamoto, M., Harada, H., Iwasaki, M.: An automatic recognition system for piping and instrument diagrams. Syst. Comput. Jpn. 20, 32–46 (2007). https://doi.org/10.1002/scj.4690200304
Kang, S.O., Lee, E.B., Baek, H.K.: A digitization and conversion tool for imaged drawings to intelligent piping and instrumentation diagrams (P&ID). Energies 12, 2593 (2019). https://doi.org/10.3390/en12132593
Kanungo, T., Haralick, R.M., Dori, D.: Understanding engineering drawings: a survey
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. CoRR abs/1411.4038 (2014). http://arxiv.org/abs/1411.4038
Moreno-garcía, C., Elyan, E., Jayne, C.: New trends on digitisation of complex engineering drawings. Neural Comput. Appl. 31(6), 1695–1712 (2019). https://doi.org/10.1007/s00521-018-3583-1
Nazemi, A., Murray, I., Mcmeekin, D.: Mathematical information retrieval (MIR) from scanned pdf documents and MathML conversion. IPSJ Trans. Comput. Vis. Appl. 6, 132–142 (2014). https://doi.org/10.2197/ipsjtcva.6.132
Rahul, R., Paliwal, S., Sharma, M., Vig, L.: Automatic information extraction from piping and instrumentation diagrams. In: ICPRAM (2019)
Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 56–72. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_4
Weisstein, E.W.: Convex hull. From MathWorld-A Wolfram Web Resource. https://mathworld.wolfram.com/ConvexHull.html
Zhang, D., Lu, G.: Review of shape representation and description techniques. Pattern Recogn. 37, 1–19 (2004). https://doi.org/10.1016/j.patcog.2003.07.008
Zhang, F., Zhai, G., Li, M., Liu, Y.: Three-branch and mutil-scale learning for fine-grained image recognition (TBMSL-NET). arXiv preprint arXiv:2003.09150 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Paliwal, S., Jain, A., Sharma, M., Vig, L. (2021). Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams. In: Gupta, M., Ramakrishnan, G. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2021. Lecture Notes in Computer Science(), vol 12705. Springer, Cham. https://doi.org/10.1007/978-3-030-75015-2_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-75015-2_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75014-5
Online ISBN: 978-3-030-75015-2
eBook Packages: Computer ScienceComputer Science (R0)