Skip to main content

Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams

  • Conference paper
  • First Online:
Trends and Applications in Knowledge Discovery and Data Mining (PAKDD 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12705))

Included in the following conference series:

Abstract

Digitization of scanned Piping and Instrumentation diagrams (P&ID), widely used in manufacturing or mechanical industries such as oil and gas over several decades, has become a critical bottleneck in dynamic inventory management and creation of smart P&IDs that are compatible with the latest CAD tools. Historically, P&ID sheets have been manually generated at the design stage, before being scanned and stored as PDFs. Current digitization initiatives involve manual processing and are consequently very time consuming, labour intensive and error-prone. Thanks to advances in image processing, machine and deep learning techniques there is an emerging body of work on P&ID digitization. However, existing solutions face several challenges owing to the variation in the scale, size and noise in the P&IDs, the sheer complexity and crowdedness within the drawings, domain knowledge required to interpret the drawings and the very minute visual differences among symbols. This motivates our current solution called Digitize-PID which comprises of an end-to-end pipeline for detection of core components from P&IDs like pipes, symbols and textual information, followed by their association with each other and eventually, the validation and correction of output data based on inherent domain knowledge. A novel and efficient kernel-based line detection and a two-step method for detection of complex symbols based on a fine-grained deep recognition technique is presented in the paper. In addition, we have created an annotated synthetic dataset, Dataset-P&ID, of 500 P&IDs by incorporating different types of noise and complex symbols which is made available for public use (currently there exists no public P&ID dataset). We evaluate our proposed method on this synthetic dataset and a real-world anonymized private dataset of 12 P&ID sheets. Results show that Digitize-PID outperforms the existing state-of-the-art for P&ID digitization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://drive.google.com/drive/folders/1Br09_gOKkHsxBOZxH9ojxrnJaxHn333P.

References

  1. Ablameyko, S., Uchida, S.: Recognition of engineering drawing entities: review of approaches. Int. J. Image Graph. 7, 709–733 (2007)

    Article  Google Scholar 

  2. Baek, Y., Lee, B., Han, D., Yun, S., Lee, H.: Character region awareness for text detection (CRAFT). In: Conference on Computer Vision and Pattern Recognition (CVPR) (2019). https://arxiv.org/abs/1904.01941

  3. Cordella, L., Vento, M.: Symbol recognition in documents: a collection of techniques? IJDAR 3, 73–88 (2000). https://doi.org/10.1007/s100320000036

    Article  Google Scholar 

  4. Elyan, E., Garcia, C.M., Jayne, C.: Symbols classification in engineering drawings. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2018). https://doi.org/10.1109/IJCNN.2018.8489087

  5. Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. KDD 1996, pp. 226–231. AAAI Press (1996)

    Google Scholar 

  6. Fu, L., Kara, L.: From engineering diagrams to engineering models: visual recognition and applications. Comput. Aided Des. 43, 278–292 (2011). https://doi.org/10.1016/j.cad.2010.12.011

    Article  Google Scholar 

  7. Gellaboina, M., Venkoparao, V.: Graphic symbol recognition using auto associative neural network model, pp. 297–301 (2009). https://doi.org/10.1109/ICAPR.2009.45

  8. Ishii, M., Ito, Y., Yamamoto, M., Harada, H., Iwasaki, M.: An automatic recognition system for piping and instrument diagrams. Syst. Comput. Jpn. 20, 32–46 (2007). https://doi.org/10.1002/scj.4690200304

    Article  Google Scholar 

  9. Kang, S.O., Lee, E.B., Baek, H.K.: A digitization and conversion tool for imaged drawings to intelligent piping and instrumentation diagrams (P&ID). Energies 12, 2593 (2019). https://doi.org/10.3390/en12132593

    Article  Google Scholar 

  10. Kanungo, T., Haralick, R.M., Dori, D.: Understanding engineering drawings: a survey

    Google Scholar 

  11. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. CoRR abs/1411.4038 (2014). http://arxiv.org/abs/1411.4038

  12. Moreno-garcía, C., Elyan, E., Jayne, C.: New trends on digitisation of complex engineering drawings. Neural Comput. Appl. 31(6), 1695–1712 (2019). https://doi.org/10.1007/s00521-018-3583-1

    Article  Google Scholar 

  13. Nazemi, A., Murray, I., Mcmeekin, D.: Mathematical information retrieval (MIR) from scanned pdf documents and MathML conversion. IPSJ Trans. Comput. Vis. Appl. 6, 132–142 (2014). https://doi.org/10.2197/ipsjtcva.6.132

    Article  Google Scholar 

  14. Rahul, R., Paliwal, S., Sharma, M., Vig, L.: Automatic information extraction from piping and instrumentation diagrams. In: ICPRAM (2019)

    Google Scholar 

  15. Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 56–72. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_4

    Chapter  Google Scholar 

  16. Weisstein, E.W.: Convex hull. From MathWorld-A Wolfram Web Resource. https://mathworld.wolfram.com/ConvexHull.html

  17. Zhang, D., Lu, G.: Review of shape representation and description techniques. Pattern Recogn. 37, 1–19 (2004). https://doi.org/10.1016/j.patcog.2003.07.008

    Article  Google Scholar 

  18. Zhang, F., Zhai, G., Li, M., Liu, Y.: Three-branch and mutil-scale learning for fine-grained image recognition (TBMSL-NET). arXiv preprint arXiv:2003.09150 (2020)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shubham Paliwal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Paliwal, S., Jain, A., Sharma, M., Vig, L. (2021). Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams. In: Gupta, M., Ramakrishnan, G. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2021. Lecture Notes in Computer Science(), vol 12705. Springer, Cham. https://doi.org/10.1007/978-3-030-75015-2_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-75015-2_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-75014-5

  • Online ISBN: 978-3-030-75015-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics