Skip to main content

Template Based Industrial Big Data Information Extraction and Query System

  • Conference paper
  • First Online:
Data Mining and Big Data (DMBD 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10387))

Included in the following conference series:

Abstract

Currently, with the rapid development of industry, the amount of data generated by industrial enterprises and industrial business website is exponential growth, and the big data has different types. In this paper, we design and implement an industrial big data information acquisition and query system. The system is based on big data acquisition and analysis of industry news data and industrial products data. We use a template based information acquisition method to crawl data from industry related news data and industry products data. We also discuss the query performance of text industry data with text index and only by SQL without index. The system is useful for analysis the hot news in industrial field and industry public opinion, and it is also useful for providing reference and rapid search and comparison of the relevant industrial products price, inventory and other information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Viktor, M.S., Kenneth, C.: Big Data: A Revolution That Will Trans-Form How We Live, Work, and Think. Houghton Mifflin Harcourt, Boston (2013)

    Google Scholar 

  2. Jsoup Open Source Project Distributed under the Liberal MIT License. http://jsoup.org/

  3. Wang, J., Wu, J., Zhang, Y., He, G.: Content information extraction of theme web pages based on tag information. In: 7th IEEE International Symposium on Computational Intelligence and Design, pp. 501–504. IEEE Press, Los Alamitos, CA (2015)

    Google Scholar 

  4. He, G., Wang, J., Zhang, Y., Peng, Y.: Keyword extraction of web pages based on domain thesaurus. In: 3th IEEE International Conference on Cloud Computing and Intelligence Systems, pp. 310–315. IEEE Press, Los Alamitos, CA (2014)

    Google Scholar 

  5. Theobald, M., Schenkel, R., Weikum, G.: Classification and focused crawling for semistructured data. In: Blanken, H., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.) Intelligent Search on XML Data. LNCS, vol. 2818, pp. 145–157. Springer, Heidelberg (2003). doi:10.1007/978-3-540-45194-5_10

    Chapter  Google Scholar 

  6. Wang, J., Yang, S., Wang, Y., Han, C.: The crawling and analysis of agricultural products big data based on Jsoup. In: 12th IEEE International Conference on Fuzzy Systems and Knowledge Discovery, pp. 1231–1236. IEEE Press, Los Alamitos, CA (2016)

    Google Scholar 

  7. Bootstrap Front-end Frameworks and Open Source Projects licensed by MIT. http://getbootstrap.com/

  8. Jia, M., Xu, H., Wang, J., Bai, Y., Liu, B., Wang, J.: Handling big data of online social networks on a small machine. J. Comput. Soc. Netw. 2(1), 1–12 (2015)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the Key Laboratory of machine intelligence and advanced computing (MSC-201707A); Capital Normal University interdisciplinary research project; Capital Normal University science and technology innovation platform project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jie Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Wang, J., Peng, Y., Lin, Y., Wang, K. (2017). Template Based Industrial Big Data Information Extraction and Query System. In: Tan, Y., Takagi, H., Shi, Y. (eds) Data Mining and Big Data. DMBD 2017. Lecture Notes in Computer Science(), vol 10387. Springer, Cham. https://doi.org/10.1007/978-3-319-61845-6_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61845-6_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61844-9

  • Online ISBN: 978-3-319-61845-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics