Skip to main content

Design and Implementation of Analyzer Management System Based on Elasticsearch

  • Conference paper
  • First Online:
Web Information Systems and Applications (WISA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13579))

Included in the following conference series:

Abstract

Elasticsearch is one of the most popular full-text search and analytics engine. It can store, search, and analyze big volumes of data in near real time. Before searching, Elasticsearch will build multiple inverted indexes for the data. The analyzer plays a crucial role in this process. An appropriate analyzer can segment text into semantically meaningful words, which can significantly improve the query accuracy. However, the default analyzer has limited performance in the Chinese context. Existing methods generally replace the default analyzer with manual configuration to optimize the query effect. The cost of service downtime caused by manually updating analyzers is often unacceptable in production environments. Based on Elasticsearch’s Restful-API, we have implemented a framework for dynamic configuration of analyzers in a cluster environment. The framework supports common Chinese analyzers and provides a visual interface. Experiments show that the framework proposed in this paper reduces the update and maintenance time cost of the analyzer in the online environment by 94% compared to manual update. At the same time, compared with the default analyzer configuration of Elasticsearch, the accuracy of the system based on this framework is improved by 30%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Elasticsearch Guide. https://www.elastic.co/guide/index.html.. Accessed 19 Mar 2022

  2. Divya, M.S., Goyal, S.K.: ElasticSearch: an advanced and quick search technique to handle voluminous data [J]. Compusoft 2(6), 171 (2013)

    Google Scholar 

  3. Qiu, Q., Xie, Z., Liang W., Li, W.: DGeoSegmenter: a dictionary-based Chinese word segmenter for the geoscience domain[J]. Comput. Geosci. 121, 1–11 2018

    Google Scholar 

  4. Huihui, S., Xin. N.: Research and Implementation of Chinese Automatic Word Segmentation System Based on Complex Network Features [J]. Wireless Commun. Mobile Comput. 2022, 1–10 (2022)

    Google Scholar 

  5. Lu, H., Hong, Y., Yang, Y., Duan, L., Badar, N.: Towards user-oriented RBAC model. In: Wang, L., Shafiq, B. (eds.) DBSec 2013. LNCS, vol. 7964, pp. 81–96. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39256-6_6

    Chapter  Google Scholar 

  6. Takase, W., Nakamura, T., Watase, Y., et al.: A solution for secure use of Kibana and Elasticsearch in multi-user environment[J]. arXiv preprint arXiv:1706.10040 (2017)

  7. Wei, B., Dai, J., Deng, L., et al.: An Optimization Method for Elasticsearch Index Shard Number[C]. In: 2020 16th International Conference on Computational Intelligence and Security (CIS). IEEE, pp. 191–195 (2020)

    Google Scholar 

  8. Reelsen A. Using elasticsearch, logstash and kibana to create realtime dashboards J]

    Google Scholar 

  9. Varis, J.: Enhancing a product’s development and debugging with supporting product development [J] (2020)

    Google Scholar 

  10. Jibiao, J., Xinghua, Q.I.: Design and realization of online query system of Chinese-English terminology of Chinese medicine [J]. China Terminol. 24(2), 92 (2022)

    Google Scholar 

  11. Concepcion, A.I., Zeigler, B.P.: DEVS formalism: a framework for hierarchical model development[J]. IEEE Trans. Softw. Eng. 14(2), 228–241 (1988)

    Article  Google Scholar 

  12. Cruz, J.P., Kaji, Y., Yanai, N.: RBAC-SC: role-based access control using smart contract [J]. Ieee Access 6, 12240–12251 (2018)

    Article  Google Scholar 

  13. William Stallings. Role-Based Access Control in Computer Security. https://www.informit.com/articles/article.aspx?p=782116.. Accessed 19 Mar 2022

  14. Pi, C., Nie, P., Feng, Y., Xu, L.: Design and implementation of annotation system for character behavior & event based on spring boot. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds.) WISA 2021. LNCS, vol. 12999, pp. 756–763. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87571-8_66

    Chapter  Google Scholar 

  15. Chinotan. SpringBoot Actuator. https://my.oschina.net/u/3266761/blog/2960. Accessed 19 Mar 2022

  16. Dhulavvagol, P.M., Bhajantri, V.H., Totad, S.G.: Performance Analysis of Distributed Processing System using Shard Selection Techniques on Elasticsearch [J]. Procedia Comput. Sci 167, 126–136 (2020)

    Google Scholar 

  17. Fan Zhang., F.: Design and Implementation of Physical Education Video Teaching System Based on Spring MVC Architecture [C]. Durham University, Suffolk University. In: Proceedings of the 4th International Conference on Information and Education Innovations (ICIEI 2019. Durham University, Suffolk University: SCIence and Engineering Institute (SCIEI), pp. 117–120 (2019)

    Google Scholar 

  18. Coronel, J.B., Mock, S.: Designsafe: using elasticsearch to share and search data on a science web portal[M]. In: Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact, pp. 1–3 (2017)

    Google Scholar 

  19. Arora, M., Kanjilal, U., Varshney, D.: Evaluation of information retrieval: precision and recall [J]. Int. J. Indian Cult. Bus. Manage. 12(2), 224–236 (2016)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peng Nie .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sun, J., Nie, P., Xu, L., Zhang, H. (2022). Design and Implementation of Analyzer Management System Based on Elasticsearch. In: Zhao, X., Yang, S., Wang, X., Li, J. (eds) Web Information Systems and Applications. WISA 2022. Lecture Notes in Computer Science, vol 13579. Springer, Cham. https://doi.org/10.1007/978-3-031-20309-1_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-20309-1_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-20308-4

  • Online ISBN: 978-3-031-20309-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics