Skip to main content

Improving Semantic Clustering of EWID Reports by Using Heterogeneous Data Types

  • Conference paper
Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing (RSFDGrC 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8170))

  • 1170 Accesses

Abstract

In this article we investigate an impact of inclusion of different data types into a clustering process. As a case-study we use reports from the EWID database which is a system used by Polish State Fire Service for documenting incidents. Each incident reported in that database is characterized by a set of quantitative attributes and by natural language descriptions of the cause, scene and the course of actions undergone by firefighters. We show that the utilization of both of those data types for a clustering purpose can be beneficial in terms of semantic homogeneity of the resulting groups. We argue that such clusters might serve as a useful tool in the firefighters’ training process.

This work was partially supported by the Polish National Science Centre grants 2011/01/B/ST6/03867 and 2012/05/B/ST6/03215, and by the Polish National Centre for Research and Development (NCBiR) - grant O ROB/0010/03/001 under Defence and Security Programmes and Projects: “Modern engineering tools for decision support for commanders of the State Fire Service of Poland during Fire&Rescue operations in buildings”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rousseeuw, P.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53–65 (1987)

    Article  MATH  Google Scholar 

  2. Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Communications in Statistics-Theory and Methods 3(1), 1–27 (1974)

    Article  MATH  Google Scholar 

  3. Holmes, M., Wang, Y., Ziedins, I.: The application of data mining tools and statistical techniques to identify patterns and changes in fire events. Technical Report 95, New Zealand Fire Service, Auckland, NZ (May 2009)

    Google Scholar 

  4. Xiangxin, L.: Rational judging method of fire station layout based on data mining. In: 2nd IEEE International Conference on Emergency Management and Management Sciences (ICEMMS), pp. 455–458. IEEE (2011)

    Google Scholar 

  5. Krasuski, A., Kreński, K., Łazowy, S.: A method for estimating the efficiency of commanding in the State Fire Service of Poland. Fire Technology 48(4), 795–805 (2012)

    Article  Google Scholar 

  6. Kreński, K., Krasuski, A., Łazowy, S.: Data mining and shallow text analysis for the data of State Fire Service. In: Proceedings of Concurrency, Specification and Programming - XXth International Workshop, CS&P 2011, Białystok University of Technology, pp. 313–321 (2011)

    Google Scholar 

  7. Krasuski, A., Ślęzak, D., Kreński, K., Łazowy, S.: Granular knowledge discovery framework. In: Pechenizkiy, M., Wojciechowski, M. (eds.) New Trends in Databases & Inform. AISC, vol. 185, pp. 109–118. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  8. Elzinga, P., Poelmans, J., Viaene, S., Dedene, G., Morsing, S.: Terrorist threat assessment with formal concept analysis. In: Proceedings of the 2010 IEEE International Conference on Intelligence and Security Informatics (ISI), pp. 77–82. IEEE (2010)

    Google Scholar 

  9. Poelmans, J., Elzinga, P., Viaene, S., Hulle, M.M.V., Dedene, G.: Gaining insight in domestic violence with emergent self organizing maps. Expert Syst. Appl. 36, 11864–11874 (2009)

    Article  Google Scholar 

  10. Poelmans, J., Elzinga, P., Dedene, G., Viaene, S., Kuznetsov, S.: A concept discovery approach for fighting human trafficking and forced prostitution. In: Andrews, S., Polovina, S., Hill, R., Akhgar, B. (eds.) ICCS-ConceptStruct 2011. LNCS, vol. 6828, pp. 201–214. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  11. Szczuka, M.S., Janusz, A.: Semantic clustering of scientific articles using explicit semantic analysis. T. Rough Sets 16, 83–102 (2013)

    Article  Google Scholar 

  12. Graeger, A., Cimolino, U., de Vries, H., Sümersen, J.: Einsatz-und Abschnittsleitung: Das Einsatz-Führungs-System (EFS). Ecomed Sicherheit (2009)

    Google Scholar 

  13. Krasuski, A., Wasilewski, P.: The Detection of Outlying Fire Service’s Reports. The FCA Driven Analytics. In: Cellier, P., Distel, F. (eds.) Contributions to the 11th International Conference on Formal Concept Analysis, TU Dresden, pp. 35–50 (2013)

    Google Scholar 

  14. R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2008)

    Google Scholar 

  15. Kaufman, L., Rousseeuw, P.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley Interscience, New York (1990)

    Book  Google Scholar 

  16. Janusz, A., Ślęzak, D., Nguyen, H.S.: Unsupervised similarity learning from textual data. Fundamenta Informaticae 119(3), 319–336 (2012)

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Janusz, A., Krasuski, A., Szczuka, M. (2013). Improving Semantic Clustering of EWID Reports by Using Heterogeneous Data Types. In: Ciucci, D., Inuiguchi, M., Yao, Y., Ślęzak, D., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2013. Lecture Notes in Computer Science(), vol 8170. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41218-9_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41218-9_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41217-2

  • Online ISBN: 978-3-642-41218-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics