Skip to main content
Log in

Data clustering and analyzing techniques using hierarchical clustering method

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Data clustering and analyzing techniques are studied by using hierarchical clustering method. A matrix of words is constructed with a randomly chosen RSS list. By collecting data from this list a matrix is built. In the matrix each row corresponds to a article and each column represents a word. Based on the matrix a hierarchical clustering algorithm is designed. In this algorithm the Pearson correlation coefficient is used to compute the distances among different contents. The dendrogram is used to describe the hierarchical relationship of contents and words. And the 2-D graph also is used to represent the dendrogram in another format.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Capota M, Andrade N, Pouwelse J, et al. (2013) Investment strategies for credit-based P2P communities. Parallel, Distributed and Network-Based Processing (PDP), 2013 21st Euromicro International Conference on. IEEE:437–443

  2. Getahun F, Chbeir R (2013) RSS query algebra: towards a better news management[J]. Inf Sci 237:313–342

    Article  Google Scholar 

  3. Hoffman D, Wang H Y, Chang M, et al. (2009) Grammar based testing of html injection vulnerabilities in rss feeds. Testing: Academic and Industrial Conference-Practice and Research Techniques, 2009. TAIC PART’09. IEEE: 105–110

  4. Hoffman D, Wang HY, Chang M et al (2010) Two case studies in grammar-based test generation. J Syst Softw 83(12):2369–2378

    Article  Google Scholar 

  5. Kim S, Kim BC, Lee JY (2013) DDoS analysis using correlation coefficient based on kolmogorov complexity. Grid Pervasive Comput 7861:443–452

    Article  Google Scholar 

  6. Parimala M, Lopez D, Senthilkumar NC (2011) A survey on density based clustering algorithms for mining large spatial databases. Int J Adv Sci Technol 31:59–66

    Google Scholar 

  7. Segaran T, O’Brien MT (2007) Programming Collective Intelligence, O’Reilly Media, Sebastopol

  8. Shi P (2009) An efficient approach for clustering web access patterns from web logs. Int J Adv Sci Technol 5:1–13

    Google Scholar 

  9. Wang L, Wang T, Yang R (2009) Data compression algorithm based on hierarchical cluster model for sensor networks. Int J Adv Sci Technol 2:72–84

    Google Scholar 

Download references

Acknowledgments

This research is supported by Natural Science Foundation of Heilongjiang Province (F201034); Scientific Research Foundation for Doctor of Harbin University of Commerce (12DL024)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qing he Pan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hu, W., he Pan, Q. Data clustering and analyzing techniques using hierarchical clustering method. Multimed Tools Appl 74, 8495–8504 (2015). https://doi.org/10.1007/s11042-013-1611-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-013-1611-9

Keywords

Navigation