Abstract
Fundamental and technical analysis that investors generally use to select the best stocks cannot provide information regarding the similarity of stock price characteristics of companies in one sector. Even though companies are in the same sector, each company has a different ability to earn profits and overcome financial difficulties. So, clustering is done to find the stock prices of companies with the same characteristics in one sector. This research uses data on industrial and consumer sector companies’ stock prices because the industrial and consumer sectors are one of the largest sectors in Indonesia. The variables used in this research are open, close, and HML (High Minus Low) stock prices. Several clustering methods that can be used to cluster time series data are Fuzzy C-Means and Fuzzy C-Medoids. In addition, this research also uses several approaches with ACF (Autocorrelation Function) and PACF (Partial Autocorrelation Function), which can handle data with high dimensions and allow the comparison of time series data with different lengths. Based on the highest FS (Fuzzy Silhouette) value, the empirical results show that the two best methods for clustering open, close, and HML stock prices are Fuzzy C-Means and Fuzzy C-Medoids. The clustering results using Fuzzy C-Means are the same for open stock prices and close stock prices data. Meanwhile, there are different clustering results for HML stock price data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
BPS: Berita Resmi Statistik. Badan Pusat Statistik, Indonesia (2022)
Nair, B.B., Saravana Kumar, P.K., Sakthivel, N.R., Vipin, U.: Clustering stock price time series data to generate stock trading recommendations: an empirical study. Expert Syst. Appl0 70, 20–36 (2017). https://doi.org/10.1016/j.eswa.2016.11.002
Liao, T.W.: clustering of time series data - a survey. Pattern Recogn. 38, 1857–1874 (2005)
D’Urso, P. : Fuzzy clustering. Handbook of Cluster Analysis, pp. 545–574. Taylor & Francis Group, LLC, Boca Raton (2016)
Liu, Y., Chen, J., Shuai, W., Liu, Z., Chao, H.: Incremental Fuzzy C Medoids clustering of time series data using dynamic time warping distance. PLoS One 13(5), e0197499 (2018). https://doi.org/10.1371/journal.pone.0197499
D’Urso, P., De Giovanni, L., Massari, R., D’Ecclesia, R.L., Maharaj, E.A.: Cepstral-based clustering of financial time series. Expert Syst. Appl. 161, 113705 (2020). https://doi.org/10.1016/j.eswa.2020.113705
D’Urso, P., Giovanni, L.D., Massari, R.: GARCH-based robust clustering of time series. Fuzzy Sets Syst. 305, 1–28 (2016)
Long, N.C., Wisitpongphan, N., Meesad, P., Unger, H.: Clustering stock data for multi-objective portfolio optimization. Int. J. Comput. Intell. Appl. 13(02), 1450011 (2014). https://doi.org/10.1142/S1469026814500114
D’Urso, P., Maharaj, E.A.: Autocorrelation-based fuzzy clustering of time series. Fuzzy Sets Syst. 160(24), 3565–3589 (2009)
Cryer, J.D., Chan, K.-S.: Time Series Analysis with Applications in R, 2nd edn. Springer Science+Business Media, LLC, New York (2008)
Ke, Z., Zhang, Z.: Testing autocorrelation and partial autocorrelation: asymptotic methods versus resampling techniques. Br. J. Math. Stat. Psychol. 71(1), 96–116 (2018)
Wedel, M., Kamakura, W.A.: Market Segment: Conceptual and Methodological Foundations, 2nd edn. Kluwer Academic Publishers (2000)
Kruse, R., Döring, C., Lesot, M.-J.: Fundamentals of fuzzy clustering. In: Advances in Fuzzy Clustering and its Applications, pp. 3–30. John Wiley & Sons Ltd (2007)
Maharaj, E.A., D’Urso, P., Caiado, J.: Time Series Clustering and Classification. Taylor & Francis Group, LLC, Boca Raton (2019)
McBratney, A.B., Moore, A.W.: Application of fuzzy sets to climatic classification. Agric. For. Meteorol. 35, 165–185 (1985)
Heiser, W.J., Groenen, P.J.F.: Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima. Psychometrika 62, 63–83 (1997)
Everitt, B.S., Landau, S., Leese, M., Stahl, D.: Cluster Analysis, 5th edn. John Wiley & Sons Ltd, London (2011)
D’Urso, P.: Fuzzy clustering for data time arrays with inlier and outlier time trajectories. IEEE Trans. Fuzzy Syst. 13(5), 583–604 (2005)
Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32–57 (1974)
Bezdek, J.C.: Numerical taxonomy with fuzzy sets. J. Math. Biol. 1, 57–71 (1974)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281-297 (1967)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Krishnapuram, R., Joshi, A., Yi, L.: A fuzzy relative of the k-Medoids algorithm with application to web document and snippet clustering. In: 1999 IEEE International Fuzzy Systems Conference Proceedings, vol. 3, pp. 1281–1286 (1999)
Krishnapuram, R., Joshi, A., Nasraoui, O., Yi, L.: Low-complexity fuzzy relational clustering algorithms for web mining. IEEE Trans. Fuzzy Syst. 9(4), 595–607 (2001)
Suhartono, N.S., Prastyo, D.D.: Design of experiment to optimize the architecture of deep learning for nonlinear time series forecasting. Procedia Comput. Sci. 144, 269–276 (2018). https://doi.org/10.1016/j.procs.2018.10.528
Suhartono, P.D., Saputri, F.F., Amalia, D.D., Prastyo, B.S., Ulama, S.: Model selection in feedforward neural networks for forecasting inflow and outflow in Indonesia. In: Mohamed, A., Berry, M.W., Yap, B.W. (eds.) Soft Computing in Data Science, pp. 95–105. Springer Singapore, Singapore (2017). https://doi.org/10.1007/978-981-10-7242-0_8
Prastyo, D.D., Nabila, F.S., Suhartono, M.H., Lee, N.S., Fam, S.-F.: VAR and GSTAR-based feature selection in support vector regression for multivariate Spatio-temporal forecasting. In: Yap, B.W., Mohamed, A.H., Berry, M.W. (eds.) Soft Computing in Data Science: 4th International Conference, SCDS 2018, Bangkok, Thailand, August 15-16, 2018, Proceedings, pp. 46–57. Springer Singapore, Singapore (2019). https://doi.org/10.1007/978-981-13-3441-2_4
Dubes, R., Jain, A.K.: Clustering techniques: the user’s dilemma. Pattern Recogn. 8, 247–260 (1976)
Sardá-Espinosa, A.: Time-series clustering in R Using the dtwclust Package. The R Journal 11(1), 22 (2019). https://doi.org/10.32614/RJ-2019-023
Dave, R.N.: Validating fuzzy partitions obtained through C-shells clustering. Pattern Recogn. Lett. 17(6), 613–623 (1996)
Kwon, S.: Cluster validity index for fuzzy clustering. Electron. Lett. 34(22), 2176–2177 (1998)
Tang, Y., Sun, F., Sun, Z.: Improved validation index for fuzzy clustering. In: American Control Conference, Portland (2005)
Zahid, N., Limouri, M., Essaid, A.: A new cluster-validity for fuzzy clustering. Pattern Recogn. 32(7), 1089–1097 (1999)
Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: Validity index for crisp and fuzzy clusters. Pattern Recogn. 37(3), 487–501 (2004)
Wang, W., Zhang, Y.: On fuzzy cluster validity indices. Fuzzy Sets Syst. 158(19), 2095–2117 (2007)
Campello, R.J.G.B., Hruschka, E.R.: A Fuzzy extension of the silhouette width criterion for cluster analysis. Fuzzy Sets Syst. 157, 2858–2875 (2006)
Rahayu, S.P., Febryani, S., Kusuma, H.B., Suhartono, Prastyo, D.D.: The approach of Fuzzy C-Means cluster and factor analysis on economic potential mapping of regency/city in east java province. In: International Conference on Science and Applied Science (ICSAS) 2018, Surakarta, Indonesia (2018)
Acknowledgment
This research is supported by Deputi Bidang Penguatan Riset dan Pengembangan, Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi & Badan Riset dan Inovasi Nasional under the scheme Penelitian Dasar, number of contract 008/E5/PG.02.00.PT/2022 & 1506/PKS/ITS/2022. The authors thank DRPM ITS for the support. The authors also thank the reviewers whose comments and valuable suggestions helped improve the quality of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Muda, M.A., Prastyo, D.D., Akbar, M.S. (2023). Clustering Stock Prices of Industrial and Consumer Sector Companies in Indonesia Using Fuzzy C-Means and Fuzzy C-Medoids Involving ACF and PACF. In: Yusoff, M., Hai, T., Kassim, M., Mohamed, A., Kita, E. (eds) Soft Computing in Data Science. SCDS 2023. Communications in Computer and Information Science, vol 1771. Springer, Singapore. https://doi.org/10.1007/978-981-99-0405-1_20
Download citation
DOI: https://doi.org/10.1007/978-981-99-0405-1_20
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0404-4
Online ISBN: 978-981-99-0405-1
eBook Packages: Computer ScienceComputer Science (R0)