Skip to main content

Genetic Algorithm-Based Matrix Factorization for Missing Value Prediction

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 776))

Abstract

Sparsity is a major problem in the areas like data mining and pattern recognition. In recommender systems, predictions based on these few observations lead to avoidance of inherent latent features of the user corresponding to the item. Similarly, in different crowdsourcing based opinion aggregation models, there is a minimal chance to obtain opinions from all the crowd workers. Even this sparsity problem has an extensive effect in predicting actual rating of a particular item due to limited and incomplete observations. To deal with this issue, in this article, a genetic algorithm based matrix factorization technique is proposed to estimate the missing entries in the response matrix that contains workers’ responses over some questions. We have created three synthetic datasets and used one real-life dataset to show the efficacy of the proposed method over the other state-of-the-art approaches.

This is a preview of subscription content, log in via an institution.

References

  1. Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 6, 734–749 (2005)

    Article  Google Scholar 

  2. Berry, M., Browne, M., Langville, A., Pauca, V., Plemmons, R.: Algorithms and applications for approximate nonnegative matrix factorization. Comput. Stat. Data Anal. 52(2), 155–173 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  3. Brabham, D.C.: Detecting stable clusters using principal component analysis. Methods Mol. Biol. 224(10), 159–182 (2013)

    Google Scholar 

  4. Brunet, J., Tamayo, P., Golub, T., Mesirov, J.: Metagenes and molecular pattern discovery using matrix factorization. Proc. Nat. Acad. Sci. 101, 1464–1469 (2004)

    Article  Google Scholar 

  5. Christidis, K., Mentzas, G.: A topic-based recommender system for electronic marketplace platforms. Expert Syst. Appl. 40(11), 4370–4379 (2013)

    Article  Google Scholar 

  6. Cichocki, A., Zdunek, R., Amari, S.: Hierarchical ALS algorithms for nonnegative matrix and 3D tensor factorization. In: Davies, M.E., James, C.J., Abdallah, S.A., Plumbley, M.D. (eds.) ICA 2007. LNCS, vol. 4666, pp. 169–176. Springer, Heidelberg (2007). doi:10.1007/978-3-540-74494-8_22

    Chapter  Google Scholar 

  7. Demartini, G., Difallah, D.E., Mauroax, C.: Zencrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large scale entity linking. In: Proceedings of the 21st International Conference on World Wide Web, Lyon, France, pp. 469–478 (2012)

    Google Scholar 

  8. Friedman, A., Berkovsky, S., Kaafar, M.A.: A differential privacy framework for matrix factorization recommender systems. User Model. User-Adap. Inter. 26(5), 425–458 (2016)

    Article  Google Scholar 

  9. Kim, J., He, Y., Park, H.: Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework. J. Global Optim. 58(2), 285–319 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  10. Kim, J., Park, H.: Fast nonnegative matrix factorization: an active-set-like method and comparisons. SIAM J. Sci. Comput. (SISC) 33(6), 3261–3281 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  11. Kittur, A., Nickerson, J.V., Bernstein, M., Gerber, E., Shaw, A., Zimmerman, J., Lease, M., Horton, J.: The future of crowd work. In: Proceedings of the CSCW, pp. 1301–1318 (2013)

    Google Scholar 

  12. Lee, D., Seung, H.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)

    Article  MATH  Google Scholar 

  13. Lee, D., Seung, H.: Algorithms for non-negative matrix factorization. Proc. Adv. Neural Inf. Process. Syst. 13, 556–562 (2001)

    Google Scholar 

  14. Lin, C.J.: Projected gradient methods for nonnegative matrix factorization. Neural Comput. 19(10), 2756–2779 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  15. Luo, X., Liu, H., Gou, G., Xia, Y., Zhu, Q.: A parallel matrix factorization based recommender by alternating stochastic gradient decent. Eng. Appl. Artif. Intell. 25(7), 1403–1412 (2012)

    Article  Google Scholar 

  16. Maulik, U., Bandyopadhyay, S.: Genetic algorithm based clustering technique. Pattern Recogn. 32, 1455–1465 (2000)

    Article  Google Scholar 

  17. Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: A survey of multiobjective evolutionary clustering. ACM Comput. Surv. 47(4), 61:1–61:46 (2015)

    Article  Google Scholar 

  18. Paatero, P.: Least squares formulation of robust non-negative factor analysis. Chemometr. Intell. Lab. Syst. 37(1), 23–35 (1997)

    Article  Google Scholar 

  19. Paatero, P., Tapper, U.: Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data value. Environmetrics 5(2), 111–126 (1994)

    Article  Google Scholar 

  20. Ross, J., Irani, L., Silberman, M., Zaldivar, A., Tomilson, B.: Who are the crowdworkers? Shifting demographics in mechanical turk. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 2863–2872 (2010)

    Google Scholar 

  21. Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, vol. 101, pp. 267–273. ACM Press (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sujoy Chatterjee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Chatterjee, S., Mukhopadhyay, A. (2017). Genetic Algorithm-Based Matrix Factorization for Missing Value Prediction. In: Mandal, J., Dutta, P., Mukhopadhyay, S. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2017. Communications in Computer and Information Science, vol 776. Springer, Singapore. https://doi.org/10.1007/978-981-10-6430-2_39

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6430-2_39

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6429-6

  • Online ISBN: 978-981-10-6430-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics