ABSTRACT
Collaborative filtering (CF) algorithm uses the preferences expressed by previous users of items being studied and is widely applied to build recommender systems. A collaborative filter predicts items that a user will like based on the vote similar users gave to that item. In this study, we use CF to estimate how much the knowledge of the presence or absence of one software feature can contribute to the correct prediction of the presence or absence of each of the possible remaining features. Completed software project documentations from the Master in Information Technology programs of selected Northern Luzon higher education institutions were first collected. An analysis of these documents revealed 26 unique software features and yielded a binary matrix indicating the presence or absence of a feature in a specific project. Leave-one-out cross-validation was performed to estimate the predictive power of each element of a given holdout vector, using the 26x26 cosine similarity matrix generated from the remaining vectors. The results show that, on average, knowing correctly the presence or absence of only 1 feature can predict with an accuracy of about 58% the presence or absence of the remaining features. This is 8% better than that of a naïve 50-50 random binary guessing algorithm, and somehow indicates the amount of information contributed by one feature value under the CF algorithm.
- Nuseibeh, B. and Easterbrook, S. (2000) Requirements Engineering: A Roadmap. Proceedings of the Conference on the Future of Software Engineering, 35--46. http://dx.doi.org/10.1145/336512.336523Google ScholarDigital Library
- Hussain, Azham & Mkpojiogu, Emmanuel & Kamal, Fazillah. (2016). The Role of Requirements in the Success or Failure of Software Projects. EJ Econjournals. 6. 6--7.Google Scholar
- Verner, June & Sampson, Jennifer & Cerpa, Narciso. (2008). What factors lead to software project failure?. Proceedings of the 2nd International Conference on Research Challenges in Information Science, RCIS 2008. 71--80. 10.1109/RCIS.2008.4632095.Google ScholarCross Ref
- Nayana Vaidya, A.R Prof. Khachane, "Recommender Systems-The need of the Ecommerce", IEEE International Conference on Computing Methodologies and Communication, 2017.Google Scholar
- Cappella, J., Yang, S., & Lee, S. (2015). Constructing Recommendation Systems for Effective Health Messages Using Content, Collaborative, and Hybrid Algorithms. The Annals of the American Academy of Political and Social Science, 659, 290--306. Retrieved from http://www.jstor.org/stable/24541864.Google ScholarCross Ref
- Ruiz Z., Salvador J., Garcia-Rodriguez J. (2017) A Survey of Machine Learning Methods for Big Data. In: Ferrández Vicente J., Álvarez-Sánchez J., de la Paz López F., Toledo Moreo J., Adeli H. (eds) Biomedical Applications Based on Natural and Artificial Computing. IWINAC 2017. Lecture Notes in Computer Science, vol 10338. Springer, Cham.Google ScholarCross Ref
- Grbovic, Mihajlo & Cheng, Haibin. (2018). Real-time Personalization using Embeddings for Search Ranking at Airbnb. Proceedings of the the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. Pages 311--320.Google ScholarDigital Library
- Hidalgo, Reynald Jay & Fernandez, Proceso. (2015). Functional Requirements Identification Using Item-to-Item Collaborative Filtering. International Journal of Information and Education Technology. 5. 758--762. 10.7763/IJIET.2015.V5.606.Google Scholar
- Zhang, Feng & Gong, Ti & E. Lee, Victor & Zhao, Gansen & Rong, Chunming & Qu, Guangzhi. (2016). Fast Algorithms to Evaluate Collaborative Filtering Recommender Systems. Knowledge-Based Systems. 96. 10.1016/j.knosys.2015.12.025.Google ScholarDigital Library
- Ying, Y., Feinberg, F., & Wedel, M. (2006). Leveraging Missing Ratings to Improve Online Recommendation Systems. Journal of Marketing Research, 43(3), 355--365. Retrieved from http://www.jstor.org/stable/30162410.Google ScholarCross Ref
- Zeng, W., Zhu, Y. X., Lü, L., & Zhou, T. (2011). Negative ratings play a positive role in information filtering. Physica A: Statistical Mechanics and its Applications, 390(23), 4486--4493.Google Scholar
- Albatayneh, N., Ghauth, K., & Chua, F. (2018). Utilizing Learners' Negative Ratings in Semantic Content-based Recommender System for e-Learning Forum. Journal of Educational Technology & Society Vol 21, No. 1, pp. 112--125.Google Scholar
- Koren Y., Bell R. (2011) Advances in Collaborative Filtering. In: Ricci F., Rokach L., Shapira B., Kantor P. (eds) Recommender Systems Handbook. Springer, Boston, MA.Google ScholarCross Ref
- Grossman, Robert; Seni, Giovanni; Elder, John; Agarwal, Nitin; Liu, Huan (2010). "Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions". Synthesis Lectures on Data Mining and Knowledge Discovery. Morgan & Claypool. 2: 1--126. doi:10.2200/S00240ED1V01Y200912DMK002.Google Scholar
Index Terms
- Using Collaborative Filtering Algorithm to Estimate the Predictive Power of a Functional Requirement
Recommendations
Trust-based collaborative filtering: tackling the cold start problem using regular equivalence
RecSys '18: Proceedings of the 12th ACM Conference on Recommender SystemsUser-based Collaborative Filtering (CF) is one of the most popular approaches to create recommender systems. This approach is based on finding the most relevant k users from whose rating history we can extract items to recommend. CF, however, suffers ...
A hybrid approach for improving predictive accuracy of collaborative filtering algorithms
Recommender systems represent a class of personalized systems that aim at predicting a user's interest on information items available in the application domain, operating upon user-driven ratings on items and/or item features. One of the most widely ...
Unifying user-based and item-based collaborative filtering approaches by similarity fusion
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrievalMemory-based methods for collaborative filtering predict new ratings by averaging (weighted) ratings between, respectively, pairs of similar users or items. In practice, a large number of ratings from similar users or similar items are not available, ...
Comments