skip to main content
10.1145/3170358.3170373acmotherconferencesArticle/Chapter ViewAbstractPublication PageslakConference Proceedingsconference-collections
research-article

Coenrollment networks and their relationship to grades in undergraduate education

Published:07 March 2018Publication History

ABSTRACT

In this paper, we evaluate the complete undergraduate coenrollment network over a decade of education at a large American public university. We provide descriptive properties of the network, demonstrating that the coenrollment networks evaluated follow power-law degree distributions similar to many other large-scale networks; that they reveal strong performance-based assortativity; and that network-based features can significantly improve GPA-based student performance predictors. We then implement a network-based, multi-view classification model to predict students' final course grades. In particular, we adapt a structural modeling approach from [19, 34], whereby we model the university-wide undergraduate coenrollment network as an undirected graph. We compare the performance of our predictor to traditional methods used for grade prediction in undergraduate university courses, and demonstrate that a multi-view ensembling approach outperforms both prior "flat" and network-based models for grade prediction across several classification metrics. These findings demonstrate the usefulness of combining diverse approaches in models of student success, and demonstrate specific network-based modeling strategies which are likely to be most effective for grade prediction.

References

  1. Paul Adams. 2006. Exploring social constructivism: theories and practicalities. Education 3--13 34, 3 (Oct. 2006), 243--257.Google ScholarGoogle Scholar
  2. J S Antrobus, R Dobbelaer, and S Salzinger. 1988. Social networks and college success, or grade point average and the friendly connection. Social networks of children, adolescents, and college students 227 (1988), 260.Google ScholarGoogle Scholar
  3. Sara Baker, Adalbert Mayer, and Steven L Puller. 2011. Do more diverse environments increase the diversity of subsequent interaction? Evidence from random dorm assignment. Econ. Lett. 110, 2 (2011), 110--112.Google ScholarGoogle ScholarCross RefCross Ref
  4. Timothy T Baldwin, Michael D Bedell, and Jonathan L Johnson. 1997. The social fabric of a team-based MBA program: Network effects on student satisfaction and performance. Acad. Manage. J. 40, 6 (1997), 1369--1397.Google ScholarGoogle Scholar
  5. L Becchetti, C Castillo, D Donato, and others. 2008. Link analysis for web spam detection. ACM Transactions on (2008). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. James Bennett, Stan Lanning, and Others. 2007. The netflix prize. In Proceedings of KDD cup and workshop, Vol. 2007. brettb.net, 35.Google ScholarGoogle Scholar
  7. Smriti Bhagat, Irina Rozenbaum, and Graham Cormode. 2007. Applying Link-based Classification to Label Blogs. In Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis (WebKDD/SNA-KDD '07). ACM, New York, NY, USA, 92--101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Susan Biancani and Daniel McFarland. 2013. Social Networks Research in Higher Education. Educ. Stud. 4 (2013), 85--126.Google ScholarGoogle Scholar
  9. Giorgio Brunello, Maria de Paola, and Vincenzo Scoppa. 2010. PEER EFFECTS IN HIGHER EDUCATION: DOES THE FIELD OF STUDY MATTER? Econ. Inq. 48, 3 (July 2010), 621--634.Google ScholarGoogle Scholar
  10. Soumen Chakrabarti, Byron Dom, and Piotr Indyk. 1998. Enhanced Hypertext Categorization Using Hyperlinks. In Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data (SIGMOD '98). ACM, New York, NY, USA, 307--318. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16). ACM, New York, NY, USA, 785--794. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Corinna Cortes, Daryl Pregibon, and Chris Volinsky. 2001. Communities of Interest. In Advances in Intelligent Data Analysis. Springer, Berlin, Heidelberg, 105--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Thomas GDietterich. 2000. Ensemble Methods in Machine Learning. In Multiple Classifier Systems. Springer, Berlin, Heidelberg, 1--15. Google ScholarGoogle ScholarCross RefCross Ref
  14. Pedro Domingos and Matt Richardson. 2001. Mining the Network Value of Customers. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '01). ACM, New York, NY, USA, 57--66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Tom Fawcett and Foster Provost. 1997. Adaptive Fraud Detection. Data Min. Knowl. Discov. 1, 3 (Sept. 1997), 291--316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Gigi Foster. 2006. It's not your peers, and it's not your friends: Some progress toward understanding the educational peer effect mechanism. J. Public Econ. 90, 8--9 (2006), 1455--1475.Google ScholarGoogle ScholarCross RefCross Ref
  17. Nir Friedman, Lise Getoor, Daphne Koller, and Avi Pfeffer. 1999. Learning probabilistic relational models. In IJCAI, Vol. 99. robotics.stanford.edu, 1300--1309. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Dragan Gašević, Amal Zouaq, and Robert Janzen. 2013. "Choose your classmates, your GPA is at stake!" The association of cross-class social ties and academic performance. Am. Behav. Sci. 57, 10 (2013), 1460--1479.Google ScholarGoogle ScholarCross RefCross Ref
  19. Lise Getoor. 2005. Link-based Classification. In Advanced Methods for Knowledge Discovery from Complex Data. Springer London, 189--207.Google ScholarGoogle Scholar
  20. Lise Getoor and Christopher P Diehl. 2005. Link Mining: A Survey. SIGKDD Explor. Newsl. 7, 2 (Dec. 2005), 3--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Lise Getoor and Lilyana Mihalkova. 2011. Learning Statistical Models from Relational Data. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD '11). ACM, New York, NY, USA, 1195--1198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A F Hadwin and Sanna Järvelä. 2011. Introduction to a special issue on social aspects of self-regulated learning: Where social and self meet in the strategic regulation of learning. Teach. Coll. Rec. 113, 2 (2011), 235--239.Google ScholarGoogle ScholarCross RefCross Ref
  23. David J Hand and Robert J Till. 2001. A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems. Mach. Learn. 45, 2 (Nov. 2001), 171--186. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Shawndra Hill, Foster Provost, and Chris Volinsky. 2006. Network-Based Marketing: Identifying Likely Adopters via Consumer Networks. Stat. Sci. 21, 2 (2006), 256--276.Google ScholarGoogle ScholarCross RefCross Ref
  25. Jessica Hoel, Jeffrey Parker, and Jon Rivenburg. 2005. Peer effects: do first-year classmates, roommates, and dormmates affect studentsâĂŹ academic success. In Higher education data sharing consortium winter conference, Santa Fe, NM. Citeseer.Google ScholarGoogle Scholar
  26. Feng Jiang and Wentao Li. 2017. Who Will Be the Next to Drop Out? Anticipating Dropouts in MOOCs with Multi-View Features. International Journal of Performability Engineering 13, 2 (2017).Google ScholarGoogle Scholar
  27. Yuheng Jiang and Lukasz Golab. {n. d.}. On Competition for Undergraduate Co-op Placements: A Graph Mining Approach. ({n. d.}).Google ScholarGoogle Scholar
  28. David W Johnson and Roger T Johnson. 2009. An Educational Psychology Success Story: Social Interdependence Theory and Cooperative Learning. Educ. Res. 38, 5 (June 2009), 365--379.Google ScholarGoogle ScholarCross RefCross Ref
  29. Daphne Koller. 1999. Probabilistic Relational Models. In Inductive Logic Programming. Springer, Berlin, Heidelberg, 3--13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Gueorgi Kossinets and Duncan J Watts. 2006. Empirical analysis of an evolving social network. Science 311, 5757 (Jan. 2006), 88--90.Google ScholarGoogle ScholarCross RefCross Ref
  31. J Lafferty, A McCallum, and F Pereira. {n. d.}. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. ({n. d.}).Google ScholarGoogle Scholar
  32. Conrad Lee, Thomas Scherngell, and Michael J Barber. 2011. Investigating an online social network using spatial interaction models. Soc. Networks 33, 2 (2011), 129--133.Google ScholarGoogle ScholarCross RefCross Ref
  33. W Li, M Gao, H Li, Q Xiong, J Wen, and Z Wu. 2016. Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning. In 2016 International Joint Conference on Neural Networks (IJCNN). ieeexplore.ieee.org, 3130--3137.Google ScholarGoogle ScholarCross RefCross Ref
  34. Qing Lu and Lise Getoor. 2003. Link-based classification. In ICML, Vol. 3. vvvvw.aaai.org, 496--503. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Q Lu and L Getoor. 2003. Link-based classification using labeled and unlabeled data. In ICML 2003 workshop on The Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. David S Lyle. 2007. Estimating and Interpreting Peer and Role Model Effects from Randomly Assigned Social Groups at West Point. Rev. Econ. Stat. 89, 2 (April 2007), 289--299.Google ScholarGoogle Scholar
  37. Collin F Lynch, Tiffany Barnes, Linting Xue, and Niki Gitinabard. 2017. Graph-based Educational Data Mining (G-EDM 2017). In Proceedings of the 10th International Conference on Educational Data Mining, Xiangen Hu, Tiffany Barnes, Arnon Hershkovitz, and Luc Paquette (Eds.). 472--473.Google ScholarGoogle Scholar
  38. Patrick J McEwan and Kristen A Soderberg. 2006. Roommate Effects on Grades: Evidence from First-Year Housing Assignments. Res. High. Educ. 47, 3 (May 2006), 347--370.Google ScholarGoogle Scholar
  39. M Newman. 2003. The Structure and Function of Complex Networks. SIAM Rev. 45, 2 (Jan. 2003), 167--256.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. M E Newman. 2001. The structure of scientific collaboration networks. Proc. Natl. Acad. Sci. U. S. A. 98, 2 (Jan. 2001), 404--409.Google ScholarGoogle ScholarCross RefCross Ref
  41. M E J Newman. 2003. Mixing patterns in networks. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 67, 2 Pt 2 (Feb. 2003), 026126.Google ScholarGoogle ScholarCross RefCross Ref
  42. Hyo-Jung Oh, Sung Hyon Myaeng, and Mann-Ho Lee. 2000. A Practical Hypertext Catergorization Method Using Links and Incrementally Available Class Information. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '00). ACM, New York, NY, USA, 264--271. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Shashank Pandit, Duen Horng Chau, Samuel Wang, and Christos Faloutsos. 2007. Netprobe: A Fast and Scalable System for Fraud Detection in Online Auction Networks. In Proceedings of the 16th International Conference on World Wide Web (WWW '07). ACM, New York, NY, USA, 201--210. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Foster Provost, Brian Dalessandro, Rod Hook, Xiaohan Zhang, and Alan Murray. 2009. Audience Selection for On-line Brand Advertising: Privacy-friendly Social Network Targeting. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '09). ACM, New York, NY, USA, 707--716. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. S Redner. 1998. How popular is your paper? An empirical study of the citation distribution. Eur. Phys. J. B 4, 2 (July 1998), 131--134.Google ScholarGoogle ScholarCross RefCross Ref
  46. Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. Why should i trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. dl.acm.org, 1135--1144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. M Riedmiller and H Braun. 1993. A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In IEEE International Conference on Neural Networks. ieeexplore.ieee.org, 586--591 vol.1.Google ScholarGoogle ScholarCross RefCross Ref
  48. Bruce Sacerdote. 2000. Peer Effects with Random Assignment: Results for Dartmouth Roommates. (Jan. 2000).Google ScholarGoogle ScholarCross RefCross Ref
  49. Sarah Schrire. 2006. Knowledge building in asynchronous discussion groups: Going beyond quantitative analysis. Comput. Educ. 46, 1 (Jan. 2006), 49--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. John J Siegfried and Michael A Gleason. 2006. Academic roommate peer effects. Unpublished manuscript, Vanderbilt Univ., Nashville (2006).Google ScholarGoogle Scholar
  51. Ralph Stinebrickner and Todd R Stinebrickner. 2006. What can be learned about peer effects using college roommates? Evidence from new survey data and students from disadvantaged backgrounds. J. Public Econ. 90, 8--9 (2006), 1435--1454.Google ScholarGoogle ScholarCross RefCross Ref
  52. Shiliang Sun. 2013. A survey of multi-view machine learning. Neural Comput. Appl. 23, 7-8 (Dec. 2013), 2031--2038.Google ScholarGoogle ScholarCross RefCross Ref
  53. Lei Tang and Huan Liu. 2011. Leveraging social media networks for classification. Data Min. Knowl. Discov. 23, 3 (Nov. 2011), 447--478. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Ben Taskar, Pieter Abbeel, and Daphne Koller. 2002. Discriminative Probabilistic Models for Relational Data. In Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI'02). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 485--492. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Andreas Töscher, Michael Jahrer, and Robert M Bell. 2009. The bigchaos solution to the netflix grand prize. Netflix prize documentation (2009), 1--52.Google ScholarGoogle Scholar
  56. A Traud, E Kelsic, P Mucha, and M Porter. 2011. Comparing Community Structure to Characteristics in Online Collegiate Social Networks. SIAM Rev. 53, 3 (Jan. 2011), 526--543. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Anneleen Van Assche, Celine Vens, Hendrik Blockeel, and Sašo Dzeroski. 2004. A random forest approach to relational learning. In Workshop on Statistical Relational Learning.Google ScholarGoogle Scholar
  58. Jacob Whitehill, Kiran Mohan, Daniel Seaton, Yigal Rosen, and Dustin Tingley. 2017. Delving Deeper into MOOC Student Dropout Prediction. (Feb. 2017). arXiv:cs.AI/1702.06404Google ScholarGoogle Scholar
  59. Andreas Wimmer and Kevin Lewis. 2010. Beyond and Below Racial Homophily: ERG Models of a Friendship Network Documented on Facebook. Am. J. Sociol. 116, 2 (2010), 583--642.Google ScholarGoogle ScholarCross RefCross Ref
  60. Gordon Winston and David Zimmerman. 2004. Peer effects in higher education. In College choices: The economics of where to go, when to go, and how to pay for it. University of Chicago Press, 395--424.Google ScholarGoogle Scholar
  61. David H Wolpert. 1992. Stacked generalization. Neural Netw. 5, 2 (1992), 241--259. Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. Chang Xu, Dacheng Tao, and Chao Xu. 2013. A Survey on Multi-view Learning. (April 2013). arXiv:cs.LG/1304.5634Google ScholarGoogle Scholar
  63. Zhijie Xu and Shiliang Sun. 2010. An Algorithm on Multi-View Adaboost. In Neural Information Processing. Theory and Algorithms (Lecture Notes in Computer Science). Springer, Berlin, Heidelberg, 355--362. Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Tong Zhang, Alexandrin Popescul, and Byron Dom. 2006. Linear Prediction Models with Graph Regularization for Web-page Categorization. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '06). ACM, New York, NY, USA, 821--826. Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. David J Zimmerman. 2003. Peer Effects in Academic Outcomes: Evidence from a Natural Experiment. Rev. Econ. Stat. 85, 1 (Feb. 2003), 9--23.Google ScholarGoogle Scholar

Index Terms

  1. Coenrollment networks and their relationship to grades in undergraduate education

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      LAK '18: Proceedings of the 8th International Conference on Learning Analytics and Knowledge
      March 2018
      489 pages
      ISBN:9781450364003
      DOI:10.1145/3170358

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 March 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      LAK '18 Paper Acceptance Rate35of115submissions,30%Overall Acceptance Rate236of782submissions,30%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader