Abstract
Learning Analytics (LA) provides a rich set of methods, techniques, and tools to analyze learners’ data. However, educators without a background in data analysis and statistical methods experience difficulty comprehending the potentials and pitfalls of learning analytics based pedagogical practices and Engineering Sciences experience this difficulty. This chapter documents a set of exemplars used to demonstrate learning analytics applications in daily classroom activities. These exemplars have been designed and used mainly to train newly recruited teachers on data analysis methods during faculty induction programs. Exemplars demonstrate the application of statistical methods such as hypothesis testing, analysis of variance (ANOVA), correlation analysis, and regression analysis. Each use case’s broad objective is to describe the application’s context so that teachers can apply it in a similar situation. The chapter provides ready-to-use examples for conducting teachers training programs on Learning Analytics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
M. Kiseleva, N. Kiseleva, E. Kiselev, Changes in education under the influence of digital technologies: main problems and risk of division. KnE Soc. Sci. 22–28 (2020)
R.E. Slavin, How evidence-based reform will transform research and practice in education. Educ. Psychol. 55(1), 21–31 (2020)
M.M. Chiu, B.W.-Y. Chow, S.W. Joh, How to assign students into sections to raise learning, in Proceedings of the Seventh International Learning Analytics & Knowledge Conference (ACM, 2017), pp. 95–104
D. Babik, S. Stevens, A.E. Waters, Comparison of ranking and rating scales in online peer assessment: simulation approach, in Proceedings of the 9th International Conference on Learning Analytics & Knowledge (2019), pp. 205–209
T.L. Varao-Sousa, C. Mills, A. Kingstone, Where you are, not what you see: the impact of learning environment on mind wandering and material retention, in Proceedings of the 9th International Conference on Learning Analytics & Knowledge (2019), pp. 421–425
X. Hu, F. Li, R. Kong, Can background music facilitate learning? Preliminary results on reading comprehension, in Proceedings of the 9th International Conference on Learning Analytics & Knowledge (2019), pp. 101–105
L. Chen, A. Dubrawski, Learning from learning curves: discovering interpretable learning trajectories, in Proceedings of the Seventh International Learning Analytics & Knowledge Conference (ACM, 2017), pp. 544–545
S.A. Adjei, A.F. Botelho, N.T. Heffernan, Sequencing content in an adaptive testing system: the role of choice, in Proceedings of the Seventh International Learning Analytics & Knowledge Conference (ACM, 2017), pp. 178–182
A. Aghababyan, N. Lewkow, R. Baker, Exploring the asymmetry of metacognition, in Proceedings of the Seventh International Learning Analytics & Knowledge Conference (ACM, 2017), pp. 115–119
L.K. Allen, C.A. Perret, A.D. Likens, D.S. McNamara, What’d you say again?: recurrence quantification analysis as a method for analyzing the dynamics of discourse in a reading strategy tutor, in LAK (2017), pp. 373–382
W. Matcha, D. Gǎsević, N.A.A. Uzir, J. Jovanović, A. Pardo, Analytics of learning strategies: associations with academic performance and feedback, in Proceedings of the 9th International Conference on Learning Analytics & Knowledge (2019), pp. 461–470
Y. Chen, B. Yu, X. Zhang, Y. Yu, Topic modeling for evaluating students’ reflective writing: a case study of pre-service teachers’ journals, in Proceedings of the Sixth International Conference on Learning Analytics & Knowledge (ACM, 2016), pp. 1–5
S. Slade, P. Prinsloo, M. Khalil, Learning analytics at the intersections of student trust, disclosure and benefit, in Proceedings of the 9th International Conference on Learning Analytics & Knowledge (2019), pp. 235–244
S.A. Adjei, A.F. Botelho, N.T. Heffernan, Predicting student performance on post-requisite skills using prerequisite skill data: an alternative method for refining prerequisite skill structures, in Proceedings of the Sixth International Conference on Learning Analytics & Knowledge (ACM, 2016), pages 469–473
T. Hecking, D. Doberstein, H.U. Hoppe, Predicting the wellfunctioning of learning groups under privacy restrictions, in Proceedings of the 9th International Conference on Learning Analytics & Knowledge (2019), pp. 245–249
M.L. Bote-Lorenzo, E. Ǵomez-Śanchez, Predicting the decrease of engagement indicators in a mooc, in Proceedings of the Seventh International Learning Analytics & Knowledge Conference (2017), pp. 143–147
U.D. Kumar, Business Analytics: The Science of Data-Driven Decision Making. Wiley (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: A Primer on Statistical Terms and Definitions
Appendix: A Primer on Statistical Terms and Definitions
The following are most commonly used terms from Statistics. Here we reproduce their definitions from [17]
-
1.
Population It is the set of all possible data for a given context.
-
2.
Sample It is the subset taken from population.
-
3.
Types of Data A Data can be broadly classified into four categories. These are: (i) Continuous, (ii) Discrete, (iii) Ordinal, and (iv) Nominal. Discrete and continuous data is numeric type data. A continuous random variable such as temperature may take any value from the number space. A discrete random variable such as color can take a fixed set of values. For example, red, blue green etc. An implicit order of hierarchy is understood in case of ordinal type of variable such performance level may be excellent, very good, good and fair. No such implicit order is assumed in case of nominal type of random variable
-
4.
Mean is the arithmetical average value of data and is one of the most frequently measures of central tendency. It is defined as:
$$\mu = \mathop \sum \limits_{i = 1}^{n} \frac{{x_{i} }}{n}$$ -
5.
Mode is the most frequently occurring value in the data set. Mode is the only measure of central tendency which is valid for qualitative (nominal) data since the mean and median for nominal data are meaningless. In the bar chart (and histogram), mode is the tallest column.
-
6.
Median It is the value that divides the data in to two equal parts, that is the proportion of observations below median and above median will be 50%. Median is much more stable than the mean value that is adding a new observation may not change the median significantly. However the drawback of median is that it is not calculated using the entire data like in the case of mean.
-
7.
Variance It is the average of the squared differences from the Mean.
$${\text{var}} = \mathop \sum \limits_{i = 1}^{n} \frac{{\left( {x_{i} - \mu } \right)^{2} }}{n}$$ -
8.
Standard Deviation It is a measurement of how far data is spread out from the mean, or average. The Standard Deviation is a measure of how spreads out numbers are. It is typically defined as
$$\sigma = \sqrt {\text{var}}$$ -
9.
Normal Distribution It is a continuous probability distribution that is symmetrical on both sides of the mean, so the right side of the center is a mirror image of the left side. The area under the normal distribution curve represents probability and the total area under the curve sums to one. It is symmetrical bell-shaped graph as shown in Fig. 6.5.
-
10.
Co-Variance It signifies the direction of the linear relationship between the two variables. By direction we mean if the variables are directly proportional or inversely proportional to each other. Increasing the value of one variable might have a positive or a negative impact on the value of the other variable.
$$Cov_{x, y} = \frac{{\sum \left( {x_{i} - \mu_{x} } \right)\left( {y_{i} - \mu_{y} } \right)}}{n - 1}$$ -
11.
Correlation It is a measure of the strength and direction of relationship that exists between two random variables. It is a measure of association between two variables.
-
12.
Pearson Correlation Coefficient measure the strength of the linear association relationship using numerical measure
$$PCC = \frac{{Cov_{x, y} }}{{\sigma_{x} \sigma_{y} }}$$ -
13.
Mean Square Error It measures the average of the squares of the errors i.e., the average squared difference between the estimated values (ŷ) and actual value (y).
$$MSE = \mathop \sum \limits_{i = 1}^{n} \frac{{\left( {y_{i} - \hat{y}} \right)^{2} }}{n}$$ -
14.
t-test is used when the population follows a normal distribution and population standard deviation is unknown. It shows how significant the differences between groups are.
-
15.
z-test is a statistical test to determine whether two population means are different when the variances are known and the sample size is large. It can be used to test hypotheses in which the z-test follows a normal distribution.
-
16.
Chi-Squared test It is used for testing relationships between categorical variables.
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kiwelekar, A.W., Laddha, M.D., Netak, L.D. (2022). Exemplar Use-Cases for Training Teachers on Learning Analytics. In: Ivanović, M., Klašnja-Milićević, A., Jain, L.C. (eds) Handbook on Intelligent Techniques in the Educational Process. Learning and Analytics in Intelligent Systems, vol 29. Springer, Cham. https://doi.org/10.1007/978-3-031-04662-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-04662-9_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-04661-2
Online ISBN: 978-3-031-04662-9
eBook Packages: EducationEducation (R0)