1 Introduction
With the advent of the Industry 4.0 era, technology has reshaped people's lives and has led to a significant increase in work Stress [
1,
2]. Facing this greater work pressure, people need leisure and entertainment for respite, they dramatically increased their online leisure activities [
3]. Scholars are more than ever concerned about the relationship between recovery from work and subjective well-being among employees [
4]. Internet leisure, representing the common life style after work, has become a key topic for exploring subjective well-being in the new era. In previous studies, leisure activities often reported inconsistent results [
5]. Subjective well-being mainly comes from passive activities such as watching TV, while activities like exercise and socializing have no contribution to subjective well-being [
6]. However, Liu and Da reported that both passive and active activities promote subjective well-being. Some theories have focused on the psychological mechanisms triggered by activities [
7]. The DRAMMA model integrates the four post-work recovery experiences from the recovery theory and the three basic psychological needs from the Self-Determination Theory into six psychological needs that make unique contributions to subjective well-being [
8,
9]. However, recent studies have raised concerns about the DRAMMA model's design, as it does not facilitate the examination of internal levels when analyzing the six mechanisms together. Therefore, a deeper investigation into the relationships between these different needs and theories of DRAMMA, SDT, and the recovery theory is needed [
10]. Similarly, in recovery theory research, it has been proposed that the observation of the combined experiences of the four psychological experiences does not clarify which experiences drive recovery [
11] and therefore, the combinations of psychological needs should be explored [
12] [
13]. While the Conservation of Resources (COR) theory serves as a universal explanatory framework for various psychological processes, there is still a need for its systematization. [
5], distinguishing the characteristics and mechanisms of resources in leisure[
4]. This study attempts to comprehensively explore the panorama of internet leisure mechanisms from a systematic COR perspective, considering the levels of psychological detachment representing the recovery theory and SDT, as well as discussing the paths to achieving subjective well-being through these two strategies for different activities, and uncovering the underlying laws of the nature of activities and resource levels, directly addressing practice.
Traditional psychological approaches are based on researchers’ hypotheses, where questionnaires or experimental procedures are developed, data is collected, and analysis is conducted. The main objective is to study the causal relationships of psychological processes. Once causal relationships are extracted, it enables better prediction of the future. The predictive nature of traditional methods predominantly emphasizes qualitative explanations and cannot be used for quantitative predictions. On the other hand, Artificial intelligence algorithms can mine logical relationships based on a large amount of data, and further improve the predictive accuracy of the models through various types of algorithms and iteration techniques. When researchers obtain the optimal algorithm model, they can quantify the influence of each project based on this model, understand the importance of each project in the research scenario, and achieve more timely and effective prediction and adjustment of Subjective Well-being.
Furthermore, it is worth noting that this paper has amassed over 3500 citations (according to Google Scholar, March 2, 2024) [
14]. In the XGBoost algorithm, the objective function in Eq. (
1) contains functions as parameters, making it challenging to optimize using traditional methods. As the result, the model is trained in an additive manner using the second-order Taylor approximation. In order to utilize this approximation, the objective function is modified as Eq. (
2). By eliminating constant terms, the objective function can be simplified into a sum of quadratic functions with just one variable, as demonstrated in Eq. (
3). This simplified form can be conveniently minimized by employing a greedy algorithm.
In the XGBoost algorithm, with the previous classifiers locking a new weak classifier is added at each iteration to make the performance of the current model better. This process continues, with each new classifier considering areas where the previous ones were not performing well. The general flow of the XGBoost algorithm is illustrated in Figure
1.
XGBoost's performance and accuracy have been extensively tested and validated in various real-world applications. For instance, it has been successfully applied in sentiment analysis [
15], which involves analyzing and understanding emotions, opinions, and attitudes expressed in text data. XGBoost has demonstrated its capability to effectively classify and analyze sentiment, allowing for a deeper understanding of people's reactions and opinions.
Additionally, XGBoost has also been utilized in depression prediction [
16] and student performance [
17], aiding in the early identification and prediction of depressive symptoms or disorders. By leveraging its advanced algorithms and ensemble learning techniques, XGBoost can analyze various factors and indicators to provide accurate predictions regarding an individual's risk or likelihood of experiencing depression. These practical implementations highlight the robustness and versatility of XGBoost as an efficient tool in diverse domains, showcasing its ability to tackle complex problems and deliver reliable results.
3 Data Analysis and Results
The demographic variables include age, marriage, occupation, work years, position level, monthly income, average work hours per day. In the original algorithm model, all data were converted into numerical variables, and the XGBoost algorithm model was used.
3.1 Model parameter freezing
In the XGBoost model, since the dependent variable is continuous, the objective and scoring are set as reg:squarederror and neg_mean_squared_error, respectively. The booster used is the commonly used gbtree, which iteratively trains a series of decision trees for prediction. This tree-based model can handle various types of data and has strong fitting and expressive power.
3.2 Model model parameter grid search
In the design of the model's hyperparameters, we used the GridSearchCV method from sklearn. model_selection to perform a grid search for the hyperparameters as shown in the Table
1.
3.3 Model results
After training the model, the test set’ accuracy was 96.96%, with 32 out of 33 predictions being correct. The evaluation of the model's performance is illustrated in Table
2, which presents the confusion matrix displaying the predicted results compared to the actual results. The accuracy of the model's predictions was calculated using Table
3. It should be noted that the accuracy can vary based on the dataset and the specific prediction task.
Through the application of the technique model. feature’ importance, we can ascertain and prioritize the top ten importance factors among the model's set of independent variables. Figure
2 displays the results of this evaluation, showcasing the ten variables of greatest significance. ① The extent to which my professional growth feels impeded, with an importance of 0.142; ②In internet leisure, I have the ability to learn new knowledge or skills, with an importance of 0.052; ③ In internet leisure, I can make independent decisions about how to choose and engage with activities, with an importance of 0.051; ④ In internet leisure, I can escape from the physical exhaustion caused by work, with an importance of 0.037; ⑤The workload that needs to be managed within the prescribed time period, with an importance of 0.031; ⑥The lack of clarity regarding my professional duties and expectations., with an importance of 0.030; ⑦ In online leisure activities, I have rid myself of the physical exertion of work, with an importance of 0.026; ⑧ If I were to live again, there is hardly anything that I would want to change, with an importance of 0.024; ⑨age, with an importance of 0.023; ⑩education, with an importance of 0.023.
4 Discussion
In the establishment of the algorithm model, the grid search method was used to determine the optimal solution for hyperparameters. In the grid search, since we have 66 items in our independent variables, and the depth of the decision tree generally does not exceed the number of independent variables, the max_depth was determined to be optimal at 5. After analyzing the values of each parameter in the grid search, the optimal solution was found at learning_rate of 0.01, eta of 0.001, and n_estimators of 400. This indicates that the model used shallow depth and small weights for each decision tree, while employing a strategy of fitting with more decision trees. To prevent overfitting, the model's subsample was set at 0.7, colsample_bytree at 0.8, and min_child_weight as 3, which were determined to be the optimal values.
Within the array of survey elements, the query regarding ‘The extent to which my professional growth feels impeded’ accounted for a 14% share in assessing feature significance. Consistent with established theoretical perspectives, extreme stress can adversely influence one's sense of well-being. Chronic stress and ongoing discomfort have the potential to spawn both psychological and physiological ailments, including but not limited to anxiety, depressive disorders, and various somatic conditions, consequently eroding one's state of contentment [
18]. Analysis of occupational stress factors corroborates both the Stressor Detachment Model and the Recovery Paradox [
19] [
20], reinforcing the notion that the pursuit of strategic resources is predicated upon fulfilling Fundamental Psychological Needs [
21]. Additionally, the point ‘During online leisure activities, my capacity to acquire new information or skills’ emerged with a significance rating of 0.052. These observations lend support to the pivotal importance of a positive self-concept as posited by the Self-Determination Theory and to the proactive cultivation of resources as a means to bolster happiness.
Appendices
Chinese Employee SWB Survey
Please recall your life in the past 1 month, choose the right degree of your true feeling in the following items: [1-5Likert Rating Scale]
1.1 Involvement of online achievement activities, such as study, physical exercises.
1.2 Involvement of online social activities, such as study, physical exercises.
1.3 Involvement of online timeout activities, such as watching short videos, listening to music.
2.1 During virtual engagements, I sense the autonomy to determine my own decision-making process.
2.2 I feel pressure in online activities.
2.3 In online activities, I get along well with people.
2.4 In online activities, I seldom communicate with others.
2.5 In online activities, on average, I feel I can freely express my opinions and ideas.
2.6 In online activities, I have the capability to learn new knowledge or skill.
2.7 In online activities, I can be myself and can expose my true inside.
3.1In online leisure activities, I put aside all work related thinking and ideas.
3.2 In online leisure activities, I put aside all work related emotion.
3.3 In online leisure activities, I have emotionally distanced myself from work.
3.4 In online leisure activities, I have rid myself of the physical exertion of work.