A Trust-Based Prediction Approach for Recommendation System

Wang, Peng; Huang, Haiping; Zhu, Jie; Qi, Lingtao

doi:10.1007/978-3-319-94472-2_12

Peng Wang²⁰,
Haiping Huang ORCID: orcid.org/0000-0002-4392-3599²⁰,
Jie Zhu²⁰ &
…
Lingtao Qi²⁰

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 10975))

Included in the following conference series:

World Congress on Services

1294 Accesses
4 Citations

Abstract

The recommendation system has been widely used in e-commerce, but still suffers from data sparsity and cold-start problems. This paper combines the user trust relationship with the collaborative filtering recommendation system and puts forward the recommendation approach based on trust delivery (TDR), in order to solve the above two problems. Through calculating the quantifying trust values between users, the prediction score of an unrated item can be figured out to achieve effective recommendation. Compared with other recommendation algorithms, TDR achieves better performance on standard Mean Absolute Error (MAE) and Coverage.

You have full access to this open access chapter, Download conference paper PDF

ETBRec: a novel recommendation algorithm combining the double influence of trust relationship and expert users

Article 28 April 2021

A trust-based collaborative filtering algorithm for E-commerce recommendation system

Article Open access 29 June 2018

Context-aware recommender system using trust network

Article 06 January 2021

Keywords

1 Introduction

In e-commerce applications, the main task of the recommendation system is how to determine which items satisfy the users’ demands or interests and then recommend them to users, in order to reduce searching time and improves users’ experience [1]. At present, the research on recommendation system mainly focuses on the analyzing and modeling of the user’s interest preference and historical behavior data, so as to accurately predict the future behaviors of users. The most representative achievement, such as the collaborative filtering recommendation system, have been gradually applied in the business fields.

However, the classic collaborative filtering recommendation algorithm still faces great challenges, especially for the following two issues:

1.
Data sparsity problem: with the rapid growth of users and items in recommendation system, the number and variety of recommended items increases fast, and the overlap of similarity selections between users will be less and less, which causes the data is becoming sparse.
2.
User cold-start problem: for some new users in the recommendation system, there is little historical data about them because they have not rated to any item, thus it is difficult for the system to give an accurate personalized recommendation.

Based on the above problems, this paper designs a new recommendation approach. Based on the trust delivery relationship between users, even with the sparse score matrix, it can still calculate the user prediction ratings for given items which are not rated. And meanwhile, it can guarantee desirable performance on the Mean Absolute Error (MAE) and Coverage.

This paper is organized as follows: Sect. 2 describes the related work and background knowledge; Sect. 3 introduces the recommendation approach based on trust delivery (TDR); the experimental results and analysis are presented in Sect. 4; Sect. 5 summarizes the whole paper.

2 Related Work and Background

2.1 Related Work

The basic idea of collaborative filtering recommendation [2] is to analyze the similarity between users or items based on historical data, and then to use similarity to predict the current user’s preference of items. They can be divided into user-based recommendation and item-based recommendation [3]. Aiming at the existing problems such as data-sparsity and cold-start, many researchers have provided improved methods. Zhao et al. proposed several new similarity definition methods for different requirements; however, there is no obvious improvement on data-sparsity and cold-start problems [4]. Hwang et al. proposed a collaborative filtering algorithm using uncertain nearest neighbors, which alleviates the sparsity of data to some extent; its defect lies in it is hard to balance the recommended results between user group and item group [5]. Shambour et al. combined the trust relationship between users with collaborative filtering algorithm, and then proposed TSF algorithm to solve both the data-sparsity and cold-start problems [6]. Similarly, O’Donovan et al. [7] divided the user trust relationship into explicit trust relationship and implicit trust one, for the purpose of overcoming the above two problems. However, [6, 7] failed in giving the precise definitions of the degree of trust among users without considerations of combining the number of items rated by two users with the average score of other items.

2.2 Background Knowledge

The ‘trust’ in this paper is the expectation that whether the target users can provide valuable rating ability to other users [8]. Trust is a measurement of the degree of trust between users, which has a range of [a, b]. For example, in the range of [0,1], 0 indicates distrust instead 1 means complete trust. The trust relationship usually has the following characteristics: (1) Asymmetry; (2) Transitivity; (3) Dynamic: with the change of time and interactions between users, the trust value may change accordingly.

As shown in Fig. 1, the trust relationship between users can be represented by a directed-graph due to asymmetric trust relationship. User c trusts user b with the trust value 0.8, user b trusts user a the trust value 0.2, then user c is also trust user a.

3 A Trust-Based Delivery Recommendation Method

In this section, we first describe the recommendation approach based on trust delivery (TDR), and then give an example. Suppose that the original score matrix is R_m*n, it means that m users rate to n items, which can cause serious cold-start problem due to the matrix sparsity. The structure of TDR is shown in Fig. 2.

3.1 Calculate the Pearson Correlation Between Users

Pearson correlation is used to calculate the similarity between users, which is calculated by (1):

$$ \rho_{a,b} = \frac{{\sum\limits_{i = 1}^{{I_{a,b} }} {\left( {R_{a,i} - \bar{R}_{a} } \right) \times \left( {R_{b,i} - \bar{R}_{b} } \right)} }}{{\sqrt {\sum\limits_{i = 1}^{{I_{a,b} }} {\left( {R_{a,i} - \bar{R}_{a} } \right)^{2} } } \times \sqrt {\sum\limits_{i = 1}^{{I_{a,b} }} {\left( {R_{b,i} - \bar{R}_{b} } \right)^{2} } } }} $$

(1)

where R_a,i and R_b,i ∈ [1, 5] represent the ratings of user a and user b on item i respectively, R_a and $ \overline{\text{R}}_{b} \, \in \,\left[ { 1,{ 5}} \right] $ represent the average rating values of user a and user b on all items separately. I_a,b is the set of co-rated items by both user a and user b. Based on (1), we can find two users with high similar similarity. However, when user a and user b have only one co-rated item, according to (1), the value of ρ_a,b is 1. It is obviously unreasonable, so we need to add the correction parameter β_a,b, which is calculated by (2), where |N_a| is the number of items rated by user a, |N_b| is the number of items rated by user b, and |N_a,b| is that have been commonly rated by a and b.

$$ \beta_{a,b} = \frac{{\left| {N_{a,b} } \right|}}{{\left| {N_{a} } \right| + \left| {N_{b} } \right| - \left| {N_{a,b} } \right|}} $$

(2)

Therefore, Pearson Correlation between users is redefined as (3):

$$ \rho^{{\prime }}_{a,b} = \left\{ {\begin{array}{*{20}l} {\rho_{a,b} \times \beta_{a,b} ,} \hfill & {\left| {N_{a,b} } \right| \ne 0} \hfill \\ {NONE,} \hfill & {\left| {N_{a,b} } \right| = 0} \hfill \\ \end{array} } \right. $$

(3)

3.2 Trust Calculation and Propagation

It takes the original score matrix as input and calculates the direct trust values between the selected user a and a group of k neighbor users. After calculating the direct trust value, the indirect trust values can be derived using the transitivity of trust. The calculation of direct trust value is shown as (4) and (5):

$$ D_{a,b} = \frac{1}{2}\left( {\delta (\bar{R}_{a} ) + \delta (\bar{R}_{b} )} \right)\, \times \,\beta_{a,b} $$

(4)

$$ \delta \left( x \right) = \left\{ {\begin{array}{*{20}l} {1{ - }\frac{ 1}{{x + \left| {N_{a,b} } \right|}},} \hfill & {\left| {N_{a,b} } \right| \ne 0} \hfill \\ {NULL,} \hfill & {\left| {N_{a,b} } \right|} \hfill \\ \end{array} } \right. $$

(5)

where R_a, $ \overline{R}_{b} \, \in \,\left[ { 1,{ 5}} \right] $ and |N_a,b| denotes the number of items co-rated by user a and user b. If user a and user b have a lot of common rated items, then the value of |N_a,b| will be larger, and the value of δ(x) will be larger, so the value of D_a,b will be larger too. If the value of |N_a,b| is 0, it means that user a and user b do not share the common rated items, and they have no direct trust relationship.

However, not all users have direct trust relationship, for example, user a and user c in Fig. 1. While relying on user b, indirect trust relationship can be explored between them. So, the trust propagation can be calculated by (6):

$$ DP_{a,c} = Min\{ D_{a,b} ,D_{b,c} \} \times \delta \left( {\bar{R}_{a} } \right) $$

(6)

where Min{x,y} represents the minimum value of x and y. Finally, based on (7), we can calculate the predicted scores of the non-rated items of user a.

$$ PR_{a,i} = \frac{1}{{1 + D_{a,b} }} \times \bar{R}_{a} + \frac{{D_{a,b} }}{{1 + D_{a,b} }} \times \left( {R_{b,i} - \bar{R}_{b} } \right) $$

(7)

3.3 Example Analysis

Suppose that 4 users can rate 6 kinds of items, the value range is [1, 5], and the greater the score value is, the higher satisfaction the user have. The user’s original scoring matrix is shown in Table 1. N indicates the user does not rate the item. It can be found that user U₁ is a cold start user, which means this user only rates few items.

Table 1. User original scoring matrix

Full size table

According to (1), (2) and (3), we can calculate the Pearson Correlation between users. The calculation results are shown in Table 2. Then, according to (4) and (5), users’ trust values can be calculated, Table 1 shows that U₁ and U₃, U₁ and U₄ have no public ratings, which means there are no direct trust relationship between them. Therefore, the trust values between these two groups of users cannot be calculated directly, which can be marked with “NULL”, as shown in Table 3.

Table 2. User person correlation matrix

Full size table

Table 3. User direct trust value matrix

Full size table

According to Table 3, U₁ can establish indirect trust relationship with U₃ and U₄ via U₂, and the specific indirect trust values can be calculated in (6). The complete user trust value matrix will be constructed, as shown in Table 4. Finally, based on the direct or indirect trust values of users, the scores of the users’ unrated items are predicted by (7), as shown in Table 5.

Table 4. User trust matrix

Full size table

Table 5. User predicted score matrix

Full size table

4 Experimental Evaluation and Discussion

The MovieLens 100 K data set is selected as the rating databases of recommendation system, and this data set is provided by the GroupLens research team at the University of Minnesota U.S, which includes 100000 ratings records of 943 users to 1682 movies. Each rating record includes a user ID, an item ID, a value of rating in the interval [1, 5] and a TimeStamp.

4.1 Evaluation Metrics

The standard Mean Absolute Error (MAE) has been used to evaluate the quality of recommendations. MAE is the most widely used metric to measure the accuracy of recommendations. Note that a lower MAE value represents a higher recommendation accuracy. The measurement for MAE can be given by (8):

$$ MAE = \frac{{\sum\limits_{i = 1}^{n} {\left| {RA_{a,i} - RP_{a,i} } \right|} }}{n} $$

(8)

where PA_a,i denotes actual rating, RP_a,i represents system’s predicted rating, and n denotes the number of available items.

Coverage is used to evaluate the ability of a specific recommendation system to provide recommendations, which is calculated by (9):

$$ Cov = \frac{{\left| {N_{p} } \right|}}{n} $$

(9)

where |N_p| denotes the number of items for which a prediction rating has been made, n denotes the number of available items.

Precision and Recall are usually used to evaluate the efficiency of recommendation system. Precision can be figured out with (10):

$$ P_{u} = \frac{{N_{tp} }}{{N_{tp} + N_{fp} }} $$

(10)

where N_tp represents the number of items which are recommended and meanwhile appreciated by users; N_fp represents the number of items which are recommended while not approved by users.

Similarly, Recall can be defined by (11):

$$ R_{u} = \frac{{N_{tp} }}{{N_{tp} + N_{tn} }} $$

(11)

Where N_tn represents the number of items which are not recommended while liked by users. In this paper, if the user’s rating of item is greater than or equal to 3, that means the item is preferred by this user.

In general, the more comprehensive evaluation F-Measure can be obtained by weighting and averaging Precision and Recall, as shown in (12):

$$ F - Measure = \frac{{\left( {\alpha^{2} + 1} \right)P_{u} \times R_{u} }}{{\alpha^{2} \left( {P_{u} + R_{u} } \right)}} $$

(12)

where the value of α is usually set to 1.

4.2 Experimental Results and Analysis

In this section, TDR will be compared with other typical recommendation algorithms to verify the performance of our proposal. The data sets come from MovieLens. Zhao-UCF is a user-based collaborative filtering recommendation algorithm and its similarity is calculated using the Person correlation [4]. TSF algorithm [6] and O’Donovan-Trust algorithm [7] are based on trust relationship. These algorithms have been viewed in the related work.

We calculate the values of MAE of these recommendation algorithms by changing the number of user neighbors. Figure 3(a) shows that TDR has the highest recommended accuracy with any given number of neighbors. It owes to the rational utilization of user trust relationship, which has significantly improved the recommended accuracy, especially the number of neighbour is between 30 and 70. Figure 3(b) illustrates that TDR has the highest Coverage with any given number of neighbors. This is because TDR not only adopts the improved Pearson correlation, but also employs the user’s trust relationship, which causes a significant improvement in recommendation coverage compared with other recommendation algorithms.

MAE and Coverage of TDR are measured when the number of cold-start users varying, as shown in Fig. 4(a) and (b). It can be found that in comparison with other recommended algorithms, TDR achieves better performance on both MAE and Coverage. The experimental results show the effectiveness of TDR when dealing with the problem of cold-start users.

Figure 5(a) and (b) show the Precision and Recall rates of the TDR, respectively. Figure 5(a) illustrates that TDR keeps a high Precision rate in any given neighbors. Figure 5(b) shows that TDR also has a high Recall rate compared with other algorithms. The experimental results demonstrate the user satisfaction of TDR.

Figure 6 shows the F-Measure performance obtained by weighting and averaging the Precision and Recall rates. Both of them in Fig. 5(a) and (b) are higher than others in any given number of neighbors, so in Fig. 6 the F-Measure of TDR has an advantage over other algorithms.

5 Conclusions

In view of data sparsity and cold-start problems in recommendation system, we design a novel recommendation approach based on trust delivery (TDR), which quantifies user trust relationship to calculate user’s trust values and scores for unrated items. Through experimental comparisons with typical algorithms, TDR achieves more desirable performance in terms of recommendation accuracy and coverage, and meanwhile the trust transitive relationship between users can effectively solve the cold user problem and provide higher recommendation satisfaction.

Most of the existing methods focus on the trust relationship network among users. However, there is few research on the distrust relationship among users, which should become one of our future work. In addition, the improvement of similarity measure between items deserves further concern in order to alleviate the case of item cold-start and improve the quality of recommendation.

References

Dakhel, A.M., Malazi, H.T., Mahdavi, M.: A social recommender system using item asymmetric correlation. Appl. Intell. 5, 1–14 (2017)
Google Scholar
Zheng, Z., Ma, H., Lyu, M.R.: A collaborative filtering based web service recommender system. In: IEEE International Conference on Web Services, Los Angeles, CA, USA, 6–10 July 2009, pp. 437–444. IEEE, July 2009
Google Scholar
Zhao, Y.S., Liu, Y.P., Zeng, Q.A.: A weight-based item recommendation approach for electronic commerce systems. Electron. Commer. Res. 17(2), 205–226 (2017)
Article Google Scholar
Zhao, Z.D., Shang, M.S.: User-based collaborative-filtering recommendation algorithms on Hadoop. In: International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 9–10 January 2010, pp. 478–481. IEEE, January 2010
Google Scholar
Hwang, C.-S., Chen, Y.-P.: Using trust in collaborative filtering recommendation. In: Okuno, H.G., Ali, M. (eds.) IEA/AIE 2007. LNCS (LNAI), vol. 4570, pp. 1052–1060. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-73325-6_105
Chapter Google Scholar
Shambour, Q., Lu, J.: A trust-semantic fusion-based recommendation approach for e-business applications. Decis. Support Syst. 54(1), 768–780 (2012)
Article Google Scholar
O’Donovan, J., Smyth, B.: Trust in recommender systems. In: International Conference on Intelligent User Interfaces, San Diego, California, USA, 10–13 January 2005, pp. 167–174. ACM, January 2005
Google Scholar
Cao, G., Kuang, L.: Identifying core users based on trust relationships and interest similarity in recommender system. In: IEEE International Conference on Web Services, San Francisco, USA, 27 June–2 July 2016, pp. 284–291. IEEE, June–July 2016
Google Scholar

Download references

Acknowledgement

This work was supported by the National Natural Science Foundation of P. R. China (No. 61672297), the Key Research and Development Program of Jiangsu Province (Social Development Program, No. BE2017742).

Author information

Authors and Affiliations

Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Peng Wang, Haiping Huang, Jie Zhu & Lingtao Qi

Authors

Peng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Haiping Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Lingtao Qi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haiping Huang .

Editor information

Editors and Affiliations

GoDaddy, Kirkland, Washington, USA
Alvin Yang
Deloitte Consulting, New York, New York, USA
Siva Kantamneni
IBM Thomas J. Watson Research Center, Yorktown Heights, New York, USA
Ying Li
Etihad Airways, Abu Dhabi, United Arab Emirates
Awel Dico
Huawei Technologies, Shenzhen, China
Xiangang Chen
Siemens Healthcare Diagnostics, Sacramento, California, USA
Rajesh Subramanyan
Kingdee International Software Group Co., Ltd, Shenzhen, China
Liang-Jie Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, P., Huang, H., Zhu, J., Qi, L. (2018). A Trust-Based Prediction Approach for Recommendation System. In: Yang, A., et al. Services – SERVICES 2018. SERVICES 2018. Lecture Notes in Computer Science(), vol 10975. Springer, Cham. https://doi.org/10.1007/978-3-319-94472-2_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-94472-2_12
Published: 23 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94471-5
Online ISBN: 978-3-319-94472-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Trust-Based Prediction Approach for Recommendation System

Abstract

Similar content being viewed by others

ETBRec: a novel recommendation algorithm combining the double influence of trust relationship and expert users

A trust-based collaborative filtering algorithm for E-commerce recommendation system

Context-aware recommender system using trust network

Keywords

1 Introduction

2 Related Work and Background

2.1 Related Work

2.2 Background Knowledge

3 A Trust-Based Delivery Recommendation Method

3.1 Calculate the Pearson Correlation Between Users

3.2 Trust Calculation and Propagation

3.3 Example Analysis

4 Experimental Evaluation and Discussion

4.1 Evaluation Metrics

4.2 Experimental Results and Analysis

5 Conclusions

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us