Elsevier

Decision Support Systems

Volume 54, Issue 2, January 2013, Pages 1055-1064
Decision Support Systems

Mining consensus preference graphs from users' ranking data

https://doi.org/10.1016/j.dss.2012.10.031Get rights and content

Abstract

The group ranking problem consists of constructing coherent aggregated results from preference data provided by decision makers. Traditionally, the output of a group ranking problem can be classified into ranking lists and maximum consensus sequences. In this study, we propose a consensus preference graph approach to represent the coherent aggregated results of users' preferences. The advantages of our approach are that (1) the graph is built based on users' consensuses, (2) the graph can be understood intuitively, and (3) the relationships between items can be easily seen. An algorithm is developed to construct the consensus preference graph from users' total ranking data. Finally, extensive experiments are carried out using synthetic and real data sets. The experimental results indicate that the proposed method is computationally efficient, and can effectively identify consensus graphs.

Highlights

► This study proposes a new output type of group ranking problem. ► The new output is a consensus preference graph of items. ► The output graph can be understood intuitively and the relationships between items can be easily seen. ► The experimental results show that the proposed method is efficient and effective.

Introduction

The group ranking decision process involves aggregating individual rankings to obtain a representative group ranking. In other words, the group ranking algorithm generates consolidated ranking results that represent the group will, preference, or decision based on decision makers' preference data. In recent decades, the group ranking problem has become an important and interesting issue in decision making [11], [15], machine learning [14], web search strategies [2], [7] and others. The essence of this problem is how to consolidate and aggregate decision makers' rankings to obtain a group ranking that represents “better coherent” ordering in regards to the decision makers' rankings.

Generally, the traditional group ranking problem can be classified using three aspects: the completeness of the user-provided preference information, the input format used to express users' preferences, and the type of compromised output results. The group ranking problem can be roughly classified into two major approaches based on the completeness of the decision maker's preference information: the total ranking approach [7], [17], [20], [21], [22], [23] and the partial ranking approach [3], [4], [8], [9], [10], [18], [19]. The former requires individuals to appraise all items (called alternatives), while the latter appraises only a subset of items. There are three typical input formats decision makers can use express their item preferences: weighting models, pair-wise comparisons, and ranking lists. All three formats have been used in previous studies to express individuals' input preferences. These formats may not be perfect, but they express user preferences reasonably well in most practical situations. Depending on the input format adopted, users are asked to rank (in the ranking list model), rate (in the weighting model), or compare (in the pair-wise comparison model) the items. After all preference data have been collected, an algorithm is applied to generate the consolidated output results. In previous research, output results could be divided into two main types. One is a total ranking list, which is an ordering list of all items that represent the achieved consensus. The other is a maximum consensus sequence, which gives the longest ranking lists of items that agree with the majority and disagree with the minority.

Unfortunately, both output formats have their own weaknesses. Most previous ranking list approaches attempted to minimize the total disagreement between multiple input rankings in order to obtain an overall ranking list that represented the achieved consensus. This disregards the fact that user opinions may be discordant and have no consensus, forcing a complete ranking result even if there is no consensus or only a slight consensus. In such a situation, what we obtain is merely the algorithm output, since different algorithms derive different ranking results due to their different designs. To overcome this weakness, Chen and Cheng [5], [6] proposed the maximum consensus approach, which generates only those maximum sequences on which users have consensus, meaning they are agreed upon by a majority of users and disagreed with by a minority of users. However, this approach may generate many maximum consensus sequences, making the results fragmented and difficult to understand and use.

Therefore, we propose a method that finds consensus preferences and represents these relationships as a graph. This is called a preference graph, where the relationships are agreed upon by majority of users and disagreed with by only a minority of users. Accordingly, we develop algorithms to discover preference graphs from users' ranking lists, and use the graphs to present the preferences of all users.

Example 1

Suppose we have the three ranking lists shown in Table 1. We will show their consolidated results in the ranked order, maximum consensus sequence, and preference graph formats.

Using a total ranking list, we may get the result {A  C > B  D > E}, which represents a coherent ranking of all items. There is no consensus, however, on the rankings of A and C in the preference data. The reason they are arranged that way is simply because we are forced into a complete ranking.

Using maximum consensus sequences, the longest patterns of coherent item rankings are {A  D > E} and {C > B > E}. The problem is that it may output many consensus sequences that need to be checked. Additionally, the preference between items A and C is unknown.

Using our approach, the result may look like Fig. 1. Items in the same cluster are similarly preferred by users. Therefore, items A and C are similarly preferred, and B and D are similarly preferred. Additionally, G1 is preferred more than G2 and G3, and G2 is preferred more than G3. There are several advantages to this approach. First, the graph is built based on users' consensuses. Second, the graph can be intuitively understood. Third, relationships between items can be observed from the graph.

This paper is divided into six sections. Our motivations are discussed in Section 1. Section 2 reviews related works. Section 3 defines the problem of mining consensus preference graphs and provides definitions. Section 4 introduces the preference graph mining algorithm. Experimental results are presented in Section 5. Finally, we draw conclusions in Section 6.

Section snippets

Related work

In this section, we review literature regarding the group ranking problem. As shown in Table 2, the group ranking problem can be classified using three features: the completeness of input preference information, the type of input format, and the compromised output format.

When looking at the completeness of users' item appraisals, the group ranking problem can be identified as using the total ranking approach or the partial ranking approach. In the total ranking approach, all individuals have to

Problem definition

In this section, we formally define the problem of mining consensus preference graphs from users' ranking data. Let U = {u1, u2,…,um} and I = {i1, i2,…,in} denote the sets of all users and all items, respectively. Each user ui creates a ranked list of all items that expresses his/her preferences. The ranked list of user ui can be represented as a sequence Si = {a1  a2 ⊕…⊕ an}.

Each user sequence must satisfy the following conditions: first, an item aj  I, where 1  j  n, cannot appear more than once in a

The algorithm

In this section, we propose a genetic algorithm (GA) to discover preference graphs from users' total ranking lists. The procedure is listed below.

  • 1.

    Sort the items' scores and partition the sorted data into k groups.

  • 2.

    Iteratively generate |P| chromosomes by the following steps.

    • (1)

      For each item score, randomly increase or decrease by a percentage no more than R%.

    • (2)

      Sort the data and partition them into k groups.

  • 3.

    Build the preference graph for every chromosome.

  • 4.

    Iteratively use GA algorithm to re-cluster

Experiments

To evaluate the efficiency and effectiveness of the proposed preference graph algorithm, we performed several experiments using synthetic data sets. In this section, we first describe the generation of the synthetic data set and the comparisons of run time and objective function. In the second portion, a real case study is applied to show the usefulness of consensus sequence mining in practice.

Conclusions

Generally, traditional group ranking problems can be classified according to the completeness of the user-provided preference information, the types of compromise outcomes, and the format used to express user preferences. In this work, we proposed a method that can find maximum agreeable preferences and represent the results as a graph. This is called a preference graph. An algorithm was developed to find a preference graph from users' ranking data. Extensive experiments were also carried out

Acknowledgment

It is our pleasure to acknowledge the anonymous reviewers for their valuable suggestions and the careful reading of our manuscript. The authors would like to express our gratitude to these reviewers for their suggestions that helped to substantially improve our paper. This study was supported by the National Science Council of Taiwan under grant NSC 97-2410-H-031-056 and 101-2410-H-008-008-MY3.

Yen-Liang Chen is Professor of Information Management at National Central University of Taiwan. He received his Ph.D. degree in computer science from National Tsing Hua University, Hsinchu, Taiwan. His current research interests include data mining, social network analysis, and decision making models. He has published papers in Decision Support Systems, IEEE Transactions on Software Engineering, IEEE Transactions on Knowledge and Data Engineering, IEEE Transactions on SMC, Information &

References (23)

  • W. Cohen

    Learning to order things

    Journal of Artificial Intelligence Research

    (1999)
  • Cited by (12)

    • Post-consensus analysis of group decision making processes by means of a graph theoretic and an association rules mining approach

      2020, Omega (United Kingdom)
      Citation Excerpt :

      The term “consensus graph” has been used frequently in the literature but in different contexts. For instance, [4,5,40,48] describe graph theoretic approaches for visualizing ranking decision by experts and then capturing the “average” or “most representative” ranking decision that somehow, reflects the best consensus among the various experts. The use of the term “consensus graph” in this paper is different as it refers to a graph which expresses whether any pair of experts agree on some ranking of the alternatives or not.

    • A new consensus mining approach to group ranking problems involving different intensities of preferences

      2019, Computers and Industrial Engineering
      Citation Excerpt :

      Borda count (Borda, 1781) accumulates the ranks of each alternative specified by users to achieve a total ranking list. Most group ranking methods assume that every voter is expected to articulate preferred order among alternatives (Arrow, 1983; Chen, Cheng, & Huang, 2013; Cheng, Chen, & Chiang, 2016). However, although voters have the same preferences, the intensities of their preferences may be quite different.

    • Discovering diverse human behavior from two-dimensional preferences

      2018, Knowledge-Based Systems
      Citation Excerpt :

      Completeness of preference ranking: When items are numerous, it is troublesome for users to provide all considerations for the items. According to the completeness of the information provided by users, the methods can be divided into the total ranking approach [4,10,38] and the partial ranking approach [1,7,36,39]. Consider a set of four items, A, B, C and D.

    View all citing articles on Scopus

    Yen-Liang Chen is Professor of Information Management at National Central University of Taiwan. He received his Ph.D. degree in computer science from National Tsing Hua University, Hsinchu, Taiwan. His current research interests include data mining, social network analysis, and decision making models. He has published papers in Decision Support Systems, IEEE Transactions on Software Engineering, IEEE Transactions on Knowledge and Data Engineering, IEEE Transactions on SMC, Information & Management, Information Processing & Management, Journal of American Society of Information Science and Technology, Information Systems, Operations Research, Naval Research Logistics, Transportation Research — part B, European Journal of Operational Research, and many others. He is the former editor-in-chief of Journal of Information Management and that of Journal of e-Business.

    Li-Chen Cheng is an Associate Professor of Department of Computer Science and Information Management, Soochow University, Taipei, Taiwan. She received her Ph.D. degree in information management from National Central University, Chung-Li, Taiwan. Her current research interests include data mining, information retrieval and EC technologies. She has published papers in Decision Support Systems, Electronic Commerce Research and Applications, European Journal of Operational Research, and many others.

    Po-Hsiang Huang received the M.S. degree in Information Management from National Central University, Chung-Li, Taiwan. His research interests include data mining, information systems and EC technologies.

    View full text