Mining consensus preference graphs from users' ranking data

doi:10.1016/j.dss.2012.10.031

Decision Support Systems

Volume 54, Issue 2, January 2013, Pages 1055-1064

https://doi.org/10.1016/j.dss.2012.10.031 Get rights and content

Abstract

The group ranking problem consists of constructing coherent aggregated results from preference data provided by decision makers. Traditionally, the output of a group ranking problem can be classified into ranking lists and maximum consensus sequences. In this study, we propose a consensus preference graph approach to represent the coherent aggregated results of users' preferences. The advantages of our approach are that (1) the graph is built based on users' consensuses, (2) the graph can be understood intuitively, and (3) the relationships between items can be easily seen. An algorithm is developed to construct the consensus preference graph from users' total ranking data. Finally, extensive experiments are carried out using synthetic and real data sets. The experimental results indicate that the proposed method is computationally efficient, and can effectively identify consensus graphs.

Highlights

► This study proposes a new output type of group ranking problem. ► The new output is a consensus preference graph of items. ► The output graph can be understood intuitively and the relationships between items can be easily seen. ► The experimental results show that the proposed method is efficient and effective.

Introduction

The group ranking decision process involves aggregating individual rankings to obtain a representative group ranking. In other words, the group ranking algorithm generates consolidated ranking results that represent the group will, preference, or decision based on decision makers' preference data. In recent decades, the group ranking problem has become an important and interesting issue in decision making [11], [15], machine learning [14], web search strategies [2], [7] and others. The essence of this problem is how to consolidate and aggregate decision makers' rankings to obtain a group ranking that represents “better coherent” ordering in regards to the decision makers' rankings.

Generally, the traditional group ranking problem can be classified using three aspects: the completeness of the user-provided preference information, the input format used to express users' preferences, and the type of compromised output results. The group ranking problem can be roughly classified into two major approaches based on the completeness of the decision maker's preference information: the total ranking approach [7], [17], [20], [21], [22], [23] and the partial ranking approach [3], [4], [8], [9], [10], [18], [19]. The former requires individuals to appraise all items (called alternatives), while the latter appraises only a subset of items. There are three typical input formats decision makers can use express their item preferences: weighting models, pair-wise comparisons, and ranking lists. All three formats have been used in previous studies to express individuals' input preferences. These formats may not be perfect, but they express user preferences reasonably well in most practical situations. Depending on the input format adopted, users are asked to rank (in the ranking list model), rate (in the weighting model), or compare (in the pair-wise comparison model) the items. After all preference data have been collected, an algorithm is applied to generate the consolidated output results. In previous research, output results could be divided into two main types. One is a total ranking list, which is an ordering list of all items that represent the achieved consensus. The other is a maximum consensus sequence, which gives the longest ranking lists of items that agree with the majority and disagree with the minority.

Unfortunately, both output formats have their own weaknesses. Most previous ranking list approaches attempted to minimize the total disagreement between multiple input rankings in order to obtain an overall ranking list that represented the achieved consensus. This disregards the fact that user opinions may be discordant and have no consensus, forcing a complete ranking result even if there is no consensus or only a slight consensus. In such a situation, what we obtain is merely the algorithm output, since different algorithms derive different ranking results due to their different designs. To overcome this weakness, Chen and Cheng [5], [6] proposed the maximum consensus approach, which generates only those maximum sequences on which users have consensus, meaning they are agreed upon by a majority of users and disagreed with by a minority of users. However, this approach may generate many maximum consensus sequences, making the results fragmented and difficult to understand and use.

Therefore, we propose a method that finds consensus preferences and represents these relationships as a graph. This is called a preference graph, where the relationships are agreed upon by majority of users and disagreed with by only a minority of users. Accordingly, we develop algorithms to discover preference graphs from users' ranking lists, and use the graphs to present the preferences of all users.

Example 1

Suppose we have the three ranking lists shown in Table 1. We will show their consolidated results in the ranked order, maximum consensus sequence, and preference graph formats.

Using a total ranking list, we may get the result {A ≥ C > B ≥ D > E}, which represents a coherent ranking of all items. There is no consensus, however, on the rankings of A and C in the preference data. The reason they are arranged that way is simply because we are forced into a complete ranking.

Using maximum consensus sequences, the longest patterns of coherent item rankings are {A ≥ D > E} and {C > B > E}. The problem is that it may output many consensus sequences that need to be checked. Additionally, the preference between items A and C is unknown.

Using our approach, the result may look like Fig. 1. Items in the same cluster are similarly preferred by users. Therefore, items A and C are similarly preferred, and B and D are similarly preferred. Additionally, G₁ is preferred more than G₂ and G₃, and G₂ is preferred more than G₃. There are several advantages to this approach. First, the graph is built based on users' consensuses. Second, the graph can be intuitively understood. Third, relationships between items can be observed from the graph.

This paper is divided into six sections. Our motivations are discussed in Section 1. Section 2 reviews related works. Section 3 defines the problem of mining consensus preference graphs and provides definitions. Section 4 introduces the preference graph mining algorithm. Experimental results are presented in Section 5. Finally, we draw conclusions in Section 6.

Section snippets

Related work

In this section, we review literature regarding the group ranking problem. As shown in Table 2, the group ranking problem can be classified using three features: the completeness of input preference information, the type of input format, and the compromised output format.

When looking at the completeness of users' item appraisals, the group ranking problem can be identified as using the total ranking approach or the partial ranking approach. In the total ranking approach, all individuals have to

Problem definition

In this section, we formally define the problem of mining consensus preference graphs from users' ranking data. Let U = {u₁, u₂,…,u_m} and I = {i₁, i₂,…,i_n} denote the sets of all users and all items, respectively. Each user u_i creates a ranked list of all items that expresses his/her preferences. The ranked list of user u_i can be represented as a sequence S_i = {a₁ ⊕ a₂ ⊕…⊕ a_n}.

Each user sequence must satisfy the following conditions: first, an item a_j ∈ I, where 1 ≤ j ≤ n, cannot appear more than once in a

The algorithm

In this section, we propose a genetic algorithm (GA) to discover preference graphs from users' total ranking lists. The procedure is listed below.

1.
Sort the items' scores and partition the sorted data into k groups.
2.
Iteratively generate |P| chromosomes by the following steps.
- (1)
  For each item score, randomly increase or decrease by a percentage no more than R%.
- (2)
  Sort the data and partition them into k groups.
3.
Build the preference graph for every chromosome.
4.
Iteratively use GA algorithm to re-cluster

Experiments

To evaluate the efficiency and effectiveness of the proposed preference graph algorithm, we performed several experiments using synthetic data sets. In this section, we first describe the generation of the synthetic data set and the comparisons of run time and objective function. In the second portion, a real case study is applied to show the usefulness of consensus sequence mining in practice.

Conclusions

Generally, traditional group ranking problems can be classified according to the completeness of the user-provided preference information, the types of compromise outcomes, and the format used to express user preferences. In this work, we proposed a method that can find maximum agreeable preferences and represent the results as a graph. This is called a preference graph. An algorithm was developed to find a preference graph from users' ranking data. Extensive experiments were also carried out

Acknowledgment

It is our pleasure to acknowledge the anonymous reviewers for their valuable suggestions and the careful reading of our manuscript. The authors would like to express our gratitude to these reviewers for their suggestions that helped to substantially improve our paper. This study was supported by the National Science Council of Taiwan under grant NSC 97-2410-H-031-056 and 101-2410-H-008-008-MY3.

References (23)

Y.L. Chen et al.
Mining maximum consensus sequences from group ranking data
European Journal of Operational Research
(2009)
Y.L. Chen et al.
An approach to group ranking decisions in a dynamic environment
Decision Support Systems
(2010)
W.D. Cook et al.
Creating a consensus ranking of proposals from reviewer's partial ordinal rankings
Computers & Operations Research
(2007)
S. Damart et al.
Supporting groups in sorting decisions: methodology and use of a multi-criteria aggregation/disaggregation DSS
Decision Support Systems
(2007)
E. Fernandez et al.
An agent model based on ideas of concordance and discordance for group ranking problems
Decision Support Systems
(2005)
S. Greco et al.
Ordinal regression revisited: multiple criteria ranking using a set of additive value functions
European Journal of Operational Research
(2008)
J. Bartholdi et al.
Voting schemes for which it can be difficult to tell who won the election
Social Choice and Welfare
(1989)
M.M.S. Beg et al.
Soft computing techniques for rank aggregation on the World Wide Web
World Wide Web: Internet and Web Information Systems
(2003)
K. Bogart
Preference structures I: distances between transitive preference relations
Journal of Mathematical Sociology
(1973)
K. Bogart
Preference structures II: distances between asymmetric relations
SIAM Journal of Applied Mathematics
(1975)

W. Cohen

Learning to order things

Journal of Artificial Intelligence Research

(1999)

Cited by (12)

Post-consensus analysis of group decision making processes by means of a graph theoretic and an association rules mining approach
2020, Omega (United Kingdom)
Citation Excerpt :
The term “consensus graph” has been used frequently in the literature but in different contexts. For instance, [4,5,40,48] describe graph theoretic approaches for visualizing ranking decision by experts and then capturing the “average” or “most representative” ranking decision that somehow, reflects the best consensus among the various experts. The use of the term “consensus graph” in this paper is different as it refers to a graph which expresses whether any pair of experts agree on some ranking of the alternatives or not.
It is assumed that a group of experts is tasked to evaluate (rank) a finite set of alternatives during a group decision making (GDM) session. The GDM session may go through a number of iterations (stages) to reach a consensus. At each iteration at least one of the experts changes his/her ranking of some of the alternatives. The session terminates when a consensus has been reached or no expert is willing to alter his/her ranking. In the latter case a compromised consensus is somehow determined. It is also assumed that a data recording mechanism exists that keeps log files with information on each session and the iterations involved at each session. Such logs describe how each individual expert has ranked the alternatives at each iteration of each session. It is also assumed that there is a supervisory authority of this GDM process. This authority wishes to analyze the log files to extract any actionable insights. An approach based on some graph theoretic and the mining of association rules is proposed to identify any dynamics that may exist in the way the experts make ranking decisions. Such analysis may reveal unknown, but potentially useful information, on the way the experts make decisions and also on the way the experts may interact with each other. Knowing such relationships may be pivotal on the way the groups of experts need to be formed and operate during the GDM sessions. Some experimental results based on synthetic data are described and analyzed in terms of the proposed approaches.
A new consensus mining approach to group ranking problems involving different intensities of preferences
2019, Computers and Industrial Engineering
Citation Excerpt :
Borda count (Borda, 1781) accumulates the ranks of each alternative specified by users to achieve a total ranking list. Most group ranking methods assume that every voter is expected to articulate preferred order among alternatives (Arrow, 1983; Chen, Cheng, & Huang, 2013; Cheng, Chen, & Chiang, 2016). However, although voters have the same preferences, the intensities of their preferences may be quite different.
Discovering the group priority from a set of user preferences plays an important role in group decision making because of its extensive applications in practice. In most group ranking problems, users are supposed to input a ranking of alternatives regardless of the intensities of preference. However, if two users specify that they prefer A to B, the intensities of preference may be quite different. In addition, most researchers have tried to determine a total ranking list by minimizing total differences among user preferences, but users might have little consensus on the final results. This study aims to propose a new consensus-based approach for group ranking problems involving different intensities of preference. Stemming from the concept of consensus mining, consensus relationships are discovered by three accumulation matrices and consensus thresholds. An optimization model incorporating the consensus relationships and the concept of Borda majority count is then developed to derive a total ranking list. Compared to previous studies, the proposed approach can treat group ranking problems involving different intensities of preference, reduce the occurrence of ties, and achieve a total ranking list reflecting the consensus preference of the majority of users.
Discovering diverse human behavior from two-dimensional preferences
2018, Knowledge-Based Systems
Citation Excerpt :
Completeness of preference ranking: When items are numerous, it is troublesome for users to provide all considerations for the items. According to the completeness of the information provided by users, the methods can be divided into the total ranking approach [4,10,38] and the partial ranking approach [1,7,36,39]. Consider a set of four items, A, B, C and D.
Among the many types of ambiguous and diverse human behaviors, preference ranking and preferred temporal relationships are two human subjective perceptions, and both can be expressed as sequences. For example, information management researchers prefer journal A over B, and this preference can be represented as the ranking sequence: (A > B). When the order of submission is first A and then B, it can be expressed as the temporal sequence: (A → B). In practice, these two preference sequences may be applicable to people with regard to the same items at the same time, which is called “two-dimensional preference” in this study. Based on these concepts, this study defines a novel model and an associated algorithm for mining two-dimensional patterns by combining preference ranking and temporal sequences. The discovered two-dimensional patterns can be categorized into eight types, including consensus, ranking-compromise, temporal-compromise, and conflict patterns. Two experiments in two application areas, namely journal submission and stock purchase, were designed to collect authentic datasets, and demonstrate their managerial meaning. The experimental results show that in most cases, the ordering of the temporal sequences follows the preference ranking sequence, except for several two-dimensional patterns involving high-risk items.
Identifying conflict patterns to reach a consensus - A novel group decision approach
2016, European Journal of Operational Research
In recent years, the group ranking problem has become an important subject of study. In most group ranking problems, the focus is on identifying consensuses. No previous research has involved identifying conflicting opinions, called conflict patterns in this paper, among decision-makers. We define conflict patterns as orderings of alternatives that have roughly the same numbers of advantages and disadvantages. Conflict patterns can reveal the ranking of which alternatives are the most controversial among decision-makers and who the supporters and opponents are. Using conflict pattern data, decision-makers can communicate with people with differing opinions and attempt to resolve the differences.
In this study, an algorithm, Mining Conflict Patterns, was developed to identify conflict patterns from users’ partial ranking data. Extensive experiments were conducted using synthetic and real data sets. The results indicate that the proposed method is computationally efficient and can effectively identify conflict patterns among all users.
Organisational structure and performance of consensus decisions through mutual influences: A computer simulation approach
2016, Decision Support Systems
This paper models and simulates the formation of consensus through information sharing and social influences among bounded rational individuals connected through a communication network, who collectively decide whether to adopt a new project, policy or idea (innovation in short) or not. Next we examine the sensitivity of group fallibility in the collective adoption decision and the time spent on reaching consensus to parameters of the model such as the economic value of the innovation, the connectivity and size of the network and the degree of social influences. We find that group consensus decisions reduce the probability of commission errors to negligible values (adopting value-destroying innovations), but the probability of omission errors (rejecting value-creating ones) and the time to reach consensus are sensitive to the exogenous parameters with some trade-offs: higher average connectivity of communication networks increases the likelihood of making omission errors in adoption but reduces the time to reach consensus. The results of this research are relevant to managerial decisions affecting the performance of consensus by mutual influences on decision making in the expanding production and exchange systems of information-intensive networks of empowered workers.
A new group ranking approach for ordinal preferences based on group maximum consensus sequences
2016, European Journal of Operational Research
Group ranking problems involve aggregating individual rankings to generate group ranking which represents consolidated group preference. Group ranking problems are commonly applied in real-world decision-making problems; however, supporting a group decision-making process is difficult due to the existence of multiple decision-makers, each with his/her own opinions. Hence, determining how to best aid the group ranking process is an important consideration. This study aims to determine a total ranking list which meets group consensus preferences for group ranking problems. A new group consensus mining approach based on the concept of tournament matrices and directed graphs is first developed; an optimization model involving maximum consensus sequences is then constructed to achieve a total ranking list. Compared to previous methods, the proposed approach can generate a total ranking list involving group consensus preferences. It can also determine maximum consensus sequences without the need for tedious candidate generation processes, while also providing flexibility in solving ranking problems using different input preferences that vary in format and completeness. In addition, consensus levels are adjustable.

View all citing articles on Scopus

Yen-Liang Chen is Professor of Information Management at National Central University of Taiwan. He received his Ph.D. degree in computer science from National Tsing Hua University, Hsinchu, Taiwan. His current research interests include data mining, social network analysis, and decision making models. He has published papers in Decision Support Systems, IEEE Transactions on Software Engineering, IEEE Transactions on Knowledge and Data Engineering, IEEE Transactions on SMC, Information & Management, Information Processing & Management, Journal of American Society of Information Science and Technology, Information Systems, Operations Research, Naval Research Logistics, Transportation Research — part B, European Journal of Operational Research, and many others. He is the former editor-in-chief of Journal of Information Management and that of Journal of e-Business.

Li-Chen Cheng is an Associate Professor of Department of Computer Science and Information Management, Soochow University, Taipei, Taiwan. She received her Ph.D. degree in information management from National Central University, Chung-Li, Taiwan. Her current research interests include data mining, information retrieval and EC technologies. She has published papers in Decision Support Systems, Electronic Commerce Research and Applications, European Journal of Operational Research, and many others.

Po-Hsiang Huang received the M.S. degree in Information Management from National Central University, Chung-Li, Taiwan. His research interests include data mining, information systems and EC technologies.

View full text

Mining consensus preference graphs from users' ranking data

Abstract

Highlights

Introduction

Section snippets

Related work

Problem definition

The algorithm

Experiments

Conclusions

Acknowledgment

European Journal of Operational Research

Decision Support Systems

Computers & Operations Research

Decision Support Systems

Decision Support Systems

European Journal of Operational Research

Voting schemes for which it can be difficult to tell who won the election

Social Choice and Welfare

Soft computing techniques for rank aggregation on the World Wide Web

World Wide Web: Internet and Web Information Systems

Preference structures I: distances between transitive preference relations

Journal of Mathematical Sociology

Preference structures II: distances between asymmetric relations

SIAM Journal of Applied Mathematics

Learning to order things

Journal of Artificial Intelligence Research