Decision Support
Mining maximum consensus sequences from group ranking data

https://doi.org/10.1016/j.ejor.2008.09.004Get rights and content

Abstract

In the last decade, the problem of getting a consensus group ranking from all users’ ranking data has received increased attention due to its widespread applications. Previous research solved this problem by consolidating the opinions of all users, thereby obtaining an ordering list of all items that represent the achieved consensus. The weakness of this approach, however, is that it always produces a ranking list of all items, regardless of how many conflicts exist among users. This work rejects the forced agreement of all items. Instead, we define a new concept, maximum consensus sequences, which are the longest ranking lists of items that agree with the majority and disagree only with the minority. Based on this concept, algorithm MCS is developed to determine the maximum consensus sequences from users’ ranking data, and also to identify conflict items that need further negotiation. Extensive experiments are carried out using synthetic data sets, and the results indicate that the proposed method is computationally efficient. Finally, we discuss how the identified consensus sequences and conflict items information can be used in practice.

Introduction

In recent decades, the group ranking problem has become an important and interesting study in decision making [8], [9], [10], machine learning [14], web search strategies [3], [7], and sports and behavioral issues. The essence of this problem is how to consolidate and aggregate decision makers’ rankings to obtain a group ranking that is representative of “better coherent” ordering for the decision makers’ rankings.

According to the completeness of preference information provided by decision makers, the group ranking problem can be roughly classified into two major approaches, the total ranking approach and the partial ranking approach. The former needs individuals to appraise all alternatives, while the latter only a subset of alternatives. Moreover, the compromised results of these approaches can be divided into two types: a full order and a partial order. Basically, requesting a full order result is a common goal in both the total ranking approach and the partial ranking approach. However, quite a few works in the partial ranking approach generate partial order results [9], [10], [16], [18], because when the provided preference information was too fragmented, the final result was hard to consolidate [9]. Each approach has its own advantages and has been successfully used in many applications. In this paper, we follow the line of the total ranking approach and propose a new group ranking algorithm which can generate partial order compromised results.

Roughly speaking, the goal of most total ranking methods is to determine a full ordering list of items that expresses the consensus achieved among a group of decision makers. Therefore, the advantage of these researches is that no matter how much users’ preferences conflict, an ordering list of all items to represent the consensus is always produced. Unfortunately, this advantage is also a disadvantage, because when there is no consensus or only slight consensus on items’ rankings, the previous approach still generates a total ordering list using their ranking algorithms. In such a situation, what we obtain is really not a consensus list, but merely the output of algorithms. Relying on this list to make any decision would be risky.

In the following, we use an example to explain the situation. Assume that we have four users U1, U2, U3, and U4 and five proposals A, B, C, D, and E. Table 1 shows the users’ ranking lists of the proposals, where “X > Y” means X is preferable to Y, and “X  Y” means X is at least as good as Y.

As seen in Table 1, all users agreed with the relationship {A > D > E}, but there is a conflict of opinion on proposals {A, C}, {B, C}, and {C, D}. In this situation, different algorithms will produce different results for these conflict items. For example, one algorithm may generate list {A > B > D > C > E}, but another may generate {C  A > B > D > E}. It is unwise to use either list to make decisions, because users only have consensus on {A > D > E} while the other parts are actually determined by the algorithm’s design.

This example indicates that we may have consensus on some items and conflicts on other items. In such situations, consensus decision-making theory stresses that we need a process to achieve the most agreeable decision among participants [25]. To support this decision process, we must have an algorithm that can find the maximum consensus lists from users’ ranking data and also identify conflict items that need further negotiation.

Traditionally, there are three formats to express users’ preferences about items in the total ranking approach. These formats include weights/scores of items, set of pairwise comparisons on the items and ranking lists of items. In this paper, we assume that an individual’s preferences can be represented as a total ranking list. An ordering sequence of items is called a consensus if a majority of users agree on this ordering and only a minority of users disagrees. Accordingly, we propose an algorithm to discover maximum consensus sequences from users’ ranking lists, where the maximum consensus sequences represent the maximum possible consensuses that can be achieved among most of users. By applying our algorithm to the above example, we can generate two types of results: the maximum consensus sequences, such as {A > D > E} in the above example, and the conflict items where users have no consensus.

Our approach has the following advantages in supporting group decision making:

  • The majority preference can be met by the maximum consensus sequence proposed in this study. The degree of majority is adjustable.

  • Users may have different opinions on items. During the decision process, users need to negotiate and resolve the objections of the minority in order to achieve the most agreements. Our algorithm can identify the conflict items that need further negotiation.

  • Most previous research consolidated all users’ preferences into a total ranking list. This may suppress or ignore the opinions of the minority and could make the achieved decision difficult to implement. In our approach, we find only the maximum agreeable consensus from users, rather than forcefully producing a total ranking list for all users.

In the next section, we briefly review previous research about group ranking. Then, the problem definition and the proposed concept, consensus sequence, are described in detail. In the methodological section, an algorithm is established to find the maximum consensus sequences. In addition, a series of experiments is conducted to analyze the proposed method. After the experimental section, we will discuss how to apply the consensuses sequence in practice. Finally, we summarize some valuable and constructive conclusions.

Section snippets

Sequential pattern mining

Agrawal and Srikant first introduced the problem of mining sequential patterns and proposed algorithms AprioriAll [26]. The basic idea of this approach can be stated as follows. We are given a set of sequences, called data-sequences, as the input data. Each data-sequence is a list of transactions, where each transaction contains a set of literals, called items. This research aims to find all of the subsequences whose ratios of appearance exceed the user-specified minimum support threshold. For

Problem definition

In this section, we formally define the problem of mining consensus sequences. Let U = {u1,u2,  ,um} and I = {i1,i2,   ,in} denote all users and the set of all distinct items, respectively. Each user uk has a ranked list of items to express his or her preferences. Let Sk be a user sequence that denotes the total ranking list of user uk. This can be represented as follows:Sk={ia1ia2ian}iariasifrs,in which iajI, 1  aj  n and ⊕  {>, ⩾, =}.

Each user sequence must satisfy three conditions. First, no

Methodology

We propose an algorithm, the MCS algorithm, for mining maximum consensus sequences from all users’ ranking list data. Parameters cmp_minsup and cf_maxsup, defined in Definition 8, are used to filter out maximum consensus sequences. A sequence with length k is referred to as a k-sequence. Let Lk denote the set of all k-consensus sequences, and Ck the set of candidate k-consensus sequences. The MCS algorithm which has three major steps is shown below. To help readers understand the process of the

Experimental results

In this section, we perform a simulation study to evaluate the efficiency and effectiveness of the proposed algorithms MCS. Synthetic data sets are generated to compare the run time and scalability. The proposed algorithm is implemented in Java language and runs on an Intel Celeron 2.6 GHz PC with 2 gigabytes of main memory running the Windows 2000 operating system.

Discussion

This section includes two subsections to illustrate how the identified consensus sequences and conflict items information can be used in practice. In the first subsection, we discuss how the consensus mining approach can find various interesting patterns by setting different minimum comply and maximum conflict supports. In the second subsection, we use a small numerical example to demonstrate a decision process which can help a group of decision makers to reach consensus on the ranking of all

Conclusion

Most existing methods in the group ranking problem focus on generating a total ordering list from users’ ranking data. Unfortunately, when users have no consensus or only little consensus on items, this approach would mislead decision makers. Therefore, this work rejected the forced agreement of all items. Instead, we defined a new concept, maximum consensus sequences. The proposed algorithm, MCS, is developed to find the maximum consensus sequences from users’ ranking data and identified

Acknowledgements

It is our pleasure to acknowledge the anonymous reviewers for their valuable suggestions and the careful reading of our manuscript. The authors would like to express our gratitude to these reviewers for their suggestions that helped to substantially improve our paper.

References (27)

  • K. Bogart

    Preference structures I: Distances between transitive preference relations

    Journal of Math Sociology

    (1973)
  • K. Bogart

    Preference structures II: Distances between asymmetric relations

    SIAM Journal of Applied Mathematics

    (1975)
  • W. Cohen

    Learning to order things

    The Journal of Artificial Intelligence Research

    (1999)
  • Cited by (43)

    • Mining consensus sequence in multi-criteria group decision making with incomparability of alternatives and conflicts of experts

      2022, Information Sciences
      Citation Excerpt :

      It is hard to make a uniform definition. For example, Chen and Cheng [5] defined that a strict preference relation and an indifference relation are in a disagreement while Ma [20] did not define so. After the definition, an opinion that agrees with the majority and disagrees only with the minority was output as the group result, where “majority” and “minority” were specified by preset thresholds.

    View all citing articles on Scopus
    View full text