skip to main content
10.1145/1142473.1142547acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

DADA: a data cube for dominant relationship analysis

Published: 27 June 2006 Publication History

Abstract

The concept of dominance has recently attracted much interest in the context of skyline computation. Given an N-dimensional data set S, a point p is said to dominate q if p is better than q in at least one dimension and equal to or better than it in the remaining dimensions. In this paper, we propose extending the concept of dominance for business analysis from a microeconomic perspective. More specifically, we propose a new form of analysis, called Dominant Relationship Analysis (DRA), which aims to provide insight into the dominant relationships between products and potential buyers. By analyzing such relationships, companies can position their products more effectively while remaining profitable.To support DRA, we propose a novel data cube called DADA (Data Cube for Dominant Relationship Analysis), which captures the dominant relationships between products and customers. Three types of queries called Dominant Relationship Queries (DRQs) are consequently proposed for analysis purposes: 1)Linear Optimization Queries (LOQ), 2)Subspace Analysis Queries (SAQ), and 3)Comparative Dominant Queries (CDQ). Algorithms are designed for efficient computation of DADA and answering the DRQs using DADA. Results of our comprehensive experiments show the effectiveness and efficiency of DADA and its associated query processing strategies.

References

[1]
{1} S. Agarwal, R. Agrawal, P. Deshpande, A. Gupta, J. Naughton, R. Ramakrishnan, and S. Sarawagi. On the Computation of Multidimensional Aggregates. In VLDB, pages 506-521, 1996.
[2]
{2} D. A. K. Alexander Hinneburg. Optimal grid-clustering: Towards breaking the curse of dimensionality in high-dimensional clustering. In VLDB, 1999.
[3]
{3} K. S. Beyer and R. Ramakrishnan. Bottom-up computation of sparse and iceberg cubes. In SIGMOD 1999, Proceedings ACM SIGMOD International Conference on Management of Data, June 1-3, 1999, Philadelphia, Pennsylvania, USA, pages 359-370, 1999.
[4]
{4} G. Birkhoff. Lattice Theory. American Mathematical Society Colloquium Publications, Rhode Island, 1973.
[5]
{5} S. Börzsönyi, D. Kossmann, and K. Stocker. The skyline operator. In ICDE, 2001.
[6]
{6} T. Brijs, G. Swinnen, K. Vanhoof, and G. Wets. Using association rules for product assortment decisions: A case study. In KDD, pages 254-260, 1999.
[7]
{7} C. Y. Chan, H. V. Jagadish, K.-L. Tan, A. K. H. Tung, and Z. Zhang. Finding k-dominant skyline in high dimensional space. In ACM SIGMOD, 2006.
[8]
{8} C. Y. Chan, H. V. Jagadish, K.-L. Tan, A. K. H. Tung, and Z. Zhang. On high dimensional skylines. In EDBT, pages 478-495, 2006.
[9]
{9} Q. Chen, M. Hsu, and U. Dayal. A data-warehouse/OLAP framework for scalable telecommunication tandem traffic analysis. In ICDE, pages 201-210, 2000.
[10]
{10} B. Davey and H. Priestley. Introduction to Lattices and Order. Cambridge University Press, 1990.
[11]
{11} M. Ester, R. Ge, W. Jin, and Z. Hu. A microeconomic data mining problem: customer-oriented catalog segmentation. In KDD, pages 557-562, 2004.
[12]
{12} R. Godin, R. Missaoui, and H. Alaoui. Incremental concept formation algorithms based on galois (concept) lattices. Computational Intelligence, 11:246-267, 1995.
[13]
{13} J. Gray, A. Bosworth, A. Layman, and H. Pirahesh. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-total. In ICDE, pages 152-159, 1996.
[14]
{14} J. Gray, S. Chaudhuri, A. Bosworth, A. Layman, D. Reichart, M. Venkatrao, F. Pellow, and H. Pirahesh. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub totals. Data Min. Knowl. Discov., 1(1):29-53, 1997.
[15]
{15} J. Han. Olap mining: Integration of olap with data mining. In Database Semantics-7, pages 3-20, 1997.
[16]
{16} V. Harinarayan, A. Rajaraman, and J. Ullman. Implementing data cubes efficiently. In ACM SIGMOD, pages 205-216, 1996.
[17]
{17} C.-T. Ho, R. Agrawal, N. Megiddo, and R. Srikant. Range queries in olap data cubes. In SIGMOD Conference, pages 73-88, 1997.
[18]
{18} J. Kleinberg, C. Papadimitriou, and P. Raghavan. Segmentation problems. In STOC, 1998.
[19]
{19} J. Kleinberg, C. Papadimitriou, and P. Raghavan. A microeconomic view of data mining. In Data Min. Knowl. Discov., 2(4): 311-322, 1998.
[20]
{20} D. Kossmann, F. Ramsak, and S. Rost. Shooting stars in the sky: An online algorithm for skyline queries. In VLDB, 2002.
[21]
{21} C. Li, G. Cong, A. K. H. Tung, and S. Wang. Incremental maintenance of quotient cube for median. In KDD, pages 226-235, New York, NY, USA, 2004. ACM Press.
[22]
{22} D. Papadias, Y. Tao, G. Fu, and B. Seeger. An optimal and progressive algorithm for skyline queries. In SIGMOD, 2003.
[23]
{23} K. Ross and D. Srivastava. Fast Computation of Sparse Datacubes. In VLDB, pages 116-125, 1997.
[24]
{24} N. Roussopoulos, Y. Kotidis, and M. Roussopoulos. Cubetree: organization of and bulk incremental updates on the data cube. In ACM SIGMOD, pages 89-99, 1997.
[25]
{25} Y. Sismanis, A. Deligiannakis, N. Roussopoulos, and Y. Kotidis. Dwarf: shrinking the petacube. In SIGMOD Conference, pages 464-475, 2002.
[26]
{26} K. L. Tan, P. K. Eng, and B. C. Ooi. Efficient progressive skyline computation. In VLDB, 2001.
[27]
{27} K. Wang, S. Zhou, and J. Han. Profit mining: From patterns to actions. In EDBT, pages 70-87, 2002.
[28]
{28} R. C.-W. Wong, A. W.-C. Fu, and K. Wang. Mpis: Maximal-profit item selection with cross-selling considerations. In ICDM, pages 371-378, 2003.
[29]
{29} J. T. Yao. Sensitivity analysis for data mining. In Proceedings of The 22nd International Conference of NAFIPS (the North American Fuzzy Information Processing Society), pages 272-277, 2003.
[30]
{30} Y. Yuan, X. Lin, Q. Liu, W. Wang, J. X. Yu, and Q. Zhang. Efficient computation of skyline cube. In VLDB, pages 241-252, 2005.
[31]
{31} Z. Zhang, X. Guo, H. Lu, A. K. H. Tung, and N. Wang. Discovering strong skyline points in high dimensional spaces. In CIKM, pages 247-248, 2005.

Cited By

View all
  • (2023)Strategies to Selecting Most Profitable Products by Price SettingsInternational Journal of Information Technology & Decision Making10.1142/S021962202350043823:02(885-915)Online publication date: 17-May-2023
  • (2023)Quantifying the competitiveness of a dataset in relation to general preferencesThe VLDB Journal10.1007/s00778-023-00804-133:1(231-250)Online publication date: 8-Aug-2023
  • (2022)Mining Top-K Competitors by Eliminating the K-Least Items from Unstructured DatasetInnovations in Electronics and Communication Engineering10.1007/978-981-16-8512-5_54(505-514)Online publication date: 13-Mar-2022
  • Show More Cited By

Index Terms

  1. DADA: a data cube for dominant relationship analysis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data
    June 2006
    830 pages
    ISBN:1595934340
    DOI:10.1145/1142473
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 June 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data cube
    2. dominant relationship analysis
    3. microeconomic data mining
    4. skyline

    Qualifiers

    • Article

    Conference

    SIGMOD/PODS06
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Strategies to Selecting Most Profitable Products by Price SettingsInternational Journal of Information Technology & Decision Making10.1142/S021962202350043823:02(885-915)Online publication date: 17-May-2023
    • (2023)Quantifying the competitiveness of a dataset in relation to general preferencesThe VLDB Journal10.1007/s00778-023-00804-133:1(231-250)Online publication date: 8-Aug-2023
    • (2022)Mining Top-K Competitors by Eliminating the K-Least Items from Unstructured DatasetInnovations in Electronics and Communication Engineering10.1007/978-981-16-8512-5_54(505-514)Online publication date: 13-Mar-2022
    • (2022)Event Cube for Suicidal Event Analysis: A Case StudyWeb Information Systems Engineering – WISE 202110.1007/978-3-030-90888-1_39(512-526)Online publication date: 1-Jan-2022
    • (2021)On m-Impact Regions and Standing Top-k Influence ProblemsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452832(1784-1796)Online publication date: 9-Jun-2021
    • (2021)K-truss community most favorites query based on top-tWorld Wide Web10.1007/s11280-021-00947-7Online publication date: 22-Oct-2021
    • (2021)Finding the most profitable candidate product by dynamic skyline and parallel processingDistributed and Parallel Databases10.1007/s10619-021-07323-4Online publication date: 18-Mar-2021
    • (2020)Efficient column-oriented processing for mutual subspace skyline queriesSoft Computing10.1007/s00500-020-04875-yOnline publication date: 21-Mar-2020
    • (2019)Creating top ranking options in the continuous option and preference spaceProceedings of the VLDB Endowment10.14778/3339490.333950012:10(1181-1194)Online publication date: 1-Jun-2019
    • (2019)Privacy-Aware Set-based Information Analysis2019 International Conference on Sustainable Information Engineering and Technology (SIET)10.1109/SIET48054.2019.8986065(1-4)Online publication date: Sep-2019
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media