Abstract
Creating a new product that dominates all its competitors is one of the main objectives in marketing. Nevertheless, this might not be feasible since in practice the development process is confined by some constraints, e.g., limited funding or low target selling price. We model these constraints by a constraint function, which determines the feasible characteristics of a new product. Given such a budget, our task is to decide the best possible features of the new product that maximize its profitability. In general, a product is marketable if it dominates a large set of existing products, while it is not dominated by many. Based on this, we define dominance relationship analysis and use it to measure the profitability of the new product. The decision problem is then modeled as a budget constrained optimization query (BOQ). Computing BOQ is challenging due to the exponential increase in the search space with dimensionality. We propose a divide-and-conquer based framework, which outperforms a baseline approach in terms of not only execution time but also space complexity. Based on the proposed framework, we further study an approximation solution, which provides a good trade-off between computation cost and quality of result.
Similar content being viewed by others
Notes
\(C(x)\) can be a piecewise function. The solution of piecewise constraint functions is discussed in Appendix B.
Based on http://www.cpubenchmark.net/cpu_list.php (Nov 2011).
Collected from http://www.compusa.com (Nov 2011).
Tight bounds cannot be derived unless we know the locations of the remaining objects. This algorithm refines MBRs by accessing the objects one-by-one; tight bounds for all finest MBRs will be established eventually after having accessed all objects.
References
Beckmann N, Kriegel H-P, Schneider R, Seeger B (1990) The R*-tree: an efficient and robust access method for points and rectangles. In: SIGMOD conference, pp 322–331
Börzsönyi S, Kossmann D, Stocker K (2001) The skyline operator. In: ICDE, pp 421–430
Chomicki J, Godfrey P, Gryz J, Liang D (2003) Skyline with presorting. In: ICDE, pp 717–816
Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms, 3rd edn. The MIT Press, Cambridge
Feng H, Song G, Zheng Y, Xia J (2003) A deadline and budget constrained cost-time optimization algorithm for scheduling dependent tasks in grid computing. In: GCC (2), pp 113–120
Household dataset (2008). http://www.ipums.org/
Kossmann D, Ramsak F, Rost S (2002) Shooting stars in the sky: an online algorithm for skyline queries. In: VLDB, pp 275–286
Kung HT, Luccio F, Preparata FP (1975) On finding the maxima of a set of vectors. J ACM 22(4):469–476
Li C, Ooi BC, Tung AKH, Wang S (2006) DADA: a data cube for dominant relationship analysis. In: SIGMOD conference, pp 659–670
Li C, Tung AKH, Jin W, Ester M (2007) On dominating your neighborhood profitably. In: VLDB, pp 818–829
Lin C-Y, Koh J-L, Chen AL (2012) Determining k-most demanding products with maximum expected number of total customers. IEEE Trans Knowl Data Eng (99), 1 (preprints)
Lu H, Jensen CS (2012) Upgrading uncompetitive products economically. In: ICDE, pp 977–988
Miah M, Das G, Hristidis V, Mannila H (2008) Standing out in a crowd: selecting attributes for maximum visibility. In: ICDE, pp 356–365
NBA Basketball Statistics (2009) NBA basketball statistics http://www.databasebasketball.com/
Oliver D (2010) JoBS methods descriptions for the scientific study of basketball. Website. http://www.powerbasketball.com/theywin2.html
Papadias D, Tao Y, Fu G, Seeger B (2005) Progressive skyline computation in database systems. ACM Trans Database Syst 30(1):41–82
Papadopoulos AN, Lyritsis A, Nanopoulos A, Manolopoulos Y (2007) Domination mining and querying. In: DaWaK, pp 145–156
Peng Y, Wong RC-W, Wan Q (2012) Finding top-k preferable products. IEEE Trans Knowl Data Eng 24(10):1774–1788
Pro Basketball in USAToday (2009). http://content.usatoday.com/sports/basketball/nba/salaries/default.aspx
Pujowidianto NA, Lee LH, Chen C-H, Yap CM (2009) Optimal computing budget allocation for constrained optimization. In: Winter simulation conference, pp 584–589
Tan K-L, Eng P-K, Ooi BC (2001) Efficient progressive skyline computation. In: VLDB, pp 301–310
Wan Q, Wong RC-W, Ilyas IF, Özsu MT, Peng Y (2009) Creating competitive products. PVLDB 2(1):898–909
Wan Q, Wong RC-W, Peng Y (2011) Finding top-k profitable products. In: ICDE, pp 1055–1066
Wu T, Sun Y, Li C, Han J (2010) Region-based online promotion analysis. In: EDBT, pp 63–74
Wu T, Xin D, Mei Q, Han J (2009) Promotion analysis in multi-dimensional space. PVLDB 2(1):109–120
Yang Z, Li L, Kitsuregawa M (2008) Efficient querying relaxed dominant relationship between product Items based on rank aggregation. In: AAAI, pp 1261–1266
Yang Z, Wang B, Kitsuregawa M (2007) General dominant relationship analysis based on partial order models. In: SAC, pp 470–474
Yiu ML, Mamoulis N (2009) Multi-dimensional top-k dominating queries. VLDB J 18(3):695–718
Zhang S, Mamoulis N, Cheung DW (2009) Scalable skyline computation using object-based space partitioning. In: SIGMOD conference, pp 483–494
Zhang Y, Jia Y, Jin W (2011) Promotional subspace mining with EProbe framework. In: CIKM, pp 2185–2188
Zhang Z, Lakshmanan LVS, Tung AKH (2009) On domination game analysis for microeconomic data mining. ACM Trans Knowl Discov Data 2(4):18:1–18:27
Zhou Y, Chakrabarty D, Lukose RM (2008) Budget constrained bidding in keyword auctions and online knapsack problems. In: WWW, pp 1243–1244
Zhu L, Li C, Tung AKH, Wang S (2012) Microeconomic analysis using dominant relationship analysis. Knowl Inf Syst 30(1):179–211
Acknowledgments
This work was supported by grant HKU 714212E from Hong Kong RGC and grant MYRG109(Y1-L3)-FST12-ULH from University of Macau Research Committee.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Compact MBR computation
Given an MBR \(m\), a budget plane \(\mathcal {BP}\), and an orthogonal hyperplane, we can partition \(m\) into (at most) two compact MBRs as shown in Fig. 8a. In this section, we show how we can compute the boundaries of these new MBRs in \(O(d^2)\) time. For the sake of presentation, we assume that the hyperplane splits \(m\) into exactly two MBRs \(m_1\) and \(m_2\). Assume that \(m\) is split by hyperplane \(x[j]=v\) (i.e., parallel to dimension \(j\) at value \(v\)). Let \(m^l[i]\) (\(m^u[i]\)) be the lower (upper) bound of \(m\) in dimension \(i\). Then, two loose MBRs, whose union equals \(m\), can be defined by the following equations.
Note that the values of \(m^u_1\) and \(m^l_2\) are the tightest values already; in addition, \(m^l_1[j]\) and \(m^u_2[j]\) are also tightest. However, we can tighten the values (\(m^l_1\) and \(m^u_2\)) in other dimensions, so that \(m_1\) and \(m_2\) tightly enclose \({\mathcal {BP}}\) using the following method. To find the maximum bound within an MBR in dimension \(i\), we need to set the minimum values in all other dimensions and solve \(C(x)=B\). For example, \(m^l_1[i]\) can be computed by solving the equation \(C(x)=B\), by setting \(x[j]=m^u_1[j],\forall {j\ne i}\).
For instance, in Fig. 19, assume that we want to partition \(m\) using hyperplane y\(=0.5\). Initially, \(m\) is split into two MBRs \(m_1=\{m^u_1,m^l_1\}=\{[0,0.5],[1,1]\}\) and \(m_2=\{m^u_2,m^l_2\}=\{[0,0],[1,0.5]\}\). Then, if \(C=\frac{1}{xy}\) and \(B=0.5\), we tighten \(m^l_1[0]\) by solving \(m^l_1[0]\cdot m^u_1[1] = B\), which gives \(m^l_1[0]=0.5/0.5=0.25\). Therefore, the tightmost bound of \(m_1\) becomes \(\{[0,0.5],[0.25,1]\}\). In general, \(d-1\) boundaries can be tightened for each new MBR and each tightening requires \(O(d)\) computations, so the overall cost for tightening an MBR is \(O(d^2)\).
Appendix B: Solution of piecewise constraint functions
Even though our work does not address all non-monotonic constraint functions, our proposed techniques are able to compute GenBOQ if the constraint function \(C(x)\) is a piecewise function, which is composed of a set of finite monotonic functions. For instance,
The complete piecewise function may be non-monotonic. Figure 20 illustrates a non-monotonic piecewise function that composes of three finite monotonic functions. In case of such a function, we can partition the domain area into 9 subareas. To compute the result of GenBOQ, we first identify the relevant subareas of each finite monotonic function. For instance, the relevant areas of the red function are 2, 3, 4, 5, 6, 7, and 8 since there is no profitable region of the red function dominating or dominated by areas 1 and 9. Accordingly, we apply our algorithms to compute the GenBOQ result of the red constraint function using the data in the relevant areas. The final result can be straightforwardly produced by combining the result of each piecewise monotonic function.
Rights and permissions
About this article
Cite this article
Ge, S., U, L.H., Mamoulis, N. et al. Dominance relationship analysis with budget constraints. Knowl Inf Syst 42, 409–440 (2015). https://doi.org/10.1007/s10115-013-0694-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-013-0694-y