Abstract
In the big data context, decision makers usually face the problem of evaluating environmental efficiencies of a massive number of decision making units (DMUs) using the data envelopment analysis (DEA) method. However, standard implementations of the traditional DEA calculation process will consume much time when the data set is very large. To eliminate this limitation of DEA applied to big data, firstly, the slacks-based measure (SBM) model is extended considering undesirable outputs and the variable returns to scale (VRS) assumption for environmental efficiency evaluation of the DMUs. Then, an approach comprised of two algorithms is proposed for environmental efficiency evaluation when the number of DMUs is massive. The set of DMUs is partitioned into subsets, a technique which facilitates the application of a parallel computing mechanism. Algorithm 1 can be used for identifying the environment efficient DMUs in any DMU set. Further, Algorithm 2 (a parallel computing algorithm) shows how to use the proposed model and Algorithm 1 in parallel to find the environmental efficiencies of all DMUs. A simulation shows that the parallel computing design helps to significantly reduce calculation time when completing environmental efficiency evaluation tasks with large data sets, compared with using the traditional calculation processes. Finally, the proposed approach is applied to do environmental efficiency analysis of transportation systems.


Similar content being viewed by others
Notes
This mechanism is an easy way to do parallel computing. The whole efficiency evaluation task is separated into ten parts, and each computer does one part of the ten independently. Each computer contains all the DMUs but it only calculates efficiency for its assigned part of the DMU set. All the computers work concurrently and the final calculation time will be no larger than the time needed by the last computer to complete its assigned task.
References
Andersen, P., & Petersen, N. C. (1993). A procedure for ranking efficient units in data envelopment analysis. Management Science, 39(10), 1261–1264.
An, Q., Chen, H., Wu, J., & Liang, L. (2015). Measuring slacks-based efficiency for commercial banks in China by using a two-stage DEA model with undesirable output. Annals of Operations Research, 235(1), 13–35.
An, Q., Yan, H., Wu, J., & Liang, L. (2016). Internal resource waste and centralization degree in two-stage systems: An efficiency analysis. Omega, 61, 89–99.
Badau, F. (2015). Ranking trade resistance variables using data envelopment analysis. European Journal of Operational Research, 247(3), 978–986.
Bizer, C., Boncz, P., Brodie, M. L., & Erling, O. (2012). The meaningful use of big data: Four perspectives-four challenges. ACM SIGMOD Record, 40(4), 56–60.
Chang, Y. T., Zhang, N., Danao, D., & Zhang, N. (2013). Environmental efficiency analysis of transportation system in China: A non-radial DEA approach. Energy Policy, 58, 277–283.
Chang, Y. T., Park, H. S., Jeong, J. B., & Lee, J. W. (2014). Evaluating economic and environmental efficiency of global airlines: A SBM-DEA approach. Transportation Research Part D: Transport and Environment, 27, 46–50.
Charnes, A., & Cooper, W. W. (1962). Programming with linear fractional functionals. Naval Research logistics quarterly, 9(3–4), 181–186.
Charnes, A., Cooper, W. W., Lewin, A. Y., & Seiford, L. M. (Eds.). (2013). Data envelopment analysis: Theory, methodology, and applications. Berlin: Springer Science & Business Media.
Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making units. European Journal of Operational Research, 2(6), 429–444.
Chen, C. M. (2009). A network-DEA model with new efficiency measures to incorporate the dynamic effect in production networks. European Journal of Operational Research, 194(3), 687–699.
Chen, C. P., & Zhang, C. Y. (2014). Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information Sciences, 275, 314–347.
Cook, W. D., & Kress, M. (1999). Characterizing an equitable allocation of shared costs: A DEA approach. European Journal of Operational Research, 119(3), 652–661.
Cook, W. D., & Zhu, J. (2005). Allocation of shared costs among decision making units: A DEA approach. Computers & Operations Research, 32(8), 2171–2178.
Cook, W. D., Liang, L., Zha, Y., & Zhu, J. (2009). A modified super-efficiency DEA model for infeasibility. Journal of the Operational Research Society, 60(2), 276–281.
Dubey, R., & Gunasekaran, A. (2015). Education and training for successful career in Big Data and Business Analytics. Industrial and Commercial Training, 47(4), 174–181.
Dubey, R., Gunasekaran, A., Childe, S. J., Wamba, S. F., & Papadopoulos, T. (2016). The impact of big data on world-class sustainable manufacturing. The International Journal of Advanced Manufacturing Technology, 84(1), 631–645.
Dulá, J. H. (2011). An algorithm for data envelopment analysis. INFORMS Journal on Computing, 23(2), 284–296.
Du, J., Wang, J., Chen, Y., Chou, S. Y., & Zhu, J. (2014). Incorporating health outcomes in Pennsylvania hospital efficiency: An additive super-efficiency DEA approach. Annals of Operations Research, 221(1), 161–172.
Fang, L. (2015). Centralized resource allocation based on efficiency analysis for step-by-step improvement paths. Omega, 51, 24–28.
Färe, R., & Grosskopf, S. (2000). Network dea. Socio-economic Planning Sciences, 34(1), 35–49.
Golany, B., & Tamir, E. (1995). Evaluating efficiency-effectiveness-equality trade-offs: A data envelopment analysis approach. Management Science, 41(7), 1172–1184.
Johnson, J. E. (2012). Big data+ big analytics = big opportunity: Big data is dominating the strategy discussion for many financial executives. As these market dynamics continue to evolve, expectations will continue to shift about what should be disclosed, when and to whom. Financial. Executive, 28(6), 50–54.
Khodabakhshi, M., & Aryavash, K. (2014). The fair allocation of common fixed cost or revenue using DEA concept. Annals of Operations Research, 214(1), 187–194.
Lee, T., Yeo, G. T., & Thai, V. V. (2014). Environmental efficiency analysis of port cities: Slacks-based measure data envelopment analysis approach. Transport Policy, 33, 82–88.
Liang, L., Wu, J., Cook, W. D., & Zhu, J. (2008). The DEA game cross-efficiency model and its Nash equilibrium. Operations Research, 56(5), 1278–1288.
Liang, L., Yang, F., Cook, W. D., & Zhu, J. (2006). DEA models for supply chain efficiency evaluation. Annals of Operations Research, 145(1), 35–49.
Li, H., Fang, K., Yang, W., Wang, D., & Hong, X. (2013). Regional environmental efficiency evaluation in China: Analysis based on the Super-SBM model with undesirable outputs. Mathematical and Computer Modelling, 58(5), 1018–1031.
Li, L., Hao, T., & Chi, T. (2016). Evaluation on China’s forestry resources efficiency based on big data. Journal of Cleaner Production. doi:10.1016/j.jclepro.2016.02.078.
Li, Y., Lei, X., Dai, Q., & Liang, L. (2015). Performance evaluation of participating nations at the 2012 London Summer Olympics by a two-stage data envelopment analysis. European Journal of Operational Research, 243(3), 964–973.
Liu, X., Chu, J., Yin, P., & Sun, J. (2016). DEA cross-efficiency evaluation considering undesirable output and ranking priority: A case study of eco-efficiency analysis of coal-fired power plants. Journal of Cleaner Production. doi:10.1016/j.jclepro.2016.04.069.
Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., & Byers, A. H. (2011). Big data: The next frontier for innovation, competition, and productivity. http://www.mckinsey.com/business-functions/business-technology/our-insights/big-data-the-next-frontier-for-innovation. Accessed May 2011.
Mar-Molinero, C., Prior, D., Segovia, M. M., & Portillo, F. (2014). On centralized resource utilization and its reallocation by using DEA. Annals of Operations Research, 221(1), 273–283.
Moreno, P., & Lozano, S. (2014). A network DEA assessment of team efficiency in the NBA. Annals of Operations Research, 214(1), 99–124.
Ohlhorst, F. J. (2012). Big data analytics: Turning big data into big money. New York: Wiley.
Schroeck, M., Shockley, R., Smart, J., Romero-Morales, D., & Tufano, P. (2012). Analytics: The real-world use of big data. Somers: IBM Global Business Services.
Seiford, L. M., & Zhu, J. (2002). Modeling undesirable factors in efficiency evaluation. European Journal of Operational Research, 142(1), 16–20.
Sexton, T. R., Silkman, R. H., & Hogan, A. J. (1986). Data envelopment analysis: Critique and extensions. New Directions for Program Evaluation, 1986(32), 73–105.
Song, M. L., Fisher, R., Wang, J. L., & Cui, L. B. (2016). Environmental performance evaluation with big data: Theories and methods. Annals of Operations Research. doi:10.1007/s10479-016-2158-8.
Tone, K. (2001). A slacks-based measure of efficiency in data envelopment analysis. European Journal of Operational Research, 130(3), 498–509.
Tone, K. (2004). Dealing with undesirable outputs in DEA: A slacks-based measure (SBM) approach (pp. 44–45). Toronto: Presentation at NAPW III.
Tone, K., & Tsutsui, M. (2009). Network DEA: A slacks-based measure approach. European Journal of Operational Research, 197(1), 243–252.
Tone, K., & Tsutsui, M. (2010). Dynamic DEA: A slacks-based measure approach. Omega, 38(3), 145–156.
Tone, K., & Tsutsui, M. (2014). Dynamic DEA with network structure: A slacks-based measure approach. Omega, 42(1), 124–131.
Wang, K., Huang, W., Wu, J., & Liu, Y. N. (2014). Efficiency measures of the Chinese commercial banking system using an additive two-stage DEA. Omega, 44, 5–20.
Wei, Q., Zhang, J., & Zhang, X. (2000). An inverse DEA model for inputs/outputs estimate. European Journal of Operational Research, 121(1), 151–163.
Wu, J., Chu, J., Sun, J., & Zhu, Q. (2016a). DEA cross-efficiency evaluation based on Pareto improvement. European Journal of Operational Research, 248(2), 571–579.
Wu, J., Chu, J., Sun, J., Zhu, Q., & Liang, L. (2016b). Extended secondary goal models for weights selection in DEA cross-efficiency evaluation. Computers & Industrial Engineering, 93, 143–151.
Wu, J., Chu, J., Zhu, Q., Yin, P., & Liang, L. (2016c). DEA cross-efficiency evaluation based on satisfaction degree: An application to technology selection. International Journal of Production Research. doi:10.1080/00207543.2016.1148278.
Wu, J., Xiong, B., An, Q., Sun, J., & Wu, H. (2015). Total-factor energy efficiency evaluation of Chinese industry by using two-stage DEA model with shared inputs. Annals of Operations Research. doi:10.1007/s10479-015-1938-x.
Yang, H., & Pollitt, M. (2009). Incorporating both undesirable outputs and uncontrollable variables into DEA: The performance of Chinese coal-fired power plants. European Journal of Operational Research, 197(3), 1095–1105.
Zhou, Y., Xing, X., Fang, K., Liang, D., & Xu, C. (2013). Environmental efficiency analysis of power industry in China based on an entropy SBM model. Energy Policy, 57, 68–75.
Zhu, Q., Wu, J., Li, X., & Xiong, B. (2016). China’s regional natural resource allocation and utilization: A DEA-based approach in a big data environment. Journal of Cleaner Production. doi:10.1016/j.jclepro.2016.02.100.
Acknowledgments
The research is supported by the National Natural Science Funds of China (Nos. 71222106, 71110107024, 71171001, 71471001, and 71501139), Research Fund for the Doctoral Program of Higher Education of China (No. 20133402110028), Foundation for the Authors of National Excellent Doctoral Dissertation of P. R. China (No. 201279), Fundamental Research Funds for the Central Universities (No. WK2040160008), Top-Notch Young Talents Program of China, and Internet of Things Industry Development Research Base Biding Project, Nanjing University of Posts and Telecommunications (No. JDS215005).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chu, JF., Wu, J. & Song, ML. An SBM-DEA model with parallel computing design for environmental efficiency evaluation in the big data context: a transportation system application. Ann Oper Res 270, 105–124 (2018). https://doi.org/10.1007/s10479-016-2264-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-016-2264-7