skip to main content
10.1145/3417990.3422004acmconferencesArticle/Chapter ViewAbstractPublication PagesmodelsConference Proceedingsconference-collections
demonstration

User-centred tooling for modelling of big data applications

Published: 26 October 2020 Publication History

Abstract

We outline the key requirements for a Big Data modelling recommender tool. Our web-based tool is suitable for capturing system requirements in big data analytics applications involving diverse stakeholders. It promotes awareness of the datasets and algorithm implementations that are available to leverage in the design of the solution. We implement these ideas in BiDaML-web, a proof of concept recommender system for Big Data applications, and evaluate the tool using an empirical study with a group of 16 target end-users. Participants found the integrated recommender and technique suggestion tools helpful and highly rated the overall BiDaML web-based modelling experience. BiDaML-web is available at https://bidaml.web.app/ and the source code can be accessed at https://github.com/tarunverma23/bidaml.

References

[1]
Iman Avazpour, Teerat Pitakrat, Lars Grunske, and John Grundy. 2014. Dimensions and metrics for evaluating recommendation systems. In Recommendation systems in software engineering. Springer, 245--273.
[2]
Tim Dwyer, Kim Marriott, and Michael Wybrow. 2008. Dunnart: A constraint-based network diagram authoring tool. In International Symposium on Graph Drawing. Springer, 420--431.
[3]
Hourieh Khalajzadeh, Mohamed Abdelrazek, John Grundy, John Hosking, and Qiang He. 2018. A survey of current end-user data analytics tool support. In 2018 IEEE International Congress on Big Data (BigData Congress). IEEE, 41--48.
[4]
Hourieh Khalajzadeh, Mohamed Abdelrazek, John Grundy, John Hosking, and Qiang He. 2019. BiDaML: A Suite of Visual Languages for Supporting End-User Data Analytics. In 2019 IEEE International Congress on Big Data (BigDataCongress). IEEE, 93--97.
[5]
Hourieh Khalajzadeh, Mohamed Abdelrazek, John Grundy, John G Hosking, and Qiang He. 2019. Survey and Analysis of Current End-user Data Analytics Tool Support. IEEE Transactions on Big Data (2019).
[6]
Hourieh Khalajzadeh, Andrew Simmons, Mohamed Abdelrazek, John Grundy, John Hosking, and Qiang He. 2020. An End-to-End Model-based Approach to Support Big Data Analytics Development. Journal of Computer Languages (2020), 100964.
[7]
Miryung Kim, Thomas Zimmermann, Robert DeLine, and Andrew Begel. 2017. Data scientists in software teams: State of the art and challenges. IEEE Transactions on Software Engineering 44, 11 (2017), 1024--1038.
[8]
Sara Landset, Taghi M Khoshgoftaar, Aaron N Richter, and Tawfiq Hasanin. 2015. A survey of open source tools for machine learning with big data in the Hadoop ecosystem. Journal of Big Data 2, 1 (2015), 24.
[9]
Stephann Makri, Ann Blandford, Mel Woods, Sarah Sharples, and Deborah Maxwell. 2014. "Making my own luck": Serendipity strategies and how to support them in digital information environments. Journal of the Association for Information Science and Technology 65, 11 (2014), 2179--2194.
[10]
Daniel Moody. 2009. The "physics" of notations: toward a scientific basis for constructing visual notations in software engineering. IEEE Transactions on software engineering 35, 6 (2009), 756--779.
[11]
Ivens Portugal, Paulo Alencar, and Donald Cowan. 2016. A preliminary survey on domain-specific languages for machine learning in big data. In 2016 IEEE International Conference on Software Science, Technology and Engineering (SWSTE). IEEE, 108--110.
[12]
Carlton E Sapp. 2017. Preparing and architecting for machine learning. Gartner Technical Professional Advice (2017), 1--37.
[13]
David Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-Francois Crespo, and Dan Dennison. 2015. Hidden technical debt in machine learning systems. In Advances in neural information processing systems. 2503--2511.
[14]
Juha-Pekka Tolvanen and Matti Rossi. 2003. MetaEdit+ defining and using domain-specific modeling languages and code generators. In Companion of the 18th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications. 92--93.
[15]
Amy X Zhang, Michael Muller, and Dakuo Wang. 2020. How do Data Science Workers Collaborate? Roles, Workflows, and Tools. Proc. ACM Hum.-Comput. Interact. 4, CSCW1 (2020).

Cited By

View all
  • (2022)VisionProceedings of the 25th International Conference on Model Driven Engineering Languages and Systems: Companion Proceedings10.1145/3550356.3559100(929-933)Online publication date: 23-Oct-2022
  • (2021)BiDaML in Practice: Collaborative Modeling of Big Data Analytics Application RequirementsEvaluation of Novel Approaches to Software Engineering10.1007/978-3-030-70006-5_5(106-129)Online publication date: 27-Feb-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MODELS '20: Proceedings of the 23rd ACM/IEEE International Conference on Model Driven Engineering Languages and Systems: Companion Proceedings
October 2020
713 pages
ISBN:9781450381352
DOI:10.1145/3417990
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

In-Cooperation

  • IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2020

Check for updates

Author Tags

  1. BiDaML
  2. big data applications
  3. recommender

Qualifiers

  • Demonstration

Conference

MODELS '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 144 of 506 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)1
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)VisionProceedings of the 25th International Conference on Model Driven Engineering Languages and Systems: Companion Proceedings10.1145/3550356.3559100(929-933)Online publication date: 23-Oct-2022
  • (2021)BiDaML in Practice: Collaborative Modeling of Big Data Analytics Application RequirementsEvaluation of Novel Approaches to Software Engineering10.1007/978-3-030-70006-5_5(106-129)Online publication date: 27-Feb-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media