Skip to main content
Log in

An experimental comparison of ER and UML class diagrams for data modelling

  • Published:
Empirical Software Engineering Aims and scope Submit manuscript

Abstract

We present the results of three sets of controlled experiments aimed at analysing whether UML class diagrams are more comprehensible than ER diagrams during data models maintenance. In particular, we considered the support given by the two notations in the comprehension and interpretation of data models, comprehension of the change to perform to meet a change request, and detection of defects contained in a data model. The experiments involved university students with different levels of ability and experience. The results demonstrate that using UML class diagrams subjects achieved better comprehension levels. With regard to the support given by the two notations during maintenance activities the results demonstrate that the two notations give the same support, while in general UML class diagrams provide a better support with respect to ER diagrams during verification activities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. See De Lucia et al. (2008a) for the complete material used in the three sets of controlled experiments.

  2. The rule of seven is the generally accepted claim that people can hold approximately seven chunks or units of information in their short-term memory at a time (Miller 1956).

  3. We decided to select such a threshold as it represents the median of the possible grades for any exam to be passed by a student in an Italian University (min 18/30 and max 30/30).

  4. Also in this case we decided to select such a threshold as it represents the median of the possible grades (min 60/100 and max 100/100).

  5. A longitudinal study is a research study conducted over a period of time (Wohlin et al. 2000). In our case, each controlled experiment is a longitudinal study because it was organised in two laboratory sessions.

References

  • Antoniol G, Canfora G, Casazza G, De Lucia A, Merlo E (2002) Recovering traceability links between code and documentation. IEEE Trans Softw Eng 28(10):970–983

    Article  Google Scholar 

  • Argwal R, De P, Sinha AP (1999) Comprehending object and processes models: an empirical study. IEEE Trans Softw Eng 25(4):541–556

    Article  Google Scholar 

  • Arisholm E, Sjoberg D (2004) Evaluating the effect of a delegated versus centralized control style on the maintainability of object-oriented software. IEEE Trans Softw Eng 30(8):521–534

    Article  Google Scholar 

  • Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, Reading

    Google Scholar 

  • Basili VR, Selby RW, Hutchens DH (1986) Experimentation in software engineering. IEEE Trans Softw Eng 12(7):758–773

    Google Scholar 

  • Bock D, Ryan T (1993) Accuracy in modeling with extended entity relationship and object oriented data models. J Database Manage 4(4):30–39

    Google Scholar 

  • Briand L, Labiche Y, Di Penta M, Yan-Bondoc H (2005) An experimental investigation of formality in UML-based development. IEEE Trans Softw Eng 31(10):833–849

    Article  Google Scholar 

  • Chen PP (1976) The entity-relationship model: toward a unified view of data. ACM Trans Database Syst 1(1):1–36

    Article  Google Scholar 

  • Conallen J (1999) Building web applications with UML. Addison-Wesley object technology series. Addison-Wesley, Reading

    Google Scholar 

  • Conover WJ (1998) Practical nonparametric statistics, 3rd edn. Wiley, New York

    Google Scholar 

  • Cruz-Lemus JA, Genero M, Manso ME, Piattini M (2005) Evaluating the effect of composite states on the understandability of UML statechart diagrams. In: Proceedings of 8th ACM/IEEE international conference on model driven engineering languages and systems. Springer, Montego Bay, pp 113–125

    Chapter  Google Scholar 

  • De Lucia A, Fasano F, Francese R, Tortora G (2004) ADAMS: an artefact-based process support system. In: Proceedings of 16th international conference on software engineering and knowledge engineering. KSI, Banff, pp 31–36

    Google Scholar 

  • De Lucia A, Gravino C, Oliveto R, Tortora G (2008a) An experimental comparison of ER and UML class diagrams for data modelling: experimental material. Technical report. www.sesa.dmi.unisa.it/reportUMLvsER.pdf

  • De Lucia A, Gravino C, Oliveto R, Tortora G (2008b) Assessing the support of ER and UML class diagrams during maintenance activities on data models. In: Proceedings of the 12th European conference on software maintenance and reengineering. IEEE, Athens, Greece, pp 173–182

    Chapter  Google Scholar 

  • De Lucia A, Gravino C, Oliveto R, Tortora G (2008c) Data model comprehension: an empirical comparison of ER and UML class diagrams. In: Proceedings of the 16th IEEE international conference on program comprehension. IEEE, Amsterdam, pp 93–102

    Chapter  Google Scholar 

  • Devore JL, Farnum N (1999) Applied statistics for engineers and scientists. Duxbury

  • Downs E, Clare P, Coe I (1992) Structured systems analysis and design method: application and context. Prentice Hall, Englewood Cliffs

    Google Scholar 

  • Gane C, Sarson T (1979) Structured systems analysis: tools and techniques. Prentice-Hall, Englewood Cliffs

    Google Scholar 

  • Gemino W, Wand Y (1997) Empirical comparison of objected oriented and dataflow models. In: Proceedings of international conference on information systems. ACM, Atlanta, pp 446–447

    Google Scholar 

  • Gemino A, Wand Y (2005) Complexity and clarity in conceptual modeling: comparison of mandatory and optional properties. Data Knowl Eng 55(3):301–326

    Article  Google Scholar 

  • Henderson PB (2003) Mathematical reasoning in software engineering education. Commun ACM 46(9):45–50

    Article  Google Scholar 

  • Hungerford BC, Eierman MA (2004) The communication effectiveness of system models using the UML versus structured techniques: a field experiment. American Journal of Business 20(2):35–43

    Google Scholar 

  • Juristo N, Moreno A (2001) Basics of software engineering experimentation. Kluwer Academic, Dordrecht

    MATH  Google Scholar 

  • Kitchenham B, Fry J, Linkman S (2003) The case against cross-over designs in software engineering. In: Proceedings of 11th annual international workshop on software technology and engineering practice, pp 65–67

  • Kuzniarz L, Staron M, Wholin C (2004) An empirical study on using stereotypes to improve understanding on UML models. In: Proceedings of 12th IEEE international workshop on program comprehension. IEEE CS, Bari, pp 14–23

    Chapter  Google Scholar 

  • Miller GA (1956) The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol Rev 63(2):81–97

    Article  Google Scholar 

  • Navathe SB (1992) Evolution of data modeling for databases. Commun ACM 35(9):112–123

    Article  Google Scholar 

  • OMG (2005) Object constraint language (OCL) specification, version 2.0. http://www.omg.org/technology/documents/formal/uml.htm

  • Oppenheim AN (1992) Questionnaire design, interviewing and attitude measurement. Pinter, London

    Google Scholar 

  • Otero C, Dolado JJ (2002) An initial experimental assessment of the dynamic modelling in UML. Empirical Software Engineering 7(1):27–47

    Article  MATH  Google Scholar 

  • Palvia P, Lio C, To P (1992) The impact of conceptual data models on end-user performance. J Database Manage 3(4):4–15

    Google Scholar 

  • Purchase HC, Wellanda R, McGillb M, Colpoysb L (2004) Comprehension of diagram syntax: an empirical study of entity relationship notations. Int J Human-comput Stud 61(2):187–203

    Article  Google Scholar 

  • Purchase HC, Colpoys L, McGill M, Carrington D, Britton C (2001) UML class diagram syntax: an empirical study of comprehension. In: Proceedings of Australian symposium on information visualisation. Australian Computer Society, Sydney, pp 113–120

    Google Scholar 

  • Reynoso L, Genero M, Piattini M, Manso ME (2006) Does object coupling really affect the understanding and modifying of UML expressions? In: Proceedings of 21st annual ACM symposium on applied computing. ACM, Dijon, pp 1721–1727

    Google Scholar 

  • Ricca F, Di Penta M, Torchiano M, Tonella P, Ceccato M (2007) The role of experience and ability in comprehension tasks supported by UML stereotypes. In: Proceedings of 29th international conference on software engineering. IEEE Computer Society, Minneapolis, pp 375–384

    Google Scholar 

  • Rumbaugh J, Jacobson I, Booch G (2004) Unified modeling language reference manual. Addison-Wesley, Reading

    Google Scholar 

  • Shoval P, Frumermann I (1994) OO and EER conceptual schemas: a comparison of user comprehension. J Database Manage 5(4):28–38

    Google Scholar 

  • Shoval P, Shiran S (1997) Entity-relationship and object-oriented data modeling - an experimental comparison of design quality. Data Knowl Eng 21(3):297–315

    Article  MATH  Google Scholar 

  • Torchiano M (2004) Empirical assessment of UML static object diagrams. In: Proceedings of 12th international workshop in program comprehension. IEEE Computer Society, Bari, pp 226–229

    Chapter  Google Scholar 

  • Wang S (1996) Two MIS analysis methods: an experimental comparison. J Educ Bus 61(3):136–141

    Google Scholar 

  • Wohlin C, Runeson P, Host M, Ohlsson MC, Regnell B, Wesslen A (2000) Experimentation in software engineering—an introduction. Kluwer, Deventer

    MATH  Google Scholar 

  • Zimmermann T, Weissgerber P, Diehl S, Zeller A (2005) Mining version histories to guide software changes. IEEE Trans Softw Eng 31(6):429–445

    Article  Google Scholar 

Download references

Acknowledgements

We would like to thank the anonymous reviewers for their detailed, constructive, and thoughtful comments that helped us to improve the presentation of the results in this paper. Special thanks are also due to the students who were involved in the experiment as subjects.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rocco Oliveto.

Additional information

Editor: Erik Arisholm

The work described in this paper is supported by the project METAMORPHOS (MEthods and Tools for migrAting software systeMs towards web and service Oriented aRchitectures: exPerimental evaluation, usability, and tecHnOlogy tranSfer), funded by MiUR (Ministero dell’Università e della Ricerca) under grant PRIN-2006-2006098097.

Rights and permissions

Reprints and permissions

About this article

Cite this article

De Lucia, A., Gravino, C., Oliveto, R. et al. An experimental comparison of ER and UML class diagrams for data modelling. Empir Software Eng 15, 455–492 (2010). https://doi.org/10.1007/s10664-009-9127-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10664-009-9127-7

Keywords

Navigation