Skip to main content
Log in

The effect of the number of concepts on the readability of schemas: an empirical study with data models

  • Original Article
  • Published:
Requirements Engineering Aims and scope Submit manuscript

Abstract

The number of concepts in a model has been frequently used in the literature to measure the ease of use in creating model schemas. However, to the best of our knowledge, nobody has looked at its effect on the readability of the model schemas after they have been created. The readability of a model schema is important in situations where the schemas are created by one team of analysts and read by other analysts, system developers, or maintenance administrators. Given the recent trend of models with increasing numbers of concepts such as the unified modeling language (UML), the effect of the number of concepts (NOC) on the readability of schemas has become increasingly important. In this work, we operationalize readability along three dimensions: effectiveness, efficiency, and learnability. We draw on the Bunge Wand Weber (BWW) framework, as well as the signal detection recognition theory and the ACT theory from cognitive psychology to formulate hypotheses and conduct an experiment to study the effects of the NOC in a data model on these readability dimensions. Our work makes the following contributions: (a) it extends the operationalization of the readability construct, and (b) unlike earlier empirical work that has focused exclusively on comparing models that differ along several dimensions, this work proposes an empirical methodology that isolates the effect of a model-independent variable (the NOC) on readability. From a practical perspective, our findings have implications both for creators of new models, as well as for practitioners who use currently available models for creating schemas to communicate requirements during the entire lifecycle of a system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. In this work, the terms “conceptual model” or “model” refer to the modeling method. We refer to the application of a modeling method for a particular situation as a “model schema”.

References

  1. Amberg MA (1996) A pattern oriented approach to a methodical evaluation of modeling methods. Aust J Inf Syst 4(1):3–10

    Google Scholar 

  2. Anderson JR (1978) Arguments concerning representations for mental imagery. Psychol Rev 85:249–277

    Article  Google Scholar 

  3. Anderson JR (1995) Cognitive psychology and its implications, 5th edn. W.H. Freeman, New York

    Google Scholar 

  4. Bajaj A, Ram S (1996) A content specification for business process models. Aust J Inf Syst 4(1):22–31

    Article  Google Scholar 

  5. Batra D, Srinivasan A (1992) A review and analysis of the usability of data modeling environments. Int J Man Mach Stud 36:395–417

    Article  Google Scholar 

  6. Bock D, Ryan T (1993) Accuracy in modeling with extended entity relationship and O–O data models. J Database Manag 4(4):30–39

    Google Scholar 

  7. Booch G (1994) Object oriented analysis and design with applications. Benjamin/Cummings, Redwood City

    Google Scholar 

  8. Booch G, Ivar J, James R (1997) UML Distilled. Addison Wesley, Reading

    Google Scholar 

  9. Brosey M, Schneiderman B (1978) Two experimental comparisons of relational and hierarchical database models. Int J Man Mach Stud 10:625–637

    Google Scholar 

  10. Castellini X (1998) Evaluation of models defined with charts of concepts: application to the UML model. In: Paper read at third workshop on the evaluation of modeling methods in systems analysis and design, in concjunction with CAiSE, at Pisa

  11. Chen PP (1976) The entity-relationship model: towards a unified model of data. ACM Trans Database Syst 1(1):9–36

    Article  Google Scholar 

  12. Daniel K, Hirshleifer D, Subrahmanyan A (1998) Investor psychology and security marlet under- and over-confidence. J Finance 53:1839–1885

    Article  Google Scholar 

  13. Egan JP (1958) Recognition memory and the operating characteristics. Indiana University, Bloomington

    Google Scholar 

  14. Fromkin HL, Streufert S (1976) Laboratory experimentation. In: Dunnette MD (ed) Handbook of industrial psychology. Rand-Mcnally, Chicago

    Google Scholar 

  15. Gemino A, Wand Y (2001) Towards common dimensions in empirical comparisons of conceptual modeling techniques. In: Paper read at 7th CAiSE/IFIP-WG8.1 international workshop on the evaluation of modeling methods in systems analysis and design, Toronto

  16. Hardgrave BC, Dalal N (1995) Comparing object oriented and extended entity relationship models. J Database Manag 6(3):15–21

    Google Scholar 

  17. Juhn S, Naumann JD (1985) The effectiveness of data representation characteristics on user validation. In: Paper read at international conference on information systems, Indianopolis

  18. Kahnemann D, Slovic P, Tversky A (1982) Judgment under uncertainty: heuristics and biases. Cambridge University Press, London

    Google Scholar 

  19. Kim Y-G, March SE (1995) Comparing data modeling formalisms. Commun ACM 38(6):103–113

    Article  Google Scholar 

  20. Kramer B, Luqi (1991) Towards former models of software engineering processes. J Syst Softw 15:63–74

    Article  Google Scholar 

  21. Long De BJ, Shleifer A, Summers LH, Waldmann R (1990) Noise trader risk in financial markets. J Polit Econ 98:703–738

    Article  Google Scholar 

  22. Mantha RW (1987) Data flow and data structure modeling for database requirements determination: a comparative study. MIS Q December:531–545

    Google Scholar 

  23. Marcos E, Cervera J, Fernandez L (1999) Evaluation of data models: a complexity metric. In: Paper read at 4th Caise/IFIP 8.1 international workshop on evaluation of modeling methods in systems analysis and design, Heidelberg

  24. Moynihan A (1996) An attempt to compare OO and functional decomposition in communicating information system functionality to users. In: Paper read at workshop on evaluation of modeling methods in systems analysis and design, CAiSE, Heraklion, Crete, 20–24 May 1996

  25. Nielsen J (1993) Usability engineering. Academic, New York

    Google Scholar 

  26. Olle TW (1986) In: Proceedings of the IFIP WG 8.1 working conference on the comparative review of ISD methodologies: improving the practice, Borth Holland

    CAS  PubMed  Google Scholar 

  27. Palvia PC, Liao C, To PL (1992) The impact of conceptual models on end-user performance. J Database Manag 3(4):4–15

    Google Scholar 

  28. Pearson PD, Johnson DD (1978) Teaching reading comprehension. Holt, New York

    Google Scholar 

  29. Peleg M, Dori D (2000) The model multiplicity problem: experimenting with real time specification methods. IEEE Trans Softw Eng 26(6):1–18

    Article  Google Scholar 

  30. Reed SK (1988) Cognition theory and applications. Brooks/Cole Publishing, Belmont

    Google Scholar 

  31. Reeves WW (1996) Cognition and complexity: the cognitive science of managing complexity. Scarecrow Press, Lanham

    Google Scholar 

  32. Rossi M, Brinkkemper S (1996) Complexity metrics for systems development methods and techniques. Inf Syst 21(2):209–227

    Article  Google Scholar 

  33. Shneiderman B (1998) Designing the user interface, 3rd edn. Addison Wesley, Reading

    Google Scholar 

  34. Shoval P, Even-Chaime M (1987) Database schema design: an experimental comparison between normalization and information analysis. Database 18(3):30–39

    Google Scholar 

  35. Shoval P, Frummerman I (1994) OO and EER schemas: a comparison of user comprehension. J Database Manag 5(4):28–38

    Google Scholar 

  36. Siau K, Cao Q (2001) Unified modeling language (UML)—A complexity analysis. J Database Manag Jan–Mar:26–34

    Google Scholar 

  37. Svensson O (1981) Are we all less risky and more skilful than our fellow drivers? Acta Psychologica 47:143–148

    Article  Google Scholar 

  38. Topi H, Ramesh V (2002) Human factors on research on data modeling: a review of prior research, an extended framework and future research directions. J Database Manag 13(2):3–19

    Google Scholar 

  39. Wand Y, Weber R (1995) On the deep structure of information systems. Inf Syst J 5:203–223

    Google Scholar 

  40. Wand Y, Weber R (2002) Information systems and conceptual modeling: a research agenda. Inf Syst Res 13(4):363–376

    Google Scholar 

  41. Weber R (1997) Ontological foundations of information systems. Coopers and Lybrand, Melbourne

    Google Scholar 

  42. Wonnacott TH, Wonnacott RJ (1984) Introductory statistics for business and economics, 3rd edn. Wiley, New York

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Akhilesh Bajaj.

Appendix

Appendix

Figure 2 shows the library schema for the base line NOC model. Figure 3 shows the library schema for the higher level NOC model. The questionnaire following Figure 2 was used to measure the mapping from the schema to the underlying domain.

Fig. 2
figure 2

Base line NOC model library schema

Fig. 3
figure 3

Higher level NOC model library schema

1.1 Questionnaire to test mapping to the underlying domain

For each question, please select the right choice (only one choice per question). Our answers should not be based on actual library systems, but on what is represented in the model schema:

1. Every book needs to have at least one subject area

_____True

_____False

_____Can’t tell from the model schema

  

2. Users can reserve and checkout the same book at the same time

_____True

_____False

_____Can’t tell from the model schema

  

3. An author can write books in multiple subject areas

_____True

_____False

_____Can’t tell from the model schema

  

4. A book can be on multiple shelves at the same time

_____True

_____False

_____Can’t tell from the model schema

  

5. A reading area can be near multiple shelves

_____True

_____False

_____Can’t tell from the model schema

  

6. From the schema, we can determine which user is sitting on which chair in the library

_____True

_____False

_____Can’t tell from the model schema

  

7. We can find out the number of books checked out by the user in a year

_____True

_____False

_____Can’t tell from the model schema

  

8. A book can have multiple checkouts on the same date

_____True

_____False

_____Can’t tell from the model schema

  

9. We can tell which users are interested in which subject areas

_____True

_____False

_____Can’t tell from the model schema

  

10. We can tell which author has the most checkouts

_____True

_____False

_____Can’t tell from the model schema

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bajaj, A. The effect of the number of concepts on the readability of schemas: an empirical study with data models. Requirements Eng 9, 261–270 (2004). https://doi.org/10.1007/s00766-004-0202-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00766-004-0202-8

Keywords

Navigation