A Measure of the Modularisation of Sequential Software Versions Using Random Graph Theory

Arzoky, Mahir; Swift, Stephen; Counsell, Steve; Cain, James

doi:10.1007/978-3-319-14358-3_10

Mahir Arzoky¹¹,
Stephen Swift¹¹,
Steve Counsell¹¹ &
…
James Cain¹²

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 199))

Included in the following conference series:

International Conference on Agile Software Development

2805 Accesses

Abstract

Software module clustering is the problem of automatically partitioning the structure of a software system using low-level dependencies in the source code to understand and improve the system’s architecture. Munch, a clustering tool based on search-based software engineering techniques, was used to modularise a unique dataset of sequential source code software versions. This paper investigates whether the dataset used for the modularisation resembles a random graph by computing the probabilities of observing certain connectivity. Modularisation will not be possible with data that resembles random graphs. Thus, this paper demonstrates that our real world time-series dataset does not resemble a random graph except for small sections where there were large maintenance activities. Furthermore, the random graph metric can be used as a tool to indicate areas of interest in the dataset, without the need to run the modularisation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

(Automated) Software Modularization Using Community Detection

How Starting Points and Representations Affect Software Modularisation: An Empirical Analysis

7 Dimensions of software change patterns

Article Open access 13 March 2024

References

Altman, D.G.: Practical Statistics for Medical research. Chapman and Hall (1997)
Google Scholar
Arzoky, M., Swift, S., Tucker, A., Cain, J.: Munch: An Efficient Modularisation Strategy to Assess the Degree of Refactoring on Sequential Source Code Checkings. In: IEEE Fourth International Conference on Software Testing, Verification and Validation Workshops, pp. 422–429 (2011)
Google Scholar
Arzoky, M., Swift, S., Tucker, A., Cain, J.: A Seeded Search for the Modularisation of Sequential Software Versions. Journal of Object Technology 11(2), 6:1-27 (2012)
Google Scholar
Barabási, A.L., Albert, R., Jeong, H.: Scale-free characteristics of random networks: The topology of the world-wide web. Physica A: Statistical Mechanics and its Applications 281(1), 69–77 (2000)
Article Google Scholar
Cain, J., Counsell, S., Swift, S., Tucker, A.: An Application of Intelligent Data Analysis Techniques to a Large Software Engineering Dataset. In: Adams, N.M., Robardet, C., Siebes, A., Boulicaut, J.-F. (eds.) IDA 2009. LNCS, vol. 5772, pp. 261–272. Springer, Heidelberg (2009)
Chapter Google Scholar
Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Trans. Software Eng. 20(6), 476–493 (1994)
Article Google Scholar
Constantine, L.L., Yourdon, E.: Structured Design. Prentice Hall (1979)
Google Scholar
Doval, D., Mancoridis, S., Mitchell, B.S.: Automatic clustering of software systems using a genetic algorithm. In: Software Technology and Engineering Practice. IEEE Proceedings STEP 1999, pp. 73–81 (1999)
Google Scholar
Erdős, P., Rényi, A.: On the evolution of random graphs. Magyar Tud. Akad, Mat. Kutató Int. Közl. 5, 17–61 (1960)
Google Scholar
Gilbert, E.N.: Random graphs. The Annals of Mathematical Statistics, 1141–1144 (1959)
Google Scholar
Harman, M., Hierons, R., Proctor, M.: A new representation and crossover operator for search based optimization of software modularization. In: Proc. Genetic and Evolutionary Computation Conference, pp. 1351–1358. Morgan Kaufmann Publishers (2002)
Google Scholar
Harman, M., Mansouri, S.A., Zhang, Y.: Search-based software engineering: Trends, techniques and applications. ACM Computing Surveys 45(1), 11 (2012)
Article Google Scholar
Harman, M., Swift, S., Mahdavi, K.: An empirical study of the robustness of two module clustering fitness functions. In: Genetic and Evolutionary Computation Conference, Washington, DC, pp. 1029–1036 (2005)
Google Scholar
Mancoridis, S., Mitchell, B.S., Rorres, C., Chen, Y., Gansner, E.R.: Using automatic clustering to produce high-level system organizations of source code. In: International Workshop on Program Comprehension (IWPC 1998), pp. 45–53. IEEE Computer Society Press, Los Alamitos (1998)
Google Scholar
Massey, F.J.: The Kolmogorov-Smirnov Test for Goodness of Fit. Journal of the American Statistical Association 46(253), 68–78 (1951)
Article MATH Google Scholar
Mislove, A., Marcon, M., Gummadi, K.P., Druschel, P., Bhattacharjee, B.: Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement, pp. 29–42 (2007)
Google Scholar
Mitchell, B.S.: A Heuristic Search Approach to Solving the Software Clustering Problem. PhD Thesis, Drexel University, Philadelphia, PA (2002)
Google Scholar
Praditwong, K., Harman, M., Yao, X.: Software Module Clustering as a Multi–Objective Search Problem. IEEE Transactions on Software Engineering 37(2), 264–282 (2011)
Article Google Scholar
Sommerville, I.: Software Engineering, 5th edn. Addison-Wesley (1995)
Google Scholar
Stroggylos, K., Spinellis, D.: Refactoring does it improve software quality? In: WoSQ 2007: Proceedings of the 5th International Workshop on Software Quality. IEEE Computer Society, Washington, DC (2007)
Google Scholar
Roth, C., Kang, S.M., Batty, M., Barthelemy, M.: A long-time limit for world subway networks. Journal of The Royal Society Interface 9(75), 2540–2550 (2012)
Article Google Scholar
Tucker, A., Swift, S., Liu, X.: Variable Grouping in multivariate time series via correlation. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 31(2), 235–245 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Brunel University, Middlesex, UK
Mahir Arzoky, Stephen Swift & Steve Counsell
Quantel Limited, Newbury, UK
James Cain

Authors

Mahir Arzoky
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Swift
View author publications
You can also search for this author in PubMed Google Scholar
Steve Counsell
View author publications
You can also search for this author in PubMed Google Scholar
James Cain
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

SINTEF, Trondheim, Norway
Torgeir Dingsøyr & Nils Brede Moe &
University of Cagliari, Cagliari, Italy
Roberto Tonelli
Brunel University London, Uxbridge, UK
Steve Counsell
Free University of Bozen-Bolzano, Bolzano, Italy
Cigdem Gencel
Blekinge Institute of Technology, Karlskrona, Sweden
Kai Petersen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arzoky, M., Swift, S., Counsell, S., Cain, J. (2014). A Measure of the Modularisation of Sequential Software Versions Using Random Graph Theory. In: Dingsøyr, T., Moe, N.B., Tonelli, R., Counsell, S., Gencel, C., Petersen, K. (eds) Agile Methods. Large-Scale Development, Refactoring, Testing, and Estimation. XP 2014. Lecture Notes in Business Information Processing, vol 199. Springer, Cham. https://doi.org/10.1007/978-3-319-14358-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-14358-3_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14357-6
Online ISBN: 978-3-319-14358-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics