research-article

Do you see any problem? On the Developers Perceptions in Test Smells Detection

Authors:
Rodrigo Lima

Centro de Informática, Universidade Federal de Pernambuco, Brasil

Centro de Informática, Universidade Federal de Pernambuco, Brasil

0000-0002-8103-7723
View Profile

,
Keila Costa

Centro de Informática, Federal University of Pernambuco, Brasil

Centro de Informática, Federal University of Pernambuco, Brasil

0000-0003-0621-2357
View Profile

,
Jairo Souza

Centro de Informática, Universidade Federal de Pernambuco, Brasil

Centro de Informática, Universidade Federal de Pernambuco, Brasil

0000-0001-9361-0665
View Profile

,
Leopoldo Teixeira

Centro de Informática, Universidade Federal De Pernambuco, Brasil

Centro de Informática, Universidade Federal De Pernambuco, Brasil

0000-0002-6154-1666
View Profile

,
Baldoino Fonseca

Instituto de Computação, Universidade Federal de Alagoas, Brasil

Instituto de Computação, Universidade Federal de Alagoas, Brasil

0000-0002-0730-0319
View Profile

,
Marcelo D'Amorim

Centro de Informática, Universidade Federal de Pernambuco, Brasil

Centro de Informática, Universidade Federal de Pernambuco, Brasil

0000-0002-1323-8769
View Profile

,
Márcio Ribeiro

Instituto de Computação, Universidade Federal de Alagoas, Brasil

Instituto de Computação, Universidade Federal de Alagoas, Brasil

0000-0002-4293-4261
View Profile

,
Breno Miranda

Centro de Informática, Universidade Federal de Pernambuco, Brasil

Centro de Informática, Universidade Federal de Pernambuco, Brasil

0000-0001-9608-9393
View Profile

SBQS '23: Proceedings of the XXII Brazilian Symposium on Software QualityNovember 2023Pages 21–30https://doi.org/10.1145/3629479.3629485

Published:06 December 2023Publication History

SBQS '23: Proceedings of the XXII Brazilian Symposium on Software Quality

Pages 21–30

ABSTRACT

Developers are continuously implementing changes to meet demands coming from users. In the context of test-driven development, before any new code is added, a test case should be written to make sure new changes do not introduce bugs. During this process, developers and testers might adopt bad design choices, which may lead to the introduction of the so-called Test Smells in the code. Test Smells are bad solutions for implementing or designing test code. We perform a broader study to investigate the participants’ perceptions about the presence of Test Smells. We analyze whether certain factors related to the participant’ profiles concerning background and experience may influence their perception of Test Smells. Also, we analyze if the heuristics adopted by developers influence their perceptions about the existence of Test Smells. We analyze commits of open source projects to identify the introduction of Test Smells. Then, we conduct an empirical study with 25 participants that evaluate instances of 10 different smell types. For each Test Smell type, we analyze the agreement among participants, and we assess the influence of different factors on the participants’ evaluations. Altogether, more than 1250 evaluations were made. The results indicate that participants present a low agreement on detecting all 10 Test Smells types analyzed in our study. The results also suggest that factors related to background and experience do not have a consistent effect on the agreement among the participants. On the other hand, the results indicate that the agreement is consistently influenced by specific heuristics employed by participants. Our findings reveal that the participants detect Test Smells in significantly different ways. As a consequence, these findings introduce some questions concerning the results of previous studies that do not consider the different perceptions of participants on detecting Test Smells.

References

Gabriele Bavota, Abdallah Qusef, Rocco Oliveto, Andrea De Lucia, and Dave Binkley. 2015. Are test smells really harmful? an empirical study. Empirical Software Engineering 20, 4 (2015), 1052–1094.Google ScholarDigital Library
Jonathan Immanuel Brachthäuser, Sukyoung Ryu, Nathaniel Nystrom, Jonas De Bleser, Dario Di Nucci, and Coen De Roover. 2019. SoCRATES: Scala radar for test smells. Proceedings of the Tenth ACM SIGPLAN Symposium on Scala (2019), 22–26. https://doi.org/10.1145/3337932.3338815Google ScholarDigital Library
Everton Cavalcante, Francisco Dantas, Thais Batista, Elvys Soares, Márcio Ribeiro, Guilherme Amaral, Rohit Gheyi, Leo Fernandes, Alessandro Garcia, Baldoino Fonseca, and André Santos. 2020. Refactoring Test Smells: A Perspective from Open-Source Developers. Proceedings of the 5th SAST (2020), 50–59. https://doi.org/10.1145/3425174.3425212Google ScholarDigital Library
Jonas De Bleser, Dario Di Nucci, and Coen De Roover. 2019. Assessing diffusion and perception of test smells in scala projects. In 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 457–467.Google ScholarDigital Library
Prem Devanbu, Myra Cohen, Thomas Zimmermann, Anthony Peruma, Khalid Almalki, Christian D Newman, Mohamed Wiem Mkaouer, Ali Ouni, and Fabio Palomba. 2020. tsDetect: an open source test smells detection tool. Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (2020), 1650–1654. https://doi.org/10.1145/3368089.3417921Google ScholarDigital Library
Joseph L Fleiss. 1971. Measuring nominal scale agreement among many raters.Psychological bulletin 76, 5 (1971), 378.Google Scholar
Mário Hozano, Alessandro Garcia, Baldoino Fonseca, and Evandro Costa. 2018. Are you smelling it? Investigating how similar developers detect code smells. Information and Software Technology 93 (2018), 130–146.Google ScholarDigital Library
Nildo Silva Junior, Larissa Rocha, Luana Almeida Martins, and Ivan Machado. 2020. A survey on test practitioners’ awareness of test smells. arXiv preprint arXiv:2003.05613 (2020).Google Scholar
J Richard Landis and Gary G Koch. 1977. The measurement of observer agreement for categorical data. biometrics (1977), 159–174.Google Scholar
Mika V Mantyla. 2005. An experiment on subjective evolvability evaluation of object-oriented software: explaining factors and interrater agreement. In 2005 International Symposium on Empirical Software Engineering, 2005. IEEE, 10–pp.Google ScholarCross Ref
Mika V Mäntylä and Casper Lassenius. 2006. Subjective evaluation of software evolvability using code smells: An empirical study. Empirical Software Engineering 11, 3 (2006), 395–431.Google ScholarDigital Library
Luana Martins, Heitor Costa, and Ivan Machado. 2023. On the diffusion of test smells and their relationship with test code quality of Java projects. Journal of Software: Evolution and Process (2023). https://doi.org/10.1002/smr.2532Google ScholarDigital Library
Anthony Peruma, Khalid Almalki, Christian D Newman, Mohamed Wiem Mkaouer, Ali Ouni, and Fabio Palomba. 2020. Tsdetect: An open source test smells detection tool. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 1650–1654.Google ScholarDigital Library
Anthony Peruma, Khalid Saeed Almalki, Christian D Newman, Mohamed Wiem Mkaouer, Ali Ouni, and Fabio Palomba. 2019. On the distribution of test smells in open source android applications: An exploratory study. (2019).Google Scholar
Brittany Reid, Markus Wagner, Marcelo d’Amorim, and Christoph Treude. 2022. Software Engineering User Study Recruitment on Prolific: An Experience Report. arXiv preprint arXiv:2201.05348 (2022).Google Scholar
Railana Santana, Daniel Fernandes, Denivan Campos, Larissa Soares, Rita Maciel, and Ivan Machado. 2021. Understanding practitioners’ strategies to handle test smells: a multi-method study. Brazilian Symposium on Software Engineering (2021), 49–53. https://doi.org/10.1145/3474624.3474639Google ScholarDigital Library
Railana Santana, Daniel Fernandes, Denivan Campos, Larissa Soares, Rita Maciel, and Ivan Machado. 2021. Understanding practitioners’ strategies to handle test smells: a multi-method study. In Brazilian Symposium on Software Engineering. 49–53.Google ScholarDigital Library
Railana Santana, Luana Martins, Tássio Virgínio, Larissa Soares, Heitor Costa, and Ivan Machado. 2022. Refactoring Assertion Roulette and Duplicate Assert test smells: a controlled experiment. arXiv (2022). https://doi.org/10.48550/arxiv.2207.05539 arXiv:2207.05539Google ScholarCross Ref
C.B. Seaman. 1999. Qualitative methods in empirical studies of software engineering. IEEE Transactions on Software Engineering 25, 4 (1999), 557–572. https://doi.org/10.1109/32.799955Google ScholarDigital Library
Elvys Soares, Márcio Ribeiro, Guilherme Amaral, Rohit Gheyi, Leo Fernandes, Alessandro Garcia, Baldoino Fonseca, and André Santos. 2020. Refactoring test smells: A perspective from open-source developers. In Proceedings of the 5th Brazilian Symposium on Systematic and Automated Software Testing. 50–59.Google ScholarDigital Library
Elvys Soares, Marcio Ribeiro, Rohit Gheyi, Guilherme Amaral, and Andre Medeiros Santos. 2022. Refactoring Test Smells With JUnit 5: Why Should Developers Keep Up-to-Date. IEEE Transactions on Software Engineering PP, 99 (2022), 1–1. https://doi.org/10.1109/tse.2022.3172654Google ScholarDigital Library
Davide Spadini, Fabio Palomba, Andy Zaidman, Magiel Bruntink, and Alberto Bacchelli. 2018. On the relation of test smells to software code quality. In 2018 IEEE international conference on software maintenance and evolution (ICSME). IEEE, 1–12.Google ScholarCross Ref
Arie Van Deursen, Leon Moonen, Alex Van Den Bergh, and Gerard Kok. 2001. Refactoring test code. In Proceedings of the 2nd international conference on extreme programming and flexible processes in software engineering (XP2001). Citeseer, 92–95.Google Scholar
Tássio Virgínio, Luana Almeida Martins, Larissa Rocha Soares, Railana Santana, Heitor Costa, and Ivan Machado. 2020. An empirical study of automatically-generated tests from the perspective of test smells. In Proceedings of the 34th Brazilian Symposium on Software Engineering. 92–96.Google ScholarDigital Library
Tássio Virgínio, Luana Martins, Railana Santana, Adriana Cruz, Larissa Rocha, Heitor Costa, and Ivan Machado. 2021. On the test smells detection: an empirical study on the JNose Test accuracy. Journal of Software Engineering Research and Development 9 (2021). https://doi.org/10.5753/jserd.2021.1893Google ScholarCross Ref
Claes Wohlin, Per Runeson, Martin Hst, Magnus C. Ohlsson, Bjrn Regnell, and Anders Wessln. 2012. Experimentation in Software Engineering. Springer Publishing Company, Incorporated.Google ScholarCross Ref

Index Terms

Do you see any problem? On the Developers Perceptions in Test Smells Detection

Index terms have been assigned to the content through auto-classification.

Recommendations

Are test smells really harmful? An empirical study

Bad code smells have been defined as indicators of potential problems in source code. Techniques to identify and mitigate bad code smells have been proposed and studied. Recently bad test code smells (test smells for short) have been put forward as a ...
Read More
Automated Detection of Test Fixture Strategies and Smells
ICST '13: Proceedings of the 2013 IEEE Sixth International Conference on Software Testing, Verification and Validation

Designing automated tests is a challenging task. One important concern is how to design test fixtures, i.e. code that initializes and configures the system under test so that it is in an appropriate state for running particular automated tests. Test ...
Read More
Analyzing Test Smells Refactoring from a Developers Perspective
SBQS '22: Proceedings of the XXI Brazilian Symposium on Software Quality

Test smells represent a set of poorly designed tests, which can harm a test code’s maintenance and quality criteria. Although fundamental steps to understand test smells have been investigated, there is still an evident lack of studies evaluating the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SBQS '23: Proceedings of the XXII Brazilian Symposium on Software Quality
November 2023
391 pages
ISBN:9798400707865
DOI:10.1145/3629479
Editors:
Edna Dias Canedo
University of Brasília (CIC, UnB), Brazil
,
Daniel de Paula Porto
University of Brasília (CIC, UnB), Brazil
,
Fábio Lúcio Lopes Mendonça
University of Brasília (FT, UnB), Brazil
,
Rafael Timóteo de Sousa Júnior
University of Brasília (FT, UnB), Brazil
,
Monalessa Perini Barcellos
Federal University of Espírito Santo (UFES), Brazil
,
Ismayle Sousa Santos
Ceará State University (UECE), Brazil
,
Sheila Reinehr
Pontifical Catholic University of Rio de Janeiro (PUC-PR), Brazil
,
Sergio Soares
Federal University of Pernambuco (UFPE), Brazil
,
Uirá Kulesza
Federal University of Rio Grande do Norte (UFRN), Brazil
,
Érica Ferreira de Souza
Federal Technological University of Paraná (UTFPR), Brazil
,
Adriano Albuquerque
University of Fortaleza (UNIFOR), Brazil
,
Carla Bezerra
Federal University of Ceará (UFC), Brazil
,
Rodrigo Santos
Federal University of the State of Rio de Janeiro (UNIRIO), Brazil
,
Alessandro Garcia
Pontifical Catholic University of Rio de Janeiro (PUC-RJ), Brazil
,
Simone Dornelas Costa
Federal University of Espírito Santo (UFES), Brazil
,
Adolfo Gustavo Serra Seca Neto
Federal Technological University of Paraná (UTFPR), Brazil
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 December 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Empirical Study
Human Factors
Open Source
Test Smells
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate35of99submissions,35%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 13
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Do you see any problem? On the Developers Perceptions in Test Smells Detection

SBQS '23: Proceedings of the XXII Brazilian Symposium on Software Quality

ABSTRACT

References

Cited By

Index Terms

Recommendations

Are test smells really harmful? An empirical study

Automated Detection of Test Fixture Strategies and Smells

Analyzing Test Smells Refactoring from a Developers Perspective