skip to main content
10.1145/3597926.3598076acmconferencesArticle/Chapter ViewAbstractPublication PagesisstaConference Proceedingsconference-collections
research-article

What You See Is What You Get? It Is Not the Case! Detecting Misleading Icons for Mobile Applications

Published: 13 July 2023 Publication History

Abstract

With the prevalence of smartphones, people nowadays can access a wide variety of services through diverse apps. A good Graphical User Interface (GUI) can make an app more appealing and competitive in app markets. Icon widgets, as an essential part of an app’s GUI, leverage icons to visually convey their functionalities to facilitate user interactions. Whereas, designing intuitive icon widgets can be a non-trivial job. Developers should follow a series of guidelines and make appropriate choices from a plethora of possibilities. Inappropriately designed or misused icons may cause user confusion, lead to wrong operations, and even result in security risks (e.g., revenue loss and privacy leakage). To investigate the problem, we manually checked 9,075 icons of 1,111 top-ranked commercial apps from Google Play and found 640 misleading icons in 312 ( ‍28%) of these apps. This shows that misleading icons are prevalent among real-world apps, even the top ones.
Manually identifying misleading icons to improve app quality is time-consuming and laborious. In this work, we propose the first framework, IconSeer, to automatically detect misleading icons for mobile apps. Our basic idea is to find the discrepancies between the commonly perceived intentions of an icon and the actual functionality of the corresponding icon widget. IconSeer takes an Android app as input and reports potential misleading icons. It is powered by a comprehensive icon-intention mapping constructed by analyzing 268,353 icons collected from 15,571 popular Android apps in Google Play. The mapping includes 179 icon classes and 852 intention classes. Given an icon widget under analysis, IconSeer first employs a pre-trained open-set deep learning model to infer the possible icon class and the potential intentions. IconSeer then extracts developer-specified text properties of the icon widget, which indicate the widget’s actual functionality. Finally, IconSeer determines whether an icon is misleading by comparing the semantic similarity between the inferred intentions and the extracted text properties of the widget. We have evaluated IconSeer on the 1,111 Android apps with manually established ground truth. IconSeer successfully identified 1,172 inconsistencies (with an accuracy of 0.86), among which we further found 482 real misleading icons.

References

[1]
2019. Android Accessibility Guideline. https://developer.android.com/guide/
[2]
2022. 10times. https://play.google.com/store/apps/details?id=com.tentimes
[3]
2022. ABC. https://play.google.com/store/apps/details?id=com.disney.datg.videoplatforms.android.abc
[4]
2022. Accessibility. https://developer.android.com/guide/topics/ui/accessibility/apps
[5]
2022. Android Styles and Themes. https://developer.android.com/develop/ui/views/theming/themes
[6]
2022. AntennaPod. https://play.google.com/store/apps/details?id=de.danoeh.antennapod
[7]
2022. BabyCam. https://play.google.com/store/apps/details?id=com.arjonasoftware.babycam
[8]
2022. Canny edge detector. https://en.wikipedia.org/wiki/Canny_edge_detector
[9]
2022. Content Labels. https://support.google.com/accessibility/android/answer/7158690
[10]
2022. Cool DJ Club Theme. https://play.google.com/store/apps/details?id=com.ikeyboard.theme.cool.dj.club
[11]
2022. Cross-entropy loss. https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html
[12]
2022. HD Wallpapers. https://play.google.com/store/apps/details?id=info.androidstation.hdwallpaper
[13]
2022. HP Print Service Plugin. https://play.google.com/store/apps/details?id=com.hp.android.printservice
[14]
2022. IconFont. https://www.iconfont.cn//
[15]
2022. IconSeer. https://sites.google.com/view/iconseer
[16]
2022. Layouts. https://developer.android.com/develop/ui/views/layout/declaring-layout
[17]
2022. nltk. https://www.nltk.org/
[18]
2022. PaddleOCR. https://github.com/PaddlePaddle/PaddleOCR
[19]
2022. Paint and Drawing Fun. https://play.google.com/store/apps/details?id=com.kidspaint.kaushalmehra.drawingfun
[20]
2022. Speak and Translate Languages. https://play.google.com/store/apps/details?id=com.speakandtranslate.voicetranslator.alllanguages
[21]
2022. wordninja. https://github.com/keredson/wordninja
[22]
Ian Alexander. 2000. An introduction to qualitative research. Eur. J. Inf. Syst., 9, 2 (2000), 127–128. https://doi.org/10.1057/palgrave.ejis.3000350
[23]
Abdulaziz Alshayban, Iftekhar Ahmed, and Sam Malek. 2020. Accessibility Issues in Android Apps: State of Affairs, Sentiments, and Ways Forward. In 2020 IEEE/ACM 42nd International Conference on Software Engineering (ICSE). 1323–1334.
[24]
Abhijit Bendale and Terrance E. Boult. 2016. Towards Open Set Deep Networks. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society, 1563–1572. https://doi.org/10.1109/CVPR.2016.173
[25]
Chunyang Chen, Ting Su, Guozhu Meng, Zhenchang Xing, and Yang Liu. 2018. From UI design image to GUI skeleton: a neural machine translator to bootstrap mobile GUI implementation. In Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (Eds.). ACM, 665–676. https://doi.org/10.1145/3180155.3180240
[26]
Jieshan Chen, Chunyang Chen, Zhenchang Xing, Xiwei Xu, Liming Zhu, Guoqiang Li, and Jinshui Wang. 2020. Unblind your apps: predicting natural-language labels for mobile GUI components by deep learning. In ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, Gregg Rothermel and Doo-Hwan Bae (Eds.). ACM, 322–334. https://doi.org/10.1145/3377811.3380327
[27]
Sen Chen, Chunyang Chen, Lingling Fan, Mingming Fan, Xian Zhan, and Yang Liu. 2021. Accessible or Not An Empirical Investigation of Android App Accessibility. IEEE Transactions on Software Engineering, 1–1. https://doi.org/10.1109/TSE.2021.3108162
[28]
Sidong Feng, Minmin Jiang, Tingting Zhou, Yankun Zhen, and Chunyang Chen. 2022. Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development. ACM Trans. Interact. Intell. Syst., apr, issn:2160-6455 https://doi.org/10.1145/3531065 Just Accepted.
[29]
Sidong Feng, Suyu Ma, Jinzhong Yu, Chunyang Chen, Tingting Zhou, and Yankun Zhen. 2021. Auto-Icon: An Automated Code Generation Tool for Icon Designs Assisting in UI Development. In IUI ’21: 26th International Conference on Intelligent User Interfaces, College Station, TX, USA, April 13-17, 2021, Tracy Hammond, Katrien Verbert, Dennis Parra, Bart P. Knijnenburg, John O’Donovan, and Paul Teale (Eds.). ACM, 59–69. https://doi.org/10.1145/3397481.3450671
[30]
Margherita Grandini, Enrico Bagli, and Giorgio Visani. 2020. Metrics for Multi-Class Classification: an Overview. CoRR, abs/2008.05756 (2020), arXiv:2008.05756. arxiv:2008.05756
[31]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society, 770–778. https://doi.org/10.1109/CVPR.2016.90
[32]
Gao Huang, Zhuang Liu, and Kilian Q. Weinberger. 2016. Densely Connected Convolutional Networks. CoRR, abs/1608.06993 (2016), arXiv:1608.06993. arxiv:1608.06993
[33]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). arxiv:1412.6980
[34]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States, Peter L. Bartlett, Fernando C. N. Pereira, Christopher J. C. Burges, Léon Bottou, and Kilian Q. Weinberger (Eds.). 1106–1114. https://proceedings.neurips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html
[35]
Yuxuan Li, Ruitao Feng, Sen Chen, Qianyu Guo, Lingling Fan, and Xiaohong Li. 2021. IconChecker: Anomaly Detection of Icon-Behaviors for Android Apps. In 2021 28th Asia-Pacific Software Engineering Conference (APSEC). 202–212. https://doi.org/10.1109/APSEC53868.2021.00028
[36]
Yuanchun Li, Ziyue Yang, Yao Guo, and Xiangqun Chen. 2017. DroidBot: a lightweight UI-guided test input generator for Android. In Proceedings of the 39th International Conference on Software Engineering, ICSE 2017, Buenos Aires, Argentina, May 20-28, 2017 - Companion Volume, Sebastián Uchitel, Alessandro Orso, and Martin P. Robillard (Eds.). IEEE Computer Society, 23–26. https://doi.org/10.1109/ICSE-C.2017.8
[37]
Hsuan Lin, Yu-Chen Hsieh, and Wei Lin. 2016. A Preliminary Study on How the Icon Composition and Background of Graphical Icons Affect Users’ Preference Levels. In Human Aspects of IT for the Aged Population. Design for Aging - Second International Conference, ITAP 2016, Held as Part of HCI International 2016, Toronto, ON, Canada, July 17-22, 2016, Proceedings, Part I, Jia Zhou and Gavriel Salvendy (Eds.) (Lecture Notes in Computer Science, Vol. 9754). Springer, 360–370. https://doi.org/10.1007/978-3-319-39943-0_35
[38]
Hsuan Lin, Yu-Chen Hsieh, and Wei Lin. 2017. Shape Design and Exploration of 2D and 3D Graphical Icons. In Human Aspects of IT for the Aged Population. Applications, Services and Contexts - Third International Conference, ITAP 2017, Held as Part of HCI International 2017, Vancouver, BC, Canada, July 9-14, 2017, Proceedings, Part II, Jia Zhou and Gavriel Salvendy (Eds.) (Lecture Notes in Computer Science, Vol. 10298). Springer, 79–91. https://doi.org/10.1007/978-3-319-58536-9_7
[39]
Thomas F. Liu, Mark Craft, Jason Situ, Ersin Yumer, Radomir Mech, and Ranjitha Kumar. 2018. Learning Design Semantics for Mobile Apps. In The 31st Annual ACM Symposium on User Interface Software and Technology (UIST ’18). ACM, New York, NY, USA. 569–579. isbn:978-1-4503-5948-1 https://doi.org/10.1145/3242587.3242650
[40]
Aiguo Lu and Chengqi Xue. 2020. A Study on Search Performance and Threshold Range of Icons. In Engineering Psychology and Cognitive Ergonomics. Mental Workload, Human Physiology, and Human Energy - 17th International Conference, EPCE 2020, Held as Part of the 22nd HCI International Conference, HCII 2020, Copenhagen, Denmark, July 19-24, 2020, Proceedings, Part I, Don Harris and Wen-Chin Li (Eds.) (Lecture Notes in Computer Science, Vol. 12186). Springer, 62–68. https://doi.org/10.1007/978-3-030-49044-7_6
[41]
Zijing Luo, Chengqi Xue, Yafeng Niu, Xinyue Wang, Bingzheng Shi, Lingcun Qiu, and Yi Xie. 2019. An Evaluation Method of the Influence of Icon Shape Complexity on Visual Search Based on Eye Tracking. In Design, User Experience, and Usability. User Experience in Advanced Technological Environments - 8th International Conference, DUXU 2019, Held as Part of the 21st HCI International Conference, HCII 2019, Orlando, FL, USA, July 26-31, 2019, Proceedings, Part II, Aaron Marcus and Wentao Wang (Eds.) (Lecture Notes in Computer Science, Vol. 11584). Springer, 44–55. https://doi.org/10.1007/978-3-030-23541-3_4
[42]
Forough Mehralian, Navid Salehnamadi, and Sam Malek. 2021. Data-Driven Accessibility Repair Revisited: On the Effectiveness of Generating Labels for Icons in Android Apps. In Proc. ESEC/FSE (ESEC/FSE 2021). Association for Computing Machinery, New York, NY, USA. 107–118. isbn:9781450385626 https://doi.org/10.1145/3468264.3468604
[43]
Kevin Moran, Boyang Li, Carlos Bernal-Cárdenas, Dan Jelf, and Denys Poshyvanyk. 2018. Automated reporting of GUI design violations for mobile apps. In Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (Eds.). ACM, 165–175. https://doi.org/10.1145/3180155.3180246
[44]
Simone Mutti, Yanick Fratantonio, Antonio Bianchi, Luca Invernizzi, Jacopo Corbetta, Dhilung Kirat, Christopher Kruegel, and Giovanni Vigna. 2015. Baredroid: Large-scale analysis of android apps on real devices. In Proceedings of the 31st Annual Computer Security Applications Conference. 71–80.
[45]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, Alessandro Moschitti, Bo Pang, and Walter Daelemans (Eds.). ACL, 1532–1543. https://doi.org/10.3115/v1/d14-1162
[46]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). arxiv:1409.1556
[47]
Mick Smythwood, Siné McDougall, and Mirsad Hadzikadic. 2019. Search-Efficacy of Modern Icons Varying in Appeal and Visual Complexity. In Design, User Experience, and Usability. User Experience in Advanced Technological Environments - 8th International Conference, DUXU 2019, Held as Part of the 21st HCI International Conference, HCII 2019, Orlando, FL, USA, July 26-31, 2019, Proceedings, Part II, Aaron Marcus and Wentao Wang (Eds.) (Lecture Notes in Computer Science, Vol. 11584). Springer, 94–104. https://doi.org/10.1007/978-3-030-23541-3_8
[48]
Niko Sünderhauf, Oliver Brock, Walter J. Scheirer, Raia Hadsell, Dieter Fox, Jürgen Leitner, Ben Upcroft, Pieter Abbeel, Wolfram Burgard, Michael Milford, and Peter Corke. 2018. The Limits and Potentials of Deep Learning for Robotics. CoRR, abs/1804.06557 (2018), arXiv:1804.06557. arxiv:1804.06557
[49]
Mingxing Tan and Quoc V. Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.) (Proceedings of Machine Learning Research, Vol. 97). PMLR, 6105–6114. http://proceedings.mlr.press/v97/tan19a.html
[50]
Mengyue Wang and Xin Li. 2017. Effects of the aesthetic design of icons on app downloads: evidence from an android market. Electron. Commer. Res., 17, 1 (2017), 83–102. https://doi.org/10.1007/s10660-016-9245-4
[51]
Zhou Wang, Alan C. Bovik, Hamid R. Sheikh, and Eero P. Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process., 13, 4 (2004), 600–612. https://doi.org/10.1109/TIP.2003.819861
[52]
Michael J. Wilber, Walter J. Scheirer, Phil Leitner, Brian Heflin, James Zott, Daniel Reinke, David K. Delaney, and Terrance E. Boult. 2013. Animal recognition in the Mojave Desert: Vision tools for field biologists. In 2013 IEEE Workshop on Applications of Computer Vision, WACV 2013, Clearwater Beach, FL, USA, January 15-17, 2013. IEEE Computer Society, 206–213. https://doi.org/10.1109/WACV.2013.6475020
[53]
Shengqu Xi, Shao Yang, Xusheng Xiao, Yuan Yao, Yayuan Xiong, Fengyuan Xu, Haoyu Wang, Peng Gao, Zhuotao Liu, Feng Xu, and Jian Lu. 2019. DeepIntent: Deep Icon-Behavior Learning for Detecting Intention-Behavior Discrepancy in Mobile Apps. In Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security, CCS 2019, London, UK, November 11-15, 2019, Lorenzo Cavallaro, Johannes Kinder, XiaoFeng Wang, and Jonathan Katz (Eds.). ACM, 2421–2436. https://doi.org/10.1145/3319535.3363193
[54]
Xusheng Xiao, Xiaoyin Wang, Zhihao Cao, Hanlin Wang, and Peng Gao. 2019. IconIntent: automatic identification of sensitive UI widgets based on icon classification for Android apps. In Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (Eds.). IEEE / ACM, 257–268. https://doi.org/10.1109/ICSE.2019.00041
[55]
Bo Yang, Zhenchang Xing, Xin Xia, Chunyang Chen, Deheng Ye, and Shanping Li. 2021. Don’t Do That! Hunting Down Visual Design Smells in Complex UIs against Design Guidelines. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021. IEEE, 761–772. https://doi.org/10.1109/ICSE43902.2021.00075
[56]
Dehai Zhao, Zhenchang Xing, Chunyang Chen, Xiwei Xu, Liming Zhu, Guoqiang Li, and Jinshui Wang. 2020. Seenomaly: vision-based linting of GUI animation effects against design-don’t guidelines. In ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, Gregg Rothermel and Doo-Hwan Bae (Eds.). ACM, 1286–1297. https://doi.org/10.1145/3377811.3380411

Cited By

View all
  • (2024)VisionTasker: Mobile Task Automation Using Vision Based UI Understanding and LLM Task PlanningProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676386(1-17)Online publication date: 13-Oct-2024

Index Terms

  1. What You See Is What You Get? It Is Not the Case! Detecting Misleading Icons for Mobile Applications

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISSTA 2023: Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis
    July 2023
    1554 pages
    ISBN:9798400702211
    DOI:10.1145/3597926
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 July 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Android Apps
    2. Deep Learning
    3. Discrepancy Detection
    4. Icon Design

    Qualifiers

    • Research-article

    Conference

    ISSTA '23
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 58 of 213 submissions, 27%

    Upcoming Conference

    ISSTA '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)103
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)VisionTasker: Mobile Task Automation Using Vision Based UI Understanding and LLM Task PlanningProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676386(1-17)Online publication date: 13-Oct-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media