FedSpam: Privacy Preserving SMS Spam Prediction

Sidhpura, Jiten; Shah, Parshwa; Veerkhare, Rudresh; Godbole, Anand

doi:10.1007/978-981-99-1645-0_5

Jiten Sidhpura¹⁰,
Parshwa Shah¹⁰,
Rudresh Veerkhare¹⁰ &
…
Anand Godbole¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1793))

Included in the following conference series:

International Conference on Neural Information Processing

839 Accesses

Abstract

SMSes are a great way to communicate via short text. But, some people take advantage of this service by spamming innocent people. Deep learning models need more SMS data to be adaptive and accurate. But, SMS data being sensitive, should not leave the device premises. Therefore, we have proposed a federated learning approach in our research study. Initially, a distilBERT model having validation accuracy of 98% is transported to mobile clients. Mobile clients train this local model via the SMSes received and send their local model weights to server for aggregation. The process is done iteratively making the model robust and resistant to latest spam techniques. Model prediction analysis is done at server side using global model to check which words in message influence spam and ham. On-device training experiment is conducted on a client and it is observed that the losses of the global model converge after every iteration.

J. Sidhpura, P. Shah, R. Veerkhare—Contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

HOW MANY SMARTPHONES ARE IN THE WORLD? https://www.bankmycell.com/blog/how-many-phones-are-in-the-world. Accessed 19 April 2022
Mishra, S., Soni, D.: DSmishSMS-A System to Detect Smishing SMS. Neural Comput. Appl. (2021)
Google Scholar
Julis, M., Alagesan, S.: Spam detection in sms using machine learning through text mining. Int. J. Sci. Technol. Res. 9, 498–503 (2020)
Google Scholar
Gomaa, W.: The impact of deep learning techniques on SMS spam filtering. Int. J. Adv. Comput. Sci. Appl. 11 (2020). https://doi.org/10.14569/IJACSA.2020.0110167
Liu, X., Lu, H., Nayak, A.: A spam transformer model for SMS spam detection. IEEE Access 9, 80253–80263 (2021). https://doi.org/10.1109/ACCESS.2021.3081479
Article Google Scholar
Rojas-Galeano, S.: Using BERT Encoding to Tackle the Mad-lib Attack in SMS Spam Detection (2021)
Google Scholar
Sanh, V., Debut, L., Chaumond, J., Wolf, T.: DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter (2019)
Google Scholar
Metsis, V., Androutsopoulos, I., Paliouras, G.: Spam filtering with Naive Bayes - which Naive Bayes? In: Proceedings of the 3rd Conference on Email and Anti-Spam (CEAS 2006), Mountain View, CA, USA (2006)
Google Scholar
Almeida, T.A., Gómez Hidalgo, J.M., Yamakami, A.: Contributions to the study of SMS spam filtering: new collection and results. In: Proceedings of the 2011 ACM Symposium on Document Engineering (DOCENG 2011), Mountain View, CA, USA (2011)
Google Scholar
Nuruzzaman, M.T., Lee, C., Choi, D.: Independent and personal SMS spam filtering. In: 2011 IEEE 11th International Conference on Computer and Information Technology, pp. 429–435 (2011). https://doi.org/10.1109/CIT.2011.23
McMahan, B., et al.: Communication-efficient learning of deep networks from decentralized data. Artificial intelligence and statistics. PMLR (2017)
Google Scholar
Your voice & audio data stays private while Google Assistant improves. https://support.google.com/assistant/answer/10176224?hl=en#zippy=. Accessed 19 April 2022
Hard, A., et al.: Federated Learning for Mobile Keyboard Prediction (2018)
Google Scholar
Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems, 30 (2017)
Google Scholar
Beutel, D.J., et al.: Flower: a friendly federated learning research framework. arXiv preprint arXiv:2007.14390 (2020)

Download references

Author information

Authors and Affiliations

Sardar Patel Institute of Technology, Mumbai, India
Jiten Sidhpura, Parshwa Shah, Rudresh Veerkhare & Anand Godbole

Authors

Jiten Sidhpura
View author publications
You can also search for this author in PubMed Google Scholar
Parshwa Shah
View author publications
You can also search for this author in PubMed Google Scholar
Rudresh Veerkhare
View author publications
You can also search for this author in PubMed Google Scholar
Anand Godbole
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiten Sidhpura .

Editor information

Editors and Affiliations

Indian Institute of Technology Indore, Indore, India
Mohammad Tanveer
Indian Institute of Information Technology - Allahabad, Prayagraj, India
Sonali Agarwal
Kobe University, Kobe, Japan
Seiichi Ozawa
Indian Institute of Technology Patna, Patna, India
Asif Ekbal
University of Innsbruck, Innsbruck, Austria
Adam Jatowt

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sidhpura, J., Shah, P., Veerkhare, R., Godbole, A. (2023). FedSpam: Privacy Preserving SMS Spam Prediction. In: Tanveer, M., Agarwal, S., Ozawa, S., Ekbal, A., Jatowt, A. (eds) Neural Information Processing. ICONIP 2022. Communications in Computer and Information Science, vol 1793. Springer, Singapore. https://doi.org/10.1007/978-981-99-1645-0_5

Download citation

DOI: https://doi.org/10.1007/978-981-99-1645-0_5
Published: 14 April 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1644-3
Online ISBN: 978-981-99-1645-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics