Skip to main content

FedSpam: Privacy Preserving SMS Spam Prediction

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2022)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1793))

Included in the following conference series:

  • 839 Accesses

Abstract

SMSes are a great way to communicate via short text. But, some people take advantage of this service by spamming innocent people. Deep learning models need more SMS data to be adaptive and accurate. But, SMS data being sensitive, should not leave the device premises. Therefore, we have proposed a federated learning approach in our research study. Initially, a distilBERT model having validation accuracy of 98% is transported to mobile clients. Mobile clients train this local model via the SMSes received and send their local model weights to server for aggregation. The process is done iteratively making the model robust and resistant to latest spam techniques. Model prediction analysis is done at server side using global model to check which words in message influence spam and ham. On-device training experiment is conducted on a client and it is observed that the losses of the global model converge after every iteration.

J. Sidhpura, P. Shah, R. Veerkhare—Contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. HOW MANY SMARTPHONES ARE IN THE WORLD? https://www.bankmycell.com/blog/how-many-phones-are-in-the-world. Accessed 19 April 2022

  2. Mishra, S., Soni, D.: DSmishSMS-A System to Detect Smishing SMS. Neural Comput. Appl. (2021)

    Google Scholar 

  3. Julis, M., Alagesan, S.: Spam detection in sms using machine learning through text mining. Int. J. Sci. Technol. Res. 9, 498–503 (2020)

    Google Scholar 

  4. Gomaa, W.: The impact of deep learning techniques on SMS spam filtering. Int. J. Adv. Comput. Sci. Appl. 11 (2020). https://doi.org/10.14569/IJACSA.2020.0110167

  5. Liu, X., Lu, H., Nayak, A.: A spam transformer model for SMS spam detection. IEEE Access 9, 80253–80263 (2021). https://doi.org/10.1109/ACCESS.2021.3081479

    Article  Google Scholar 

  6. Rojas-Galeano, S.: Using BERT Encoding to Tackle the Mad-lib Attack in SMS Spam Detection (2021)

    Google Scholar 

  7. Sanh, V., Debut, L., Chaumond, J., Wolf, T.: DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter (2019)

    Google Scholar 

  8. Metsis, V., Androutsopoulos, I., Paliouras, G.: Spam filtering with Naive Bayes - which Naive Bayes? In: Proceedings of the 3rd Conference on Email and Anti-Spam (CEAS 2006), Mountain View, CA, USA (2006)

    Google Scholar 

  9. Almeida, T.A., Gómez Hidalgo, J.M., Yamakami, A.: Contributions to the study of SMS spam filtering: new collection and results. In: Proceedings of the 2011 ACM Symposium on Document Engineering (DOCENG 2011), Mountain View, CA, USA (2011)

    Google Scholar 

  10. Nuruzzaman, M.T., Lee, C., Choi, D.: Independent and personal SMS spam filtering. In: 2011 IEEE 11th International Conference on Computer and Information Technology, pp. 429–435 (2011). https://doi.org/10.1109/CIT.2011.23

  11. McMahan, B., et al.: Communication-efficient learning of deep networks from decentralized data. Artificial intelligence and statistics. PMLR (2017)

    Google Scholar 

  12. Your voice & audio data stays private while Google Assistant improves. https://support.google.com/assistant/answer/10176224?hl=en#zippy=. Accessed 19 April 2022

  13. Hard, A., et al.: Federated Learning for Mobile Keyboard Prediction (2018)

    Google Scholar 

  14. Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems, 30 (2017)

    Google Scholar 

  15. Beutel, D.J., et al.: Flower: a friendly federated learning research framework. arXiv preprint arXiv:2007.14390 (2020)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiten Sidhpura .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sidhpura, J., Shah, P., Veerkhare, R., Godbole, A. (2023). FedSpam: Privacy Preserving SMS Spam Prediction. In: Tanveer, M., Agarwal, S., Ozawa, S., Ekbal, A., Jatowt, A. (eds) Neural Information Processing. ICONIP 2022. Communications in Computer and Information Science, vol 1793. Springer, Singapore. https://doi.org/10.1007/978-981-99-1645-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-1645-0_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-1644-3

  • Online ISBN: 978-981-99-1645-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics