Abstract
Sexual harassment and abuse in rideshares is a growing problem. We propose a potential solution to this by using the voice recordings from the rideshare to detect cases of sexual harassment. Emotions such as fear, anger and disgust are most highly correlated to an individual being sexually harassed. Our solution aims to identify these emotions in a woman’s voice as an indicator of sexual harassment. The Ryerson Audio-Visual Database of Emotional Speech and Song dataset was used and offered voice recordings from male and female actors speaking sentences in different emotions. We extract the Mel-Frequency Cepstral Coefficient (MFCC) of the recordings in the dataset and run it through Machine Learning methods such as CNN (Convolutional Neural Network), SVM (Support Vector Machines) and LSTM (Long-Short Term Memory). We achieved an F1-score of 95% with the CNN model on our dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ontario Human Rights Commission: Identifying sexual harassment. http://www.ohrc.on.ca/en/policy-preventing-sexual-and-gender-based-harassment/2-identifying-sexual-harassment
Drucker, H., Burges, C.J., Kaufman, L., Smola, A.J., Vapnik, V.: Support vector regression machines. In: Advances in Neural Information Processing Systems, pp. 155–161 (1997)
Hensley, R.: Cracks in the ridesharing market-and how to fill them. https://www.mckinsey.com/industries/automotive-and-assembly/our-insights/cracks-in-the-ridesharing-market-and-how-to-fill-them
Hochreiter, S., Schmidhuber, J.: LSTM can solve hard long time lag problems. In: Advances in Neural Information Processing Systems, pp. 473–479 (1997)
Likitha, M., Gupta, S.R.R., Hasitha, K., Raju, A.U.: Speech based human emotion recognition using MFCC. In: International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET), pp. 2257–2260. IEEE (2017)
Lim, W., Jang, D., Lee, T.: Speech emotion recognition using convolutional and recurrent neural networks. In: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), pp. 1–4. IEEE (2016)
Livingstone, S.R., Russo, F.A.: The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): a dynamic, multimodal set of facial and vocal expressions in north American English. PloS one 13(5), e0196391 (2018)
Milton, A., Roy, S.S., Selvi, S.T.: SVM scheme for speech emotion recognition using MFCC feature. Int. J. Comput. Appl. 69(9), 34–39 (2013)
Pan, Y., Shen, P., Shen, L.: Speech emotion recognition using support vector machine. Int. J. Smart Home 6(2), 101–108 (2012)
Practical-Cryptography-Website: MFCC generation algorihm. http://practicalcryptography.com/miscellaneous/machine-learning/guide-mel-frequency-cepstral-coefficients-mfccs/
Sermanet, P., Chintala, S., LeCun, Y.: Convolutional neural networks applied to house numbers digit classification. arXiv preprint arXiv:1204.3968 (2012)
Acknowledgements
Thanks to Alice Kates for her valuable feedback and Yash Gupta for his initial suggestions on the topic.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Sakhuja, S., Cohen, R. (2020). RideSafe: Detecting Sexual Harassment in Rideshares. In: Goutte, C., Zhu, X. (eds) Advances in Artificial Intelligence. Canadian AI 2020. Lecture Notes in Computer Science(), vol 12109. Springer, Cham. https://doi.org/10.1007/978-3-030-47358-7_48
Download citation
DOI: https://doi.org/10.1007/978-3-030-47358-7_48
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-47357-0
Online ISBN: 978-3-030-47358-7
eBook Packages: Computer ScienceComputer Science (R0)