Abstract
Robust classification strongly depends on the combination of properly chosen features and the classification algorithm. This paper investigates an autoencoder for feature fusion together with recurrent neural networks such as the Long Short-Term Memory neural networks (LSTMs) in different configurations applied to a dataset of a material transport process. As an important outcome the investigations show that the application of features acquired from the autoencoder bottleneck layer in combination with a bidirectional LSTM improve the classification algorithm significantly and require fewer features in comparison to standard machine learning algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berckmans, D., Janssens, K., Van der Auweraer, H., Sas, P., Desmet, W.: Model-based synthesis of aircraft noise to quantify human perception of sound quality and annoyance. J. Sound Vib. 311(3–5), 1175–1195 (2008). https://doi.org/10.1016/j.jsv.2007.10.018
Husakovic, A., Pfann, E., Huemer, M.: Robust machine learning based acoustic classification of a material transport process. In: Proceedings of the 14 Symposium on Neural Networks and Applications (NEUREL), Belgrade, Serbia (2018). https://doi.org/10.1109/NEUREL.2018.8587031
Bae, S.H., Choi, I., Soo Kim, N.: Acoustic scene classification using parallel combination of LSTM and CNN, DCASE2016 challenge. Technical report, Budapest, Hungary (2016)
Han, K., Wang, Y., Zhang, C., Lee, C., Hu, C.: Autoencoder inspired unsupervised feature selection. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, pp. 2941–2945 (2018). https://doi.org/10.1109/ICASSP.2018.8462261
Huang, K., Wu, C., Yang, T., Su, M., Chou, J.: Speech emotion recognition using autoencoder bottleneck features and LSTM. In: Proceedings of the 2016 International Conference on Orange Technologies (ICOT), Melbourne, Australia, pp. 1–4 (2016). https://doi.org/10.1109/ICOT.2016.8278965
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, pp. 1026–1034 (2015). https://doi.org/10.1109/ICCV.2015.123
Nguyen, T., Pernkopf, F.: Acoustic scene classification using a convolutional neural network ensemble and nearest neighbor filters. In: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE), Tampere, Finland, pp. 34–38 (2018)
Han, Y., Lee, K.: Acoustic scene classification using convolutional neural network and multiple-width frequency-delta data augmentation. arXiv preprint arXiv:1607.02383 (2016)
Acknowledgment
This work has been supported by the COMET-K2 “Center for Symbiotic Mechatronics” of the Linz Center of Mechatronics (LCM) funded by the Austrian federal government and the federal state of Upper Austria.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Husaković, A., Mayrhofer, A., Pfann, E., Huemer, M., Gaich, A., Kühas, T. (2020). Acoustic Monitoring – A Deep LSTM Approach for a Material Transport Process. In: Moreno-Díaz, R., Pichler, F., Quesada-Arencibia, A. (eds) Computer Aided Systems Theory – EUROCAST 2019. EUROCAST 2019. Lecture Notes in Computer Science(), vol 12014. Springer, Cham. https://doi.org/10.1007/978-3-030-45096-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-45096-0_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-45095-3
Online ISBN: 978-3-030-45096-0
eBook Packages: Computer ScienceComputer Science (R0)