Abstract
Blockchain is changing science and technology in a revolutionary way for its decentralized, incorruptible computing mechanism. This work explores blockchain applications in speech recognition via investigating decentralized deep learning models. The decentralized deep learning models demonstrate a good potential to handle large scale acoustic data by fusing distributed deep learning models to achieve better learning results. To the best of our knowledge, it is a pioneering work to explore blockchain technologies in speech recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Indurkhya, N., Damerau, F.J.: Handbook of Natural Language Processing, 2nd edn., pp. 339–365 (2010). Chapman and Hall/CRC Press, Boca Raton (2010)
Zhang, Y.: Speech Recognition Using Deep Learning Algorithms (2013). http://cs229.stanford.edu/proj2013/
Mendis, G.J., Sabounchi, M., Wei, J.: Blockchain as a Service: An Autonomous, Privacy Preserving, Decentralized Architecture for Deep Learning (2018). https://arxiv.org/abs/1807.02515
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Bengio, Y.: Deep learning of representations: looking forward. In: Dediu, A.-H., MartÃn-Vide, C., Mitkov, R., Truthe, B. (eds.) SLSP 2013. LNCS (LNAI), vol. 7978, pp. 1–37. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39593-2_1
Bengio Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. PAMI 35, 1798–1828 (2013)
Deng, L.: A tutorial survey of architectures, algorithms, and applications for deep learning. In: APSIPA Transactions on Signal and Information Processing, Cambridge University Press (2014, to appear)
Mohamed, A., Dahl, G., Hinton, G.: Deep belief networks for phone recognition. In: Proceedings of the NIPS Workshop Deep Learning for Speech Recognition and Related Applications (2009)
Deng, L., Seltzer, M., Yu, D., et al.: Binary coding of speech spectrograms using a deep auto-encoder. In: Interspeech (2010)
Dahl, G., Yu, D., Deng, L., Acero, A.: Large vocabulary continuous speech recognition with context-dependent DBN-HMMs. In: ICASSP (2011)
Dahl, G., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans. Audio Speech Lang Proc. 20, 30–42 (2012)
Mohamed, A., Dahl, G., Hinton, G.: Acoustic modeling using deep belief networks. IEEE Trans. Audio, Speech Lang. Proc. 20(1), 14–22 (2012)
Mohamed, A., Hinton, G., Penn, G.: Understanding how deep belief networks perform acoustic modelling. In: Proceedings of the ICASSP (2012)
Morgan, N.: Deep and wide: multiple layers in automatic speech recognition. IEEE Trans. Audio Speech Lang. Proc. 20(1), 7–13 (2012)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016). http://www.deeplearningbook.org
Deng, L., Li, J., Huang, J.T., et al.: Recent advances in deep learning for speech research at microsoft. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
Qin, C.-X., Zhang, L.H.: Deep neural network based feature extraction for low-resource speech recognition. Acta Automatica Sinica 43(7), 1208–1219 (2017)
Wu, W., Cai, M., et al.: Bottleneck features and subspace Gaussian mixture models for low-resource speech recognition. J. Univ. Chin. Acad. Sci. 32(1), 97–102 (2015)
Liu, J., Zhang, W.: Research progress on key technologies of low resource speech recognition. J. Data Acquis. Process. 32(2), 205–220 (2017)
Shu, F., Qu, D., et al.: A speech recognition method using long short-term memory network in low resources. J. Xi’an Jiaotong Univ. 51(10), 120–127 (2017)
Qin, C., Zhang, L.: Acoustic modelling approach of multi-stream feature incorporated convolutional neural network for low-resource speech recognition speech recognition. J. Comput. Appl. 36(9), 2609–2615 (2016)
Graves, A., Mohamed, A.-R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649. IEEE (2013)
Zhang, Y., Pezeshki, M., Brakel, P., et al.: Towards end-to-end speech recognition with deep convolutional neural networks. arXiv preprint arXiv:1701.02720 (2017)
Abdel-Hamid, O., Mohamed, A.R., Jiang, H., et al.: Convolutional neural networks for speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 22(10), 1533–1545 (2014)
Young, T., Hazarika, D., Poria, S., Cambria, E.: Recent trends in deep learning based natural language processing. arXiv preprint arXiv:1708.02709 (2017)
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)
Xu, X., Pautasso, C., Zhu, L., et al.: The blockchain as a software connector. In: 13th Working IEEE/IFIP Conference on Software Architecture (WICSA), pp. 182–191 (2016)
Dennis, R., Owen, G.: Rep on the block: a next generation reputation system based on the blockchain. In: International Conference for Internet Technology and Secured Transactions (ICITST), pp. 131–138. IEEE (2015)
Watanabe, H., Fujimura, S., Nakadaira, A., et al.: Blockchain contract: a complete consensus using blockchain. In: IEEE 4th Global Conference on Consumer Electronics (GCCE), pp. 577–578 (2015)
Shokri, R., Shmatikov, V.: Privacy-preserving deep learning. In: Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, pp. 1310–1321 (2015)
Konecny, J., McMahan, H.B., Ramage, D., Richtárik, P.: Federated optimization: distributed machine learning for on-device intelligence. arXiv preprint arXiv:1610.02527 (2016)
Kokkinos, Y., Margaritis, K.G.: Confifidence ratio affinity propagation in ensemble selection of neural network classifiers for distributed privacy-preserving data mining. Neurocomputing 150, 513–528 (2015)
Easton, J.: Blockchains: a distributed data ledger for the railway industry. In: Innovative Applications of Big Data in the Railway Industry. IGI Global, Hershey (2018)
Ramachandran, S., Krishnmachari, B.: Blockchain for the IoT: opportunities and challenges (2018)
Konstantinidis, I., Siaminos, G., Timplalexis, C., Zervas, P., Peristeras, V., Decker, S.: Blockchain for business applications: a systematic literature review. In: Abramowicz, W., Paschke, A. (eds.) BIS 2018. LNBIP, vol. 320, pp. 384–399. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93931-5_28
Kosba, A., et al.: Hawk the blockchain model of cryptography and privacy-preserving smart contracts. In: 2016 IEEE Symposium on Security and Privacy, p. 848 (2016)
Theoduli, E., et al.: On the design of a blockchain-based system to facilitate healthcare data sharing. In: 17th IEEE International Conference on Trust Security and Privacy in Computing and Communications/12th IEEE International Conference on Big Data Science and Engineering (2018)
Acknowledgments
This work is partially supported by the National Natural Science Foundation of China under Grant No. 61501388 and the PSFQ (Provincial Science Foundation of Qinghai, China) under Grant No. 2016-ZJ-904.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yang, X., Huang, H. (2020). Exploring Blockchain in Speech Recognition. In: Han, H., Wei, T., Liu, W., Han, F. (eds) Recent Advances in Data Science. IDMB 2019. Communications in Computer and Information Science, vol 1099. Springer, Singapore. https://doi.org/10.1007/978-981-15-8760-3_15
Download citation
DOI: https://doi.org/10.1007/978-981-15-8760-3_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8759-7
Online ISBN: 978-981-15-8760-3
eBook Packages: Computer ScienceComputer Science (R0)