Exploring Blockchain in Speech Recognition

Yang, Xuemei; Huang, Heming

doi:10.1007/978-981-15-8760-3_15

Xuemei Yang⁹ &
Heming Huang¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1099))

Included in the following conference series:

International conference on Data Science, Medicine and Bioinformatics

276 Accesses

Abstract

Blockchain is changing science and technology in a revolutionary way for its decentralized, incorruptible computing mechanism. This work explores blockchain applications in speech recognition via investigating decentralized deep learning models. The decentralized deep learning models demonstrate a good potential to handle large scale acoustic data by fusing distributed deep learning models to achieve better learning results. To the best of our knowledge, it is a pioneering work to explore blockchain technologies in speech recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Indurkhya, N., Damerau, F.J.: Handbook of Natural Language Processing, 2nd edn., pp. 339–365 (2010). Chapman and Hall/CRC Press, Boca Raton (2010)
Google Scholar
Zhang, Y.: Speech Recognition Using Deep Learning Algorithms (2013). http://cs229.stanford.edu/proj2013/
Mendis, G.J., Sabounchi, M., Wei, J.: Blockchain as a Service: An Autonomous, Privacy Preserving, Decentralized Architecture for Deep Learning (2018). https://arxiv.org/abs/1807.02515
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Article Google Scholar
Bengio, Y.: Deep learning of representations: looking forward. In: Dediu, A.-H., Martín-Vide, C., Mitkov, R., Truthe, B. (eds.) SLSP 2013. LNCS (LNAI), vol. 7978, pp. 1–37. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39593-2_1
Chapter Google Scholar
Bengio Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. PAMI 35, 1798–1828 (2013)
Google Scholar
Deng, L.: A tutorial survey of architectures, algorithms, and applications for deep learning. In: APSIPA Transactions on Signal and Information Processing, Cambridge University Press (2014, to appear)
Google Scholar
Mohamed, A., Dahl, G., Hinton, G.: Deep belief networks for phone recognition. In: Proceedings of the NIPS Workshop Deep Learning for Speech Recognition and Related Applications (2009)
Google Scholar
Deng, L., Seltzer, M., Yu, D., et al.: Binary coding of speech spectrograms using a deep auto-encoder. In: Interspeech (2010)
Google Scholar
Dahl, G., Yu, D., Deng, L., Acero, A.: Large vocabulary continuous speech recognition with context-dependent DBN-HMMs. In: ICASSP (2011)
Google Scholar
Dahl, G., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans. Audio Speech Lang Proc. 20, 30–42 (2012)
Article Google Scholar
Mohamed, A., Dahl, G., Hinton, G.: Acoustic modeling using deep belief networks. IEEE Trans. Audio, Speech Lang. Proc. 20(1), 14–22 (2012)
Google Scholar
Mohamed, A., Hinton, G., Penn, G.: Understanding how deep belief networks perform acoustic modelling. In: Proceedings of the ICASSP (2012)
Google Scholar
Morgan, N.: Deep and wide: multiple layers in automatic speech recognition. IEEE Trans. Audio Speech Lang. Proc. 20(1), 7–13 (2012)
Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016). http://www.deeplearningbook.org
Deng, L., Li, J., Huang, J.T., et al.: Recent advances in deep learning for speech research at microsoft. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
Google Scholar
Qin, C.-X., Zhang, L.H.: Deep neural network based feature extraction for low-resource speech recognition. Acta Automatica Sinica 43(7), 1208–1219 (2017)
Google Scholar
Wu, W., Cai, M., et al.: Bottleneck features and subspace Gaussian mixture models for low-resource speech recognition. J. Univ. Chin. Acad. Sci. 32(1), 97–102 (2015)
Google Scholar
Liu, J., Zhang, W.: Research progress on key technologies of low resource speech recognition. J. Data Acquis. Process. 32(2), 205–220 (2017)
Google Scholar
Shu, F., Qu, D., et al.: A speech recognition method using long short-term memory network in low resources. J. Xi’an Jiaotong Univ. 51(10), 120–127 (2017)
Google Scholar
Qin, C., Zhang, L.: Acoustic modelling approach of multi-stream feature incorporated convolutional neural network for low-resource speech recognition speech recognition. J. Comput. Appl. 36(9), 2609–2615 (2016)
Google Scholar
Graves, A., Mohamed, A.-R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649. IEEE (2013)
Google Scholar
Zhang, Y., Pezeshki, M., Brakel, P., et al.: Towards end-to-end speech recognition with deep convolutional neural networks. arXiv preprint arXiv:1701.02720 (2017)
Abdel-Hamid, O., Mohamed, A.R., Jiang, H., et al.: Convolutional neural networks for speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 22(10), 1533–1545 (2014)
Google Scholar
Young, T., Hazarika, D., Poria, S., Cambria, E.: Recent trends in deep learning based natural language processing. arXiv preprint arXiv:1708.02709 (2017)
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)
Article Google Scholar
Xu, X., Pautasso, C., Zhu, L., et al.: The blockchain as a software connector. In: 13th Working IEEE/IFIP Conference on Software Architecture (WICSA), pp. 182–191 (2016)
Google Scholar
Dennis, R., Owen, G.: Rep on the block: a next generation reputation system based on the blockchain. In: International Conference for Internet Technology and Secured Transactions (ICITST), pp. 131–138. IEEE (2015)
Google Scholar
Watanabe, H., Fujimura, S., Nakadaira, A., et al.: Blockchain contract: a complete consensus using blockchain. In: IEEE 4th Global Conference on Consumer Electronics (GCCE), pp. 577–578 (2015)
Google Scholar
Shokri, R., Shmatikov, V.: Privacy-preserving deep learning. In: Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, pp. 1310–1321 (2015)
Google Scholar
Konecny, J., McMahan, H.B., Ramage, D., Richtárik, P.: Federated optimization: distributed machine learning for on-device intelligence. arXiv preprint arXiv:1610.02527 (2016)
Kokkinos, Y., Margaritis, K.G.: Confifidence ratio affinity propagation in ensemble selection of neural network classifiers for distributed privacy-preserving data mining. Neurocomputing 150, 513–528 (2015)
Article Google Scholar
Easton, J.: Blockchains: a distributed data ledger for the railway industry. In: Innovative Applications of Big Data in the Railway Industry. IGI Global, Hershey (2018)
Google Scholar
Ramachandran, S., Krishnmachari, B.: Blockchain for the IoT: opportunities and challenges (2018)
Google Scholar
Konstantinidis, I., Siaminos, G., Timplalexis, C., Zervas, P., Peristeras, V., Decker, S.: Blockchain for business applications: a systematic literature review. In: Abramowicz, W., Paschke, A. (eds.) BIS 2018. LNBIP, vol. 320, pp. 384–399. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93931-5_28
Chapter Google Scholar
Kosba, A., et al.: Hawk the blockchain model of cryptography and privacy-preserving smart contracts. In: 2016 IEEE Symposium on Security and Privacy, p. 848 (2016)
Google Scholar
Theoduli, E., et al.: On the design of a blockchain-based system to facilitate healthcare data sharing. In: 17th IEEE International Conference on Trust Security and Privacy in Computing and Communications/12th IEEE International Conference on Big Data Science and Engineering (2018)
Google Scholar

Download references

Acknowledgments

This work is partially supported by the National Natural Science Foundation of China under Grant No. 61501388 and the PSFQ (Provincial Science Foundation of Qinghai, China) under Grant No. 2016-ZJ-904.

Author information

Authors and Affiliations

Xianyang Normal University, Xianyang, 712000, China
Xuemei Yang
Qinghai Normal University, Xining, 810008, Qinghai, China
Heming Huang

Authors

Xuemei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Heming Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xuemei Yang .

Editor information

Editors and Affiliations

Fordham University, New York, NY, USA
Henry Han
Guangxi University, Nanning, China
Tie Wei
Guangzhou University, Guangzhou, China
Wenbin Liu
Jiangsu University, Zhenjiang, China
Fei Han

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, X., Huang, H. (2020). Exploring Blockchain in Speech Recognition. In: Han, H., Wei, T., Liu, W., Han, F. (eds) Recent Advances in Data Science. IDMB 2019. Communications in Computer and Information Science, vol 1099. Springer, Singapore. https://doi.org/10.1007/978-981-15-8760-3_15

Download citation

DOI: https://doi.org/10.1007/978-981-15-8760-3_15
Published: 29 September 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8759-7
Online ISBN: 978-981-15-8760-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics