skip to main content
10.1145/2897073.2897098acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Evaluating BSS algorithms in a mobile context realized by a client-server architecture

Published: 14 May 2016 Publication History

Abstract

The human daily and the professional life demand a high amount of communication ability, but every fourth adult above 50 is hearing-impaired, a fraction that steadily increases in an aging society. For an autonomous, self-confident and long productive life, a good speech understanding in everyday life situations is necessary to reduce the listening effort. For this purpose, a mobile app-based assistance system based on Blind Source Separation is required that makes every-day acoustic scenarios more transparent by the opportunity of an interactive focusing on the preferred sound source in as close to real-time as possible. In case of highly costly BSS algorithms, at least an offline separation is to be provided. Developing such an app in the context of a short-term research project with limited budget to realize this goal statement makes it impossible to meet the challenge as a stand-alone solution with existing technologies and hardware. As an alternative, employing part of the required soft- and/or hardware on a remote server at least maintains the mobile context, given sufficient connectivity. For this purpose, a client-server architecture that combines Android, Java, MatlabControl and MATLAB, and that conducts separation of live recorded audio data remotely, is explained and tested. Conclusions about what is possible for the offline case are drawn from that. Tests are evaluated using a set of objective and subjective criteria. This demonstrates the possibility of realizing the assistance system in a mobile context.

References

[1]
Android Developers. Starting another Activity, October 2014.
[2]
Android Developers. Audio Capture, October 2015.
[3]
Android Developers. Audiorecord, October 2015.
[4]
Android Developers. Objectinputstream, October 2015.
[5]
Android Developers. Socket, October 2015.
[6]
Android Open Source Project. Audio, October 2015.
[7]
ARM. ARM - the architecture for the digital world, June 2015.
[8]
E. C. Cherry. Some experiments on the recognition of speech, with one and with two ears. The Journal of the Acoustical Society of America, 25(5):975--979, 1953.
[9]
CPUBoss. CPUBoss - best CPUs - all the CPUs ranked in real-time!, June 2015.
[10]
J. Drake, Z. Lanier, C. Mulliner, P. Fora, S. Ridley, and G. Wicherski. Android Hacker's Handbook. Wiley, 2014.
[11]
V. Emiya, E. Vincent, N. Harlander, and V. Hohmann. Multi-criteria subjective and objective evaluation ofaudio source separation. IEEE Transactions on Audio, Speech, and Language Processing, 9, 1 2010.
[12]
J. Fitzpatrick. An interview with Steve Furber. Commun. ACM, 54(5):34--39, May 2011.
[13]
M. Gargenta and M. Nakamura. Learning Android: Develop Mobile Apps Using Java and Eclipse. O'Reilly Media, 2014.
[14]
Gartner. Gartner: Worldwide smartphone sales to end users by operating system in 2013, February 2014.
[15]
Gartner. Gartner: Worldwide traditional PC, tablet, ultramobile and mobile phone shipments on pace to grow 7.6 percent in 2014, January 2014.
[16]
Google Project Hosting. MatlabControl - a Java API to interact with MATLAB, 2013.
[17]
Google Project Hosting. MatlabProxy (matlabcontrol documentation), 2015.
[18]
B. Hunt, R. Lipsman, and J. Rosenberg. A Guide to MATLAB: For Beginners and Experienced Users. Cambridge University Press, 2014.
[19]
V. Ingle and J. Proakis. Digital Signal Processing Using MATLAB. Cengage Learning, 2011.
[20]
Z. Koldovskyt' and P. Tichavsky. Time-domain blind separation of audio sources on the basis of a complete ICA decomposition of an observation space. Audio, Speech, and Language Processing, IEEE Transactions on, 19(2):406--416, Feb 2011.
[21]
MathWorks. MATLAB - the language of technical computing, 2014.
[22]
MathWorks. MATLAB Coder, October 2014.
[23]
MathWorks. Using TCP/IP server sockets, 2015.
[24]
J. McGovern. A Practical Guide to Enterprise Architecture. The Coad series. Prentice Hall PTR, 2004.
[25]
D. Nion, K. N. Mokios, N. D. Sidiropoulos, and A. Potamianos. Batch and adaptive PARAFAC-based blind separation of convolutive speech mixtures. Trans. Audio, Speech and Lang. Proc., 18(6):1193--1207, August 2010.
[26]
M. C. Offiah. Non-native mobile porting and multi-platform benchmarking of blind source separation algorithms. International Journal of Computing and Digital Systems, 5, 1 2016.
[27]
PassMark Software. PassMark Software - CPU benchmarks - CPU popularity in the last 90 days, June 2015.
[28]
D. A. Patterson and D. R. Ditzel. The case for the reduced instruction set computer. SIGARCH Comput. Archit. News, 8(6):25--33, October 1980.
[29]
M. Pedersen, D. Wang, J. Larsen, and U. Kjems. Two-microphone separation of speech mixtures. Neural Networks, IEEE Transactions on, 19(3):475--492, March 2008.
[30]
M. S. Pedersen, J. Larsen, U. Kjems, and L. C. Parra. A survey of convolutive blind source separation methods. In Springer Handbook of Speech Processing. Springer Press, November 2007.
[31]
Z. Rafii. REPET - codes, June 2015.
[32]
Z. Rafii and B. Pardo. Online REPET-SIM for real-time speech enhancement. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pages 848--852, May 2013.
[33]
Z. Rafii and B. Pardo. REpeating pattern extraction technique (REPET): A simple method for music/voice separation. IEEE Transactions on Audio, Speech & Language Processing, 21(1):71--82, 2013.
[34]
E. Vincent, S. Araki, and P. Bofill. The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation. In Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation, ICA '09, pages 734--741, Berlin, Heidelberg, 2009. Springer-Verlag.
[35]
E. Vincent, R. Gribonval, and C. Fevotte. Performance measurement in blind audio source separation. Audio, Speech, and Language Processing, IEEE Transactions on, 14(4):1462--1469, July 2006.
[36]
VoIP Supply. Cisco hd voice - voip supply, 2015.
[37]
K. Whitehouse. Calling Matlab from Java, 2015.
[38]
K. Yaghmour. Embedded Android: Porting, Extending, and Customizing. Oreilly and Associate Series. O'Reilly Media, Incorporated, 2013.
[39]
H. Zaretsky, S. Flanagan, and A. Moroz. Medical Aspects of Disability, Fourth Edition: A Handbook for the Rehabilitation Professional. Medical Aspects of Disability: A Handbook for the Rehabilitation Professional. Springer Publishing Company, 2010.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MOBILESoft '16: Proceedings of the International Conference on Mobile Software Engineering and Systems
May 2016
326 pages
ISBN:9781450341783
DOI:10.1145/2897073
© 2016 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 May 2016

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

  • The financial project organization is directed by the VDI/VDE Berlin

Conference

ICSE '16
Sponsor:

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 64
    Total Downloads
  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)3
Reflects downloads up to 14 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media