LinL:Lost in n-best List

Meng, Peng; Shi, Yun-Qing; Huang, Liusheng; Chen, Zhili; Yang, Wei; Desoky, Abdelrahman

doi:10.1007/978-3-642-24178-9_23

Peng Meng^20,21,22,
Yun-Qing Shi²¹,
Liusheng Huang^20,22,
Zhili Chen^20,22,
Wei Yang^20,22 &
…
Abdelrahman Desoky²³

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 6958))

Included in the following conference series:

International Workshop on Information Hiding

2416 Accesses

Abstract

Translation-based steganography (TBS) is a new kind of text steganographic scheme. However, contemporary TBS methods are vulnerable to statistical attacks. Differently, this paper presents a novel TBS, namely Lost in n-best List, abbreviated as LinL, that is resilient against the current statistical attacks. LinL employs only one Statistical Machine Translator (SMT) in the encoding process which selects one of the n-best list of each cover text sentence in order to camouflage messages in stegotext. The presented theoretical analysis demonstrates that there is a classification accuracy upper bound between normal translated text and the stegotext. When the text size is 1000 sentences, the theoretical maximum classification accuracy is about 60%. The experiment results also show current steganalysis methods cannot detect LinL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

TStego-THU: Large-Scale Text Steganalysis Dataset

An English Sentence Dictionary Based Secure Text Steganographic Technique for Message-Data Confidentiality

Information Hiding Based on Typing Errors

References

Grothoff, C., Grothoff, K., Alkhutova, L., Stutsman, R., Atallah, M.: Translation-based steganography. In: Barni, M., Herrera-Joancomartí, J., Katzenbeisser, S., Pérez-González, F. (eds.) IH 2005. LNCS, vol. 3727, pp. 219–233. Springer, Heidelberg (2005)
Chapter Google Scholar
Stutsman, R., Atallah, M., Grothoff, K.: Lost in just the translation. In: Proceedings of the 2006 ACM symposium on Applied computing, ACM New York, NY, USA (2006) 338–345
Chapter Google Scholar
Grothoff, C., Grothoff, K., Stutsman, R., Alkhutova, L., Atallah, M.: Translation-based steganography. Journal of Computer Security 17(3), 269–303 (2009)
Google Scholar
Meng, P., Hang, L., Chen, Z., Hu, Y., Yang, W.: STBS: A statistical algorithm for steganalysis of translation-based steganography. In: Böhme, R., Fong, P.W.L., Safavi-Naini, R. (eds.) IH 2010. LNCS, vol. 6387, pp. 208–220. Springer, Heidelberg (2010)
Chapter Google Scholar
Chen, Z., Huang, L., Meng, P., Yang, W., Miao, H.: Blind linguistic steganalysis against translation based steganography. In: Kim, H.-J., Shi, Y.Q., Barni, M. (eds.) IWDW 2010. LNCS, vol. 6526, pp. 251–265. Springer, Heidelberg (2011)
Chapter Google Scholar
Google: Google translator (2009), http://translate.google.cn
Systran: Systran translator (2009), https://www.systransoft.com
Linguatec: Linguatec translation, http://www.linguatec.de
Chen, B., Zhang, M., Aw, A., Li, H.: Exploiting n-best hypotheses for smt self-enhancement. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, Association for Computational Linguistics, pp. 157–160 (2008)
Google Scholar
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., et al.: Moses: Open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Association for Computational Linguistics (2007)
Google Scholar
Bennett, K.: Linguistic steganography: Survey, analysis, and robustness concerns for hiding information in text. Purdue University, CERIAS Tech. Report (2004)
Google Scholar
Maker, K.: TEXTO, ftp://ftp.funet.fi/pub/crypt/steganography/texto.tar.gz
Wayner, P.: Disappearing cryptography: information hiding: steganography and watermarking. Morgan Kaufmann Pub., San Francisco (2008)
Google Scholar
Chapman, M., Davida, D.: Hiding the hidden: A software system for concealing ciphertext as innocuous text. Lecture Note. In: Computer Science, 335–345 (1997)
Google Scholar
Chang, C., Clark, S.: Linguistic steganography using automatically generated paraphrases. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics (2010)
Google Scholar
Liu, T.Y., Tsai, W.H.: A new steganographic method for data hiding in microsoft word documents by a change tracking technique. IEEE Transactions on Information Forensics and Security 2(1), 24–30 (2007)
Article Google Scholar
Desoky, A.: Nostega: a novel noiseless steganography paradigm. Journal of Digital Forensic Practice 2(3), 132–139 (2008)
Article Google Scholar
Desoky, A.: Listega: list-based steganography methodology. International Journal of Information Security 8(4), 247–261 (2009)
Article Google Scholar
Desoky, A.: NORMALS: normal linguistic steganography methodology. Journal of Information Hiding and Multimedia Signal Processing 1(3), 145–171 (2010)
Google Scholar
Desoky, A.: Matlist: mature linguistic steganography methodology. Security and Communication Networks
Google Scholar
Taskiran, C., Topkara, U., Topkara, M., Delp, E.: Attacks on lexical natural language steganography systems. In: Proceedings of SPIE, vol. 6072, pp. 97–105 (2006)
Google Scholar
Zhili, C., Liusheng, H., Zhenshan, Y., Wei, Y., Lingjun, L., Xueling, Z., Xinxin, Z.: Linguistic steganography detection using statistical characteristics of correlations between words. In: Solanki, K., Sullivan, K., Madhow, U. (eds.) IH 2008. LNCS, vol. 5284, pp. 224–235. Springer, Heidelberg (2008)
Chapter Google Scholar
Zhili, C., Liusheng, H., Zhenshan, Y., Lingjun, L., Wei, Y.: A statistical algorithm for linguistic steganography detection based on distribution of words. In: Third International Conference on Availability, Reliability and Security, ARES 2008, pp. 558–563 (2008)
Google Scholar
Zhili, C., Liusheng, H., Zhenshan, Y., Xinxin, Z.: Effective linguistic steganography detection. In: IEEE 8th International Conference on Computer and Information Technology Workshops, CIT Workshops 2008, pp. 224–229 (2008)
Google Scholar
Meng, P., Hang, L., Yang, W., Chen, Z.: Attacks on translation based steganography. In: IEEE Youth Conference on Information, Computing and Telecommunication, YC-ICT 2009, pp. 227–230. IEEE, Los Alamitos (2010)
Google Scholar
Meng, P., Hang, L., Chen, Z., Yang, W., Yang, M.: Analysis and detection of translation-based steganography. Chinese Journal of Electronics 38(8), 1748–1752 (2010)
Google Scholar
Koehn, P.: MOSES, Statistical Machine Translation System, User Manual and Code Guide (2010)
Google Scholar
Anderson, T., Bahadur, R.: Classification into two multivariate normal distributions with different covariance matrices. The Annals of Mathematical Statistics 33(2), 420–431 (1962)
Article MathSciNet MATH Google Scholar
WMT08: Wmt08 news commentary (2008), http://www.statmt.org/wmt08/training-parallel.tar
Fridrich, J., Goljan, M., Hogea, D.: Steganalysis of JPEG images: Breaking the F5 algorithm. In: Petitcolas, F.A.P. (ed.) IH 2002. LNCS, vol. 2578, pp. 310–323. Springer, Heidelberg (2003)
Chapter Google Scholar
Budhia, U., Kundur, D., Zourntos, T.: Digital video steganalysis exploiting statistical visibility in the temporal domain. IEEE Transactions on Information Forensics and Security 1(4), 502–516 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

NHPCC, Depart. of CS. & Tech., USTC, Hefei, 230027, China
Peng Meng, Liusheng Huang, Zhili Chen & Wei Yang
New Jersey Institute of Technology, Newark, New Jersey, 07102, USA
Peng Meng & Yun-Qing Shi
Suzhou Institute for Advanced Study, USTC, Suzhou, 215123, China
Peng Meng, Liusheng Huang, Zhili Chen & Wei Yang
CSEE, University of Maryland, Baltimore County, MD, 21250, USA
Abdelrahman Desoky

Authors

Peng Meng
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Qing Shi
View author publications
You can also search for this author in PubMed Google Scholar
Liusheng Huang
View author publications
You can also search for this author in PubMed Google Scholar
Zhili Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Abdelrahman Desoky
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Digimarc Corporation, 9405 Gemini Drive, 97008, Beaverton, OR, USA
Tomáš Filler
Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University, Karlovo namesti 13, 121 35, Prague 2, Czech Republic
Tomáš Pevný
Department of Electrical and Computer Engineering, T. J. Watson School, SUNY Binghamton, 13902, Binghamton, NY, USA
Scott Craver
Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, OX1 3QD, Oxford, UK
Andrew Ker

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meng, P., Shi, YQ., Huang, L., Chen, Z., Yang, W., Desoky, A. (2011). LinL:Lost in n-best List. In: Filler, T., Pevný, T., Craver, S., Ker, A. (eds) Information Hiding. IH 2011. Lecture Notes in Computer Science, vol 6958. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24178-9_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-24178-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24177-2
Online ISBN: 978-3-642-24178-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics