ABSTRACT
A severe potential security problem in utilization of Unicode on the Web is identified, which is resulted from the fact that there are many similar characters in the Universal Character Set (UCS). The foundation of our solution relies on evaluating the similarity of characters in UCS. We develop a solution based on the renowned Kernel Density Estimation (KDE) method to establish such a Unicode Similarity List (UC-SimList).
- Anti-Phishing Group of City University of Hong Kong, http://antiphishing.cs.cityu.edu.hkGoogle Scholar
- Cover T. and Thomas J., "Elements of Information Theory". John Wiley, 1991 Google ScholarDigital Library
- Duerst M., Suignard M., RFC 3987: Internationalized Resource Identifiers (IRIs), The Internet Society, 2005.Google ScholarCross Ref
- Gabrilovich E. and Gontmakher A., The Homograph Attack, Communications of the ACM 45(2), pp.128, 2002 Google ScholarDigital Library
- Fu A. Y., Deng X., Liu W., A Potential IRI based Phishing Strategy, WISE2005, LNCS Vol. 3806, pp. 618--619, 2005 Google ScholarDigital Library
- Liu W., Deng X., Huang G, Fu Y., An Anti-Phishing Strategy based on Visual Similarity Assessment, IEEE Internet Computing 10(2), pp. 58--65, Mar/Apr. 2006. Google ScholarDigital Library
Index Terms
- Safeguard against unicode attacks: generation and applications of UC-simlist
Recommendations
The methodology and an application to fight against Unicode attacks
SOUPS '06: Proceedings of the second symposium on Usable privacy and securityUnicode is becoming a dominant character representation format for information processing. This presents a very dangerous usability and security problem for many applications. The problem arises because many characters in the UCS (Universal Character ...
Two template matching approaches to Arabic, Amharic and Latin isolated characters recognition
With the establishment of commercial OCR systems for Latin text, recent research efforts have been directed at the design of recognition systems for non-Latin scripts, such as Japanese, Cyrillic, Chinese, Hindi, Tibetan, and in particular Arabic. The ...
A Solution for Developing International Software Based on Unicode
ISDEA '14: Proceedings of the 2014 Fifth International Conference on Intelligent Systems Design and Engineering ApplicationsAccording to Unicode theory, this paper introduced a solution for developing international software based on Unicode. Firstly, multi-language support software system concept and main characteristics was presented. Secondly, Unicode basic concept was ...
Comments