Abstract
From 1991 to 2005, China’s High Technology Research and Development Program (HTRDP) sponsored a series of technology evaluations on Chinese information processing and intelligent human-machine interface, which is called HTRDP evaluations, or “863” evaluations in brief. This paper introduces the HTRDP evaluations in detail. The general information of the HTRDP evaluation is presented first, including the history, the concerned technology categories, the organizer, the participants, and the procedure, etc. Then the evaluations on each technology are described in detail respectively, covering Chinese word segmentation, machine translation, acoustic speech recognition, text to speech, text summarization, text categorization, information retrieval, character recognition, and face detection and recognition. For the evaluations on each technology categories, the history, the evaluation tasks, the data, the evaluation method, etc., are given. The last section concludes the paper and discusses possible future work.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Sproat R, Emerson T. The First International Chinese Word Segmentation Bakeoff. In: Proceedings of the Second SIGHAN Workshop on Chinese Language Processing. 2003, 133–143
Levow G-A, The Third International Chinese Language Processing Bakeoff: Word Segmentation and Named Entity Recognition. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing. 2006, 108–117
Emerson T. The Second International Chinese Word Segmentation Bakeoff. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005. 123–133
ACE evaluations: http://www.nist.gov/speech/tests/ace/index.htm
YU S W. Automatic Evaluation of Output Quality for Machine Translation Systems, Machine Translation. Netherlands: Kluwer Academic publisher, 1993, 8: 117–126
Papineni K A, Roukos S, Ward T, et al. Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022). IBM Research Division, Thomas J. Watson Research Center. 2001
Zhang H P, Yu H K, Xiong D Y, et al. HHMM-based Chinese Lexical Analyzer ICTCLAS, In: Proceedings of 2nd SigHan Workshop, 2003. 184–187
Och F J. Minimum error rate training in statistical machine translation. In: Proceedings of the 41st ACL, Sapporo, Japan, 2003. 160–167
Och F J. Statistical Machine Translation: From Single-Word Models to Alignment Templates. 38–39
Yasuhiro A, et al. Overview of the IWSLT04 Evaluation Campaign. 2004
Paul M, Nakaiwa H, Federico M. Towards innovative evaluation methodologies for speech translation. Working Notes of the NTCIR-4 2004 Meeting. 2004, 2(Suppl). 17–21
White J S, O’Connell T, O’Mara F. The ARPA MT evaluation methodologies: evolution, lessons, and future approaches. In: Proceedings of the AMTA, 1994, 193–205
Papineni K, Roukos S, Ward T. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th ACL, Philadelphia, USA, 2002. 311–318
Doddington G. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the HLT 2002, San Diego, USA, 2002. 257–258
Turian J P, Shen L, Melamed I D. Evaluation of machine translation and its evaluation. In Proceedings of the MT Summit IX, New Orleans, USA, 2003. 386–393
Liu Q, Liu Y. Machine translation automatic evaluating method and system thereof, Chinese Patent, CN1641631-A
Martin A, Doddington TKG, Ordowski M, et al. The DET curve in assessment of detection task performance. In: Proceedings of EuroSpeech’97. 1997, Vol.4, 1895–1898
Van Santen J P H, Pols L C W, Abe M, et al. Report on the Third ESCA TTS Workshop Evaluation Procedure. Third ESCA TTS Workshop, 1998.
Benoit C, Grice M, Hazan V. The SUS test: a method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences. Speech Commun. 1996, 18:381–392
Document Understanding Conference, http://duc.nist.gov/
He J, Tan A-H, Tan C-L. A Comparative Study on Chinese Text Categorization Methods. PRICAI 2000 Workshop on Text and Web Mining, 2000.
Lewis D D, Yang Y M, Rose T G, et al. RCV1 A New Benchmark Collection for Text Categorization Research. Journal of Machine Learning Research. 2004, 5: 361–397
Sebastiani F. Machine Learning in Automated Text Categorization. ACM Computing Surveys, 2002
Aas K, Eikvil L. Text Categorisation: A survey. Raport NR 941, 1999
Yang Y M. An evaluation of statistical approaches to text categorization. Information Retrieval, 1999, 1(1–2)
The Text Retrieval Conference, http://trec.nist.gov/
Harman D. The first text retrieval conference (TREC-1). Information Processing and Management, 1993, 29(4): 411–414
Voorhees E M. Overview of TREC 2005. In: Proceedings of the Text REtrieval Conference (TREC). Gaithersburg, Maryland, 2005
NII Test Collection for IR Systems, http://research.nii.ac.jp/ntcir/
Cross Language Evaluation Forum, http://www.clef-campaign.org/
Zhang J L, et al. Research on the 863 Chinese Information Retrieval Evaluation. Journal of Chinese Information Processing, 2006, 20(suppl): 19–24(in Chinese)
Shah C, Croft W B. Evaluating High Accuracy Retrieval Techniques. In: Proceedings of SIGIR’ 04, 2004
Text Retrieval Conference, http://trec.nist.gov.
Chinese Web Information Retrieval Forum, http://www.cwirf.org/.
Cheng Y X, et al. 863 Web Track Experiments at ICST-PKU. Journal of Chinese Information Processing, 2006, 20(suppl): 102–106(in Chinese)
Zhao L, et al. 2005 THUIR Report for 863 Information Retrieval Evaluation. Journal of Chinese Information Processing, 2006, (suppl): 91–95 (in Chinese)
Zhang Z C, et al. Technology Report of HIT-IRLab for Evaluation 2005 of 863 Information Retrieval. Journal of Chinese Information Processing, 2006, 20(suppl): 83–90 (in Chinese)
Xu W R, et al. PRIS Information Retrieval System Report. Journal of Chinese Information Processing, 2006, 20(suppl): 96–101(in Chinese)
Lv B B, et al. 863 Information Retrieval Evaluation-Institute of Automation. Journal of Chinese Information Processing, 2006, 20(suppl): 78–82 (in Chinese)
Kanungo T, Marton G A, Bulbul O. Performance Evaluation of Two Arabic OCR Products. In: Proceedings of AIPR Workshop on Advances in Computer. Assist Recognition, SPIE. Vol 3584
Liu C L, Jaeger S, Nakagawa M. Online Recognition of Chinese Characters: The State-of-the-Art. IEEE Transaction on Patten Analysis and Machine Intelligence, 2004, 26(2): 198–213
Marti U-V, Bunke H. A full English sentence database for off-line handwriting recognition. In: Proceedings of the 5th Int. Conf. on Document Analysis and Recognition, Bangalore, India, 1999. 705–708
Liu C P, Qian Y L, et al. 863 Testing System on Handwritten Chinese Character Recognition. Journal of Chinese Information Processing, 2000, 14(2): 2–7 (in Chinese)
Guo J, Lin Z Q, Zhang H G. A New Database Model of Off-line Handwritten Chinese Characters and Its Applications. Chinese Journal of Electronics, 2000, 28(5): 115–116 (in chinese)
Requirements and test procedure of on-line handwriting Chinese ideogram recognition. Chinese National Standard GB/T 18790-2002. July, 2002
Chinese Ideograms Coded Character Set for Information Interchange-Basic Set. Chinese National Standard GB 2312-1980, 1980
Information technology-Chinese ideograms coded character set for information interchange-Extension for the basic set. Chinese National Standard GB 18030-2000. March, 2000
Chellappa R, et al. Human and Machine Recognition of Faces: A Survey. In: Proceedings of the IEEE, 1995, 83(5): 705–741
Zhao W Y, Chellappa R, Rosenfeld A, et al. Face Recognition: A Literature Survey. ACM Computing Survey, 2003, 35(4): 399–458
Phillips P J, Moon H, Rizvi S, et al. The FERET Evaluation Methodology for Face-Recognition Algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(10): 1090–1104
Phillips P J, Grother P J, et al. Face Recognition Vendor Test 2002: Evaluation Report, Technical Report. NISTIR 6965, National Institute of Standards and Technology, 2003
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, Q., Wang, X., Liu, H. et al. HTRDP evaluations on Chinese information processing and intelligent human-machine interface. Front. Comput. Sc. China 1, 58–93 (2007). https://doi.org/10.1007/s11704-007-0007-2
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s11704-007-0007-2