Abstract
This is the second year of Kasetsart University’s participation in INEX. We participated in two tracks: Snippet retrieval and Data Centric. This year, we introduced an XML information retrieval system that uses MySQL and Sphinx which we call the More Efficient XML Information Retrieval (MEXIR). In our system, XML documents are stored into one table that has a fixed relational schema. The schema is independent of the logical structure of XML documents. Furthermore, we present a structure weighting function which optimizes the performance of MEXIR.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Geva, S., Kamps, J., Lethonen, M., Schenkel, R., Thom, J.A., Trotman, A.: Overview of the INEX 2009 Ad Hoc Track. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 4–25. Springer, Heidelberg (2010)
Ricardo, B., Berthier, R.: Modern Information Retrieval. Addison Wesley Longman Publishing Co. Inc. (1999)
Ricardo, B., Berthier, R.: Modern Information Retrieval, 2nd edn. Addison Wesley Longman Publishing Co. Inc. (2011)
Robertson, S.E., Walker, S., Jones, S., Hancock, B.M.M., Gatford, M.: Okapi at TREC-3. In: Harman, D.K. (ed.) Proceedings of the Third Text REtrieval Conference, TREC-3 (April 1995)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. R. Donnelley & Sons Company, USA (1983)
Schenkel, R., Suchanek, F.M., Kasneci, G.: YAWN: A semantically annotated Wikipedia XML corpus. In: 12. GI-Fachtagung fur Datenbanksysteme in Business, Technologie und Web (BTW 2007), pp. 277–291 (2007)
Trappett, M., Geva, S., Trotman, A., Scholer, F., Sanderson, M.: Overview of the INEX 2011 Snippet Retrieval Track. In: Geva, S., Kamps, J., Schenkel, R. (eds.) INEX 2011. LNCS, vol. 7424, pp. 283–294. Springer, Heidelberg (2012)
Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I (NEXI). In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 16–40. Springer, Heidelberg (2005)
Trotman, A., Sigurbjörnsson, B.: NEXI, Now and Next. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 41–53. Springer, Heidelberg (2005)
Trotman, A., Lalmas, M.: The Interpretation of CAS. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 58–71. Springer, Heidelberg (2006)
Trotman, A., Wang, Q.: Overview of the INEX 2010 Data Centric Track. In: Geva, S., Kamps, J., Schenkel, R., Trotman, A. (eds.) INEX 2010. LNCS, vol. 6932, pp. 171–181. Springer, Heidelberg (2011)
Wichaiwong, T., Jaruskulchai, C.: XML Retrieval More Efficient Using Double Scoring Scheme. In: Geva, S., Kamps, J., Schenkel, R., Trotman, A. (eds.) INEX 2010. LNCS, vol. 6932, pp. 351–362. Springer, Heidelberg (2011)
Wichaiwong, T., Jaruskulchai, C.: An Extension XML Compression Technique for XML Element Retrieval. Information-An International Interdisciplinary Journal 15 (2012)
Wichaiwong, T., Jaruskulchai, C.: XML Retrieval More Efficient Using ADXPI Indexing Scheme. In: The 4th International Symposium on Mining and Web, Biopolis, Singapore (2011)
Wichaiwong, T., Jaruskulchai, C.: MEXIR: An Implementation of High Performance and High Precision XML Information Retrieval. Computer Technology and Application 2(4) (2011)
Wichaiwong, T., Jaruskulchai, C.: A Score Sharing Method for XML Element Retrieval. Information-An International Interdisciplinary Journal 15 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wichaiwong, T., Jaruskulchai, C. (2012). MEXIR at INEX-2011. In: Geva, S., Kamps, J., Schenkel, R. (eds) Focused Retrieval of Content and Structure. INEX 2011. Lecture Notes in Computer Science, vol 7424. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35734-3_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-35734-3_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35733-6
Online ISBN: 978-3-642-35734-3
eBook Packages: Computer ScienceComputer Science (R0)