Abstract
We propose a novel method to mine popular menu items from online reviews. In order to extract popular menu items, a crawler that uses the wrapper on search web sites was used to collect online reviews, restaurant names, and menu items. Then, unnecessary posts were removed by using the patterns. Also, post frequency was used to find the most frequently appearing menu items from online reviews in order to select the most popular menu items. In the result, the total average accuracy was 0.900.
This work was supported by the Korea Research Foundation(KRF) grant funded by the Korea government(MEST) (No. 2011-0002899).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chang, C.H., Kayed, M., Girgis, M.R., Shaalan, K.: A Survey of Web Information Extraction Systems. IEEE Transaction on Knowledge and Data Engineering 18(10), 1411–1428 (2006)
Bertoli, C., Crescenzi, Y., Merialdo, P.: Crawling programs for wrapper-based applications. In: IEEE IRI 2008, pp. 160–165 (2008)
Yang, J., Kim, T., Choi, J.: An Interface Agent for Wrapper-Based Information Extraction. In: Barley, M.W., Kasabov, N. (eds.) PRIMA 2004. LNCS (LNAI), vol. 3371, pp. 291–302. Springer, Heidelberg (2005)
Soderland, S., Cardie, C., Mooney, R.: Learning information extraction rules for semi-structured and free text. Machine Learning (1999)
Chakrabarti, S., Berg, M., Dom, B.: Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery. Computer Networks 31(11-16), 1623–1640 (1999)
Chakrabarti, S.: Mining the Web, Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2003)
Cho, J., Garcia, H., Page, L.: Efficient Crawling through URL Ordering. Computer Networks 30(1-7), 161–172 (1998)
Nadeau, D., Sekine, S.: A Survey of Named Entity Recognition and Classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Alfonseca, E., Manandhar, S.: An Unsupervised Method for General Named Entity Recognition and Automated Concept Discovery. In: International Conference on General WordNet, pp. 1–9 (2002)
Satoshi, S., Nobata, C.: Definition, Dictionaries and Tagger for Extended Named Entity Hierarchy. In: Conference on Language Resources and Evaluation, pp. 1977–1988 (2004)
Liu, B.: Web Data Mining. Springer, Heidelberg
Tuerny, P.: Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews. In: Meeting of the Association for Computational Linguistics, pp. 417–424 (2002)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification using Machine Learning Techniques. In: Conference on Empirical Methods in Natural Language Processing, pp. 79–86 (2002)
Hu, M., Liu, B.: Mining and Summarizing Customer Reviews. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177 (2004)
Hu, M., Liu, B.: Mining Opinion Features in Customer Reviews. In: 19th National Conference on Artificial Intelligence(AAAI 2004), pp. 755–760 (2004)
Jindal, N., Liu, B.: Mining Comparative Sentences and Relations. In: AAAI 2006 (2006)
Wing Spoon, http://www.wingspoon.cokr
Menupandotcom, http://www.menupan.com
Local Story, http://www.localstory.kr
Food N Cafe, http://www.foodncafe.com
Daum Place, http://place.daum.net
Gang, S.: Analysis of Korean Morphemes and Information Retrieval. Hungrung Publish (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gu, Y.H., Yoo, S.J. (2011). Mining Popular Menu Items of a Restaurant from Web Reviews. In: Gong, Z., Luo, X., Chen, J., Lei, J., Wang, F.L. (eds) Web Information Systems and Mining. WISM 2011. Lecture Notes in Computer Science, vol 6988. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23982-3_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-23982-3_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23981-6
Online ISBN: 978-3-642-23982-3
eBook Packages: Computer ScienceComputer Science (R0)