skip to main content
10.1145/2837185.2837269acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
short-paper

Clustering for closely similar recipes to extract spam recipes in user-generated recipe sites

Published: 11 December 2015 Publication History

Abstract

Nowadays, many user-generated recipe sites are accessible on the internet. On user-generated recipe sites, however, are various spam recipe pages that describe closely similar recipes requiring special cooking equipment, with no preparation explanations. These spam recipes are not useful for users. In fact, they impede user's recipe searches. In this paper, we target closely similar recipes as a first step in extracting spam recipes. If user search results could be classified to identify closely similar recipes, user's recipe searches would be easier and more productive. Clustering tools of many kinds are proposed, but it is difficult to cluster closely similar recipes using only existing clustering tools because recipe sites have a unique page structure comprising a title, ingredients, directions (preparation instructions), and comments. The importance of words from each part differs. We propose a clustering method for user-generated recipe sites based on page structure and important words. Next, we conducted an experiment to measure the benefits of our proposed method. The result of experiment presents the benefits of our proposed method which classify the closely similar recipes.

References

[1]
P. Forbes and M. Zhu. Content-boosted matrix factorization for recommender systems: experiments with recipe recommendation. In Proceedings of the fifth ACM conference on Recommender systems, pages 261--264. ACM, 2011.
[2]
G. Geleijnse, P. Nachtigall, P. van Kaam, and L. Wijgergangs. A personalized recipe advice system to promote healthful choices. In Proceedings of the 16th international conference on Intelligent user interfaces, pages 437--438. ACM, 2011.
[3]
S. Hanai and A. Nadamoto. Clustering for similar recipes by using cooking ingredient. In Proceedings of the IEICE conference on Data Engineering and Food Media, pages 47--52, 2014. (in Japanese).
[4]
S. Hanai, H. Nanba, and A. Nadamoto. Clustering for closely similar recipes to extract spam recipe. In IPSJ SIG Technical Reports, pages 1--7, 2014. (in Japanese).
[5]
M. Harvey, B. Ludwig, and D. Elsweiler. You are what you eat: Learning user tastes for rating prediction. In Proceedings of String Processing and Information Retrieval, pages 153--164. Springer, 2013.
[6]
K. Ikejiri, Y. Sei, H. Nakagawa, Y. Tahara, and A. Ohsuga. A proposal of calculation method of surprising value of recipe based on ingredient. Proceedings of IEICE technical report. Data engineering, 113(214):1--6, 2013. (in Japanese).
[7]
R. D. Lawrence, G. S. Almasi, V. Kotlyar, M. Viveros, and S. S. Duri. Personalization of supermarket product recommendations. the Journal of Data Min. Knowl. Discov., pages 11--32, 2001.
[8]
Maruha-Nichiro. Investigation of cooking recipes, 2013. http://www.maruha-nichiro.co.jp/news_center/research/pdf/20130227_recipe_cyousa.pdf [Online; accessed 11-August-2015](in Japanese).
[9]
Y. Shidochi, T. Takahashi, I. Ide, and H. Murase. Finding replaceable materials in cooking recipe texts considering characteristic cooking actions. In Proceedings of the ACM Multimedia 2009 Workshop on Multimedia for Cooking and Eating Activities, pages 9--14. ACM, 2009.
[10]
Y. Sugiyama, Y. Yamakata, and K. Tanaka. Summary of similar recipe for the recipe data as a procedure information and discovery of important differences. In Proceedings of DEIM Forum 2013 D3, volume 5, 2013. (in Japanese).
[11]
M. Svensson, K. Höök, J. Laaksolahti, and A. Waern. Social navigation of food recipes. In Proceedings of the SIGCHI conference on Human factors in computing systems, pages 341--348. ACM, 2001.
[12]
C.-Y. Teng, Y.-R. Lin, and L. A. Adamic. Recipe recommendation using ingredient networks. In Proceedings of the 4th Annual ACM Web Science Conference, pages 298--307. ACM, 2012.
[13]
Y. van Pinxteren, G. Geleijnse, and P. Kamsteeg. Deriving a recipe similarity measure for recommending healthful meals. In Proceedings of the 16th international conference on Intelligent user interfaces, pages 105--114. ACM, 2011.
[14]
J. Wagner, G. Geleijnse, and A. van Halteren. Guidance and support for healthy food preparation in an augmented kitchen. In Proceedings of the 2011 Workshop on Context-awareness in Retrieval and Recommendation, pages 47--50, 2011.
[15]
Y. Zhao and G. Karypis. Comparison of agglomerative and partitional document clustering algorithms. Technical Report 02--014, University of Minnesota, 2002.

Cited By

View all
  • (2024)A Random Walk-Based Approach for Clustering of Food ItemsCyber Intelligence and Information Retrieval10.1007/978-981-97-3594-5_32(385-395)Online publication date: 19-Jul-2024
  • (2020)Extraction Method for a Recipe's Uniqueness based on Recipe Frequency and LexRank of ProceduresProceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services10.1145/3428757.3429128(241-245)Online publication date: 30-Nov-2020
  • (2020)Hierarchical Clustering of World Cuisines2020 IEEE 36th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW49219.2020.00007(98-104)Online publication date: Apr-2020
  • Show More Cited By

Index Terms

  1. Clustering for closely similar recipes to extract spam recipes in user-generated recipe sites

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      iiWAS '15: Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services
      December 2015
      704 pages
      ISBN:9781450334914
      DOI:10.1145/2837185
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 December 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. closely similar recipes
      2. clustering
      3. user-generated recipe

      Qualifiers

      • Short-paper

      Funding Sources

      Conference

      iiWAS '15

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)6
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Random Walk-Based Approach for Clustering of Food ItemsCyber Intelligence and Information Retrieval10.1007/978-981-97-3594-5_32(385-395)Online publication date: 19-Jul-2024
      • (2020)Extraction Method for a Recipe's Uniqueness based on Recipe Frequency and LexRank of ProceduresProceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services10.1145/3428757.3429128(241-245)Online publication date: 30-Nov-2020
      • (2020)Hierarchical Clustering of World Cuisines2020 IEEE 36th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW49219.2020.00007(98-104)Online publication date: Apr-2020
      • (2017)Emotion-based method for latent followee recommendation in TwitterProceedings of the 19th International Conference on Information Integration and Web-based Applications & Services10.1145/3151759.3151817(121-125)Online publication date: 4-Dec-2017
      • (2017)Searching cooking recipes by focusing on common ingredientsProceedings of the 19th International Conference on Information Integration and Web-based Applications & Services10.1145/3151759.3151797(95-101)Online publication date: 4-Dec-2017
      • (2017)Method for Detecting Near-duplicate Recipe Creators Based on Cooking Instructions and Food ImagesProceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence10.1145/3106668.3106676(49-54)Online publication date: 20-Aug-2017
      • (2017)Extraction of Characteristic Sets of Ingredients and Cooking Actions on Cuisine Type2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA)10.1109/WAINA.2017.81(509-513)Online publication date: Mar-2017
      • (2017)Detecting and presenting welcome-news for tourists from user reviewsInternational Journal of Web Information Systems10.1108/IJWIS-04-2017-003313:4(354-369)Online publication date: 6-Nov-2017
      • (2016)Extracting welcome news from travel reviewsProceedings of the 18th International Conference on Information Integration and Web-based Applications and Services10.1145/3011141.3011149(141-148)Online publication date: 28-Nov-2016
      • (2016)Food Search Based on User Feedback to Assist Image-based Food Recording SystemsProceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management10.1145/2986035.2986037(71-75)Online publication date: 16-Oct-2016
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media