skip to main content
10.1145/2479832.2479857acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

Automatic extraction of advice-revealing sentences foradvice mining from online forums

Published: 23 June 2013 Publication History

Abstract

Web forums often contain explicit key learnings gleaned from people's experiences since they are platforms for personal communications on sharing information with others. One of the key learnings contained inWeb forums is often expressed in the form of advice. As part of human experience mining from Web resources, we aim to provide a methodology to extract advice-revealing sentences from Web forums due to its usefulness, especially in travel domain. Instead of viewing the problem as a simple classification, we define it as a sequence labeling problem using various features. We identify three different types of features (i.e., syntactic features, context features, and sentence informativeness) and propose a new way of using Hidden Markov Model (HMM) for labeling sequential sentences, which in our experiment gave the best performance for our task. Moreover, the sentence informativeness score serves as an important feature for this task. It is worth noting that this work is the first attempt to extract advice-revealing sentences from Web forums.

References

[1]
Y. Altun, I. Tsochantaridis, and T. Hofmann. Hidden markov support vector machines. In The Proceedings of ICML, 2003.
[2]
Travel Industry Association. Executive Summaries - Travelers Use of the Internet, 2004 Edition. Washington, DC: TIA, 2005.
[3]
R. Barzilay and L. Lee. Catching the drift: Probabilistic content models with applications to generation and summarization. In The Proceedings of HLT-NAACL, pages 113--120, 2004.
[4]
Y. W. Chen and C. J. Lin. Combining svms with various feature selection strategies. Feature Extraction, 207:315--324, 2006.
[5]
K. W. Church and W. A. Gale. Poisson mixtures. Natural Language Engineering, 1(2):163--190, 1995a.
[6]
K. W. Church and W. A. Gale. Inverse document frequency (idf): A measure of deviations from poisson. In The Proceedings of Third the Workshop on Very Large Corpora, pages 121--130, 1995b.
[7]
Compete, Inc. Embracing Consumer Buzz Creates Measurement Challenges for Marketers. December 2006.
[8]
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1--38, 1977.
[9]
S. Ding, G. Cong, C. Y. Lin, and X. Zhu. Using conditional random fields to extract contexts and answers of questions from online forums. In The Proceedings of 46th Annual Meeting of the ACL, pages 710--718, June 2008.
[10]
N. Glance, M. Hurst, K. Nigam, M. Siegler, R. Stockton, and T. Tomokiyo. Deriving marketing intelligence from online discussions. In The Proceedings of the KDD, pages 419--428, 2005.
[11]
S. Kozawa, M. Okamoto, S. Nagano, K. Cho, and S. Matsubara. Advice extraction from web for providing prior information concerning outdoor activities. In The 4th International Symposium on Intelligent Interactive Multimedia Systems and Services, pages 251--260, 2011.
[12]
J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In The Proceedings of ICML, 2001.
[13]
B. Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2006.
[14]
M.-C. D. Marneffe, B. MacCartney, and C. D. Manning. Generating typed dependency parses from phrase structure parses. LREC, 2006.
[15]
N. Nguyen and Y. Guo. Comparisons of sequence labeling algorithms and extensions. In Proceedings of the 24th international conference on Machine learning, ICML '07, pages 681--688, New York, NY, USA, 2007. ACM.
[16]
K. C. Park, Y. J. Jeong, and S. H. Myaeng. Detecting experiences from weblogs. In The Proceedings of the 48th Annual Meeting of the ACL, pages 1464--1472, 2010.
[17]
Z. Qu and Y. Liu. Finding problem solving threads in online forums. In The Proceedings of the 5th International Joint Conference on Natural Language Processing, pages 1413--1417, 2011.
[18]
Z. Qu and Y. Liu. Sentence dependency tagging in online question answering forums. In The Proceedings of the 50th Annual Meeting of the ACL, pages 554--562, 2012.
[19]
N. A. Smith, D. L. Vail, and J. D. Lafferty. Computationally efficient m-estimation of log-linear structure models. In ACL, 2007.
[20]
K. Sparck-Jones. Index term weighting. Information Storage and Retrieval, 9:619--633, 1973.
[21]
T. L. M. van Kasteren, G. Englebienne, and B. J. A. Krose. Activity recognition using semi-markov models on real world smart home datasets. J. Ambient Intell. Smart Environ., 2(3):311--325, Aug. 2010.
[22]
M. Van-Setten, S. Pokraev, and J. Koolwaaij. Context-aware recommendations in the mobile tourist application compass. In The Proceedings of the International Conference on Adaptive Hypermedia and Adaptive Web-based Systems, pages 235--244, 2004.
[23]
Alfan F. Wicaksono and S. H. Myaeng. Mining advices from weblogs. In The Proceedings of 21st ACM CIKM, pages 2347--2350, 2012.
[24]
W. Y. Yang, Y. Cao, and C. Y. Lin. A structural support vector method for extracting contexts and answers of questions from online forums. In The Proceedings of the 2009 Conference on EMNLP, volume 2, pages 514--523, 2009.

Cited By

View all
  • (2024)Understanding Online Parental Help-Seeking and Help-Giving in Early Childhood: The Design Challenges of Supporting Complex Parenting QuestionsProceedings of the ACM on Human-Computer Interaction10.1145/36536908:CSCW1(1-36)Online publication date: 26-Apr-2024
  • (2022)Explainable System for Suggestion Mining using attention2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS)10.1109/ICACCS54159.2022.9785102(679-684)Online publication date: 25-Mar-2022
  • (2021)Beyond the Polarities: Sentiment Analysis of French Restaurant Reviews Using BERT-based Models2021 8th International Conference on Behavioral and Social Computing (BESC)10.1109/BESC53957.2021.9635309(1-8)Online publication date: 29-Oct-2021
  • Show More Cited By

Index Terms

  1. Automatic extraction of advice-revealing sentences foradvice mining from online forums

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    K-CAP '13: Proceedings of the seventh international conference on Knowledge capture
    June 2013
    160 pages
    ISBN:9781450321020
    DOI:10.1145/2479832
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 June 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. advice mining
    2. extension of hmm
    3. sequence labeling

    Qualifiers

    • Research-article

    Conference

    K-CAP 2013
    Sponsor:
    K-CAP 2013: Knowledge Capture Conference
    June 23 - 26, 2013
    Banff, Canada

    Acceptance Rates

    K-CAP '13 Paper Acceptance Rate 13 of 60 submissions, 22%;
    Overall Acceptance Rate 55 of 198 submissions, 28%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Understanding Online Parental Help-Seeking and Help-Giving in Early Childhood: The Design Challenges of Supporting Complex Parenting QuestionsProceedings of the ACM on Human-Computer Interaction10.1145/36536908:CSCW1(1-36)Online publication date: 26-Apr-2024
    • (2022)Explainable System for Suggestion Mining using attention2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS)10.1109/ICACCS54159.2022.9785102(679-684)Online publication date: 25-Mar-2022
    • (2021)Beyond the Polarities: Sentiment Analysis of French Restaurant Reviews Using BERT-based Models2021 8th International Conference on Behavioral and Social Computing (BESC)10.1109/BESC53957.2021.9635309(1-8)Online publication date: 29-Oct-2021
    • (2020)Knowledge components detection in User-Generated Content2020 International Conference on Intelligent Systems and Computer Vision (ISCV)10.1109/ISCV49265.2020.9204188(1-6)Online publication date: Jun-2020
    • (2020)Open Domain Suggestion Mining Leveraging Fine-Grained Analysis (Workshop Paper)2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM)10.1109/BigMM50055.2020.00069(414-423)Online publication date: Sep-2020
    • (2017)On the Identification of Suggestion Intents from Vietnamese Conversational TextsProceedings of the 8th International Symposium on Information and Communication Technology10.1145/3155133.3155201(417-424)Online publication date: 7-Dec-2017
    • (2015)Automatically Mining Negative Code Examples from Software Developer Q & A ForumsProceedings of the 2015 30th IEEE/ACM International Conference on Automated Software Engineering Workshop (ASEW)10.1109/ASEW.2015.10(115-122)Online publication date: 9-Nov-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media