Skip to main content

Informing the Curious Negotiator: Automatic News Extraction from the Internet

  • Chapter
Data Mining

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3755))

Abstract

Information acquisition and validation play an important role in the decision making process during negotiation. In this chapter we briefly present the framework of a smart data mining system for providing contextual information extracted from the Internet to a negotiation agent. We then present one of its components in more details – an effective automated technique for extracting relevant articles from news web sites, so that they can be used further by the mining agents. Most current techniques experience difficulties in coping with changes in web site structure and formats. The proposed extraction process is completely automatic and independent of web site formats. Proposed technique identifies regularities in both format and content of news web sites. The algorithms are applicable to both single- and multi-document web sites. Since invalid URLs can cause errors in data extraction, we also present a method for the negotiation agent to estimate the validity of the extracted data based on the frequency of the relevant words in the news title. Once the news articles are extracted the next task is to construct sets of given articles. This chapter presents a new procedure for constructing news data sets on given topics. The extracted news data set is further utilised by the parties involved in negotiation. The information retrieved from the data set can support both human and automated negotiators.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Simoff, S.J., Debenham, J.K.: Curious negotiator. In: Klusch, M., Ossowski, S., Shehory, O. (eds.) CIA 2002. LNCS (LNAI), vol. 2446, p. 104. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  2. Gerding, E.H., van Bragt, D.D.B., et al.: Multi-issue negotiation processes by evolutionary simulation: validation and social extensions. In: Proceedings Workshop on Complex Behavior in Economics. Aix-en-Provence, France (2000)

    Google Scholar 

  3. Gomes, A., Jehiel, P.: Dynamic process of social and economic interactions: On the persistence of inefficiencies. Journal of Political Economy 113(3), 626–667 (2005)

    Article  Google Scholar 

  4. Kraus, S.: Strategic Negotiation in Multiagent Environments. MIT Press, Cambridge (2001)

    MATH  Google Scholar 

  5. Ströbel, M.: Design of Roles and Protocols for Electronic Negotiations. Electronic Commerce Research Journal, Special Issue on Market Design (2001)

    Google Scholar 

  6. Milgrom, P., Weber, R.A.: Theory of Auctions with Competitive Bidding. Econometrica 50(5), 1089–1122 (1982)

    Article  MATH  Google Scholar 

  7. Watkins, M.: Breakthrough Business Negotiation-A Toolbox for Managers. Jossey-Bass (2002)

    Google Scholar 

  8. Hand, D., Mannila, H., et al.: Principles of Data Mining. MIT Press, Cambridge (2001)

    Google Scholar 

  9. Franz, M., Ittycheriah, A., et al.: First Story Detection: Combining Similarity and Novelty Based Approaches. In: Topic Detection and Tracking Workshop Report (2001)

    Google Scholar 

  10. Simoff, S.J., Debenham, J.K.: Time-constrained support for decision-making in e-market environments. In: Proceedings of the 6th International Conference of The International Society for Decision Support Systems ISDSS 2001, London, UK, pp. 193–206 (2001)

    Google Scholar 

  11. Chidlovskii, B., Ragetli, J., et al.: Automatic wrapper generation for web search engines. In: Lu, H., Zhou, A. (eds.) WAIM 2000. LNCS, vol. 1846, pp. 399–410. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  12. Freitag, D., Kushmerick, N.: Boosted wrapper induction. In: Proceedings of the 17th National Conference on Artificial Intelligence, AAAI 2000 (2000)

    Google Scholar 

  13. Kushmerick, N., Grace, B.: The wrapper induction environment. In: Workshop on Software Tools for Developing Agents, AAAI 1998 (1998)

    Google Scholar 

  14. Kushmerick, N.: Wrapper induction: Efficiency and expressiveness. Artificial Intelligence 118(1-2), 15–68 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  15. Muslea, I., Minton, S., et al.: STALKER: Learning extraction rules for semistructured, Web-based information sources. In: Proceedings of AAAI 1998 Workshop on AI and Information Integration. AAAI Press, Menlo Park (1998)

    Google Scholar 

  16. Gao, X., Sterling, L.: Semi-structured Data Extraction from Heterogeneous Sources. In: Bratjevik, T., Schwartz, D., Divitini, M. (eds.) Internet-based Knowledge Management and Organizational Memories, pp. 83–102. Idea Group Publishing, USA (2000)

    Google Scholar 

  17. McKeown, K.R., Barzilay, R., et al.: Columbia multi-document summarization: Approach and evaluation. In: Proceedings of the Workshop on Text Summarization, ACM SIGIR Conference, DARPA/NIST Document Understanding Conferences, DUC (2001)

    Google Scholar 

  18. Lin, S.H., Ho, J.M.: Discovering informative content blocks from Web documents. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD 2002. ACM Press, New York (2002)

    Google Scholar 

  19. Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)

    Google Scholar 

  20. Hulth, A., Karlgren, J., Jonsson, A., Boström, H., Asker, L.: Automatic Keyword Extraction Using Domain Knowledge. In: Gelbukh, A. (ed.) CICLing 2001. LNCS, vol. 2004, p. 472. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  21. Andrade, M., Valencia, A.: Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families. Bioinformatics (14), 600–607 (1998)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Zhang, D., Simoff, S.J. (2006). Informing the Curious Negotiator: Automatic News Extraction from the Internet. In: Williams, G.J., Simoff, S.J. (eds) Data Mining. Lecture Notes in Computer Science(), vol 3755. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677437_14

Download citation

  • DOI: https://doi.org/10.1007/11677437_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32547-5

  • Online ISBN: 978-3-540-32548-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics