Elsevier

Expert Systems with Applications

Volume 42, Issue 20, 15 November 2015, Pages 6807-6818
Expert Systems with Applications

Knowledge discovery in task-oriented dialogue

https://doi.org/10.1016/j.eswa.2015.05.005Get rights and content

Highlights

  • Method for knowledge discovery in task-oriented dialogues.

  • Knowledge is represented using folksonomies.

  • The knowledge represented by folksonomies can be used to interpret new utterances.

  • The folksonomy can be used to discover Topics Addressed by interlocutors.

Abstract

Knowledge discovery is the process of discovering useful knowledge in a broad range of sources, such as relational databases, images, or texts. Dialogues are generated by interaction between people using natural language and can be used as a source of information. Once discovered, knowledge needs to be represented, and there are several approaches to this. In this paper, we propose a method to discover knowledge in task-oriented dialogues by representing these dialogues through folksonomies, using a novel quadripartite model. Folksonomies are knowledge structures composed of users, tags, and resources. Dialogues and folksonomies have a social dimension in common, which renders folksonomies suited to representing knowledge discovered from dialogues. The knowledge represented by folksonomies can be used to interpret new utterances in a dialogue and detect trends, e.g., by discovering Topics Addressed by people at different time intervals, in the dialogues used to learn the folksonomies. The main difference between our approach and past techniques is that we use the characteristics (the content) of each resource in the discovery process. Experiments involving a real-world task-oriented dialogue corpus showed that using our method, learned folksonomies can interpret utterances with an accuracy of 72.32%. Moreover, another experiment showed that it is possible to use our method to determine Topics Addressed by interlocutors in dialogues.

Introduction

The process of extracting useful, implicit, and previously unknown knowledge from large amounts of data is known as Knowledge Discovery in Databases (KDD) (Lara, Lizcano, Martínez, & Pazos, 2014). KDD has been used in a broad range of applications across a variety of domains, such as to improve the analysis of marketing and business databases (Orriols-Puig, Martínez-López, Casillas, & Lee, 2013), extract knowledge from structural medical data (Esfandiary, Babavalian, Moghadam, & Tabar, 2014), and to monitor water quality using hydrological data (Alatrista-Salas et al., 2014). Due to the rapidly growing amounts of digital data, there is a pressing need for theories and tools that can support the extraction of useful information (knowledge) from them. KDD aims to map low-level data into other forms that are more compact, abstract, and useful (Fayyad, Piatetsky-Shapiro, & Smyth, 1996). The knowledge discovered may be used for analysis, trends, classification, group identification, behavior forecasting, etc. Once knowledge has been obtained, it needs to be represented in some form, such as ontologies, frames, or folksonomies.

Various sources may be used for KDD: relational databases, and structured and non-structured texts or images are examples. However, dialogues have not yet been explored as source of knowledge. Dialogues are interactions between speakers and listeners, called interlocutors, and are composed of speech acts (utterances). Of the several types of dialogues, task-oriented dialogue aims to solve a given task in a given domain. Such dialogues generate the solution to a task, requested by someone in order to accomplish something, in a concise sequence. Thus, a remarkable characteristic of task-oriented dialogues is that they involve two kinds of interlocutors: one that asks for help, and another that possesses knowledge of the relevant domain, and assists the former kind in solving the task at hand. Table 1 shows an example of a task-oriented dialogue.

According to Traum and Hinkelman (1992), one of the main characteristics of task-oriented dialogue is the dissemination of knowledge, i.e., the interlocutor with more knowledge transfers it to the one asking for help (Carletta et al., 1997). This kind of dialogue is now common on the Internet. Several companies offer customer support by way of live chat, where an attendant answers questions from customers (Elmorshidy, 2011). In this paper, we refer to as “attendant” the interlocutor who has knowledge of the domain and “user” the interlocutor who is asking for help, as in a customer support center (e.g., help desk system).

Our research in this paper aims to discover knowledge from task-oriented dialogues. Once discovered, the knowledge must be represented. Of the several alternatives available, we choose folksonomies for the following reasons: (i) both dialogue and folksonomies have in common a social dimension; (ii) folksonomies directly reflect the vocabulary of common users, and thus can represent discovered knowledge more faithfully; and (iii) folksonomies are simpler than other knowledge structures, such as ontologies. It is important to mention that since we had some success with folksonomies, we plan in future research to develop a method to learn ontologies from dialogues.

Folksonomies are structures of knowledge representation that emerge from tagging in collaborative tagging systems (Peters, 2009). Tagging is the assignment of tags to resources by users. Thus, folksonomies comprise users, tags, and resources. A resource can be any object that users are interested in tagging, such as photos and videos. In comparison with ontologies, folksonomies are simpler structures to implement and use (Echarte, Astrain, Córdoba, & Villadangos, 2007). According to Hotho, Jäschke, Schmitz, and Stumme (2006a), one of the benefits of tagging is that users do not need experience or a particular skill to participate, i.e., the folksonomies that emerge do not need to be built by knowledge engineers. Moreover, ontologies have a controlled vocabulary derived by consensus, which needs to be attained among the participants of the knowledge-building process (knowledge engineers, domain experts, etc.). By contrast, folksonomies directly reflect the vocabulary of common users, since lay users assign tags to resources (Quintarelli, 2005). As a consequence, folksonomies, unlike ontologies, are untroubled by the large amount of information at hand or the need for consensus. A major characteristic of folksonomies is their social dimension (users), which is also part of dialogues, due to the interaction between users. This characteristic renders folksonomies suitable for representing knowledge discovered from dialogues.

In this paper, we are interested in discovering knowledge in dialogues and introduce a method to learn folksonomies from task-oriented dialogues. The knowledge represented by folksonomies may be used for the interpretation of new dialogue utterances and for trend detection, e.g., discovering Topics Addressed by people at different time intervals in dialogues used to learn the folksonomies. Trending topics are the ones being discussed more than others, and are useful measures of popularity on social networking services, such as Twitter. In order to verify whether the structures created by our method are genuine folksonomies, we performed an experiment to show that they exhibit the small-world phenomenon (Milgram, 1967), which is a characteristic of folksonomies (Cattuto et al., 2007). We also confirmed that our learned folksonomies can interpret dialogue utterances. For this, we performed an experiment to measure the accuracy of the folksonomies in interpreting utterances to determine whether they belong to the domain represented by the folksonomies. Moreover, we conducted an experiment to discover trending topics using the learned folksonomies.

The main contribution of this paper is a method to facilitate the learning of folksonomies from dialogues. To the best of our knowledge, ours is the first published proposal for learning folksonomies from dialogues.

This paper is organized as follows: Section 2 presents the concept of a folksonomy, and Section 3 contains a description of the FolksDialogue method. In Section 4, we present our proposed approach to trend detection in folksonomies. The experiments that we conducted and the results obtained are detailed in Section 5. We survey related works in the area in Section 6, and offer our conclusions as well as directions for future work Section 7.

Section snippets

Folksonomies

Collaborative tagging systems are characterized by the idea of tagging resources or objects through terms or keywords (tags). Such terms are freely created by different users in their own words and serve as reference for a particular resource or object of their interest. Resources can be of different kinds depending on the tagging system. Examples of tagging systems and their resources include Delicious (URLs), Flickr (pictures), and last.fm (music). In such systems, users tag resources (URLs,

The FolksDialogue method

Our proposed method aims to discover knowledge in task-oriented dialogues, and its output is a folksonomy. In order to better explain our approach, we first present an extension of the formal definition of the tripartite model of folksonomies, described in Section 2, obtained from task-oriented dialogues.

Trend detection in dialogues

In the context of our study, trend detection refers to discovering Topics Addressed at different time intervals by interlocutors in a dialogue. Trending topics are issues that are being discussed more often than others. Trending topics are regularly detected and highlighted in a broad range of contexts nowadays. The social networking service Twitter uses all public tweets to compile a list of the most discussed topics that is updated every hour (Kang, Kim, & Chung, 2014). In our approach, we

Experiments and results

In this section, we detail experiments to test our proposed method for knowledge discovery in dialogues and report the results. In order to first check whether the structures created by our proposed method are genuine folksonomies, we performed an experiment to show that they contain the small-world phenomenon, which is a characteristic of folksonomies. Furthermore, we designed an experiment to show that the folksonomy learned using FolksDialogue can be used to interpret utterances in dialogues

Related work

We searched publications related to discovering knowledge from dialogues, but did not find a work with that particular focus. We did find a study by Trappey, Wu, Liu, and Lin (2013) that proposed a process to analyze consumer dialogues in order to discover factors that contribute to customer satisfaction and dissatisfaction in some service experience. For this, the authors used text mining techniques and clustering methods. It is important to note, however, that what the authors call

Conclusions and future work

Knowledge discovery aims to extract unknown and useful knowledge from large amounts of data. Different sources may be used in KDD. In this paper, we proposed a method to extract knowledge from dialogues and represent it through folksonomies. The method learns folksonomies from task-oriented dialogues represented by a novel quadripartite model.

The main contribution of this paper is a new method to learn folksonomies from dialogues. To the best of our knowledge, no such approach has hitherto been

Acknowledgments

Gregory Moro Puppi Wanderley would like to thank CAPES-Brazil for supporting him in this research.

References (37)

  • Echarte, F., Astrain, J. J., Córdoba, A., & Villadangos, J. (2007). Ontology of folksonomy: A new modeling method. In...
  • Elmorshidy, A. (2011). Benefits analysis of live customer support chat in E-commerce websites: Dimensions of a new...
  • D.W. Embley et al.

    Handbook of concept modeling: Theory, practice, and research challenges

    (2011)
  • N. Esfandiary et al.

    Knowledge discovery in medicine: Current issue and future trend

    Expert Systems with Applications

    (2014)
  • U. Fayyad et al.

    From data mining to knowledge discovery in databases

    AI Magazine

    (1996)
  • M. Gupta et al.

    Survey on social tagging techniques

    SIGKDD Explorations

    (2010)
  • Hotho, A., Jäschke, R., Schmitz, C., & Stumme, G. (2006a). Information retrieval in folksonomies: Search and ranking....
  • Hotho, A., Jäschke, R., Schmitz, C., & Stumme, G. (2006b). Trend detection in folksonomies. In Proceedings of first...
  • Cited by (4)

    • A rule-based support system for dissonance discovery and control applied to car driving

      2016, Expert Systems with Applications
      Citation Excerpt :

      Knowledge discovery can also be a source of inconsistency. The main principle of knowledge discovery consists in using several knowledge bases in order to merge them and discover new knowledge (Wachla & Moczulski, 2007; Lee and Wang, 2012; Ruiz, Foguem, & Grabot, 2014; Valverde-Albacete, González-Calabozo, Peñas, & Peláez-Moreno, 2016; Wanderley, Tacla, Barthès, & Paraiso, 2015; Zhang et al., 2014). It can also concern an unexpected discovery such as serendipity (McCay-Peet, Toms, & Kelloway, 2015), or creative discovery such as inventive problem solving (Yan, Zanni-Merk, Cavallucci, & Collet, 2014).

    • Using linguistic context to learn folksonomies from task-oriented dialogues

      2019, Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, FLAIRS 2019
    View full text