Knowledge discovery in task-oriented dialogue

doi:10.1016/j.eswa.2015.05.005

Expert Systems with Applications

Volume 42, Issue 20, 15 November 2015, Pages 6807-6818

https://doi.org/10.1016/j.eswa.2015.05.005 Get rights and content

Highlights

•
Method for knowledge discovery in task-oriented dialogues.
•
Knowledge is represented using folksonomies.
•
The knowledge represented by folksonomies can be used to interpret new utterances.
•
The folksonomy can be used to discover Topics Addressed by interlocutors.

Abstract

Knowledge discovery is the process of discovering useful knowledge in a broad range of sources, such as relational databases, images, or texts. Dialogues are generated by interaction between people using natural language and can be used as a source of information. Once discovered, knowledge needs to be represented, and there are several approaches to this. In this paper, we propose a method to discover knowledge in task-oriented dialogues by representing these dialogues through folksonomies, using a novel quadripartite model. Folksonomies are knowledge structures composed of users, tags, and resources. Dialogues and folksonomies have a social dimension in common, which renders folksonomies suited to representing knowledge discovered from dialogues. The knowledge represented by folksonomies can be used to interpret new utterances in a dialogue and detect trends, e.g., by discovering Topics Addressed by people at different time intervals, in the dialogues used to learn the folksonomies. The main difference between our approach and past techniques is that we use the characteristics (the content) of each resource in the discovery process. Experiments involving a real-world task-oriented dialogue corpus showed that using our method, learned folksonomies can interpret utterances with an accuracy of 72.32%. Moreover, another experiment showed that it is possible to use our method to determine Topics Addressed by interlocutors in dialogues.

Introduction

The process of extracting useful, implicit, and previously unknown knowledge from large amounts of data is known as Knowledge Discovery in Databases (KDD) (Lara, Lizcano, Martínez, & Pazos, 2014). KDD has been used in a broad range of applications across a variety of domains, such as to improve the analysis of marketing and business databases (Orriols-Puig, Martínez-López, Casillas, & Lee, 2013), extract knowledge from structural medical data (Esfandiary, Babavalian, Moghadam, & Tabar, 2014), and to monitor water quality using hydrological data (Alatrista-Salas et al., 2014). Due to the rapidly growing amounts of digital data, there is a pressing need for theories and tools that can support the extraction of useful information (knowledge) from them. KDD aims to map low-level data into other forms that are more compact, abstract, and useful (Fayyad, Piatetsky-Shapiro, & Smyth, 1996). The knowledge discovered may be used for analysis, trends, classification, group identification, behavior forecasting, etc. Once knowledge has been obtained, it needs to be represented in some form, such as ontologies, frames, or folksonomies.

Various sources may be used for KDD: relational databases, and structured and non-structured texts or images are examples. However, dialogues have not yet been explored as source of knowledge. Dialogues are interactions between speakers and listeners, called interlocutors, and are composed of speech acts (utterances). Of the several types of dialogues, task-oriented dialogue aims to solve a given task in a given domain. Such dialogues generate the solution to a task, requested by someone in order to accomplish something, in a concise sequence. Thus, a remarkable characteristic of task-oriented dialogues is that they involve two kinds of interlocutors: one that asks for help, and another that possesses knowledge of the relevant domain, and assists the former kind in solving the task at hand. Table 1 shows an example of a task-oriented dialogue.

According to Traum and Hinkelman (1992), one of the main characteristics of task-oriented dialogue is the dissemination of knowledge, i.e., the interlocutor with more knowledge transfers it to the one asking for help (Carletta et al., 1997). This kind of dialogue is now common on the Internet. Several companies offer customer support by way of live chat, where an attendant answers questions from customers (Elmorshidy, 2011). In this paper, we refer to as “attendant” the interlocutor who has knowledge of the domain and “user” the interlocutor who is asking for help, as in a customer support center (e.g., help desk system).

Our research in this paper aims to discover knowledge from task-oriented dialogues. Once discovered, the knowledge must be represented. Of the several alternatives available, we choose folksonomies for the following reasons: (i) both dialogue and folksonomies have in common a social dimension; (ii) folksonomies directly reflect the vocabulary of common users, and thus can represent discovered knowledge more faithfully; and (iii) folksonomies are simpler than other knowledge structures, such as ontologies. It is important to mention that since we had some success with folksonomies, we plan in future research to develop a method to learn ontologies from dialogues.

Folksonomies are structures of knowledge representation that emerge from tagging in collaborative tagging systems (Peters, 2009). Tagging is the assignment of tags to resources by users. Thus, folksonomies comprise users, tags, and resources. A resource can be any object that users are interested in tagging, such as photos and videos. In comparison with ontologies, folksonomies are simpler structures to implement and use (Echarte, Astrain, Córdoba, & Villadangos, 2007). According to Hotho, Jäschke, Schmitz, and Stumme (2006a), one of the benefits of tagging is that users do not need experience or a particular skill to participate, i.e., the folksonomies that emerge do not need to be built by knowledge engineers. Moreover, ontologies have a controlled vocabulary derived by consensus, which needs to be attained among the participants of the knowledge-building process (knowledge engineers, domain experts, etc.). By contrast, folksonomies directly reflect the vocabulary of common users, since lay users assign tags to resources (Quintarelli, 2005). As a consequence, folksonomies, unlike ontologies, are untroubled by the large amount of information at hand or the need for consensus. A major characteristic of folksonomies is their social dimension (users), which is also part of dialogues, due to the interaction between users. This characteristic renders folksonomies suitable for representing knowledge discovered from dialogues.

In this paper, we are interested in discovering knowledge in dialogues and introduce a method to learn folksonomies from task-oriented dialogues. The knowledge represented by folksonomies may be used for the interpretation of new dialogue utterances and for trend detection, e.g., discovering Topics Addressed by people at different time intervals in dialogues used to learn the folksonomies. Trending topics are the ones being discussed more than others, and are useful measures of popularity on social networking services, such as Twitter. In order to verify whether the structures created by our method are genuine folksonomies, we performed an experiment to show that they exhibit the small-world phenomenon (Milgram, 1967), which is a characteristic of folksonomies (Cattuto et al., 2007). We also confirmed that our learned folksonomies can interpret dialogue utterances. For this, we performed an experiment to measure the accuracy of the folksonomies in interpreting utterances to determine whether they belong to the domain represented by the folksonomies. Moreover, we conducted an experiment to discover trending topics using the learned folksonomies.

The main contribution of this paper is a method to facilitate the learning of folksonomies from dialogues. To the best of our knowledge, ours is the first published proposal for learning folksonomies from dialogues.

This paper is organized as follows: Section 2 presents the concept of a folksonomy, and Section 3 contains a description of the FolksDialogue method. In Section 4, we present our proposed approach to trend detection in folksonomies. The experiments that we conducted and the results obtained are detailed in Section 5. We survey related works in the area in Section 6, and offer our conclusions as well as directions for future work Section 7.

Section snippets

Folksonomies

Collaborative tagging systems are characterized by the idea of tagging resources or objects through terms or keywords (tags). Such terms are freely created by different users in their own words and serve as reference for a particular resource or object of their interest. Resources can be of different kinds depending on the tagging system. Examples of tagging systems and their resources include Delicious (URLs), Flickr (pictures), and last.fm (music). In such systems, users tag resources (URLs,

The FolksDialogue method

Our proposed method aims to discover knowledge in task-oriented dialogues, and its output is a folksonomy. In order to better explain our approach, we first present an extension of the formal definition of the tripartite model of folksonomies, described in Section 2, obtained from task-oriented dialogues.

Trend detection in dialogues

In the context of our study, trend detection refers to discovering Topics Addressed at different time intervals by interlocutors in a dialogue. Trending topics are issues that are being discussed more often than others. Trending topics are regularly detected and highlighted in a broad range of contexts nowadays. The social networking service Twitter uses all public tweets to compile a list of the most discussed topics that is updated every hour (Kang, Kim, & Chung, 2014). In our approach, we

Experiments and results

In this section, we detail experiments to test our proposed method for knowledge discovery in dialogues and report the results. In order to first check whether the structures created by our proposed method are genuine folksonomies, we performed an experiment to show that they contain the small-world phenomenon, which is a characteristic of folksonomies. Furthermore, we designed an experiment to show that the folksonomy learned using FolksDialogue can be used to interpret utterances in dialogues

Related work

We searched publications related to discovering knowledge from dialogues, but did not find a work with that particular focus. We did find a study by Trappey, Wu, Liu, and Lin (2013) that proposed a process to analyze consumer dialogues in order to discover factors that contribute to customer satisfaction and dissatisfaction in some service experience. For this, the authors used text mining techniques and clustering methods. It is important to note, however, that what the authors call

Conclusions and future work

Knowledge discovery aims to extract unknown and useful knowledge from large amounts of data. Different sources may be used in KDD. In this paper, we proposed a method to extract knowledge from dialogues and represent it through folksonomies. The method learns folksonomies from task-oriented dialogues represented by a novel quadripartite model.

The main contribution of this paper is a new method to learn folksonomies from dialogues. To the best of our knowledge, no such approach has hitherto been

Acknowledgments

Gregory Moro Puppi Wanderley would like to thank CAPES-Brazil for supporting him in this research.

References (37)

S. Chojnacki et al.
Random graph generative model for folksonomy network structure approximation
Procedia Computer Science
(2010)
J. Lara et al.
Data preparation for KDD through automatic reasoning based on description logic
Information Systems
(2014)
P. Mika
Ontologies are us: A unified model of social networks and semantics
Web Semantics: Science, Services and Agents on the World Wide Web
(2007)
A. Orriols-Puig et al.
Unsupervised KDD to creatively support managers’ decision making with fuzzy association rules: A distribution channel application
Industrial Marketing Management
(2013)
H. Alatrista-Salas et al.
A knowledge discovery process for spatiotemporal data: Application to river water quality monitoring
Ecological Informatics
(2014)
Belgeman, G., Keller, P., & Smadja, F. (2006). Automated tag clustering: Improving search and exploration in the tag...
J. Carletta et al.
The reliability of a dialogue structure coding scheme
Computational Linguistics
(1997)
C. Cattuto et al.
Network properties of folksonomies
AI Communication
(2007)
CoGrOO (2014). Website: <http://cogroo.sourceforge.net/> Accessed: 07 Oct....
Delicious (2014). <www.delicious.com> Accessed: 07 Oct....

Echarte, F., Astrain, J. J., Córdoba, A., & Villadangos, J. (2007). Ontology of folksonomy: A new modeling method. In...

Elmorshidy, A. (2011). Benefits analysis of live customer support chat in E-commerce websites: Dimensions of a new...

D.W. Embley et al.

Handbook of concept modeling: Theory, practice, and research challenges

(2011)

N. Esfandiary et al.

Knowledge discovery in medicine: Current issue and future trend

Expert Systems with Applications

(2014)

U. Fayyad et al.

From data mining to knowledge discovery in databases

AI Magazine

(1996)

M. Gupta et al.

Survey on social tagging techniques

SIGKDD Explorations

(2010)

Hotho, A., Jäschke, R., Schmitz, C., & Stumme, G. (2006a). Information retrieval in folksonomies: Search and ranking....

Hotho, A., Jäschke, R., Schmitz, C., & Stumme, G. (2006b). Trend detection in folksonomies. In Proceedings of first...

Cited by (4)

A rule-based support system for dissonance discovery and control applied to car driving
2016, Expert Systems with Applications
Citation Excerpt :
Knowledge discovery can also be a source of inconsistency. The main principle of knowledge discovery consists in using several knowledge bases in order to merge them and discover new knowledge (Wachla & Moczulski, 2007; Lee and Wang, 2012; Ruiz, Foguem, & Grabot, 2014; Valverde-Albacete, González-Calabozo, Peñas, & Peláez-Moreno, 2016; Wanderley, Tacla, Barthès, & Paraiso, 2015; Zhang et al., 2014). It can also concern an unexpected discovery such as serendipity (McCay-Peet, Toms, & Kelloway, 2015), or creative discovery such as inventive problem solving (Yan, Zanni-Merk, Cavallucci, & Collet, 2014).
This paper is based on the concept of dissonance, that is, gaps or conflicts existing in a specific knowledge base or among different knowledge bases. It presents a rule-based system that assists human operators in dissonance discovery and control by taking into account two kinds of dissonance, i.e., affordance to study conflicts of use, and inconsistencies to study conflicts of intention and action, through the analysis of cognitive behavior implemented in knowledge bases. This system elaborates the knowledge base composed of rules, and analyzes the knowledge content to discover new knowledge by creating additional rules, or to identify inconsistencies when conflicts between rules occur. The affordance discovery control process uses a deductive and an inductive reasoning algorithm of which the aim is to establish new rules using existing ones. The inconsistency discovery control process applies an abductive reasoning algorithm in order to determine contradictory rules when existing rules may result in opposite intentions being accomplished. Two groups of inconsistencies are addressed: interferences involving several decision makers, and contradictions involving the same decision maker. A knowledge acquisition control process facilitates the creation of the initial rules that contain parameters such as intentions relating to the goals to be achieved, actions to be performed to achieve these intentions, objects used to carry out these actions and the decision makers who execute these actions using the corresponding objects. A feasibility study taking into account five rule bases relating to the manual use of an Automated Speed Control System (ASCS), the automated control of the car speed by the ASCS, the manual control of aquaplaning, the manual control of the car speed, and the manual control of car fuel consumption is proposed to validate the rule-based support system.
MOCA: A Motivational Online Conversational Agent for Improving Student Engagement in Collaborative Learning
2021, IEEE Transactions on Learning Technologies
Using linguistic context to learn folksonomies from task-oriented dialogues
2019, Proceedings of the 32nd International Florida Artificial Intelligence Research Society Conference, FLAIRS 2019
An integrated forecasting model of complex uncertainty system based on knowledge discovery
2016, Journal of Computers (Taiwan)

View full text

Knowledge discovery in task-oriented dialogue

Highlights

Abstract

Introduction

Section snippets

Folksonomies

The FolksDialogue method

Trend detection in dialogues

Experiments and results

Related work

Conclusions and future work

Acknowledgments

Procedia Computer Science

Information Systems

Web Semantics: Science, Services and Agents on the World Wide Web

Industrial Marketing Management

A knowledge discovery process for spatiotemporal data: Application to river water quality monitoring

Ecological Informatics

The reliability of a dialogue structure coding scheme

Computational Linguistics

Network properties of folksonomies

AI Communication

Handbook of concept modeling: Theory, practice, and research challenges

Knowledge discovery in medicine: Current issue and future trend

Expert Systems with Applications

From data mining to knowledge discovery in databases

AI Magazine

Survey on social tagging techniques

SIGKDD Explorations