Keywords

1 Introduction

Conversational user interface (CUI) is a kind of interface mode based on the chat window, which is different from the graphical user interface and natural interaction mode. The conversation interface can provide the entrance for intelligent voice, but also carries pictures, controls and hypertext forms [2, 3, 5, 7]. For complex financial service scenario, the conversation interface combines both the voice user interface (VUI) and graphical user interface (GUI) for the user to promote the overall interaction flow, in order to realize the user tasks [1, 4].

2 Background

2.1 More Natural Way of Interaction

Conversational interface based on intelligent voice technology provides a natural form for people to communicate with technology [6, 19]. From CLI (Command-line interface), to complicated GUI (Graphical User Interface), people have been searching ways to understand and communicate with machine for about thirty years (Fig. 1). Human brains are fundamentally considered that the source of speech is human. It is the first time which people use their own way to interact with machine. That means conversational interface can be considered as a much more natural way than GUI [8, 10, 12].

Fig. 1.
figure 1

History and trend

CUI is a transition from GUI to a VUI (Fig. 1). It is at the in the form of dialog [9, 17]. The form of dialogue weakens the way of accessing and operating the main information. It would bring not only the interface change but also the process alternation.

2.2 The Feasibility

Technology and human recognition system provide the feasibility for conversational interaction. Voice interaction technology has developed rapidly. People have the requirement that the machine is able to understand themselves as human beings, or be their real assistant. At present, more than one Technology Company has announced that their voice recognition rate is up to 97% [9, 11, 15]. In the other hand, the booming Social Networking Services (SNS) make conversational interface familiar with the users than any time in the history [18].

2.3 The Improvement of Experience

For a long time, the GUI has been used as an intermediary for people to interact with software. There are a lot of interactive logic and control rules in GUI, which cost users a lot of learning costs. Compared with the graphical user interface, conversational interface based on intelligent voice technology has a unique advantage [13]. People don’t have to look for the function they want among a big list. The voice user interface (VUI) should lead the whole task to reduce operation steps. At the same time, the graphical user interface (GUI) also provides the visual advantages complementary at the key touch-point. This is definitely a huge improvement in the user experience.

3 Process

Conversation user interface (CUI) is aimed to shorten the task flow and optimize user interaction with the system [16]. CUI provides the obvious advantage experience, but at the same time the open characteristic of CUI would lead to personalization of the user flow. Rather than a fixed step of GUI era, CUI design requires the user experience designer to predict user intent and create task-driven scenarios.

Establishing a financial scenario requires integration of the financial service characteristics. The following should be paid attention:

  • Security. It includes user account security and environmental security. Account security is included in the security of the user account behavior, such as login, transact etc. Environmental security includes the judgment on environmental interference, whether it is safe for trading in this environment.

  • Professional. Unlike other Fibo as personal assistant, CUI applied in financial scenarios should be designed with strong user intent. Therefore, rigorous logic and perfect feedback mechanism are necessary.

The core element of CUI is the scenario-based dialogue, which is task-driven and focuses on what users say and how chatbot responds. Therefore, the design process could be concluded as below (Fig. 2).

Fig. 2.
figure 2

Process

3.1 Establish the Scenario

“Transfer the remittance” in mobile banking have been chosen example for exploratory conversational interaction [14]. And the intelligent system provides the conversational financial service was name “Fibo”.

The scenario provides the following conveniences for the change of CUI.

  • The instructions issued by the user is clear and simple, which could be summarized in terms of “Verb plus object”.

  • It is a High frequency use function in mobile banking.

  • The normal interaction flow is less than or equal to three.

  • The number of modifiable or edited items could be controlled.

Compared to the GUI, users interact with the interface by pointing or clicking, VUI provide a “query to answer” flow for the users. If someone (A) wants to transfer money to someone (B) from his/her account, the chat flow below (Fig. 3) shows the possibilities of the whole scenario.

Fig. 3.
figure 3

Chat flow

The rounded corners rectangles with dotted border show where VUI would happen. The rounded corners rectangles with solid border show where GUI would be added to help. VUI is to guide the user to speak out their intent, and turn it into command in order to push the task flow. However, GUI is to help the user dealing with complicated information, which is hard to represent by VUI (Fig. 3).

3.2 Input

Utterance is about what users say. Utterances are composed by keyword commands, natural speech and slot.

Keyword Command.

It is what users intent to do, such as book a hotel, check the weather etc. In this case, the intent of user is to transfer money to other ones. However sometimes people would not speak out their intent. The people said that “I owe jack $500” might probably want to pay back by money transfer.

Natural Speech.

Such as filler words.

Slot.

Slot is the key information point extracted from the user instruction by the NLU. The NUL module understands the specific requirements of user instructions through this key information and its Value definition (Slot-Value). In this case, the slots are payee name and money amount.the slot-value are corresponds to “jack” and “$500” (Fig. 4).

Fig. 4.
figure 4

Utterance

However, one of the most important aspects is defining the range of what people might say. Human language is extensive and profound. In this case, the intent of “Transfer the remittance” could be presented as follow. Transfer,send,remit,pay,wire could be used to express the meaning of the transfer (Fig. 5).

Fig. 5.
figure 5

Intent language

The utterance showed above is aimed to extend built-in intents. Also providing examples of utterance could help to ensure a good experience. One-shot utterance including the key command and slot could lead the user to the destination. One-shot example is always used as a start (Fig. 6).

Fig. 6.
figure 6

One-shot start

Input is about what user communicate with the GUI shown in the dialogue box. In this case, user need to confirm payee information by browsing list and select payee account list by pointing option. The dialogue box carry the view and control for the users’ eyes and hands.

3.3 Respond

CUI responds the user through text to speech (TTS) and GUI dialogue box. Designers write a good answer scripts, played through the TTS engine into speech. The response brings the user the most intuitive experience. GUI reflects the personality of the product through interactive design and visual design, while VUI is a personalized through the emotion and speech organization. To set the VUI personality in financial scenario needs to be based on in-depth user research. Through users’ persona to insight the social need. It also needs to integrate the character of financial product, such as professional, reliable, etc.

The design of CUI requires the designer to predefine several types of responses that might arise in the conversation. The whole dialogue design is much like the design of movie script, which help users to complete tasks through query and answer.

At the same time, the research pays more attention on the touch-point which GUI should be applied. The CUI carries GUI as dialog box form to make sure the whole process in immersive environment.

There are three Key nodes in “transfer the remittance” scenario. These key nodes would produce different touch-point according to the user’s account status.

  • Payee list. The payee who is not in the list means that payee information including name and account should be newly created. There is no shortcut for inputting a long string of card Numbers, unless the payee account is mapped as an email address or cellphone Numbers. If the form of the account information is mapped as contact information, the scenario should provide VUI branch flow aimed to get the information from users’ contact list (Fig. 7).

    Fig. 7.
    figure 7

    Payee list

  • Payee account uniqueness. If the payee has more than one account in the payee list, the system should ask the user to choose one. If the utterance includes the feature information such as XYZ bankcard, which could be tagged by system, the step of payee account selection should be omitted. Otherwise, GUI selection control is recommended to apply for users’ easy choosing if there are more than three accounts (Fig. 8).

    Fig. 8.
    figure 8

    Payee account uniqueness

  • Payment account balance. If the payment account balance is insufficient, other accounts which balance is sufficient should be matched to the process. The system should ask user to confirm the alternation. If the payment account is unique, the transfer amount less than the balance should be recommended to help to continue the process (Fig. 9).

    Fig. 9.
    figure 9

    Payee account uniqueness

4 Design

In this case, two session processes have been selected in this scenario to carry out the high fidelity prototype design (Fig. 9). The design follows the following principle:

  • Maximize the entrance of voice

  • Enhance content presentation

  • Use clear and simple graphical interface controls, form styles to improve interaction efficiency.

The first session choose the one-shot session. The whole conversation is brief. GUI dialogue box is used in the penultimate step for confirmation for the transaction. With the development of biometric technology, users will no longer rely on graphical user interface to verify signatures in the future.

The second session choose the payee account adulteration session. The conversion is composed of wise response and simple control. Compared to the traditional GUI process, only one point for the option is involved in the whole process (Fig. 10).

Fig. 10.
figure 10

High fidelity prototype design

The element of CUI is dialogue box. The process of different session could be considered as the combination of different boxes.

5 Conclusion

CUI connects the graphical user interface and natural user interface. By using social scene with user familiar with, through the intelligent voice technology which help users to think and do users want, with interesting dialogue the CUI build up tasks-driven scenario. CUI have succeed in shortening the process and optimizing the user experience.

The mission driven, immersive, single-threaded elements of financial service has a good match for the CUI experience. Based on the research on CUI for financial services scene provides a practice to explore the intelligent voice commercial market.

In the future, CUI may evolve or disappear with technological innovations. But it’s still worth exploring the method of a natural interaction.