Keywords

1 Introduction

So far, robots are still largely separated from human workers in production spaces. There are only a few examples where robots and humans work together such as in a new fulfillment center of Amazon in New Jersey [1]. Through technological development and adapting standards especially in the field of safety [2], the number of shared workspaces and closer human-robot collaborations will increment. Increasing numbers of robots can reduce production costs in the long term, compensate physical limitations of human workers, and countervail production peaks and staffing bottlenecks. Whereas most of the research on human-robot collaboration focuses on technical aspects, there is still a strong need to investigate the psychological and social prerequisites for effective collaboration as pointed out by Charles et al. [3].

Human-robot interaction can vary in many ways, e.g. in the type of task, the ratio of humans and robots, or the understanding of roles [4, 5]. Instead of just coexisting, human and robot might be able to cooperate in the future, which means to share a common goal and interact in a coordinated way [6]. It might even be possible to speak of hybrid human-robot teams. Although, the term team is usually used for human teams, recent research has transferred the concept repeatedly on robots. Classically, a team is defined as two or more individuals, who have specific roles, perform interdependent tasks, are adaptable, and share a common goal [7]. The closer and more equitable the interaction becomes, the more important are human factors [8]. Charalambous and collegeagues [9] gave a first framework of human factors which influence effective implementation and execution of hybrid human-robot collaboration. Among others, the factors trust in the robots and the robot’s reliability play a crucial role.

People often take for granted that machines and robots work perfectly and act unaware of actual dangers. This automation-induced complacency effect is caused particularly through automation reliability [10]. Hancock and colleagues [11] examined critical factors for trust development in hybrid teams in a meta-analysis. They also pointed out that the robot’s performance has the strongest association with trust. Even if the robot’s performance influenced the level of trust, Salem and colleagues [12] found out that manipulating the robot’s behavior did not influence people’s decision to work with them. This leads to two hypotheses:

  • H 1 : The accuracy of a robot’s behavior influences the level of trust. A faulty robot will be perceived as less trustworthy.

  • H 2 : The accuracy of a robots’ behavior does not influence the perception of the team functionality.

Experiments have shown that a robot’s appearance systematically influences the humans’ affect towards them. This effect, originally proposed by Jentsch [13] and Freud [14], is called “Uncanny Valley” [15] after the form of the diagram which illustrates the conjecture to describe the effect. Correspondingly, humanoid robots are more popular than industrial robots (until they are too anthropomorphic and the response becomes revulsion). Humanlike appearance is also associated with human attributes, people tend for example to delegate responsibility better to humanlike then to machinelike robots [16]. This finding is supported also by Hancock and colleagues [11], who saw the characteristics of the robot (like anthropomorphism) also as an influential factor on trust. The robot’s look rises expectations of the robot’s role and behavior in the situation, wherefore hybrid collaboration can be improved trough a matching of the robot’s appearance and the given task. A robot that confirmed the expectations increases peoples’ sense of the robot’s compatibility with its job, and hence their compliance with the robot [17]. In a production environment, an industrial robot will match the situation better than a humanoid, which leads to the following hypotheses:

  • H 3 : The robot’s appearance influences the level of trust. A humanoid robot will be perceived as more trustworthy.

  • H 4 : The robot’s appearance influences the perception of the team functionality. An industrial robot will create a higher perceived team functionality.

  • H 5 : The robot’s appearance influences the performance of the collaborative task. An industrial robot will lead to a higher performance.

To test these hypotheses, a virtual reality experiment with four experimental groups was conducted which is described in the next section. Then a description of the sample follows and the results are reported. In the last section, the results and limitations are discussed and an outlook is given.

2 Method

To analyze hybrid collaboration in a controlled experiment, a Wizard-of-OzFootnote 1 experiment was designed within a virtual reality setting using a Virtual TheaterFootnote 2 and an Oculus Rift Development Kit 2 (DK2) Head-Mounted Display (after the concept described in [18, 19]).

In a virtual production hall (see Fig. 1), the participants needed to fulfil a task that could only be successfully achieved via a cooperation with the mobile, voice-controlled robot CharlesFootnote 3. To get used to the virtual environment, exploration time was provided where participants could walk freely through the production hall and get information about the different machines (e.g. lathes, bending machine) by pressing the corresponding buttons. The robot stood in the production hall and offered help operating the machines through a textual interface, which was also read aloud. Participants were told that they could give the robot instructions by saying “Go to machine…” (“Gehe zu…”) or “Press the button” (“Benutze…”). The robot could only operate machines with a red button; the human could only operate machines with a green button, which the participant had to figure out by try and error. The names of the machines were visible as numbers over the machines. The actual task instructions were given when they operated a specific machine: The task was to operate an electric chain hoist by alternately pressing a green and a red button. To ensure the power supply, human or robot needed to stand on a platform. They had to pull as much of the rope as possible within five minutes starting from the moment when the instruction was activated. The total time within the Virtual Theatre was 15 min in average.

Fig. 1.
figure 1

Impressions of (a) the production hall, (b) the chain host and (c) the floorplan.

In a 2 × 2 design, the robot’s characteristics were manipulated. The robot appeared either as an industrial or humanoid robot (see Fig. 2) and operated either reliably or imperfect in response to the human operator. Participants were randomly assigned to the conditions. The behavior of the robot in both conditions was standardized by the specifications of an activity (see Fig. 3).

Fig. 2.
figure 2

Design of the humanoid and industrial robot.

Fig. 3.
figure 3

Activity diagram for the reliable condition

The experiment was framed with an online pre- and a post-survey using SoSci SurveyFootnote 4. The pre-survey was designed to get insights into the personal characteristics of the participants and the post-survey to gain information of the individual, subjective assessment of the hybrid teamwork. An overview of the complete questionnaires is reported in Table 1.

Table 1. Questionnaires of the pre- and post-survey. Questionnaires used for the analysis in this paper are starred (*).

For the analysis in this paper, we analysed the level of trust towards the robot with an adaption of the questionnaire of Schaefer [25] as well as the team functionality with the Questionnaire of Teamwork (FAT) by Kauffeld and Frieling [23] in dependence of the robot’s characteristics. Trust was evaluated by entering a percentage value. Exemplary questions are “How much percent of the time did the robot work properly?” or “How much percent of the time did the robot work well in the team?”. Team functionality was assessed using a six-point Likert-scale with verbalized endpoints. Examples are “Our goals are realistic and achievable. – Our goals are unrealistic and unachievable.” or “We reach all goals with ease. – Sometimes we get the impression that we cannot achieve our goals.”. Reliability for the individual FAT subscales varied between α = .64 and α = 89 [23].

3 Results

In order to test the hypotheses described above, factorial analyses of variance (ANOVA) with the fixed factors appearance and behaviour of the robot were conducted using SPSS 24.

Robots which performed reliable were trusted more \( \left( {\bar{x} = 6 1. 70,SD = 1 4. 3 7} \right) \) than robots which performed faulty \( (\bar{x} = 5 5. 2 7,SD = 1 6. 6 6) \). This main effect of the robot’s accuracy on the level of trust was significant, F (1, 108) = 4.65, p = .033, ω2 = .04. Hence, hypothesis H 1 is confirmed. The main effect of appearance and the interaction effect were not significant, thus hypothesis H 3 cannot be confirmed.

Robot’s appearanceFootnote 5 influenced some FAT dimensions. There was a significant main effect of the robot’s appearance on orientation, F (1, 108) = 6.16, p = .014, ω2 = .05, and task management, F (1, 108) = 4.23, p = .039, ω2 = .04. The humanoid condition had higher values in both dimensions (\( \bar{x} = 2. 8 5,SD = 1.0 1 \) vs. \( \bar{x} = 2. 3 9,SD = . 9 7 \) for goal orientation and \( \bar{x} = 3. 1 2,SD = 1. 10 \) vs. \( \bar{x} = 2. 7 5,SD = . 8 5 \) for task management) indicating less goal orientation and task management. The dimensions can also be summarized as task reflexivity [31]. The other dimensions of the FAT, which can be summarized as social reflexivity, did not reveal significant main effects, so hypothesis H 4 can be partially confirmed. The teams with both the industrial and the humanoid robot can be classified as fully functioning teams (see Fig. 4). The main effects of behaviour and the interaction effects were not significant, too, which goes along with hypothesis H 2 .

Fig. 4.
figure 4

Team functionality of the hybrid teams after West [31].

There was a significant main effect of the robot’s appearance on the performance, F (1, 104) = 5.06, p = .027, ω2 = .05). In collaboration with the humanoid robot, the performance \( (\bar{x} = 3. 2 6,SD = 2. 2 5 ) \) was not as good as with the industrial robot \( (\bar{x} = 4. 2 6,SD = 2. 3 1) \), which goes along with hypothesis H 5 . The main effect of the robot’s behaviour as well as the interaction effect between appearance and behaviour on performance were not significant.

4 Discussion

This study examined the impact of the design of a robotic workmate (appearance and behaviour) on the trust placed in the robot, the perceived team functionality, and the objective team performance (which might be an effect of time – with more time provided for task fulfilment, an impact is assumed). It was found that a faulty robot was perceived as less trustworthy, but that its accuracy did not affect the team functionality and team performance. An industrial appearing lead to a higher task reflexivity and a better collaborative performance, but no influence of the appearance on the level of trust was found.

The lacking influence of appearance on trust and social reflexivity might be a result of the currently still predominant mental model of robots which considers robots – independently of their look – more as a tool than an actual teammate [32]. This might be also enhanced by the design of the study, which assigned the participants as an operator of the robot who commands it. Although a natural bidirectional language interface was chosen and the robot listened to a human name, which should emphasize the equality of the team members, the robots did not seem to introduce a high sense of unity.

Further work will continue to explore how the participants’ characteristics (e.g. personality, attitude towards robots) influence the performance of hybrid collaboration and the subjective perception of it. Through analyzing the interactions between human and robot during the experiment (via video analysis), it will also be considered whether human-robot teams experience team development and whether the stages of human team development can be transferred to hybrid teams.

The study gives first implications of how a robotic workmate should be designed, taking into account the interactions with a human partner. Nevertheless, the limitations of this study should be considered. First, due to technical issues and simulation sickness of some participants, comparatively few test subjects were included in each condition of the experiments; hence, the statistical power was limited. Second, the sample consisted mainly of students and it is possible that construction workers, who will be the actual end-users, will perceive hybrid collaboration in a different way. Third, habituation effects might occur, so that the described patterns might not remain over time. Fourth, a virtual environment setting was used to guarantee a safe interaction with a robot and to manipulate the robot’s characteristics easily. However, it must be explored whether the findings are transferable to real production environments.