Keywords

1 Introduction

Recent developments in IoT technology allow more and more data to be collected. For instance, in agro-food domain sensor data can improve the crop yield and quality prediction [29, 31] and optimization of food supply [34]. Those data can be analyzed through statistical analysis [13, 15], machine learning methods [12] or visual analytics [32, 40]. Visual analytics technology is often used for the descriptive analytics purposes and understanding the data. However, many dashboards fail to communicate efficiently and effectively to the user [11]. Also, novice professionals struggles to understand simple graphs, especially if the information is not clear-cut and visually prominent [33]. Therefore, presenting the information in another way than visual, such as verbal, may be beneficial for the user.

One of the ways to obtain verbalization of data is to employ the linguistic summarization approach. Linguistic summaries can automatically generate natural language like sentences with aim of capturing the essence of data [16, 24]. Linguistic summaries have proven to be useful in several applications, e.g. retail [17] and eldercare [39].

Linguistic summaries are being developed and matured as the method by many researchers, for instance by proposing new quality measures, e.g., [8, 17], or more efficient generation methods [10, 18, 28].

Yet, one of the remaining challenges is that often too many linguistic summaries are being generated [10]. Thus, it is important to determine which linguistic summaries are useful and appropriate for target users achieving a balanced trade-off between complexity and precision of information [28]. Naturally usefulness or relevance is very much context dependent. In this paper we tackle this issue, by proposing a method to detect relevant linguistic summaries in a case from the agro-food domain. We consider a case of a pig barn and a climate sensor that captures the conditions in the barn. We want to present to the farmers only the linguistic summaries that they consider useful and allowing them to take actions that improve their operations.

This paper is structured as follows. Next section describes the background on linguistic summarization, Sect. 3 describes the context of the case study and Sect. 4 the proposed the method to detect relevant summaries. Section 5 presents the results of the case study and is followed by the concluding remarks.

2 Background

In this paper we follow the approach of Yager [41], which was considerably advanced and then implemented by Kacprzyk [16], Kacprzyk and Yager [21], Kacprzyk et al. [22, 23].

In this approach linguistic data summaries are quantified propositions with two possible protoforms (or templates):

  • simple protoform:

    $$\begin{aligned} Q y\text {'s are }P; \end{aligned}$$
    (1)

    e.g. Most cars are new

  • extended protoform:

    $$\begin{aligned} Q \ R y\text {'s are }P; \end{aligned}$$
    (2)

    e.g. Most new cars are fast

where Q is the quantifier, P is the summarizer, and R is an optional qualifier, which are all modeled as fuzzy sets over appropriate domains.

The truth value, describing the validity of the summary, is the basic measure of the quality of the summary. Many methods for calculating the truth value have been proposed [8]. But the truth value is not the only quality measure of a linguistic summary. Kacprzyk et al. [17, 23] proposed four additional measures, namely the degree of specificity, the degree of appropriateness, the degree of covering, and the length of the summary. Bugarin et al. [5] were differentiating between evaluating a single summary sentence and a set of summaries. They proposed several measures that capture aspects, such as coverage, length, and specificity. An overview of different quality criteria can be found in [8, 28].

Those quality measures can be used to select a smaller set of the true summaries to be shown to the user. Three different approaches of employing the other quality measures can distinguished, namely using the thresholds [4, 39], dominance of linguistic summaries [6] and aggregating the quality measures [9, 17]. Yet, despite those methods, still too many linguistic summaries are obtained [10].

Linguistic summaries has been applied to different types of data: numerical [7, 14, 36], time series [19, 20, 30], sensor data [35, 39], texts [37], videos [1,2,3] and processes [10, 38] in different application domains, such as retail [17], eldercare [39] or finance [19].

3 Case Study in the Agro-Food Domain

In this section we describe the context of the case study as the context is very important for the relevance. We consider a case of a climate sensor installed in a pig barn. This sensor measures the temperature, air humidity and carbon dioxide (CO\(_2\)) inside the barn every 10 min. A dashboard was created to present the data collected by the sensor and some additional readings coming from the nearby weather station like outside air temperature and outside air humidity.

The dashboard initially showed the live gauges displaying the information about current values of inside temperature, humidity and CO\(_2\). Also three time series plots were introduced to display the data (temperature, humidity and CO\(_2\)) over a longer period of time, typically 28 days. However, many farmers found this dashboard difficult to understand and deduce appropriate actions. To overcome this difficulty, it was decided to extend the dashboard with linguistic summaries.

Through discussions with the domain experts and a selected group of farmers (users) a few decisions were made. Firstly, it was decided to use only simple type protoforms, as they can provide the overview on the past. Moreover it was decided to create linguistic summaries about inside humidity, CO\(_2\), and daily range of inside temperatures (maximum-minimum), as the variations in the temperature in the barn is more important that the actual values. Thirdly, three linguistic labels for each variable and five quantifiers were defined and validated. Their membership functions are depicted in Figs. 1 and 2, respectively. The quantifiers has been designed in such a way, for a given argument only one of them has a value of membership function higher than 0.7. This means that at most only one summary will be generated for a summarizer. By this, we will not present to the user both summaries as e.g. “Almost all CO\(_2\) levels are too high” and “Most CO\(_2\) levels are too high”, as the second sentence is obsolete. Also each of the variables has desired values and undesired values. Temperature range and CO\(_2\) has undesired values only on one end, while humidity has undesired values on both ends. At last it was decided also to display at most two summaries per variable for comprehensibility purposes [27].

Fig. 1.
figure 1

Membership functions for the linguistic variables of temperature range, humidity and CO\(_2\).

Also, the farmers indicated that they want to see information that triggers them for an action. Therefore linguistic summaries like “almost all CO\(_2\) values are OK” they don’t find relevant and useful.

4 Method for Selecting Relevant Linguistic Summaries

The input from the domain experts and the farmers was used to propose a method that determined which of the true summaries can be considered as relevant and presented to the users. The most important feature of this case is that there is a desired state and users are only interested when things are not as expected. When considering those two requirements we can distinguish two cases:

  • desired case - when most (if not all) data are in the desired (ok) range

  • undesired case - when majority of the data is not in the desired (ok) range

In order to develop the method that selects relevant summaries, some additional information from the users is needed. One of them is to rank of the linguistic labels for each variables. For the one tailed variables, like CO\(_2\) levels, this is rather trivial task, the more undesired the value, the higher priority has the linguistic label. For the two tailed variables, like the humidity, where desired values are in the middle of the domain, setting the priority can be more challenging, as both extreme values can have same priorities or one of them is more important than the other. We assume that priority of the qualifiers is positively correlated with the amount they describe, i.e. the bigger amount the qualifier describes, the higher the priority.

Fig. 2.
figure 2

Membership functions for quantifiers.

Second question is about what is more important to the user: summarizer or qualifier. For instance, consider the CO\(_2\) level. The question is which sentence is more useful to the user:

  • certain number of the CO\(_2\) values is slightly high

  • a few of the CO\(_2\) values is too high

In the considered case all farmers have indicated the second summary as more useful, giving the higher weight to the summarizer than qualifier.

The proposed method consist of 5 steps:

  • Step 1: Calculate all summaries with truth value \(\mathcal{T}>\theta _T\). We assumed \(\theta _T\) of 0.7. For each linguistic value at most three summaries will be created.

  • Step 2: Classify case as desired or undesired. This is done by looking at the summary with the biggest quantifier, in our case a lot, most or almost all. If the summarizer’s label is a “desired” one, we consider a case as desired, otherwise it is undesired.

  • Step 3: Prune all summaries with “desired” linguistic label. In our case, we used “OK” as “desired” linguistic label for all linguistic variables.

  • Step 4: Order the summaries according to the priority. In our case this is according to the importance of the summarizer label.

  • Step 5: Display summaries: maximally one for desired case and maximally two for undesired case.

5 Results

We will show now several examples of data with linguistic summaries obtained. Most of those examples are real data obtained from farms in the Netherlands from February 2018 to February 2019. A few dummy examples are shown to demonstrate extreme cases. We divided those examples in two groups:

  • type 1: undesired case

  • type 2: desired case

5.1 Type 1: Undesired Case

In this subsection we show three examples that we classified as undesired case. We defined the case as undesired, when most of data is not in the desired range.

The first example is depicted in Fig. 3 and shows daily range of temperature. Please note that desired values are on one extreme. Only a few data points can be described as ok. In this case two summaries are presented to the user:

  • certain number (30%-50%) of differences in temperature during the day were too high (6 \(^\circ \)C or more),

  • certain number (30%-50%) of differences in temperature during the day were slightly high (3–5 \(^\circ \)C).

The first of them is about too high differences in temperature, since this linguistic label has the highest priority than the others, as indicated by the farmers.

Fig. 3.
figure 3

Example: daily range of temperature – an undesired case

Another example (Fig. 4) depicts the levels of CO\(_2\) in a barn. Here again two summaries were obtained:

  • Few (5%–35%) of the CO\(_2\) values were too high (3200 ppm or more),

  • A lot (55%–75%) of the CO\(_2\) values were slightly high (2600–3200 ppm).

Even more that there are many more observations with only slightly high CO\(_2\) values, farmers indicated that the ordering should be done based on the importance of the summarizer, rather than qualifier.

Fig. 4.
figure 4

Example: CO\(_2\) values – an undesired case

Fig. 5.
figure 5

Example: humidity values – an undesired case (dummy data)

The last example (Fig. 5) for this type of undesired cases is when only one summary is found relevant to be shown to the user.

  • Most (80%–95%) of the humidity values were too low (below 40%).

Please note, that this data are dummy data, created for the completeness purpose.

5.2 Type 2: Desired Case

In this subsection we show two examples that we classified as desired case, that is when most of data is in the desired range.

The first example (Fig. 6) shows the shows daily range of temperature, with majority of values being ok.

Fig. 6.
figure 6

Example: daily range of temperature – an desired case

Hence, in this case only a summary shown below is presented to the user.

  • few (5%–25%) of differences in temperature during the day were slightly high (3–5 \(^\circ \)C).

The other example (Fig. 7) depicts CO\(_2\) levels. All the values were in the OK range, hence in this case no summary is presented to the user.

Fig. 7.
figure 7

Example: CO\(_2\) levels – an desired case

5.3 Evaluation with the Users

We have evaluated this method with farmers. First the farmers were using the dashboard for two weeks, next they were asked to fill the questionnaire. Only 12 farmers has filled the questionnaire. The questionnaire was based on Technology Acceptance Model (TAM) [25]. TAM assess four aspects: perceived ease of use, perceived usefulness, satisfaction and intention to use [26]. We have used the following 14 statements evaluated on the 7 point Likert scale:

  • Q1: Using LSs enable farmers to get information from the barn quickly.

  • Q2: Using LSs saves time of the farmers.

  • Q3: Using LSs improves my tracking efficiency of the barn.

  • Q4: Using LSs improves performance of the barn.

  • Q5: It is clear and understandable what are the LSs.

  • Q6: I find easy to read the LSs.

  • Q7: The words used in the LSs are easy and adequate.

  • Q8: I find good that the system calculates automatically the LSs.

  • Q9: I am completely satisfied with having sentences in the dashboard.

  • Q10: I am completely satisfied with the LSs that were presented.

  • Q11: I feel confident on using the LSs presented.

  • Q12: I can accomplish adjustments to barn quickly using LSs.

  • Q13: I will use LSs for tracking variables of the barn.

  • Q14: I would recommend to use LSs to other farmers.

The results of the survey are shown in Fig. 8.

Fig. 8.
figure 8

Average scores for the results of TAM survey

The statements with the lowest score were “using LSs improves performance of the barn” from perceived usefulness aspect and “I would recommend to use LSs to other farmers” from intention to use. Analyzing the different scores of each farmer and the comments they did, it can be explained that these scores were low because according to some of them, it was expected that the linguistic summaries showed an alert instead of showing a summary of the past. Moreover farmers are reluctant to tell other farmers what to do.

On the other hand, the highest scores were for “using LSs enable farmers to get information about the barn quickly” from perceived usefulness; “I feel confident on using the LSs presented” from satisfaction; and “I will use LSs for tracking the variables of the barn” from intention to use aspect. This shows that, in general, farmers trust in the system and believe that linguistic messages are a good tool for making decisions in the barns.

When analyzing the factors in general, ease of use was the best characteristic of LSs, whereas perceived usefulness was the lowest score among the factors, but not with a significant difference. These results showed that linguistic messages are an appropriate solution for the farmers as they find it easy to use, but there is still more room for improvement on usefulness for the information provided.

To sum up, linguistic summaries are a good tool for the farmers; however, the linguistic texts that farmers expected were more about an alerting system, rather than a summarization system, which means that for the future, there is still some potential for extending the linguistic summaries with messages that not only describe the past, but support also decision making.

6 Concluding Remarks

We have analyzed and described an application of linguistic summaries in the agro-food domain. We have focused on the relevance aspect. We determine which linguistic summaries are useful and appropriate for target users through the interviews. The designed method was implemented and tested with the users. The results of a TAM survey indicated that linguistic summaries allow farmers to understand quickly the past performance, yet the farmers would appreciate also an alert functionality and more support in decision making. In future work we will extend the linguistic summaries with the suggestions of the users, as well as work on the additional case studies to generalize the approach. This should lead to a new quality measure quantifying relevance of a linguistic summary.