Putting the crowd to work in a knowledge-based factory

doi:10.1016/j.aei.2010.05.011

Advanced Engineering Informatics

Volume 24, Issue 3, August 2010, Pages 243-250

https://doi.org/10.1016/j.aei.2010.05.011 Get rights and content

Abstract

Although researchers have developed numerous computational approaches to reasoning and knowledge representation, their implementations are always limited to specific applications (e.g. assembly planning, fault diagnosis or production scheduling) for which bespoke knowledge bases or algorithms have been created.

However, “cloud computing” has made irrelevant both the physical location and internal processes used by machine intelligence. In other words, the Internet encourages functional processes to be treated as ‘black boxes’ with which users need only be concerned with posing the right question and interpreting the response. The system asking the questions does not need to know how answers are generated, only that they are available in an appropriate time frame.

This paper proposes that Crowdsourcing could provide on-line, ‘black-box’, reasoning capabilities that could far exceed the capabilities of current AI technologies (i.e. genetic algorithms, neural-nets, case-based reasoning) in terms of flexibility and scope. This paper describes how Crowdsourcing has been deployed in three different reasoning scenarios to carry out industrial tasks that involve significant amounts of tacit (e.g. unformalised) knowledge. The first study reports the application of Crowdsourcing to identify canonical view of 3D CAD models. The qualitative results suggest that the anonymous, Internet, workforce have a good comprehension of 3D geometry. Having established this basic competence the second experiment assesses the Crowd’s ability to judge the similarity of 3D components. Comparison of the results with published benchmarks shows a high degree of correspondence. Lastly the performance of the Internet labourers is quantified in a 2D nesting task, where their performance is found to be superior to reported computational algorithms. In all these cases results were returned within a couple of hours and the paper concludes that there is potential for broad application of Crowdsourcing to geometric problem solving in CAD/CAM.

Introduction

We present Crowdsourcing as a tool to facilitate machine intelligence in a knowledge-based factory. Our approach acknowledges the outstanding capacity of the human brain, but rather than trying to understand, or mimic, the complexity of the cognitive processes, we propose that human intelligence is employed directly at critical steps where machine intelligence cannot match human performance.

A direct consequence of this methodology is that rather than systems requiring, say, rule-bases, inference-engines, fitness functions or databases of “case” examples, the focus becomes the construction of queries for the crowd and the development of statistical methods for the aggregation of their responses. However although this scenario (i.e. where Crowdsourcing provides the reasoning functions traditionally carried out by AI subsystems) will change the nature of individual software components, the overall architecture of problem solving systems will, in many cases, be unchanged. But Crowdsourcing offers the opportunity to do more than simply provide a neat on-line interface to human reasoning and judgement. It offers the opportunity to discover effective problem solving strategies. By definition, “machine intelligence” systems are created by programmers who have encoded problem solving strategies in the software. So while occasionally emergent behaviours are observed, we would argue that for the most part AI systems solve problems exactly has they have been design too. And this is both their limitation and their strength.

Crowdsourcing, in contrast, solves problems, cleans up data, classifies content, selects options, creates new content, and many other tasks using strategies that appear ‘opaque’ to the user. But because of the digital nature of the activity, there is an opportunity to record, observe and assess the problem solving strategies of many individuals in a way that would be extremely difficult to do in any other scenario. To illustrate this, Section 3 of the paper presents the early results of Crowdsourced part nesting where not only do the results improve upon those generated by commercial CAM systems but could, potentially, also provide insights into how automate system could be improved.

Finally, crucial to any industrial use of Crowdsourced “human intelligence”, is the constant availability of sufficient quantity, and quality, of on-line workers. Regardless of the task undertaken, our results suggest that the Internet is now sufficiently large and globally distributed well that commercial Crowdsourcing sites can easily provide results in only a few hours on a 24/7 basis.

This paper is divided into five sections: this introduction continues with a brief overview of machine reasoning in the knowledge-based factory and describing the emerging technology of Internet Crowdsourcing. The next section (Section 2) describes how Crowdsourcing has been use to provide reasoning for a 3D content based retrieval system whose overall architecture is analogous to “traditional” AI applications for machine vision or speech recognition. The following section (Section 3) illustrates the opportunity for Crowdsourcing to contribute insights to problem solving strategies (in this case 2D shape nesting) as well as results. Finally, a discussion (Section 4) summarises the potential applications and the challenges of Crowdsourcing industrial tasks, before conclusions are drawn, in Section 5.

Machine intelligence has been used to support industrial processes that range from computer vision for robotics to creative design. Duffy [1] presents six main machine learning techniques: agent-based learning, analogical reasoning, induction methods, genetic algorithms, knowledge compilation, and neural networks. A common element of all these methods is the need to “inform” or “teach” the system using databases of examples.

For example, analogical reasoning (i.e. finding solutions to problems based on retrieving knowledge from previous experiences), induction methods (i.e. where knowledge is generated by the amalgamation of similar data and its analysis to obtain a classification), genetic algorithms (i.e. when new concepts are generated by the cross-over or ‘mutation’ of previous ones), knowledge compilation (i.e. simplify into more fundamental knowledge, so it can be reusable in other situations), and neural networks (when a machine, executes a similar learning mechanism to a human brain by training on example data) are all strategies that could also be employed in conjunction with Crowdsourced databases of examples to create a true ‘knowledge-based’ factory.

Currently, when embedded in an application, AI technologies (such as those listed above) are rarely able to work on raw data (e.g. documents, audio or image files etc.). More typically the data is analysed to identify features, or characteristics, which form the ‘language’ of the reasoning system (Fig. 1).

In a machine vision system, the features might be “edges” identifies in a .jpeg image, in speech recognition software signal processing is used to identify phonetic patterns and in the 3D CAM depressions are identified on geometric CAD models prior to process planning. This model is unchanged by Crowdsourcing technology and the following sections demonstrates how micro-outsourcing to the Internet can provide the functionality for both the ‘Feature Recognition’ and ‘Reasoning’ stages that are characteristic of so many problem solving architectures.

The term ‘crowdsourcing’ was coined by Jeff Howe in 2006 as “the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call” [2]. These activities are executed by people who do not necessarily know each other, and interact with the company, the ‘requester’, via virtual tools and an internet connection. They become ‘the workers’: they can access tasks, execute them, upload the results and receive various forms of payment using any web browser. This is a labour market open 24/7, with a diverse workforce available to perform tasks quickly and cheaply.

The crowdsourcing platform used in the investigations reported here was Amazon’s mTurk (www.mturk.com) which was selected because of the large number of worker available. Although it should be note that there are several alternatives (e.g. HumanGrid: http://www.humangrid.de and Crowdflower: crowdflower.com).

As shown in Fig. 2, the ‘requesters’ both design and post tasks for the Crowd to work on. In mTurk, tasks given to the ‘workers’ are called ‘HITs’ (Human Intelligence Tasks). Requesters can test workers before allowing them to accept tasks and so establish a baseline performance level of prospective workers. Requesters can also accept, or reject, the results submitted by the workers, and this decision impacts on the worker’s reputation within the mTurk system. Payments for completed tasks can be redeemed as ‘Amazon.com’ gift certificates or alternatively transferred to a worker’s bank account. Details of the mTurk interface design, how an API is used to create and post HITs and a description of the workers’ characteristics are beyond the scope of this paper but can be found (along with further details of the experimental results) in [3], [4]. With each result submitted by a worker the requester receives an answer that including various information about how the task was processed. One element of this data is an unique “workerID” allowing the requester to distinguish between individual workers. Using this “workerID” it is possible to analyse how many different HITs each worker completed.

A definitive classification of Crowdsourcing tasks has not yet been established, however Corney et al. [5] suggest three possible categorisations based upon: nature of the task (creation, evaluation and organisation tasks), nature of the crowd (‘expert’, ‘most people’ and ‘vast majority’) and nature of the payment (voluntary contribution, rewarded at a flat rate and rewarded with a prize). Similarly Crowdsourcing practitioners, such as Chaordix (from the Cambrian House [6]) describes Crowdsourcing models as a Contest (i.e. individual submit ideas and the winner is selected by the company, ‘the requester’), a Collaboration (i.e. individuals submit their ideas or results, the crowd evolves the ideas and picks a winner), and Moderated (i.e. individuals submit their ideas, the crowd evolves those ideas, a panel – set by ‘the requesters’ select the finalists and the crowd votes on a winner). In the last few years academics across many different disciplines have started reporting the use of Internet Crowdsourcing to support a range of research projects, e.g. social network motivators [7], relevance of evaluations and queries [8], [9], accuracy in judgement and evaluations[10]. Despite this activity few industrial applications of Crowdsourcing have been reported and this gap in the literature motivated the authors to undertake the studies into 3D search and 2D part nesting reported in the following sections.

Section snippets

3D Search case study

At present, 3D models (e.g. engineering drawings) are indexed by alpha numeric ‘part-numbers’ with a format unique to each individual organisation. Although this indexing system works well in the context of on-going maintenance and development of individual parts, it offers little scope for ‘data-mining’ (i.e. exploration) of an organisation’s inventory of designs. In addition to the sourcing of parts the application of a 3D similarity matching algorithm to large collections of parts would

Crowdsourcing 2D part nesting

Other AI reasoning applications such a planning require very different approaches from those used in classification or recognition problems. Typically networks of constraints are constructed and the “intelligence” or problem solving strategy is embedded in the algorithm used to navigate this graph [19] Crowdsourcing offers the possibility of solving planning, and other combinatorially explosive, problems using distributed human labour. To investigate this possibility, we created an experiment

Discussion

In this paper, we have shown how Crowdsourced can be used to carry out a variety of geometric reasoning tasks. This is a powerful testament to the flexibility of the process presented here. Although the design and coding required to implement the HITs was not trivial, it was considerably easier than the development of new algorithms for machine cognition and learning. The crucial aspect was to formulate the right question and carefully consider the instructions provided to the mTurk workers.

Conclusions

Examples of crowdsourced work for various mechanical CAD/CAM applications have been presented in this paper. Beyond simply establishing that the approach produced surprisingly good results we learnt that it was important to present the “right question” to the crowd. Therefore, the sophisticated job was simplified into several steps to ensure clarity and comprehension by the workers. Interestingly, like Crowdsourcing applications in other domains we have shown that preparation work (e.g. best

References (25)

S.C.F. Chan et al.
A solid modeling library for the World Wide Web
Computer Networks and ISDN Systems.
(1998)
U. Alva et al.
Automated design of sheet metal punches for bending multiple parts in a single setup
Robotics and Computer-Integrated Manufacturing
(2001)
J. Wang et al.
Retrieving 3D CAD model by freehand sketches for design reuse
Advanced Engineering Informatics
(2008)
S. Jayanti et al.
Developing an engineering shape benchmark for CAD models
Computer-Aided Design
(2006)
J. Mula et al.
Models for production planning under uncertainty: a review
International Journal of Production Economics
(2006)
J. Puchinger et al.
Models and algorithms for three-stage two-dimensional bin packing
European Journal of Operational Research
(2007)
A.H.B. Duffy
The “What” and “How” of learning in design
IEEE Expert: Intelligent Systems and Their Applications
(1997)
J. Howe, “The rise of Crowdsourcing: http://www.wired.com/wired/archive/14.06/crowds_pr.html”, Wired, issue 14,...
P. Jagadeesan, J. Wenzel, et al., Geometric Reasoning via Internet CrowdSourcing 2009 SIAM/ACM Joint Conference on...
P. Jagadeesan, J. Wenzel, et al., Validation of purdue engineering shape benchmark clusters by Crowdsourcing, in:...

J.R. Corney et al.

Outsourcing labour to the cloud

International Journal of Innovation and Sustainable Development

(2010)

“CambrianHouse: www.cambrianhouse.com/”, retrieved (accessed...

Cited by (31)

General framework, opportunities and challenges for crowdsourcing techniques: A Comprehensive survey
2020, Journal of Systems and Software
Citation Excerpt :
Workers may be recruited based on their initial evaluation before participating in the CS process and reputation after they have completed some tasks. The former applies to the workers’ profile in which skill levels and expertise are determined through pre-qualification tasks, gold questions or entry questions at the time of recruitment Corney et al. (2010). Whereas, the latter based on the validation of completed tasks Mashhadi and Capra (2011).
Crowdsourcing, a distributed human problem-solving paradigm is an active research area which has attracted significant attention in the fields of computer science, business, and information systems. Crowdsourcing holds novelty with advantages like open innovation, scalability, and cost-efficiency. Although considerable research work is performed, however, a survey on the crowdsourcing process-technology has not been divulged yet. In this paper, we present a systematic survey of crowdsourcing in focussing emerging techniques and approaches for improving conventional and developing future crowdsourcing systems. We first present a simplified definition of crowdsourcing. Then, we propose a framework based on three major components, synthesize a wide spectrum of existing studies for various dimensions of the framework. According to the framework, we first introduce the initialization step, including task design, task settings, and incentive mechanisms. Next, in the implementation step, we look into task decomposition, crowd and platform selection, and task assignment. In the last step, we discuss different answer aggregation techniques, validation methods and reward tactics, and reputation management. Finally, we identify open issues and suggest possible research directions for the future.
Harnessing user innovation for social media marketing: Case study of a crowdsourced hamburger
2018, International Journal of Information Management
This study investigates how user innovation can be used as an engagement mechanism for crowdsourcing-based marketing initiatives. By building on an in-depth case study of a hamburger chain’s crowdsourcing initiative, we analyze key activities in customers’ value-creating processes, the crowdsourcer’s value-creating processes, and innovation encounter processes. We further identify three key activities by which a crowdsourcer can facilitate the realization of desired outcomes from the crowdsourcing initiative: (1) the development of opportunities for user innovation, (2) the planning of user innovation activities, and (3) the implementation and assessment of the outcomes. Our results emphasize the importance of activities and technical features that enable socializing with other participants, support active participation, and create a participatory experience. Our study will inform research and practice on crowdsourcing and user innovation for marketing purposes.
Crowdsourcing with online quantitative design analysis
2018, Advanced Engineering Informatics
Citation Excerpt :
Crowd sourcing in design is thought to be particularly effective if conducted early in the design process through participatory design [12–15]. In recent years online web-based tools [16–19] are enabling residents and members of the public to submit their feedback. This enables many people to provide comment on design choices, contributing their insight by geospatial tagging [16,3] to highlight design issues.
Design is a balancing act between people’s competing concerns, design options and design performance. Recently collecting data on such concerns such as sustainability or aesthetics has become possible through online crowdsourcing, particularly in 3d. However, such systems rarely present more than a single design alternative or allow users to change the design and seldom provide quantitative design analysis to gauge design performance. This precludes a more participatory approach including a wider audience and their insight in the design process.
To improve the design process we propose a system to assist the design team in exploring the balance of concerns, design options and their performance. We augment a 3d visualisation crowdsourcing environment with quantitative on-demand assessment of design variants run in the cloud. This enables crowdsourced exploration of the design space and its performance. Automated participant tracking and explicit submitted feedback on design options are collated and presented to aid the design team in balancing the demands of urban master planning. We report application of this system to an urban masterplan with Arup.
Multi-faceted assessment of trademark similarity
2016, Expert Systems with Applications
Trademarks are intellectual property assets with potentially high reputational value. Their infringement may lead to lost revenue, lower profits and damages to brand reputation. A test normally conducted to check whether a trademark is highly likely to infringe other existing, already registered, trademarks is called a likelihood of confusion test. One of the most influential factors in this test is establishing similarity in appearance, meaning or sound. However, even though the trademark registration process suggests a multi-faceted similarity assessment, relevant research in expert systems mainly focuses on computing individual aspects of similarity between trademarks. Therefore, this paper contributes to the knowledge in this field by proposing a method, which, similar to the way people perceive trademarks, blends together the three fundamental aspects of trademark similarity and produces an aggregated score based on the individual visual, semantic and phonetic assessments. In particular, semantic similarity is a new aspect, which has not been considered by other researchers in approaches aimed at providing decision support in trademark similarity assessment. Another specific scientific contribution of this paper is the innovative integration, using a fuzzy engine, of three independent assessments, which collectively provide a more balanced and human-centered view on potential infringement problems. In addition, the paper introduces the concept of degree of similarity since the line between similar and dissimilar trademarks is not always easy to define especially when dealing with blending three very different assessments. The work described in the paper is evaluated using a database comprising 1400 trademarks compiled from a collection of real legal cases of trademark disputes. The evaluation involved two experiments. The first experiment employed information retrieval measures to test the classification accuracy of the proposed method while the second used human collective opinion to examine correlations between the trademark scoring/rating and the ranking of the proposed method, and human judgment. In the first experiment, the proposed method improved the F-score, precision and accuracy of classification by 12.5%, 35% and 8.3%, respectively, against the best score computed using individual similarity. In the second experiment, the proposed method produced a perfect positive Spearman rank correlation score of 1.00 in the ranking task and a pairwise Pearson correlation score of 0.92 in the rating task. The test of significance conducted on both scores rejected the null hypotheses of the experiment and showed that both scores correlated well with collective human judgment. The combined overall assessment could add value to existing support systems and be beneficial for both trademark examiners and trademark applicants. The method could be further used in addressing recent cyberspace phenomena related to trademark infringement such as customer hijacking and cybersquatting.
An evaluation methodology for crowdsourced design
2015, Advanced Engineering Informatics
Citation Excerpt :
These are generous payment levels in comparison to other reported research studies which could as low as $0.01 [35], $0.10 [36,35]. However, in consideration of the experiment’s level of difficulty, the lowest rate was fixed as $0.15 [37,38]. By choosing $1.00 as a maximum payment the task would by one of the best paid on the platform where only some translation jobs might be paid as much as $1.40 per hour [35,39].
In recent years, the “power of the crowd” has been repeatedly demonstrated and various Internet platforms have been used to support applications of collaborative intelligence in tasks ranging from open innovation to image analysis. However, crowdsourcing applications in the fields of design research and creative innovation have been much slower to emerge. So, although there have been reports of systems and researchers using Internet crowdsourcing to carry out generative design, there are still many gaps in knowledge about the capability and limitations of the technology. Indeed the process models developed to support traditional commercial design (e.g. Pugh’s Total Design, Agile, Double-Diamond etc.) have yet to be established for Crowdsourced Design (cDesign). As a contribution to the development of such a general model this paper proposes a cDesign framework to support the creation of crowdsourced design activities. Within the cDesign framework the effective evaluation of design quality is identified as a key component that not only enables the leveraging of a large, virtual workforce’s creative activities but is also fundamental to almost all iterative optimisation processes. This paper reports an experimental investigation into two different Crowdsourced design evaluation approaches; free evaluation and ‘Crowdsourced Design Evaluation Criteria’ (cDEC). The results are benchmarked against a ‘manual’ evaluation carried out by a panel of experienced designers. The results suggest that the cDEC approach produces design rankings that correlate strongly with the judgements of an “expert panel”. The paper concludes that cDEC assessment methodology demonstrates how Crowdsourcing can be effectively used to evaluate, as well as generate, new design solutions.
A crowdsourcing development approach based on a neuro-fuzzy network for creating innovative product concepts
2014, Neurocomputing
As an effective way to aggregate a crowd׳s wisdom, crowdsourcing has attracted much attention in recent years. Especially for product innovation, crowdsourcing shows huge potential for generating more creative ideas in terms of quantity and innovativeness. However, there are still some deficiencies in the existing crowdsourcing work: i) lack of a crowdsourcing system under a systematic or unified framework to support product innovation; ii) lack of an effective quantitative method to assist the design of crowdsourcing tasks; and iii) insufficient anti-cheating concerns in the initial stage of task design. In this article, a prototype crowdsourcing system is proposed to tackle these problems. Through the establishment of a task development model which consists of i) an innovation target analysis module, ii) an innovation-oriented HIT (human intelligent task) allocation module, and iii) a cheating control module, the proposed system is able to analyze and decompose the innovation target. In addition, it can identify suitable tasks to facilitate innovation and to embed anti-cheating measures in task design. To demonstrate the proposed prototype system, a case study on a future PC design is presented. Through control testing, it appears that the proposed system is effective in generating more valid and innovative ideas.

View all citing articles on Scopus

View full text

Putting the crowd to work in a knowledge-based factory

Abstract

Introduction

Section snippets

3D Search case study

Crowdsourcing 2D part nesting

Discussion

Conclusions

Computer Networks and ISDN Systems.

Robotics and Computer-Integrated Manufacturing

Advanced Engineering Informatics

Computer-Aided Design

International Journal of Production Economics

European Journal of Operational Research

The “What” and “How” of learning in design

IEEE Expert: Intelligent Systems and Their Applications

Outsourcing labour to the cloud

International Journal of Innovation and Sustainable Development