An evaluation and annotation methodology for product category matching in e-commerce

doi:10.1016/j.compind.2021.103497

Computers in Industry

Volume 131, October 2021, 103497

https://doi.org/10.1016/j.compind.2021.103497 Get rights and content

Highlights

•
Product category matching is an important task in digital marketplaces and e-commerce.
•
This paper motivates, describes and formalizes the problem of product category matching.
•
The paper also presents a rigorously designed methodology and guidelines for acquiring reliable and cost-effective annotations for this task.
•
The utility of all methods presented is validated on three real-world e-commerce taxonomies.

Abstract

Product category matching is an important task in digital marketplaces and e-commerce, helping to power better search and recommendations in an online context. While variants of the problem have received some attention in academia, there is no documented guidance on how to efficiently acquire annotations for evaluating multiple (current and future) models, many of which rely on modern machine learning techniques such as neural representation learning. In this paper, we motivate and formalize the problem of product category matching in e-commerce, and present a rigorously designed set of guidelines and methodology for acquiring annotations in a cost-effective and reliable manner. We also present a methodology for using the annotations to compare solutions of two or more product category matching methods, including comparing models both before and after annotation. Three widely used e-commerce product category taxonomies, and multiple metrics, are used to demonstrate the utility of our proposals.

Introduction

The last decade has witnessed the rapid rise of e-commerce, including e-commerce marketplaces and platforms (such as eBay and Amazon) but also the adoption of e-commerce technologies by traditional retailers like Walmart and Target (Krishnamurthy, 2004, Chaffey, 2007, Hänninen et al, 2018, Mandel, 2017). In online marketplaces, e-commerce platforms, and even media relating thereof (e.g., product reviews and influencer blogs), product category matching between two independent webpages or platforms is a practical problem for users, advertisers and aggregators of information. While we define the problem formally in Section 2, Fig. 1(a) illustrates an intuitive example. One website (Walmart) may be talking about ‘runner rugs’, while another (Target) refers to the same concept as ‘runners’. The burden is on the user to find different mentions of the same product category through more intensive search (e.g., by posing different keywords in a search engine) or to be limited to the results that show up as relevant for a specific search phrase, even though a better product (described using a different phrase) or price may be available elsewhere.

In our own experience, we have found that there are several practical reasons why product category matching is an important problem, especially for media companies relying on advertising dollars. One reason is that when users are browsing media websites, including blog and product review sites, they may be exposed to a particular product that they would either like to immediately purchase, or research further for a future purchase. Linking the product category mentioned in the media post to retailers’ product webpages, many of whom may not refer to it in the same way (as illustrated above in the Walmart-Target example), is clearly valuable.

A key aspect of the problem is its domain-specific nature. In the e-commerce domain, product categories are arranged in a taxonomy, and along with the product category label, the ‘path’ leading to it from the root of the taxonomy is also an important structured attribute.¹ A fragment of this taxonomy is illustrated in Fig. 1(b) for both websites. We formally define a taxonomy in Section 2. Because of this structure, classic ‘unstructured’ solutions such as ‘string matching’ were found to be too noisy to be useful even in preliminary experiments (Ukkonen, 1985, Navarro, 2001). Instead, we hypothesize that techniques that take both the label and the path into account may be more successful in determining when two concepts match, compared with solutions that rely only on the label. Through experimental results, we show that a method that takes the structure of the taxonomy into account when matching concepts between two taxonomies indeed performs better than one that only takes labels into account. Specific contributions are enumerated below:

•
First, we formalize and define the problem of product category matching, especially as it applies to the structured version of the problem that we intuitively described through the Walmart-Target example above.
•
Second, we present a rigorous set of guidelines for acquiring annotations for the product category matching problem in an efficient, cost-effective and reliable manner. Following an experimental study, we also present feedback from actual annotators that may allow further customization and task-specific refinement of these guidelines in other enterprises.
•
Third, we present a methodology for using the acquired annotations to evaluate two or more candidate solutions for product category matching. Within e-commerce, such evaluations have been conducted either behind closed doors, or using task-specific measures and datasets that may not have validity beyond that particular enterprise.² In contrast, we present a clear and replicable description of our experimental findings and evaluation methodology.
•
Finally, using three widely used e-commerce product category taxonomies, we conduct an experimental study to evaluate two candidate solutions inspired by recent progress in representation learning. We also use the acquired annotations to evaluate models that may be developed and proposed after annotations have been collected.

Section snippets

Problem definition and research goals

The most fine-grained unit under consideration in this article is that of a concept. Concepts are fundamental components of ontologies (Fensel, 2001), and are equivalently described as types or collections of instances. However, because of the domain-specific nature of this article, we assume a less abstract definition of concepts as product categories, defined below.

Definition (Product Category): A product category is defined as an attribute of a product, representing its type. Every product

E-commerce taxonomies

We consider three taxonomies as the primary materials in this paper: Google Product Taxonomy (GPT), PriceGrabber and Walmart. Key statistics are provided in Table 1. The GPT is a list of thousands of product categories designed by Google to uniformly categorize products in a shopping feed. It is publicly available at the following link⁵ and has undergone some updates in recent years. We use the latest version for the experiments.

Annotation task construction and guidelines

In this section, we describe both the annotation guidelines, as well as how the pre-annotation models are used to generate a set of candidate concept-pairs that are then annotated using a 4-point scale (excellent, good, fair and bad). In Section 3.1 we mentioned that there are nine pairs of query-response datasets and two pre-annotation models: L₁ (Pre.) and L₂ (Retro.). In keeping with the terminology introduced in Section 2, we refer to the ‘source’ taxonomy from which the queries are issued

Findings

Using the proposed methodology, data and language representation models, we obtained a total of 4101 annotations from 25 unique editors, with each annotation comprising a ‘label’ from the 4-point scale expressed in the guidelines. For each label, we compute a standard set of statistics, expressing it as a box plot in Fig. 2. We find that the range broadens as the quality implied by the label worsens. As we will discuss shortly, the occurrence of ‘bad’ labels primarily stem from Pre. best-match

Annotator feedback and discussion

Earlier, we had detailed the annotation guidelines, and the methodology for constructing the nine annotation-tasks (each representing a ‘dataset pair’). Following the annotations, we sent a survey to each annotator to obtain valuable feedback on the task, since, as an annotation exercise, the task is relatively novel compared to other such exercises in the Web and AI literature (such as rating a webpage as relevant in response to a query).

In response to the post-annotation question, what did

Related work

The product category matching problem is related to several strands of research, as described below.

Product Recommendations and E-Commerce. The primary application domain in this paper was e-commerce. Recently, there has been an enormous growth in the e-commerce research literature in several computational communities (Park and Chu, 2009, Xiao and Benbasat, 2007, Goy et al, 2007, Ito et al., 2002). Unsurprisingly, the HCI community is no stranger to this domain, and even beyond the research

Conclusion and future work

Product category matching is an important problem that shows up in various guises in online marketplaces and digital commerce. In this paper, we described and formalized the problem, while presenting rigorous methodological solutions for addressing the problem in different contexts. We presented and described a key set of annotation guidelines both for acquiring annotations efficiently, and for using the annotations to conduct evaluations and analyses along multiple dimensions. Using three

Author statement

Nicolas Torzec: Conceptualization, Methodology, Supervision.

Chien-Chun Ni: Data curation, Methodology.

Ke Shen: Software, Visualization, Investigation, Validation, Writing – Reviewing and Editing.

Mayank Kejriwal: Writing – Original draft preparation, Conceptualization, Methodology, Supervision, Writing – Reviewing and Editing.

Conflicts of interest

The authors declare no conflicts of interest.

Declaration of Competing Interest

The authors report no declarations of interest.

References (59)

P. Bille
A survey on tree edit distance and related problems
Theor. Comput. Sci.
(2005)
C. He et al.
Interactive recommender systems: a survey of the state of the art and future research challenges and opportunities
Expert Syst. Appl.
(2016)
Y.S. Kim et al.
Development of a recommender system based on navigational and behavioral patterns of customers in e-commerce sites
Expert Syst. Appl.
(2005)
B. Lika et al.
Facing the cold start problem in recommender systems
Expert Syst. Appl.
(2014)
E. Ukkonen
Algorithms for approximate string matching
Inf. Control
(1985)
A. Akbik et al.
Contextual string embeddings for sequence labeling
Proceedings of the 27th International Conference on Computational Linguistics
(2018)
F. Almeida et al.
Word Embeddings: A Survey
(2019)
Z. Bellahsene et al.
On evaluating schema matching and mapping
Schema Matching and Mapping
(2011)
T.B. Brown et al.
Language Models are Few-Shot Learners
(2020)
D. Chaffey
E-business and E-commerce Management: Strategy, Implementation and Practice
(2007)

L. Chen et al.

Recommender systems based on user reviews: the state of the art

User Model. User-Adapt. Interact.

(2015)

J. Devlin et al.

Bert: Pre-Training of Deep Bidirectional Transformers for Language Understanding

(2018)

X.L. Dong

Challenges and innovations in building a product knowledge graph

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

(2018)

M. Faruqui et al.

Retrofitting Word Vectors to Semantic Lexicons

(2014)

D. Fensel

Ontologies

(2001)

T. Fountain et al.

Taxonomy induction using hierarchical random graphs

Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

(2012)

A. Goy et al.

Personalization in e-commerce applications

The Adaptive Web

(2007)

N.N. Group

Ecommerce User Experience

(2020)

A. Gupta et al.

Taxonomy induction using hypernym subsequences

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

(2017)

M. Hänninen et al.

Digitalization in retailing: multi-sided platforms as drivers of industry transformation

Balt. J. Manag.

(2018)

D. Harman

Information retrieval evaluation

Synth. Lect. Inf. Concepts Retr. Serv.

(2011)

J. Hollander, M. Schlesinger, Shared annotation system and method, US Patent App. 10/936,788 (May 24...

T. Ito et al.

A group-buy protocol based on coalition formation for agent-mediated e-commerce

IJCIS

(2002)

A. Joulin et al.

Bag of Tricks for Efficient Text Classification

(2016)

N. Kim et al.

A study on the law2vec model for searching related law

J. Digit. Contents Soc.

(2017)

Y.S. Kim

Recommender system based on product taxonomy in e-commerce sites

J. Inf. Sci. Eng.

(2013)

B.P. Knijnenburg et al.

Explaining the user experience of recommender systems

User Model. User-Adapt. Interact.

(2012)

S. Krishnamurthy

A comparative analysis of ebay and amazon

Intelligent Enterprises of the 21st Century

(2004)

X.N. Lam et al.

Addressing cold-start problem in recommendation systems

Proceedings of the 2nd International Conference on Ubiquitous Information Management and Communication

(2008)

Cited by (11)

E-fulfillment cost management in omnichannel retailing: An exploratory study
2024, Computers in Industry
The purpose of this study is twofold: investigating how omnichannel (OC) retailers manage e-fulfillment costs and establishing how these costs relate to the evolution of OC retailers' e-fulfillment strategies. Experts in e-fulfillment from 34 European OC retailers across various sectors participated in an exploratory survey. The study's results reveal that although e-fulfillment costs significantly influence the evolution of e-fulfillment strategies, many OC retailers fulfilling online orders from retail stores or traditional warehouses remain unaware of the actual costs of e-fulfillment. Activities other than picking and last-mile delivery, such as inbound logistics and storage, are poorly controlled. Furthermore, complex cost metrics such as cost-to-serve—the total cost associated with delivering a specific order to a specific customer—are predominantly found among OC retailers operating fulfillment centers (FCs) in their e-fulfillment distribution networks. This underscores the need for all OC retailers to accurately assess e-fulfillment costs at multiple levels, which will be crucial for optimizing order preparation, tailoring pricing strategies, and achieving profitability, especially when operating hybrid e-fulfillment strategies where online orders are prepared in multiple facilities. As the largest study on e-fulfillment costs to date, it highlights the importance of advancing e-fulfillment cost management systems among OC retailers and adopting an approach that encompasses all e-fulfillment activities. Future research should delve into the key challenges of developing these systems, considering the operational realities of each OC retailer.
Medicine-Shelf matching strategy based on Bayesian convolutional neural network with fuzzy analytic hierarchy process
2023, Expert Systems with Applications
In pharmaceutical warehousing operations, a scientific and reasonable medicine and shelf matching strategy can improve medicine shelving efficiency and manual work efficiency. However, the traditional matching strategy has the problems of static matching and low matching efficiency, so this paper proposes the matching algorithm for drugs and shelves based on the fuzzy analytic hierarchy process and Bayesian convolutional neural network (FAHP-BCNN). First, we propose a drug-shelf matching degree model and discuss the influence of the dynamic matching process on the matching results by studying two parts: the attribute matching degree and the dynamic matching influence degree. Secondly, this paper quantitatively assessed the importance of the two matches by means of the fuzzy analytic hierarchy process. Finally, we mapped the massive matching information onto Bayesian convolutional neural network nodes, constructed a dynamic matching network model, and obtained competitive matching strategies and results. The experimental results show that the FAHP-BCNN algorithm exhibits better matching results in all three scenarios with different quantity ratios of drugs and shelf matching. Compared with the fixed cargo matching strategy of the ABC classification method, the FAHP-BCNN algorithm shows excellent performance in both the manual walking distance index and fatigue index. In summary, the drug-shelf matching algorithm based on FAHP-BCNN is effective and can provide theoretical support for pharmaceutical warehousing enterprises to perform drug-shelf matching.
Logistics distribution optimization: Fuzzy clustering analysis of e-commerce customers’ demands
2023, Computers in Industry
E-commerce customers’ demands for delivery services have become more personalized, diversified, and complex. In this paper, we conduct cluster analysis on the customer demand attributes resulting in a list of attributes including quantitative and qualitative expectations that can be relevant for creating efficient distribution routes taking into account the delivery time and customer satisfaction. A fuzzy clustering optimization method is elaborated for the treatment of above-mentioned customer attributes for distribution management in order to generate efficient delivery strategies. A case study from Shun-Feng (SF) International Express is used to demonstrate the effectiveness and practicability of the proposed method. The obtained results show that both customer satisfaction and the net profit of the enterprise have considerably increased due to an efficient distribution management.
E-commerce collaborative filtering recommendation method based on social network user relationship
2023, International Journal of Networking and Virtual Organisations
Named entity resolution in personal knowledge graphs
2023, Personal Knowledge Graphs (PKGs): Methodology, tools and applications
Automatic Semantic Typing of Pet E-commerce Products Using Crowdsourced Reviews: An Experimental Study
2023, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

View all citing articles on Scopus

View full text

An evaluation and annotation methodology for product category matching in e-commerce

Highlights

Abstract

Introduction

Section snippets

Problem definition and research goals

E-commerce taxonomies

Annotation task construction and guidelines

Findings

Annotator feedback and discussion

Related work

Conclusion and future work

Author statement

Conflicts of interest

Declaration of Competing Interest

Theor. Comput. Sci.

Expert Syst. Appl.

Expert Syst. Appl.

Expert Syst. Appl.

Inf. Control

Contextual string embeddings for sequence labeling

Proceedings of the 27th International Conference on Computational Linguistics

Word Embeddings: A Survey

On evaluating schema matching and mapping

Schema Matching and Mapping

Language Models are Few-Shot Learners

E-business and E-commerce Management: Strategy, Implementation and Practice

Recommender systems based on user reviews: the state of the art

User Model. User-Adapt. Interact.

Bert: Pre-Training of Deep Bidirectional Transformers for Language Understanding

Challenges and innovations in building a product knowledge graph

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Retrofitting Word Vectors to Semantic Lexicons

Ontologies

Ontologies

Taxonomy induction using hierarchical random graphs

Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Personalization in e-commerce applications

The Adaptive Web

Ecommerce User Experience

Taxonomy induction using hypernym subsequences

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

Digitalization in retailing: multi-sided platforms as drivers of industry transformation

Balt. J. Manag.

Information retrieval evaluation

Synth. Lect. Inf. Concepts Retr. Serv.

A group-buy protocol based on coalition formation for agent-mediated e-commerce

IJCIS

Bag of Tricks for Efficient Text Classification

A study on the law2vec model for searching related law

J. Digit. Contents Soc.

Recommender system based on product taxonomy in e-commerce sites

J. Inf. Sci. Eng.

Explaining the user experience of recommender systems

User Model. User-Adapt. Interact.

A comparative analysis of ebay and amazon

Intelligent Enterprises of the 21st Century

Addressing cold-start problem in recommendation systems

Proceedings of the 2nd International Conference on Ubiquitous Information Management and Communication