Pseudo relevance feedback optimization

Arampatzis, Avi; Peikos, Georgios; Symeonidis, Symeon

doi:10.1007/s10791-021-09393-5

Pseudo relevance feedback optimization

Published: 25 May 2021

Volume 24, pages 269–297, (2021)
Cite this article

Download PDF

Information Retrieval Journal Aims and scope Submit manuscript

Pseudo relevance feedback optimization

Download PDF

569 Accesses
Explore all metrics

Abstract

We propose a method for automatic optimization of pseudo relevance feedback (PRF) in information retrieval. Based on the conjecture that the initial query’s contribution to the final query may not be necessary once a good model is built from pseudo relevant documents, we set out to optimize per query only the number of top-retrieved documents to be used for feedback. The optimization is based on several query performance predictors for the initial query, by building a linear regression model discovering the optimal machine learning pipeline via genetic programming. Even by using only 50–100 training queries, the method yields statistically-significant improvements in MAP of 18–35% over the initial query, 7–11% over the feedback model with the best fixed number of pseudo-relevant documents, and up to 10% (5.5% on median) over the standard method of optimizing both the balance coefficient and the number of feedback documents by grid-search in the training set. Compared to state-of-the-art PRF methods from the recent literature, our method outperforms by up to 21% with an average of 10%. Further analysis shows that we are still far from the method’s effectiveness ceiling (in contrast to the standard method), leaving amble room for further improvements.

DB-GPT: Large Language Model Meets Database

Article Open access 19 January 2024

Xuanhe Zhou, Zhaoyan Sun & Guoliang Li

A survey on neural topic models: methods, applications, and challenges

Article Open access 25 January 2024

Xiaobao Wu, Thong Nguyen & Anh Tuan Luu

Machine Learning Optimization Techniques: A Survey, Classification, Challenges, and Future Research Issues

Article 29 March 2024

Kewei Bian & Rahul Priyadarshi

1 Introduction

One of the major topics in the field of Information Retrieval is automated ways to optimize retrieval effectiveness. Nowadays, new commercial search applications are in high public demand, therefore, search engines should be equipped with techniques that can process user queries extensively and yield good search results, such as Pseudo (or Blind) Relevance Feedback (PRF), among others.

PRF is an age-old method for improving retrieval effectiveness (Salton 1971; Croft and Harper 1979; Xu and Croft 1996). It is (commonly) a two-step process that enables us to utilize information about the initial query with respect to the collection, by using information from documents retrieved by the initial query in order to formulate and issue a better query. The recent literature about PRF attempts to produce new linguistic models to select terms from the retrieved documents or other external sources, e.g. (Jaleel et al. 2004; Tao and Zhai 2006; Lv and Zhai 2009b, 2010, 2014), or uses mathematical models to re-weight the chosen terms and reformulate the final query, e.g. (Singh et al. 2017; Valcarce et al. 2018).

PRF, in its classic form, involves three parameters: the number of top-ranked documents by the initial query that will be considered relevant so as to build a positive feedback model, the relative weight of the feedback model against the initial query, and the number of terms to keep in the improved query. In this paper, we deal with the automated optimization of such parameters. Based on the conjecture that the initial query’s contribution to the final query may not be necessary once a good model is built from pseudo relevant documents, we set out to optimize per query only the number of top-retrieved documents to be used for feedback.

The idea for the conjecture originates in (Arampatzis et al. 2000; Arampatzis 2001), where in an adaptive filtering context the authors introduced and employed initial query elimination/degradation. Quoting Arampatzis (2001): “The initial query is considered as carrying a worth of a certain number of relevant documents. As a result, the contribution of an initial query in training a classifier decreases with the number of relevant training documents.” Thus, as more and more training documents were becoming available during adaptive filtering, the contribution of the initial filtering query was gradually diminishing in adapting the classifier. The technique was applied successfully in TREC-9 and TREC-10 Filtering Tracks, assuming a worth of 10 and 2 relevant documents, respectively. We investigate this method at the far end of the spectrum by eliminating the initial query’s contribution altogether.

The contributions of the present study are the following. We explore the initial query elimination conjecture by arguing theoretically and investigating experimentally whether it holds some truth also in a PRF context. In this respect, we develop a PRF optimization method which disregards the initial query but builds a better positive feedback model by optimizing, per query, the number K of top-retrieved documents to be used for feedback. The optimization is based on several query performance predictors (QPPs) for the initial query, used as inputs to a linear regression model for predicting the optimal K. The machine learning pipeline of the linear regression model itself is also optimized using genetic programming via a tool which it intelligently explores thousands of possible pipelines to find the best one for the data at hand. The approach requires training data, and while it may be computationally-heavy in training, it is quite fast in query-time.

Despite this interesting perspective, to the best of our knowledge, only a small number of previous studies have tried to solve this or a similar optimization problem in the context of PRF, e.g. (Sakai et al. 2005; Lv and Zhai 2009a; Parapar et al. 2014), and none of them used QPPs except Amati et al. (2004) who employed QPPs only to decide whether to apply PRF to a query or not. Over the years, query performance prediction has become an important research area, consisting of two primary methodologies, i.e. pre- and post-retrieval. The former studies the expected query performance before the retrieval takes place, i.e. using only the query and collection statistics. The latter takes also into consideration data produced by the retrieval, such as the result list (Markovits et al. 2012; Shtok et al. 2012). Since in PRF the initial query is always run, the latter methodology seems more suitable and expected to be more beneficial.

The rest of the paper is organized as follows. Section 2 gives a brief overview of related works. Section 3 introduces and elaborates on the proposed method. Section 4 presents the experimental evaluation. Section 5 provides further discussion and insight, before conclusions are drawn and several directions for further research are pointed out in Sect. 6.

2 Related work

Over the years, there has been a considerable interest in Query Expansion (QE). QE approaches are classified into two groups: global, which have as a primary goal the extraction of a set of terms from various data sources (external or internal, e.g. thesauri) to meaningfully augment user’s original query, and local, which expand and re-weight user’s original query with terms derived from the analysis of the result set. There is a large volume of studies describing the role of QE, focusing primarily on techniques to improve retrieval effectiveness, e.g. Mitra et al. (1998); Kekäläinen and Järvelin (1998); Crouch et al. (2002); Cronen-Townsend et al. (2002); Ruthven and Lalmas (2003); Abdelmgeid Amin (2008); Azad and Deepak (2019).

There are two ways to expand and/or re-weight the user’s original query with local methods techniques, relevance feedback and pseudo relevance feedback (PRF). A well-known method for relevance feedback is Rocchio's (1971) which is based on the vector space model, and another primary study is that of Croft and Harper (1979) which is a probabilistic approach.

Karisani et al. (2016) proposed a method to extract the most informative terms in a set of documents for PRF. A set of documents is retrieved using the user’s initial query and then a weight is assigned to each document describing the document’s closeness to the user’s information need. These weights are used to recalculate the final query’s term weights. The experimental results in standard English and Persian test collections increased MAP up to 7%.

Another method, that achieved significant improvements in retrieval effectiveness, was proposed by Singh et al. (2017). They explored the possibility of using fuzzy logic-based QE approach to improve overall efficiency. The weights of each word were mixed using fuzzy rules to infer the weights of the additional query terms. At the end, after the fuzzy logic approach, they filter out semantically irrelevant terms to further improve their results.

Lv and Zhai (2010) proposed a novel positional relevance model to reward terms close to the initial query terms in the feedback documents and avoid including irrelevant terms in the feedback model. Their proposed method is an extension of relevance models so they set the parameters, such as feedback coefficient, number K of feedback documents, and number of expansion terms to fixed values. However, in order to check the robustness of the proposed method regarding the K value, the authors also tried K values varied from 0 to 100. Their methods proved robust and effective compared to the standard relevance feedback models.

Parapar and Barreiro (2011) presented two different approaches for the Relevance Model (RM) (Lavrenko and Croft 2001), promoting terms under the Language Modelling framework to improve divergence in the PRF context. The first approach (KLD3) is built upon Kullback–Leibler Divergence based on query expansion in the language modelling framework. The second approach (RM3DT) is based on the Relevance Model with the promotion of Divergent Terms. The authors evaluated the performance of the proposed models on TREC collections, and the RM3DT method outperformed the baseline Language Modelling (LM) retrieval model by 11–31%, and the RM3 feedback model by 0.5–23%.

Valcarce et al. (2018) examined the use of linear methods for PRF. They proposed the Lime model, a novel formulation of the PRF task as a matrix decomposition problem. To expand the original query, they used a factorization that includes the computation of an inter-term similarity matrix. Also, for the proposed decomposition, they applied linear least squares regression with regularisation. The proposed LiMe-TF and LiMe-TF-IDF outperform the LM (12–34%) and RM3 (0.6–5.5%) baselines, on five TREC datasets. In both of the last-mentioned studies, the number K of feedback documents is tuned, per PRF model, based on training data, to the same fixed number (one of 5,10,25,50,75,100) for all queries.

An important limitation found in the aforementioned studies which use local QE techniques to improve retrieval effectiveness lies in the fact that the number K of pseudo-relevant documents is set to a specific fixed value for all queries (irrespective of whether it is optimized on some training set or not), with the most common being 5, 10, 20, 30, 50 (Raiber and Kurland 2014). Only a few researchers attempted to optimize, per query, the balance $\alpha $ between initial query and feedback information (Lv and Zhai 2009a), or realized the importance of a good PRF document set (Sakai et al. 2005) or K with respect to the query (Parapar et al. 2014). These constitute the more related works, which we will see next.

Lv and Zhai (2009a) proposed three heuristics to adaptively predict the optimal balance between initial query and feedback information in PRF. To predict the balance coefficient $\alpha $, several features were examined and combined by using a regression approach which led to robust and effective results compared with the regular fixed-coefficient feedback. In our study, we focus on the K parameter instead, eliminating $\alpha $.

An attempt to adjust the number of pseudo-relevant documents per query was proposed by Sakai et al. (2005), called Selective Sampling. The method assumes that some of the initial top-ranked documents are not useful, so it skips those documents while it creates the set of pseudo-relevant documents S. Three parameters are introduced, $P_{\mathrm {min}}, P_{\mathrm {max}}, P_{\mathrm {scope}}$, which are the minimum/maximum number of pseudo-relevant documents required and the total number of pseudo-relevant documents examined per query. The algorithm uses these three parameters as cutoffs, so that $P_{\mathrm {min}} \le |S| \le P_{\mathrm {max}} \le P_{\mathrm {scope}}$, which were set via training with the NTCIR-3 Japanese test collection to 3, 10, and 50, respectively. They used 40 expansion terms which were down-weighted by a factor of 0.25 compared to the initial query terms. An evaluation on the NTCIR-4 Japanese/English test collection found that Selective Sampling outperforms traditional PRF methods almost as often as traditional PRF methods outperform Selective Sampling, which is rather a tie. Our work is quite different, since we do not skip documents or optimize K with a fixed value for all queries but optimize it per query, and it will be proven clearly effective (as will see in this paper).

A method (SDRM3) that tried to optimize the number of pseudo relevant documents per query was proposed by Parapar et al. (2014). The authors investigated the score distribution of the initial retrieval and tried to break it down to its relevant and non-relevant components; they formulated the problem as a threshold optimization task (similarly to what was proposed before, e.g., in Arampatzis et al. 2009) and evaluated the model’s performance on TREC collections. Significant improvements were found compared to the baseline LM retrieval model (8–17%). Although there were also improvements over the baseline (RM3) feedback model (2.3%), these were not statistically significant.

Thus, the study of Parapar et al. (2014) is the most related, as it tries to solve the same problem using the score distribution of the initial retrieval; however, there are still major differences. Firstly, in their approach, they use the training set to optimize the number of expansion terms and the balance coefficient (both with fixed values irrespective of the query), and the smoothing parameter; we eliminate the initial query and do not tune any other parameter. Secondly, while the mixture model of the relevant and non-relevant score distributions is tightly-coupled to the retrieval model employed (Arampatzis and Robertson 2011), our approach based on query performance predictors is—in principle—retrieval model invariant and certainly much faster in query-time (recovering the parameters of a mixture model iteratively is much more expensive than calculating our predictors). Finally, we achieve larger improvements over the initial retrieval and over the baseline feedback model, as will see later in our experiments.

Thus, the current study attempts to solve the optimization problem at hand by using a novel approach. To the best of our knowledge, this is the first time anyone explores query performance predictors (QPPs) to determine the optimal number of pseudo-relevant documents per query. Amati et al. (2004) who used QPPs only to determine whether or not to apply PRF in a query, used a fixed $K=10$ when their method told them so; such a selective PRF is also included our method, since we detect queries for which it would not be beneficial and switch it off. Moreover, our method can be used independently of the retrieval and PRF models, as QPPs can be calculated using the initial retrieval scores and a regression model can be built with a relatively few training queries.

3 Optimizing pseudo relevance feedback

Let $Q_0$ be the initial user query, expressed for some information need. Traditionally, pseudo relevance feedback (PRF) involves three parameters: the number K of top-ranked documents retrieved by $Q_0$ to be considered as pseudo-relevant, the $Q_0$’s weight $\alpha $ against the positive feedback query/model $Q_{r,K}$ built from the K pseudo-relevant documents, and the number T of top-weighted feedback terms to be retained in the modified query $Q_m$. Assuming vector representations for $Q_0$, $Q_{r,K}$, $Q_m$, the modified query is calculated as:

$$\begin{aligned} Q_m = \alpha Q_0 + (1 - \alpha ) Q_{r,K} \, , \quad 0 \le \alpha \le 1 \, . \end{aligned}$$

(1)

Taking as an example Rocchio’s formula, it uses three weights: $\alpha ,\beta ,\gamma $. Since there is no negative feedback in PRF, $\gamma $ is set to zero or eliminated. Additionally, $\beta = 1-\alpha $, since what matters practically is the relative weight of the contributions of $Q_0$ and $Q_{r,K}$ to $Q_m$, i.e. there is a single free weight after all: $\alpha $. Rocchio builds $Q_{r,K}$ as the average pseudo-relevant document vector or centroid.

Of the three parameters involved ($\alpha $, K, and T), the latter has been deemed as the least important after decades of experimentation. The number of terms used for query expansion with PRF is less significant than the quality of terms selected, as stated many times before in the literature (e.g. Sihvonen and Vakkari (2004)), so commonly T is set to 20. Since optimization of T does not seem to worth the effort, we also set $T=20$ and focus on the former two parameters.

Most previous research in PRF, pre-set K and $\alpha $ to fixed values independent of the $Q_0$ at hand, such as $\alpha = 0.5$ and $K=10$. These actual fixed values are usually determined experimentally by selecting the $\alpha $ and K which maximize, on average, some effectiveness measure on a set of training queries on some benchmark corpus. We will refer to this optimization method as standard throughout the paper. Note that there is no single $\alpha $/K combination that optimizes all evaluation measures, but the optimal values depend on the measure of interest.

The value of $\alpha $ denotes the degree of distrust we have in the feedback model $Q_{r,K}$: the larger the $\alpha $, the less the confidence we have in $Q_{r,K}$ with respect to its quality. For a given $Q_0$ and its initial ranking, the quality of $Q_{r,K}$ depends solely on the choice of K, for which two factors come at play:

1.
The number R of documents relevant to the information need. Assuming $Q_0$ yields a perfect ranking (i.e. all R relevant documents are ranked above all non-relevant ones), K should not be set greater than R, otherwise $Q_{r,K}$ (and consequently $Q_m$) may drift away from $Q_0$ and achieve a worse ranking. Setting K less than R may also have an adverse effect due to a possible insufficient coverage of the topic in $Q_{r,K}$. Accounting for imperfections in training $Q_{r,K}$ statistical anomalies and other effects, we can say that the best K is around R, when $Q_0$ produces a perfect ranking.
2.
The $Q_0$’s effectiveness or quality of its ranking. For a less-than-perfect $Q_0$ ranking, K should be set lower than R, since the density of relevant documents generally increases when going up the ranking and decreases when going down. In other words, from the two alternative sets of top-K documents, $K=R-\delta $ or $K=R+\delta $ (for a positive integer $\delta $), the former is expected to have a larger fraction of relevant documents than the latter. Therefore, this strategy produces a ‘cleaner’ pseudo-relevant set with respect to the fraction of relevant documents it contains.^{Footnote 1} In any case, when $Q_0$ is imperfect, we pay for drift and coverage problems.

Based on the above, the optimal K can take a value up to around R. The more effective the $Q_0$, the nearer the optimal K is to R. The less effective the $Q_0$, the further the optimal K moves away from R to smaller values. Thus, positive correlations between the optimal K and both R and $Q_0$ effectiveness are expected.

Since R is unknown and $Q_0$ is imperfect in practice, it is difficult to achieve the delicate balance between drift and coverage in $Q_{r,K}$. To alleviate these effects from spilling into $Q_m$ and keep focus to the user’s information need, $\alpha $ is usually set to a value $>0.5$, retaining a significant (safe) contribution of $Q_0$ to $Q_m$, more than $Q_{r,K}$.^{Footnote 2} In combination with using the same fixed $\alpha $ and K values for all incoming queries, PRF’s potential may not be squeezed out in its entirety.

Based on the above, we argue that once one has a method for optimizing K per query, the $\alpha $ parameter becomes much less important and could even be eliminated/set-to-zero discarding $Q_0$’s contribution. A perfect $Q_{r,K}$ could potentially encapsulate all $Q_0$’s information, deeming perhaps $Q_0$’s contribution to $Q_m$ unnecessary.^{Footnote 3} While $Q_0$’s effectiveness cannot be controlled during PRF (it depends on the query issued, retrieval model, collection pre-processing/indexing, etc.), since PRF is always (at least) a two-stage process, $Q_0$’s effectiveness and R could be estimated, guiding the selection of a better than a pre-set fixed K. Ideally, in the extreme case, such an optimization method should even predict a $K=0$, meaning that no PRF would be beneficial and only the $Q_0$ should be used.

Thus, the method we propose employs query performance predictors (also known as query difficulty) to determine/predict $Q_0$’s effectiveness, and uses their values to predict an optimal K per query that maximizes a given effectiveness measure. In this study, we will not consider any R-predictors, although they constitute an obvious and perhaps effective extension.

3.1 Post-retrieval query performance predictors

In the Query Performance Prediction (QPP) literature, there are several quantities correlated to retrieval effectiveness, usually to MAP (Hauff 2010), but also to other measures since many measures are correlated— in their turn—to MAP, e.g. Precision@R (Manning et al. 2008). There exist pre- and post-retrieval QPPs. Since in a PRF setting, the initial query will always run, it makes sense to focus on post-retrieval QPPs.

There are three main categories of post-retrieval QPP methods. The first one is clarity-based methods that directly measure the ambiguity of the results list with respect to the corpus (Cronen-Townsend et al. 2002). The second is robustness-based methods, which evaluate how robust the results are to perturbations in the query, the result list, and the retrieval method (Zhou and Croft 2007; Yom-Tov et al. 2005). Lastly, the score distribution-based methods analyze the score distribution of the results list.

According to Zhou and Croft (2007), the methods of the first two categories are time-consuming. Since PRF alone more than doubles the runtime, it is not desirable to burden it further. For instance, to calculate robustness there is the need to generate a random collection by sampling from document models of the documents in the original collection, and then perform retrieval on both collections. The similarity between the two rankings is the robustness score. To calculate the clarity score one needs to estimate the query’s and the collection’s language model. Although the collection’s language model can be pre-computed during indexing, the query language model is estimated by sampling documents after the initial retrieval. For these reasons, we resort to QPPs which are based on the score distribution of the initial results list, which are easy and fast to calculate.

Consequently, we employ three post retrieval QPPs, namely, WIG, NQC, and SMV; all three have been widely used in recent studies (Zhou and Croft 2007; Shtok et al. 2012; Tao and Wu 2014).

3.1.1 Weighted information gain (WIG)

The Weighted Information Gain (WIG) predictor was introduced by Zhou and Croft (2007) as an approach to predict query performance in web search environments. It measures the divergence between the mean retrieval score of some top documents in the result list and that of a random document in the whole corpus. Equation 2 is a simplified version of the WIG predictor formula which, according to Zhou (2008), is efficient and uses only the scores of the results:

$$\begin{aligned} \mathrm {WIG}(q, {\mathcal {M}})=\frac{1}{n} \sum _{d \in {\mathcal {D}}_{q}^{[n]}} \frac{1}{\sqrt{|q|}}(\mathrm {Score}(d)- \mathrm {Score}({\mathcal {D}})) \, , \end{aligned}$$

(2)

where n is a free parameter equal to the number of top-ranked documents used for calculating the predictor, ${\mathcal {D}}_{q}^{[n]}$ is the set of the top-n documents, and |q| is the query length. $\mathrm {Score}(d)$ is the score assigned to document d by the retrieval model ${\mathcal {M}}$. Finally, $\mathrm {Score}({\mathcal {D}})$ is the average score of all retrieved results.

This predictor has been used in previous studies (Tao and Wu 2014; Shtok et al. 2012). According to Markovits et al. (2012), the normalization of the WIG by the query length |q| harms the prediction quality on TREC benchmark collections, so we removed this normalization in our experiments. Lastly, we set $n=5$, as in Zhou (2008).

3.1.2 Normalized query commitment (NQC)

The Normalized Query Commitment (NQC) predictor, proposed by Shtok et al. (2012), estimates the amount of query drift in the list of top-retrieved documents using the standard deviation of their retrieval scores:

$$\begin{aligned} \mathrm {NQC}(q,{\mathcal {M}}) = \frac{\sqrt{\frac{1}{n} \sum _{d \in {\mathcal {D}}_{q}^{[n]}}(\mathrm {Score}(d)-{\hat{\mu }})^{2}}}{\mathrm {Score}({\mathcal {D}})} \, , \end{aligned}$$

(3)

where ${\hat{\mu}}$ is the average score of the top-n results in ${\mathcal {D}}_{q}^{[n]}$. We set $n=100$, as recommended by Shtok et al. (2012).

3.1.3 Score magnitude and variance (SMV)

According to Tao and Wu (2014), WIG and NQC tend to work in some situations and fail in others; as a result, they developed another post-retrieval predictor, namely, the Score Magnitude and Variance (SMV):

$$\begin{aligned} \mathrm {SMV(q, {\mathcal {M}})}=\frac{\frac{1}{n} \sum _{d \in {\mathcal {D}}_{q}^{[n]}}\left( \mathrm {Score}(d)\left| \ln \frac{\mathrm {Score}(d)}{{\hat{\mu }}}\right| \right) }{\mathrm {Score}({\mathcal {D}})} \, . \end{aligned}$$

(4)

Once more, we set $n=100$, as recommended by Shtok et al. (2012).

3.2 Predicting K optimal

First, we investigated how the optimal K for MAP ($K_{\mathrm {opt\_MAP}}$) looks like on real data. In initial experiments with a benchmark dataset (which is described in detail in Sect. 4.1), we run a $Q_0$, built on its results all positive-only PRF queries $Q_m = Q_{r,K}$ for $K = 1\ldots 200$ with Rocchio, and evaluated them on the test corpus in order to find $K_{\mathrm {opt\_MAP}}$. We did that for 150 different $Q_0$s. The min/med/avg/max $K_{\mathrm {opt\_MAP}}$ found was 1/13/46.6/200. Two topics hit the 200 mark, suggesting that we should have searched also higher Ks; nevertheless, the distribution is quite skewed to the downside, so we are confident that these few topics which should have had $K_{\mathrm {opt\_MAP}}>200$ will not affect our overall experimental results.

Figure 1 shows how MAP changes with K for four queries, and Table 1 gives some more quantitative information. As it can be seen in the figure, MAP as a function of K is neither smooth nor monotonic. These four queries were selected as representatives of four broad and rough categories of behaviour we saw in the data: multiple modals, rising-and-falling, falling, rising.

Table 1 Example topics/queries

Pseudo relevance feedback optimization

Abstract

Similar content being viewed by others

DB-GPT: Large Language Model Meets Database

A survey on neural topic models: methods, applications, and challenges

Machine Learning Optimization Techniques: A Survey, Classification, Challenges, and Future Research Issues

1 Introduction

2 Related work

3 Optimizing pseudo relevance feedback

3.1 Post-retrieval query performance predictors

3.1.1 Weighted information gain (WIG)

3.1.2 Normalized query commitment (NQC)

3.1.3 Score magnitude and variance (SMV)

3.2 Predicting K optimal

4 Evaluation

4.1 Experimental setup

4.2 Experimental results

4.2.1 Initial query elimination

4.2.2 Retaining initial query with fixed or predicted K

4.3 Additional experiments

4.3.1 Experiments with web data

4.3.2 Comparison to the state-of-the-art

5 Discussion

5.1 Optimal K and query performance

5.2 Loss functions for model selection

6 Conclusions & directions for further research

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation