research-article

Relevance Ranking for Real-Time Tweet Search

Authors:

Yan Xia,

Yu Sun,

Tian Wang,

Juan Caicedo Carvajal,

Jinliang Fan,

Bhargav Mangipudi,

Lisa Huang,

Yatharth SarafAuthors Info & Claims

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 2829 - 2836

https://doi.org/10.1145/3340531.3412743

Published: 19 October 2020 Publication History

Get Access

Abstract

Relevance ranking is a key component of many search engines, including the Tweet search engine at Twitter. Users often use Tweet search to discover live discussions and different voices on trending topics or recent events. Tweet search is thus unique due to its focus on real-time content, where both the retrieved content and queries change drastically on an hourly basis. Another important property of Tweet search is that its relevance ranking takes the social endorsements from other users into account, e.g., "likes" and "retweets", which is different from mainly relying on clicks as implicit feedback. The relevance ranking of Tweet search is also subject to strict latency constraints, because every second, a large amount of Tweets are posted and indexed, while tens of thousands of queries are issued to search posted Tweets. Considering the above properties and constraints, we present a relevance ranking system for Tweet search addressing all these challenges at Twitter. We first discuss the formation of the relevance ranking pipeline, which consists of a series of ranking models. We then present the methodology for training the models and the various groups of features we use, including real-time and personalized features. We also investigate approaches of achieving unbiased model training and building up automatic online tuning of system parameters. Experiments using online A/B testing demonstrate the effectiveness of the proposed approaches and we have deployed the proposed relevance ranking system in production for more than three years.

Supplementary Material

MP4 File (3340531.3412743.mp4)

Relevance ranking is a key component of the Tweet search engine at Twitter. Tweet search is unique in that it focuses on real-time content, where both the retrieved content and queries change drastically on an hourly basis. Another property of Tweet search is that it takes the social endorsements from users into account, which is different from relying on clicks as implicit feedback. The relevance ranking of Tweet search is also subject to strict latency constraints, which is due to the high volume of tweets and queries it receives. Considering the above properties and constraints, we present a relevance ranking system for Tweet search at Twitter. We first discuss the formation of the relevance ranking pipeline, and then the methodology for training models and the features we use. We also investigate approaches of achieving unbiased model training and building up automatic online tuning of system parameters. We have deployed this relevance ranking system in production for more than three years.

Download
69.47 MB

References

[1]

Qingyao Ai, Keping Bi, Cheng Luo, Jiafeng Guo, and W Bruce Croft. 2018. Unbiased Learning to Rank with Unbiased Propensity Estimation. arXiv preprint arXiv:1804.05938 (2018).

Abstract

Supplementary Material

References

Index Terms

Recommendations

Featured Tweet Search: Modeling Time and Social Influence for Microblog Retrieval

Ranking Relevance in Yahoo Search

Re-ranking search results using query logs

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations