Skip to main content
Log in

Channel retrieval: finding relevant broadcasters on Telegram

  • Original Paper
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Mobile broadband (3G and 4G) has remarkably influenced people’s lives. For instance, the impacts of this technology are evident in transportation, education and messaging. With the advent of this technology, a new generation of messengers offering instant messaging is available to people. One of them is Telegram that comes with new features; one of them is broadcasting messages under the name of “channel.” In this paper, we introduce the channel retrieval problem which aims to find a sorted list of related channels to a user query. This problem is first modeled to the classic information retrieval problems (expert finding and blog retrieval), but since there’s a vocabulary gap between the user query and the published messages in the channels, two query expansion methods for enhancing the performance are proposed. In this paper, a dataset is generated for the channel retrieval, which is publicly available for other researchers. Our experiments on this dataset show that using a semantic approach for query expansion can enhance channel retrieval performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Source by https://telegram.org/blog/channels

Fig. 2

Source by https://telegram.org/blog/replies-mentions-hashtags

Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. Instant Messaging Statistics Report, 2019–2023, February 2019 The Radicati Group, Inc.

  2. The Statistics Portal, Most popular global mobile messenger apps as of April 2018, based on the number of monthly active users (in millions), https://www.statista.com/statistics/258749/most-popular-global-mobile-messenger-apps/.

  3. 200,000,000 Monthly Active Users, https://telegram.org/blog/200-million.

  4. http://www.businessinsider.com/how-facebook-makes-money-according-to-mark-zuckerberg-2018-4.

  5. https://www.theguardian.com/technology/2010/apr/13/twitter-advertising-google.

  6. https://telegram.org/faq.

  7. https://lucene.apache.org/.

  8. https://radimrehurek.com/gensim/.

  9. https://github.com/usnistgov/trec_eval.

References

  • Albishre K, Li Y, Xu Y (2017) Effective pseudo-relevance for microblog retrieval. In: ACSW. ACM, pp 51:1–51:6

  • Al-Taie M, Kadry S, Adekunle OI (2018) Understanding expert finding systems: domains and techniques. Soc Netw Anal Min 8:57:1–57:9

    Article  Google Scholar 

  • Anglano C, Canonico M, Guazzone M (2017) Forensic analysis of telegram messenger on android smartphones. Digit Investig 23:31–49

    Article  Google Scholar 

  • Azad HK, Deepak A (2019) Query expansion techniques for information retrieval: a survey. Inf Process Manag 56(5):1698–1735

    Article  Google Scholar 

  • Balog K, Azzopardi L, de Rijke M (2009) A language modeling framework for expert finding. Inf Process Manag 45:1–19

    Article  Google Scholar 

  • Bandyopadhyay A, Ghosh K, Majumder P, Mitra M (2012) Query expansion for microblog retrieval. IJWS 1:368–380

    Article  Google Scholar 

  • Blei DM (2012) Probabilistic topic models. Commun ACM 55:77–84

    Article  Google Scholar 

  • Chy AN, Ullah MZ, Aono M (2019) Query expansion for microblog retrieval focusing on an ensemble of features. J Inf Process 27:61–76

    Google Scholar 

  • Çıtlak O, Dörterler M, Alper İ Doğru (2019) A survey on detecting spam accounts on twitter network. Soc Netw Anal Min 9(1:35

    Article  Google Scholar 

  • Cotelo JM, Cruz FL, Troyano JA (2014) Dynamic topic-related tweet retrieval. JASIST 65:513–523

    Google Scholar 

  • Cui W, Junping D, Wang D, Yuan X, Kou F, Zhou L, Zhou N (2019) Short text analysis based on dual semantic extension and deep hashing in microblog. ACM Trans Intell Syst Technol (TIST) 10(4):1–24

    Article  Google Scholar 

  • Dickerson JP, Kagan V, Subrahmanian VS (2014) Using sentiment to detect bots on twitter: are humans more opinionated than bots? In: ASONAM. IEEE Computer Society, pp 620–627

  • Elmongui HG, Mansour R, Morsy H, Khater S, El-Sharkasy A, Ibrahim R (2015) Trupi: Twitter recommendation based on users’ personal interests. In: CICLing (2), Lecture Notes in Computer Science, vol 9042. Springer, pp 272–284

  • Feynman R, Vernon F Jr (1963) The theory of a general quantum system interacting with a linear dissipative system. Ann Phys 24:118–173

    Article  MathSciNet  Google Scholar 

  • Gharebagh SS, Rostami P, Neshati M (2018) T-shaped mining: a novel approach to talent finding for agile software teams. ECIR Lect Notes Comput Sci 10772:411–423

    Article  Google Scholar 

  • Gorrab A, Kboubi F, Grand BL, Ghézala HB (2017) New hashtags’ weighting schemes for hashtag and user recommendation on twitter. In: AICCSA. IEEE, pp 564–570

  • Gregorio J, Vicente AG, Alarcos B (2017) Forensic analysis of telegram messenger for windows phone. Digit Investig 22:88–106

    Article  Google Scholar 

  • Gupta P, Satuluri V, Grewal A, Gurumurthy S, Zhabiuk V, Li Q, Lin JJ (2014) Real-time twitter recommendation: online motif detection in large dynamic graphs. PVLDB 7:1379–1380

    Google Scholar 

  • Gupta V, Hewett R (2017) Harnessing the power of hashtags in tweet analytics. In: BigData. IEEE, pp 2390–2395

  • Gurini DF, Gasparetti F, Micarelli A, Sansonetti G (2018) Temporal people-to-people recommendation on social networks with sentiment-based matrix factorization. Fut Gener Comput Syst 78:430–439

    Article  Google Scholar 

  • Han Z, Qiao W, Cui S, Kong L (2016) Time-based microblog search system. In: ICYCSEE (2), Communications in Computer and Information Science, vol 624. Springer, pp 226–228

  • Hashemi SH, Neshati M, Beigy H (2013) Expertise retrieval in bibliographic network: a topic dominance learning approach. In: Proceedings of the 22nd ACM international conference on conference on information & knowledge management, CIKM ’13. ACM, pp 1117–1126

  • Java A, Song X, Finin T, Tseng B (2007) Why we twitter: understanding microblogging usage and communities. In: WebKDD/SNA-KDD ’07: proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on web mining and social network analysis. ACM, pp 56–65

  • Karimzadehgan M, Zhai C (2010) Estimation of statistical translation models based on mutual information for ad hoc information retrieval. In: SIGIR. ACM, pp 323–330

  • Keikha M, Crestani F (2012) Linguistic aggregation methods in blog retrieval. Inf Process Manag 48:467–475

    Article  Google Scholar 

  • Khennak I, Drias H (2016) A firefly algorithm-based approach for pseudo-relevance feedback: application to medical database. J Med Syst 40:240

    Article  Google Scholar 

  • Kim Y, Shim K (2014) Twilite: a recommendation system for twitter using a probabilistic model based on latent Dirichlet allocation. Inf Syst 42:59–77

    Article  Google Scholar 

  • Kim J, Yun U, Pyun G, Ryang H, Lee G, Yoon E, Ryu KH (2015) A blog ranking algorithm using analysis of both blog influence and characteristics of blog posts. Cluster Comput 18:157–164

    Article  Google Scholar 

  • Li H, Xu J (2014) Semantic matching in search. Found Trends Inf Retr 7:343–469

    Article  Google Scholar 

  • Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • McHugh ML (2012) Interrater reliability: the kappa statistic. Biochem Med 22(3):276–282

    Article  MathSciNet  Google Scholar 

  • Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: NIPS, pp 3111–3119

  • Myers SA, Sharma A, Gupta P, Lin JJ (2014) Information network or social network? The structure of the twitter follow graph. In: 23rd International world wide web conference, WWW ’14, Seoul, Republic of Korea, April 7–11, 2014, Companion Volume, pp 493–498

  • Neshati M (2017) On early detection of high voted q&a on stack overflow. Inf Process Manag 53:780–798

    Article  Google Scholar 

  • Neshati M, Beigy H, Hiemstra D (2014) Expert group formation using facility location analysis. Inf Process Manag 50:361–383

    Article  Google Scholar 

  • Neshati M, Fallahnejad Z, Beigy H (2017) On dynamicity of expert finding in community question answering. Inf Process Manag 53:1026–1042

    Article  Google Scholar 

  • Nobari AD, Gharebagh SS, Neshati M (2017a) Skill translation models in expert finding. In: SIGIR. ACM, pp 1057–1060

  • Nobari AD, Reshadatmand N, Neshati M (2017b) Analysis of telegram, an instant messaging service. In: CIKM. ACM, pp 2035–2038

  • Nobari AD, Neshati M, Gharebagh SS (2020) Quality-aware skill translation models for expert finding on StackOverflow. Inf Syst 87:101413

    Article  Google Scholar 

  • Ponte JM, Croft WB (1998) A language modeling approach to information retrieval. In: SIGIR. ACM, pp 275–281

  • Qader SA, Abbas AR (2019) Survey of user to user recommendation system in online social networks. Eng Technol J 37(10 A):422–428

    Google Scholar 

  • Rath B, Gao W, Ma J, Srivastava J (2018) Utilizing computational trust to identify rumor spreaders on twitter. Soc Netw Anal Min 8(1):64

    Article  Google Scholar 

  • Rey MCN, García XL, Lozano-Aguiar L (2017) Instant messaging networks as a new channel to spread the news: Use of whatsapp and telegram in the Spanish online media of proximity. In: WorldCIST (3), Springer, Advances in intelligent systems and computing, vol 571, pp 64–72

  • Rostami P, Neshati M (2019) T-shaped grouping: expert finding models to agile software teams retrieval. Expert Syst Appl 118:231–245

    Article  Google Scholar 

  • Roy D, Paul D, Mitra M, Garain U (2016) Using word embeddings for automatic query expansion. arXiv preprint arXiv:160607608

  • Sriram B, Fuhry D, Demir E, Ferhatosmanoglu H, Demirbas M (2010) Short text classification in twitter to improve information filtering. In: SIGIR. ACM, pp 841–842

  • Susanka T, Kokes J (2017) Security analysis of the telegram IM. In: ROOTS, ACM, Proceedings of the 1st reversing and offensive-oriented trends symposium

  • Wang Z, Zhang M (2015) Feedback model for microblog retrieval. DASFAA (1), Lecture notes in computer science. Springer, pp 529–544

  • Wang Y, Huang H, Feng C (2019) Query expansion with local conceptual word embeddings in microblog retrieval. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2019.2945764

    Article  Google Scholar 

  • Xu K, Zheng X, Cai Y, Min H, Gao Z, Zhu B, Xie H, Wong TL (2018) Improving user recommendation by extracting social topics and interest topics of users in uni-directional social networks. Knowl-Based Syst 140:120–133

    Article  Google Scholar 

  • Xu K, Cai Y, Min H, Zheng X, Xie H, Wong TL (2017) Uis-lda: a user recommendation based on social connections and interests of users in uni-directional social networks. In: WI. ACM, pp 260–265

  • Yang XS (2010) Nature-inspired metaheuristic algorithms, 2nd edn. Luniver Press, Bristol

    Google Scholar 

  • Yusoff MN, Dehghantanha A, Mahmod R (2017) Network traffic forensics on firefox mobile OS: Facebook, twitter and telegram as case studies. CoRR arXiv:1706.08056 [abs]

  • Zhai C, Lafferty JD (2004) A study of smoothing methods for language models applied to information retrieval. ACM Trans Inf Syst 22:179–214

    Article  Google Scholar 

  • Zingla MA, Latiri C, Slimani Y (2016) Short query expansion for microblog retrieval. KES Procedia Comput Sci 96:225–234

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mahmood Neshati.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jalilvand, A., Neshati, M. Channel retrieval: finding relevant broadcasters on Telegram. Soc. Netw. Anal. Min. 10, 23 (2020). https://doi.org/10.1007/s13278-020-0629-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-020-0629-z

Keywords

Navigation