skip to main content
10.1145/3366424.3383569acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

The Positioning Matters: Estimating Geographical Bias in the Multilingual Record of Biographies on Wikipedia

Published: 20 April 2020 Publication History

Abstract

This article proposes that an appropriate assessment of the geographical bias in multilingual Wikipedia's content should consider not only the number of articles linked to places, but also their internal positioning –i.e. their location in different languages and their centrality in the network of references between articles–. This idea is studied empirically, systematically evaluating the geographic concentration in the biographical coverage of globally recognized individuals (those whose biographies are found in more than 25 language versions of Wikipedia). Considering the internal positioning levels of these biographies, only 5 countries account for more than 62% of Wikipedia's biographical coverage. In turn, the inequality in coverage between countries reaches very high levels, estimated with a Gini coefficient of .84 and a Palma ratio of 207. In all the tests carried out, the inclusion of the linguistic and/or relational positioning of the articles increases the estimate of inequality in biographical coverage. This suggests that previous estimates of geographical bias, which do not consider differences in internal positioning, have underestimated the degree of inequality in the distribution of information.

References

[1]
Gruwell, L. Wikipedia's politics of exclusion: Gender, epistemology, and feminist rhetorical (in) action. Computers and Composition 37, 117–131 (2015).
[2]
Klein, M., Gupta, H., Rai, V., Konieczny, P. & Zhu, H. Monitoring the Gender Gap with Wikidata Human Gender Indicators. in Proceedings of the 12th International Symposium on Open Collaboration 1–9 (2016).
[3]
3Shane-Simpson, C. & Gillespie-Lynch, K. Examining potential mechanisms underlying the Wikipedia gender gap through a collaborative editing task. Computers in Human Behavior 66, 312–328 (2017).
[4]
Hinnosaar, M. Gender inequality in new media: Evidence from Wikipedia. Journal of Economic Behavior & Organization 163, 262–276 (2019).
[5]
Graham, M., Hogan, B., Straumann, R. K. & Medhat, A. Uneven geographies of user-generated information: patterns of increasing informational poverty. Annals of the Association of American Geographers 104, 746–764 (2014).
[6]
Graham, M. Information geographies and geographies of information. New geographies (2015).
[7]
Roll, U. Using Wikipedia page views to explore the cultural importance of global reptiles. Biological conservation 204, 42–50 (2016).
[8]
Overell, S. E. & Rüger, S. View of the world according to Wikipedia: Are we all little Steinbergs? Journal of Computational Science 2, 193–197 (2011).
[9]
Graham, M., Hale, S. A. & Stephens, M. Geographies of the World's Knowledge. (2011).
[10]
Graham, M., De Sabbata, S. & Zook, M. A. Towards a study of information geographies:(im) mutable augmentations and a mapping of the geographies of information. Geo: Geography and environment 2, 88–105 (2015).
[11]
Yu, A. Z., Ronen, S., Hu, K., Lu, T. & Hidalgo, C. A. Pantheon 1.0, a manually verified dataset of globally famous biographies. Scientific data 3, 150075 (2016).
[12]
Beytía, P. & Schobin, J. Networked Pantheon: a Relational Database of Globally Famous People. Available at SSRN 3255401 (2018).
[13]
Beytía, P. & Müller, H.-P. Towards a Digital Reflexive Sociology: Exploring the Most Globally Disseminated Sociologists on Multilingual Wikipedia. (2019).
[14]
Brin, S. & Page, L. The anatomy of a large-scale hypertextual web search engine. Computer networks and ISDN systems 30, 107–117 (1998).
[15]
Page, L., Brin, S., Motwani, R. & Winograd, T. The PageRank citation ranking: Bringing order to the web. (1999).
[16]
Gini, C. Variabilità e mutabilità. Reprinted in Memorie di metodologica statistica (Ed. Pizetti E, Salvemini, T). Rome: Libreria Eredi Virgilio Veschi (1912).
[17]
Palma, J. G. Homogeneous middles vs. heterogeneous tails, and the end of the ‘inverted-U’: It's all about the share of the rich. development and Change 42, 87–153 (2011).
[18]
Palma, J. G. Do nations just get the inequality they deserve? The “Palma Ratio” re-examined. in Inequality and Growth: Patterns and Policy 35–97 (Springer, 2016).
[19]
Hellebrandt, T. & Mauro, P. The future of worldwide income distribution. Peterson Institute for International Economics Working paper (2015).
[20]
Darvas, Z. Some are more equal than others: new estimates of global and regional inequality. (IEHAS Discussion Papers, 2016).
[21]
Guereña, A. Unearthed: land, power, and inequality in Latin America. Oxfam International (2016).

Cited By

View all
  • (2025)Demographic disparity in Wikipedia coverage: a global perspectiveEPJ Data Science10.1140/epjds/s13688-025-00530-414:1Online publication date: 21-Feb-2025
  • (2023)Fairness in Socio-Technical Systems: A Case Study of WikipediaCollaboration Technologies and Social Computing10.1007/978-3-031-42141-9_6(84-100)Online publication date: 22-Aug-2023
  • (2022)Visibility layers: a framework for systematising the gender gap in Wikipedia contentInternet Policy Review10.14763/2022.1.162111:1Online publication date: 22-Mar-2022
  • Show More Cited By

Index Terms

  1. The Positioning Matters: Estimating Geographical Bias in the Multilingual Record of Biographies on Wikipedia
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '20: Companion Proceedings of the Web Conference 2020
          April 2020
          854 pages
          ISBN:9781450370240
          DOI:10.1145/3366424
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 20 April 2020

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Wikipedia
          2. geo-tagged information
          3. geographical bias
          4. information inequality

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          WWW '20
          Sponsor:
          WWW '20: The Web Conference 2020
          April 20 - 24, 2020
          Taipei, Taiwan

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)19
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 02 Mar 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2025)Demographic disparity in Wikipedia coverage: a global perspectiveEPJ Data Science10.1140/epjds/s13688-025-00530-414:1Online publication date: 21-Feb-2025
          • (2023)Fairness in Socio-Technical Systems: A Case Study of WikipediaCollaboration Technologies and Social Computing10.1007/978-3-031-42141-9_6(84-100)Online publication date: 22-Aug-2023
          • (2022)Visibility layers: a framework for systematising the gender gap in Wikipedia contentInternet Policy Review10.14763/2022.1.162111:1Online publication date: 22-Mar-2022
          • (2022)Visibility layers: a framework for systematising the gender gap in Wikipedia contentInternet Policy Review10.14763/2022.1.162111:1Online publication date: 22-Mar-2022
          • (2022)"We Need a Woman in Music": Exploring Wikipedia's Values on Article PriorityProceedings of the ACM on Human-Computer Interaction10.1145/35551566:CSCW2(1-28)Online publication date: 11-Nov-2022
          • (2022)An Analysis of Content Gaps Versus User Needs in the Wikidata Knowledge GraphThe Semantic Web – ISWC 202210.1007/978-3-031-19433-7_21(354-374)Online publication date: 16-Oct-2022
          • (2021)A Polyvocal and Contextualised Semantic WebThe Semantic Web10.1007/978-3-030-77385-4_30(506-512)Online publication date: 31-May-2021

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media