Skip to main content

Semantic Attributes for Citation Relationships: Creation and Visualization

  • Conference paper
  • First Online:
Book cover Metadata and Semantic Research (MTSR 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 755))

Included in the following conference series:

Abstract

This paper presents a method to process a content of research papers in binary PDF format at a server side that gives research information systems new features of citation content analysis. This method efficiently generates JSON versions of PDF documents that allows an easier recognition of papers’ references, in-text references, citation context, etc. As a result, one can parse an extended set of citation data, including a location of citations in a research paper’s structure, frequency of mentioning for the same references, style of reference mentioning and so on. Based on these data we upgrade traditional citation relationships by adding some semantic attributes. Formatting these semantic data according W3C Web Annotation Data Model and integrating the data with some annotation tools, we visualize citation relationships, its semantic attributes and related statistics as annotations for readers of PDF documents from a research information system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    C40, the Citation Counting and Context Characterization Ontology. http://purl.org/spar/c4o.

  2. 2.

    http://dspacecris.eurocris.org/bitstream/11366/526/1/CRIS2016_paper_40_Parinov.pdf.

  3. 3.

    http://www.monash.edu.au/lls/llonline/writing/science/3.3.xml.

  4. 4.

    https://github.com/citeccyr.

  5. 5.

    http://www.ranepa.ru/eng/.

  6. 6.

    https://socionet.ru/.

  7. 7.

    http://repec.org/.

  8. 8.

    https://github.com/mozilla/pdf.js.

  9. 9.

    E.g. the PDFMiner at http://www.unixuser.org/~euske/python/pdfminer/.

  10. 10.

    https://nodejs.org/.

  11. 11.

    https://github.com/modesty/pdf2json.

  12. 12.

    The whole is at https://socionet.ru/citmap/convertedPDF/test_cris2016.json.

  13. 13.

    https://en.wikipedia.org/wiki/Percent-encoding.

  14. 14.

    https://github.com/citeccyr/pdf-stream.

  15. 15.

    The whole is at https://socionet.ru/citmap/convertedPDF/cris2016.json.

  16. 16.

    Available at https://socionet.ru/collection.xml?h=spz:neicon.

  17. 17.

    The whole is data at http://no-xml.socionet.ru/citmap/outputs/cris2016-refs.xml.

  18. 18.

    The whole is at http://no-xml.socionet.ru/citmap/outputs/cris2016-intext.xml.

  19. 19.

    https://github.com/hypothesis/pdf.js-hypothes.is.

  20. 20.

    https://sparinov.wordpress.com/2016/05/27/comments-within-full-texts-in-pdf/.

  21. 21.

    https://www.w3.org/TR/annotation-model/.

References

  1. Smith, L.C.: Citation analysis. Libr. Trends 30(1), 83–106 (1981)

    Google Scholar 

  2. Garfield, E.: The relationship between citing and cited publications: a question of relatedness (1994). Originally published in the Current Contents

    Google Scholar 

  3. Barrueco, J.M., Krichel, T.: Building an autonomous citation index for grey literature: the economics working papers case. In: Proceedings GL6: Sixth International Conference on Grey Literature (2004). http://core.ac.uk/download/pdf/11878095.pdf

  4. Waltman, L.: A review of the literature on citation impact indicators. J. Inf. 10(2), 365–391 (2016)

    Article  MathSciNet  Google Scholar 

  5. Alschner, W., Umov, A.: Towards An Integrated Database of International Economic Law (IDIEL) Disputes (2016)

    Google Scholar 

  6. Bertin, M., Atanassova, I.: A study of lexical distribution in citation contexts through the IMRaD standard. PloS Negl. Trop. Dis. 1(200,920), 83–402 (2014)

    Google Scholar 

  7. Bertin, M., Atanassova, I., Gingras, Y., Larivière, V.: The invariant distribution of references in scientific articles. J. Assoc. Inf. Sci. Technol. 67(1), 164–177 (2016)

    Article  Google Scholar 

  8. Zhang, G., Ding, Y., Milojević, S.: Citation content analysis (CCA): a framework for syntactic and semantic analysis of citation content. J. Am. Soc. Inform. Sci. Technol. 64(7), 1490–1503 (2013)

    Article  Google Scholar 

  9. Ding, Y., Zhang, G., Chambers, T., Song, M., Wang, X., Zhai, C.: Content-based citation analysis: the next generation of citation analysis. J. Assoc. Inf. Sci. Technol. 65(9), 1820–1833 (2014)

    Article  Google Scholar 

  10. Oevermann, J.: Reconstructing semantic structures in technical documentation with vector space classification. In: SEMANTiCS (Posters, Demos, SuCCESS) (2016)

    Google Scholar 

  11. Dix, A., Levialdi, S., Malizia, A.: Semantic halo for collaboration tagging systems. In: The Social Navigation and Community-Based Adaptation Technologies Workshop, 20 June 2006

    Google Scholar 

Download references

Acknowledgments

A part of this research (related with the annotation tool development) is funded by Russian Foundation for Basic Research, grant 12-07-00518-a. Another part – the approach development for extracting citation content data with focus on the supercomputer simulation of interactions among the agents and research community environment is funded by RSF grant (project No. 14-18-01968).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sergey Parinov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Parinov, S. (2017). Semantic Attributes for Citation Relationships: Creation and Visualization. In: Garoufallou, E., Virkus, S., Siatri, R., Koutsomiha, D. (eds) Metadata and Semantic Research. MTSR 2017. Communications in Computer and Information Science, vol 755. Springer, Cham. https://doi.org/10.1007/978-3-319-70863-8_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70863-8_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70862-1

  • Online ISBN: 978-3-319-70863-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics