Skip to main content

ToDI: A Taxonomy of Derived Indices

  • Conference paper
  • First Online:
Intelligent Technologies for Interactive Entertainment (INTETAIN 2020)

Abstract

Advancements in digital technology have eased the process of gathering, generating, and altering digital data at large scale. The sheer scale of the data necessitates the development and use of smaller secondary data structured as ‘indices,’ which are typically used to locate desired subsets of the original data, thereby speeding up data referencing and retrieval operations. Many variants of such indices exist in today’s database systems, and the subject of their design is well investigated by computer scientists. However, indices are examples of data derived from existing data; and the implications of such derived indices, as well as indices derived from other indices, pose problems that require careful ethical analysis. But before being able to thoroughly discuss the full nature of such problems, let alone analyze their ethical implications, an appropriate and complete vocabulary in the form of a robust taxonomy for defining and describing the myriad variations of derived indices and their nuances is needed. This paper therefore introduces a novel taxonomy of derived indices that can be used to identify, characterise, and differentiate derived indices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We use this particular example to point out that such an index can have surprisingly comical uses. This example, and terrible pun, was inspired by the works of British writer and comedian John Finnemore.

References

  1. Ahmed, I., Fayyaz, A., Shahzad, A.: PostgreSQL Developer’s Guide. Packt Publishing Ltd., Birmingham (2015)

    Google Scholar 

  2. Ball, A., Darlington, M.: Briefing paper: the adobe extensible metadata platform (XMP). UKOLN research organization (2007)

    Google Scholar 

  3. Choi, K.R., Kim, K.C.: T*-tree: a main memory database index structure for real time applications. In: Proceedings of 3rd International Workshop on Real-Time Computing Systems and Applications, pp. 81–88. IEEE (1996)

    Google Scholar 

  4. Dwyer, C.: Privacy in the age of google and facebook. IEEE Technol. Soc. Mag. 30(3), 58–63 (2011)

    Article  Google Scholar 

  5. Foote, K.D.: A brief history of metadata. Online (Data Varsity) (2019)

    Google Scholar 

  6. Garcia-Molina, H., Ullman, J.D., Widom, J.: Database System Implementation, vol. 672. Prentice Hall, Upper Saddle River (2000)

    Google Scholar 

  7. Gilliland, A.J.: Setting the stage. Introduction Metadata 2, 1–19 (2008)

    Google Scholar 

  8. Gitelman, L.: Raw Data is an Oxymoron. MIT Press, Cambridge (2013)

    Book  Google Scholar 

  9. Griffel, D.M., McIntosh, S.D.: Admins: a progress report. Technical report. Center for International Studies, Massachusetts, Cambridge (1967)

    Google Scholar 

  10. Heumann, K., Mewes, H.W.: The hashed position tree (HPT): a suffix tree variant for large data sets stored on slow mass storage devices. In: 3rd South American Workshop on String Processing, pp. 101–114 (1996)

    Google Scholar 

  11. Hey, T., Trefethen, A.: The data deluge: an e-science perspective. In: Grid Computing: Making the Global Infrastructure a Reality, pp. 809–824 (2003)

    Google Scholar 

  12. Japp, R.: The top-compressed suffix tree: a disk-resident index for large sequences. In: Proceedings of the Bioinformatics Workshop at the 21st Annual British National Conference on Databases (2004)

    Google Scholar 

  13. Lawrence, B., Lowry, R., Miller, P., Snaith, H., Woolf, A.: Information in environmental data grids. Philos. Trans. R. Soc. A: Math. Phys. Eng. Sci. 367(1890), 1003–1014 (2009)

    Article  Google Scholar 

  14. Lehman, T.J., Carey, M.J.: A study of index structures for main memory database management systems. Technical report, University of Wisconsin-Madison, Department of Computer Sciences (1985)

    Google Scholar 

  15. Orr, D.A., Sanchez, L.: Alexa, did you get that? Determining the evidentiary value of data stored by the Amazon® echo. Digit. Investig. 24, 72–78 (2018)

    Article  Google Scholar 

  16. Phoophakdee, B., Zaki, M.J.: Genome-scale disk-based suffix tree indexing. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp. 833–844 (2007)

    Google Scholar 

  17. Riley, J.: Understanding metadata. National Information Standards Organization, Washington DC, United States, p. 23 (2017). http://www.niso.org/publications/press/UnderstandingMetadata.pdf

  18. Şah, M., Wade, V.: Automatic metadata mining from multilingual enterprise content. J. Web Semant. 11, 41–62 (2012)

    Article  Google Scholar 

  19. Schmidt, D.: Google data collection. Digital Content Next [Online] (2018)

    Google Scholar 

  20. Smith, J.R., Schirling, P.: Metadata standards roundup. IEEE Multimed. 13(2), 84–88 (2006)

    Article  Google Scholar 

  21. Voigt, P., Von dem Bussche, A.: The EU general data protection regulation (GDPR). A Practical Guide, 1st edn. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-319-57959-7

    Book  Google Scholar 

  22. Wheatley, H.B.: What is an Index?: A Few Notes on Indexes and Indexers, vol. 1. [for the Index Society]. Longmans, Green (1879)

    Google Scholar 

Download references

Acknowledgement

Many thanks are due to the departments of Mathematics and Computer Science (MCS) and Computer Science and Engineering (CSEN) for their continued support of the project. And to reviewers of ToDI’s drafts for their helpful comments without which this work could not have been improved.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maria Joseph Israel .

Editor information

Editors and Affiliations

A Appendix

A Appendix

1.1 A.1 Taxonomy and Associated Descriptors

Term

Description

Term Name

Base Index

Label

Base Index

Definition

A base index is an immediate index of a data set

Comment

A base index is different from other derived indices in that it is base/source of all other derived index (indices). All the indices described in the related work section of this paper are examples of base indices. An example of a base index is for a book could be the table of contents, which is used to index the location where each chapter or section of the presented data in the book can be found

Type of Term

Base index

Term

Description

Term Name

Derived Index

Label

Derived Index

Definition

A derived index is an index that is derived from an existing index

Comment

Derived indices may be the result of structural or functional derivation, or a combination of the two

Type of Term

Derived Index

Term

Description

Term Name

Functional Property

Label

Functional Property

Definition

Functional property refers to the functionality and purpose of the index that is being derived from

Comment

A functional property is different from other properties of an index in that it is in some ways indexes or duplicates, summarizes, supplements, and/or combines existing indices’ functionality

Type of Term

Property

Term

Description

Term Name

Structural Property

Label

Structural Property

Definition

Structural property refers to the structural aspect of the index

Comment

A structural property is different from other properties of an index in that it is in some ways mimics the look and feel of the index or indices it is derived from

Type of Term

Property

Term

Description

Term Name

Copy Index

Label

Copy Index

Definition

A copy index is an index that duplicates a portion of an existing index

Comment

Copy index is different from Carbon copy index and from other derived index in that it copies some or all of an existing index’s functionality, structure or both

Type of Term

Derived index

Term

Description

Term Name

Carbon Copy Index

Label

Carbon Copy Index

Definition

A carbon copy index is an index that duplicates an existing index a hundred percent

Comment

Carbon Copy index is different from copy index and other derived index in that it is an exact copy of an existing index or indices

Type of Term

Derived index

Term

Description

Term Name

Meta Index

Label

Meta Index

Definition

An index of an index, meaning an index that is abstracted from the original index without revealing identifiable information, that may be doubly distanced from the original base index or other derived index

Comment

Meta index is different from other derived index in that it is meta of derived index. An example of a meta index is a bibliography of books and book chapters in a particular subject area. It is a meta index of indices

Type of Term

Derived index

Term

Description

Term Name

Abridged Index

Label

Abridged Index

Definition

An abridged index is a briefer version of an existing index

Comment

Abridged index is different from other derived index in that it is shorted version of a derived index. An example of a abridged index can be seen in books which poses two indices: one that only lists the chapters and another which lists the chapters along with every section and subsection of each chapter...

Type of Term

Derived index

Term

Description

Label

Fragment Index

Term Name

Fragment Index

Definition

A fragment index is an incomplete portion of an index

Comment

Fragment index is different from other derived index in that it is an incomplete derived index. An example of a fragment index is a table of contents of a book which is missing a page

Type of Term

Derived index

Term

Description

Label

Supplemental Index

Term Name

Supplemental Index

Definition

A supplemental index is an index which supplements an existing index

Comment

An example of a supplemental index in a book is seen in various paper copy textbooks which include several additional (not included on paper) chapters on either an included digital media such as a disk or the textbook’s/author’s/publisher’s website

Type of Term

Derived index

Term

Description

Label

Tangential Index

Term Name

Tangential Index

Definition

A tangential index is an index which indexes related data to data already indexed in an existing index

Comment

For example, a book word index, listing the pages upon which a word occurs, can be enhanced with a tangential index that adds the line number or numbers in that page wherein the word occurs

Type of Term

Derived index

Term

Description

Term Name

Intersectional Index

Label

Intersectional Index

Definition

An intersectional index is an index which combines parts of existing indices into a single index

Comment

For example, a table of contents in a book that is an editorial work which brings together parts of several books and lists several chapters, sections, and subsections of each in one uniform index is an intersectional index

Type of Term

Derived index

Term

Description

Term Name

Union/Aggregate Index

Label

Union/Aggregate Index

Definition

A union/aggregate index is an index which combines 2 or more indices in their entirety into a single index

Comment

Any book series which after completion is then republished in a single volume edition will contain a master index which brings together the indices of each of the books together in a single index

Type of Term

Derived index

Rights and permissions

Reprints and permissions

Copyright information

© 2021 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Israel, M.J., Shaghaghi, N., Amer, A. (2021). ToDI: A Taxonomy of Derived Indices. In: Shaghaghi, N., Lamberti, F., Beams, B., Shariatmadari, R., Amer, A. (eds) Intelligent Technologies for Interactive Entertainment. INTETAIN 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 377. Springer, Cham. https://doi.org/10.1007/978-3-030-76426-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-76426-5_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-76425-8

  • Online ISBN: 978-3-030-76426-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics