Full length article
Reprint of: Client interfaces to the Virtual Observatory Registry

https://doi.org/10.1016/j.ascom.2015.04.004Get rights and content

Abstract

The Virtual Observatory Registry is a distributed directory of information systems and other resources relevant to astronomy. To make it useful, facilities to query that directory must be provided to humans and machines alike. This article reviews the development and status of such facilities, also considering the lessons learnt from about a decade of experience with Registry interfaces. After a brief outline of the history of the standards development, it describes the use of Registry interfaces in some popular clients as well as dedicated UIs for interrogating the Registry. It continues with a thorough discussion of the design of the two most recent Registry interface standards, RegTAP on the one hand and a full-text-based interface on the other hand. The article finally lays out some of the less obvious conventions that emerged in the interaction between providers of registry records and Registry users as well as remaining challenges and current developments.

Introduction

In Demleitner et al. (2014a), henceforth Paper I, we described the design and maintenance of the Virtual Observatory (VO) Registry as a distributed information system. Conceptually, it is a collection of, by now, about 15 000 registry records. To give the Registry’s users–astronomers, the library community, or even the general public–access to this collection, facilities have to be provided that allow focused queries against it. This includes common bibliographic constraints (by author, title or abstract term, year, etc.), but also constraints specific to a registry mainly concerned with data services (e.g., supported protocols or query parameters, metadata of published tables). In the design of such facilities, several challenges have to be addressed:

  • 1.

    different users have very different expectations and requirements

  • 2.

    the underlying data collection (i.e., the set of registry records) is changing over time

  • 3.

    the underlying data structure is fairly complex, and evolves itself as new standards and techniques are introduced in the VO

  • 4.

    as many uses require only a small subset of the types of metadata contained, partial resource descriptions should be retrievable

  • 5.

    the total data set cannot efficiently be transferred to clients as a whole

  • 6.

    registry records are frequently authored by persons not entirely familiar with the data model, resulting in inconsistent quality.

In consequence, no single user interface to the Registry can be sufficient. Instead, the VO community designed client interfaces, i.e., network endpoints with rigorously defined behavior and semantics, designed for use by programs that then present the actual user interfaces to Registry data.

We will begin this paper with a brief review of the various client interfaces that are or were used in the VO (Section  2). In Section  3, we proceed to describe the use some selected clients make of these facilities and the ways they apply and expose information obtained from the registry. A major part of the paper, Section  4, is devoted to a thorough discussion of the Registry Relational Model (RegTAP for short), one of the two registry interfaces currently being developed and deployed in response to the deficiencies of previous standards. In Section  5, the other new-generation interface is described.

While laying out some common use cases of Registry data in Section  6 we also point out common query patterns. Section  7 concludes with some speculation about probable future developments.

In the following, we refer to common Registry standard texts by their abbreviated names as introduced in Paper I, and again the capitalized word “Registry” refers to the abstract concept, while concrete services are written in lower case (e.g., a “publishing registry”). Concepts from VOResource and its extensions are written in small caps.

Section snippets

History

Although only explicitly written down in 2011, the use cases collected on the IVOA wiki (IVOA Registry WG, 2011) outline some of the challenges faced by the designers of the first client interfaces to the registry in the mid-2000s—finding tables containing columns with certain physics, locating services implementing certain protocols, and the like.

While on the maintenance side of the registry the ecosystem around OAI-PMH (Open Archives Initiative, 2002) provided guidance for many technology

Registry use in clients

Many VO clients integrate Registry access, frequently without advertising the actual source of the data. Depending on the scope of the application, different parts of Registry metadata are used, and different presentations of this information appear appropriate. In the following, we look at Registry usage in a number of, we believe, representative applications, concluding with an in-depth look at TOPCAT’s use of the registry.

TAPHandle2 is a TAP client

The registry relational model

The Registry Relational Model–briefly called RegTAP for mainly technical reasons–is the successor to the Search method in RI1. It essentially defines a relational schema and rules to map VOResource records into this schema. Using TAP as an access protocol and ADQL as the query language, this is enough to completely define a client interface to the registry.

A sketch of this relational schema is given in Fig. 4. Although the authors first experimented with alternative structures that would have

Full-text based registry interface

In parallel with the relatively complex RegTAP interface, a successor to RI1’s KeywordSearch is also being developed. This full-text based registry addresses the difficulty of extracting information from the previous registry interface. As field values describing resources are mostly text, a full-text search engine such as the Apache Lucene library–as used in popular server components like ElasticSearch or Apache Solr–is suitable to index and search the contents of the registry. It was

Common registry queries

VOResource is a complex data model that sometimes offers multiple ways of expressing apparently very similar concepts. Driven by both registry record authoring practices and the queries employed by popular clients, some usage patterns have evolved that should be followed for successful Registry use. Other patterns are recommended for ease of use. We describe the patterns in RegTAP terms, but most would be equally applicable to endpoints speaking XQuery, and partly even to keyword-based services.

Open issues

Even after the introduction of RegTAP, Registry development is not completed. In addition to Registry extensions as new service and resource types are defined within the VO, several fields of work are currently actively being explored. We discuss them here as they delineate what the Registry should be doing but does not do so far, as well as to document approaches tried in the VO to problems that may similarly arise in other communities.

Conclusions

The VO Registry is an essential source of metadata about the services and data that can be used within the VO, and no non-trivial interaction with the VO can take place without using its discovery capabilities. Many VO clients embed Registry information and protocols in various forms.

By necessity, standardization of the Registry protocols occurred relatively early in the history of the VO. While standards on the server side have held out very well, the early standards on the client side have,

Acknowledgments

We thank Laurent Michel, Pierre Fernique, and Pedro Osuna for providing information on the Registry interfaces of TAPHandle, Aladin, and VOSpec.

This work was in part supported by the German Astrophysical Virtual Observatory GAVO, BMBF grant 05A11VH3.

References (37)

  • M. Demleitner et al.

    The virtual observatory registry

    Astron. Comput.

    (2014)
  • E. Kani-Zabihi et al.

    User perceptions of online public library catalogues

    Int. J. Inf. Manage.

    (2008)
  • T.P. Robitaille et al.

    Astropy: A community Python package for astronomy

    Astron. Astrophys.

    (2013)
  • Benson, K., Plante, R., Auden, E., Graham, M., Greene, G., Hill, M., Linde, T., Morris, D., O’Mullane, W., Rixon, G.,...
  • T. Boch et al.

    MOC—HEALPix Multi-Order Coverage Map

    (2014)
  • F. Bonnarel et al.

    The ALADIN interactive sky atlas. A reference tool for identification of astronomical sources

    Astron. Astrophys. Suppl.

    (2000)
  • Castro-Neves, M., Draper, P.W., 2014. SPLAT-VO: Spectral Analysis Tool for the Virtual Observatory. Astrophysics Source...
  • Demleitner, M., 2012. Towards registry interfaces 2, talk given at the Fall 2012 IVOA Interop, São Paulo. URL:...
  • Demleitner, M., 2013. News on RegTAP, talk given at the Fall 2013 IVOA Interop, Waikoloa, HI. URL:...
  • Demleitner, M., 2014. RegTAP—a new API to the VO Registry, poster presented at ADASS...
  • Demleitner, M., Dowler, P., Plante, R., Rixon, G., Taylor, M., 2012. TAPRegExt: a VOResource schema extension for...
  • M. Demleitner et al.

    IVOA Registry Relational Schema

    (2014)
  • Derriere, S., Gray, N., Mann, R., Preite Martinez, A., McDowell, J., McGlynn, T., Ochsenbein, F., Osuna, P., Rixon, G.,...
  • Dowler, P., Rixon, G., Tody, D., 2010. Table access protocol version 1.0. IVOA Recommendation, March. URL:...
  • P. Fernique et al.

    A bit of GLUe for the VO: Aladin experience

  • Graham, M., Demleitner, M., Dowler, P., Fernique, P., Laurino, O., Lemson, G., Louys, M., Salgado, J., 2013. UTypes:...
  • Graham, M., Plante, R., Tody, D., Fitzpatrick, M., 2014. PyVO: Python access to the Virtual Observatory, Astrophysics...
  • Harrison, P., 2011. Relational registry DM. IVOA Wiki page. URL:...
  • This article is a reprint of a previously published article. For citation purposes, please use the original publication details; Demleitner, 10C, pp. 88–98.

    View full text