research-article

Exploring Collections of research publications with Human Steerable AI

Authors:

Alberto González Martínez,

Billy Troy Wooton,

Nurit Kirshenbaum,

Dylan Kobayashi,

Jason LeighAuthors Info & Claims

PEARC '20: Practice and Experience in Advanced Research Computing 2020: Catch the Wave

Pages 339 - 348

https://doi.org/10.1145/3311790.3396646

Published: 26 July 2020 Publication History

Get Access

Abstract

Understanding highly-dimensional data sets is a complex task. Traditionally, this problem has been tackled with linear pipelines that rely on mathematical models and algorithms to summarize relationships and structure, producing a visual representation of the data in a collapsed, low-dimensional form. The main issue with these traditional pipelines is that they are driven solely by algorithms or models, and without a human in the loop, they can potentially limit sense-making by masking expected or known structure in the data. Textual data, such as that contained in research publications, is one example of unstructured highly dimensional data, wherein the raw data must be converted to an abstract numeric representation that is highly dimensional.

In recent years, Semantic Interaction has become an interesting approach to enabling model steering in Visual Analytics systems, as it provides mechanisms with which to adjust the parameter space, explore data, and test hypotheses. In order to facilitate this interaction modality, Semantic Interaction systems need to invert the computation of one or more mathematical models to support a bidirectional structure within their pipelines. Most examples of Semantic Interaction systems are limited to linear models to allow for this bidirectionality. In this paper we propose an inexpensive neural encoder approach to performing backward and forward computations within semantic interaction pipelines for analyzing textual data. We show that this approach allows for the efficient ”merging” of new instances into a previously trained model without retraining. It also provides a reverse link, allowing the parameters of a trained model to be affected by user interactions with the visual representation of data. To demonstrate the usefulness of this approach we present the Zexplorer system, a tool for exploring Large Document Collections of Research papers with Semantic Interaction. The Zexplorer system is built as an extension to Zotero, a widely used open source bibliography system.

Supplemental Material

MP4 File

Presentation video

Download
220.26 MB

References

[1]

Kaveh Abhari, Elizabeth Davidson, and Bo Xiao. 2017. Perceived Individual Risk of Co-innovation in Collaborative Innovation Networks. (2017).

Abstract

Supplemental Material

References

Cited By

Recommendations

DeepSI: Interactive Deep Learning for Semantic Interaction

Observation-Level Interaction with Clustering and Dimension Reduction Algorithms

The human is the loop: new directions for visual analytics

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

HTML Format

Share

Share this Publication link

Share on social media

Affiliations