short-paper

Pay "Attention" to Chart Images for What You Read on Text

Authors:
Chenyu Yang

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China

0009-0009-9726-8380
View Profile

,
Ruixue Fan

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China

0000-0002-9826-7349
View Profile

,
Nan Tang

QCRI, HBKU & HKUST (GZ), Doha, Qatar

QCRI, HBKU & HKUST (GZ), Doha, Qatar

0000-0003-2832-0295
View Profile

,
Meihui Zhang

Beijing Institute of Technology, Beijing, China

Beijing Institute of Technology, Beijing, China

0000-0002-0752-9877
View Profile

,
Xiaoman Zhao

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China

0000-0002-7667-3813
View Profile

,
Ju Fan

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China

0000-0003-4729-9903
View Profile

,
Xiaoyong Du

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China

0000-0002-5757-9135
View Profile

SIGMOD '23: Companion of the 2023 International Conference on Management of DataJune 2023Pages 111–114https://doi.org/10.1145/3555041.3589714

Published:05 June 2023Publication History

SIGMOD '23: Companion of the 2023 International Conference on Management of Data

Pages 111–114

ABSTRACT

Data visualization is changing how we understand data, by showing why's, how's, and what's behind important patterns/trends in almost every corner of the world, such as in academic papers, news articles, financial reports, etc. However, along with the increasing complexity and richness of data visualizations, given a text description (e.g., "fewer teens say they attended school completely online (8%)"), it becomes harder for users to pinpoint where to pay attention to on a chart (e.g., a grouped bar chart).

In this demonstration paper, we present a system HiChart for text-chart image highlighting: when a user selects a span of text, HiChart automatically analyzes the chart image (e.g., a jpeg or a png file) and highlights the parts that are relevant to the span. From a technical perspective, HiChart devises the following techniques. Reverse-engineering visualizations: given a chart image, HiChart uses computer vision techniques to generate a visualization specification using Vega-Lite language, as well as the underlying dataset; Visualization calibration by data tuning: HiChart calibrates the re-generated chart by tuning the recovered dataset through value perturbation; and Chart highlighting for a span: HiChart maps the span to corresponding data cells and uses the built-in highlighting functions of Vega-Lite to highlight the chart.

Supplemental Material

Presentation-HiChart.mp4

mp4

258.3 MB

Download

References

Wenhu Chen, Hongmin Wang, Jianshu Chen, Yunkai Zhang, Hong Wang, Shiyang Li, Xiyou Zhou, and William Yang Wang. 2020. TabFact: A Large-scale Dataset for Table-based Fact Verification. In ICLR.Google Scholar
Chunxiao Liu, Zhendong Mao, Tianzhu Zhang, Hongtao Xie, Bin Wang, and Yongdong Zhang. 2020. Graph Structured Network for Image-Text Matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarCross Ref
Xiaoyi Liu, Diego Klabjan, and Patrick NBless. 2019. Data extraction from charts via single deep neural network. arXiv preprint arXiv:1906.11906 (2019).Google Scholar
Junyu Luo, Zekun Li, Jinpeng Wang, and Chin-Yew Lin. 2021. ChartOCR: data extraction from charts images via a deep hybrid framework. In Proceedings of the IEEE/CVF winter conference on applications of computer vision. 1917--1925.Google ScholarCross Ref
Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, and Enamul Hoque. 2022. Chartqa: A benchmark for question answering about charts with visual and logical reasoning. arXiv preprint arXiv:2203.10244 (2022).Google Scholar
Vidya Setlur, Sarah E. Battersby, Melanie Tory, Rich Gossweiler, and Angel X. Chang. 2016. Eviza: A Natural Language Interface for Visual Analysis. In UIST. ACM, 365--377.Google Scholar
Fangfang Zhou, Yong Zhao, Wenjiang Chen, Yijing Tan, Yaqi Xu, Yi Chen, Chao Liu, and Ying Zhao. 2021. Reverse-engineering bar charts using neural networks. Journal of Visualization 24, 2 (2021), 419--435.Google ScholarDigital Library

Index Terms

Pay "Attention" to Chart Images for What You Read on Text
1. Human-centered computing
  1. Visualization

Recommendations

BarChartAnalyzer: Digitizing Images of Bar Charts
IMPROVE 2021: Proceedings of the International Conference on Image Processing and Vision Engineering

Charts or scientific plots are widely used visualizations for efficient knowledge dissemination from datasets. However, these charts are predominantly available in image format. There are various scenarios where these images are interpreted in the ...
Read More
BarChartAnalyzer: Data Extraction and Summarization of Bar Charts from Images
Abstract
Charts or scientific plots are widely used visualizations for efficient knowledge dissemination from datasets. However, these charts are predominantly available in image format. There are various scenarios where these images are interpreted in the ...
Read More
Introducing Google Chart Tools and Google Maps API in Data Visualization Courses

This article reports the experience of using Google Chart Tools and Google Maps in a data visualization course at Georgia State University. These visualization toolkits have many benefits but haven’t been widely used in such courses. Students ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGMOD '23: Companion of the 2023 International Conference on Management of Data
June 2023
330 pages
ISBN:9781450395076
DOI:10.1145/3555041
General Chairs:
Sudipto Das
Amazon Web Services, USA
,
Ippokratis Pandis
Amazon Web Services, USA
,
Program Chairs:
K. Selçuk Candan
Arizona State University, USA
,
Sihem Amer-Yahia
CNRS, Université Grenoble Alpes, France
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 June 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
chart highlighting
data extraction
data visualization
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate785of4,003submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 124
  Total Downloads
- Downloads (Last 12 months)124
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Pay "Attention" to Chart Images for What You Read on Text

SIGMOD '23: Companion of the 2023 International Conference on Management of Data

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

BarChartAnalyzer: Digitizing Images of Bar Charts

BarChartAnalyzer: Data Extraction and Summarization of Bar Charts from Images

Introducing Google Chart Tools and Google Maps API in Data Visualization Courses