skip to main content
10.1145/2254736.2254747acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Integrating and querying source code of programs working on a database

Published: 20 May 2012 Publication History

Abstract

Programs and a database's schema contain complex data and control dependencies that make modifying the schema along with multiple portions of the source code difficult to change. In this paper, we address the problem of exploring and analyzing those dependencies that exist between a program and a database's schema using keyword search techniques inside a database management system (DBMS). As a result, we present QDPC, a novel system that allows the integration and flexible querying within a DBMS of source code and a database's schema. The integration focuses on obtaining the approximate matches that exist between source files (classes, function and variable names) and the database's schema (table names and column names), and then storing them in summarization tables inside a DBMS. These summarization tables are then analyzed with SQL queries to find matches that are related to a set of keywords provided by the user. It is possible to perform additional analysis of the discovered matches by computing aggregations over the obtained matches, and to perform sophisticated analysis by computing OLAP cubes. In our experiments, we show that we obtain an efficient integration and allow complex analysis of the dependencies inside the DBMS. Furthermore, we show that searching for data dependencies and building OLAP cubes can be obtained in an efficient manner. Our system opens up the possibility of using the keyword search for software engineering applications.

References

[1]
S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A system for keyword-based search over relational databases. In Proc. ICDE, pages 5--16, 2002.
[2]
T.M. Austin and G. S. Sohi. Dynamic dependency analysis of ordinary programs. Proc. of ACM SIGARCH Computer Architecture News, 20(2):342--351, 1992.
[3]
Z. Bellahsene, A. Bonifati, and E. Rahm. Schema matching and mapping. Springer-Verlag, 2011.
[4]
G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. In Procs. of ICDE, pages 431--440, 2002.
[5]
Y. Chen, W. Wang, Z. Liu, and X. Lin. Keyword search on structured and semi-structured data. In Proc. of ACM SIGMOD, pages 1005--1010, 2009.
[6]
A. Cleve, J. Henrard, and J. L. Hainaut. Data reverse engineering using system dependency graphs. In Proc. of IEEE Conference on Reverse Engineering, pages 157--166, 2006.
[7]
C. Garcia-Alvarado, Z. Chen, and C. Ordonez. OLAP-based query recommendation. In Proc. of ACM CIKM, pages 1353--1356, 2010.
[8]
C. Garcia-Alvarado and C. Ordonez. Keyword Search Across Databases and Documents. In Proc. ACM SIGMOD KEYS Workshop, 2010.
[9]
M. Harman. Why source code analysis and manipulation will always be important. In Proc. of IEEE Working Conference on Source Code Analysis and Manipulation (SCAM), pages 7--19, 2010.
[10]
J. Henrard and J. L. Hainaut. Data dependency elicitation in database reverse engineering. In Proc of IEEE European Conference on Software Maintenance and Reengineering, pages 11--19, 2001.
[11]
V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. In VLDB, pages 670--681, 2002.
[12]
C. Ordonez. Data set preprocessing and transformation in a database system. Intelligent Data Analysis (IDA), 15(4), 2011.
[13]
C. Ordonez, Z. Chen, and J. García-García. Metadata management for federated databases. In ACM CIMS Workshop, pages 31--38, 2007.
[14]
J. Tarhio and E. Ukkonen. Boyer-Moore approach to approximate string matching. SWAT, pages 348--359, 1990.
[15]
T. Zimmermann and N. Nagappan. Predicting defects using network analysis on dependency graphs. In Proc. of ACM ICSE, pages 531--540, 2008.

Cited By

View all
  • (2020)Querying Big Source Code2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9378481(5682-5684)Online publication date: 10-Dec-2020
  • (2012)Querying external source code files of programs connecting to a relational databaseProceedings of the 5th Ph.D. workshop on Information and knowledge10.1145/2389686.2389689(9-16)Online publication date: 2-Nov-2012

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KEYS '12: Proceedings of the Third International Workshop on Keyword Search on Structured Data
May 2012
78 pages
ISBN:9781450311984
DOI:10.1145/2254736
  • General Chairs:
  • Ling Tok Wang,
  • Ge Yu,
  • Jiaheng Lu,
  • Wei Wang
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. DBMS
  2. integration
  3. software maintainability
  4. source code

Qualifiers

  • Research-article

Funding Sources

Conference

SIGMOD/PODS '12
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Querying Big Source Code2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9378481(5682-5684)Online publication date: 10-Dec-2020
  • (2012)Querying external source code files of programs connecting to a relational databaseProceedings of the 5th Ph.D. workshop on Information and knowledge10.1145/2389686.2389689(9-16)Online publication date: 2-Nov-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media