ABSTRACT
Following the NoSQL (Not Only SQL) movement, more research work and applications are looking towards graph databases for their dynamic schema and natural representation of complex data1. In order to access/ search graph data in a single source, a number of search methods have been proposed in the literature. These methods range from graph query languages, pattern queries, template/form based search, to keyword search. One key challenge of these methods is the expressiveness and ease of use trade-off for query formulation. Formulating queries becomes more challenging when querying multiple heterogeneous data sources and when users are unaware of the structure of the underlying data. This paper reviews various methods proposed in the literature to query graph modeled data in two different settings; namely, single source and multiple heterogeneous data sources. Furthermore, the paper proposes ConteSaG, a technique for transparently querying multiple heterogeneous data sources. ConteSaG employs graph database to represent data residing in local sources with no need to create complex global schema or even to upfront integrate all data in a central source. Moreover, ConteSaG provides a context-based keyword search over the graph representations. Context-based keyword search allows users to search multiple data sources by determining the context of search terms without the need to have complete knowledge about the structure of data in the local sources or writing queries in a specific query language.
- Mike Buerli, The Current State of Graph Databases. (2012)Google Scholar
- Neo4j, http://neo4j.org (2016).Google Scholar
- H. He, Querying and mining graph databases. Ph.D. Thesis, UCSB (2007)Google Scholar
- E. Prud'hommeaux and A. Seaborne. SPARQL query language for RDF. W3C, URL: http://www.w3.org/TR/rdf-sparql-query/ (Document Status Update, 26 March 2013) (last visit: 2017).Google Scholar
- R. H. Guting. GraphDB: Modeling and querying graphs in databases. In VLDB Conference, pages 297--308 (1994)Google Scholar
- Mariano P. Consens, Alberto O. Mendelzon, GraphLog: a visual formalism for real life recursion, Proceedings of the ninth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, pp.404--416 (1990)Google ScholarDigital Library
- X. Yan, P.S. Yu, and J. Han. Graph indexing; a frequent structure-based approach. In SIGMOD, 2004.Google ScholarDigital Library
- D. Shasha, J.T.L. Wang and R. Giugno. Algorithmics and applications of tree and graph searching In PODS, (2002)Google Scholar
- Federica M., Riccardo M., Giorgio V., and Wilma P. Flexible Query Answering on graph-modeled Data. ACM, EDBT (2009)Google Scholar
- R. De Virgilio A. Maccioni, and R. Torlone. Approximate querying of rdf graphs via path alignment. Distributed and Parallel Database, pages 1--27, (2014)Google Scholar
- YuanyuanTian. Querying Graph Databases. Thesis (2008)Google Scholar
- B.P. Kelley, B. Yuan, f. Lewitter, R. Sharan, B. R. Stockwel, and T. Ideker. Pathblast: a tool for aligning of protein interaction networks. Nucleic Acids Res., pages 83--88,(2004). Google ScholarCross Ref
- M. Koyuturk, A. greme, and W. Szpenkowski. Pairwise local alignment of protein interaction networks guided by models of evaluation. In International conference on research in computational moleculer biology, pages 48--56, (2005)Google ScholarDigital Library
- R. Sharan. Conserved Patterns of Protein Interaction in Multiple species. PNAS, 102: 1974--1979, (2005) Google ScholarCross Ref
- M. Jayapandian and H. V. Jagadish. Automated Creation of a Forms-based Database Query Interface. In Proc. VLDB, Pages 695--709, (2008) Google ScholarDigital Library
- Y. Papakonstantinou, M. Petropoulos, and V. Vassalos. QURSED: Quering and Reporting Semi-structured Data. In Proc. SIGMOD, Pages 192--203, (2002) Google ScholarDigital Library
- A. Zouzies, M. Vlachos, and V. Hristidis. Templated Search over Relational Databases. ACM, November (2014)Google ScholarDigital Library
- Yunyao Li, Huahai Yang, and H. V. Jagadish. NaLIX: A generic natural language search environment for XML data. ACM Trans. Database Syst., 32(4), (2007) Google ScholarDigital Library
- A.M. Popesc, O. Etzioni, and H. Kautz. Towards a theory of natural language interfaces to databases. In Proc. IUI, Pages 149--157, (2003)Google Scholar
- F. Li and H. V. Jagadish. NaLIR: An interactive natural language interface for querying relational databases. In SIGMOD, (2014)Google ScholarDigital Library
- M. Kargar, A. An, N. Cercone, P.Godfrey, JJ.Szlichta, and X. Yu. MeanKS: Meaningful keyword search in relational databases with complex schema. In SIGMOD, (2014)Google ScholarDigital Library
- R. De Virgilio, A. Maccioni, and R. Torlone. Graph driven exploration of relational databases for efficient keyword search. In GraphQ, pages 208--2015, (2014)Google Scholar
- Z. Zeng, Z. Bao, M. Li Lee, and T. Wang Ling. Towards an interactive keyword search over relational databases. ACM 978-1-4503-3473-0/15/05, pages 259--262, (2015) Google ScholarDigital Library
- G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan: Keyword searching and browsing in databases using BANKS. In : Proc. 18th Int. Conf. on Data Engineering, pp. 431--440 (2002) Google ScholarCross Ref
- V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar: Bidirectional expansion for keyword search on graph databases. In: VLDB, pp. 505--516 ( 2005)Google ScholarDigital Library
- H. He, H. Wang, J. Yang, and P. Yu: BLINKS: Ranked keyword searches on graphs. In: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 305--316 (2007) Google ScholarDigital Library
- Guoliang Li, JianhuaFeng, Feng Lin, Lizhu Zhou: Progressive Ranking for Efficient Keyword Search over Relational Databases. Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge, pp. 193 -- 197(2008) Google ScholarDigital Library
- L. Qin, J. X. Yu, L. Chang, and Y. Tao.: Querying communities in relational databases. In: Proc. 25th Int. Conf. on Data Engineering, PP. 724--735 (2009). Google ScholarDigital Library
- Guoliang Li, Beng Chin Ooi, JianhuaFeng, JianyongWang, and Lizhu Zhou.: EASE: an effective3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proc.2008 ACM SIGMOD Int. Conf. On Management of Data, PP. 903--914(2008).Google Scholar
- Dalvi, B.B., Kshirsagar, M. Sudarshan, S.: Keyword search on external memory data graphs. PVLDB 1(1), pages 1189--1204 (2008)Google Scholar
- D. Wang, L. Zou, W. Pan and D. Zhao. Keyword Graph: Answering Keyword Search over Large Graphs. In Springer, pages 635--649 (2012)Google ScholarCross Ref
- Kasneci, G., Ramanath, M., Sozio, M., Suchanek, F.m., Weikum, G.: Star: Steiner tree approximation in relationship graphs. In ICDE, pages 868--879 (2009)Google Scholar
- Jaehui Park and Sang goo Lee. Keyword search in relational databases. Knowl. Inf. Syst., 26(2): pages 175--193, (2011) Google ScholarDigital Library
- Li, Guoliang; Feng, Jianhua; Zhou, Xiaofang; Wang, Jianyong: Providing built-in keyword search capabilities in RDBMS. In: VLDB, pp. 1--19 (2011)Google Scholar
- S. Agrawal, S. Chaudhuri, and G. Das: DBXplorer: A System for keyword Search over Relational Databases. In: ICDE, pp. 5--16 (2002) Google ScholarDigital Library
- V. Hristidis and Y. Papakonstantinou: DISCOVER: Keyword search in relational databases. In: Proc. 28th Int. Conf. On Very Large Data Bases, pp. 670--681 (2002) Google ScholarCross Ref
- V. Hristidis, L. Gravano, Y. Papakonstantinou: Efficient IR-Style Keyword Search over Relational Databases. In: VLDB, pp 850--861 (2003) Google ScholarCross Ref
- G. Li J. Wang, L. Zhou. Efficient keyword search for valuable LCAs over xml documents. In proceedings of the 16th ACM Conference on information and knowledge management, pp. 31--40, (2007)Google Scholar
- Yu Xu and YannisPapakonstantinou. Efficient Keyword Search for Smallest LCAs in XML Databases. In FatmaÖzcan, editor, Proceed- ings of the ACM SIGMOD International Conference on Management of Data, Baltimore, Maryland, USA, June 14--16, 2005, pages 527--538,( 2005)Google Scholar
- Yu Xu and YannisPapakonstantinou. Efficient LCA based keyword search in XML data. In Alfons Kemper, Patrick Valduriez, NoureddineMouaddib, Jens Teubner, MokraneBouzeghoub, Volker Markl, Laurent Amsaleg, and IoanaManolescu, editors, EDBT 2008, 11th International Conference on Extending Database Technology, Nantes, France, March 25--29, 2008, Proceedings, volume 261 of ACM International Conference Proceeding Series, pages 535--546. ACM, 2008.Google Scholar
- Luiz Gomes Jr. and Andr'eSantanch' The Web Within: Leveraging Web Standards and Graph Analysis to Enable Application-Level Integration of Institutional Data. Springer-Verlag Berlin Heidelberg 2015, pp. 26--54(2015).Google Scholar
- MayssamSayyadian, HieuLeKhac, AnHai Doan, and Luis Gravano. Efficient Keyword Search Across Heterogeneous Relational Databases. In Proceedings of the 23rd International Conference on Data Engineering,ICDE 2007, April 15--20, 2007, The Marmara Hotel, Istanbul, Turkey, pages 346--355, (2007)Google Scholar
- Geist, Ingolf: Keyword Search across Distributed Heterogeneous Structured Data Sources Dissertation, Otto-von-Guericke-Universität Magdeburg, (2012)Google Scholar
- ChavanAparna, SuvarnaBangar: Review on KeywordSeaech over Relational Databases. International Journal of Emerging Technology and Advanced Engineering, Volume 4, Issue 11 (2014).Google Scholar
Index Terms
- ConteSaG: context-based keyword search over multiple heterogeneous graph-modeled data
Recommendations
3se: a semi-structured search engine for heterogeneous data in graph model
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementAs the ubiquitous interplay of structured, semi-structured and unstructured data from different sources, neither DB-style structured query requiring knowledge of full schema and complex language, nor IR-style keyword search ignoring latent structures, ...
S2phere: Semi-Supervised Pre-training for Web Search over Heterogeneous Learning to Rank Data
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningWhile Learning to Rank (LTR) models on top of transformers have been widely adopted to achieve decent performance, it is still challenging to train the model with sufficient data as only an extremely small number of query-webpage pairs could be annotated ...
Considering User Intention in Differential Graph Queries
Empty answers are a major problem by processing pattern matching queries in graph databases. Especially, there can be multiple reasons why a query failed. To support users in such situations, differential queries can be used that deliver missing parts ...
Comments