Abstract
In vertical search engine research, many works have been reported. But most of them focus on its key issues such as crawling, extraction, and query and few of them give a total solution for building a complete vertical search engine from scratch in a systematic method. To address this issue, we propose a model-driven method and its supporting tool SESQ. Based on a user defined ER schema for a target domain, the tool can help to build a complete search engine by integrating tasks of crawling, extraction, data management and query within one unified framework.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Nie, Z., Zhang, Y., Wen, J.R., Ma, W.Y.: Object-Level Ranking: Bringing Order to Web Objects. In: WWW 2005 (2005)
Chakrabarti, S., Berg, M., Dom, B.: Focused Crawling: a new Approach to Topic-Specific Web Resource Discovery. In: WWW 1999 (1999)
Ester, M., Kriegel, H.P., Schubert, M.: Accurate and Efficient Crawling for Relevant Websites. In: VLDB 2004 (2004)
Crescenzi, V., Mecca, G., Merialdo, P.: Automatic Web Information Extraction in the RoadRunner System. In: Workshop DASWIS of ER 2001 (2001)
Zhu, J., Nie, Z., Wen, J.R., Zhang, B., Ma, W.Y.: Simultaneous record detection and attribute labeling in web data extraction. In: SIGKDD 2006 (2006)
Zhao, H., Meng, W., Wu, Z., Raghavan, V., Yu, C.: Fully Automatic Wrapper Generation for Search Engines. In: WWW 2005 (2005)
Lin, L., Li, G., Zhou, L.: Meta-Search Based Web Resource Discovery for Object-Level Vertical Search. In: WISE 2006 (2006)
Guo, Q., Zhou, L.: Schema driven topic specific web crawling. In: DASFAA 2005 (2005)
Lin, L., Zhou, L.: Leveraging Webpage Classification for Data Object Recognition. In: IEEE/WIC/ACM Web Intelligence 2007 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lin, L. et al. (2008). SESQ: A Model-Driven Method for Building Object Level Vertical Search Engines. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds) Conceptual Modeling - ER 2008. ER 2008. Lecture Notes in Computer Science, vol 5231. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87877-3_39
Download citation
DOI: https://doi.org/10.1007/978-3-540-87877-3_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87876-6
Online ISBN: 978-3-540-87877-3
eBook Packages: Computer ScienceComputer Science (R0)