Reference Hub5
Ranked Deep Web Page Detection Using Reinforcement Learning and Query Optimization

Ranked Deep Web Page Detection Using Reinforcement Learning and Query Optimization

Kapil Madan, Rajesh K. Bhatia
Copyright: © 2021 |Volume: 17 |Issue: 4 |Pages: 23
ISSN: 1552-6283|EISSN: 1552-6291|EISBN13: 9781799859734|DOI: 10.4018/IJSWIS.2021100106
Cite Article Cite Article

MLA

Madan, Kapil, and Rajesh K. Bhatia. "Ranked Deep Web Page Detection Using Reinforcement Learning and Query Optimization." IJSWIS vol.17, no.4 2021: pp.99-121. http://doi.org/10.4018/IJSWIS.2021100106

APA

Madan, K. & Bhatia, R. K. (2021). Ranked Deep Web Page Detection Using Reinforcement Learning and Query Optimization. International Journal on Semantic Web and Information Systems (IJSWIS), 17(4), 99-121. http://doi.org/10.4018/IJSWIS.2021100106

Chicago

Madan, Kapil, and Rajesh K. Bhatia. "Ranked Deep Web Page Detection Using Reinforcement Learning and Query Optimization," International Journal on Semantic Web and Information Systems (IJSWIS) 17, no.4: 99-121. http://doi.org/10.4018/IJSWIS.2021100106

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

This paper proposes a novel algorithm based on reinforcement learning-entitled asynchronous advantage actor-critic (A3C). Overflow queries are optimized to crawl the ranked deep web. A3C assigns the reward and penalty to the various queries. Queries are derived from the domain-based taxonomy that helps to fill the search forms. Overflow queries are the collection of queries that match with more than k number of results and only top k matched results are retrieved. Low ranked documents beyond k results are not accessible and lead to low coverage. Overflow queries are optimized to convert into non-overflow queries based on the proposed technique and lead to more coverage. As of yet, no research work has been explored by using A3C with taxonomy in the domain of ranked deep web. The experimental results show that the proposed technique outperforms the three other techniques (i.e., document frequency, random query, and high frequency) in terms of average improvement metric by 26%, 69%, and 92%, respectively.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.