Abstract
Full-Text Search is one of the most important and popular query types in document retrieval systems. With the development of The Fourth Generation Wireless Network (4G), wireless data broadcast has gained a lot of interest because of its scalability, flexibility, and energy efficiencies for wireless mobile computing. How to apply full-text search to documents transmitted through wireless communications is thus a research topic of interest. In this paper, we propose a novel data streaming scheme (named Basic-Hash) with hash-based indexing and inverted list techniques to facilitate energy and latency efficient full-text search in wireless data broadcast. We are the first work utilizing hash technology for this problem, which takes much less access latency and tuning time comparing to the previous literature. We further extend the proposed scheme by merging the hashed word indices in order to reduce the total access latency (named Merged-Hash). An information retrieval protocol is developed to cope with these two schemes. The performances of Basic-Hash and Merged-Hash are examined both theoretically and empirically. Simulation results prove their efficiencies with respect to both energy consumption and access latency.
This work is supported by NSF grant CCF-0829993 and CCF-0514796.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amer-Yahia, S., Shanmugasundaram, J.: Xml full-text search: challenges and opportunities. In: VLDB 2005 (2005)
Asplund, M.: Building full-text search applications with oracle text, http://www.oracle.com/technology/pub/articles/asplund-textsearch.html
Atlam, E.S., Ghada, E.M., Fuketa, M., Morita, K., Aoe, J.: A compact memory space of dynamic full-text search using bi-gram index. In: ISCC 2004 (2004)
Blair, D.C., Maron, M.E.: An evaluation of retrieval effectiveness for a full-text document-retrieval system. Commun. ACM 28(3), 289–299 (1985)
Brown, E.W., Callan, J.P., Croft, W.B.: Fast incremental indexing for full-text information retrieval. In: VLDB 1994, pp. 192–202 (1994)
Chung, Y.D., Yoo, S., Kim, M.H.: Energy- and latency-efficient processing of full-text searches on a wireless broadcast stream. IEEE Trans. on Knowl. and Data Eng. 22(2), 207–218 (2010)
Chung, Y.C., Lin, L., Lee, C.: Scheduling non-uniform data with expected-time constraint in wireless multi-channel environments. J. Parallel Distrib. Comput. 69(3), 247–260 (2009)
Faloutsos, C., Christodoulakis, S.: Signature files: an access method for documents and its analytical performance evaluation. ACM Trans. Inf. Syst. 2(4), 267–288 (1984)
Imielinski, T., Viswanathan, S., Badrinath, B.r.: Data on air: Organization and access. IEEE Trans. on Knowl. and Data Eng. 9(3), 353–372 (1997)
Kim, M.S., Whang, K.Y., Lee, J.G., Lee, M.J.: Structural optimization of a full-text n-gram index using relational normalization. The VLDB Journal 17(6), 1485–1507 (2008)
Moffat, A., Zobel, J.: Self-indexing inverted files for fast text retrieval. ACM Trans. Inf. Syst. 14(4), 349–379 (1996)
Scholer, F., Williams, H.E., Yiannis, J., Zobel, J.: Compression of inverted indexes for fast query evaluation. In: SIGIR 2002, pp. 222–229 (2002)
Tomasic, A., GarcÃa-Molina, H., Shoens, K.: Incremental updates of inverted lists for text document retrieval. SIGMOD Rec. 23(2), 289–300 (1994)
Viredaz, M.A., Brakmo, L.S., Hamburgen, W.R.: Energy management on handheld devices. Queue 1(7), 44–52 (2003)
Xu, J., Lee, W.C., Tang, X., Gao, Q., Li, S.: An error-resilient and tunable distributed indexing scheme for wireless data broadcast. IEEE Trans. on Knowl. and Data Eng. 18(3), 392–404 (2006)
Yao, Y., Tang, X., Lim, E.P., Sun, A.: An energy-efficient and access latency optimized indexing scheme for wireless data broadcast. IEEE Trans. on Knowl. and Data Eng. 18(8), 1111–1124 (2006)
Zhang, J., Suel, T.: Optimized inverted list assignment in distributed search engine architectures. In: Parallel and Distributed Processing Symposium, International, p. 41 (2007)
Zhang, X., Lee, W.C., Mitra, P., Zheng, B.: Processing transitive nearest-neighbor queries in multi-channel access environments. In: EDBT 2008: Proceedings of the 11th International Conference on Extending Database Technology, pp. 452–463 (2008)
Zheng, B., Lee, W.C., Lee, K.C., Lee, D.L., Shao, M.: A distributed spatial index for error-prone wireless data broadcast. The VLDB Journal 18(4), 959–986 (2009)
Zheng, B., Lee, W.C., Liu, P., Lee, D.L., Ding, X.: Tuning on-air signatures for balancing performance and confidentiality. IEEE Trans. on Knowl. and Data Eng. 21(12), 1783–1797 (2009)
Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Comput. Surv. 38(2), 6 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, K., Shi, Y., Wu, W., Gao, X., Zhong, J. (2011). A Novel Hash-Based Streaming Scheme for Energy Efficient Full-Text Search in Wireless Data Broadcast. In: Yu, J.X., Kim, M.H., Unland, R. (eds) Database Systems for Advanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20149-3_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-20149-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20148-6
Online ISBN: 978-3-642-20149-3
eBook Packages: Computer ScienceComputer Science (R0)