Abstract
Web search engines such as Google or Yahoo determine relevance of Web pages according to the occurrence of words in the pages indexed by the engine (additional information is then used to rank these results). Unfortunately, such searches are not always sufficient to solve information needs since task-driven searchers often must distinguish between documents that share a set of keywords (i.e., a topic) but assume a different form to serve a different purpose or function. For example, before purchasing a digital camera, an individual may want to read reviews from online magazines and see the blogs in which people who have used this camera express their opinions and personal stories.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bagdanov, A., and M. Worring. 2001. Fine-grained document genre classification using first order random graphs. In Document analysis and recognition. Seattle, WA: IEEE Computer Society.
Bartlett, F. 1932/1967. Remembering: A study in experimental and social psychology. Cambridge, England: Cambridge University Press.
Crowston, K., and M. Williams. 2000. Reproduced and emergent genres of communication on the world wide web. Information Society 16(3):201â215.
Dewe, J., J. Karlgren, and I. Bretan. 1998. Assembling a balanced corpus from the internet. In 11th Nordic Conference of Computational Linguistics, 28â29 Jan 1998. Copenhagen, Denmark.
Dillon, A., and B. Gushrowski. 2000. Genres and the web: Is the personal home page the first uniquely digital genre? Journal of the American Society for Information Science 51(2):202â205.
Freund, L., C.L.A. Clarke, and E.G. Toms. 2006. Towards genre classification for IR in the workplace. In Proceedings of the 1st International Conference on Information Interaction in Context, 30â36. Copenhagen.
Görlach, M. 2004. Text types and the history of English. Trends in Linguistics: Studies and Monographs 139. New York, NY: Mouton de Gruyter.
Haas, S.W., and E.S. Grams. 2000. Readers, authors and page structure: A discussion of four questions arising from a content analysis of web pages. Journal of the American Society for Information Science 51(2):181â192.
Karjalainen, A., T. PĂ€ivĂ€rinta, P. TyrvĂ€inen, and J. Rajala. 2000. Genre-based metadata for enterprise document management. In Proceedings of the 33rd Hawaiâi International Conference on System Sciences. Wailea, Maui, Hawaii.
Karlgren, J. 2010. Conventions and mutual expectations: Understanding sources for web genres. In Genres on the Web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer.
Karlgren, J., and D. Cutting. 1994. Recognizing text genres with simple metrics using discriminant analysis. In Proceedings of the 15th International Conference on Computational Linguistics. Kyoto.
Kennedy, A., and M. Shepherd. 2005. Automatic identification of home pages on the web. In Proceedings of the 38th Hawaii International Conference on System Sciences. Waikoloa, Hawaii, HI.
Kessler, B., G. Nunberg, and H. Schuetze. 1997. Automatic detection of text genre. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Meeting of the European Chapter of the Association for Computational Linguistics, 32â38. Madrid: Morgan Kaufmann Publishers.
KwaĆnik, B.H., and K. Crowston. 2004. A framework for creating a facetted classification for genres: Addressing issues of multidimensionality. In Proceedings of the 37th Hawaiâi International Conference on System Science (HICSS). Big Island, Hawaiâi.
KwaĆnik, B.H., Y.-L. Chun, K. Crowston, J. DâIgnazio, and J. Rubleske. 2006. Challenges in creating a taxonomy for genres of digital documents. In 2006 ISKO Conference. Vienna.
Lee, D.Y.W. 2001. Genres, registers, text types, domains, and styles: Clarifying the concepts and navigating a path through the BNC jungle. Language, Learning & Technology 5(3):37â72.
Lim, C.S., K.J. Lee, and G.C. Kim. 2005. Multiple sets of features for automatic genre classification of web documents. Information Processing & Management 41(5):1263â1276.
Meyer zu Eissen, S., and B. Stein. 2004. Genre classification of web pages: User study and feasibility analysis. In Proceedings of the 27th Annual German Conference on Artificial Intelligence (KI 04), eds. S. Biundo, T. FrĂŒhwirth, and G. Palm, 256â269. Ulm: Springer.
Nilan. M.S., J. Pomerantz, and S. Paling. 2001. Genres from the bottom up: What has the Web brought us? In Proceedings of the American Society for Information Science and Technology Conference, 330â339. Washington, DC.
Petersen, T. 1994. Art and architecture thesaurus. New York, NY: Oxford University Press.
Rosso, M.A. 2008. User-based identification of web genres. Journal of the American Society for Information Science & Technology 59(7):1053â1072.
Rosso, M.A., and S.W. Haas. 2010. Identification of web genres by user warrant. In Genres on the Web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer.
Roussinov, D.G., and H. Chen. 2001. Information navigation on the web by clustering and summarizing query results. Information Processing and Management 37(6):789â816.
Santini, M. 2008. Zero, single, or multi? Genre of web pages through the usersâ perspective. Information Processing and Management 44(2):702â737.
Sharoff, S. 2010. In the garden and in the jungle: Comparing genres in the BNC and Internet. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer.
Stubbe, A., C. Ringlstetter, and K.U. Schulz. 2007. Genre as noise â noise in genre. In Proceedings of the IJCAI-2007 Workshop on Analytics for Noisy Unstructured Text Data. Hyderabad.
Swales, J.M. 1990. Genre analysis: English in academic and research settings. New York, NY: Cambridge University Press.
Toms, E.G., D.G. Campbell, and R. Blades. 1999. Does genre define the shape of information? The role of form and function in user interaction with digital documents. In American Society for Information Science; ASIS â99. Washington, DC: Information Today.
Watters, C., and M. Shepherd. 1999. Cybergenre and web functionality. In Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. Maui, HI: IEEE Press.
Yates, S.J., and T. Sumner. 1997. Digital genres and the new burden of fixity. In Hawaiian International Conference on System Sciences (HICCS 30). Wailea, HA: IEEE Computer Press.
Acknowledgments
This research was partially supported by NSF IIS Grant 04-14482. We thank John DâIgnazio and You-Lee Chun for their contributions to this research project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Crowston, K., KwaĆnik, B., Rubleske, J. (2010). Problems in the Use-Centered Development of a Taxonomy of Web Genres. In: Mehler, A., Sharoff, S., Santini, M. (eds) Genres on the Web. Text, Speech and Language Technology, vol 42. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9178-9_4
Download citation
DOI: https://doi.org/10.1007/978-90-481-9178-9_4
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9177-2
Online ISBN: 978-90-481-9178-9
eBook Packages: Computer ScienceComputer Science (R0)