ABSTRACT
Algorithms are an essential part of computational science. An algorithm search engine, which extracts pseudo-codes and their metadata from documents, and makes it searchable, has recently been developed as part of the CiteseerX suite. However, this algorithm search engine only retrieves and ranks relevant algorithms solely on textual similarity. Here, we propose a method for using the algorithm co-citation network to infer the similarity between algorithms. We apply a graph clustering algorithm on the network for algorithm recommendation and make suggestions on how to improve the current CiteseerX algorithm search engine.
- S. Bhatia, S. Lahiri, and P. Mitra. Generating synopses for document-element search. Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09, page 2003, 2009. Google ScholarDigital Library
- S. Bhatia and P. Mitra. Summarizing Figures, Tables and Algorithms in Scientific Publications to Augment Search Results. ACM Transactions on Information Systems (TOIS), pages 1--24, 2010. Google ScholarDigital Library
- S. Bhatia, P. Mitra, and C. L. Giles. Finding algorithms in scientific articles. Proceedings of the 19th international conference on World wide web - WWW '10, page 1061, 2010. Google ScholarDigital Library
- S. Bhatia, S. Tuarob, P. Mitra, and C. L. Giles. An Algorithm Search Engine for Software Developers. SUITE '11: Proceedings of 2011 ICSE Workshop on Search-driven Development: Users, Infrastructure, Tools and Evaluation, 2011, 2011. Google ScholarDigital Library
- S. Dongen. Graph clustering by flow simulation {Ph.D. dissertation}. Centers for Mathematics and Computer Science University of Utrecht, 2000.Google Scholar
- M. Hall, H. National, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The WEKA data mining software: an update. SIGKDD Explorations, 11(1):10--18, 2009. Google ScholarDigital Library
- H. SMALL. Co-citation in the Scientific Literature : A New Measure of the Relationship Between Two Documents. Journal of the American Society for Information Science, pages 265--269, 1973.Google Scholar
Index Terms
- Improving algorithm search using the algorithm co-citation network
Recommendations
Mining query subtopics from search log data
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalMost queries in web search are ambiguous and multifaceted. Identifying the major senses and facets of queries from search log data, referred to as query subtopic mining in this paper, is a very important issue in web search. Through search log analysis, ...
Web Page Recommender System using hybrid of Genetic Algorithm and Trust for Personalized Web Search
The main challenge to effective information retrieval is to optimize the page ranking in order to retrieve relevant documents for user queries. In this article, a method is proposed which uses hybrid of genetic algorithms GA and trust for generating the ...
Improving XML search by generating and utilizing informative result snippets
Snippets are used by almost every text search engine to complement the ranking scheme in order to effectively handle user searches, which are inherently ambiguous and whose relevance semantics are difficult to assess. Despite the fact that XML is a ...
Comments