research-article

Improving algorithm search using the algorithm co-citation network

Authors:
Suppawong Tuarob

The Pennsylvania State University, University Park, PA, USA

The Pennsylvania State University, University Park, PA, USA
View Profile

,
Prasenjit Mitra

The Pennsylvania State University, University Park, PA, USA

The Pennsylvania State University, University Park, PA, USA
View Profile

,
C. Lee Giles

The Pennsylvania State University, University Park, PA, USA

The Pennsylvania State University, University Park, PA, USA
View Profile

JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital LibrariesJune 2012Pages 277–280https://doi.org/10.1145/2232817.2232869

Published:10 June 2012Publication History

JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries

Pages 277–280

ABSTRACT

Algorithms are an essential part of computational science. An algorithm search engine, which extracts pseudo-codes and their metadata from documents, and makes it searchable, has recently been developed as part of the CiteseerX suite. However, this algorithm search engine only retrieves and ranks relevant algorithms solely on textual similarity. Here, we propose a method for using the algorithm co-citation network to infer the similarity between algorithms. We apply a graph clustering algorithm on the network for algorithm recommendation and make suggestions on how to improve the current CiteseerX algorithm search engine.

References

S. Bhatia, S. Lahiri, and P. Mitra. Generating synopses for document-element search. Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09, page 2003, 2009. Google ScholarDigital Library
S. Bhatia and P. Mitra. Summarizing Figures, Tables and Algorithms in Scientific Publications to Augment Search Results. ACM Transactions on Information Systems (TOIS), pages 1--24, 2010. Google ScholarDigital Library
S. Bhatia, P. Mitra, and C. L. Giles. Finding algorithms in scientific articles. Proceedings of the 19th international conference on World wide web - WWW '10, page 1061, 2010. Google ScholarDigital Library
S. Bhatia, S. Tuarob, P. Mitra, and C. L. Giles. An Algorithm Search Engine for Software Developers. SUITE '11: Proceedings of 2011 ICSE Workshop on Search-driven Development: Users, Infrastructure, Tools and Evaluation, 2011, 2011. Google ScholarDigital Library
S. Dongen. Graph clustering by flow simulation {Ph.D. dissertation}. Centers for Mathematics and Computer Science University of Utrecht, 2000.Google Scholar
M. Hall, H. National, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The WEKA data mining software: an update. SIGKDD Explorations, 11(1):10--18, 2009. Google ScholarDigital Library
H. SMALL. Co-citation in the Scientific Literature : A New Measure of the Relationship Between Two Documents. Journal of the American Society for Information Science, pages 265--269, 1973.Google Scholar

Index Terms

Improving algorithm search using the algorithm co-citation network
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
    2. Retrieval tasks and goals
      1. Clustering and classification
  2. Information systems applications
    1. Data mining
      1. Clustering

Recommendations

Mining query subtopics from search log data
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

Most queries in web search are ambiguous and multifaceted. Identifying the major senses and facets of queries from search log data, referred to as query subtopic mining in this paper, is a very important issue in web search. Through search log analysis, ...
Read More
Web Page Recommender System using hybrid of Genetic Algorithm and Trust for Personalized Web Search

The main challenge to effective information retrieval is to optimize the page ranking in order to retrieve relevant documents for user queries. In this article, a method is proposed which uses hybrid of genetic algorithms GA and trust for generating the ...
Read More
Improving XML search by generating and utilizing informative result snippets

Snippets are used by almost every text search engine to complement the ranking scheme in order to effectively handle user searches, which are inherently ambiguous and whose relevance semantics are difficult to assess. Despite the fact that XML is a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
June 2012
458 pages
ISBN:9781450311540
DOI:10.1145/2232817
General Chairs:
Karim B. Boughida
The George Washington University, USA
,
Barrie Howard
The Library of Congress, USA
,
Program Chairs:
Michael L. Nelson
Old Dominion University, USA
,
Herbert Van de Sompel
Los Alamos National Laboratory, USA
,
Ingeborg Sølvberg
Norwegian University of Science & Technology, Norway
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 June 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
algorithm co-citation network
algorithms
clustering
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate415of1,482submissions,28%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 17
  Total Citations
  View Citations
- 229
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Improving algorithm search using the algorithm co-citation network

JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries

ABSTRACT

References

Cited By

Index Terms

Recommendations

Mining query subtopics from search log data

Web Page Recommender System using hybrid of Genetic Algorithm and Trust for Personalized Web Search

Improving XML search by generating and utilizing informative result snippets