skip to main content
10.1145/2459118.2459124acmotherconferencesArticle/Chapter ViewAbstractPublication PagescomputeConference Proceedingsconference-collections
research-article

Mining research abstracts for exploration of research communities

Published: 23 January 2012 Publication History

Abstract

Research abstracts are the 'information scents' which attract a novice researcher. To read through the entire research paper and to decide the suitability of the paper to one's research problem is a tough and abstract task. Many times, researchers do not know whether they are citing the relevant (but original?) research articles. It has been only a trial and error approach so far. To enable researchers to correctly target at the relevant and yet quality research literature, mechanisms to organise collections of research papers are essential. Though a considerable effort has been attempted earlier in this context, establishing research communities concentrated on citation based recommendations only. However, the quality and originality of research articles have not been taken into account until now. In this paper, we propose the evolution of research communities by analysing the research abstracts. We utilise Fuzzy Concept Map based approach in detecting the originality of scientific abstracts. By K-means clustering, we establish a research article hyper graph from the qualified abstracts. Later, we evolve the author clusters for every topic cluster and analyse them for redundancy. Further study on relevant bibliometrics helps us to identify a 'nucleus author' for every topic cluster.

References

[1]
F. Smeaton, G. Keogh, C. Gurrin, K. Mcdonald, and T. Sodring. 2002. Analysis of papers from twenty-five years of sigir conferences: What have we been doing for the last quarter of a century. SIGIR Forum, 36(2), 39--43.
[2]
Pothen, H. Simon, and K. P. Liou. 1990. Partitioning sparse matrices with eigenvectorsof graphs. SIAM J. Matrix Anal. Appl., 11, 430--452.
[3]
ACM Classification System, http://www.acm.org/about/class/1998
[4]
Andrew W. Moore. 2001. K-means and Hierarchical clustering.
[5]
http://www.autonlab.org/tutorials/kmeans09.pdf.
[6]
Aris, A., Shneiderman, B., Qazvinian, V., & Radev, D. 2009. Visual overviews for discovering key papers and inuences across research fronts. JASIST: Journal of the American Society for Information Science and Technology, 60(11). 2219--2228.
[7]
Association for Computing Machinery. 2011. ACM Digital Library. Retrieved from http://portal.acm.org
[8]
W. Kernighan and S. Lin. 1970. An efficient heuristic procedure for partitioning graphs. Bell System Technical Journal, 49, 291--307.
[9]
Bi Chen, Baojun Qiu, Yan Qu, Xiaolong Zhang, John Yen. 2009. Intelligent Sensemaking of Scientific Literature through Citation Scent. Sensemaking Workshop associated with CHI.
[10]
Bollacker, K. D., Lawrence, S., & Giles, C. L. 1998. CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications. In AGENTS '98:proc. second international conference on autonomous agents New York. 116--123.
[11]
Carayol, N. and Matt, M. 2006. Individual and collective determinants of academic scientists' productivity, Information Economics and Policy 18 (1).
[12]
Castelnuovo G. 2008. Ditching impact factors: Time for the single researcher impact factor. Bmj. 336--789.
[13]
Center for History and New Media, G. 2011. Zotero {Software}. Retrieved from http://www.zotero.org
[14]
Chen, C. 2004. Searching for intellectual turning points: Progressive knowledge domain visualization. PNAS: Proc. National Academy of Sciences of the United States of America, 101(90001), 5303--5310.
[15]
Chen, C. 2006. CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. JASIST: Journal of the American Society for Information Science and Technology, 57(3). 359--377
[16]
Chen, C., Ibekwe-SanJuan, F., & Hou, J. 2010. The structure and dynamics of cocitation clusters: A multiple-perspective cocitation analysis. JASIST: Journal of the American Society for Information Science and Technology, 61. 1386--1409.
[17]
Cornell University Library. 2011. ArXiv. Retrieved from http://arxiv.org
[18]
DBconnect: mining research community on DBLP data. 2007. in Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, COPYRIGHT ACM.
[19]
DBLife, http://dblife.cs.wisc.edu/
[20]
Deepika J. and Mahalakhsmi G. S. 2011, Journal Impact Factor -- A measure of quality or popularity? -- in proc. of int. conf. IICAI spl session on Advances in Web Intelligence and Data Mining -- December 2011.
[21]
Dunne, C., Shneiderman, B., Gove, R., Klavans, J. & Dorr, B. 2011, Rapid understanding of scientific paper collections: integrating statistics, textual analysis, and visualization. University of Maryland.
[22]
Elsevier. (2011). SciVerse Scopus. Retrieved from http://scopus.com/
[23]
G. S. Mahalakshmi, S. Sendhilkumar, AlaguIrulappan, PreethamMirinda. 2009. Ontology Based Relevance Analysis for Automatic Reference Tracking. International Journal of Computer Applications in Technology (IJCAT): Special Issue on Computer Applications in Knowledge-Based Systems, ISSN 0952-8091, Vol. 35, Nos. 2/3/4. 165--173.
[24]
Giles, C. L., Bollacker, K. D., & Lawrence, S. 1998. CiteSeer: an automatic citation indexing system. In DL '98: proc. 3rd ACM conference on digital libraries New York, NY, USA: ACM. 89--98.
[25]
Google. Google Scholar. 2011. Retrieved from http://scholar.google.com
[26]
Guan, T. & Wong, K. F. 1999. KPS a Web information mining algorithm. WWW8.
[27]
Han, H., Zha, H., & Giles, C. 2005. Name disambiguation in author citations using a k-way spectral clustering method. Proceedings of the 5th ACM/IEEECS Joint Conference on Digital Libraries. 334--343.
[28]
Hastie, Trevor. Tibshirani, Robert. Friedman, Jerome. 2009. 14.3.12 Hierarchical clustering (PDF). The Elements of Statistical Learning (2nd ed.). New York: Springer. 520--528.
[29]
Hirsch, J. E. 2009. An index to quantify an individual's scientific research output, Proc Natl Acad Sci USA, Vol. 102, No. 46, Springerlink, Scientometrics. 16569--16573.
[30]
Institute of Electrical and Electronics Engineers. 2011. IEEE Xplore. Retrieved from http://ieeexplore.ieee.org
[31]
JabRef Development Team. 2011. JabRef {Software}. Retrieved from http://jabref.sourceforge.net
[32]
Jiawei Han. 2009. Mining Heterogeneous Information Networks by Exploring the Power of Links, J. Gama et al. (Eds.): DS 2009, Springer-Verlag Berlin Heidelberg, LNAI 5808. 13--30.
[33]
Jose Salmeron. 2010. Modeling grey uncertainty with Fuzzy Grey Cognitive Maps, Expert Systems with Applications 37 (2010) 7581--7588.
[34]
K. Satyanarayana. 2010. Impact Factor and other indices to assess science, scientists and scientific journals. Indian Journal of Physiology and Pharmacology, Vol.54, No.3. 197--212.
[35]
Kanungo. T, Mount, D. M., Netanyahu, N. S, Piatko, C. D.; Silverman, R.; Wu, A. Y. 2002. An efficient k-means clustering algorithm: Analysis and implementation. IEEE Trans. Pattern Analysis and Machine Intelligence 24. 881--892.
[36]
Kosko B. 1986. Fuzzy Cognitive Maps. Int. Journal of Man-Machine Studies, Vol. 24. 65--75.
[37]
Lars Backstrom, Jure Leskovec. 2011. Supervised random walks: predicting and recommending links in social networks. WSDM 2011. 635--644.
[38]
Libra, http://libra.msra.cn/
[39]
Makoto Takeya, Hitoshi Sasaki, Keizo Nagaoka and Nobuyoshi Vonezawa 2004. A Performance scoring method based on quantitative comparison of Concept maps by a teacher and students. Concept maps: theory, methodology, technology, Proc. Of the first int. Conference on concept mapping, A. J. Cañas, J. D. Novak, F. M. González, eds, Pamplona, Spain.
[40]
M. E. J. Newman. 2003. The structure and function of complex networks. SIAM Review, 45(2). 167--256.
[41]
Manjula Dissanayake and Simaan M. AbouRizk. 2007. Qualitative simulation of construction performance using fuzzy cognitive maps, IEEE, Proceedings of the 2007 Winter Simulation Conference S. G. Henderson, B. Biller, M.-H. Hsieh, J. Shortle, J. D. Tew, and R. R. Barton, eds. 2134--2140.
[42]
Mario A. Nascimento, Jörg Sander, and Jeffrey Pound. 2003. Analysis of sigmod's co-authorship graph. SIGMOD Record, 32(2). 57--58.
[43]
Martin Rosvall and Carl T. Bergstrom. 2008. Maps of random walks on complex networks reveal community structure. PNAS 2008 105 (4). 1118--1123
[44]
Meho, L. I., & Spurgin, K. M. 2005. Ranking the research productivity of LIS faculty and schools: An evaluation of data sources and research methods. 1314--1331.
[45]
Mendeley Ltd. 2011. Mendeley {Software}. Retrieved from http://www.mendeley.com
[46]
Michelle Girvan and M. E. J. Newman. 2002. Community structure in social and biological networks. In Proceedings of the National Academy of Science USA, 99. 8271--8276.
[47]
Microsoft Research. 2011. Microsoft Academic Search. Retrieved from http://academic.research.microsoft.com
[48]
National Center for Biotechnology Information. 2011. Retrieved from http://ncbi.nlm.nih.gov/
[49]
Newman, M. E. J. 2001. The structure of scientific collaboration networks. PNAS: Proc. National Academy of Sciences of the United States of America, 98(2). 404--409.
[50]
Nevin Heintze. 1996. Scalable Document Fingerprinting. Proceedings of the Second USENIX Workshop on Electronic Commerce, Oakland, California, November 18--21.
[51]
Noshir S. Contractor, Peter R. Monge. 2002. Theories of Communication Networks. Oxford University Press.
[52]
NWB Team. 2006. Network Workbench {Software}. Retrieved from http://nwb.slis.indiana.edu
[53]
Osmar R. Zaïane, Jiyang Chen, and Randy Goebel. 2007. Mining Research Communities in Bibliographical Data. In Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis (WebKDD/SNA-KDD '07). ACM, New York, NY, USA. 74--81.
[54]
Pena A. Sossa H. and Gutiérrez, F. 2005. Knowledge and Reasoning Supported by Cognitive Maps, Proceedings of Mexican International Conference on Artificial Intelligence 2005 (MICAI'05), Lecture Notes in Artificial Intelligence, Vol.3789. Springer, Monterrey, Mexico.
[55]
PROTÉGÉ, 2011, http://protege.stanford.edu
[56]
R. Mihalcea, C. Corley, and C. Strapparava. 2006. Corpus-based and knowledge-based measures of text semantic similarity. In Proceedings of AAAI-06.
[57]
Rosy Madaan, Ashutosh Dixit, A. K. Sharma, Komal Kumar Bhatia. 2010. A Framework for Incremental Hidden Web Crawler, (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 03, 2010. 753--758.
[58]
S. Wasserman and K. Faust. 1994. Social network analysis: Methods and applications. Cambridge University Press.
[59]
SalhaAlzahrani and NaomieSalim. 2010. Fuzzy semantic-Based String Similarity for Extrinsic Plagiarism Detection. Lab Report for PAN at CLEF 2010.
[60]
Salton, G. and McGill, M. J. 1983. Introduction to Modern Information Retrieval. McGraw-Hill, New York, NY.
[61]
Seglen PO. 1997. Why the impact factor of journals should not be used for evaluating research. BMJ, Vol.314, No.7079. 498--502.
[62]
Thomson Reuters Impact Factor. 1994.
[63]
Thomson Reuters. 2011a. EndNote {Software}. Retrieved from http://www.endnote.com
[64]
Thomson Reuters. 2011b. ISI Web of Knowledge. Retrieved from http://isiwebofknowledge.com
[65]
Zhen, L., Jiang, Z. 2010. Hy-SN: Hyper-graph based semantic network. Knowledge-Based Systems 23(8). 809--816.

Cited By

View all
  • (2017)A Bibliometric Analysis of Journal of Informetrics – A Decade Study2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)10.1109/ICRTCCM.2017.22(222-227)Online publication date: Feb-2017
  • (2016)A visual analytics approach to author name disambiguationProceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies10.1145/3006299.3006302(52-60)Online publication date: 6-Dec-2016
  • (2014)Context Based Retrieval of Scientific Publications via Reader LensComputational Intelligence in Data Mining - Volume 310.1007/978-81-322-2202-6_53(583-596)Online publication date: 12-Dec-2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
COMPUTE '12: Proceedings of the 5th ACM COMPUTE Conference: Intelligent & scalable system technologies
January 2012
146 pages
ISBN:9781450314404
DOI:10.1145/2459118
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • ACM Pune Professional Chapter: ACM Pune Professional Chapter

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 January 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fuzzy cognitive maps
  2. originality of research publications

Qualifiers

  • Research-article

Conference

Compute '12
Sponsor:
  • ACM Pune Professional Chapter

Acceptance Rates

COMPUTE '12 Paper Acceptance Rate 18 of 116 submissions, 16%;
Overall Acceptance Rate 114 of 622 submissions, 18%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2017)A Bibliometric Analysis of Journal of Informetrics – A Decade Study2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)10.1109/ICRTCCM.2017.22(222-227)Online publication date: Feb-2017
  • (2016)A visual analytics approach to author name disambiguationProceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies10.1145/3006299.3006302(52-60)Online publication date: 6-Dec-2016
  • (2014)Context Based Retrieval of Scientific Publications via Reader LensComputational Intelligence in Data Mining - Volume 310.1007/978-81-322-2202-6_53(583-596)Online publication date: 12-Dec-2014
  • (2012)Enhancement of co-authorship networks with content-similarity informationProceedings of the International Conference on Advances in Computing, Communications and Informatics10.1145/2345396.2345593(1225-1228)Online publication date: 3-Aug-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media