skip to main content
10.1145/1988688.1988726acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

Towards automatic thematic sheets based on discursive categories in biomedical literature

Published: 25 May 2011 Publication History

Abstract

Biological papers contain a huge amount of results and ideas that are difficult to manage. Researchers are not only interested in finding relevant information but they also need to know how the authors of papers get the results. Thus, it is interesting to identify the methods used and to distinguish for example between speculations, observations and deductions. Biologists need also to distinguish between new and prior information, especially to identify the real new output of a study. In order to respond to these needs, we propose a linguistic model based on the discursive categories. This model aims to develop the BioExcom tool for the automatic production of thematic sheets using the Contextual Exploration processing. BioExcom is already able to detect speculative sentences and to categorize them into new and prior speculation. The other categories of the model will be developed using the proposed linguistic markers.

References

[1]
Ananiadou, S., Kell, D.B., Tsujii, J. 2006.Text mining and its potential applications in systems biology. J. Tre. Bio. 24 (Oct 2006), 571--579.
[2]
Jensen, L.J., Saric, J., Bork, P. 2006. Literature mining for the biologist: from information retrieval to biological discovery. Nat Rev Genet 7, 119--129 (Feb 2006).
[3]
Rzhetsky, A., Seringhaus, M., Gersein, M. 2008. Seeking a new biology through text mining. J.Cel. 134 (Jul 2008), 9--13
[4]
Cohen, B., Hunter, L. 2008 Getting started in text mining, Plos Computational biology volume 4, issue 1 (Jan 2008)
[5]
Hu, Z.Z., Narayanaswamy, M., Ravikumar, K.E., Vijay-Shanker, K., Wu, CH. 2005. Literature mining and database annotation of protein phosphorylation using a rule-based system. J.Bio. 21 (Apr. 2005), 2759--2765
[6]
Jensen, L.J., Saric, J., Ouzonova R., Rojas, I., Bork, P. 2006. Extraction of regulatory gene/proteína Networks from Medline. Bioinformatics, Vol. 22 no. 6 2006, pages 645--650 doi:10.1093/bioinformatics/bti597
[7]
Wilbur W.J., Rzhetsky A., Shatkay H. 2006. New directions in biomedical text annotation: definitions, guidelines and corpus construction. J.BMC.Bio.7 (Jul 2006), 356
[8]
Teufel, S., Moens, M. 2002. Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status. J.Comp Lin, 28,4 (Dec 2002).
[9]
Mizuka, Y., Korhonen, A., Mullen, T., Collier, N. 2006. Zone analysis in biology articles as a basis for information extraction. J. Med. Inf. 75 (Jun 2006), 468--487.
[10]
Blais A., Atanassova I., Desclés J.P., Zhang M., Zighem, L. 2007. Discourse Automatic Annotation of Texts: an Application to Summarization. In Proceedings of the Twentieth International FLAIRS Conference, Special Track ≪Automatic Annotation and Information Retrieval: New Perspectives≫ (Key West Florida, USA, May 7--9, 2007). FLAIRS, 07
[11]
Harmsze, F.A.P. 2002 Modular structure for scientific articles in an electronic environment. Doctoral Thesis, Univérsity of Amsterdam.
[12]
Waard, A. de., Buitelaar, P., Eigner T. 2009 Identifying the Epistemic Value of Discourse Segments in Biology Texts, In Proceedings of the 8th International Conference on Computational Semantics (Tilburg, Netherland; January 7--9, 2009), 351--354. DOI= http://portal.acm.org/citation.cfm?id=1693756.1693802&coll=DL&dl=GUIDE&CFID=115124777&CFTOKEN=99232089
[13]
Rubin, V. L., Liddy, E. D., Kando, N. 2005.Certainty Identification in Texts: Categorization Model and Manual Tagging Results. In J. G. Shanahan, Y. Qu & J. Wiebe (Eds.), Computing Attitude and Affect in Text: Theory and Applications (the Information Retrieval Series). New York: Springer-Verlag, 61--76.
[14]
Thompson, P., Venturi, G., McNaught, J., Montemagni, S., Ananiadou, S. 2008. Categorising modality in biomedical texts. In Proceedings of the Workshop on Building and Evaluating 1 Resources for Biomedical Text Mining. (Marrakech, Morocco, May 26--30, 2008) LREC'08.
[15]
Light M., Qiu X.Y., Srinivasan P. 004.The Language of Bioscience: Facts, Speculations, and Statements in Between. In Proceedings of HLT-NAACL Workshop On Linking Biological Literature Ontologies And Databases, 17--24 (Boston, Massachusetts May 6, 2004). HLT-NAACL" 04.
[16]
Bray, D. 2001. Reasoning for results. J.Nat. 412 (Aug 2001), 863
[17]
Blagosklonny, MV., Pardee, AB. 2002. Conceptual biology: unearthing the gems. J.Nat. 416 (Mar 2002), 373.
[18]
Desclés, J.P. 2006. Contextual Exploration Processing for Discourse Automatic Annotations of Texts. In Proceedings of the Nineteenth Florida Artificial Intelligence Research Society International conference, (Melbourne Florida, USA, May 11_13, 2006) 281--284. FLAIRS" 06.
[19]
Djioua, B., Flores, J.G., Blais, A., Desclés, J.P., Guibert, G., Jackiewicz, A., Le Priol, F., Nait-Baha, L., Sauzay, B. 2006. EXCOM: an automatic annotation engine for semantic information, In Proceedings of the Nineteenth Florida Artificial Intelligence Research Society International conference, (Melbourne Florida, USA, May 11_13 2006) 285--290. FLAIRS" 06.
[20]
Alrahabi, M., Desclés J.P. 2009. EXCOM: Plate-forme d'annotation sémantique de textes multilingues, In Proceedings of the Natural Language Processing conference (Senlis, Canada, June 24--26 2009). TALN" 09.
[21]
Le Priol, F. 2008. Automatic annotation of images, pictures or videos comments for text mining guided by no textual data, In Proceedings of the twenty first Florida Artificial Intelligence Research Society International conference (Miami Florida, USA, May 15--17) FLAIRS" 08.
[22]
Desclés, J., Alrahabi, M., Desclés J.P., Blais A. 2009 Automatic Annotation by BioExcom for categorizing prior and new speculations in biological papers. Presented at 3rd International Symposium on Languages in Biology and Medicine, (Jeju Island, South Korea, November 8--10 2009). LBM" 09
[23]
Desclés, J., Makkaoui, O., Hacène, T. 2010. Automatic annotation of speculation in biomedical texts: new perspectives and large scale evaluation. In Proceedings of the Negation and speculation in Natural Language Processing workshop, (Uppsala, Sweden, July 10, 2010). NeSpNLP" 10 DOI=http://portal.acm.org/citation.cfm?id=1858959.1858965&coll=DL&dl=GUIDE&CFID=115124777&CFTOKEN=99232089.
[24]
Szarvas, G., Vincze, V., Farkas, R., Csirik, J. 2008. The BioScope corpus: annotation for negation, uncertainty and their scope in biomedical texts. In Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing. BioNLP ACL" 08, (Colombus, Ohio, Jun 19--20, 2008) DOI=http://portal.acm.org/citation.cfm?id=1572306.1572314&coll=DL&dl=GUIDE&CFID=115349241&CFTOKEN=402148

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WIMS '11: Proceedings of the International Conference on Web Intelligence, Mining and Semantics
May 2011
563 pages
ISBN:9781450301480
DOI:10.1145/1988688
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 May 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic annotation
  2. biology
  3. categorization
  4. contextual exploration
  5. discourse analysis
  6. information extraction
  7. summarization

Qualifiers

  • Research-article

Conference

WIMS '11

Acceptance Rates

Overall Acceptance Rate 140 of 278 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 68
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media