skip to main content
10.1145/3233547.3233578acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
short-paper

What Happens When?: Interpreting Schedule of Activity Tables in Clinical Trial Documents

Published:15 August 2018Publication History

ABSTRACT

Clinical trial protocols are complex documents that must be translated manually for trial execution and management. We have developed a system to automatically transform a schedule of activity (SOA) table from a PDF document into a machine interpretable form. Our system combines semantic, structural, and NLP approaches with a "human in the loop" for verification to determine which cells contain activity or temporal information, and then to understand details of what these cells represent. Using a training and test set of 20 protocols, we assess the accuracy of identifying specific types of SOA elements. This work is the first stage of a larger effort to use artificial intelligence techniques to extract procedural logic in clinical trial documents and to create a knowledge base of protocols for insights and comparison across studies.

References

  1. Batelle . 2015. Biopharmaceutical Industry-Sponsored Clinical Trials: Impact on State Economies. Technical Report. Battelle Technology Partnership Practice.Google ScholarGoogle Scholar
  2. Ernst R Berndt and Iain M Cockburn . 2013. Price indexes for clinical trial research: a feasibility study. Technical Report. National Bureau of Economic Research.Google ScholarGoogle Scholar
  3. Sanmitra Bhattacharya and Michael N Cantor . 2013. Analysis of eligibility criteria representation in industry-standard clinical trial protocols. Journal of Biomedical Informatics Vol. 46, 5 (2013), 805--813. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Tara Borlawsky and PR Payne . 2007. Evaluating an NLP-based approach to modeling computable clinical trial eligibility criteria. In AMIA Annu Symp Proc, Vol. Vol. 878.Google ScholarGoogle Scholar
  5. Robert M Califf . 2009. Clinical research sites-the underappreciated component of the clinical research system. Jama Vol. 302, 18 (2009), 2025--2027.Google ScholarGoogle ScholarCross RefCross Ref
  6. Olivia Choudhury, Hillol Sarker, Nolan Rudolph, Morgan Foreman, Nicholas Fay, Murtaza Dhuliawala, Issa Sylla, Noor Fairoza, and Amar K Das . 2018. Enforcing Human Subject Regulations using Blockchain and Smart Contracts. Blockchain in Healthcare Today (2018).Google ScholarGoogle Scholar
  7. Clinical Data Interchange Standards Consortium . {n. d.}. CDISC Foundational Standards, https://www.cdisc.org/standards/foundational. deftempurl%https://www.cdisc.org/standards/foundational tempurlGoogle ScholarGoogle Scholar
  8. Berry De Bruijn, Simona Carini, Svetlana Kiritchenko, Joel Martin, and Ida Sim . 2008. Automated information extraction of key trial design elements from clinical trial publications. In AMIA Annual Symposium Proceedings, Vol. Vol. 2008. American Medical Informatics Association, 141.Google ScholarGoogle Scholar
  9. David Ferrucci, Anthony Levas, Sugato Bagchi, David Gondek, and Erik T. Mueller . 2013. Watson: Beyond Jeopardy! Vol. 199--200 (07 . 2013), 93--105.Google ScholarGoogle Scholar
  10. Xibin Gao, Munindar P Singh, and Pankaj Mehra . 2012. Mining business contracts for service exceptions. IEEE Transactions on Services Computing Vol. 5, 3 (2012), 333--344. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kenneth A Getz, Stella Stergiopoulos, Michelle Marlborough, Jane Whitehill, Marla Curran, and Kenneth I Kaitin . 2015. Quantifying the magnitude and cost of collecting extraneous protocol data. American journal of therapeutics Vol. 22, 2 (2015), 117--124.Google ScholarGoogle Scholar
  12. Kenneth A Getz, Stella Stergiopoulos, Mary Short, Leon Surgeon, Randy Krauss, Sybrand Pretorius, Julian Desmond, and Derek Dunn . 2016. The impact of protocol amendments on clinical trial performance and cost. Therapeutic Innovation & Regulatory Science Vol. 50, 4 (2016), 436--441.Google ScholarGoogle ScholarCross RefCross Ref
  13. Ian Horrocks, Peter F Patel-Schneider, Harold Boley, Said Tabet, Benjamin Grosof, Mike Dean, et almbox. . 2004. SWRL: A semantic web rule language combining OWL and RuleML. W3C Member submission Vol. 21 (2004), 79.Google ScholarGoogle Scholar
  14. Svetlana Kiritchenko, Berry de Bruijn, Simona Carini, Joel Martin, and Ida Sim . 2010. ExaCT: automatic extraction of clinical trial characteristics from journal publications. BMC medical informatics and decision making Vol. 10, 1 (2010), 56.Google ScholarGoogle Scholar
  15. Edward Loper and Steven Bird . 2002. NLTK: The Natural Language Toolkit. In In Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Philadelphia: Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Jake Luo, Stephen B Johnson, Albert Lai, and Chunhua Weng . 2011. Extracting Temporal Constraints from Clinical Research Eligibility Criteria Using Conditional Random Fields. Vol. 2011 (01 . 2011), 843--52.Google ScholarGoogle Scholar
  17. Sudip Mittal, Karuna P Joshi, Claudia Pearce, and Anupam Joshi . 2016. Automatic extraction of metrics from slas for cloud service management Cloud Engineering (IC2E), 2016 IEEE International Conference on. IEEE, 139--142.Google ScholarGoogle Scholar
  18. Sanjay Modgil and Peter Hammond . 2003. Decision support tools for clinical trial design. Artificial Intelligence in Medicine Vol. 27, 2 (2003), 181--200. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. NHLBI . 2018. BioLINCC Website. deftempurl%https://biolincc.nhlbi.nih.gov/ tempurlGoogle ScholarGoogle Scholar
  20. NIH US National Library of Medicine . 2018. Unified Medical Language System (UMLS), www.nlm.nih.gov/research/umls. deftempurl%https://www.nlm.nih.gov/research/umls/ tempurlGoogle ScholarGoogle Scholar
  21. Martin O'connor, Holger Knublauch, Samson Tu, Benjamin Grosof, Mike Dean, William Grosso, and Mark Musen . 2005. Supporting rule system interoperability on the semantic web with SWRL International Semantic Web Conference. Springer, 974--986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Horacio Saggion, Adam Funk, Diana Maynard, and Kalina Bontcheva . 2007. Ontology-based information extraction for business intelligence. In The Semantic Web. Springer, 843--856. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Ravi D Shankar, Susana B Martins, Martin O'Connor, David B Parrish, and Amar K Das . 2007. An ontology-based architecture for integration of clinical trials management applications AMIA Annual Symposium Proceedings, Vol. Vol. 2007. American Medical Informatics Association, 661.Google ScholarGoogle Scholar
  24. Peter WJ Staar, Michele Dolfi, Christoph Auer, and Costas Bekas . 2018. Corpus Conversion Service: A machine learning platform to ingest documents at scale. SysML (2018).Google ScholarGoogle Scholar

Index Terms

  1. What Happens When?: Interpreting Schedule of Activity Tables in Clinical Trial Documents

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                BCB '18: Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
                August 2018
                727 pages
                ISBN:9781450357944
                DOI:10.1145/3233547

                Copyright © 2018 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 15 August 2018

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • short-paper

                Acceptance Rates

                BCB '18 Paper Acceptance Rate46of148submissions,31%Overall Acceptance Rate254of885submissions,29%
              • Article Metrics

                • Downloads (Last 12 months)7
                • Downloads (Last 6 weeks)3

                Other Metrics

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader