Using AMT & SNOMED CT-AU to support clinical research

Similar documents
A Case Study: News Classification Based on Term Frequency

Exploration. CS : Deep Reinforcement Learning Sergey Levine

On-Line Data Analytics

Netsmart Sandbox Tour Guide Script

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Research computing Results

PESIT SOUTH CAMPUS 10CS71-OBJECT-ORIENTED MODELING AND DESIGN. Faculty: Mrs.Sumana Sinha No. Of Hours: 52. Outcomes

SY 6200 Behavioral Assessment, Analysis, and Intervention Spring 2016, 3 Credits

Division Strategies: Partial Quotients. Fold-Up & Practice Resource for. Students, Parents. and Teachers

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

PH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.)

Class Subject. Phone Number

22264VIC Graduate Certificate in Bereavement Counselling and Intervention. Student Application & Agreement Form

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Lecture 10: Reinforcement Learning

Replace difficult words for Is the language appropriate for the. younger audience. For audience?

The Role of String Similarity Metrics in Ontology Alignment

Identifying Novice Difficulties in Object Oriented Design

STA 225: Introductory Statistics (CT)

Software Maintenance

Laboratorio di Intelligenza Artificiale e Robotica

visual aid ease of creating

CAUL Principles and Guidelines for Library Services to Onshore Students at Remote Campuses to Support Teaching and Learning

PREPARING FOR THE SITE VISIT IN YOUR FUTURE

A process by any other name

Short vs. Extended Answer Questions in Computer Science Exams

PRINCE2 Foundation (2009 Edition)

Linking Task: Identifying authors and book titles in verbose queries

How To Design A Training Course By Peter Taylor

ehealth Governance Initiative: Joint Action JA-EHGov & Thematic Network SEHGovIA DELIVERABLE Version: 2.4 Date:

Automating the E-learning Personalization

Detecting English-French Cognates Using Orthographic Edit Distance

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

Investment in e- journals, use and research outcomes

Abstract. Janaka Jayalath Director / Information Systems, Tertiary and Vocational Education Commission, Sri Lanka.

When Student Confidence Clicks

Laboratorio di Intelligenza Artificiale e Robotica

Simulation in Radiology Education

Lisa Forster Student Functional Group - ITS. SI-net: Student Placements

WELCOME! Of Social Competency. Using Social Thinking and. Social Thinking and. the UCLA PEERS Program 5/1/2017. My Background/ Who Am I?

Managerial Decision Making

ACTIVITY INSIGHT FOR COLLEGE OF ARTS & SCIENCES FACULTY

PRINCE2 Practitioner Certification Exam Training - Brochure

Using Task Context to Improve Programmer Productivity

Disambiguation of Thai Personal Name from Online News Articles

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Recognition of Prior Learning (RPL) Procedure - Higher Education

Diploma of Building and Construction (Building)

Practice Examination IREB

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Ontological spine, localization and multilingual access

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

teaching issues 4 Fact sheet Generic skills Context The nature of generic skills

Assessment of Generic Skills. Discussion Paper

SCT Banner Financial Aid Needs Analysis Training Workbook January 2005 Release 7

Northern Kentucky University Department of Accounting, Finance and Business Law Financial Statement Analysis ACC 308

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Postprint.

STUDENT GRADES POLICY

Critical Care Current Fellows

Ministry of Education, Republic of Palau Executive Summary

2017 P-16 Statewide Professional Development Conference What You Don t Know Can Limit You!

MATERIAL COVERED: TEXTBOOK: NOTEBOOK: EVALUATION: This course is divided into five main sections:

Visual CP Representation of Knowledge

5 th September Dear Parent/Carer of Year 10 Students GCSE PE

Tools to SUPPORT IMPLEMENTATION OF a monitoring system for regularly scheduled series

Level 1 Mathematics and Statistics, 2015

AQUA: An Ontology-Driven Question Answering System

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach

Compositional Semantics

The patient-centered medical

CS 101 Computer Science I Fall Instructor Muller. Syllabus

SHEEO State Authorization Inventory. Indiana Last Updated: October 2011

Education the telstra BLuEPRint

INSTRUCTION MANUAL. Survey of Formal Education

Controlled vocabulary

Rule-based Expert Systems

An application of student learner profiling: comparison of students in different degree programs

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Biomedical Sciences. Career Awards for Medical Scientists. Collaborative Research Travel Grants

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Truth Inference in Crowdsourcing: Is the Problem Solved?

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Person Centered Positive Behavior Support Plan (PC PBS) Report Scoring Criteria & Checklist (Rev ) P. 1 of 8

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

Ontologies vs. classification systems

1 Use complex features of a word processing application to a given brief. 2 Create a complex document. 3 Collaborate on a complex document.

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

Mining Association Rules in Student s Assessment Data

SHEEO State Authorization Inventory. Nevada Last Updated: October 2011

Including the Microsoft Solution Framework as an agile method into the V-Modell XT

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

ABC of Programming Linda

ELM Higher Education Workshops. I. Looking for work around the globe. What does it entail? Because careers no longer stop at the border, students will

Intellectual Property

M55205-Mastering Microsoft Project 2016

Subject Inspection of Mathematics REPORT. Marian College Ballsbridge, Dublin 4 Roll number: 60500J

Transcription:

Using AMT & SNOMED CT-AU to support clinical research Simon J. McBRIDE, Michael J. LAWLEY, Hugo LEROUX and Simon GIBSON CSIRO Australian E-Health Research Centre 2 August 2012 PREVENTATIVE HEALTH FLAGSHIP & ICT CENTRE (AUSTRALIAN E-HEALTH RESEARCH CENTRE)

Overview Context Problem Solution options Method Results Limitations & Future work 2 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Australian Imaging, Biomarkers & Lifestyle Study of Ageing (AIBL) Large scale (+1,100 participants) 4.5+ year prospective, longitudinal study of ageing 4 collections of data in 18 month intervals since 2006 Research streams: Cognitive Imaging Biomarkers Lifestyle Medication definition Pharmaceuticals & Nutraceuticals Medication records include: Name Dose Frequency Duration of use 3 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Problem AIBL s medication use data quality was poor Participant self- and/or carer-assisted reporting to paper records Manual entry from paper to an electronic data capture system without medication support (free text fields) Types of issues Misspelling of medication names Incomplete records Mix of brand/product names & generic names (e.g. Cartia vs Aspirin ) Consequence Difficult to analyse medication use within the cohort due to paper records How do we Improve the quality of legacy data? Ensure this problem doesn t occur at later time points? 4 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Solution Candidates Candidate evaluation Support for use cases: How many Participants are taking <medication>? Is there a correlation between <medication> and <observation>? Match to user community expectations Long term sustainability Secret sauce? Is there value in being able to exploit semantics in the data during analysis? Candidates Commercial: MIMS Integrated Standards: Australian Medicines Terminology (AMT), SNOMED CT-AU Public domain: DrugBank 5 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Method AMT Mapping Concepts below Trade Product SNOMED CT-AU Mapping Concepts below Substance Direct algorithm Exact string match on Preferred Term, Fully Specified Name & Description (via Lucene) Least common ancestor (LCA) algorithm LCA: The common ancestors that are not ancestors of any other common ancestor Navigate hierarchy of candidate concepts ancestors to find least common of those ancestors 6 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Ontoserver API RESTful API providing a number of useful operations including XMLResponse findconcepts(term, context,...) XMLResponse findconceptsbyterm(term, max) XMLResponse concept(id) XMLResponse subsumedby(term, predicate) XMLResponse parents(term, max) XMLResponse children(term, max) 7 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Results Mapped To N % AMT Direct LCA SNOMED CT-AU Direct LCA 523 687 1210 43.2 56.8 56.1 147 200 347 42.4 57.6 16.1 Unknown 601 27.8 Total Mapped 1557 72.2 Total 2158 100 8 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Limitations & future work Relatively naive direct mapping algorithms Do different string matching algorithms improve Direct algorithm performance? More formal evaluation No ground truth when the work was done. Recently acquired later time point data that includes a manual mapping to MIMS Australia terms. Planning an evaluation and improvement of our mapping algorithm using the manual mapping ground truth. Implementation of AMT/SNOMED CT-AU concept lookup during data entry to avoid the need to complete this task in future. Implementation of tools to exploit the relationships between AMT/SNOMED CT-AU concepts in query and visualisation for research purposes Wordle example 9 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Fun with Wordle With thanks to Wordle (see http://wordle.net/) 10 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Snapper & Ontoserver for fun and profit AEHRC has tools that you can play with Ts & Cs say fun only in the first instance, talk to us if you re after profit Snapper Terminology mapping tool Free download at http://aehrc.com/snapper Ontoserver Free access to an Ontoserver instance: http://ec2-23-20-239-33.compute-1.amazonaws.com:8080/ontoserver/ Ontology server providing a useful API (SNOMED variants & AMT) RESTful API WADL at http://ec2-23-20-239-33.compute- 1.amazonaws.com:8080/ontoserver/resources/application.wadl Don t try writing it down, contact me after the session 11 Using AMT & SNOMED CT-AU to support clinical research Simon McBride

Questions, thank you & more information Simon McBride Research Project Leader Australian E-Health Research Centre t +61 7 3253 3631 e simon.mcbride@csiro.au w http://aehrc.com/ Many thanks to co-authors Dr Michael Lawley Dr Hugo Leroux Mr Simon Gibson AIBL: http://aibl.csiro.au/ Australian E-Health Research Centre (AEHRC) http://aehrc.com/ Come visit us at Booth 26 PREVENTATIVE HEALTH FLAGSHIP & ICT CENTRE (AEHRC)