Predictive Analytics 101: An Introduction to the Future of Healthcare

Similar documents
Python Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

Assignment 1: Predicting Amazon Review Ratings

Learning From the Past with Experiment Databases

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Rule Learning With Negation: Issues Regarding Effectiveness

Probability and Statistics Curriculum Pacing Guide

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Artificial Neural Networks written examination

(Sub)Gradient Descent

CS Machine Learning

Multivariate k-nearest Neighbor Regression for Time Series data -

STA 225: Introductory Statistics (CT)

Rule Learning with Negation: Issues Regarding Effectiveness

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Word Segmentation of Off-line Handwritten Documents

Issues in the Mining of Heart Failure Datasets

Lecture 1: Basic Concepts of Machine Learning

CS 446: Machine Learning

Human Emotion Recognition From Speech

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Switchboard Language Model Improvement with Conversational Data from Gigaword

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Time series prediction

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Learning Methods for Fuzzy Systems

Calibration of Confidence Measures in Speech Recognition

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

A Case Study: News Classification Based on Term Frequency

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Reducing Features to Improve Bug Prediction

Radius STEM Readiness TM

Universidade do Minho Escola de Engenharia

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

MYCIN. The MYCIN Task

Australian Journal of Basic and Applied Sciences

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Learning Methods in Multilingual Speech Recognition

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

A Reinforcement Learning Variant for Control Scheduling

Higher Education Six-Year Plans

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Axiom 2013 Team Description Paper

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Evolutive Neural Net Fuzzy Filtering: Basic Description

Redirected Inbound Call Sampling An Example of Fit for Purpose Non-probability Sample Design

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Test Effort Estimation Using Neural Network

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

CSL465/603 - Machine Learning

Exploration. CS : Deep Reinforcement Learning Sergey Levine

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Improving recruitment, hiring, and retention practices for VA psychologists: An analysis of the benefits of Title 38

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Generative models and adversarial training

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Evaluation of Teach For America:

Seminar - Organic Computing

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Applications of data mining algorithms to analysis of medical data

A Comparison of Standard and Interval Association Rules

Linking Task: Identifying authors and book titles in verbose queries

Probabilistic Latent Semantic Analysis

Speech Emotion Recognition Using Support Vector Machine

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Conference Presentation

K-Medoid Algorithm in Clustering Student Scholarship Applicants

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

A Vector Space Approach for Aspect-Based Sentiment Analysis

arxiv: v2 [cs.cv] 30 Mar 2017

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Model Ensemble for Click Prediction in Bing Search Ads

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Exposé for a Master s Thesis

OFFICE SUPPORT SPECIALIST Technical Diploma

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Iowa School District Profiles. Le Mars

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Full-time MBA Program Distinguish Yourself.

Text-mining the Estonian National Electronic Health Record

Visit us at:

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Go fishing! Responsibility judgments when cooperation breaks down

Indian Institute of Technology, Kanpur

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

BENCHMARK TREND COMPARISON REPORT:

Transcription:

MGMA 2017 ANNUAL CONFERENCE OCT. 8-11 ANAHEIM, CA Predictive Analytics 101: An Introduction to the Future of Healthcare Frank Cohen, MBB, MPA Director, Analytics, Doctors Management LLC Clearwater, Fla. Frank Cohen does not have any financial conflicts to report at this time. 1

Learning Objectives 1. Explain the basic concepts of predictive analytics 2. Illustrate how predictive algorithms are built 3. Incorporate predictive analytics or nearpredictive analytics into audit plans Defining Analytics in General often involves studying past historical data to research potential trends, to analyze the effects of certain decisions or events, or to evaluate the performance of a given tool or scenario. The goal of is to improve the business by gaining knowledge which can be used to make improvements or changes. [www.businessdictionary.com/definition/.html] 2

Analytical Categories Descriptive Exploratory Inferential Predictive Causal Mechanistic Prescriptive Descriptive Descriptive is a preliminary stage of data processing that creates a summary of historical data to yield useful information and possibly prepare the data for further analysis. For example: Overpayment rate for a specific sample Median work RVUs reported by a physician population Percent of patients that are late or no shows Accounts receivable over time Claims summary information Descriptive describe a current/past state or condition 3

Exploratory In statistics, exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. For example: Profiles of provider transactions Provider similarity according to profiles Visual summaries of large amounts of data Eligibility data link to provider billing In EDA, the researcher takes a bit of a bird s eye view of the data in order to make sense of what is available to be reviewed Inferential Inferential statistics draws valid inferences about a population based on an analysis of a representative sample of that population. For example: Estimating overpayment from a sample to a population Gain a better understanding on how work RVUs will impact compensation Improving scheduling by getting a better estimate of uncertainty Using time series analysis to understand A/R over time Absolutely dependent upon a statistically valid random sample (SVRS) Inferential models include estimation, prediction and assessments 4

Predictive Predictive analytics is the branch of statistics which is used to make predictions about unknown future events. For example: Predicting the likelihood that a given physician will be audited in the future Predicting which procedure codes/modifiers are most likely to be targets Predicting how likely a provider is to be sued for malpractice Predicting the likelihood that a patient will return to the hospital within 30 days Predicting the time it will take for a new physician to break even Predictive analytics uses many techniques from data mining, statistics, modeling, machine learning, and artificial intelligence to analyze current data to make predictions about future. Prescriptive Prescriptive analytics is the area of business analytics (BA) dedicated to finding the best course of action for a given situation. Prescriptive analytics is related to both descriptive and predictive analytics. 5

An Analytical Approach Prediction vs. Estimation Estimation uses data to estimate (or guess) at a parameter of for some already known variable Prediction uses the data to estimate (or guess) at some random value that is not a part of the known variables or data set Estimation or prediction? Extrapolation Audit risk Impact of new drugs Healthcare insurance premiums Driving a car Piano tuners in Chicago Predictors usually have larger uncertainties than estimators 6

We cannot solve our problems with the same thinking we used when we created them. Albert Einstein LET S TALK PREDICTIVE Why Predictive Analytics? The big picture creates the need for: Strategic and financial planning Improving access to care Improving RC and profitability Improving outcomes and access Limit and mitigate risk Helping to achieve focus on priorities Leaders want to be able to look into the future What should we expect next year? What can we do to meet our objectives? 7

What can PA provide to the business? Improving efficiencies (more with less) Improve financial forecasting, ensuring long term survival Resource allocation and categorization Understand the market better More competitive New products and services Understand patient care better Predicting areas of patient dissatisfaction Statistical analysis Text Mining Machine Learning Artificial Intelligence Forecasting Optimization Exploratory analysis The Many Faces of PA 8

Supervised vs. Unsupervised Learning Supervised learning Supervised learning is an approach to machine learning where both input and desired output data are provided in the form of expected answers. Input and output data are labelled for classification to provide a learning basis for future data processing Unsupervised learning Unsupervised learning is an approach to machine learning whereby software learns from data without being given correct answers. It is an important type of artificial intelligence as it allows an AI to self improve based on large, diverse data sets such as real world experience. Basic difference in layman terms : In supervised learning, the output datasets are provided which are used to train the machine and get the desired outputs whereas in unsupervised learning no datasets are provided, instead the data are clustered into different classes Application of learning types Supervised Facial and visual recognition Sorting good claims from bad claims Cloning of EHR records Identification of patients at risk Unsupervised Facial and visual recognition Self driving cars Human behavior patterns Robotic vacuums Machine learning 9

Types of Models Classifiers place data points into unique buckets Recommenders recommends products and service a consumer is most likely to purchase based on prior behavior Numerical models include regression, time series and other areas of canonical statistics Text Mining enables high dimensional analysis of unstructured data, like text found in EHR and documentation in patient charts Neural Networks used complex weighting to find the best predictor Naïve Bayes Kth Nearest Neighbor (KNN) CART (Classification and Regression Trees) Random Forests Regression (linear, multiple, logistic, etc.) Time Series analysis MARS Boosted Trees Majority classifier Support Vector Machine (SVM) Neural Networks Natural language processing Social Network analysis (SNA) And new ones just about every day PA Algorithms 10

KNN, Clusters, CART, Decision Trees CLASSIFIERS Classifcation Classifies each data point based some set of attributes Specialty Time in practice Prior lawsuits Prior audits Internal coding reviews Patient satisfaction surveys Resignations and terminations The goal is to assign an unknown variable or record to a class based on location 11

Classification types KNN (Kth Nearest Neighbor) Classification trees and forests Neural networks Support Vector Machines (SVM) KNN (Kth Nearest Neighbor) In KNN, we place each data point into a class that is most appropriate We measure the distance between the items in a class using specific metrics Centroid measures the distance from the center of a class Medoid measures the distance from some representative point A new data point is placed into the given class based on it s distance from the closest class 12

KNN Example 1 Classifying whether a given claim may be a high risk target for audit or review Using any number of variables, such as number of ICD codes, procedure code, presence of a modifier, patient demographics, cohort comparison, etc., a claim is classified as either risk or no risk based on its distance to the cluster centroid Risk type (high, medium, low) is a function of distance thresholds Closer to centroid higher risk Classification and Regression Trees (CART) Classification and regression trees are machine learning methods for constructing prediction models from data. The models are obtained by recursively partitioning the data space and fitting a simple prediction model within each partition. As a result, the partitioning can be represented graphically as a decision tree. 13

Building a classification model Training Data Testing Data Tree ID Attrib 1 Attrib 2 Attrib 3 Class 1 Yes High 3125 Yes 2 Yes Medium 5280 Yes 3 No High 5800 No 4 Yes Low 2100 No 5 No Medium 17 No 6 No High 7050 No 7 No High 1104 No 8 Yes Low 2304 Yes 9 No Low 187 No 10 Yes Low 1874 No 11 Yes Medium 6102 Yes Tree ID Attrib 1 Attrib 2 Attrib 3 Class 12 Yes Medium 3104? 13 Yes High 2877? 14 No Low 788? 15 Yes Low 3671? 16 No Medium 2252? CART Example 2 Bivariate Tree ID Attrib 1 Attrib 2 Attrib 3 Class 12 Yes Medium 3104? 13 Yes High 2877? 14 No Low 788? 15 Yes Low 3671? 16 No Medium 2252? 14

CART Example 2 Multivariate Tree ID Attrib 1 Attrib 2 Attrib 3 Class 12 Yes Medium 3104? 13 Yes High 2877? 14 No Low 788? 15 Yes Low 3671? 16 No Medium 2252? CART Example 1 Predicting hospital stays of greater than 30 days This is a binary tree that bifurcates each step based on specific scores (or values) associated to encounter times with different healthcare teams 15

Support Vector Machines A Support Vector Machine (SVM) is a discriminative classifier formally defined by a separating hyperplane. In other words, given labeled training data (supervised learning), the algorithm outputs an optimal hyperplane which categorizes new examples. Regression and Time Series NUMERICAL MODELS 16

Regression Analysis Regression analysis is a form of predictive modelling technique which investigates the relationship between a dependent (target) and independen t variable (s) (predictor). This technique is used for forecasting, time series modelling and finding the causal effect relationship between the variables. For example: Predicting how long it will take a new physician to break even Predicting how many (and which) patients will be no shows Predicting body fat using BMI Regression analysis example 1 Predicting revenue for a newly hired physician Take revenue amounts for some number of physicians over some time period and create a slope formula Use formula top calculate outside of data range, for example, at 18 months, it is 255.5 + (638.5 * 36) (8.125 36 2 ) = $12,711.5 Value 12000 10000 8000 6000 4000 2000 Fitted Line Plot Value = 255.5 + 638.5 Value 2-8.125 Value 2^2 S 340.304 R-Sq 98.9% R-Sq(adj) 98.8% 0 0 5 10 15 Value 2 20 25 17

Regression analysis example 2 Predicting body fat percentages using BMI data Plot BMI against percent of body fat Using slope formula to predict outside of the regression Where BMI = 33.2, body fat = 23.19 + (3.286 * 33.2) ( 0.03999 * 33.2 2) = 41.905 Regression analysis example 3 Predicting long term recovery after discharge from a hospital Chart prognosis scale against number of days since discharge Use slope formula to predict some number outside of the data set At 42 days, prog = 58.6066 * 42 1.666 = 10.87 18

Time Series In descriptive modeling, or time series analysis, a time series is modeled to determine its components in terms of seasonal patterns, trends, relation to external factors, and the like. In contrast, time series forecasting uses the information in a time series (perhaps with additional information) to forecast future values of that series {Page 18 19, Practical Time Series Forecasting with R: A Hands On Guide} Time Series Example 1 Collection 80000 70000 60000 50000 40000 30000 20000 10000 0 3 6 Moving Average Plot for Collection 9 12 15 18 Month Variable Actual Fits Forecasts 95.0% PI Moving Average Length 3 Accuracy Measures MAPE 20 MAD 3493 MSD 26450913 Charge Collection Run Bonus Salary Cost P/L 1,953.35 195.34 0.10 18,333.00 21,082.95 (20,887.61) 3,649.04 656.83 0.18 18,333.00 21,082.95 (20,426.12) 3,938.22 748.26 0.19 18,333.00 21,082.95 (20,334.69) 5,075.24 1,167.31 0.23 18,333.00 21,082.95 (19,915.64) 6,356.85 1,970.62 0.31 18,333.00 21,082.95 (19,112.33) 8,206.97 2,872.44 0.35 18,333.00 21,082.95 (18,210.51) 10,697.90 4,386.14 0.41 18,333.00 21,082.95 (16,696.81) 14,113.88 5,927.83 0.42 18,333.00 21,082.95 (15,155.12) 18,775.33 8,448.90 0.45 18,333.00 21,082.95 (12,634.05) 25,144.77 11,566.60 0.46 18,333.00 21,082.95 (9,516.35) 33,844.94 16,584.02 0.49 18,333.00 21,082.95 (4,498.93) 45,729.83 23,322.21 0.51 18,333.00 21,082.95 2,239.26 61,964.80 30,362.75 0.49 18,333.00 21,082.95 9,279.80 84,142.21 43,753.95 0.52 18,333.00 21,082.95 22,671.00 114,437.11 58,362.92 0.51 1,254.44 18,333.00 22,337.39 36,025.54 111,375.00 53,460.00 0.48 519.00 18,333.00 21,601.95 31,858.05 113,575.00 60,194.75 0.53 1,529.21 18,333.00 22,612.16 37,582.59 112,915.00 60,974.10 0.54 1,646.12 18,333.00 22,729.07 38,245.04 118,085.00 63,765.90 0.54 2,064.89 18,333.00 23,147.84 40,618.07 116,167.70 61,568.88 0.53 1,735.33 18,333.00 22,818.28 38,750.60 19

Neural Networks In between the input units and output units are one or more layers of hidden units, which, together, form the majority of the artificial brain. Most neural networks are fully connected, which means each hidden unit and each output unit is connected to every unit in the layers either side. An often stated advantage of neural networks over conventional programs lies in their ability to solve problems that either do not have an algorithmic solution or a solution is too complex to find. Neural networks are well suited to tackle problems that people are good at solving, like prediction and pattern recognition NEURAL NETWORKS 20

Neural Network Example 1 Predicting Mortality Neural Network Example 2 Predicting Heart Disease 21

TEXT ANALYTICS Text Analytics Text analytics converts unstructured text data, which account for over 70% of healthcare records, into meaningful data that can be used for analysis, feedback, search engines and other purposes. For example Cloning detection Patient sentiment analysis Outcomes assessments 22

Text Analytics Example The Truth about PA Predictive modeling is all about probabilities and uncertainty Some likelihood an event occurs, not certainty PA does not necessarily work for every issue or problem Highly complex systems where co dependencies cannot be modeled How accurate do my predictive mode 23

In Conclusion... Continuing Education ACMPE credit for medical practice executives. 1 AAPC Core B, CPPM credit 1 ACHE credit for medical practice executives 1 CME AMA PRA Category 1 Credits.. 1 CNE credit for licensed nurses 1 CPE credit for certified public accountants (CPAs) 1.2 CEU credit for generic continuing education 1 Let the speakers know what you thought! Evaluations are available on the MGMA mobile app 24

Frank Cohen fcohen@drsmgmt.com 727 322 4332 Doctors Management LLC 2075 San Marinio Way North Clearwater, FL 33763 25