Session 4. Case Study of Modern Approach to Lapse Rate Assumption

Similar documents
Python Machine Learning

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Lecture 1: Machine Learning Basics

(Sub)Gradient Descent

CS Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Assignment 1: Predicting Amazon Review Ratings

Northern Kentucky University Department of Accounting, Finance and Business Law Financial Statement Analysis ACC 308

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Model Ensemble for Click Prediction in Bing Search Ads

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Probability and Statistics Curriculum Pacing Guide

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Learning From the Past with Experiment Databases

CSL465/603 - Machine Learning

Axiom 2013 Team Description Paper

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Statewide Framework Document for:

CS 446: Machine Learning

Artificial Neural Networks written examination

Diagnostic Test. Middle School Mathematics

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Generative models and adversarial training

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

An Introduction to Simio for Beginners

Mathematics process categories

Grade 6: Correlated to AGS Basic Math Skills

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

Applications of data mining algorithms to analysis of medical data

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

learning collegiate assessment]

DEPARTMENT OF FINANCE AND ECONOMICS

Radius STEM Readiness TM

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

FRAMEWORK FOR IDENTIFYING THE MOST LIKELY SUCCESSFUL UNDERPRIVILEGED TERTIARY STUDY BURSARY APPLICANTS

GACE Computer Science Assessment Test at a Glance

Global Television Manufacturing Industry : Trend, Profit, and Forecast Analysis Published September 2012

STA 225: Introductory Statistics (CT)

Detailed course syllabus

Functional Skills Mathematics Level 2 assessment

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Universidade do Minho Escola de Engenharia

12- A whirlwind tour of statistics

Improvements to the Pruning Behavior of DNN Acoustic Models

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown

Go fishing! Responsibility judgments when cooperation breaks down

Learning Methods in Multilingual Speech Recognition

Data Structures and Algorithms

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

University of Cincinnati College of Medicine. DECISION ANALYSIS AND COST-EFFECTIVENESS BE-7068C: Spring 2016

Probabilistic Latent Semantic Analysis

On-Line Data Analytics

An Empirical Comparison of Supervised Ensemble Learning Approaches

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

A Case Study: News Classification Based on Term Frequency

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Software Development Plan

Speech Recognition at ICSI: Broadcast News and beyond

Reducing Features to Improve Bug Prediction

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Robot manipulations and development of spatial imagery

Switchboard Language Model Improvement with Conversational Data from Gigaword

UoS - College of Business Administration. Master of Business Administration (MBA)

WHEN THERE IS A mismatch between the acoustic

Diploma in Library and Information Science (Part-Time) - SH220

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

On the Combined Behavior of Autonomous Resource Management Agents

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Research computing Results

The CTQ Flowdown as a Conceptual Model of Project Objectives

Knowledge Transfer in Deep Convolutional Neural Nets

B. How to write a research paper

Laboratorio di Intelligenza Artificiale e Robotica

Innovating Toward a Vibrant Learning Ecosystem:

Hard Drive 60 GB RAM 4 GB Graphics High powered graphics Input Power /1/50/60

MMOG Subscription Business Models: Table of Contents

Rule Learning With Negation: Issues Regarding Effectiveness

Beyond the Pipeline: Discrete Optimization in NLP

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

Innovative Methods for Teaching Engineering Courses

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Business 712 Managerial Negotiations Fall 2011 Course Outline. Human Resources and Management Area DeGroote School of Business McMaster University

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

It s a lean life! The Journey

Detecting English-French Cognates Using Orthographic Edit Distance

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Calibration of Confidence Measures in Speech Recognition

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

Integrating simulation into the engineering curriculum: a case study

Transcription:

SOA Predictive Analytics Seminar Taiwan 31 Aug. 2018 Taipei, Taiwan Session 4 Case Study of Modern Approach to Lapse Rate Assumption Richard Liao, ASA Stanley Hsieh

Case Study of Modern Approach to Lapse Rate Assumption RICHARD LIAO/ STANLEY HSIEH 31 August, 2018 Table of Contents Why machine learning for lapse study?...3 Machine learning preparation.. 14 Machine learning model....21 Case study analysis of outcome..30 Machine learning tool....40 Q & A 43 2

Why machine learning for lapse study? What is Machine Learning? Use statistic to give computer ability to learn Let the algorithm do the job to improve the prediction 4

What is Machine Learning? Supervised learning Learning a function with input and output Labeled training data set is used to learn a function This function can be used to map new examples Two main tasks Unsupervised learning Learning a function describing the structure of unlabeled data 5 What is Machine Learning? Regression To predict continuous outcomes Classification Two main tasks To predict discrete classes Training set For training machine learning model Validation set For machine learning model adjustment Testing set For prediction and testing prediction power 6

What Impacts Lapse Rate? What are the attributes affecting lapse rate? Only one attribute or more attributes? Should it be really time dependent? Different product types? Sales channel or even sales office, sales person? Social economic trends impact? Other factors we don t normally think of? 7 Traditional Experience Study Traditional way of lapse rate experience study usually contains a few dimensions only: Premium mode Policy year Product type Gender Sales channel Often times, the result by the above dimensions look volatile. Should more dimensions be considered? What are those? How can we find them easier? 8

Business Impact by Lapse Rate It is really, really hard to sell an insurance policy. Have we tried upmost to prevent lapse? 9 Business Impact by Lapse Rate Profit and Loss High volatility of lapse rate estimation may cause high volatility of profit and loss, especially after the implementation of IFRS17, significant difference of actual lapse realized and expected lapse becomes the source of profit and loss Market influence The ability to monitor and retain insurance policies may influence the domination of market share and corporate reputation Customer value When high value policies are sold, preventing policies from surrender is the key to keep customer value or company value 10

Business Impact by Lapse Rate Marketing strategy When knowing the possible lapse behaviors resulting from specific product types, sales behaviors, policyholders features, non-policyholders features, or other factors, insurance companies can have better position on making marketing strategy for policy sales Product design Lapse rate plays a key role when pricing a product and determining the profitability of a product. Accurate estimation of lapse rate becomes important when implementing business plan Risk management and ALM Asset and liability management and risk capital management heavily relies on the accuracy of cash flow projection. Hence, lapse rate prediction is extraordinarily crucial for the management decision 11 Linking Machine Learning with Lapse Study Supervised learning X Y Binary classification problem: Y = 1 for Surrender = 0 for Non-surrender Combine policy related data with economic data to enrich data Algorithm learns from information of data Select an appropriate machine learning model 12

Benefit of Machine Learning Approach Higher prediction power More dimensions to determine lapse behaviors More automatic assumption making process Improve short term money management 13 Machine learning preparation

Project Flow Problem Definition Data Investigation Modeling Analysis 15 Data, Resource and Business Impact Data availability Cost of data purchase or collection Privacy issue / legal issue Data quality Consistency over time regarding definitions Mindful of garbage in, garbage out Enough data counts Enough variable (attribute) counts Dealing with missing date apply common methodologies Investment in data infrastructure 16

How to succeed? Start from small and realistic goals, and build from the success to make it bigger Cooperate with subject matter experts Understand the implementation needs of the model, such as purpose, cost, time frame of each prediction, or resource supported 17 Data Types & Variable Types Independent Variable (X): Policy Related Data: premium balance, channel mode etc Economic Index: GDP, stock index, inflation, real-estate price etc Dependent Variable (Y): Y = 1 for Surrender and Y = 0 for Non-Surrender 18

Quality of Data & Data Collections Source of Data: Internet? Agent? Why do we have missing data? There is no value in learning constant data Some data is recorded recently so there is lack of historical data Communication with data engineer for data cleaning Actuarial Perspective is important for variables selection 19 Data Cleaning Techniques & Transformation Select a threshold for excluding variable with too many missing data Mean Imputation by filling data mean to missing observations We can use feature engineering to create variables Categorical variable has to be transformed into factors 20

Machine learning model Machine learning Model Generalized Linear Model Decision Tree Random Forest Gradient Boosting Machine 22

Generalized Linear Model Result can be interpreted by coefficients of variables Link Function and Distribution logit and binomial for binary classification Classical Way By using statistical test for model significance Machine Learning Way By feeding more variables for prediction power Regularization: To control overfitting of GLM Regularization tool: Ridge (L2-norm) vs Lasso (L1-norm) LASSO is widely more popular due to its penalty character 23 24

Decision Tree Decision boundary is drawn to capture non-linear trend Key idea of algorithm: recursive binary splitting Measure impurity of node by Gini Index Policy = 200 Y=90 N=110 Algorithm goes through the variables to find the variable that has lower Gini index as this variable classifies lapse behavior more distinguishably. Policy = 120 Y = 70 N=50 Policy = 80 Y = 20 N =60 25 26

Random Forest Start from idea of bagging resampling and bootstrapping Searches for the best feature among a random subset of features to de-correlate the trees Trees can be implemented by parallel computation 27 28

Gradient Boosting Machine (GBM) G(x) = F(X) + h(x) + F(X) = weaker learner Residuals = y F(X) Residuals is trained in the direction of gradient descent Add the trained residuals to weaker learner then repeat this process Train a bad tree first then train its residual to make it a better tree Generally, a powerful machine learning model 29 Case study analysis of outcome

Outcome Class Probability: p0 = Non-surrender probability and p1 = Surrender Probability Optimal Threshold Threshold that optimally decide whether each policy will surrender next quarter Predict p0 p1 0 0.99 0.01 0 0.90 0.10 1 0.11 0.89 0 0.91 0.09 0 0.87 0.13 0 0.88 0.12 1 0.12 0.92 31 Metrics To evaluate performance of model To prevent overfitting MSE (Mean Square Error): It can be used to evaluate numeric prediction like stock price prediction AUC (Area under Curve): This is what we used for the case study which is a classification problem. 32

AUC (Area under Curve) AUC = 0.95 AUC stands for Area under the ROC (Return of Characteristics) Curve Points on ROC is the False Positive Rate and True Positive Rate at certain threshold 33 Hyper-Parameter Tuning Maximum Variables Allows in a GLM : Tradeoff between model explanation and model prediction Depth of Tree: Is deeper the tree better the model? Number of Trees in a Forest: Is more trees in a forest better the model? Number of Sequential Estimators for GBM: How many time should we repeat sequential training? Grid Search vs Random Search: A tradeoff between efficiency and accuracy 34

AE Ratio It is not easy to tell which method is better here as models are compared in onedimensional space Gives some sense of model performance in one dimensional space However, machine learning model should capture all dimensions performance 35 Two-Way Lift Chart ML shows better result here as the chart consider overall dimensions Vertical axis is A/E ratio and horizontal axis is the ratio of machine learning model prediction to experience study prediction AE Ratio approach but capture more dimensions Better model is determined by whether the line is close to 1 throughout the range of horizontal axis 36

Variable Importance Chart Variable Importance: Calculate the relative influence of the variable in a machine learning model It can be used to look at the variable that has higher influence on classifying surrender policy Variables We can find the variable that is not considered by traditional experience study 37 More on the Case Study Method AUC on Testing Data Experience Study 0.70 N/A GLM 0.81 16% Decision Tree 0.80 12% Random Forest 0.87 21% GBM 0.95 29% Performance Increase by Adopt GLM for model explanation while it has shown reasonable prediction power Random forest and GBM has shown better prediction power than decision tree Adopt GBM for most accurate prediction as it has the best prediction power 38

Some Learnings from the Study Machine learning suggested new dimensions not commonly looked at before in traditional experience study: Consider the top 3 important variables Amount of policy an agent sells affect lapse rate? Re-examine the important variables for agents management Machine learning can derive a lapse function for each policy, which can be used for: Lapse simulation Value of customer calculation Continuously monitoring lapse behavior with up-to-date data and updated model 39 Machine learning tool

They are all Open-Source 41 Machine Learning Tool Hadoop Spark H2o.ai Python Jupyter Notebook Distributed data storage to store and distribute big data Data processor for data cleaning process Package for machine learning with big data Object-oriented programming language that implement Spark and H2o.ai Web-based computing interface for modelling and visualization 42

Q & A