Pricing illiquid assets A Deep Learning approach. Oded Luria Deep Learning Meetup Dec 2015

Similar documents
Python Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Knowledge Transfer in Deep Convolutional Neural Nets

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

CS Machine Learning

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Generative models and adversarial training

Softprop: Softmax Neural Network Backpropagation Learning

(Sub)Gradient Descent

arxiv: v1 [cs.lg] 15 Jun 2015

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Assignment 1: Predicting Amazon Review Ratings

Learning From the Past with Experiment Databases

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Human Emotion Recognition From Speech

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Artificial Neural Networks written examination

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Probability and Statistics Curriculum Pacing Guide

Evolutive Neural Net Fuzzy Filtering: Basic Description

CSL465/603 - Machine Learning

Rule Learning With Negation: Issues Regarding Effectiveness

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

Switchboard Language Model Improvement with Conversational Data from Gigaword

WHEN THERE IS A mismatch between the acoustic

SARDNET: A Self-Organizing Feature Map for Sequences

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Deep Neural Network Language Models

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

INPE São José dos Campos

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

arxiv: v2 [cs.cv] 30 Mar 2017

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

arxiv: v1 [cs.lg] 7 Apr 2015

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Australian Journal of Basic and Applied Sciences

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Laboratorio di Intelligenza Artificiale e Robotica

THE world surrounding us involves multiple modalities

Speech Recognition at ICSI: Broadcast News and beyond

Modeling function word errors in DNN-HMM based LVCSR systems

A Case Study: News Classification Based on Term Frequency

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Word Segmentation of Off-line Handwritten Documents

Knowledge-Based - Systems

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Lecture 1: Basic Concepts of Machine Learning

Linking Task: Identifying authors and book titles in verbose queries

Seminar - Organic Computing

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

MGT/MGP/MGB 261: Investment Analysis

Medical Complexity: A Pragmatic Theory

Calibration of Confidence Measures in Speech Recognition

Probabilistic Latent Semantic Analysis

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

Rule Learning with Negation: Issues Regarding Effectiveness

Time series prediction

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

learning collegiate assessment]

A Review: Speech Recognition with Deep Learning Methods

Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Learning Methods in Multilingual Speech Recognition

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Learning Methods for Fuzzy Systems

Improvements to the Pruning Behavior of DNN Acoustic Models

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Cultivating DNN Diversity for Large Scale Video Labelling

Modeling function word errors in DNN-HMM based LVCSR systems

A Deep Bag-of-Features Model for Music Auto-Tagging

Lecture 2: Quantifiers and Approximation

Laboratorio di Intelligenza Artificiale e Robotica

Running Head: STUDENT CENTRIC INTEGRATED TECHNOLOGY

Cooperative evolutive concept learning: an empirical study

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410)

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Model Ensemble for Click Prediction in Bing Search Ads

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Universidade do Minho Escola de Engenharia

Evolution of Symbolisation in Chimpanzees and Neural Nets

Transcription:

Pricing illiquid assets A Deep Learning approach Oded Luria Deep Learning Meetup Dec 2015

Deep Learning in Nature (May 2015) Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech. see also Jürgen Schmidhuber, Critique of Paper by "Deep Learning Conspiracy" (Nature 521 p 436). http://people.idsia.ch/~juergen/deep-learning-conspiracy.html

Takeuchi and Lee, 2013 Examine whether deep learning techniques can discover features in time series of stock prices that can successfully predict future returns Main idea: Autoencoder reduces inputs to 4 dimensional features Classifier outputs probabilities of two classes (returns below/above the median) Performance: Overall accuracy rate of 53% Open questions: Use separate autoencoders for different categories of features The impact of updating the weights over time. See also: Gilberto Batres-Estrada, 2015 http://cs229.stanford.edu/proj2013/takeuchilee-applyingdeeplearningtoenhancemomentumtradingstrategiesinstocks.pdf

Everyone is exploring (*from public sources) 4 http://robusttechhouse.com/list-of-funds-or-trading-firms-using-artificial-intelligence-or-machine-learning/

If you have a deep learning architecture that can identify those patterns across different time scales, you can arguably use that information to better forecast what will happen next Binatix s software doesn t just learn from static data points, but incorporates temporal signals, essentially how the information continually changes over time Nadav Ben-Efraim (left) and Itamar Arel (right) 5

Finance and Deep Learning? (*from public sources) PROS Lots of features lots of examples Complex data Versatile regimes Evolving market conditions Competitive advantage? CONS Weak spatial correlation/ parameter tying Requires transparent model Conservative industry

Pricing illiquid assets

Price Price uncertainty theoretical price curve Business Requirements: 1. Accurate pricing 2. Prediction confidence 3. Transparency trade price uncertainty increases Time 8

Experimental settings Data preparation Mixture of numerical, boolean & categorical features Filtering outliers Scaling Filling missing values Splitting categorical variables Some degree of feature engineering 9

Feature engineering? * Trends Cross-sectional information Information about other bonds? *Molecular Activity Challenge 10 http://blog.kaggle.com/2012/11/01/deep-learning-how-i-did-it-merck-1st-place-interview/

Deep Leaning aspects

Research methodology This (supervised regression) problem can be approached using a number of methods: 1. Using discriminative models (supervised learning) Use Trace spreads as target labels for classification \ regression problem 2. Using Hybrid models (combine supervised and unsupervised learning) Discrimination assisted with outcomes of generative \ unsupervised networks 3. More options? Classification \ Regression? Network depth? Open questions Type of units? Types of optimizers? Use Ensemble learning? 12

Regression Vs Classification for price prediction regression classification Direct approach Cost function: mean square error Difficult in estimation of example error* Unimodal model Ordered classes Cost function: categorical cross entropy Performance vs. resolution tradeoff Good error estimation for each example Supports multi-modal decisions 13

How deep should the network be? Shallow networks: Easier to train Could suffer from high bias error Deep networks: Superior when the data is complex have more parameters More difficult to train More prone to overfit? *units removed from figure 14

raw features Random Forests class predictions raw features Random Forests classifier class predictions Are the Deep Learning features better? Random Forests applied on raw features Random Forests applied on Deep Learning features *units removed from figure 15

Are the DL features better? 570-60 spread PCA applied on Deep Learning features 16

Probability Assessing prediction quality (classification net) Ordinal classification -> smooth probability distribution Indicates spread prediction certainty Similarity to bid-offer spread The predictive power of this index needs to be verified low confidence high error high confidence low error Spread [BPS] 17

Categorical Cross-entropy Bin Coverage [%] Training process Epoch 18

Ensemble Learning Avoids being trapped in saddle points/ bad minima Handles initialization problems Improves resolution 1 network 5 networks A very simple way to improve the performance of almost any machine learning algorithm is to train many different models on the same data and then to average their predictions (Hinton) Another way to exploit computing power to push performance i model averaging... After training them, the outputs of different networks can be averaged (text modified from Yoshua Bengio) 19

Integration into business processes

Conclusions Applying Deep Learning on Financial datasets is not trivial Use it when: Many features Many examples Data has complex relationships between variables Many regimes Transparency is not required (mostly) Think of ways to assess your error Use benchmark methods to assess the Deep Learning contribution 21

Thank you oded.luria@citi.com