Machine Learning. Intro to AI Bert Huang Virginia Tech

Similar documents
Lecture 1: Machine Learning Basics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Laboratorio di Intelligenza Artificiale e Robotica

Artificial Neural Networks written examination

CSL465/603 - Machine Learning

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Python Machine Learning

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Laboratorio di Intelligenza Artificiale e Robotica

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Generative models and adversarial training

Australian Journal of Basic and Applied Sciences

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Learning From the Past with Experiment Databases

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Axiom 2013 Team Description Paper

Learning Methods in Multilingual Speech Recognition

A Neural Network GUI Tested on Text-To-Phoneme Mapping

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

arxiv: v1 [cs.lg] 15 Jun 2015

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Word Segmentation of Off-line Handwritten Documents

Active Learning. Yingyu Liang Computer Sciences 760 Fall

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Mathematics process categories

(Sub)Gradient Descent

Lecture 1: Basic Concepts of Machine Learning

arxiv: v1 [cs.cl] 2 Apr 2017

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Getting Started with Deliberate Practice

OFFICE SUPPORT SPECIALIST Technical Diploma

Georgetown University at TREC 2017 Dynamic Domain Track

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

Speech Recognition at ICSI: Broadcast News and beyond

learning collegiate assessment]

Lecture 10: Reinforcement Learning

A Case Study: News Classification Based on Term Frequency

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

Task Types. Duration, Work and Units Prepared by

Mining Association Rules in Student s Assessment Data

Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools

Managerial Decision Making

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Time series prediction

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

2017 FALL PROFESSIONAL TRAINING CALENDAR

MADERA SCIENCE FAIR 2013 Grades 4 th 6 th Project due date: Tuesday, April 9, 8:15 am Parent Night: Tuesday, April 16, 6:00 8:00 pm

Model Ensemble for Click Prediction in Bing Search Ads

MGT/MGP/MGB 261: Investment Analysis

WHEN THERE IS A mismatch between the acoustic

CHAPTER 10 UNCERTAINTY REDUCTION THEORY

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Capitalism and Higher Education: A Failed Relationship

Deep Facial Action Unit Recognition from Partially Labeled Data

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Welcome to ACT Brain Boot Camp

Human Emotion Recognition From Speech

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Evolutive Neural Net Fuzzy Filtering: Basic Description

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Math 96: Intermediate Algebra in Context

Functional Skills Mathematics Level 2 assessment

Cal s Dinner Card Deals

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Changing User Attitudes to Reduce Spreadsheet Risk

Semi-Supervised Face Detection

MYCIN. The MYCIN Task

Compositional Semantics

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Exposé for a Master s Thesis

CS Machine Learning

Syllabus for CHEM 4660 Introduction to Computational Chemistry Spring 2010

On the Combined Behavior of Autonomous Resource Management Agents

Circuit Simulators: A Revolutionary E-Learning Platform

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Handling Concept Drifts Using Dynamic Selection of Classifiers

A survey of multi-view machine learning

Switchboard Language Model Improvement with Conversational Data from Gigaword

IN THIS UNIT YOU LEARN HOW TO: SPEAKING 1 Work in pairs. Discuss the questions. 2 Work with a new partner. Discuss the questions.

University of Toronto Physics Practicals. University of Toronto Physics Practicals. University of Toronto Physics Practicals

TRANSFER LEARNING IN MIR: SHARING LEARNED LATENT REPRESENTATIONS FOR MUSIC AUDIO CLASSIFICATION AND SIMILARITY

Improving Conceptual Understanding of Physics with Technology

Modeling function word errors in DNN-HMM based LVCSR systems

Speech Emotion Recognition Using Support Vector Machine

Introduce yourself. Change the name out and put your information here.

Top US Tech Talent for the Top China Tech Company

Transcription:

Machine Learning Intro to AI Bert Huang Virginia Tech

Machine Learning Learning: improving with experience at some task Improve over task with respect to some performance measure based on some experience Writing computer programs that write computer programs Learning definition by Tom Mitchell

Outline Three machine learning stories/cautionary tales Deep learning definition Types of machine learning Best practices

Machine Learning Story 1 Face Detection & Recognition

What Does a Human Face Look Like?

Apple II image from wikipedia.com. Eyes added digitally.

Apple II image from wikipedia.com. Eyes added digitally.

if pixel153 > 128 & pixel154 > 128 & pixel155 > 128 & pixel156 < 64 & sqrt(pixel157) < 82 & log(pixel1132 * pixel1133) > 1. then image is a face* * (not a real face recognition program) Apple II image from wikipedia.com. Eyes added digitally.

Machine Learning Story 2 Recommender Systems

Coco Alice The Incredibles 2 Barbara Jurassic World II Cathy Black Panther

Figure from Koren, Bell, Volinksy, IEEE Computer, 2009

Applications of Recommendation Movies Books Music Medicine Education Jobs

Applications of Recommendation Movies Books Music Medicine Education Jobs

Machine Learning Story 3 Housing Markets

Wall Street in the mid-1980s turned to the quants brainy financial engineers to invent new ways to boost profits. They and their managers, though laziness and greed, built a huge financial bubble on foundations that they did not understand. It was a recipe for disaster. The journalist Felix Salmon won the American Statistical Association s Excellence in Statistical Reporting Award for 2010. We reprint his article, first published as the cover story of Wired magazine, because it brilliantly conveys complex statistical concepts ASA Excellence in Statistical Reporting Award The formula that killed Wall Street

In the years before 2008, it was hardly unth that a math wizard like David X. Li might so earn a Nobel Prize. After all, financial econom even Wall Street quants have received the in economics before, and Li s work on measuri has had more impact, more quickly, than p Nobel Prize-winning contributions to the field. A formula in statistics, misunderstood and misused, has devastated the global economy though, as dazed bankers, politicians, regulato investors survey the wreckage of the biggest fi meltdown since the Great Depression, Li is pr thankful he still has a job in finance at all. N his achievement should be dismissed. He took toriously tough nut determining correlation, seemingly disparate events are related and c 16 february2012

Pr[T A < 1, T B < 1] = φ 2 (φ 1 (F A (1)), φ 1 (F B (1)), γ) The formula that killed so many pension plans: David X. Li's Gaussian copula, as first published in 2000. Investors exploited it as a quick and fatally flawed way to assess risk. Probability Specifically, this is a joint default probability the likelihood that any two members of the pool (A and B) will both default. It s what investors are looking for, and the rest of the formula provides the answer. Copula This couples (hence the Latinate term copula) the individual probabilities associated with A and B to come up with a single number. Errors here massively increase the risk of the whole equation blowing up. Survival times The amount of time between now and when A and B can be expected to default. Li took the idea from a concept in actuarial science that charts what happens to someone s life expectancy when their spouse dies. Distribution functions The probabilities of how long A and B are likely to survive. Since these are not certainties, they can be dangerous: Small miscalculations may leave you facing much more risk than the formula indicates. Equality A dangerously precise concept, since it leaves no room for error. Clean equations help both quants and their managers forget that the real world contains a surprising amount of uncertainty, fuzziness, and precariousness. Gamma The all-powerful correlation parameter, which reduces correlation to a single constant something that should be highly improbable, if not impossible. This is the magic number that made Li s copula function irresistible.

Machine Learning Stories Face recognition Recommender systems Finance

What is deep learning? raw image input raw image input image preprocessing learnable component (neural network) edge detection another neural network object detection another neural network object identification object identification

Deep Learning Using machine learning to simultaneously train every part of the process from raw input to raw output Considered deep when compared to shallow approach of training/designing each component on its own

Types of Machine Learning Types of learning settings Supervised learning Unsupervised learning Types of learning algorithms Batch learning Online learning

Example: Digit Classification http://ufldl.stanford.edu/housenumbers/

Example: Airline Price Prediction

Example: Airline Price Prediction

Batch Supervised Learning Draw data set D = {(x 1, y 1 ), (x 2, y 2 ),..., (x n, y n )} from distribution D A Algorithm learns hypothesis h 2 H from set H of possible hypotheses A(D) =h We measure the quality of h as the expected loss: E (x,y)2d [`(y, h(x))] This quantity is known as the risk E.g., loss could be the Hamming loss `Hamming (a, b) = ( 0 if a = b 1 otherwise classification

Online Supervised Learning In step t, draw data point x from distribution D Current hypothesis h guesses the label of x Get true label from oracle O Pay penalty if h(x) is wrong (or earn reward if correct) Learning algorithm updates to new hypothesis based on this experience Does not store history

Learning Settings Supervised or unsupervised (or semi-supervised, weakly supervised, transductive ) Online or batch (or reinforcement ) Classification, regression (or structured output, clustering, dimensionality reduction )

Best Practices Try range of models with different capacity Split data into training, validation, and testing sets Measure performance on evaluation set to tune parameters Measure performance on testing set as final check

Held-out Validation

Held-out Validation Accuracy on training data Accuracy on validation data Simple 0.91 0.83 Medium 0.95 0.88 Complex 0.99 0.79 Super Complex 1.0 0.54 training data validation data

Summary Three machine learning stories One cautionary tale Deep learning definition Types of machine learning Best practices