Machine Learning Intro to AI Bert Huang Virginia Tech
Machine Learning Learning: improving with experience at some task Improve over task with respect to some performance measure based on some experience Writing computer programs that write computer programs Learning definition by Tom Mitchell
Outline Three machine learning stories/cautionary tales Deep learning definition Types of machine learning Best practices
Machine Learning Story 1 Face Detection & Recognition
What Does a Human Face Look Like?
Apple II image from wikipedia.com. Eyes added digitally.
Apple II image from wikipedia.com. Eyes added digitally.
if pixel153 > 128 & pixel154 > 128 & pixel155 > 128 & pixel156 < 64 & sqrt(pixel157) < 82 & log(pixel1132 * pixel1133) > 1. then image is a face* * (not a real face recognition program) Apple II image from wikipedia.com. Eyes added digitally.
Machine Learning Story 2 Recommender Systems
Coco Alice The Incredibles 2 Barbara Jurassic World II Cathy Black Panther
Figure from Koren, Bell, Volinksy, IEEE Computer, 2009
Applications of Recommendation Movies Books Music Medicine Education Jobs
Applications of Recommendation Movies Books Music Medicine Education Jobs
Machine Learning Story 3 Housing Markets
Wall Street in the mid-1980s turned to the quants brainy financial engineers to invent new ways to boost profits. They and their managers, though laziness and greed, built a huge financial bubble on foundations that they did not understand. It was a recipe for disaster. The journalist Felix Salmon won the American Statistical Association s Excellence in Statistical Reporting Award for 2010. We reprint his article, first published as the cover story of Wired magazine, because it brilliantly conveys complex statistical concepts ASA Excellence in Statistical Reporting Award The formula that killed Wall Street
In the years before 2008, it was hardly unth that a math wizard like David X. Li might so earn a Nobel Prize. After all, financial econom even Wall Street quants have received the in economics before, and Li s work on measuri has had more impact, more quickly, than p Nobel Prize-winning contributions to the field. A formula in statistics, misunderstood and misused, has devastated the global economy though, as dazed bankers, politicians, regulato investors survey the wreckage of the biggest fi meltdown since the Great Depression, Li is pr thankful he still has a job in finance at all. N his achievement should be dismissed. He took toriously tough nut determining correlation, seemingly disparate events are related and c 16 february2012
Pr[T A < 1, T B < 1] = φ 2 (φ 1 (F A (1)), φ 1 (F B (1)), γ) The formula that killed so many pension plans: David X. Li's Gaussian copula, as first published in 2000. Investors exploited it as a quick and fatally flawed way to assess risk. Probability Specifically, this is a joint default probability the likelihood that any two members of the pool (A and B) will both default. It s what investors are looking for, and the rest of the formula provides the answer. Copula This couples (hence the Latinate term copula) the individual probabilities associated with A and B to come up with a single number. Errors here massively increase the risk of the whole equation blowing up. Survival times The amount of time between now and when A and B can be expected to default. Li took the idea from a concept in actuarial science that charts what happens to someone s life expectancy when their spouse dies. Distribution functions The probabilities of how long A and B are likely to survive. Since these are not certainties, they can be dangerous: Small miscalculations may leave you facing much more risk than the formula indicates. Equality A dangerously precise concept, since it leaves no room for error. Clean equations help both quants and their managers forget that the real world contains a surprising amount of uncertainty, fuzziness, and precariousness. Gamma The all-powerful correlation parameter, which reduces correlation to a single constant something that should be highly improbable, if not impossible. This is the magic number that made Li s copula function irresistible.
Machine Learning Stories Face recognition Recommender systems Finance
What is deep learning? raw image input raw image input image preprocessing learnable component (neural network) edge detection another neural network object detection another neural network object identification object identification
Deep Learning Using machine learning to simultaneously train every part of the process from raw input to raw output Considered deep when compared to shallow approach of training/designing each component on its own
Types of Machine Learning Types of learning settings Supervised learning Unsupervised learning Types of learning algorithms Batch learning Online learning
Example: Digit Classification http://ufldl.stanford.edu/housenumbers/
Example: Airline Price Prediction
Example: Airline Price Prediction
Batch Supervised Learning Draw data set D = {(x 1, y 1 ), (x 2, y 2 ),..., (x n, y n )} from distribution D A Algorithm learns hypothesis h 2 H from set H of possible hypotheses A(D) =h We measure the quality of h as the expected loss: E (x,y)2d [`(y, h(x))] This quantity is known as the risk E.g., loss could be the Hamming loss `Hamming (a, b) = ( 0 if a = b 1 otherwise classification
Online Supervised Learning In step t, draw data point x from distribution D Current hypothesis h guesses the label of x Get true label from oracle O Pay penalty if h(x) is wrong (or earn reward if correct) Learning algorithm updates to new hypothesis based on this experience Does not store history
Learning Settings Supervised or unsupervised (or semi-supervised, weakly supervised, transductive ) Online or batch (or reinforcement ) Classification, regression (or structured output, clustering, dimensionality reduction )
Best Practices Try range of models with different capacity Split data into training, validation, and testing sets Measure performance on evaluation set to tune parameters Measure performance on testing set as final check
Held-out Validation
Held-out Validation Accuracy on training data Accuracy on validation data Simple 0.91 0.83 Medium 0.95 0.88 Complex 0.99 0.79 Super Complex 1.0 0.54 training data validation data
Summary Three machine learning stories One cautionary tale Deep learning definition Types of machine learning Best practices