Jun Zhu.

Similar documents
CSL465/603 - Machine Learning

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Lecture 1: Machine Learning Basics

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Semi-Supervised Face Detection

Discriminative Learning of Beam-Search Heuristics for Planning

Speech Emotion Recognition Using Support Vector Machine

Lecture 1: Basic Concepts of Machine Learning

Wenguang Sun CAREER Award. National Science Foundation

Indian Institute of Technology, Kanpur

Python Machine Learning

Assignment 1: Predicting Amazon Review Ratings

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Reducing Features to Improve Bug Prediction

Switchboard Language Model Improvement with Conversational Data from Gigaword

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

arxiv: v1 [cs.lg] 15 Jun 2015

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Bug triage in open source systems: a review

Rule Learning With Negation: Issues Regarding Effectiveness

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Mining Topic-level Opinion Influence in Microblog

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Australian Journal of Basic and Applied Sciences

Active Learning. Yingyu Liang Computer Sciences 760 Fall

arxiv: v1 [cs.lg] 3 May 2013

Rule Learning with Negation: Issues Regarding Effectiveness

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Feature Selection based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification using Naïve Bayes

Introduction to CS 100 Overview of UK. CS September 2015

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Probabilistic Latent Semantic Analysis

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

A survey of multi-view machine learning

Identification of Opinion Leaders Using Text Mining Technique in Virtual Community

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

EGRHS Course Fair. Science & Math AP & IB Courses

A Comparison of Two Text Representations for Sentiment Analysis

Latent Semantic Analysis

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Top US Tech Talent for the Top China Tech Company

Syntactic Patterns versus Word Alignment: Extracting Opinion Targets from Online Reviews

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Distant Supervised Relation Extraction with Wikipedia and Freebase

Learning From the Past with Experiment Databases

Truth Inference in Crowdsourcing: Is the Problem Solved?

Extracting and Ranking Product Features in Opinion Documents

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

A Case Study: News Classification Based on Term Frequency

Human Emotion Recognition From Speech

CS 446: Machine Learning

Extracting Verb Expressions Implying Negative Opinions

Customized Question Handling in Data Removal Using CPHC

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics

As a high-quality international conference in the field

BYLINE [Heng Ji, Computer Science Department, New York University,

Using Web Searches on Important Words to Create Background Sets for LSI Classification

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

A Vector Space Approach for Aspect-Based Sentiment Analysis

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

A NOVEL SCHEME FOR SPEAKER RECOGNITION USING A PHONETICALLY-AWARE DEEP NEURAL NETWORK. Yun Lei Nicolas Scheffer Luciana Ferrer Mitchell McLaren

Universidade do Minho Escola de Engenharia

The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Xinyu Tang. Education. Research Interests. Honors and Awards. Professional Experience

arxiv: v2 [cs.cv] 30 Mar 2017

Henry Tirri* Petri Myllymgki

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Multivariate k-nearest Neighbor Regression for Time Series data -

Eileen Bau CIE/USA-DFW 2014

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Generative models and adversarial training

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Semantic and Context-aware Linguistic Model for Bias Detection

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Time series prediction

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

A Survey on Unsupervised Machine Learning Algorithms for Automation, Classification and Maintenance

Corrective Feedback and Persistent Learning for Information Extraction

Welcome to. ECML/PKDD 2004 Community meeting

Evaluating and Comparing Classifiers: Review, Some Recommendations and Limitations

COBRA: A Fast and Simple Method for Active Clustering with Pairwise Constraints

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

How do adults reason about their opponent? Typologies of players in a turn-taking game

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

Action Recognition and Video

Transfer Learning with Applications

Word Embedding Based Correlation Model for Question/Answer Matching

Integrating E-learning Environments with Computational Intelligence Assessment Agents

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Word learning as Bayesian inference

TextGraphs: Graph-based algorithms for Natural Language Processing

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Transcription:

How Did I Get Here?

Who am I?

Jun Zhu 2011 ~ present Associate Professor, State Key Lab of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University dcszj@mail.tsinghua.edu.cn http://www.ml-thu.net/~jun

01 ~ 05 Tsinghua, B.E 05 ~ 07 Tsinghua, M. E 05 ~ 09 Tsinghua, PhD Education Working and Visiting Experience 07 ~ 09 CMU Visiting Researcher 09 ~ 11 CMU Post-doc Fellow 10. 2 ~ 10.3 & 10 Stanford Visiting Researcher 04 ~ 07 MSRA, Research Intern on joint projects CMU. Stanford

Structured Learning Maximum Entropy Discrimination Markov Network (MaxEnDNet) a novel framework with sound theoretical guarantee; generalizes to latent factor models and non-parametric Bayesian inference. Learning Principles Classification Structured Prediction Max-Likelihood Estimation (Joint) Naïve Bayes HMMs (Math. Stat.,1966) Max-Likelihood Est. (Conditional) Logistic Regression CRFs (ICML, 2001) Max-Margin Learning Support Vector Machines Max-Margin MNs (NIPS, 2003) Max-Entropy Discrimination Learning Max-Entropy Discrimination MaxEnDNet (ICML, 2008) MaxEnDNet Theory Latent Variable Models Non-parametric Bayesian Representative Publications Theoretical guarantee (JMLR 2009, ICML 2008, ICML 2009a); Latent factor models (NIPS 2008, ICML 2009b, ICML 2010, NIPS 2010a,b, JMLR 2011, PAMI 2011); Non-parametric Bayesian (ICML 2011, 2012, NIPS, 2012).

Structured Learning Regularized (Nonparametric) Bayesian Inference Max-margin Supervised Topic Models (Zhu et al., JMLR 12; Jiang, Zhu, et al., NIPS 12) Infinite Latent SVMs (Zhu, Chen & Xing, NIPS 11) V U X Y Nonparametric Relational Models (Zhu, ICML 12) Nonparametric Matrix Factorization (Xu, Zhu, & Zhang, NIPS 12)

Structured Learning Sparse High-dimensional Learning fast algorithms for feature selection and structure learning of Markov networks; adaptive multi-task learning with rich features; sparse topical coding. Lasso JSTOR, 1996 Sparse Highdim Learning Compressed Sensing IEEE Trans. IT, 2004 Sparse Coding Nature, 1996 Learning MN Structure Multi-task Learning Sparse Topical Coding Representative Publications Structure learning of Markov networks (NIPS 2010c, SIGKDD 2009a, SIGKDD 2010); Multi-task learning (NIPS 2010d); Sparse topical coding (UAI 2011, SIGKDD 2011).

Practical Applications Statistical Web Data Mining a novel statistical modeling framework for robust web data extraction; bootstrapping for entity-relationship mining; probabilistic graphical models for social network analysis. Web Data Mining Web Information Extraction Entity Relationship Mining Social Network Analysis Representative Publications Information extraction (ICML 2005, SIGKDD 2006, SIGKDD 2007, ICML 2007, JMLR 2008, WWW 2009a); Entity relationship mining (WWW 2009b); Social network analysis (SIGKDD 2009b).

How Did I Get Here? where I = Jun Zhu

How Did I Get Here (Tianjin)? Thanks MSRA and

How Did I Get the Talk Title? The credits go to

How Did I Get My Career? Successful undergraduate research training on CPU design and hardware Confidence Persistence But, my heart leads me to AI and ML for graduate study and the career Credit: Wikibooks

How Did I Get to MSRA? A random chance for 0.5 year internship but, turn out to be >3 Years! very fruitful and enjoyable time

How Did I Get to CMU? 2007, sponsored visit by the government 2008, invited visit by CMU 2009, post-doc & project scientist with Sailing Lab

How Did I Get back to Tsinghua? Persuaded by Professor Bo Zhang to believe in the bright future Get the job offer after an interview Back to Tsinghua without looking for other places

How Did I Get the 973 Project? Probably the youngest team leader in 973 projects Thanks to my team members Special thanks to Professor Zongben Xu (Member of CAS) for not just selecting for titles

How Did I Get to the Future? Never! Grammar mistakes!

How Will I Get to the Future? Hard! The future is uncertain, my long march just starts I ll follow my heart, be confident, be persistent, and try all the best

Acknowledgements Advisor: Prof. Bo Zhang Mentors & Collaborators: Dr. Zaiqing Nie Dr. Ji-Rong Wen Dr. Lei Zhang Dr. Wei-Ying Ma (MSRA) Prof. Eric P. Xing (CMU) Prof. Li Fei-Fei (Stanford) Students: Funding: Amr Ahmed (CMU), Ning Chen (Tsinghua), Ni Lao (CMU), Seunghak Lee (CMU), Li-jia Li (Stanford), Xiaojiang Liu (USTC), Xiaolin Shi (Stanford), Hao Su (Stanford), Yuandong Tian (CMU), Matt Wytock (CMU). Aonan Zhang, Minjie Xu, Hugh Perkins Wei Li, Bei Chen, Kuan Liu.