Machine Learning Lab Course. Summer Term Organizational Meeting. lecturer: Prof. Dr. Stephan Günnemann. Data Mining and Analytics

Similar documents
CS Machine Learning

Lecture 1: Machine Learning Basics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Python Machine Learning

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

UniConnect: A Hosted Collaboration Platform for the Support of Teaching and Research in Universities

CSL465/603 - Machine Learning

Mining Association Rules in Student s Assessment Data

Generative models and adversarial training

Laboratorio di Intelligenza Artificiale e Robotica

Axiom 2013 Team Description Paper

Feature-oriented vs. Needs-oriented Product Access for Non-Expert Online Shoppers

Reducing Features to Improve Bug Prediction

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Humboldt-Universität zu Berlin

Computerized Adaptive Psychological Testing A Personalisation Perspective

Reinforcement Learning by Comparing Immediate Reward

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

(Sub)Gradient Descent

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Research computing Results

Laboratorio di Intelligenza Artificiale e Robotica

A Case Study: News Classification Based on Term Frequency

Australian Journal of Basic and Applied Sciences

Home Access Center. Connecting Parents to Fulton County Schools

Rule Learning With Negation: Issues Regarding Effectiveness

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Computer Organization I (Tietokoneen toiminta)

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Truth Inference in Crowdsourcing: Is the Problem Solved?

Top US Tech Talent for the Top China Tech Company

Word Segmentation of Off-line Handwritten Documents

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Modeling function word errors in DNN-HMM based LVCSR systems

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Rule Learning with Negation: Issues Regarding Effectiveness

A Web Based Annotation Interface Based of Wheel of Emotions. Author: Philip Marsh. Project Supervisor: Irena Spasic. Project Moderator: Matthew Morgan

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Navigating the PhD Options in CMS

Welcome to. ECML/PKDD 2004 Community meeting

Modeling function word errors in DNN-HMM based LVCSR systems

Human Emotion Recognition From Speech

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

SYLLABUS- ACCOUNTING 5250: Advanced Auditing (SPRING 2017)

NCAA Eligibility Center High School Portal Instructions. Course Module

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

PeopleSoft Class Scheduling. The Mechanics of Schedule Build

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Software Development Plan

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Responsible Conduct of Research Workshop Series, Scientific Communications and Authorship -- October 13,

Forget catastrophic forgetting: AI that learns after deployment

GRADUATE PROGRAM Department of Materials Science and Engineering, Drexel University Graduate Advisor: Prof. Caroline Schauer, Ph.D.

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Linking Task: Identifying authors and book titles in verbose queries

DOCTOR OF PHILOSOPHY HANDBOOK

Study in Berlin at the HTW. Study in Berlin at the HTW

Implementing a tool to Support KAOS-Beta Process Model Using EPF

Best Practices in Internet Ministry Released November 7, 2008

SARDNET: A Self-Organizing Feature Map for Sequences

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Diploma in Library and Information Science (Part-Time) - SH220

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

arxiv: v1 [cs.lg] 15 Jun 2015

George Mason University Graduate School of Education Program: Special Education

Learning From the Past with Experiment Databases

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Softprop: Softmax Neural Network Backpropagation Learning

ACCOUNTING FOR MANAGERS BU-5190-OL Syllabus

Exposé for a Master s Thesis

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011

Handling Concept Drifts Using Dynamic Selection of Classifiers

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

Applications of data mining algorithms to analysis of medical data

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

DISCLAIMER. Mechanical Mechanical and Aerospace Mechanical and Materials. Options for Final Year Thesis and Design Projects. David Mee Carl Reidsema

Guidelines for Project I Delivery and Assessment Department of Industrial and Mechanical Engineering Lebanese American University

Applications of memory-based natural language processing

CS177 Python Programming

Indian Institute of Technology, Kanpur

Online Marking of Essay-type Assignments

ON BEHAVIORAL PROCESS MODEL SIMILARITY MATCHING A CENTROID-BASED APPROACH

A new way to share, organize and learn from experiments

SEBUTHARGA NO. : SH/27/2017 SCOPE OF WORKS, TECHNICAL SPECIFICATIONS & REQUIREMENTS

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

MAKING YOUR OWN ALEXA SKILL SHRIMAI PRABHUMOYE, ALAN W BLACK

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

The Moodle and joule 2 Teacher Toolkit

Introduction to Moodle

Transcription:

Machine Learning Lab Course Organizational Meeting lecturer: Prof. Dr. Stephan Günnemann Summer Term 2018

Team Prof. Dr. Stephan Günnemann Daniel Zügner This is a practical course (Praktikum) for Master students! Name of module: Large-Scale Machine Learning (IN2106, IN4192) website: ml-lab.in.tum.de 2

Why attend our Machine Learning lab course? 1. Get the chance to implement and apply state-of-the-art ML algorithms 2. Gain hands-on experience working on real-world data, solving real-world tasks (e.g. by working on one of the projects by our industry partners). Successful projects might even qualify for a subsequent master thesis. 3. Work on large-scale problems with the support of state-ofthe-art GPU computing resources. 3

Requirements Requirements for the lab course strong programming skills (Java, Python, C++, Java, etc.) strong knowledge in data mining/machine learning you should have passed relevant courses (the more, the better) - Mining Massive Datasets - Machine Learning - Our seminars self-motivation Additional selection criteria other relevant experience (projects in companies, experience as a HiWi) - you can send an overview of your experience to us (see end of slides) 4

Organization Groups of 3-4 students Each team will work on a different project, e.g. in cooperation with one of our industry partners or on a topic they have suggested themselves Groups are allowed (should) collaborate! exchange your experience with the other groups how do the other groups tackle certain problems? Technical aspects: each group will get exclusive access to at least one high-end GPU server with - 4x NVIDIA GPU w/ 11GB RAM - 10-core CPU - 256 GB RAM scale up your models and data! 5

Organization Weekly meetings (around 90-120 minutes) each group should briefly report their progress, open problems, and next steps Regular documentation of your work status reports and documentation (we might set up a wiki) use of a central code repository 6

Grading The grade is based on the whole semester sperformance! regular completion of documentation regular presentations/discussions during semester final presentation at the end of the semester - overview about what you have done, how did you implement it, what are the results, what went wrong, discussion of the framework, - each member of the team needs to present some parts 7

Content Techniques we might want to look at (if you know these, that's good!) Optimization (e.g. via gradients) Stochastic optimization Neural networks Learning with non-i.i.d. data (e.g. temporal data) Tasks: preprocessing classification profiling clustering/topic mining recommendation anomaly detection 8

Projects There are three types of projects in this lab course: Academic projects Industry projects Your own projects 9

Reproduction and improvement of a published model Can you spot inconsistencies in a recent publication s experimental setup? Can you even improve their results? Students can choose a recent algorithm (e.g. from ICLR 2018), and aim to reproduce and improve the results in the paper. Given the computational resources available to the students, they can even select large-scale models and evaluate the validity of the results and claims. This can also be a good way to lay the foundation of a new algorithm for a master thesis. 10

Industry project: Oktoberfest food classification Industry partner: ilass AG, maker of software for gastronomy and party tents (e.g. Oktoberfest). The project will be about detecting and classifying food items on images to be extracted from a video stream. Representative present today: Peter Vogel 11

Industry project: Automatic anonymization of faces Automatic anonymization of faces in image and video data is important to protect the privacy of people. Blurring or completely graying out parts in images where faces are detected means a loss of information since all facial features are removed. Goal: develop a method for face anonymization while preserving the most relevant facial features to still recognize basic information like emotions. 12

Industry project: Siemens Details to be announced. 13

Own projects You can submit a brief exposé of your project idea provided that: There is a considerable challenge from a machine learning perspective, e.g. non-i.i.d. data (graphs, temporal data), very noisy data, new application, You have a sufficiently large and challenging dataset at hand (e.g. from an open data platform), The project is suitable for a group of 3-4 students. 14

Own projects: exposé The exposé should contain a brief description of the problem and why it is important, a description of the dataset you plan to use a rough outline of an approach you would like to pursue If you are a group of students, only one student should fill in the exposé and add the others student ID Max, 3,000 characters Submit via online form (see end of slides) 15

Registration Registration via the matching system! Module name: Large-Scale Machine Learning (IN2106, IN4192) + fill out the application form (see next slide) 16

Your Experience Fill out our brief online form about your experience until 14.02.2018 you can provide us with a list of your experience in data mining/machine learning (courses, projects, ) please send a short overview only (bullet list); not a complete CV (optional) attach a brief exposé of your own project idea. Check ml-lab.in.tum.de for a link to the form. 17