FACULTY MENTOR Vasconcelos, Nuno. PROJECT TITLE Image collection with drones

Similar documents
ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Laboratorio di Intelligenza Artificiale e Robotica

Generative models and adversarial training

Python Machine Learning

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Laboratorio di Intelligenza Artificiale e Robotica

Lecture 1: Machine Learning Basics

Top US Tech Talent for the Top China Tech Company

Circuit Simulators: A Revolutionary E-Learning Platform

Full text of O L O W Science As Inquiry conference. Science as Inquiry

Computers Change the World

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Rule Learning With Negation: Issues Regarding Effectiveness

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

EQuIP Review Feedback

Forget catastrophic forgetting: AI that learns after deployment

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Rule Learning with Negation: Issues Regarding Effectiveness

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

This Performance Standards include four major components. They are

Secondary English-Language Arts

Common Core Exemplar for English Language Arts and Social Studies: GRADE 1

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

A student diagnosing and evaluation system for laboratory-based academic exercises

Using dialogue context to improve parsing performance in dialogue systems

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

Reducing Features to Improve Bug Prediction

Disciplinary Literacy in Science

Computerized Adaptive Psychological Testing A Personalisation Perspective

Reinforcement Learning by Comparing Immediate Reward

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

Table of Contents. Introduction Choral Reading How to Use This Book...5. Cloze Activities Correlation to TESOL Standards...

Introduction to Forensics: Preventing Fires in the First Place. A Distance Learning Program Presented by the FASNY Museum of Firefighting

An Industrial Technologist s Core Knowledge: Web-based Strategy for Defining Our Discipline

Human Emotion Recognition From Speech

understandings, and as transfer tasks that allow students to apply their knowledge to new situations.

CS Machine Learning

Xinyu Tang. Education. Research Interests. Honors and Awards. Professional Experience

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

CS 446: Machine Learning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Abstractions and the Brain

Learning Methods in Multilingual Speech Recognition

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Extending Place Value with Whole Numbers to 1,000,000

Classroom Connections Examining the Intersection of the Standards for Mathematical Content and the Standards for Mathematical Practice

Major Milestones, Team Activities, and Individual Deliverables

Exploration. CS : Deep Reinforcement Learning Sergey Levine

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Speech Emotion Recognition Using Support Vector Machine

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web

What Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models

Research computing Results

Using computational modeling in language acquisition research

Summary results (year 1-3)

Indiana Collaborative for Project Based Learning. PBL Certification Process

Lecture 10: Reinforcement Learning

Interactive Whiteboard

Skillsoft Acquires SumTotal: Frequently Asked Questions. October 2014

UCEAS: User-centred Evaluations of Adaptive Systems

State Budget Update February 2016

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

PREPARED BY: IOTC SECRETARIAT 1, 20 SEPTEMBER 2017

Lip Reading in Profile

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Probability estimates in a scenario tree

A Study of Successful Practices in the IB Program Continuum

arxiv: v1 [cs.lg] 15 Jun 2015

Australian Journal of Basic and Applied Sciences

Degree Qualification Profiles Intellectual Skills

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Word Segmentation of Off-line Handwritten Documents

UNIVERSITY OF THESSALY DEPARTMENT OF EARLY CHILDHOOD EDUCATION POSTGRADUATE STUDIES INFORMATION GUIDE

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Copyright Corwin 2015

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

Concept mapping instrumental support for problem solving

A Case Study: News Classification Based on Term Frequency

Computational Data Analysis Techniques In Economics And Finance

The IDN Variant Issues Project: A Study of Issues Related to the Delegation of IDN Variant TLDs. 20 April 2011

Radius STEM Readiness TM

Application of Virtual Instruments (VIs) for an enhanced learning environment

Speech Recognition at ICSI: Broadcast News and beyond

Using Team-based learning for the Career Research Project. Francine White. LaGuardia Community College

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

INPE São José dos Campos

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web

Transcription:

Image collection with drones The last few years have shown that a critical component in the design of effective image classification systems is the availability of large training datasets. Drones are a new way to collect large numbers of images of objects in a relatively inexpensive manner. We are interested in collecting datasets of objects under many views and in collecting datasets of scenes. The students will develop protocols for the use of drones in data collection and apply those protocols to the assembly of a few datasets. These will then be used to train deep learning systems for object recognition. MS or undergraduate. As many as apply Candidates are expected to have basic knowledge of Python, Linux and computer vision.

Deep learning to measure image quality Datasets are an integral component of machine learning, and they are even more powerful if accurately labeled. We will develop an automated method for labeling drone obtained picture based on intuitive, human perceivable qualities such as blurriness, brightness, contrast, noise, and (over/under) exposure. Labels associated with each image will provide a quantitative estimate of an image s characteristics, ultimately to be used in deep learning applications. Methods should be robust, flexible, automated, and scalable so they can adequately process tens of thousands of different drone-taken images. Candidates are expected to be adept with at least one commonly used programming language, such as C++, Java, or Python. Knowledge in Linux, OOP, computer vision, image processing, and/or machine learning are a plus, but not essential.

Deep Learning for Object Size Estimation from Real World Images Object size estimation from real world images is an interesting, practical but non-trivial problem. Our final goal is to design an algorithm to measure the object size from real world images without providing reference in advance. The students will have two major tasks: collecting a small scale labeled dataset and developing a weakly supervised learning algorithm for size measurement. The data can be collected by downloading labeled images from the Internet or taking new pictures and measuring objects within them. Using these data, a weakly supervised deep learning model will be trained to choose the best reference from images automatically. Finally, this reference can be utilized to estimate object size. Candidates are expected to have basic knowledge of mathematics, and to be adept with at least one commonly used programming language, such as C++, Python, matlab. Multiple view geometry, machine learning and computer vision are a plus

Using synthetic data for training deep learning systems The data collection from real world is very expensive. However, there are infinite synthesized data from some simulation game environments, and they are very easy/cheap to collect. We want to explore the impact of synthesized data for real-world computer vision problems. The first step of this project is to collect a large amount of synthesized data from the simulated game engine. The next step is to train a basic model from the synthesized dataset, and see how it performs in real-world computer vision tasks, e.g. object detection. We also want to explore how these synthesized data can be optimally used, in combination with real-world data, and thus improve the performance. This project aims for top-tier conference publication. Candidates are expected to be familiar programming language, such as C++, Python, or Matlab, and have strong qualitative and quantitative analysis skills. Stronger candidates will also have some knowledge in Linux, computer vision, image processing, and/or machine learning.

Efficient Deep Learning for Drones and Smart Phones The development of slim and accurate deep neural networks has become crucial for realworld applications, especially for those employed in embedded systems like drones and smart phones. We are interested in building light models, capable of making deep learning deployable in real-time on drones. These models will be used to build object recognition systems. This project aims for both application and top-tier conference publication. Candidates are expected to have basic knowledge of Python, Linux and computer vision. Skills of FPGA will help you but not required.

The role of context in object detection The performance of object detection has improved substantially in the last few years, with the introduction of deep learning systems. Contextual information extracted from scenes is useful for object detection. For example, a car does not show up on top of a tree. This project aims to characterize relationship between the contextual information and the performance of the object detection. This involves collecting images whose objects are not easily detected by the state of art detector, train context sensitive deep learning models, and measure whether contextual information can help improve detection performance. Candidates are expected to have basic knowledge of Matlab, Python. Basic knowledge about computer vision is required. It is better to know some famous object detection frameworks e.g rcnn and faster rcnn.

Deep Learning for Biological Imaging Large scale annotated datasets are critical for learning effective classification networks. To improve the scalability of the collection process, images are typically gathered using online search engines. However, these sources can be biased with respect to characteristics such as the object s pose. In this project, we aim at validating this hypothesis by collecting a large-scale dataset of plankton species with densely sampled poses. The students will learn to operate the imaging apparatus for data collection, design protocols for analysing the resulting datasets, and train deep learning systems to understand how pose variability influences classification performance of plankton images. This is an on-going project in collaboration with the Scripps Institute of Oceanography. Candidates are expected to be adept with at least one commonly used programming language, such as C/C++, Java, Python, or Matlab. Stronger candidates will also have some knowledge in Linux, computer vision, image processing, and/or machine learning.

Multi-frame visual recognition In the recent years, the emergence of various new visual recognition algorithms has drastically changed the way computers recognize and segment objects in images. Compared to still images, though, a short video clip consisting of a sequence of frames can potentially contain much more information for us to understand the spatial relationship between object instances and scenes. We intend to realize the most recent image recognition algorithms on an input of consecutive frames and examine the margin of improvement over the conventional single-frame processing. In this project, students will participate in gathering the training data, implementing a recognition algorithm, and analyzing the results. Candidates are expected to be proficient in at least one of the programming languages such as Python or MATLAB, and have basic knowledge in deep learning and computer vision. Applicants with knowledge on object detection, recognition or tracking are preferred.

Synthesize hand gesture sequences for deep learning Hand gesture recognition is important for human-computer interaction and communication. However, training data is scarce for this domain. We would like to build a synthesizer based on 3D gaming engines to generate hand gesture video sequences with different backgrounds and extensive gesture classes. In this project, students will be able to learn about 3D engine and deep learning techniques to understand sequential data. Candidates are expected to be familiar with python and C++. Knowledge with graphics and machine learning is preferred.

Action prediction in videos using Convolutional Neural Networks Recent times have seen a lot of work in accurately detecting human actions in videos, but we are still far from making interpretations of those. The next milestone for any computer vision system would be to be able to understand why those actions happened and what the agent intends to do next. We are interested in building a system which can predict what would be an agent's future action in a video based on our current and previous knowledge. The students will work on developing a deep learning system which could perform this task and validate its performance on multiple datasets. MS students Candidates are expected to have basic knowledge of Python, Linux and computer vision. Experience with CNNs is expected.

Synthesize hand gesture sequences for deep learning Hand gesture recognition is important for human-computer interaction and communication. However, training data is scarce for this domain. We would like to build a synthesizer based on 3D gaming engines to generate hand gesture video sequences with different backgrounds and extensive gesture classes. In this project, students will be able to learn about 3D engine and deep learning techniques to understand sequential data. Candidates are expected to be familiar with python and C++. Knowledge with graphics and machine learning is preferred.