Collecting and Analyzing Big Data

Size: px
Start display at page:

Download "Collecting and Analyzing Big Data"

Transcription

1 University of Oslo The Faculty of Social Sciences Oslo Summer School in Comparative Social Science Studies 2017 Collecting and Analyzing Big Data Lecturer: Associate Professor Neal Caren Department of Sociology The University of North Carolina, Chapel Hill USA Main disciplines: Sociology, Economics, Political Science, Big Data Dates: 31 July - 4 August 2017 Course Credits: 10 pts (ECTS) Limitation: 25 participants Objectives This course is an introduction to collecting and analyzing "big data" for social scientists. Over the last decade, the variety and types of data available to researchers have exploded. This includes not only contemporary data, such as from websites and social media platforms, but also historical data, from digitized interviews to 19th century newspapers. At the same time, analytic techniques from computer science are increasingly being used to solve social science problems. One week is not enough time to master the techniques for collecting and analyzing big data. You will, however, be able to establish the foundation for developing these skills. The course is designed as a practical overview. The emphasis each class will be on applying the specific techniques rather than on their mathematical basis. The course will provide an overview in that each lesson will introduce a new method in order to demonstrate the range methods. Combined, students will have the skills and resources to apply these methods to theoretically-relevant problems in the social sciences. University of Oslo The Faculty of Social Sciences

2 By the end of the course, it is expected that students will be able to: Collect data from the internet using web scraping and APIs. Read and write digital text files. Analyze data using supervised learning technique such as random forest models. Analyze data using unsupervised learning techniques such as topic models. Understand and apply current methods for analyzing texts. Link machine learning methods to relevant social science questions. Program in Python Course credits Students have the option of submitting a research paper in order to receive ECTS credits. These research papers (6000 to 8,000 words) should apply one or more of the techniques used in the course to a theoretically interesting research question. Papers should generally follow the format of a research article in the student's discipline, although the literature review may be concise than normal. Additionally, students must provide code, and where feasible, data, to replicate the analysis. This is to be completed within 8 weeks after the course. Requirements Students should have a Python distribution appropriate for data science. The recommended way to do this is to install Continuum's Anaconda Python distribution. It is free and available for all operating systems. Students are not expected to have any knowledge of Python. Reading list Müller, Andreas C. and Sarah Guido Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly Media, Inc. 392 pages. Additional readings will be made available. Page 2 of 7

3 COURSE OUTLINE Session 1: Big data, machine learning and the social sciences This lecture will unpack some of the major findings from the intersection of social science and big data. The focus will be on the specific tools and methods that were used. We will also review the major sources of data and tools currently available for data science. Müller, Andreas C. and Sarah Guido. Introduction. Chapter 1 (pages 1-24) in Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly Media, Inc O Neil, Cathy and Rachel Schutt. Introduction: What is Data Science Chapter 1 (pages 1-16) in Doing Data Science. O'Reilly Media, Inc. O Neil, Cathy and Rachel Schutt. Statistical Inference, Exploratory Data Analysis, and the Data Science Process Chapter 2 (pages 17-50) in Doing Data Science. O'Reilly Media, Inc. Session 2: Getting Started with Python This lecture will focus on getting students up and running with Python for social science applications. This includes both an overview of the elements of the Python data science stack (e.g. IPython/Jupyter, pandas, matplotlib, scikit-learn) but a more detailed introduction to working with Python. McKinney, Wes. Python Language Basics, IPython, and Jupyter Notebooks. Chapter 2 (pages 15-54) in Python for Data Analysis, 2nd Edition (2017). Page 3 of 7

4 Session 3: Harvesting data from the web: APIs Collecting big data is often done through web application programming interfaces, or APIs. This is a way for developers, or researchers, to access data stored on governmental or corporate servers. For example, Twitter, Facebook, and Yelp, all make some of their data available through APIs. This lecture will introduce the basics of collecting data from an API in Python. Mitchell, Ryan. Using APIs Chapter 4 (pages 49-70) in Web Scraping with Python: Collecting Data from the Modern Web, O'Reilly Media, Inc. "Chronicling America API." Session 4: Harvesting data from the web: Web scraping A second major source of big data is collecting it directly from websites. Web scraping involves visiting one more pages and collecting and storing the relevant information in an automated fashion. This lecture will introduce the basics of web scraping in Python. Mitchell, Ryan. Using APIs Chapter 4 (pages 49-70) in Web Scraping with Python: Collecting Data from the Modern Web O'Reilly Media, Inc. Mitchell, Ryan. Your First Web Scraper. Chapter 1 (pages 3-12) in Web Scraping with Python: Collecting Data from the Modern Web O'Reilly Media, Inc. Mitchell, Ryan. Advanced HTML Parsing Chapter 2 (pages 13-30) in Web Scraping with Python: Collecting Data from the Modern Web O'Reilly Media, Inc. Page 4 of 7

5 Session 5: Manipulating Big Data By most estimates, 80% of data analysis is cleaning and merging the data. This lecture introduces best practices for preparing your data in Python. Vanderplas, Jake. Data Manipulation with Pandas. Chapter 3 (pages ) in Python Data Science Handbook.O'Reilly Media, Inc. Session 6: Supervised Learning I You are likely familiar with supervised learning, but you probably don't call it that. Supervised learning in the machine language term for when you are modeling one variable as a function of another set of variables, such as linear or logistic regression. This lecture reviews common methods for regression and classifications such as linear regression and introduces more complex algorithms. Müller, Andreas C. and Sarah Guido. Supervised Learning. Chapter 2 (pages 25-69) in Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly Media, Inc. Session 7: Model Evaluation Keeping the data used to evaluate your model separate from the data used to develop your model is critical to the machine learning workflow. This is of particular concern when there are concerns about overfitting. This lecture introduces the idea of cross validation and reviews methods for evaluating model fit. Müller, Andreas C. and Sarah Guido. Model Evaluation and Improvement. Chapter 5 (pages ) in Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly Page 5 of 7

6 Session 8: Supervised Learning II This lecture extends on focus on supervised learning to techniques to include decision trees and random forest models. Müller, Andreas C. and Sarah Guido. Supervised Learning. Chapter 2 (pages ) in Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly Session 9: Working with Text Data This lecture introduces the basics of manipulating and analyzing text data, including counting and analyzing term frequencies for text categorization. Müller, Andreas C. and Sarah Guido. Working with Text Data. Chapter 7 (pages ) in Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly. Session 10: Unsupervised Learning with Text Data This lecture will introduce methods for analyzing themes in text data. The focus will be on topic modeling which involves assigning each document to one or multiple topics. Müller, Andreas C. and Sarah Guido. Working with Text Data. Chapter 7 (pages ) in Introduction to Machine Learning with Python: A Guide for Data Scientists, O'Reilly O Neil, Cathy and Rachel Schutt. Next-Generation Data Scientists, Hubris, and Ethics Chapter 16 (pages ) in Doing Data Science. O'Reilly Media, Inc. Page 6 of 7

7 The Lecturer Neal Caren is an Associate Professor of Sociology at the University of North Carolina, Chapel Hill. His research interests center on the quantitative analysis of protest and social movements. His work has been published in the American Sociological Review, Social Forces, Social Problems, and the Annual Review of Sociology. The data in many of his publications has been either scraped from the web, downloaded using APIs, or otherwise involved collected and analyzing texts. He is the author of a well-used publicly available script for converting Lexis-Nexis article downloads into a CSV file. For several years, he has run a graduate workshop on computational social science and digital data collection, has given external workshops on the topic, and has many several tutorials available online. He is also the editor of the social movements journal Mobilization. Page 7 of 7

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence COURSE DESCRIPTION This course presents computing tools and concepts for all stages

More information

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Twitter Sentiment Classification on Sanders Data using Hybrid Approach IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE Pierre Foy TIMSS Advanced 2015 orks User Guide for the International Database Pierre Foy Contributors: Victoria A.S. Centurino, Kerry E. Cotter,

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

CS 446: Machine Learning

CS 446: Machine Learning CS 446: Machine Learning Introduction to LBJava: a Learning Based Programming Language Writing classifiers Christos Christodoulopoulos Parisa Kordjamshidi Motivation 2 Motivation You still have not learnt

More information

(Sub)Gradient Descent

(Sub)Gradient Descent (Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include

More information

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Read Online and Download Ebook ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Click link bellow and free register to download

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Tools and Techniques for Large-Scale Grading using Web-based Commercial Off-The-Shelf Software

Tools and Techniques for Large-Scale Grading using Web-based Commercial Off-The-Shelf Software Tools and Techniques for Large-Scale Grading using Web-based Commercial Off-The-Shelf Software Drexel University Programming Learning EXperience (DUPLEX) Departments of Mathematics and Computer Science

More information

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio SCSUG Student Symposium 2016 Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio Praneth Guggilla, Tejaswi Jha, Goutam Chakraborty, Oklahoma State

More information

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler Machine Learning and Data Mining Ensembles of Learners Prof. Alexander Ihler Ensemble methods Why learn one classifier when you can learn many? Ensemble: combine many predictors (Weighted) combina

More information

Platform for the Development of Accessible Vocational Training

Platform for the Development of Accessible Vocational Training Platform for the Development of Accessible Vocational Training Executive Summary January/2013 Acknowledgment Supported by: FINEP Contract 03.11.0371.00 SEL PUB MCT/FINEP/FNDCT/SUBV ECONOMICA A INOVACAO

More information

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE Master of Science (M.S.) Major in Computer Science 1 MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE Major Program The programs in computer science are designed to prepare students for doctoral research,

More information

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,

More information

Computational Data Analysis Techniques In Economics And Finance

Computational Data Analysis Techniques In Economics And Finance Computational Data Analysis Techniques In Economics And Finance If searched for a ebook Computational Data Analysis Techniques in Economics and Finance in pdf format, in that case you come on to correct

More information

Introduction, Organization Overview of NLP, Main Issues

Introduction, Organization Overview of NLP, Main Issues HG2051 Language and the Computer Computational Linguistics with Python Introduction, Organization Overview of NLP, Main Issues Francis Bond Division of Linguistics and Multilingual Studies http://www3.ntu.edu.sg/home/fcbond/

More information

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits. DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE Sample 2-Year Academic Plan DRAFT Junior Year Summer (Bridge Quarter) Fall Winter Spring MMDP/GAME 124 GAME 310 GAME 318 GAME 330 Introduction to Maya

More information

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics 2017-2018 GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics Entrance requirements, program descriptions, degree requirements and other program policies for Biostatistics Master s Programs

More information

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering Time and Place: MW 3:00-4:20pm, A126 Wells Hall Instructor: Dr. Marianne Huebner Office: A-432 Wells Hall

More information

Computerized Adaptive Psychological Testing A Personalisation Perspective

Computerized Adaptive Psychological Testing A Personalisation Perspective Psychology and the internet: An European Perspective Computerized Adaptive Psychological Testing A Personalisation Perspective Mykola Pechenizkiy mpechen@cc.jyu.fi Introduction Mixed Model of IRT and ES

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE Mingon Kang, PhD Computer Science, Kennesaw State University Self Introduction Mingon Kang, PhD Homepage: http://ksuweb.kennesaw.edu/~mkang9

More information

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics (L615) Markus Dickinson Department of Linguistics, Indiana University Spring 2013 The web provides new opportunities for gathering data Viable source of disposable corpora, built ad hoc for specific purposes

More information

STA 225: Introductory Statistics (CT)

STA 225: Introductory Statistics (CT) Marshall University College of Science Mathematics Department STA 225: Introductory Statistics (CT) Course catalog description A critical thinking course in applied statistical reasoning covering basic

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

Academic Catalog Programs & Courses Manchester Community College

Academic Catalog Programs & Courses Manchester Community College 2016 2017 Academic Catalog Programs & Courses Manchester Community College 1 Accounting and Business Administration Transfer, A.S. Program Design The Accounting and Business Administration Transfer associate

More information

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and

More information

USC MARSHALL SCHOOL OF BUSINESS

USC MARSHALL SCHOOL OF BUSINESS USC MARSHALL SCHOOL OF BUSINESS SUPPLY CHAIN MANAGEMENT IOM 482 Fall 2013 INSTRUCTOR OFFICE HOURS Professor Murat Bayiz Bridge Hall, Room 401G Phone: (213) 740 5618 E-mail: murat.bayiz@marshall.usc.edu

More information

Introduction to Simulation

Introduction to Simulation Introduction to Simulation Spring 2010 Dr. Louis Luangkesorn University of Pittsburgh January 19, 2010 Dr. Louis Luangkesorn ( University of Pittsburgh ) Introduction to Simulation January 19, 2010 1 /

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website Sociology 521: Social Statistics and Quantitative Methods I Spring 2012 Wed. 2 5, Kap 305 Computer Lab Instructor: Tim Biblarz Office hours (Kap 352): W, 5 6pm, F, 10 11, and by appointment (213) 740 3547;

More information

Lectora a Complete elearning Solution

Lectora a Complete elearning Solution Lectora a Complete elearning Solution Irina Ioniţă 1, Liviu Ioniţă 1 (1) University Petroleum-Gas of Ploiesti, Department of Information Technology, Mathematics, Physics, Bd. Bucuresti, No.39, 100680,

More information

Probability and Statistics Curriculum Pacing Guide

Probability and Statistics Curriculum Pacing Guide Unit 1 Terms PS.SPMJ.3 PS.SPMJ.5 Plan and conduct a survey to answer a statistical question. Recognize how the plan addresses sampling technique, randomization, measurement of experimental error and methods

More information

PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018

PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018 1 PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018 Department Of Psychology and Behavioural Sciences AARHUS UNIVERSITY Course coordinator: Anne Scharling Rasmussen Lectures: Ali Amidi (AA), Kaare Bro

More information

GLBL 210: Global Issues

GLBL 210: Global Issues GLBL 210: Global Issues This syllabus includes the following sections: Course Overview Required Texts Course Requirements Academic Policies Course Outline COURSE OVERVIEW Over the last two decades, there

More information

Enter the World of Polling, Survey &

Enter the World of Polling, Survey & Enter the World of Polling, Survey & Mobile Enter the World of MOBILE LEARNING INNOVATION CONTENTS Page 1. Introduction to I.C.O. Europe 3 2. What type of Learning produces the greatest effect? 4-6 3.

More information

Statistics and Data Analytics Minor

Statistics and Data Analytics Minor October 28, 2014 Page 1 of 6 PROGRAM IDENTIFICATION NAME OF THE MINOR Statistics and Data Analytics ACADEMIC PROGRAM PROPOSING THE MINOR Mathematics PROGRAM DESCRIPTION DESCRIPTION OF THE MINOR AND STUDENT

More information

THE DEPARTMENT OF DEFENSE HIGH LEVEL ARCHITECTURE. Richard M. Fujimoto

THE DEPARTMENT OF DEFENSE HIGH LEVEL ARCHITECTURE. Richard M. Fujimoto THE DEPARTMENT OF DEFENSE HIGH LEVEL ARCHITECTURE Judith S. Dahmann Defense Modeling and Simulation Office 1901 North Beauregard Street Alexandria, VA 22311, U.S.A. Richard M. Fujimoto College of Computing

More information

Online Marking of Essay-type Assignments

Online Marking of Essay-type Assignments Online Marking of Essay-type Assignments Eva Heinrich, Yuanzhi Wang Institute of Information Sciences and Technology Massey University Palmerston North, New Zealand E.Heinrich@massey.ac.nz, yuanzhi_wang@yahoo.com

More information

Radius STEM Readiness TM

Radius STEM Readiness TM Curriculum Guide Radius STEM Readiness TM While today s teens are surrounded by technology, we face a stark and imminent shortage of graduates pursuing careers in Science, Technology, Engineering, and

More information

CS 101 Computer Science I Fall Instructor Muller. Syllabus

CS 101 Computer Science I Fall Instructor Muller. Syllabus CS 101 Computer Science I Fall 2013 Instructor Muller Syllabus Welcome to CS101. This course is an introduction to the art and science of computer programming and to some of the fundamental concepts of

More information

Bluetooth mlearning Applications for the Classroom of the Future

Bluetooth mlearning Applications for the Classroom of the Future Bluetooth mlearning Applications for the Classroom of the Future Tracey J. Mehigan, Daniel C. Doolan, Sabin Tabirca Department of Computer Science, University College Cork, College Road, Cork, Ireland

More information

CS 100: Principles of Computing

CS 100: Principles of Computing CS 100: Principles of Computing Kevin Molloy August 29, 2017 1 Basic Course Information 1.1 Prerequisites: None 1.2 General Education Fulfills Mason Core requirement in Information Technology (ALL). 1.3

More information

FAQ (Frequently Asked Questions)

FAQ (Frequently Asked Questions) FAQ (Frequently Asked Questions) Q. How can we contact the DIGITAL EDUCATION PROJECT and the NATIONAL DIGITAL SCHOOLBOOK LIBRARY PROGRAM for additional information and questions? A. VISIT OUR WEBSITE at

More information

Lecture 1: Basic Concepts of Machine Learning

Lecture 1: Basic Concepts of Machine Learning Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010

More information

Learning Microsoft Publisher , (Weixel et al)

Learning Microsoft Publisher , (Weixel et al) Prentice Hall Learning Microsoft Publisher 2007 2008, (Weixel et al) C O R R E L A T E D T O Mississippi Curriculum Framework for Business and Computer Technology I and II BUSINESS AND COMPUTER TECHNOLOGY

More information

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Knowledge based expert systems D H A N A N J A Y K A L B A N D E Knowledge based expert systems D H A N A N J A Y K A L B A N D E What is a knowledge based system? A Knowledge Based System or a KBS is a computer program that uses artificial intelligence to solve problems

More information

CSL465/603 - Machine Learning

CSL465/603 - Machine Learning CSL465/603 - Machine Learning Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Introduction CSL465/603 - Machine Learning 1 Administrative Trivia Course Structure 3-0-2 Lecture Timings Monday 9.55-10.45am

More information

DOUBLE DEGREE PROGRAM AT EURECOM. June 2017 Caroline HANRAS International Relations Manager

DOUBLE DEGREE PROGRAM AT EURECOM. June 2017 Caroline HANRAS International Relations Manager DOUBLE DEGREE PROGRAM AT EURECOM June 2017 Caroline HANRAS International Relations Manager KEY FACTS 1991 Creation by EPFL and Telecom ParisTech 3 Main Fields of Expertise 300 23 Master Students Professors

More information

Universidade do Minho Escola de Engenharia

Universidade do Minho Escola de Engenharia Universidade do Minho Escola de Engenharia Universidade do Minho Escola de Engenharia Dissertação de Mestrado Knowledge Discovery is the nontrivial extraction of implicit, previously unknown, and potentially

More information

Mining Association Rules in Student s Assessment Data

Mining Association Rules in Student s Assessment Data www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama

More information

From understanding perspectives to informing public policy the potential and challenges for Q findings to inform survey design

From understanding perspectives to informing public policy the potential and challenges for Q findings to inform survey design Rachel Baker From understanding perspectives to informing public policy the potential and challenges for Q findings to inform survey design Organised session: Neil McHugh, Job van Exel Session outline

More information

Strategy and Design of ICT Services

Strategy and Design of ICT Services Strategy and Design of IT Services T eaching P lan Telecommunications Engineering Strategy and Design of ICT Services Teaching guide Activity Plan Academic year: 2011/12 Term: 3 Project Name: Strategy

More information

Len Lundstrum, Ph.D., FRM

Len Lundstrum, Ph.D., FRM , Ph.D., FRM Professor of Finance Department of Finance College of Business Office: 815 753-0317 Northern Illinois University Fax: 815 753-0504 Dekalb, IL 60115 llundstrum@niu.edu Education Indiana University

More information

Capturing and Organizing Prior Student Learning with the OCW Backpack

Capturing and Organizing Prior Student Learning with the OCW Backpack Capturing and Organizing Prior Student Learning with the OCW Backpack Brian Ouellette,* Elena Gitin,** Justin Prost,*** Peter Smith**** * Vice President, KNEXT, Kaplan University Group ** Senior Research

More information

Degree Qualification Profiles Intellectual Skills

Degree Qualification Profiles Intellectual Skills Degree Qualification Profiles Intellectual Skills Intellectual Skills: These are cross-cutting skills that should transcend disciplinary boundaries. Students need all of these Intellectual Skills to acquire

More information

Vorlesung Advanced Topics in HCI (Mensch-Maschine-Interaktion 2)

Vorlesung Advanced Topics in HCI (Mensch-Maschine-Interaktion 2) Vorlesung Advanced Topics in HCI (Mensch-Maschine-Interaktion 2) Ludwig-Maximilians-Universität München LFE Medieninformatik Albrecht Schmidt & Andreas Butz WS2003/2004 http://www.medien.informatik.uni-muenchen.de/

More information

EdX Learner s Guide. Release

EdX Learner s Guide. Release EdX Learner s Guide Release Nov 18, 2017 Contents 1 Welcome! 1 1.1 Learning in a MOOC........................................... 1 1.2 If You Have Questions As You Take a Course..............................

More information

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl

More information

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl

More information

ATENEA UPC AND THE NEW "Activity Stream" or "WALL" FEATURE Jesus Alcober 1, Oriol Sánchez 2, Javier Otero 3, Ramon Martí 4

ATENEA UPC AND THE NEW Activity Stream or WALL FEATURE Jesus Alcober 1, Oriol Sánchez 2, Javier Otero 3, Ramon Martí 4 ATENEA UPC AND THE NEW "Activity Stream" or "WALL" FEATURE Jesus Alcober 1, Oriol Sánchez 2, Javier Otero 3, Ramon Martí 4 1 Universitat Politècnica de Catalunya (Spain) 2 UPCnet (Spain) 3 UPCnet (Spain)

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview

More information

A Case-Based Approach To Imitation Learning in Robotic Agents

A Case-Based Approach To Imitation Learning in Robotic Agents A Case-Based Approach To Imitation Learning in Robotic Agents Tesca Fitzgerald, Ashok Goel School of Interactive Computing Georgia Institute of Technology, Atlanta, GA 30332, USA {tesca.fitzgerald,goel}@cc.gatech.edu

More information

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Using Web Searches on Important Words to Create Background Sets for LSI Classification Using Web Searches on Important Words to Create Background Sets for LSI Classification Sarah Zelikovitz and Marina Kogan College of Staten Island of CUNY 2800 Victory Blvd Staten Island, NY 11314 Abstract

More information

Trip to the beach essay >>>CLICK HERE<<<

Trip to the beach essay >>>CLICK HERE<<< Trip to the beach essay >>>CLICK HERE

More information

University of Groningen. Systemen, planning, netwerken Bosman, Aart

University of Groningen. Systemen, planning, netwerken Bosman, Aart University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Sociology. M.A. Sociology. About the Program. Academic Regulations. M.A. Sociology with Concentration in Quantitative Methodology.

Sociology. M.A. Sociology. About the Program. Academic Regulations. M.A. Sociology with Concentration in Quantitative Methodology. Sociology M.A. Sociology M.A. Sociology with Concentration in Quantitative Methodology M.A. Sociology with Specialization in African M.A. Sociology with Specialization in Digital Humanities Ph.D. Sociology

More information

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming. Computer Science 1 COMPUTER SCIENCE Office: Department of Computer Science, ECS, Suite 379 Mail Code: 2155 E Wesley Avenue, Denver, CO 80208 Phone: 303-871-2458 Email: info@cs.du.edu Web Site: Computer

More information

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability Shih-Bin Chen Dept. of Information and Computer Engineering, Chung-Yuan Christian University Chung-Li, Taiwan

More information

Ryerson University Sociology SOC 483: Advanced Research and Statistics

Ryerson University Sociology SOC 483: Advanced Research and Statistics Ryerson University Sociology SOC 483: Advanced Research and Statistics Prerequisites: SOC 481 Instructor: Paul S. Moore E-mail: psmoore@ryerson.ca Office: Sociology Department Jorgenson JOR 306 Phone:

More information

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Postimputation Module WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili Overview Ricopili Overview postimputation, 12 steps 1) Association analysis 2) Meta analysis

More information

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access The courses availability depends on the minimum number of registered students (5). If the course couldn t start, students can still complete it in the form of project work and regular consultations with

More information

From Social to Scholarly and Back Again

From Social to Scholarly and Back Again Portland State University PDXScholar Library Faculty Publications and Presentations University Library 2-24-2015 From Social to Scholarly and Back Again Jill Emery Portland State University, jemery@pdx.edu

More information

On-Line Data Analytics

On-Line Data Analytics International Journal of Computer Applications in Engineering Sciences [VOL I, ISSUE III, SEPTEMBER 2011] [ISSN: 2231-4946] On-Line Data Analytics Yugandhar Vemulapalli #, Devarapalli Raghu *, Raja Jacob

More information

Teaching Reproducible Research Inspiring New Researchers to Do More Robust and Reliable Science

Teaching Reproducible Research Inspiring New Researchers to Do More Robust and Reliable Science Transcript for 11/16 Webinar Note the transcript has been only partially checked for accuracy so please see recording: http://magazine.amstat.org/videos/education_webinars/reproducibleresearch.mp4 Teaching

More information

Automating Outcome Based Assessment

Automating Outcome Based Assessment Automating Outcome Based Assessment Suseel K Pallapu Graduate Student Department of Computing Studies Arizona State University Polytechnic (East) 01 480 449 3861 harryk@asu.edu ABSTRACT In the last decade,

More information

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

USER ADAPTATION IN E-LEARNING ENVIRONMENTS USER ADAPTATION IN E-LEARNING ENVIRONMENTS Paraskevi Tzouveli Image, Video and Multimedia Systems Laboratory School of Electrical and Computer Engineering National Technical University of Athens tpar@image.

More information

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study Purdue Data Summit 2017 Communication of Big Data Analytics New SAT Predictive Validity Case Study Paul M. Johnson, Ed.D. Associate Vice President for Enrollment Management, Research & Enrollment Information

More information

A faculty approach -learning tools. Audio Tools Tutorial and Presentation software Video Tools Authoring tools

A faculty approach -learning tools. Audio Tools Tutorial and Presentation software Video Tools Authoring tools A faculty approach -learning tools Audio Tools Tutorial and Presentation software Video Tools Authoring tools Quizz tools Powerpoint 2 Flash Content tools Web 2.0 tools RUFO Project Work visit at Paris

More information

SELECCIÓN DE CURSOS CAMPUS CIUDAD DE MÉXICO. Instructions for Course Selection

SELECCIÓN DE CURSOS CAMPUS CIUDAD DE MÉXICO. Instructions for Course Selection Instructions for Course Selection INSTRUCTIONS FOR COURSE SELECTION 1. Open the following link: https://prd28pi01.itesm.mx/recepcion/studyinmexico?ln=en 2. Click on the buttom: continue 3. Choose your

More information

Curriculum for the Bachelor Programme in Digital Media and Design at the IT University of Copenhagen

Curriculum for the Bachelor Programme in Digital Media and Design at the IT University of Copenhagen Curriculum for the Bachelor Programme in Digital Media and Design at the IT University of Copenhagen The curriculum of 1 August 2009 Revised on 17 March 2011 Revised on 20 December 2012 Revised on 19 August

More information

EDUC 998 The Doctoral Dissertation Proposal Summer 2004

EDUC 998 The Doctoral Dissertation Proposal Summer 2004 EDUC 998 The Doctoral Dissertation Proposal Summer 2004 Instructor: Joe Maxwell Office: Robinson A-353D. Office hours: by appointment Phone: 993-2119 Email: jmaxwell@gmu.edu Class meeting: Mon/Wed 4:30-7:10

More information

Indian Institute of Technology, Kanpur

Indian Institute of Technology, Kanpur Indian Institute of Technology, Kanpur Course Project - CS671A POS Tagging of Code Mixed Text Ayushman Sisodiya (12188) {ayushmn@iitk.ac.in} Donthu Vamsi Krishna (15111016) {vamsi@iitk.ac.in} Sandeep Kumar

More information

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Mariusz Łapczy ski 1 and Bartłomiej Jefma ski 2 1 The Chair of Market Analysis and Marketing Research,

More information

The Enterprise Knowledge Portal: The Concept

The Enterprise Knowledge Portal: The Concept The Enterprise Knowledge Portal: The Concept Executive Information Systems, Inc. www.dkms.com eisai@home.com (703) 461-8823 (o) 1 A Beginning Where is the life we have lost in living! Where is the wisdom

More information

GETTING POSITIVE NEWS COVERAGE

GETTING POSITIVE NEWS COVERAGE IBTTA 2015 WEBINAR SERIES 201 Media Training GETTING POSITIVE NEWS COVERAGE This session was created by Singer Communications for IBTTA for internal IBTTA use only Why PR? Increases visibility Gets your

More information

Visual Journalism J3220 Syllabus

Visual Journalism J3220 Syllabus Visual Journalism J3220 Syllabus Section: 15CB Semester: Fall 2013 Class meeting time: Tuesday and Thursday from 4:05-6 p.m., Matherly 107 Instructor: Andrea Hall Email: andreaehall@ufl.edu Phone number:??

More information

Self Study Report Computer Science

Self Study Report Computer Science Computer Science undergraduate students have access to undergraduate teaching, and general computing facilities in three buildings. Two large classrooms are housed in the Davis Centre, which hold about

More information

SAT Results December, 2002 Authors: Chuck Dulaney and Roger Regan WCPSS SAT Scores Reach Historic High

SAT Results December, 2002 Authors: Chuck Dulaney and Roger Regan WCPSS SAT Scores Reach Historic High ABOUT THE SAT 2001-2002 SAT Results December, 2002 Authors: Chuck Dulaney and Roger Regan WCPSS SAT Scores Reach Historic High The Scholastic Assessment Test (SAT), more formally known as the SAT I: Reasoning

More information

Career Preparation for English Majors Department of English The Ohio State University

Career Preparation for English Majors Department of English The Ohio State University Course Development Note: At the request of Debra Moddelmog, Chair of the Ohio State Department of English, Ruth Friedman, the department s Career/Internship Advisor, developed the following course syllabus

More information

"On-board training tools for long term missions" Experiment Overview. 1. Abstract:

On-board training tools for long term missions Experiment Overview. 1. Abstract: "On-board training tools for long term missions" Experiment Overview 1. Abstract 2. Keywords 3. Introduction 4. Technical Equipment 5. Experimental Procedure 6. References Principal Investigators: BTE:

More information

INNOVATIONS IN TEACHING Using Interactive Digital Images of Products to Teach Pharmaceutics

INNOVATIONS IN TEACHING Using Interactive Digital Images of Products to Teach Pharmaceutics INNOVATIONS IN TEACHING Using Interactive Digital Images of Products to Teach Pharmaceutics Laura Moore Fox, PhD, Khang H. Pham, PharmD,* and Michael Dollar, BS y South Carolina College of Pharmacy Objective.

More information

McGraw-Hill Education Preparation For The GED Test 2nd Edition By McGraw-Hill Education Editors

McGraw-Hill Education Preparation For The GED Test 2nd Edition By McGraw-Hill Education Editors McGraw-Hill Education Preparation For The GED Test 2nd Edition By McGraw-Hill Education Editors If looking for the ebook McGraw-Hill Education Preparation for the GED Test 2nd Edition by McGraw-Hill Education

More information

Hongyan Ma. University of California, Los Angeles

Hongyan Ma. University of California, Los Angeles SUMMARY, 300 Young Drive North, Mailbox 951520, hym@ucla.eduhttp://polaris.gseis.ucla.edu/hma/ Objective is a faculty position in library and information science devoted to research and teaching Research

More information

Ecole Polytechnique Fédérale de Lausanne EPFL School of Computer and Communication Sciences IC. School of Computer and Communication Sciences

Ecole Polytechnique Fédérale de Lausanne EPFL School of Computer and Communication Sciences IC. School of Computer and Communication Sciences Ecole Polytechnique Fédérale de Lausanne EPFL School of Computer and Communication Sciences IC 1 WELCOME to the Master programs in Computer Science, Data Science and Communication Systems 2 TODAY S SPEAKERS

More information

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators s and environments Percepts Intelligent s? Chapter 2 Actions s include humans, robots, softbots, thermostats, etc. The agent function maps from percept histories to actions: f : P A The agent program runs

More information

Education for an Information Age

Education for an Information Age Education for an Information Age Teaching in the Computerized Classroom 7th Edition by Bernard John Poole, MSIS University of Pittsburgh at Johnstown Johnstown, PA, USA and Elizabeth Sky-McIlvain, MLS

More information

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Chinese Language Parsing with Maximum-Entropy-Inspired Parser Chinese Language Parsing with Maximum-Entropy-Inspired Parser Heng Lian Brown University Abstract The Chinese language has many special characteristics that make parsing difficult. The performance of state-of-the-art

More information