A New Approach to Recommender Systems
|
|
- Jewel Cook
- 5 years ago
- Views:
Transcription
1 Li Haochong Shen Abstract In this project, we seek out a new way to build a recommendation system. We chose the problem of matching professor with ones research interest as a testing ground. Specifically, we gathered all publications of Stanford artificial intelligence professors and used supervised learning to train our model. We also used unsupervised learning to generate baseline reference to verify our model output in real use cases. 1. Introduction Recommender systems have been widely adopted everywhere in our life. News websites push news to you in relevant topics you selected. Streaming websites recommend new movies and TV series using on your demographic information and past viewing history. Online shopping websites hint items you are might be interested in purchasing based on your shopping habits. Most prominently, online advertising personalize advertisements for you using various heuristics. Nevertheless, there still are a lot of areas that could use the help of a recommender system where no suitable ones are implemented yet. Also, most of the work on recommender system, aside from improving quality and accuracy, focuses on cold start, scalability etc. In this project, we focus on a specific problem which represents a larger set of problems, the problem of matching ones research interest with professors. The problem is very different from other rec-. Correspondence to: Li <wl336@stanford.edu>, Haochong Shen <haochong@stanford.edu>. Project Report of CS229 Marchine Learning, Fall Copyright 2017 by the author(s). ommender problems in several ways. First, choosing a professor to pursue an advanced degree is usually a one-time decision. Secondly, the cost of wrong choice is very high. Thirdly, due to nature of research, each professors focus can be very different even if their focus is in similar or same discipline. All these properties makes this type of problems unique and cannot be solved by applying news, videos or shopping recommender systems. We propose a new approach to the problem. Our algorithm will be using a single classifier as our recommender. We chose 3 traditional machine learning models, multinomial naive Bayes (MNB), support vector machine (SVM) and logistic regression, to compare for the classifier. We also attempted to use recurrent neural network (RNN). We trained the classifier with professor names as label and we transformed their publication to a feature vector as model input. Final user will input their past papers, statement of purpose or a small paragraph of description of their interest and get professor recommendation as output. 2. Analysis of other approaches 2.1. Manual Search Most of people faced trying to find the right professor to pursue an advance degree will rely on online resources. Some will do an extensive web search using popular search engines such as Google, Baidu and Bing using specific keywords of their area of interest. They might alternatively start from reputable universities and scan through department directories and narrow done the professors with matching fields of study. This will narrow down their selection to tens of professors. Then they might read through the publications of each professor to see if there is a matching interest. Such manual search is very timeconsuming and tedious and often not exhaustive. There are thousands of institutions globally. Each
2 institution will have multiple professors working in similar area. Collectively there will be thousands of or even more publications in one area of study. It is virtually impossible to examine all of those or even find all of them. Here is when selection bias come in. One may tend to focus on more famous professors and schools and ignore some lessor known but better alternatives. One may also ignore foreign schools and favor domestic ones and miss a better match Collabrative Filtering One common approach to recommender system is collaborative filtering. Collaborative filtering methods usually are based on collecting and compiling a significant amount of record of users behaviors, choices and preferences and trying to predict what the end users preference on specific items based on the choice or preferences of other users that is similar to end user, i.e. has made similar choice or has similar preferences on other items. This type of recommender system has its advantage in that it requires no understanding of the item recommender recommends since the recommendation is based on user input thus it does not require parsing any content or understand the meaning of the item. However, such method make a very strong assumption that people who agreed and make similar choice in the past will be making similar choice in the future, and that they will like similar kinds of items in the future as they liked in the past. This assumption works well in some problem but will not work in others. In our problem of matching research interest to professors. When people further their study into on area, their focus usually diverge. One may start studying general machine learning and dive into deep learning, and then choose deep reinforcement learning and finally landing in a specific problem associated with it. Another problem is what usually called cold start problem. This type of system requires a large amount of existing data on other users in order to give acceptable recommendation. A news, video or e-commerce website have the opportunity to gather user input for a long period of time before rolling out their recommender system. There is no easy way to gather any user input for people searching for professors. Other problems of sparsity and scalability also applies to various extends. A professor usually have a very limited number of student and it will be hard to make correlation based on limited amount of samples Content-Based Recommender Another common approach is content-based filtering. Traditional content-based recommender system uses keywords to describe items and a user profile is build to indicate preference based on keywords. This approach, though very effective in some cases, loses a lot of finer-grained information. For example, in the problem of matching professor with ones research interest. Multiple professor might be labeled with same labels, e.g. supervisedlearning, deep-learning etc, but they might be focusing on different sub-domain, where there might be no appropriate model. Recommender using simple keyword tags will not yield much better results than web search. Newer recommender system extract features from the item content. A widely used algorithm is the tf-idf representation. To create a user profile, the system still needs two types of information, the model of users preference and user interaction with the recommender system. The system creates a content-based profile of user based on a weighted vector of the items features extracted from the item. The weights represents the importance of each feature is to the user and can be inferred from rated content vectors using a number of techniques. Simple approach can be an average of all rated item vector. Some other approaches uses machine learning techniques. Direct feedback from the users also helps to adjust weights on certain attributes. A key issue with content-based filtering is that the systems ability to handle cross domain content. The recommendation is usually confined in the same type of the content. This is not a problem in our use case. In fact, our approach is built upon contentbased idea. The main issue we need to solve is how to build user profile efficiently or completely bypass this step Hybrid Recommender Newer recommendation system uses both approaches described above, combining collaborative filtering and content-based filtering could be more effective in some cases. Hybrid system can be
3 implemented in several ways: by creating separate content-based and collaborative-based prediction and them combine them with some heuristics or by adding one capabilities onto the other type of recommenders system. Since collaborative-filtering technique will not work in our problems, we chose not to pursue this path. 3. Dataset and Features 3.1. Dataset Our dataset is obtained from Scopus database. We selected 10 Stanford computer science professors who are actively working in artificial intelligence and selected their publications from This gives us a total of 615 papers. Below is a distribution chart. Table 1. Professor and Published Paper Distribution Professor Teaching Publications Andrew Ng CS Chris Manning CS224N 109 Dan Boneh CS Doug James CS205A 34 Fei-fei Li CS231N 24 Jeannette Bohg CS223A 24 Mike Genesereth CS Percy Liang CS Silvio Savarese CS231A 100 Stefano Ermon CS Due to limitation of the database and to reduce training time and model size, we only selected the abstract portion of the publications based on the assumption that they are usually the most informative part and usually summarize the entire publication very well. This also minimized the need for preprocessing the data as abstract usually contains not much more than pure text. We reformatted the data into title, abstract text, year, author tab delimited format and below is one sample in our data. Smart Forms "We present Smart Forms, an innovative web forms technology for easy creation, maintenance, and evaluation of user-friendly web forms especially the ones that must implement complex laws, regulations, or business policies. In order to provide cognitive assistance to end users during form-filling, Smart Forms have built-in mechanisms for visual feedback, restriction of selectable values, and automatic form filling. Smart Forms can be created and maintained easily by declar-atively configuring rather than procedurally programming these mechanisms. We also present the Smart Forms Editor which assists a Smart Form creator in creating datadriven form UI, editing, testing and verifying formrules, and testing and debugging a form. Copyright 2016, Association for the Advancement of Artificial Intelligence( All rights reserved." 2016 mike 3.2. Features Obviously, we cannot directly use text as input to machine learning models. We build a transformer to vectorize input text. We preprocessed the input text to remove stop words and performed word stemming, which is a common step to decrease not so useful feature count and capture more correlations and connections between features. We then fit a transformer to transform input text to a termfrequency-inverse-document frequency (tf-idf) vector with various n-gram choice and used it as the final input of the classifier. Detailed description and final feature choice will be presented in following sections. 4. Methods The main problem with traditional system is the need to build user profiles. We propose to use only one main building block as the recommender. Both user input and item content go through the same input. Thus we can skip building user profile and just use user input directly to the recommender. The recommender is made up of two functional blocks, the transformer and the classifier. The transformer transforms the input text into feature vectors that can be used by the classifier. The classifier directly output the recommendation as a scoring vector and the top scored item will be used as the output Feature Extraction To extract features out of the text input, we build the aforementioned transformer and these are the paramter combination that produced the best re-
4 sults. Table 2. Feature Selection Feature Parameter N-gram 2 max df 0.8 min df 2 max features 20,000 tf-idf yes since we achieved better accuracy in train-dev set. This is understandable and shows our model have room for improvment. Notibly, multiple professors collabrated on the same paper, our training target does not consider this scenario. Also, abstract information might not be detailed enough to differentiate professors with very similar focus. Fig 1. is the output of our unsupervised learning algorithm showing professor s contribution in each area of study We also inputed the following paragraph into the model 4.2. Model Training We chose Multinomial Naive Bayes (MNB), Support Vector Machine (SVM) and Logistic Regression (LR) as the classifier block. Those models are chosen since they are widely used in language processing. For MNB and SVM, we tuned hyperparameters to achieve the best train-dev accuracy. In SVM we also tested multiple kernels, we found linear kernel to be the best performing kernel. Other higher dimension kernel results in very poor results due to the limited size of our training set Testing and Baseline For initial testing, we used other abstract of publications of the professor, doing the same transformation as input and uses the professors name as target. This will show whether the model learns enough information to differentiate between professors, which is a relatively good indicator of how well our recommender will perform. To justify this assumption, lets assume our user is the author of such abstract and used it as the input to the model, then the model should match it with the papers real author. For real testing, we need a baseline reference to see if our output are reasonable. We build a k-means clustering alogorithm to extract topics out of the publications and generate a visual results of professor s focus in different topics. 5. Results and Discussion Table 3, 4 describes the test accuracies of our initial testing using professors other publications as input. We can see that we achieved a relatively good accuracy overall accuarcy. We can see a bias problem INPUT: Reinforcement learning is the machine learning area of my most interests, because it is inspired by the behaviorist psychology. It concerned about how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward.due to its generality, it can be applied to many area, such as game theory, control theory, operations research, information theory, simulation-based optimization. OUTPUT: andrew Comparing to unsupervised learning results, this is a very accurate recommendation 6. Future Works The RNN failed to train with our limited time-frame and it will be the first thing we would like to investigate given more time. Glo-Ve and word2vec captures word relationship and can potentially give model even better accuracy. We would also enhance our target label to multi-label to account for multiple authors for the same paper. We would also like to expand our dataset. Full publication should give more fine-grained information to further enhance accuracy. More professors in multiple institutions and in various area of study should be included to test the scalability of our model and investigate potential interdisciplinary recommendation. We should also develop a better overall testing method and set a clearer optimization target to ensure our model improves in the right direction.
5 Figure 1. Professor weight in each topics extracted by k-means clustering and LDA Table 3. Overall Testing Accuracy Logistic Regression MNB SVM Accuracy 83.0% 83.0% 83.0% Micro Precision 83.0% 83.0% 83.0% Macro Precision 77.9% 89.3% 78.2% Micro Recall 83.0% 83.0% 83.0% Macro Recall 77.7% 80.1% 77.7% Micro F1 83.0% 83.0% 83.0% Macro F1 77.2% 81.3% 77.2% 7. Contribution Table 5. Contribution of each Team Member Item Project Proposal Project Milestone Supervised Unsupervised RNN Project Poster Final Report Contributor Haochong, Haochong, Haochong References
6 Table 4. Testing Accuracy for each Professor Logistic Regression MNB SVM Professor Test Sample Size Precision F1-score Precision F1-score Precision F1-score Andrew Ng % 72.7% 70.0% 66.7% 80.0% 76.2% Chris Manning % 81.8% 69.2% 81.8% 64.3% 78.3% Dan Boneh % 100.0% 100.0% 100.0% 100.0% 100.0% Doug James % 100.0% 100.0% 100.0% 100.0% 100.0% Fei-fei Li 3 0.0% 0.0% 100.0% 50.0% 0.0% 0.0% Jeannette Bohg % 100.0% 100.0% 66.7% 100.0% 100.0% Mike Genesereth % 50.0% 66.7% 80.0% 50.0% 50.0% Percy Liang % 80.0% 100.0% 80.0% 100.0% 80.0% Silvio Savarese % 87.5% 87.5% 87.5% 87.5% 87.5% Stefano Ermon % 100.0% 100.0% 100.0% 100.0% 100.0% Average/Total % 80.7% 85.4% 82.1% 80.4% 80.8%
Python Machine Learning
Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled
More informationA Case Study: News Classification Based on Term Frequency
A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview
More informationAustralian Journal of Basic and Applied Sciences
AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean
More informationAssignment 1: Predicting Amazon Review Ratings
Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for
More informationLearning From the Past with Experiment Databases
Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University
More informationLecture 1: Machine Learning Basics
1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3
More informationTwitter Sentiment Classification on Sanders Data using Hybrid Approach
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders
More informationSINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationCS Machine Learning
CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing
More informationSystem Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks
System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering
More informationLinking Task: Identifying authors and book titles in verbose queries
Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,
More informationPredicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks
Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com
More informationModule 12. Machine Learning. Version 2 CSE IIT, Kharagpur
Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should
More informationarxiv: v1 [cs.cl] 2 Apr 2017
Word-Alignment-Based Segment-Level Machine Translation Evaluation using Word Embeddings Junki Matsuo and Mamoru Komachi Graduate School of System Design, Tokyo Metropolitan University, Japan matsuo-junki@ed.tmu.ac.jp,
More informationRule Learning With Negation: Issues Regarding Effectiveness
Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United
More informationSemi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.
Semi-supervised methods of text processing, and an application to medical concept extraction Yacine Jernite Text-as-Data series September 17. 2015 What do we want from text? 1. Extract information 2. Link
More informationHuman Emotion Recognition From Speech
RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati
More informationTraining a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski
Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski Problem Statement and Background Given a collection of 8th grade science questions, possible answer
More informationActivities, Exercises, Assignments Copyright 2009 Cem Kaner 1
Patterns of activities, iti exercises and assignments Workshop on Teaching Software Testing January 31, 2009 Cem Kaner, J.D., Ph.D. kaner@kaner.com Professor of Software Engineering Florida Institute of
More information(Sub)Gradient Descent
(Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include
More informationReducing Features to Improve Bug Prediction
Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science
More informationThe 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,
More informationOPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS
OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,
More informationCalibration of Confidence Measures in Speech Recognition
Submitted to IEEE Trans on Audio, Speech, and Language, July 2010 1 Calibration of Confidence Measures in Speech Recognition Dong Yu, Senior Member, IEEE, Jinyu Li, Member, IEEE, Li Deng, Fellow, IEEE
More informationUniversiteit Leiden ICT in Business
Universiteit Leiden ICT in Business Ranking of Multi-Word Terms Name: Ricardo R.M. Blikman Student-no: s1184164 Internal report number: 2012-11 Date: 07/03/2013 1st supervisor: Prof. Dr. J.N. Kok 2nd supervisor:
More informationModel Ensemble for Click Prediction in Bing Search Ads
Model Ensemble for Click Prediction in Bing Search Ads Xiaoliang Ling Microsoft Bing xiaoling@microsoft.com Hucheng Zhou Microsoft Research huzho@microsoft.com Weiwei Deng Microsoft Bing dedeng@microsoft.com
More informationSoftware Maintenance
1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories
More informationMining Association Rules in Student s Assessment Data
www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama
More information*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN
From: AAAI Technical Report WS-98-08. Compilation copyright 1998, AAAI (www.aaai.org). All rights reserved. Recommender Systems: A GroupLens Perspective Joseph A. Konstan *t, John Riedl *t, AI Borchers,
More informationCS 446: Machine Learning
CS 446: Machine Learning Introduction to LBJava: a Learning Based Programming Language Writing classifiers Christos Christodoulopoulos Parisa Kordjamshidi Motivation 2 Motivation You still have not learnt
More informationRule Learning with Negation: Issues Regarding Effectiveness
Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX
More informationActive Learning. Yingyu Liang Computer Sciences 760 Fall
Active Learning Yingyu Liang Computer Sciences 760 Fall 2017 http://pages.cs.wisc.edu/~yliang/cs760/ Some of the slides in these lectures have been adapted/borrowed from materials developed by Mark Craven,
More informationADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF
Read Online and Download Ebook ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Click link bellow and free register to download
More informationLearning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models
Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Stephan Gouws and GJ van Rooyen MIH Medialab, Stellenbosch University SOUTH AFRICA {stephan,gvrooyen}@ml.sun.ac.za
More informationPOS tagging of Chinese Buddhist texts using Recurrent Neural Networks
POS tagging of Chinese Buddhist texts using Recurrent Neural Networks Longlu Qin Department of East Asian Languages and Cultures longlu@stanford.edu Abstract Chinese POS tagging, as one of the most important
More informationLearning Methods for Fuzzy Systems
Learning Methods for Fuzzy Systems Rudolf Kruse and Andreas Nürnberger Department of Computer Science, University of Magdeburg Universitätsplatz, D-396 Magdeburg, Germany Phone : +49.39.67.876, Fax : +49.39.67.8
More informationA New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation
A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation SLSP-2016 October 11-12 Natalia Tomashenko 1,2,3 natalia.tomashenko@univ-lemans.fr Yuri Khokhlov 3 khokhlov@speechpro.com Yannick
More informationExploration. CS : Deep Reinforcement Learning Sergey Levine
Exploration CS 294-112: Deep Reinforcement Learning Sergey Levine Class Notes 1. Homework 4 due on Wednesday 2. Project proposal feedback sent Today s Lecture 1. What is exploration? Why is it a problem?
More informationWord Segmentation of Off-line Handwritten Documents
Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department
More informationSwitchboard Language Model Improvement with Conversational Data from Gigaword
Katholieke Universiteit Leuven Faculty of Engineering Master in Artificial Intelligence (MAI) Speech and Language Technology (SLT) Switchboard Language Model Improvement with Conversational Data from Gigaword
More informationThe Good Judgment Project: A large scale test of different methods of combining expert predictions
The Good Judgment Project: A large scale test of different methods of combining expert predictions Lyle Ungar, Barb Mellors, Jon Baron, Phil Tetlock, Jaime Ramos, Sam Swift The University of Pennsylvania
More informationAxiom 2013 Team Description Paper
Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association
More informationSpeech Emotion Recognition Using Support Vector Machine
Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,
More informationLip reading: Japanese vowel recognition by tracking temporal changes of lip shape
Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Koshi Odagiri 1, and Yoichi Muraoka 1 1 Graduate School of Fundamental/Computer Science and Engineering, Waseda University,
More informationSelf Study Report Computer Science
Computer Science undergraduate students have access to undergraduate teaching, and general computing facilities in three buildings. Two large classrooms are housed in the Davis Centre, which hold about
More informationProduct Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments
Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments Vijayshri Ramkrishna Ingale PG Student, Department of Computer Engineering JSPM s Imperial College of Engineering &
More informationMMOG Subscription Business Models: Table of Contents
DFC Intelligence DFC Intelligence Phone 858-780-9680 9320 Carmel Mountain Rd Fax 858-780-9671 Suite C www.dfcint.com San Diego, CA 92129 MMOG Subscription Business Models: Table of Contents November 2007
More informationKnowledge Transfer in Deep Convolutional Neural Nets
Knowledge Transfer in Deep Convolutional Neural Nets Steven Gutstein, Olac Fuentes and Eric Freudenthal Computer Science Department University of Texas at El Paso El Paso, Texas, 79968, U.S.A. Abstract
More informationWhat is a Mental Model?
Mental Models for Program Understanding Dr. Jonathan I. Maletic Computer Science Department Kent State University What is a Mental Model? Internal (mental) representation of a real system s behavior,
More informationLecture 1: Basic Concepts of Machine Learning
Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010
More informationTIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy
TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE Pierre Foy TIMSS Advanced 2015 orks User Guide for the International Database Pierre Foy Contributors: Victoria A.S. Centurino, Kerry E. Cotter,
More informationNotes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1
Notes on The Sciences of the Artificial Adapted from a shorter document written for course 17-652 (Deciding What to Design) 1 Ali Almossawi December 29, 2005 1 Introduction The Sciences of the Artificial
More informationPhD project description. <Working title of the dissertation>
PhD project description PhD student: University of Agder (UiA) Faculty of Engineering and Science Department
More informationProbability and Statistics Curriculum Pacing Guide
Unit 1 Terms PS.SPMJ.3 PS.SPMJ.5 Plan and conduct a survey to answer a statistical question. Recognize how the plan addresses sampling technique, randomization, measurement of experimental error and methods
More informationGeorgetown University at TREC 2017 Dynamic Domain Track
Georgetown University at TREC 2017 Dynamic Domain Track Zhiwen Tang Georgetown University zt79@georgetown.edu Grace Hui Yang Georgetown University huiyang@cs.georgetown.edu Abstract TREC Dynamic Domain
More informationOrganizational Knowledge Distribution: An Experimental Evaluation
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 24 Proceedings Americas Conference on Information Systems (AMCIS) 12-31-24 : An Experimental Evaluation Surendra Sarnikar University
More informationThe Moodle and joule 2 Teacher Toolkit
The Moodle and joule 2 Teacher Toolkit Moodlerooms Learning Solutions The design and development of Moodle and joule continues to be guided by social constructionist pedagogy. This refers to the idea that
More informationOnline Updating of Word Representations for Part-of-Speech Tagging
Online Updating of Word Representations for Part-of-Speech Tagging Wenpeng Yin LMU Munich wenpeng@cis.lmu.de Tobias Schnabel Cornell University tbs49@cornell.edu Hinrich Schütze LMU Munich inquiries@cislmu.org
More informationA Comparison of Two Text Representations for Sentiment Analysis
010 International Conference on Computer Application and System Modeling (ICCASM 010) A Comparison of Two Text Representations for Sentiment Analysis Jianxiong Wang School of Computer Science & Educational
More informationIterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages
Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages Nuanwan Soonthornphisaj 1 and Boonserm Kijsirikul 2 Machine Intelligence and Knowledge Discovery Laboratory Department of Computer
More informationLaboratorio di Intelligenza Artificiale e Robotica
Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning
More informationLanguage Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus
Language Acquisition Fall 2010/Winter 2011 Lexical Categories Afra Alishahi, Heiner Drenhaus Computational Linguistics and Phonetics Saarland University Children s Sensitivity to Lexical Categories Look,
More informationP. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas
Exploiting Distance Learning Methods and Multimediaenhanced instructional content to support IT Curricula in Greek Technological Educational Institutes P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou,
More informationCSL465/603 - Machine Learning
CSL465/603 - Machine Learning Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Introduction CSL465/603 - Machine Learning 1 Administrative Trivia Course Structure 3-0-2 Lecture Timings Monday 9.55-10.45am
More informationA study of speaker adaptation for DNN-based speech synthesis
A study of speaker adaptation for DNN-based speech synthesis Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King The Centre for Speech Technology Research (CSTR) University of Edinburgh,
More informationOn the Combined Behavior of Autonomous Resource Management Agents
On the Combined Behavior of Autonomous Resource Management Agents Siri Fagernes 1 and Alva L. Couch 2 1 Faculty of Engineering Oslo University College Oslo, Norway siri.fagernes@iu.hio.no 2 Computer Science
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationDesigning a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses
Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses Thomas F.C. Woodhall Masters Candidate in Civil Engineering Queen s University at Kingston,
More informationISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM
Proceedings of 28 ISFA 28 International Symposium on Flexible Automation Atlanta, GA, USA June 23-26, 28 ISFA28U_12 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Amit Gil, Helman Stern, Yael Edan, and
More informationThe Effect of Extensive Reading on Developing the Grammatical. Accuracy of the EFL Freshmen at Al Al-Bayt University
The Effect of Extensive Reading on Developing the Grammatical Accuracy of the EFL Freshmen at Al Al-Bayt University Kifah Rakan Alqadi Al Al-Bayt University Faculty of Arts Department of English Language
More informationNetpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models
Netpix: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models 1 Netpix: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models James B.
More informationUsing dialogue context to improve parsing performance in dialogue systems
Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,
More informationGenerative models and adversarial training
Day 4 Lecture 1 Generative models and adversarial training Kevin McGuinness kevin.mcguinness@dcu.ie Research Fellow Insight Centre for Data Analytics Dublin City University What is a generative model?
More informationModeling function word errors in DNN-HMM based LVCSR systems
Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford
More informationMYCIN. The MYCIN Task
MYCIN Developed at Stanford University in 1972 Regarded as the first true expert system Assists physicians in the treatment of blood infections Many revisions and extensions over the years The MYCIN Task
More informationLecture 10: Reinforcement Learning
Lecture 1: Reinforcement Learning Cognitive Systems II - Machine Learning SS 25 Part III: Learning Programs and Strategies Q Learning, Dynamic Programming Lecture 1: Reinforcement Learning p. Motivation
More informationUniversity of Groningen. Systemen, planning, netwerken Bosman, Aart
University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document
More informationComment-based Multi-View Clustering of Web 2.0 Items
Comment-based Multi-View Clustering of Web 2.0 Items Xiangnan He 1 Min-Yen Kan 1 Peichu Xie 2 Xiao Chen 3 1 School of Computing, National University of Singapore 2 Department of Mathematics, National University
More informationCLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH
ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department
More informationEDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016
EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016 Instructor: Dr. Katy Denson, Ph.D. Office Hours: Because I live in Albuquerque, New Mexico, I won t have office hours. But
More informationWelcome to the session on ACCUPLACER Policy Development. This session will touch upon common policy decisions an institution may encounter during the
Welcome to the session on ACCUPLACER Policy Development. This session will touch upon common policy decisions an institution may encounter during the development or reevaluation of a placement program.
More informationAttributed Social Network Embedding
JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, MAY 2017 1 Attributed Social Network Embedding arxiv:1705.04969v1 [cs.si] 14 May 2017 Lizi Liao, Xiangnan He, Hanwang Zhang, and Tat-Seng Chua Abstract Embedding
More informationDOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME
The following resources are currently available: DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME 2016-17 What is the Doctoral School? The main purpose of the Doctoral School is to enhance your experience
More informationDetecting English-French Cognates Using Orthographic Edit Distance
Detecting English-French Cognates Using Orthographic Edit Distance Qiongkai Xu 1,2, Albert Chen 1, Chang i 1 1 The Australian National University, College of Engineering and Computer Science 2 National
More informationUSING A RECOMMENDER TO INFLUENCE CONSUMER ENERGY USAGE
USING A RECOMMENDER TO INFLUENCE CONSUMER ENERGY USAGE Master Degree Project in Information Fusion Two years Level ECTS Autumn term and Spring term Year Henric Carlsson Supervisor: Gunnar Mathiason Examiner:
More informationCambridge NATIONALS. Creative imedia Level 1/2. UNIT R081 - Pre-Production Skills DELIVERY GUIDE
Cambridge NATIONALS Creative imedia Level 1/2 UNIT R081 - Pre-Production Skills VERSION 1 APRIL 2013 INDEX Introduction Page 3 Unit R081 - Pre-Production Skills Page 4 Learning Outcome 1 - Understand the
More informationOCR for Arabic using SIFT Descriptors With Online Failure Prediction
OCR for Arabic using SIFT Descriptors With Online Failure Prediction Andrey Stolyarenko, Nachum Dershowitz The Blavatnik School of Computer Science Tel Aviv University Tel Aviv, Israel Email: stloyare@tau.ac.il,
More informationDeep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach
#BaselOne7 Deep search Enhancing a search bar using machine learning Ilgün Ilgün & Cedric Reichenbach We are not researchers Outline I. Periscope: A search tool II. Goals III. Deep learning IV. Applying
More informationAUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS
AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS R.Barco 1, R.Guerrero 2, G.Hylander 2, L.Nielsen 3, M.Partanen 2, S.Patel 4 1 Dpt. Ingeniería de Comunicaciones. Universidad de Málaga.
More informationSpeech Recognition at ICSI: Broadcast News and beyond
Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI
More informationThe Karlsruhe Institute of Technology Translation Systems for the WMT 2011
The Karlsruhe Institute of Technology Translation Systems for the WMT 2011 Teresa Herrmann, Mohammed Mediani, Jan Niehues and Alex Waibel Karlsruhe Institute of Technology Karlsruhe, Germany firstname.lastname@kit.edu
More informationArtificial Neural Networks written examination
1 (8) Institutionen för informationsteknologi Olle Gällmo Universitetsadjunkt Adress: Lägerhyddsvägen 2 Box 337 751 05 Uppsala Artificial Neural Networks written examination Monday, May 15, 2006 9 00-14
More informationTesting A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA
Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA Testing a Moving Target How Do We Test Machine Learning Systems? Peter Varhol, Technology
More informationBootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition
Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition Tom Y. Ouyang * MIT CSAIL ouyang@csail.mit.edu Yang Li Google Research yangli@acm.org ABSTRACT Personal
More informationFeature Selection based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification using Naïve Bayes
Feature Selection based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification using Naïve Bayes Viviana Molano 1, Carlos Cobos 1, Martha Mendoza 1, Enrique Herrera-Viedma 2, and
More informationUniversidade do Minho Escola de Engenharia
Universidade do Minho Escola de Engenharia Universidade do Minho Escola de Engenharia Dissertação de Mestrado Knowledge Discovery is the nontrivial extraction of implicit, previously unknown, and potentially
More informationHoughton Mifflin Online Assessment System Walkthrough Guide
Houghton Mifflin Online Assessment System Walkthrough Guide Page 1 Copyright 2007 by Houghton Mifflin Company. All Rights Reserved. No part of this document may be reproduced or transmitted in any form
More informationModeling function word errors in DNN-HMM based LVCSR systems
Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford
More informationVisit us at:
White Paper Integrating Six Sigma and Software Testing Process for Removal of Wastage & Optimizing Resource Utilization 24 October 2013 With resources working for extended hours and in a pressurized environment,
More information