Advanced Natural Language Processing and Information Retrieval

Similar documents
Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

AQUA: An Ontology-Driven Question Answering System

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Prediction of Maximal Projection for Semantic Role Labeling

Ensemble Technique Utilization for Indonesian Dependency Parser

Python Machine Learning

Linking Task: Identifying authors and book titles in verbose queries

TextGraphs: Graph-based algorithms for Natural Language Processing

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Applications of memory-based natural language processing

A Comparison of Two Text Representations for Sentiment Analysis

The Smart/Empire TIPSTER IR System

ScienceDirect. Malayalam question answering system

A Vector Space Approach for Aspect-Based Sentiment Analysis

BYLINE [Heng Ji, Computer Science Department, New York University,

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Improving Machine Learning Input for Automatic Document Classification with Natural Language Processing

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

SEMAFOR: Frame Argument Resolution with Log-Linear Models

Georgetown University at TREC 2017 Dynamic Domain Track

Assignment 1: Predicting Amazon Review Ratings

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

CS 446: Machine Learning

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Lecture 1: Basic Concepts of Machine Learning

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Second Exam: Natural Language Parsing with Neural Networks

Online Updating of Word Representations for Part-of-Speech Tagging

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Cross Language Information Retrieval

Introduction to Text Mining

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

A Case Study: News Classification Based on Term Frequency

The stages of event extraction

Generative models and adversarial training

CS Machine Learning

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Probabilistic Latent Semantic Analysis

Word Segmentation of Off-line Handwritten Documents

Switchboard Language Model Improvement with Conversational Data from Gigaword

Distant Supervised Relation Extraction with Wikipedia and Freebase

Outline. Web as Corpus. Using Web Data for Linguistic Purposes. Ines Rehbein. NCLT, Dublin City University. nclt

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Matching Similarity for Keyword-Based Clustering

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Compositional Semantics

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

arxiv: v1 [cs.lg] 15 Jun 2015

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Speech Emotion Recognition Using Support Vector Machine

Conversational Framework for Web Search and Recommendations

arxiv: v2 [cs.cv] 30 Mar 2017

Cross-Lingual Text Categorization

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Beyond the Pipeline: Discrete Optimization in NLP

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Summarizing Answers in Non-Factoid Community Question-Answering

Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanities

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Speech Recognition at ICSI: Broadcast News and beyond

A Topic Maps-based ontology IR system versus Clustering-based IR System: A Comparative Study in Security Domain

Rule Learning with Negation: Issues Regarding Effectiveness

Resolving Complex Cases of Definite Pronouns: The Winograd Schema Challenge

Expert locator using concept linking. V. Senthil Kumaran* and A. Sankar

Natural Language Processing. George Konidaris

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

A Bayesian Learning Approach to Concept-Based Document Classification

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

Human Emotion Recognition From Speech

arxiv: v1 [cs.cv] 10 May 2017

Discriminative Learning of Beam-Search Heuristics for Planning

CS 598 Natural Language Processing

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Indian Institute of Technology, Kanpur

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

Abstractions and the Brain

Rule Learning With Negation: Issues Regarding Effectiveness

Bridging Lexical Gaps between Queries and Questions on Large Online Q&A Collections with Compact Translation Models

ARNE - A tool for Namend Entity Recognition from Arabic Text

arxiv: v4 [cs.cl] 28 Mar 2016

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Constraining X-Bar: Theta Theory

Postprint.

Multi-Lingual Text Leveling

Exposé for a Master s Thesis

Reducing Features to Improve Bug Prediction

Building a Semantic Role Labelling System for Vietnamese

Universidade do Minho Escola de Engenharia

Syllabus for CHEM 4660 Introduction to Computational Chemistry Spring 2010

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Grammar Extraction from Treebanks for Hindi and Telugu

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

Knowledge-Based - Systems

Transcription:

Advanced Natural Language Processing and Information Retrieval Course Description Alessandro Moschitti Department of Computer Science and Information Engineering University of Trento Email: moschitti@disi.unitn.it

Teachers Prof. Alessandro Moschitti, PhD Dr. Olga Uryupina, PhD Dr. Antonio Uva Dr. Massimo Nicosia Dr. Daniele Bonadiman Dr. Kateryna Tymoshenko, PhD

Teachers Dr. Gianni Barlacchi Dr. Lingzhen Chen (Liah) Dr. Irina Haponchyk

Student List https://goo.gl/0nwz5v

Course Schedule Lectures Thursday, 11:30 13:30 (Theory) Room A213 16:30 18:30 (Laboratory) Room PC A201 In the first month some theory lectures in lab Consulting hours: Sending email is recommended

Syllabus Introduction to Information Retrieval (IR) Boolean retrieval, Vector Space Model, Feature Vectors, Document/Passage Retrieval, Search Engines, Relevance Feedback & Query Expansion, Document Filtering and Categorization, flat and hierarchical clustering, Latent Semantic Analysis, Web Crawling and the Google algorithm. Statistical Machine Learning: Kernel Methods, Classification, Clustering, Ranking, Re-Ranking and Regression and hints to practical machine learning, Neural Networks: CNNs, LSTM

Syllabus Performance Evaluation: Performance Measures, Performance Estimation, Cross validation, Held Out and n-fold Cross validation Statistical Natural Language Processing: Sequence Labeling: POS-tagging, Named Entity Recognition and Normalization. Syntactic Parsing: shallow and deep Constituency Parsing, Dependency Syntactic Parsing. Social Media: sentiment analysis and event extraction from Twetter

Syllabus Statistical Natural Language Processing: Shallow Semantic Parsing: Predicate Argument Structures, SRL of FrameNet and ProbBank, Relation Extraction (supervised and semi-supervised). Discourse Parsing: Coreference Resolution and discourse connective classification

Syllabus Joint NLP and IR applications: Deep Linguistic Analysis for Question Answering: QA tasks (open, restricted, factoid, non-factoid), NLP Representation, Question Answering Workflow, QA Pipeline, Question Classification and QA reranking. Fine-Grained Opinion Mining: automatic review classification, deep opinion analysis, automatic product extraction and review, reputation/social media analysis

Lab 1 Search Engines Kernel Methods and SVMs Automated Text Categorization Question Classification Answer Reranking Syntactic Parsing and Named Entity Recognition Sentiment Analysis Neural Networks

Lab 2 Our UIMA pipeline implementing a pseudo Watson (4-5 lectures) all NLP processors seen before Question Answering full pipeline Community Question Answering full pipeline

PART I: Essential Notions of Information Retrieval and Machine Learning Feb 22: Alessandro Introduction to the course and IR, performance measures, machine learning, text categorization Mar 1: Alessandro (live video lecture) Perceptron, SVMs (theory) Kernel Methods, Question Classification (theory) Practical examples on the above

PART I: Essential Notions of Information Retrieval and Machine Learning Mar 8 Alessandro (live video lecture): Classification, Multiclassification, Ranking, Regression and Structured Output Models (theory) Irina: Ranking, Multi-classification, Regression, Structured Perceptron (Lab)

PART II: Basics of Natural Language Processing Mar 15: Olga: sequence labeling: POS-tagging and Named Entity Recognition Antonio: sequence labeling: POS-tagging and Named Entity Recognition (Lab) Mar 22 Olga Coreference Resolution (theory) Mar 22: Irina: Coreference Resolution (Lab) Mar 29: Easter

PART II: Basics of Natural Language Processing Apr 5 Olga: Syntactic Parsing (theory) Antonio: Syntactic Parsing (Lab) Apr 12 Gianni: Pandas for text data analysis Apr 19: Kateryna: Question Answering with a UIMA pipeline Antonio: Community Question Answering with a UIMA Pipeline

PART III: Neural Networks for NLP and IR Apr 26: Alessandro Introduction to Neural Networks (theory) Neural Models for NLP (theory) May 3: Daniele Neural Networks models and implementations: Pytorch development environment: examples on Sentiment Analysis May 10: Daniele Neural networks for Question Answering Convolutional Networks, Long Short Term Memory

PART III: Neural Networks for NLP and IR May 10: Daniele Neural networks for Question Answering Convolutional Networks, Long Short Term Memory May 17: Liah: Networks for NER and sequence to sequence models. Massimo: Neural Networks for end-to-end systems

Where to study? Course Slides at http://disi.unitn.it/moschitti/teaching.html ANLP-IR section (you can watch the old NLP-IR section) Book - IR: Modern Information Retrieval Authors:Ricardo A. Baeza-Yates. Addison-Wesley Longman Publishing Co., Inc. Boston, MA, USA 1999 ISBN:020139829X IIR: Introduction to Information Retrieval. Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze. Cambridge University Press, 2008.

Where to study? Book NLP: Foundations of Statistical Natural Language Processing. Chris Manning and Hinrich Schütze, Foundations of Statistical Natural Language Processing, MIT Press. Cambridge, MA: May 1999 SPEECH and LANGUAGE PROCESSING.An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition Second Edition by Daniel Jurafsky and James H. Martin

Where to study? Course Slides at http://disi.unitn.it/moschitti/teaching.html NLP-IR section: Slides of IIR available at: http://informationretrieval.org

Material Slides at

Reference Book