Bias in NLP Systems. COMP-550 Nov 30, 2017

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

A Case Study: News Classification Based on Term Frequency

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Speech Recognition at ICSI: Broadcast News and beyond

Applications of memory-based natural language processing

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Linking Task: Identifying authors and book titles in verbose queries

Laboratorio di Intelligenza Artificiale e Robotica

Natural Language Processing. George Konidaris

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Mathematics Scoring Guide for Sample Test 2005

Laboratorio di Intelligenza Artificiale e Robotica

CS 446: Machine Learning

Online Updating of Word Representations for Part-of-Speech Tagging

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Prediction of Maximal Projection for Semantic Role Labeling

Beyond the Pipeline: Discrete Optimization in NLP

Semi-Supervised Face Detection

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Speech Emotion Recognition Using Support Vector Machine

Learning Methods for Fuzzy Systems

Georgetown University at TREC 2017 Dynamic Domain Track

NCEO Technical Report 27

Major Milestones, Team Activities, and Individual Deliverables

CS Machine Learning

Using dialogue context to improve parsing performance in dialogue systems

Cross Language Information Retrieval

Compositional Semantics

Python Machine Learning

Cal s Dinner Card Deals

Lecture 1: Basic Concepts of Machine Learning

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

AQUA: An Ontology-Driven Question Answering System

Using Web Searches on Important Words to Create Background Sets for LSI Classification

(Sub)Gradient Descent

Learning From the Past with Experiment Databases

CSL465/603 - Machine Learning

Experts Retrieval with Multiword-Enhanced Author Topic Model

English Language and Applied Linguistics. Module Descriptions 2017/18

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

arxiv: v1 [cs.cl] 2 Apr 2017

Learning Methods in Multilingual Speech Recognition

The Strong Minimalist Thesis and Bounded Optimality

Probabilistic Latent Semantic Analysis

PIRLS. International Achievement in the Processes of Reading Comprehension Results from PIRLS 2001 in 35 Countries

A study of speaker adaptation for DNN-based speech synthesis

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

The Smart/Empire TIPSTER IR System

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

A Comparison of Two Text Representations for Sentiment Analysis

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Ensemble Technique Utilization for Indonesian Dependency Parser

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Switchboard Language Model Improvement with Conversational Data from Gigaword

Rule Learning with Negation: Issues Regarding Effectiveness

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Unsupervised Acoustic Model Training for Simultaneous Lecture Translation in Incremental and Batch Mode

Generative models and adversarial training

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Second Exam: Natural Language Parsing with Neural Networks

A Vector Space Approach for Aspect-Based Sentiment Analysis

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

THE world surrounding us involves multiple modalities

12- A whirlwind tour of statistics

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

MGT/MGP/MGB 261: Investment Analysis

Test Blueprint. Grade 3 Reading English Standards of Learning

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Assignment 1: Predicting Amazon Review Ratings

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Rule Learning With Negation: Issues Regarding Effectiveness

SEMAFOR: Frame Argument Resolution with Log-Linear Models

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation

arxiv: v1 [math.at] 10 Jan 2016

Learning to Rank with Selection Bias in Personal Search

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

INPE São José dos Campos

The Role of the Head in the Interpretation of English Deverbal Compounds

Artificial Neural Networks written examination

Outline. Web as Corpus. Using Web Data for Linguistic Purposes. Ines Rehbein. NCLT, Dublin City University. nclt

A STUDY ON AWARENESS ABOUT BUSINESS SCHOOLS AMONG RURAL GRADUATE STUDENTS WITH REFERENCE TO COIMBATORE REGION

A cognitive perspective on pair programming

Parsing of part-of-speech tagged Assamese Texts

Short Text Understanding Through Lexical-Semantic Analysis

Radius STEM Readiness TM

Which verb classes and why? Research questions: Semantic Basis Hypothesis (SBH) What verb classes? Why the truth of the SBH matters

Transcription:

Bias in NLP Systems COMP-550 Nov 30, 2017

Outline A4 reading discussions Bias in NLP systems Recap 2

A4 Reading Discussion Any clarification questions? How does this method relate to the work we discussed in class? What are the strengths of the approach? Limitations? Is it a good idea to replace parts of the model with a neural network? If so, which parts? 3

NLP in the Real World NLP and AI systems are increasingly used to automate fact finding and decision making Information retrieval Image captioning Automated essay grading School admissions decisions Resume and CV filtering Loan and insurance approval Want to make sure process and decisions are fair and unbiased! 4

Technological Fairness? Hope: Use objective measures and statistical techniques to produce a fairer system, free of human biases Reality: Machine learning systems can learn the biases that are inherent in the data Even worse: the learned methods can produce results that are more biased than the training data! How can this be? 5

Bias in Word Embedding Models word2vec exhibits bias! This is okay: man woman king queen But this is NOT, and also found by word2vec! man woman computer programmer homemaker (Bolukbasi et al., 2016) 6

Most Gender-Biased Occupations (Bolukbasi et al., 2016) 7

Implications of Word Association Bias Above results due to counting of word associations Maybe this just reflects the bias in the underlying distribution of real life why is that so bad? Scenario: information retrieval; search result Must produce a ranking of, say, people's home pages to show in a search query. e.g. "cmu computer science phd student" Given two otherwise identical webpages, an algorithm may pick a website with a man's name (e.g., John) over one with a woman's name (e.g., Mary), because the former is more distributionally similar to computer science! 8

Visual Semantic Role Labelling imsitu data set (Yatskar et al., 2016) 9

Bias Amplification in Trained Models Result from (Zhao et al., 2017) 10

Why does Bias Amplification Occur? Training data exhibits some bias An automatic system is asked to produce a decision under uncertainty Ranking websites Labelling image as involving male or female participant With standard loss/evaluation procedures, rational to favour more frequent class, if other information does not disambiguate 11

Debiasing Algorithms General technique: 1. Identify axis or axes of bias (e.g., gender, race, religion, etc.) 2. Modify our learning or inference by adding constraints, such that the biased outcomes (as previously identified) are disfavoured Let's consider the method of Zhao et al., (2017) 12

Debiasing Activity Recognition Original inference problem: argmax y Y f θ y, i i.e., make the decision y (e.g., y = {woman, meat, stove, }) that maximizes the score on test instance i Idea: for each activity v to debias, add a constraint: 13

New Optimization Problem where {Y i } represents the space of all possible label assignments to all test instances constraints are taken from equation (2) for each activity This is expensive to solve exactly; use an approximate method based on Lagrange multipliers 14

Performance Reduced bias amplification without much loss in classification performance! 15

Summary of Current Work Bias is a problem in NLP systems Naïve methods can exacerbate problem Possible to reduce effect of biases without sacrificing task performance 16

References Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. NIPS 2016. Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, Kai-Wei Chang. Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints. EMNLP 2017. 17

Recap of Course What have we done in COMP-550? 18

Computational Linguistics (CL) Modelling natural language with computational models and techniques Domains of natural language Acoustic signals, phonemes, words, syntax, semantics, Speech vs. text Natural language understanding (or comprehension) vs. natural language generation (or production) 19

Computational Linguistics (CL) Modelling natural language with computational models and techniques Goals Language technology applications Scientific understanding of how language works 20

Computational Linguistics (CL) Modelling natural language with computational models and techniques Methodology and techniques Gathering data: language resources Evaluation Statistical methods and machine learning Rule-based methods 21

Current Trends and Challenges Speculations about the future of NLP 22

Better Use of More Data Large amounts of data now available Unlabelled Noisy May not be directly relevant to your specific problem How do we make better use of it? Unsupervised or lightly supervised methods Prediction models that can make use of data to learn what features are important (neural networks) Incorporate linguistic insights with large-scale data processing 23

Using More Sources of Knowledge Old set up: Annotated data set Better model? Feature extraction + Simple supervised learning Model predictions Background text General knowledge bases Domain-specific constraints Directly relevant annotated data Model predictions 24

Away From Discreteness Discreteness is sometimes convenient assumption, but also a problem Words, phrases, sentences and labels for them Symbolic representations of semantics Motivated a lot of work in regularization and smoothing Representation learning Learn continuous-valued representations using cooccurrence statistics, or some other objective function e.g., vector-space semantics 25

Continuous-Valued Representations cat, linguistics, NP, VP Advantages: Implicitly deal with smoothness, soft boundaries Incorporate many sources of information in training vectors Challenges: What should a good continuous representation look like? Evaluation is often still in terms of a discrete set of labels 26

Broadening Horizons We are getting better at solving specific problems on specific benchmark data sets. e.g., On WSJ corpus, POS tagging performance of >97% matches human-level performance. Much more difficult and interesting: Working across multiple kinds of text and data sets Integrating disparate theories, domains, and tasks 27

Connections to Other Fields Cognitive science and psycholinguistics e.g., model L1 and L2 acquisition; other human behaviour based on computational models Human computer interaction and information visualization That s nice that you have a tagger/parser/summarizer/asr system/nlg module. Now, what do you do with it? Multi-modal systems and visualizations 28

That s It! Good luck on your projects and finals! 29