Word Sense Disambiguation

Size: px
Start display at page:

Download "Word Sense Disambiguation"

Transcription

1 Word Sense Disambiguation Computational Lexical Semantics Gemma Boleda 1 Stefan Evert 2 1 Universitat Politècnica de Catalunya 2 University of Osnabrück ESSLLI. Bordeaux, France, July / 56

2 Thanks Overview These slides are based on Jurafsky & Martin (2004: chapter 20) and material by Ann Copestake (course at UPF, 2008) 2 / 56

3 Outline Overview 1 Overview / 56

4 Outline Overview 1 Overview / 56

5 Overview Word Sense Disambiguation The task of selecting the correct sense for a word in context. potentially helpful in many applications machine translation, question answering, information retrieval... we focus on WSD as a stand-alone task artificial! 5 / 56

6 Overview Word Sense Disambiguation The task of selecting the correct sense for a word in context. potentially helpful in many applications machine translation, question answering, information retrieval... we focus on WSD as a stand-alone task artificial! 5 / 56

7 Overview Word Sense Disambiguation The task of selecting the correct sense for a word in context. potentially helpful in many applications machine translation, question answering, information retrieval... we focus on WSD as a stand-alone task artificial! 5 / 56

8 Overview Word Sense Disambiguation The task of selecting the correct sense for a word in context. potentially helpful in many applications machine translation, question answering, information retrieval... we focus on WSD as a stand-alone task artificial! 5 / 56

9 WSD algorithm basic form: input: word in context, fixed inventory of word senses output: the correct word sense for that use context? words surrounding the target word: annotated? just the words in no particular order? context size? inventory? task-dependent machine translation from English to Spanish: set of Spanish translations speech synthesis: homographs with differing pronunciations (e.g., bass) stand-alone task: a lexical resource (usually, WordNet) 6 / 56

10 WSD algorithm basic form: input: word in context, fixed inventory of word senses output: the correct word sense for that use context? words surrounding the target word: annotated? just the words in no particular order? context size? inventory? task-dependent machine translation from English to Spanish: set of Spanish translations speech synthesis: homographs with differing pronunciations (e.g., bass) stand-alone task: a lexical resource (usually, WordNet) 6 / 56

11 WSD algorithm basic form: input: word in context, fixed inventory of word senses output: the correct word sense for that use context? words surrounding the target word: annotated? just the words in no particular order? context size? inventory? task-dependent machine translation from English to Spanish: set of Spanish translations speech synthesis: homographs with differing pronunciations (e.g., bass) stand-alone task: a lexical resource (usually, WordNet) 6 / 56

12 WSD algorithm basic form: input: word in context, fixed inventory of word senses output: the correct word sense for that use context? words surrounding the target word: annotated? just the words in no particular order? context size? inventory? task-dependent machine translation from English to Spanish: set of Spanish translations speech synthesis: homographs with differing pronunciations (e.g., bass) stand-alone task: a lexical resource (usually, WordNet) 6 / 56

13 An example WordNet Sense Target Word in Context bass 4... fish as Pacific salmon and striped bass and... bass 4... produce filets of smoked bass or sturgeon... bass 7... exciting jazz bass player since Ray Brown... bass 7... play bass because he doesn t have to solo... Figure: Possible inventory of sense tags for word bass 7 / 56

14 Variants of the task lexical sample task WSD for a small set of target words a number of corpus instances are selected and labeled similar to task in our case study supervised approaches; word-specific classifiers all-words WSD for all content words in a text similar to POS-tagging; but very large tagset! data sparseness not enough training data for every word 8 / 56

15 Outline Overview 1 Overview / 56

16 Feature extraction supervised approach need to identify features that are predictive of word senses fundamental (and early) insight: look at the context words bass smoked bass or jazz bass player window (e.g., 1-word window) 10 / 56

17 Feature extraction supervised approach need to identify features that are predictive of word senses fundamental (and early) insight: look at the context words bass smoked bass or jazz bass player window (e.g., 1-word window) 10 / 56

18 Feature extraction supervised approach need to identify features that are predictive of word senses fundamental (and early) insight: look at the context words bass smoked bass or jazz bass player window (e.g., 1-word window) 10 / 56

19 Feature extraction supervised approach need to identify features that are predictive of word senses fundamental (and early) insight: look at the context words bass smoked bass or jazz bass player window (e.g., 1-word window) 10 / 56

20 Method Overview process the dataset (POS-tagging, lemmatization, parsing) build feature representation encoding the relevant linguistic information two main feature types: 1 collocational features 2 bag-of-words features 11 / 56

21 Collocational features features that take order or syntactic relations into account restricted to immediate word context (usually fixed window). For example: lemma and part of speech of two-word window syntactic function of the target word 12 / 56

22 Collocational features: Example Example: (20.1) An electric guitar and bass player stand off to one side, not really part of the scene, just as a sort of nod to gringo expectations perhaps. 2-word window representation, using parts of speech: [guitar, NN, and, CC, player, NN, stand, VB] [w 2, P 2, w 1, P 1, w + 1, P + 1, w + 2, P + 2] 13 / 56

23 Collocational features: Example Example: (20.1) An electric guitar and bass player stand off to one side, not really part of the scene, just as a sort of nod to gringo expectations perhaps. 2-word window representation, using parts of speech: [guitar, NN, and, CC, player, NN, stand, VB] [w 2, P 2, w 1, P 1, w + 1, P + 1, w + 2, P + 2] 13 / 56

24 Collocational features: Example Example: (20.1) An electric guitar and bass player stand off to one side, not really part of the scene, just as a sort of nod to gringo expectations perhaps. 2-word window representation, using parts of speech: [guitar, NN, and, CC, player, NN, stand, VB] [w 2, P 2, w 1, P 1, w + 1, P + 1, w + 2, P + 2] 13 / 56

25 Bag-of-words features lexical features pre-selected words that are potentially relevant for sense distinctions. For example: for all-words task: frequent content words in the corpus for lexical sample task: content words in the sentences of the target word test for presence/absence of a certain word in the selected context 14 / 56

26 Bag-of-words features: Example Example: (20.1) An electric guitar and bass player stand off to one side, not really part of the scene, just as a sort of nod to gringo expectations perhaps. pre-selected words: [fishing, big, sound, player, fly] feature vector: [0, 0, 0, 1, 0] 15 / 56

27 Bag-of-words features: Example Example: (20.1) An electric guitar and bass player stand off to one side, not really part of the scene, just as a sort of nod to gringo expectations perhaps. pre-selected words: [fishing, big, sound, player, fly] feature vector: [0, 0, 0, 1, 0] 15 / 56

28 More on features collocational cues account for: collocational effects bass+player=bass 7 syntax-related sense differences serve breakfast to customers vs. serve Philadelphia bag of word features account for topic and domain related features resemblance to semantic fields, frames,... complementary information both feature types usually combined 16 / 56

29 Combined representation: Example simplified representation for 2 sentences: collocational features corresponding to 1-word window:... jazz bass player smoked bass or... bag-of-word features only fishing, player 17 / 56

30 Combined representation Weka wordl1 posl1 wordr1 posr1 fishing player sense jazz,cc,player,nn,0,1,s7 smoke,vbd,or,nn,0,0,s4... jazz bass player smoked bass or / 56

31 Method Overview any supervised algorithm Decision Trees (for example, J48) Decision Lists (similar to Decision Trees) Naive Bayes (probabilistic)... and tool Weka R SVMTool your own implementation / 56

32 Interim Summary supervised approaches use sense-annotated datasets need many annotated examples for every word relevant information in the context: lexico-syntactic information (collocational features) lexical information (bag of words features) information is encoded in the form of features... and a classifier is trained to distinguish different senses of a given word 20 / 56

33 Outline Overview 1 Overview / 56

34 Extrinsic evaluation long term goal: improve performance in end-to-end application extrinsic evaluation (or task-based, end-to-end, in vivo evaluation) example: Word Sense Disambiguation for (Cross-Lingual) Information Retrieval 22 / 56

35 Intrinsic evaluation however, extrinsic evaluation difficult and time consuming intrinsic evaluation (or in vitro evaluation) treat a WSD component as if it were a stand-alone system measure: sense accuracy (percentage of words correctly tagged) Accuracy = matches total method: held-out data from the same sense-tagged corpora used for training (train-test methodology) to standardize datasets and methods: SensEval and SemEval competitions example: our case study 23 / 56

36 Intrinsic evaluation however, extrinsic evaluation difficult and time consuming intrinsic evaluation (or in vitro evaluation) treat a WSD component as if it were a stand-alone system measure: sense accuracy (percentage of words correctly tagged) Accuracy = matches total method: held-out data from the same sense-tagged corpora used for training (train-test methodology) to standardize datasets and methods: SensEval and SemEval competitions example: our case study 23 / 56

37 Baseline Overview baseline: performance we would get without much knowledge / with a simple approach necessary for any Machine Learning experiment (how good is 70%?) simplest baseline: most frequent sense WordNet: first sense heuristic (senses ordered) very powerful baseline! skewed distribution of senses in corpora BUT we need access to annotated data for every word in the dataset to estimate sense frequencies this is a knowledge-laden baseline 24 / 56

38 Ceiling Overview ceiling or upper-bound for performance: inter-coder agreement all-word corpora using WordNet: A o more coarse-grained sense distinctions: A o 0.9 another possibility: avoid annotation using pseudowords banana-door however: unrealistic real polysemy is not like banana-doors! need to find better ways to create pseudowords 25 / 56

39 Outline Overview 1 Overview / 56

40 Overview sense-labeled corpora give accurate information but scarce! need other sources: dictionaries, thesaurus, selectional restrictions... idea: use dictionaries as corpora (identifying related words in definitions and examples) 27 / 56

41 An example Example: (20.10) The bank can guarantee deposits will eventually cover future tuition costs because it invests in adjustable-rate mortgage securities. bank 1 Gloss: a financial institution that accepts deposits and channels the money into lending activities Examples: he cashed a check at the bank ; that bank holds the mortgage on my home bank 2 Gloss: sloping land (especially beside a body of water) Examples: they pulled the canoe up on the bank ; he sat on the bank of the river Figure: WordNet information for two senses of bank 28 / 56

42 An example Example: (20.10) The bank can guarantee deposits will eventually cover future tuition costs because it invests in adjustable-rate mortgage securities. bank 1 Gloss: a financial institution that accepts deposits and channels the money into lending activities Examples: he cashed a check at the bank ; that bank holds the mortgage on my home bank 2 Gloss: sloping land (especially beside a body of water) Examples: they pulled the canoe up on the bank ; he sat on the bank of the river Figure: WordNet information for two senses of bank 28 / 56

43 An example Example: (20.10) The bank can guarantee deposits will eventually cover future tuition costs because it invests in adjustable-rate mortgage securities. bank 1 Gloss: a financial institution that accepts deposits and channels the money into lending activities Examples: he cashed a check at the bank ; that bank holds the mortgage on my home bank 2 Gloss: sloping land (especially beside a body of water) Examples: they pulled the canoe up on the bank ; he sat on the bank of the river Figure: WordNet information for two senses of bank 29 / 56

44 Signatures Overview signature: set of words that characterizes a given sense of a target word extracted from dictionaries, thesauri, tagged corpora,... for example (20.10): bank 1 : financial, institution, accept, deposit, channel, money, lending, activity, cash, check, hold, mortgage, home bank 2 : sloping, land, body, water, pull, canoe, bank, sit, river 30 / 56

45 Lesk Algorithm Lesk Algorithm function SIMPLIFIED LESK(word, sentence) returns best sense of word best-sense most frequent sense for word max-overlap 0 context set of words in sentence for each sense in senses of word do signature set of words in the gloss and examples of sense overlap COMPUTEOVERLAP(signature, context) if overlap > max-overlap then max-overlap overlap best-sense sense end return(best-sense) 31 / 56

46 Lesk Algorithm Example: she strolled by the river bank. best-sense bank 1 ; max-overlap 0 context {she, stroll, river} sense bank 1 : signature {financial, institution, accept, deposit, channel, money, lending, activity, cash, check, hold, mortgage, home} overlap 0; 0 > 0 fails sense bank 2 : signature {sloping, land, body, water, pull, canoe, bank, sit, river} overlap 1; 1 > 0 succeeds best-sense bank 2 ; max overlap 1 return bank 2 32 / 56

47 Lesk Algorithm Example: she strolled by the river bank. best-sense bank 1 ; max-overlap 0 context {she, stroll, river} sense bank 1 : signature {financial, institution, accept, deposit, channel, money, lending, activity, cash, check, hold, mortgage, home} overlap 0; 0 > 0 fails sense bank 2 : signature {sloping, land, body, water, pull, canoe, bank, sit, river} overlap 1; 1 > 0 succeeds best-sense bank 2 ; max overlap 1 return bank 2 32 / 56

48 Lesk Algorithm Example: she strolled by the river bank. best-sense bank 1 ; max-overlap 0 context {she, stroll, river} sense bank 1 : signature {financial, institution, accept, deposit, channel, money, lending, activity, cash, check, hold, mortgage, home} overlap 0; 0 > 0 fails sense bank 2 : signature {sloping, land, body, water, pull, canoe, bank, sit, river} overlap 1; 1 > 0 succeeds best-sense bank 2 ; max overlap 1 return bank 2 32 / 56

49 Lesk Algorithm Example: she strolled by the river bank. best-sense bank 1 ; max-overlap 0 context {she, stroll, river} sense bank 1 : signature {financial, institution, accept, deposit, channel, money, lending, activity, cash, check, hold, mortgage, home} overlap 0; 0 > 0 fails sense bank 2 : signature {sloping, land, body, water, pull, canoe, bank, sit, river} overlap 1; 1 > 0 succeeds best-sense bank 2 ; max overlap 1 return bank 2 32 / 56

50 Lesk Algorithm Example: she strolled by the river bank. best-sense bank 1 ; max-overlap 0 context {she, stroll, river} sense bank 1 : signature {financial, institution, accept, deposit, channel, money, lending, activity, cash, check, hold, mortgage, home} overlap 0; 0 > 0 fails sense bank 2 : signature {sloping, land, body, water, pull, canoe, bank, sit, river} overlap 1; 1 > 0 succeeds best-sense bank 2 ; max overlap 1 return bank 2 32 / 56

51 Overview right intuition: words that appear in dictionary definitions and examples are relevant to a given sense problem: data sparseness: dictionary entries short, not always examples Lesk algorithm currently used as baseline BUT many extensions possible and have been tried (generalizations over lemmata, corpus data, weighting,... ) AND dictionary-derived features can be used (are used) in standard supervised approaches 33 / 56

52 Interim Summary information encoded in dictionaries (definitions, examples) is useful for WSD can be used exclusively or in addition to other information (collocations, bag of words) for supervised approaches the Lesk algorithm disambiguates solely on the basis of dicionary information overlap between dictionary entry and context of word occurrence the most frequent sense and the Lesk algorithm are used as baselines for evaluation 34 / 56

53 Overview we have a huge number of classes (senses) need large hand-built resources: supervised approaches need large annotated corpora (unrealistic) dictionary methods need large dictionaries, which, even if available, often do not provide enough information alternatives: Minimally supervised WSD Unsupervised WSD both make use of unannotated data these approaches are not as successful as supervised approaches 35 / 56

54 Minimally supervised WSD: Bootstrapping for a given word, for example plant start with a small number of annotated examples (seeds) for each sense collect additional examples for each sense based on their similarity to annotated examples iterate 36 / 56

55 Bootstrapping: example plant (Yarowsky 1995) sense A: living entity; sense B: building first examples: those that appear with life (sense A) and manufacturing (sense B) Figure: Bootstrapping word senses. Figure 20.4 in Jurafsky & Martin. 37 / 56

56 Yarowsky 1995 Influential insights (used as heuristics in Yarowsky s algorithm): one sense per collocation life+plant = plant A manufacturing+plant = plant B one sense per discourse if a word appears multiple times in a text, probably all occurrences will bear the same sense also useful to enlarge datasets 38 / 56

57 Unsupervised WSD no previous knowledge no human-defined word senses simply group examples according to the similarity of the examples clustering and infer senses from that problem: hard to interpret and evaluate 39 / 56

58 Unsupervised WSD no previous knowledge no human-defined word senses simply group examples according to the similarity of the examples clustering and infer senses from that problem: hard to interpret and evaluate 39 / 56

59 Outline Overview 1 Overview / 56

60 Interim summary WSD can be framed as a standard classification task training data, feature definition, classifier, evaluation supervised approaches most useful information: syntactic and lexical context (collocational features) words related to the different senses of a given word (bag of word features) words in dictionary (thesaurus, etc.) entries other approaches try to make use of unannotated data bootstrapping, unsupervised learning would be great, but not as successful as supervised approaches (and harder to interpret and work with) 41 / 56

61 Useful empirical facts skewed distribution of senses most frequent sense baseline heuristic when no other information is available BUT distribution varies with text/corpus! (cone in geometry textbook) one sense per collocation bass+player=bass 7 simple cues for sense classification (heuristic) one sense per discourse different occurences of a word in a given text tend to be used in the same sense heuristic for classification and for data gathering 42 / 56

62 Conceptual problems the task as currently defined does no allow for generalization over different words learning is word-specific number of classes = number of senses; equal to or greater than number of words! need training data for every sense of every word most words have low frequency (Zipf s law) no chance with unknown words this wouldn t be a problem if word sense alternation were like bank 1 bank 2 (homonymy) but many alternations are systematic! (regular polysemy, metonymy, metaphor) 43 / 56

63 Conceptual problems the task as currently defined does no allow for generalization over different words learning is word-specific number of classes = number of senses; equal to or greater than number of words! need training data for every sense of every word most words have low frequency (Zipf s law) no chance with unknown words this wouldn t be a problem if word sense alternation were like bank 1 bank 2 (homonymy) but many alternations are systematic! (regular polysemy, metonymy, metaphor) 43 / 56

64 Regular polysemy conversion bank (N): financial institution bank (V): put money in a bank same for sugar, hammer, tango, etc. (also derivation: -ize) adjectives (Boleda 2007) qualitative vs. relational: cara familiar ( familiar face ) vs. reunió familiar ( family meeting ) event-related vs. qualitative: fet sabut ( known fact ) vs. home sabut ( wise man ) 44 / 56

65 Regular polysemy conversion bank (N): financial institution bank (V): put money in a bank same for sugar, hammer, tango, etc. (also derivation: -ize) adjectives (Boleda 2007) qualitative vs. relational: cara familiar ( familiar face ) vs. reunió familiar ( family meeting ) event-related vs. qualitative: fet sabut ( known fact ) vs. home sabut ( wise man ) 44 / 56

66 Regular polysemy: mass/count animal/meat chicken 1 : animal; chicken 2 : meat lamb 1 : animal; lamb 2 : meat... portions/kinds: two beers two servings of beer two types of beer generally: thing/derived substance (grinding) After several lorries had run over the body, there was rabbit splattered all over the road. 45 / 56

67 Regular polysemy verb alternations causative/inchoative (Levin 1993) John broke the window The window broke Spanish psychological verbs Le preocupa la situación (Dative + Subject) Bruna no quiere preocuparla (subject + Accusative) 46 / 56

68 Contextual coercion / Logical metonymy (Also see course by Louise McNally.) object to eventuality (Pustejovsky 1995) Mary enjoyed the book. After three martinis, Kim felt much happier. adjectives (Pustejovsky 1995): event selection fast runner vs. fast typist vs. fast car 47 / 56

69 Metonymy Overview container/content He drank a bottle of whisky. Morphology again: He drank a bottleful of whisky. (-ful suffixation) fruit/plant olive, grapefruit,... Spanish: often tree masculine (olivo, naranjo), fruit feminine (oliva, naranja) figure/ground Kim painted the door Kim walked through the door 48 / 56

70 Metonymy Overview country names Location: I live in China. Government: The US and Lybia have agreed to work together to solve... Team (sports): England won last s year World Cup. more generally: institutions Barcelona applied for the Olympic Games. The banks won t give credits now. The newspapers criticized this policy. object/person The cello is playing badly. Not so regular: contextual metaphor: The ham sandwich wants his check. (Lakoff & Johnson 1980) 49 / 56

71 Metonymy Overview country names Location: I live in China. Government: The US and Lybia have agreed to work together to solve... Team (sports): England won last s year World Cup. more generally: institutions Barcelona applied for the Olympic Games. The banks won t give credits now. The newspapers criticized this policy. object/person The cello is playing badly. Not so regular: contextual metaphor: The ham sandwich wants his check. (Lakoff & Johnson 1980) 49 / 56

72 Metaphor Overview physical mental depart 1 : physical transfer; arrive 1 : physical transfer; go 1 : physical transfer depart 2 : mental transfer; arrive 2 : mental transfer; go 2 : mental transfer concrete abstract aigua clara ( clear water ) vs. estil clar ( clear style ) cabells negres ( black hair ) vs. humor negre ( black humour ) 50 / 56

73 Metaphor Overview physical mental depart 1 : physical transfer; arrive 1 : physical transfer; go 1 : physical transfer depart 2 : mental transfer; arrive 2 : mental transfer; go 2 : mental transfer concrete abstract aigua clara ( clear water ) vs. estil clar ( clear style ) cabells negres ( black hair ) vs. humor negre ( black humour ) 50 / 56

74 To sum up pervasive systematicity in sense alternations: regular polysemy, metonymy, metaphor productive We found a little, hairy wampimuk sleeping behind the tree (McDonald & Ramscar 2001) Wampimuk soup is delicious! inherent property of language analogical reasoning (psychology again) WSD as currently handled cannot capture these regularities theoretical and practical problem! 51 / 56

75 To sum up pervasive systematicity in sense alternations: regular polysemy, metonymy, metaphor productive We found a little, hairy wampimuk sleeping behind the tree (McDonald & Ramscar 2001) Wampimuk soup is delicious! inherent property of language analogical reasoning (psychology again) WSD as currently handled cannot capture these regularities theoretical and practical problem! 51 / 56

76 To sum up pervasive systematicity in sense alternations: regular polysemy, metonymy, metaphor productive We found a little, hairy wampimuk sleeping behind the tree (McDonald & Ramscar 2001) Wampimuk soup is delicious! inherent property of language analogical reasoning (psychology again) WSD as currently handled cannot capture these regularities theoretical and practical problem! 51 / 56

77 To sum up pervasive systematicity in sense alternations: regular polysemy, metonymy, metaphor productive We found a little, hairy wampimuk sleeping behind the tree (McDonald & Ramscar 2001) Wampimuk soup is delicious! inherent property of language analogical reasoning (psychology again) WSD as currently handled cannot capture these regularities theoretical and practical problem! 51 / 56

78 WSD and regularities: what one can do generalize on FEATURES e.g., jazz MUSIC-STYLE jazz, rock, blues,... provided some lexical resource is available that encodes this information He is a jazz bass player. I love bass solos in rock music. problem: when (how) to generalize? when to stop? 52 / 56

79 WSD and regularities: what one can do generalize on FEATURES e.g., jazz MUSIC-STYLE jazz, rock, blues,... provided some lexical resource is available that encodes this information He is a jazz bass player. I love bass solos in rock music. problem: when (how) to generalize? when to stop? 52 / 56

80 WSD and regularities: what one can do generalize on FEATURES e.g., jazz MUSIC-STYLE jazz, rock, blues,... provided some lexical resource is available that encodes this information He is a jazz bass player. I love bass solos in rock music. problem: when (how) to generalize? when to stop? 52 / 56

81 WSD and regularities: what one can do generalize on FEATURES e.g., jazz MUSIC-STYLE jazz, rock, blues,... provided some lexical resource is available that encodes this information He is a jazz bass player. I love bass solos in rock music. problem: when (how) to generalize? when to stop? 52 / 56

82 WSD and regularities: what would be desirable train on chicken and use the data for lamb, wampimuk,... Resources such as WordNet encode the meat/animal distinction: WordNet info for chicken: chicken 1 : the flesh of a chicken used for food. chicken 2 : a domesticated gallinaceous bird (hyponym). chicken 3 : a person who lacks confidence. chicken 4 : a foolhardy competition. WordNet info for lamb: lamb 1 : young sheep. lamb 2 : a person easily deceived or cheated. lamb 3 : a sweet innocent mild-mannered person. lamb 4 : the flesh of a young domestic sheep eaten as food. WHAT IS MISSING: link between chicken 2 and lamb 1, chicken 1 and lamb 4 (note other senses) 53 / 56

83 WSD and regularities: what would be desirable train on chicken and use the data for lamb, wampimuk,... Resources such as WordNet encode the meat/animal distinction: WordNet info for chicken: chicken 1 : the flesh of a chicken used for food. chicken 2 : a domesticated gallinaceous bird (hyponym). chicken 3 : a person who lacks confidence. chicken 4 : a foolhardy competition. WordNet info for lamb: lamb 1 : young sheep. lamb 2 : a person easily deceived or cheated. lamb 3 : a sweet innocent mild-mannered person. lamb 4 : the flesh of a young domestic sheep eaten as food. WHAT IS MISSING: link between chicken 2 and lamb 1, chicken 1 and lamb 4 (note other senses) 53 / 56

84 WSD and regularities: what would be desirable train on chicken and use the data for lamb, wampimuk,... Resources such as WordNet encode the meat/animal distinction: WordNet info for chicken: chicken 1 : the flesh of a chicken used for food. chicken 2 : a domesticated gallinaceous bird (hyponym). chicken 3 : a person who lacks confidence. chicken 4 : a foolhardy competition. WordNet info for lamb: lamb 1 : young sheep. lamb 2 : a person easily deceived or cheated. lamb 3 : a sweet innocent mild-mannered person. lamb 4 : the flesh of a young domestic sheep eaten as food. WHAT IS MISSING: link between chicken 2 and lamb 1, chicken 1 and lamb 4 (note other senses) 53 / 56

85 Word Sense Disambiguation Computational Lexical Semantics Gemma Boleda 1 Stefan Evert 2 1 Universitat Politècnica de Catalunya 2 University of Osnabrück ESSLLI. Bordeaux, France, July / 56

86 Classifier example 1: Naive Bayes probabilistic classifier (related to HMMs) choosing the best sense amounts to choosing the most probable sense given the feature vector conditional probability BUT it is impossible to train it directly (too many feature combinations) 2 strategies: decomposing the probabilities (Bayes rules) easier to estimate making unrealistic assumption: words are independent ( Naive Bayes) training the classifier = estimating probabilities from the sense-tagged corpus 55 / 56

87 Classifier example 2: Decision Lists similar to decision trees (difference: only one condition) Rule Sense fish within window bass 4 striped bass bass 4 guitar within window bass 7 play/v bass bass 7 Figure: Decision List for word bass to learn a decision list classifier: generate and order tests according to the training data 56 / 56

Word Sense Disambiguation

Word Sense Disambiguation Word Sense Disambiguation D. De Cao R. Basili Corso di Web Mining e Retrieval a.a. 2008-9 May 21, 2009 Excerpt of the R. Mihalcea and T. Pedersen AAAI 2005 Tutorial, at: http://www.d.umn.edu/ tpederse/tutorials/advances-in-wsd-aaai-2005.ppt

More information

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus Language Acquisition Fall 2010/Winter 2011 Lexical Categories Afra Alishahi, Heiner Drenhaus Computational Linguistics and Phonetics Saarland University Children s Sensitivity to Lexical Categories Look,

More information

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases POS Tagging Problem Part-of-Speech Tagging L545 Spring 203 Given a sentence W Wn and a tagset of lexical categories, find the most likely tag T..Tn for each word in the sentence Example Secretariat/P is/vbz

More information

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se

More information

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Yoav Goldberg Reut Tsarfaty Meni Adler Michael Elhadad Ben Gurion

More information

Applications of memory-based natural language processing

Applications of memory-based natural language processing Applications of memory-based natural language processing Antal van den Bosch and Roser Morante ILK Research Group Tilburg University Prague, June 24, 2007 Current ILK members Principal investigator: Antal

More information

A Bayesian Learning Approach to Concept-Based Document Classification

A Bayesian Learning Approach to Concept-Based Document Classification Databases and Information Systems Group (AG5) Max-Planck-Institute for Computer Science Saarbrücken, Germany A Bayesian Learning Approach to Concept-Based Document Classification by Georgiana Ifrim Supervisors

More information

Construction Grammar. University of Jena.

Construction Grammar. University of Jena. Construction Grammar Holger Diessel University of Jena holger.diessel@uni-jena.de http://www.holger-diessel.de/ Words seem to have a prototype structure; but language does not only consist of words. What

More information

Linking Task: Identifying authors and book titles in verbose queries

Linking Task: Identifying authors and book titles in verbose queries Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,

More information

Multilingual Sentiment and Subjectivity Analysis

Multilingual Sentiment and Subjectivity Analysis Multilingual Sentiment and Subjectivity Analysis Carmen Banea and Rada Mihalcea Department of Computer Science University of North Texas rada@cs.unt.edu, carmen.banea@gmail.com Janyce Wiebe Department

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview

More information

On document relevance and lexical cohesion between query terms

On document relevance and lexical cohesion between query terms Information Processing and Management 42 (2006) 1230 1247 www.elsevier.com/locate/infoproman On document relevance and lexical cohesion between query terms Olga Vechtomova a, *, Murat Karamuftuoglu b,

More information

Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2

Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2 Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2 Ted Pedersen Department of Computer Science University of Minnesota Duluth, MN, 55812 USA tpederse@d.umn.edu

More information

Context Free Grammars. Many slides from Michael Collins

Context Free Grammars. Many slides from Michael Collins Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures

More information

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics (L615) Markus Dickinson Department of Linguistics, Indiana University Spring 2013 The web provides new opportunities for gathering data Viable source of disposable corpora, built ad hoc for specific purposes

More information

Cross Language Information Retrieval

Cross Language Information Retrieval Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................

More information

Leveraging Sentiment to Compute Word Similarity

Leveraging Sentiment to Compute Word Similarity Leveraging Sentiment to Compute Word Similarity Balamurali A.R., Subhabrata Mukherjee, Akshat Malu and Pushpak Bhattacharyya Dept. of Computer Science and Engineering, IIT Bombay 6th International Global

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Twitter Sentiment Classification on Sanders Data using Hybrid Approach IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

The MEANING Multilingual Central Repository

The MEANING Multilingual Central Repository The MEANING Multilingual Central Repository J. Atserias, L. Villarejo, G. Rigau, E. Agirre, J. Carroll, B. Magnini, P. Vossen January 27, 2004 http://www.lsi.upc.es/ nlp/meaning Jordi Atserias TALP Index

More information

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation tatistical Parsing (Following slides are modified from Prof. Raymond Mooney s slides.) tatistical Parsing tatistical parsing uses a probabilistic model of syntax in order to assign probabilities to each

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Outline. Web as Corpus. Using Web Data for Linguistic Purposes. Ines Rehbein. NCLT, Dublin City University. nclt

Outline. Web as Corpus. Using Web Data for Linguistic Purposes. Ines Rehbein. NCLT, Dublin City University. nclt Outline Using Web Data for Linguistic Purposes NCLT, Dublin City University Outline Outline 1 Corpora as linguistic tools 2 Limitations of web data Strategies to enhance web data 3 Corpora as linguistic

More information

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Exploration. CS : Deep Reinforcement Learning Sergey Levine Exploration CS 294-112: Deep Reinforcement Learning Sergey Levine Class Notes 1. Homework 4 due on Wednesday 2. Project proposal feedback sent Today s Lecture 1. What is exploration? Why is it a problem?

More information

Procedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing

Procedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 141 ( 2014 ) 124 128 WCLTA 2013 Using Corpus Linguistics in the Development of Writing Blanka Frydrychova

More information

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17. Semi-supervised methods of text processing, and an application to medical concept extraction Yacine Jernite Text-as-Data series September 17. 2015 What do we want from text? 1. Extract information 2. Link

More information

Combining a Chinese Thesaurus with a Chinese Dictionary

Combining a Chinese Thesaurus with a Chinese Dictionary Combining a Chinese Thesaurus with a Chinese Dictionary Ji Donghong Kent Ridge Digital Labs 21 Heng Mui Keng Terrace Singapore, 119613 dhji @krdl.org.sg Gong Junping Department of Computer Science Ohio

More information

2.1 The Theory of Semantic Fields

2.1 The Theory of Semantic Fields 2 Semantic Domains In this chapter we define the concept of Semantic Domain, recently introduced in Computational Linguistics [56] and successfully exploited in NLP [29]. This notion is inspired by the

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

Lecture 1: Basic Concepts of Machine Learning

Lecture 1: Basic Concepts of Machine Learning Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010

More information

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Stephan Gouws and GJ van Rooyen MIH Medialab, Stellenbosch University SOUTH AFRICA {stephan,gvrooyen}@ml.sun.ac.za

More information

The stages of event extraction

The stages of event extraction The stages of event extraction David Ahn Intelligent Systems Lab Amsterdam University of Amsterdam ahn@science.uva.nl Abstract Event detection and recognition is a complex task consisting of multiple sub-tasks

More information

Genevieve L. Hartman, Ph.D.

Genevieve L. Hartman, Ph.D. Curriculum Development and the Teaching-Learning Process: The Development of Mathematical Thinking for all children Genevieve L. Hartman, Ph.D. Topics for today Part 1: Background and rationale Current

More information

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar Chung-Chi Huang Mei-Hua Chen Shih-Ting Huang Jason S. Chang Institute of Information Systems and Applications, National Tsing Hua University,

More information

Compositional Semantics

Compositional Semantics Compositional Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Words, bag of words Sequences Trees Meaning Representing Meaning An important goal of NLP/AI: convert natural language

More information

Ensemble Technique Utilization for Indonesian Dependency Parser

Ensemble Technique Utilization for Indonesian Dependency Parser Ensemble Technique Utilization for Indonesian Dependency Parser Arief Rahman Institut Teknologi Bandung Indonesia 23516008@std.stei.itb.ac.id Ayu Purwarianti Institut Teknologi Bandung Indonesia ayu@stei.itb.ac.id

More information

A Case Study: News Classification Based on Term Frequency

A Case Study: News Classification Based on Term Frequency A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center

More information

Distant Supervised Relation Extraction with Wikipedia and Freebase

Distant Supervised Relation Extraction with Wikipedia and Freebase Distant Supervised Relation Extraction with Wikipedia and Freebase Marcel Ackermann TU Darmstadt ackermann@tk.informatik.tu-darmstadt.de Abstract In this paper we discuss a new approach to extract relational

More information

The taming of the data:

The taming of the data: The taming of the data: Using text mining in building a corpus for diachronic analysis Stefania Degaetano-Ortlieb, Hannah Kermes, Ashraf Khamis, Jörg Knappen, Noam Ordan and Elke Teich Background Big data

More information

Word Segmentation of Off-line Handwritten Documents

Word Segmentation of Off-line Handwritten Documents Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department

More information

SEMAFOR: Frame Argument Resolution with Log-Linear Models

SEMAFOR: Frame Argument Resolution with Log-Linear Models SEMAFOR: Frame Argument Resolution with Log-Linear Models Desai Chen or, The Case of the Missing Arguments Nathan Schneider SemEval July 16, 2010 Dipanjan Das School of Computer Science Carnegie Mellon

More information

Vocabulary Usage and Intelligibility in Learner Language

Vocabulary Usage and Intelligibility in Learner Language Vocabulary Usage and Intelligibility in Learner Language Emi Izumi, 1 Kiyotaka Uchimoto 1 and Hitoshi Isahara 1 1. Introduction In verbal communication, the primary purpose of which is to convey and understand

More information

Derivational and Inflectional Morphemes in Pak-Pak Language

Derivational and Inflectional Morphemes in Pak-Pak Language Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes

More information

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY Chen, Hsin-Hsi Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan E-mail: hh_chen@csie.ntu.edu.tw Abstract

More information

A Comparison of Two Text Representations for Sentiment Analysis

A Comparison of Two Text Representations for Sentiment Analysis 010 International Conference on Computer Application and System Modeling (ICCASM 010) A Comparison of Two Text Representations for Sentiment Analysis Jianxiong Wang School of Computer Science & Educational

More information

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING SISOM & ACOUSTICS 2015, Bucharest 21-22 May THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING MarilenaăLAZ R 1, Diana MILITARU 2 1 Military Equipment and Technologies Research Agency, Bucharest,

More information

Using dialogue context to improve parsing performance in dialogue systems

Using dialogue context to improve parsing performance in dialogue systems Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,

More information

! # %& ( ) ( + ) ( &, % &. / 0!!1 2/.&, 3 ( & 2/ &,

! # %& ( ) ( + ) ( &, % &. / 0!!1 2/.&, 3 ( & 2/ &, ! # %& ( ) ( + ) ( &, % &. / 0!!1 2/.&, 3 ( & 2/ &, 4 The Interaction of Knowledge Sources in Word Sense Disambiguation Mark Stevenson Yorick Wilks University of Shef eld University of Shef eld Word sense

More information

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,

More information

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering

More information

A Graph Based Authorship Identification Approach

A Graph Based Authorship Identification Approach A Graph Based Authorship Identification Approach Notebook for PAN at CLEF 2015 Helena Gómez-Adorno 1, Grigori Sidorov 1, David Pinto 2, and Ilia Markov 1 1 Center for Computing Research, Instituto Politécnico

More information

Natural Language Processing. George Konidaris

Natural Language Processing. George Konidaris Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2017 Natural Language Processing Understanding spoken/written sentences in a natural language. Major area of research in AI. Why? Humans

More information

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,

More information

Short Text Understanding Through Lexical-Semantic Analysis

Short Text Understanding Through Lexical-Semantic Analysis Short Text Understanding Through Lexical-Semantic Analysis Wen Hua #1, Zhongyuan Wang 2, Haixun Wang 3, Kai Zheng #4, Xiaofang Zhou #5 School of Information, Renmin University of China, Beijing, China

More information

AQUA: An Ontology-Driven Question Answering System

AQUA: An Ontology-Driven Question Answering System AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.

More information

Prediction of Maximal Projection for Semantic Role Labeling

Prediction of Maximal Projection for Semantic Role Labeling Prediction of Maximal Projection for Semantic Role Labeling Weiwei Sun, Zhifang Sui Institute of Computational Linguistics Peking University Beijing, 100871, China {ws, szf}@pku.edu.cn Haifeng Wang Toshiba

More information

Online Updating of Word Representations for Part-of-Speech Tagging

Online Updating of Word Representations for Part-of-Speech Tagging Online Updating of Word Representations for Part-of-Speech Tagging Wenpeng Yin LMU Munich wenpeng@cis.lmu.de Tobias Schnabel Cornell University tbs49@cornell.edu Hinrich Schütze LMU Munich inquiries@cislmu.org

More information

Which verb classes and why? Research questions: Semantic Basis Hypothesis (SBH) What verb classes? Why the truth of the SBH matters

Which verb classes and why? Research questions: Semantic Basis Hypothesis (SBH) What verb classes? Why the truth of the SBH matters Which verb classes and why? ean-pierre Koenig, Gail Mauner, Anthony Davis, and reton ienvenue University at uffalo and Streamsage, Inc. Research questions: Participant roles play a role in the syntactic

More information

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

OCR for Arabic using SIFT Descriptors With Online Failure Prediction OCR for Arabic using SIFT Descriptors With Online Failure Prediction Andrey Stolyarenko, Nachum Dershowitz The Blavatnik School of Computer Science Tel Aviv University Tel Aviv, Israel Email: stloyare@tau.ac.il,

More information

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

The Internet as a Normative Corpus: Grammar Checking with a Search Engine The Internet as a Normative Corpus: Grammar Checking with a Search Engine Jonas Sjöbergh KTH Nada SE-100 44 Stockholm, Sweden jsh@nada.kth.se Abstract In this paper some methods using the Internet as a

More information

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages Nuanwan Soonthornphisaj 1 and Boonserm Kijsirikul 2 Machine Intelligence and Knowledge Discovery Laboratory Department of Computer

More information

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks POS tagging of Chinese Buddhist texts using Recurrent Neural Networks Longlu Qin Department of East Asian Languages and Cultures longlu@stanford.edu Abstract Chinese POS tagging, as one of the most important

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

A process by any other name

A process by any other name January 05, 2016 Roger Tregear A process by any other name thoughts on the conflicted use of process language What s in a name? That which we call a rose By any other name would smell as sweet. William

More information

1. Introduction. 2. The OMBI database editor

1. Introduction. 2. The OMBI database editor OMBI bilingual lexical resources: Arabic-Dutch / Dutch-Arabic Carole Tiberius, Anna Aalstein, Instituut voor Nederlandse Lexicologie Jan Hoogland, Nederlands Instituut in Marokko (NIMAR) In this paper

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

(Sub)Gradient Descent

(Sub)Gradient Descent (Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include

More information

Review in ICAME Journal, Volume 38, 2014, DOI: /icame

Review in ICAME Journal, Volume 38, 2014, DOI: /icame Review in ICAME Journal, Volume 38, 2014, DOI: 10.2478/icame-2014-0012 Gaëtanelle Gilquin and Sylvie De Cock (eds.). Errors and disfluencies in spoken corpora. Amsterdam: John Benjamins. 2013. 172 pp.

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL)  Feb 2015 Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) www.angielskiwmedycynie.org.pl Feb 2015 Developing speaking abilities is a prerequisite for HELP in order to promote effective communication

More information

THE VERB ARGUMENT BROWSER

THE VERB ARGUMENT BROWSER THE VERB ARGUMENT BROWSER Bálint Sass sass.balint@itk.ppke.hu Péter Pázmány Catholic University, Budapest, Hungary 11 th International Conference on Text, Speech and Dialog 8-12 September 2008, Brno PREVIEW

More information

Controlled vocabulary

Controlled vocabulary Indexing languages 6.2.2. Controlled vocabulary Overview Anyone who has struggled to find the exact search term to retrieve information about a certain subject can benefit from controlled vocabulary. Controlled

More information

Switchboard Language Model Improvement with Conversational Data from Gigaword

Switchboard Language Model Improvement with Conversational Data from Gigaword Katholieke Universiteit Leuven Faculty of Engineering Master in Artificial Intelligence (MAI) Speech and Language Technology (SLT) Switchboard Language Model Improvement with Conversational Data from Gigaword

More information

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation Gene Kim and Lenhart Schubert Presented by: Gene Kim April 2017 Project Overview Project: Annotate a large, topically

More information

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Using Web Searches on Important Words to Create Background Sets for LSI Classification Using Web Searches on Important Words to Create Background Sets for LSI Classification Sarah Zelikovitz and Marina Kogan College of Staten Island of CUNY 2800 Victory Blvd Staten Island, NY 11314 Abstract

More information

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist Meeting 2 Chapter 7 (Morphology) and chapter 9 (Syntax) Today s agenda Repetition of meeting 1 Mini-lecture on morphology Seminar on chapter 7, worksheet Mini-lecture on syntax Seminar on chapter 9, worksheet

More information

First Grade Standards

First Grade Standards These are the standards for what is taught throughout the year in First Grade. It is the expectation that these skills will be reinforced after they have been taught. Mathematical Practice Standards Taught

More information

INPE São José dos Campos

INPE São José dos Campos INPE-5479 PRE/1778 MONLINEAR ASPECTS OF DATA INTEGRATION FOR LAND COVER CLASSIFICATION IN A NEDRAL NETWORK ENVIRONNENT Maria Suelena S. Barros Valter Rodrigues INPE São José dos Campos 1993 SECRETARIA

More information

Cross-Lingual Text Categorization

Cross-Lingual Text Categorization Cross-Lingual Text Categorization Nuria Bel 1, Cornelis H.A. Koster 2, and Marta Villegas 1 1 Grup d Investigació en Lingüística Computacional Universitat de Barcelona, 028 - Barcelona, Spain. {nuria,tona}@gilc.ub.es

More information

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Alistair Conkie AT&T abs - Research 180 Park Avenue, Florham Park,

More information

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature 1 st Grade Curriculum Map Common Core Standards Language Arts 2013 2014 1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature Key Ideas and Details

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

First Grade Curriculum Highlights: In alignment with the Common Core Standards

First Grade Curriculum Highlights: In alignment with the Common Core Standards First Grade Curriculum Highlights: In alignment with the Common Core Standards ENGLISH LANGUAGE ARTS Foundational Skills Print Concepts Demonstrate understanding of the organization and basic features

More information

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen UNIVERSITY OF OSLO Department of Informatics Dialog Act Recognition using Dependency Features Master s thesis Sindre Wetjen November 15, 2013 Acknowledgments First I want to thank my supervisors Lilja

More information

Indian Institute of Technology, Kanpur

Indian Institute of Technology, Kanpur Indian Institute of Technology, Kanpur Course Project - CS671A POS Tagging of Code Mixed Text Ayushman Sisodiya (12188) {ayushmn@iitk.ac.in} Donthu Vamsi Krishna (15111016) {vamsi@iitk.ac.in} Sandeep Kumar

More information

Concepts and Properties in Word Spaces

Concepts and Properties in Word Spaces Concepts and Properties in Word Spaces Marco Baroni 1 and Alessandro Lenci 2 1 University of Trento, CIMeC 2 University of Pisa, Department of Linguistics Abstract Properties play a central role in most

More information

ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly

ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly Inflected Languages Classical Approaches to Tagging The slides are posted on the web. The url is http://chss.montclair.edu/~feldmana/esslli10/.

More information

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC On Human Computer Interaction, HCI Dr. Saif al Zahir Electrical and Computer Engineering Department UBC Human Computer Interaction HCI HCI is the study of people, computer technology, and the ways these

More information

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language Nathaniel Hayes Department of Computer Science Simpson College 701 N. C. St. Indianola, IA, 50125 nate.hayes@my.simpson.edu

More information

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many Schmidt 1 Eric Schmidt Prof. Suzanne Flynn Linguistic Study of Bilingualism December 13, 2013 A Minimalist Approach to Code-Switching In the field of linguistics, the topic of bilingualism is a broad one.

More information

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,

More information

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1 Basic Parsing with Context-Free Grammars Some slides adapted from Julia Hirschberg and Dan Jurafsky 1 Announcements HW 2 to go out today. Next Tuesday most important for background to assignment Sign up

More information

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma University of Alberta Large-Scale Semi-Supervised Learning for Natural Language Processing by Shane Bergsma A thesis submitted to the Faculty of Graduate Studies and Research in partial fulfillment of

More information

Part III: Semantics. Notes on Natural Language Processing. Chia-Ping Chen

Part III: Semantics. Notes on Natural Language Processing. Chia-Ping Chen Part III: Semantics Notes on Natural Language Processing Chia-Ping Chen Department of Computer Science and Engineering National Sun Yat-Sen University Kaohsiung, Taiwan ROC Part III: Semantics p. 1 Introduction

More information

Grammars & Parsing, Part 1:

Grammars & Parsing, Part 1: Grammars & Parsing, Part 1: Rules, representations, and transformations- oh my! Sentence VP The teacher Verb gave the lecture 2015-02-12 CS 562/662: Natural Language Processing Game plan for today: Review

More information

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Innov High Educ (2009) 34:93 103 DOI 10.1007/s10755-009-9095-2 Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Phyllis Blumberg Published online: 3 February

More information

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models Netpix: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models 1 Netpix: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models James B.

More information

Can Human Verb Associations help identify Salient Features for Semantic Verb Classification?

Can Human Verb Associations help identify Salient Features for Semantic Verb Classification? Can Human Verb Associations help identify Salient Features for Semantic Verb Classification? Sabine Schulte im Walde Institut für Maschinelle Sprachverarbeitung Universität Stuttgart Seminar für Sprachwissenschaft,

More information