SATANJEEV (Bano) BANERJEE School Other Website:

Similar documents
Xinyu Tang. Education. Research Interests. Honors and Awards. Professional Experience

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Top US Tech Talent for the Top China Tech Company

Stephanie Ann Siler. PERSONAL INFORMATION Senior Research Scientist; Department of Psychology, Carnegie Mellon University

Speech Translation for Triage of Emergency Phonecalls in Minority Languages

Using dialogue context to improve parsing performance in dialogue systems

Word Sense Disambiguation

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Speech Recognition at ICSI: Broadcast News and beyond

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Vocabulary Usage and Intelligibility in Learner Language

Leveraging Sentiment to Compute Word Similarity

Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard

Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2

On-Line Data Analytics

CHAT To Your Destination

Effect of Word Complexity on L2 Vocabulary Learning

COURSE LISTING. Courses Listed. Training for Cloud with SAP SuccessFactors in Integration. 23 November 2017 (08:13 GMT) Beginner.

Robust Sense-Based Sentiment Classification

A new Dataset of Telephone-Based Human-Human Call-Center Interaction with Emotional Evaluation

FRESNO COUNTY INTELLIGENT TRANSPORTATION SYSTEMS (ITS) PLAN UPDATE

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment

Metadata of the chapter that will be visualized in SpringerLink

AQUA: An Ontology-Driven Question Answering System

Welcome to. ECML/PKDD 2004 Community meeting

Guru: A Computer Tutor that Models Expert Human Tutors

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Student Information System. Parent Quick Start Guide

COMM370, Social Media Advertising Fall 2017

CWIS 23,3. Nikolaos Avouris Human Computer Interaction Group, University of Patras, Patras, Greece

ANNUAL CURRICULUM REVIEW PROCESS for the 2016/2017 Academic Year

Hongyan Ma. University of California, Los Angeles

Collaboration Tier 1

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq

Interaction Design Considerations for an Aircraft Carrier Deck Agent-based Simulation

International Student Prospectus 2015/2016. EduSpiral Consultant Services For more info call

BUS Computer Concepts and Applications for Business Fall 2012

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

A study of speaker adaptation for DNN-based speech synthesis

Curriculum Vitae FARES FRAIJ, Ph.D. Lecturer

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

McGraw-Hill Connect and Create Built by Blackboard. Release Notes. Version 2.3 for Blackboard Learn 9.1

5935 Clarkston Road Clarkston, MI (248) , (248)

A Coding System for Dynamic Topic Analysis: A Computer-Mediated Discourse Analysis Technique

INTERMEDIATE ALGEBRA Course Syllabus

Atypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty

Regression for Sentence-Level MT Evaluation with Pseudo References

arxiv: v1 [cs.cl] 2 Apr 2017

A cognitive perspective on pair programming

Julie Gawrylowicz. Personal Statement and Research Interests

University Faculty Details Page on DU Web-site


Dialog Act Classification Using N-Gram Algorithms

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Contact Information 345 Mell Ave Atlanta, GA, Phone Number:

UCEAS: User-centred Evaluations of Adaptive Systems

Macromedia University Bachelor of Arts (B.A.) Programme Information

A Note on Structuring Employability Skills for Accounting Students

Getting the Story Right: Making Computer-Generated Stories More Entertaining

A Case Study: News Classification Based on Term Frequency

Modeling function word errors in DNN-HMM based LVCSR systems

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

IMPORTANT: PLEASE READ THE FOLLOWING DIRECTIONS CAREFULLY PRIOR TO PREPARING YOUR APPLICATION PACKAGE.

Chemical Engineering Mcgill Cegep Entry

Pair Programming: When and Why it Works

A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS

A BLENDED MODEL FOR NON-TRADITIONAL TEACHING AND LEARNING OF MATHEMATICS

Bachelor Class

Instructor Experience and Qualifications Professor of Business at NDNU; Over twenty-five years of experience in teaching undergraduate students.

Aclara is committed to improving your TWACS technical training experience as well as allowing you to be safe, efficient, and successful.

Learning Methods in Multilingual Speech Recognition

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Eye Movements in Speech Technologies: an overview of current research

ZACHARY J. OSTER CURRICULUM VITAE

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Emotional Variation in Speech-Based Natural Language Generation

SkillPort Quick Start Guide 7.0

Math 181, Calculus I

medicaid and the How will the Medicaid Expansion for Adults Impact Eligibility and Coverage? Key Findings in Brief

International House VANCOUVER / WHISTLER WORK EXPERIENCE

MS-431 The Cold War Aerospace Technology Oral History Project. Creator: Wright State University. Department of Archives and Special Collections

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Great Teachers, Great Leaders: Developing a New Teaching Framework for CCSD. Updated January 9, 2013

THE M.A. DEGREE Revised 1994 Includes All Further Revisions Through May 2012

West Georgia RESA 99 Brown School Drive Grantville, GA

Modelling interaction during small-group synchronous problem-solving activities: The Synergo approach.

Word Segmentation of Off-line Handwritten Documents

K5 Math Practice. Free Pilot Proposal Jan -Jun Boost Confidence Increase Scores Get Ahead. Studypad, Inc.

Re-evaluating the Role of Bleu in Machine Translation Research

95723 Managing Disruptive Technologies

Getting Results Continuous Improvement Plan

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Sample Letter Of Teamwork Recommendation

Assignment 1: Predicting Amazon Review Ratings

Transcription:

SATANJEEV (Bano) BANERJEE School email: banerjee@cs.cmu.edu Other email: satanjeev@gmail.com Website: http://www.cs.cmu.edu/~banerjee School address 5000 Forbes Avenue, 6223 GHC Pittsburgh, PA-15213 Home address 5562 Hobart Street, #519 Pittsburgh, PA-15217 Areas of Interest: Extracting and structuring semantic information from spoken language; adapting to individual users and user groups by automatically interpreting user behavior as feedback; speech summarization; natural language processing. Education: PhD in Language Technologies: Language Technologies Institute, School of Computer Science Tentative thesis: Using Implicit Human Supervision to Automatically Understand Meetings Master of Language Technologies: Language Technologies Institute, School of Computer Science Result: GPA: 3.96/4.00 Master of Science, Computer Science: University of Minnesota, Duluth Result: GPA: 3.964/4.000 Thesis: Adapting the Lesk Algorithm for Word Sense Disambiguation to WordNet. Bachelor of Engineering, Computer Engineering: University of Pune, India Result: First Class with Distinction May 2010 (expected) Dec 2004 Dec 2002 May 2000 Work Experience: Graduate Research Fellowships/Assistantships: Working with Dr. Rudnicky on PhD thesis: Extracting implicit supervision from normal human interaction with systems through passive observation and active querying. Exploring this topic within the context of assisting meeting participants in taking notes, and learning to do so from their notetaking behavior in previous meetings. Worked with Dr. Lavie on developing METEOR, a new metric for automatic machine translation evaluation. Worked with Dr. Mostow and Dr. Beck on learning language models from speech recognition errors. Worked with Dr. Ted Pedersen on word sense disambiguation. Created and released Ngram Statistics Package and Sense Tools. Aug 2003 2004 2005 Jun 2002 Aug 2003 Sep 2000 May 2002 Summer Internship at Microsoft Research, Redmond: Worked with the Speech Research Group on improving a phone-based dialog system. Summer 2007 Graduate Teaching Assistantships: For undergraduate course on Data Structures and Algorithms in Computer Spring 2006 Page 1 of 5

Science. Held weekly recitations for 30 students, and designed and graded homework programming assignments. For undergraduate course on Entrepreneurship in Computer Science. Mentored student projects. Fall 2006 Publications: All papers are available from my webpage: http://www.cs.cmu.edu/~banerjee/publications Most Cited Papers: Banerjee and Pedersen: Extended Gloss Overlaps as a Measure of Semantic Relatedness. In Proceedings of the 18 th International Conference on Artificial Intelligence (IJCAI 03). Aug 9-15, 2003, Acapulco, Mexico. [200+ citations]. Banerjee and Lavie: METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In the Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization at the 43 rd Annual Meeting of the Association of Computational Linguistics, Ann Arbor, MI, Jun 2005. [200+ citations]. Banerjee and Pedersen: The Design, Implementation and Use of the Ngram Statistics Package. In Proceedings of the 4 th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 03). Feb 17-21, 2003, Mexico City, Mexico. [100+ citations]. Banerjee, Rose, and Rudnicky: The Necessity of a Meeting Recording and Playback System, and the Benefit of Topic-Level Annotations to Meeting Browsing. In Proceedings of the 10 th Conference on Human-Computer Interaction. Sep, 2005, Rome. [30+ citations]. Banerjee and Rudnicky: Using Simple Speech-Based Features to Detect the State of a Meeting and the Roles of the Meeting Participants. In Proceedings of the 8 th International Conference on Spoken Language Processing (Interspeech 2004 - ICSLP). Oct 4-8, 2004, Jeju Island, Korea. [30+ citations]. Other Papers, by Topic: On Meeting Understanding: Banerjee and Rudnicky: Extracting Implicit Supervision from Meeting Notes to Train a Noteworthy-Utterance Detection Model. In submission. Banerjee and Rudnicky: Detecting the Noteworthiness of Utterances in Human Meetings. In Proceedings of the 2009 Conference of the Special Interest Group on Discourse and Dialogue (SIGDial). Sep 11-12, 2009, London, UK. Banerjee and Rudnicky: An Extractive-Summarization Baseline for the Automatic Detection of Noteworthy Utterances in Multi-Party Human-Human Dialog. In Proceedings of the 2008 IEEE Workshop on Spoken Language Technology. Dec 15-18, 2008, Goa, India. Banerjee and Rudnicky: Segmenting Meetings into Agenda Items by Extracting Implicit Supervision from Human Note-Taking. In Proceedings of the 2007 Conference on Intelligent User Interfaces. Jan 28-31, 2007, Hawaii. Banerjee and Rudnicky: TextTiling Based Approach to Topic Boundary Detection in Meetings. In Proceedings of the 2006 Interspeech (ICSLP) Conference. Sep 17-21, 2006, Pittsburgh, PA. Page 2 of 5

Publications Continued: Banerjee and Rudnicky: SmartNotes: Implicit Labeling of Meeting Data through User Note-Taking and Browsing. In Proceedings of the Conference of the North American Association of Computational Linguistics - Human Languages Technology - Demonstration Track. Jun 5-7, 2006, New York, NY. Banerjee and Rudnicky: You Are What You Say: Using Meeting Participants Speech to Detect their Roles and Expertise. In the NAACL-HLT Workshop on Analyzing Conversations in Text and Speech, Jun 8, 2006, New York, NY. Rudnicky, et al: Intelligently Integrating Information from Speech and Vision Processing to Perform Light-Weight Meeting Understanding. In the Workshop on Multimodal Multiparty Meeting Processing, Oct 7, 2005, Trento, Italy. Banerjee and Rudnicky: Aspects of the Virtuality Continuum and Multi-Participant Interaction Modeling in the Artificial Agent-Assisted Meeting Scenario. In the workshop on The Virtuality Continuum Revisited, Apr 2-7, 2005, Portland, OR. Rybski, et al: Segmentation and Classification of Meetings using Multiple Information Streams. In Proceedings of the 6 th International Conference on Multimodal Interfaces, Oct 14-15, 2004, State College, PA. Banerjee, et al: Creating Multi-Modal, User-Centric Records of Meetings with the Carnegie Mellon Meeting Recorder Architecture. In the ICASSP 2004 Workshop on Meeting Recognition, May 17, 2004, Montreal, Canada. On Word Sense Disambiguation: Patwardhan, Banerjee and Pedersen: SenseRelate::TargetWord - A Generalized Framework for Word Sense Disambiguation. In the 20 th National Conference on Artificial Intelligence - Demonstration track. Jul 2005, Pittsburgh, PA. Also in Proceedings of the Demonstration and Interactive Poster Session of the 43 rd Annual Meeting of the Association for Computational Linguistics. Jun 26, 2005, Ann Arbor, MI. Patwardhan, Banerjee and Pedersen: Using Measures of Semantic Relatedness for Word Sense Disambiguation. In Proceedings of the 4 th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 03). Feb 17-21, 2003, Mexico City, Mexico. Banerjee and Pedersen: An Adapted Lesk Approach to Word Sense Disambiguation using WordNet. In Proceedings of the 3 rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 02). Feb 17-21, 2003, Mexico City, Mexico. On Other Topics: Marge, Banerjee and Rudnicky: Using the Amazon Mechanical Turk for Transcription of Spoken Language. To appear in Proceedings of the 2010 IEEE International Conference on Acoustics, Speech and Spoken Language (ICASSP). March 14-19, 2010, Dallas, TX. Harris, Banerjee and Rudnicky: Heterogeneous Multi-Robot Dialogues for Search Tasks. In the AAAI Spring Symposium: Dialogical Robots: Verbal Interaction with Embodied Agents and Situated Devices, March 21-23, 2005, Stanford, CA. Harris, Banerjee, et al: A Research Platform for Multi-Agent Dialogue Dynamics. In the 13th International Workshop on Robot and Human Interactive Communication (RO-MAN), September 20-22, 2004, Kurashiki, Japan. Page 3 of 5

Publications Continued: Banerjee, et al: Improving Language Models by Learning from Speech Recognition Errors in a Reading Tutor that Listens. In Proceedings of the Second International Conference on Applied Artificial Intelligence, December 15-16, 2003, Kolhapur, India. Banerjee, Beck and Mostow: Evaluating the Effect of Predicting Oral Reading Miscues. In Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-03), September 1-4, 2003, Geneva, Switzerland. Tam, et al: Training a Confidence Measure for a Reading Tutor that Listens. In Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-03), September 1-4, 2003, Geneva, Switzerland. Released Software: The SmartNotes system. This program helps meeting participants record audio and take notes during meetings, and share and access them afterwards. It is the implementation of my PhD thesis, and has been used for meeting recording and research purposes for other projects at CMU. Webpage: http://ww.cmusmartnotes.org METEOR System for Automatic Evaluation of Machine Translation. Co-developed the original version of this system with Dr. Alon Lavie for automatic evaluation of machine translation. This has become a popular metric in the translation community. Webpage: http://www.cs.cmu.edu/~alavie/meteor Ngram Statistics Package. Co-developed the original version of this Perl toolkit with Dr. Ted Pedersen for detecting collocations from text corpora using various statistical methods. This toolkit is used widely for both research and education. Webpage: http://ww.d.umn.edu/~tpederse/code.html. SenseTools. Co-developed this Perl toolkit that perform word sense disambiguation using a supervised approach. Webpage of the software: www.d.umn.edu/~tpederse/code.html. Computer Skills: Languages: Proficient in Java, Perl, CGI, C. Medium proficiency in JavaScript, C#, C++. Toolkits: Proficient in Weka Machine Learning Toolkit, Amazon Mechanical Turk. IDEs: Proficient in Eclipse. Medium proficiency in Visual Studio. Operating systems worked on: Windows 2000, XP, Vista, 7, and UNIX (Solaris, Linux). Professional Activities: IEEE SLTC Newsletter. I am a Senior Staff Reporter at the IEEE Speech and Language Technical Committee s quarterly newsletter. I regularly contribute articles on new developments in speech research, and mentor junior reporters. Served as Paper Reviewer. Reviewed papers for journals such as Transactions on Pattern Analysis and Machine Intelligence (TPAMI), and Knowledge Engineering Review (KER), and for conferences such as Multimodal Interaction (ICMI), Natural Language Processing (ICON) and Hawaii Conference on System Sciences. Organized Young Researchers Roundtable on Spoken Dialog System. Co-organized the first annual Young Researchers Roundtable on Spoken Dialog System workshop. This is now an annually held research workshop. Conference Roommate Finding Service. Co-created and ran a web-based service to help conference attendees find roommates with whom to share their hotel rooms. This service was deployed at the Interspeech conferences in 2006, and 2007. 2005 2004 2005 2006 2007 Page 4 of 5

Other Projects: Artificial Intelligence Based Othello Player. Designed and implemented the game engine to play the board game Othello against human opponents. This program was awarded the first and second prizes at two separate inter-college competitions in India. 1998 References: Alexander I. Rudnicky Principal Systems Scientist Email: air@cs.cmu.edu Web: http://www.cs.cmu.edu/~air Alon Lavie Associate Research Professor Email: alavie@cs.cmu.edu Web: http://www.cs.cmu.edu/~alavie Carolyn Rosé Assistant Professor Email: cprose@cs.cmu.edu Web: http://www.cs.cmu.edu/~cprose Ted Pedersen Professor University of Minnesota Duluth Email: tpederse@d.umn.edu Web: http://www.d.umn.edu/~tpederse Page 5 of 5