DSAP - Digital Speech and Audio Processing

Similar documents
SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

SSE - Supervision of Electrical Systems

Speech Recognition at ICSI: Broadcast News and beyond

Strategy and Design of ICT Services

Content Teaching Methods: Social Studies. Dr. Melinda Butler

Segregation of Unvoiced Speech from Nonspeech Interference

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

HUMAN DEVELOPMENT OVER THE LIFESPAN Psychology 351 Fall 2013

MGMT 3362 Human Resource Management Course Syllabus Spring 2016 (Interactive Video) Business Administration 222D (Edinburg Campus)

Speech Emotion Recognition Using Support Vector Machine

Webquests: Increase student motivation and achievement. by Jodi Dillon Terri Rheaume Jennifer Stover

The D2L eportfolio for Teacher Candidates

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

ELEC3117 Electrical Engineering Design

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

The D2L eportfolio for Teacher Candidates

Human Emotion Recognition From Speech

Introduction to Forensic Anthropology ASM 275, Section 1737, Glendale Community College, Fall 2008

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

The functions and elements of a training system

KUTZTOWN UNIVERSITY KUTZTOWN, PENNSYLVANIA COE COURSE SYLLABUS TEMPLATE

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Course Syllabus Chem 482: Chemistry Seminar

STUDENT HANDBOOK ACCA

Programme Specification

A study of speaker adaptation for DNN-based speech synthesis

Briefing document CII Continuing Professional Development (CPD) scheme.

Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard

SYLLABUS- ACCOUNTING 5250: Advanced Auditing (SPRING 2017)

ITSC 1301 Introduction to Computers Course Syllabus

MGMT 479 (Hybrid) Strategic Management

Data Fusion Models in WSNs: Comparison and Analysis

Educating Students with Special Needs in Secondary General Education Classrooms. Thursdays 12:00-2:00 pm and by appointment

Agent-Based Software Engineering

COURSE DESCRIPTION PREREQUISITE COURSE PURPOSE

George Mason University Graduate School of Education Program: Special Education

COMMU ICATION SECOND CYCLE DEGREE IN COMMUNICATION ENGINEERING ACADEMIC YEAR Il mondo che ti aspetta

Introduction to Financial Accounting

Speaker Identification by Comparison of Smart Methods. Abstract

BSM 2801, Sport Marketing Course Syllabus. Course Description. Course Textbook. Course Learning Outcomes. Credits.

A Note on Structuring Employability Skills for Accounting Students

Adler Graduate School

21st Century Community Learning Center

RM 2234 Retailing in a Digital Age SPRING 2016, 3 credits, 50% face-to-face (Wed 3pm-4:15pm)

COUN 522. Career Development and Counseling

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

This course has been proposed to fulfill the Individuals, Institutions, and Cultures Level 1 pillar.

COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION

Linguistics. The School of Humanities

Voice conversion through vector quantization

Aronson, E., Wilson, T. D., & Akert, R. M. (2010). Social psychology (7th ed.). Upper Saddle River, NJ: Prentice Hall.

An Asset-Based Approach to Linguistic Diversity

Nottingham Trent University Course Specification

A comparison of spectral smoothing methods for segment concatenation based speech synthesis

BOS 3001, Fundamentals of Occupational Safety and Health Course Syllabus. Course Description. Course Textbook. Course Learning Outcomes.

REVIEW OF CONNECTED SPEECH

The Waldegrave Trust Waldegrave School, Fifth Cross Road, Twickenham, TW2 5LH TEL: , FAX:

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition

Syllabus of the Course Skills for the Tourism Industry

Speaker recognition using universal background model on YOHO database

University of Massachusetts Lowell Graduate School of Education Program Evaluation Spring Online

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Circuit Simulators: A Revolutionary E-Learning Platform

Implementation Regulations

PROMOTION MANAGEMENT. Business 1585 TTh - 2:00 p.m. 3:20 p.m., 108 Biddle Hall. Fall Semester 2012

Declaration of competencies

Dublin City Schools Broadcast Video I Graded Course of Study GRADES 9-12

The University of Southern Mississippi

TEACHING AND EXAMINATION REGULATIONS (TER) (see Article 7.13 of the Higher Education and Research Act) MASTER S PROGRAMME EMBEDDED SYSTEMS

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

University of Toronto Physics Practicals. University of Toronto Physics Practicals. University of Toronto Physics Practicals

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

1. Programme title and designation International Management N/A

20 HOURS PER WEEK. Barcelona. 1.1 Intensive Group Courses - All levels INTENSIVE COURSES OF

FORENSIC SCIENCE SYLLABUS - AMENDED SPRING SEMESTER 2014

PROGRAMME SPECIFICATION

Albright College Reading, PA Tentative Syllabus

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

Introduction to Sociology SOCI 1101 (CRN 30025) Spring 2015

University of the Free State Language Policy i

UCC2: Course Change Transmittal Form

Information for Private Candidates

ITED350.02W Spring 2016 Syllabus

George Mason University College of Education and Human Development Secondary Education Program. EDCI 790 Secondary Education Internship

Chemistry 106 Chemistry for Health Professions Online Fall 2015

Laura A. Riffel

Lingüística Cognitiva/ Cognitive Linguistics

ECO 2013-Principles of Macroeconomics

VOCATIONAL QUALIFICATION IN YOUTH AND LEISURE INSTRUCTION 2009

UCLA Issues in Applied Linguistics

Process to Identify Minimum Passing Criteria and Objective Evidence in Support of ABET EC2000 Criteria Fulfillment

Building a Synchronous Virtual Classroom in a Distance English Language Teacher Training (DELTT) Program in Turkey

CWIS 23,3. Nikolaos Avouris Human Computer Interaction Group, University of Patras, Patras, Greece

Lecture 9: Speech Recognition

Transcription:

Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2017 230 - ETSETB - Barcelona School of Telecommunications Engineering 739 - TSC - Department of Signal Theory and Communications DEGREE IN TELECOMMUNICATIONS ENGINEERING (Syllabus 1992). (Teaching unit Optional) MASTER'S DEGREE IN INFORMATION AND COMMUNICATION TECHNOLOGIES (Syllabus 2009). (Teaching unit Optional) MASTER'S DEGREE IN TELECOMMUNICATIONS ENGINEERING (Syllabus 2013). (Teaching unit Optional) 5 Teaching languages: English Teaching staff Coordinator: Others: Climent Nadeu Antonio Bonafonte Javier Hernando Opening hours Timetable: Tuesday and Thursday from 10:00 to 13:00 Prior skills Signal Processing Requirements Signal processing Degree competences to which the subject contributes Specific: 1. Ability to apply information theory methods, adaptive modulation and channel coding, as well as advanced techniques of digital signal processing to communication and audiovisual systems. Transversal: 2. TEAMWORK: Being able to work in an interdisciplinary team, whether as a member or as a leader, with the aim of contributing to projects pragmatically and responsibly and making commitments in view of the resources that are available. 3. EFFECTIVE USE OF INFORMATION RESOURCES: Managing the acquisition, structuring, analysis and display of data and information in the chosen area of specialisation and critically assessing the results obtained. 4. FOREIGN LANGUAGE: Achieving a level of spoken and written proficiency in a foreign language, preferably English, that meets the needs of the profession and the labour market. Teaching methodology - Lectures (50%) - Application classes (with Matlab or similar) (50%) - Team work: project, presentation - Individual work: preparation and completion (out classroom) of application activities Learning objectives of the subject 1 / 5

Learning objectives of the subject Understanding and being competent on a relevant set of concepts and techniques in the field of digital audio processing, and their application to problems arising from real applications. Especially, speech and music signals and applications will be considered. Learning results: Ability to digitally process, in an application-oriented context, audio and speech signals, in order to analyze, model, extract information from, clean, modify, and generate/synthesize them. Study load Total learning time: 125h Hours large group: 39h 31.20% Hours medium group: Hours small group: Guided activities: Self study: 86h 68.80% 2 / 5

Content 1. Introduction Course presentation Audio diversity Characteristics of speech and music. Production model Hearing and auditory modeling The short-time Fourier transform 2. Short-term analysis-synthesis of (cuasi)periodic signals Filter-bank analysis/synthesis. The phase vocoder Filter-bank and spectrogram Time-scale and pitch modification QMF filters. MP3 coding. 3. Modeling and representation of speech signals Production-based all-pole modeling Pitch determination for speech and music LPC-based coding used in mobile telephony 3 / 5

4. Enhancement of speech and audio signals Cancellation: echo, interference Denoising: spectral subtraction, Wiener-based filtering, wavelets Blind source separation: ICA, CASA, NMF 5. Multi-microphone audio processing Room acoustics Array beamforming Acoustic source localization and tracking Specific objectives: 6. Recognition and detection of audio and speech 6. Recognition and detection of audio and speech Pattern-matching approaches Audio activity detection Application to speech and speaker recognition 4 / 5

Projects realization and presentation Learning time: 54h Theory classes: 3h Self study : 51h Design, implementation and test of a audio processing system for a specific application Oral presentation of 1) Project proposal, and 2) Project realization Qualification system Attendance/participation in class (10%) Tests (30%) Project (50%) Presentation (10%) Bibliography Basic: Quatieri, T.F. Discrete-time speech signal processing: principles and practice. Upper Saddle River, NJ: Prentice Hall, 2002. ISBN 013242942X. Gold, B.; Morgan, N.; Ellis, D. Speech and audio signal processing: processing and perception of speech and music. 2nd rev. ed. Wiley-Blackwell, 2011. ISBN 978-0-470-19536-9. Dutoit, T.; Marqués, F.; Rabiner, L.R. Applied signal processing: a MATLAB-based proof of concept. New York ; London: Springer, 2009. ISBN 978-0-38774534-3. Complementary: Rabiner, L.R.; Schafer, R.W. Theory and applications of digital speech processing. Prentice Hall, 2010. ISBN 9780136034285. Huang, Y.A.; Benesty, J. (eds.). Audio signal processing for next-generation multimedia communication systems [on line]. New York: Kluwer Academic Publishing, 2004 [Consultation: 23/07/2013]. Available on: <http://link.springer.com/book/10.1007/b117685/page/1>. ISBN 1402077688. Others resources: Lecture slides Practical work statements and programs Audiovisual material Slides Slides used in lectures Computer material Codi programes Software codes in Matlab or similar 5 / 5