Prosody, Phonology and Phonetics

Similar documents
MARE Publication Series

International Series in Operations Research & Management Science

Pre-vocational Education in Germany and China

Lecture Notes in Artificial Intelligence 4343

Guide to Teaching Computer Science

Advances in Mathematics Education

Business Students. AACSB Accredited Business Programs

Perspectives of Information Systems

PRODUCT PLATFORM AND PRODUCT FAMILY DESIGN

Eyebrows in French talk-in-interaction

Communication and Cybernetics 17

Speech Emotion Recognition Using Support Vector Machine

The influence of metrical constraints on direct imitation across French varieties

Lecture Notes in Artificial Intelligence 7175

Developing Language Teacher Autonomy through Action Research

Second Language Learning and Teaching. Series editor Mirosław Pawlak, Kalisz, Poland

IMPLEMENTING EUROPEAN UNION EDUCATION AND TRAINING POLICY

Mandarin Lexical Tone Recognition: The Gating Paradigm

OTHER RESEARCH EXPERIENCE & AFFILIATIONS

EDUCATION IN THE INDUSTRIALISED COUNTRIES

US and Cross-National Policies, Practices, and Preparation

COMMUNICATION-BASED SYSTEMS

AUTONOMY. in the Law

A study of speaker adaptation for DNN-based speech synthesis

NATO ASI Series Advanced Science Institutes Series

Welcome to. ECML/PKDD 2004 Community meeting

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

PRAAT ON THE WEB AN UPGRADE OF PRAAT FOR SEMI-AUTOMATIC SPEECH ANNOTATION

Lecture Notes on Mathematical Olympiad Courses

National Taiwan Normal University - List of Presidents

Athens: City And Empire Students Book (Cambridge School Classics Project) By Cambridge School Classics Project

Rhythm-typology revisited.

The recognition, evaluation and accreditation of European Postgraduate Programmes.

Instrumentation, Control & Automation Staffing. Maintenance Benchmarking Study

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008

Master s Degree Programme in East Asian Studies

THE ALLEGORY OF THE CATS By David J. LeMaster

Summary and policy recommendations

Innovation & Quality in E-Learning & Standardization: Open Learning for All

Linking Task: Identifying authors and book titles in verbose queries

PeopleSoft Human Capital Management 9.2 (through Update Image 23) Hardware and Software Requirements

Annotation Pro. annotation of linguistic and paralinguistic features in speech. Katarzyna Klessa. Phon&Phon meeting

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Cambridge NATIONALS. Creative imedia Level 1/2. UNIT R081 - Pre-Production Skills DELIVERY GUIDE

30 Jahre Kooperation zwischen TU Darmstadt & Tongji University Shanghai

PROJECT PERIODIC REPORT

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Twenty years of TIMSS in England. NFER Education Briefings. What is TIMSS?

A Practical Introduction to Teacher Training in ELT

DICE - Final Report. Project Information Project Acronym DICE Project Title

Factors Affecting the Sustainability of Sino-American Educational Ventures in Mainland China

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Case of the Department of Biomedical Engineering at the Lebanese. International University

Problems of the Arabic OCR: New Attitudes

FEIRONG YUAN, PH.D. Updated: April 15, 2016

Getting the Story Right: Making Computer-Generated Stories More Entertaining

Body-Conducted Speech Recognition and its Application to Speech Support System

Disambiguation of Thai Personal Name from Online News Articles

Developing Grammar in Context

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

THE PROMOTION OF SOCIAL AWARENESS

Albert (Yan) Wang. Flow-induced Trading Pressure and Corporate Investment (with Xiaoxia Lou), Forthcoming at

University Faculty Details Page on DU Web-site

Introduction Research Teaching Cooperation Faculties. University of Oulu

To link to this article: PLEASE SCROLL DOWN FOR ARTICLE

Topic Study Group No. 25: The Role of History of Mathematics in Mathematics Education

Academic profession in Europe

Investigation on Mandarin Broadcast News Speech Recognition

Tuition fees: Experiences in Finland

Coordinating by looking back? Past experience as enabler of coordination in extreme environment

Department of Education and Skills. Memorandum

Open Education and Quality: The Need for Changing Strategies and Learning UNESCO IITE 2016, St. Petersburg by Christian M.

Use and Adaptation of Open Source Software for Capacity Building to Strengthen Health Research in Low- and Middle-Income Countries

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Automating the E-learning Personalization

Stimulating Techniques in Micro Teaching. Puan Ng Swee Teng Ketua Program Kursus Lanjutan U48 Kolej Sains Kesihatan Bersekutu, SAS, Ulu Kinta

HIGHLIGHTS OF FINDINGS FROM MAJOR INTERNATIONAL STUDY ON PEDAGOGY AND ICT USE IN SCHOOLS

Prairie View A&M University Houston, TX P.O. Box 519; MS 2220; Hilliard Hall (281)

Pupil Premium Impact Assessment

Advanced Grammar in Use

Problems of practice-based Doctorates in Art and Design: a viewpoint from Finland

Speech Recognition at ICSI: Broadcast News and beyond

K-12 PROFESSIONAL DEVELOPMENT

Cross Language Information Retrieval

Grindelwald Tasmania 7277 Australia Tel: ++ (613)

Spoken English, TESOL and Applied Linguistics

EFL teachers and students perspectives on the use of electronic dictionaries for learning English

Researchers, speak out! Annina Huhtala, Kaskas

PUBLIC CASE REPORT Use of the GeoGebra software at upper secondary school

EDITORIAL: ICT SUPPORT FOR KNOWLEDGE MANAGEMENT IN CONSTRUCTION

Empirical research on implementation of full English teaching mode in the professional courses of the engineering doctoral students

Free Education for Open Learning: Open educational policies, strategies & access for all

Middle School Curriculum Guide

OUR GOAL:THE SUCCESS OF YOUR STAY IN FRANCE

Task-Based Language Teaching: An Insight into Teacher Practice

THE PERCEPTION AND PRODUCTION OF STRESS AND INTONATION BY CHILDREN WITH COCHLEAR IMPLANTS

REVIEW OF ONLINE INTERCULTURAL EXCHANGE: AN INTRODUCTION FOR FOREIGN LANGUAGE TEACHERS

Corpus Linguistics (L615)

What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data

My First Spanish Phrases (Speak Another Language!) By Jill Kalz

Transcription:

Prosody, Phonology and Phonetics Series Editors Daniel J. Hirst CNRS Laboratoire Parole et Langage, Aix-en-Provence, France Qiuwu Ma School of Foreign Languages, Tongji University, Shanghai, China Hongwei Ding School of Foreign Languages, Tongji University, Shanghai, China

The series will publish studies in the general area of Speech Prosody with a particular (but non-exclusive) focus on the importance of phonetics and phonology in this field. The topic of speech prosody is today a far larger area of research than is often realised. The number of papers on the topic presented at large international conferences such as Interspeech and ICPhS is considerable and regularly increasing. The proposed book series would be the natural place to publish extended versions of papers presented at the Speech Prosody Conferences, in particular the papers presented in Special Sessions at the conference. This could potentially involve the publication of 3 or 4 volumes every two years ensuring a stable future for the book series. If such publications are produced fairly rapidly, they will in turn provide a strong incentive for the organisation of other special sessions at future Speech Prosody conferences. More information about this series at http://www.springer.com/series/11951

Keikichi Hirose Jianhua Tao Editors Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis 2123

Editors Keikichi Hirose Graduate School of Information Science and Technology University of Tokyo Tokyo Japan Jianhua Tao Institute of Automation Chinese Academy of Sciences Beijing China ISSN 2197-8700 Prosody, Phonology and Phonetics ISBN 978-3-662-45257-8 DOI 10.1007/978-3-662-45258-5 ISSN 2197-8719 (electronic) ISBN 978-3-662-45258-5 (ebook) Library of Congress Control Number: 2014955166 Springer Berlin Heidelberg Dordrecht London Springer-Verlag Berlin Heidelberg 2015 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com)

Contents Part I Modeling of Prosody 1 ProZed: A Speech Prosody Editor for Linguists, Using Analysis-by-Synthesis... 3 Daniel J. Hirst 2 Degrees of Freedom in Prosody Modeling... 19 Yi Xu and Santitham Prom-on 3 Extraction, Analysis and Synthesis of Fujisaki model Parameters... 35 Hansjörg Mixdorff 4 Probabilistic Modeling of Pitch Contours Toward Prosody Synthesis and Conversion... 49 Hirokazu Kameoka Part II Para- and Non-Linguistic Issues of Prosody 5 Communicative Speech Synthesis as Pan-Linguistic Prosody Control 73 Yoshinori Sagisaka and Yoko Greenberg 6 Mandarin Stress Analysis and Prediction for Speech Synthesis... 83 Ya Li and Jianhua Tao 7 Expressivity in Interactive Speech Synthesis; Some Paralinguistic and Nonlinguistic Issues of Speech Prosody for Conversational Dialogue Systems... 97 Nick Campbell and Ya Li 8 Temporally Variable Multi attribute Morphing of Arbitrarily Many Voices for Exploratory Research of Speech Prosody... 109 Hideki Kawahara v

vi Contents Part III Control of Prosody in Speech Synthesis 9 Statistical Models for Dealing with Discontinuity of Fundamental Frequency... 123 Kai Yu 10 Use of Generation Process Model for Improved Control of Fundamental Frequency Contours in HMM-Based Speech Synthesis... 145 Keikichi Hirose 11 Tone Nucleus Model for Emotional Mandarin Speech Synthesis... 161 Miaomiao Wang 12 Emphasis, Word Prominence, and Continuous Wavelet Transform in the Control of HMM-Based Synthesis... 173 Martti Vainio, Antti Suni and Daniel Aalto 13 Exploiting Alternatives for Text-To-Speech Synthesis: From Machine to Human... 189 Nicolas Obin, Christophe Veaux and Pierre Lanchantin 14 Prosody Control and Variation Enhancement Techniques for HMM-Based Expressive Speech Synthesis... 203 Takao Kobayashi

Contributors Daniel Aalto University of Helsinki, Helsinki, Finland Nick Campbell Trinity College Dublin, The University of Dublin, Dublin, Ireland Yoko Greenberg Waseda University, Tokyo, Japan Keikichi Hirose The University of Tokyo, Tokyo, Japan Daniel J. Hirst CNRS & Aix-Marseille University, Aix-en-Provence, France Tongji University, Shanghai, China Hirokazu Kameoka The University of Tokyo, Tokyo, Japan/NTT Communication Science Laboratories, Atsugi, Japan Hideki Kawahara Wakayama University, Wakayama, Japan Takao Kobayashi Tokyo Institute of Technology, Tokyo, Japan Pierre Lanchantin Cambridge University, Cambridge, UK Ya Li Institute ofautomation, ChineseAcademy of Sciences, Beijing, China/Trinity College Dublin, The University of Dublin, Dublin, Ireland Hansjörg Mixdorff Beuth-Hochschule für Technik Berlin, Berlin, Germany Nicolas Obin IRCAM, UMR STMS IRCAM-CNRS-UPMC, Paris, France Santitham Prom-on King Mongkut s University of Technology Thonburi, Thailand Yoshinori Sagisaka Waseda University, Tokyo, Japan Antti Suni University of Helsinki, Helsinki, Finland Jianhua Tao Institute ofautomation, ChineseAcademy of Sciences, Beijing, China Martti Vainio University of Helsinki, Helsinki, Finland Christophe Veaux Centre for Speech Technology Research, Edinburgh, UK vii

viii Contributors Miaomiao Wang Toshiba China R&D Center, Beijing, China Yi Xu University College London, London, UK Kai Yu Shanghai Jiao Tong University, Shanghai, China