Cognitive Dynamic Systems

Similar documents
Advanced Grammar in Use

Developing Grammar in Context

Python Machine Learning

THE PROMOTION OF SOCIAL AWARENESS

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Guide to Teaching Computer Science

Evolutive Neural Net Fuzzy Filtering: Basic Description

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Axiom 2013 Team Description Paper

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

Reinforcement Learning by Comparing Immediate Reward

Instrumentation, Control & Automation Staffing. Maintenance Benchmarking Study

WHEN THERE IS A mismatch between the acoustic

Perspectives of Information Systems

Learning Methods for Fuzzy Systems

Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo, Delhi

Welcome to. ECML/PKDD 2004 Community meeting

Knowledge-Based - Systems

Lecture 10: Reinforcement Learning

10.2. Behavior models

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

International Series in Operations Research & Management Science

Lecture 1: Machine Learning Basics

INPE São José dos Campos

Speech Emotion Recognition Using Support Vector Machine

Marketing Management

Communication and Cybernetics 17

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Human Emotion Recognition From Speech

Focus on. Learning THE ACCREDITATION MANUAL 2013 WASC EDITION

Lecture Notes on Mathematical Olympiad Courses

CONCEPT MAPS AS A DEVICE FOR LEARNING DATABASE CONCEPTS

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Seminar - Organic Computing

TD(λ) and Q-Learning Based Ludo Players

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Delaware Performance Appraisal System Building greater skills and knowledge for educators

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

A Reinforcement Learning Variant for Control Scheduling

Artificial Neural Networks written examination

CSL465/603 - Machine Learning

New Venture Financing

Software Security: Integrating Secure Software Engineering in Graduate Computer Science Curriculum

Copyright Corwin 2015

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

NANCY L. STOKEY. Visiting Professor of Economics, Department of Economics, University of Chicago,

UCLA InterActions: UCLA Journal of Education and Information Studies

Conducting the Reference Interview:

Prof. Dr. Hussein I. Anis

The Learning Model S2P: a formal and a personal dimension

EDUCATION IN THE INDUSTRIALISED COUNTRIES

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

content First Introductory book to cover CAPM First to differentiate expected and required returns First to discuss the intrinsic value of stocks

World University Rankings. Where s India?

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

University of Southern California Hayward R. Alker Postdoctoral Fellow, Center for International Studies,

Principles of Public Speaking

International Examinations. IGCSE English as a Second Language Teacher s book. Second edition Peter Lucantoni and Lydia Kellas

AMULTIAGENT system [1] can be defined as a group of

Soft Computing based Learning for Cognitive Radio

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Learning to Schedule Straight-Line Code

Lecture 1: Basic Concepts of Machine Learning

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Culture, Tourism and the Centre for Education Statistics: Research Papers

BENG Simulation Modeling of Biological Systems. BENG 5613 Syllabus: Page 1 of 9. SPECIAL NOTE No. 1:

Time series prediction

Feature-oriented vs. Needs-oriented Product Access for Non-Expert Online Shoppers

Speaker Identification by Comparison of Smart Methods. Abstract

Jared C. Carbone May 2013

Rule Learning With Negation: Issues Regarding Effectiveness

A General Class of Noncontext Free Grammars Generating Context Free Languages

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Developing Effective Teachers of Mathematics: Factors Contributing to Development in Mathematics Education for Primary School Teachers

Abstractions and the Brain

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

MGT/MGP/MGB 261: Investment Analysis

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

TCH_LRN 531 Frameworks for Research in Mathematics and Science Education (3 Credits)

Mathematics Faculty Win Top University Honors

How the Guppy Got its Spots:

STA 225: Introductory Statistics (CT)

Probabilistic Latent Semantic Analysis

A THESIS. By: IRENE BRAINNITA OKTARIN S

How People Learn Physics

Modeling function word errors in DNN-HMM based LVCSR systems

Calibration of Confidence Measures in Speech Recognition

PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia

Professional Development Guideline for Instruction Professional Practice of English Pre-Service Teachers in Suan Sunandha Rajabhat University

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

InTraServ. Dissemination Plan INFORMATION SOCIETY TECHNOLOGIES (IST) PROGRAMME. Intelligent Training Service for Management Training in SMEs

UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL

Australian Journal of Basic and Applied Sciences

A study of speaker adaptation for DNN-based speech synthesis

Lecture Notes in Artificial Intelligence 4343

MBA6941, Managing Project Teams Course Syllabus. Course Description. Prerequisites. Course Textbook. Course Learning Objectives.

Transcription:

Cognitive Dynamic Systems The principles of cognition are becoming increasingly important in the areas of signal processing, communications, and control. In this ground-breaking book,, a pioneer in the field and an award-winning researcher, educator, and author, sets out the fundamental ideas of cognitive dynamic systems. Weaving together the various branches of study involved, he demonstrates the power of cognitive information processing and highlights a range of future research directions. The book begins with a discussion of the core topic, cognition, dealing in particular with the perception action cycle. Then, the foundational topics, power spectrum estimation for sensing the environment, Bayesian filtering for environmental state estimation, and dynamic programming for action in the environment, are discussed. Building on these foundations, detailed coverage of two important applications of cognition, cognitive radar and cognitive radio, is presented. Blending theory and practice, this insightful book is aimed at all graduate students and researchers looking for a thorough grounding in this fascinating field. is the Director of the Cognitive Systems Laboratory at McMaster University, Canada. He is a pioneer in adaptive signal processing theory and applications in radar and communications, areas of research that have occupied much of his professional life. For the past 10 years he has focused his entire research interests on cognitive dynamic systems: cognitive radar, cognitive radio, cognitive control, and cognition applied to the cocktail party processor for the hearing impaired. He is a Fellow of the IEEE and the Royal Society of Canada, and is the recipient of the Henry Booker Gold Medal from URSI (2002), the Honorary Degree of Doctor of Technical Sciences from ETH Zentrum, Zurich (1999), and many other medals and prizes. In addition to the seminal journal papers Cognitive radio and Cognitive radar, he has also written or co-written nearly 50 books including a number of best-selling textbooks in the fields of signal processing, communications, and neural networks and learning machines.

Cognitive Dynamic Systems Perception Action Cycle, Radar, and Radio McMaster University, Canada

CAMBRIDGE UNIVERSITY PRESS Cambridge, New York, Melbourne, Madrid, Cape Town Singapore, São Paulo, Delhi, Mexico City Cambridge University Press The Edinburgh Building, Cambridge CB2 8RU, UK Published in the United States of America by Cambridge University Press, New York Informatio on this title: /9780521114363 Cambridge University Press 2012 This publication is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published 2012 Printed in the United Kingdom at the University Press, Cambridge A catalogue record of this book is available from the British Library. Library of Congress Cataloguing in Publication data Haykin, Simon Cognitive dynamic systems : perception action cycle, radar, and radio /. p. cm. Includes bibliographical references and index. ISBN 978-0-521-11436-3 (hardback) 1. Self-organizing systems. 2. Cognitive radio networks. I. Title. Q325.H39 2011 003.7 dc23 2011037991 ISBN 978-0-521-11436-3 Hardback Cambridge University Press has no responsibility for the persistence or accuracy of URLs for external or third-party internet websites referred to in this publication, and does not guarantee that any content on such websites is, or will remain, accurate or appropriate.

Contents Preface Acknowledgments ix xi 1. Introduction 1 1.1 Cognitive dynamic systems 1 1.2 The perception action cycle 2 1.3 Cognitive dynamic wireless systems: radar and radio 3 1.4 Illustrative cognitive radar experiment 4 1.5 Principle of information preservation 8 1.6 Organization of the book 10 Notes and practical references 12 2. The perception action cycle 14 2.1 Perception 14 2.2 Memory 17 2.3 Working memory 20 2.4 Attention 20 2.5 Intelligence 21 2.6 Practical benefi ts of hierarchy in the perception action cycle 23 2.7 Neural networks for parallel distributed cognitive information processing 24 2.8 Associative learning process for memory construction 29 2.9 Back-propagation algorithm 31 2.10 Recurrent multilayer perceptrons 34 2.11 Self-organized learning 35 2.12 Summary and discussion 38 Notes and practical references 40 3. Power-spectrum estimation for sensing the environment 43 3.1 The power spectrum 43 3.2 Power spectrum estimation 44 3.3 Multitaper method 47 3.4 Space time processing 52 3.5 Time frequency analysis 56 3.6 Cyclostationarity 64

vi Contents 3.7 Harmonic F-test for spectral line components 67 3.8 Summary and discussion 71 Notes and practical references 73 4. Bayesian fi ltering for state estimation of the environment 77 4.1 Probability, conditional probability, and Bayes rule 78 4.2 Bayesian inference and importance of the posterior 80 4.3 Parameter estimation and hypothesis testing: the MAP rule 83 4.4 State-space models 87 4.5 The Bayesian filter 90 4.6 Extended Kalman filter 95 4.7 Cubature Kalman filters 97 4.8 On the relationship between the cubature and unscented Kalman filters 105 4.9 The curse of dimensionality 109 4.10 Recurrent multilayer perceptrons: an application for state estimation 112 4.11 Summary and discussion 120 Notes and practical references 121 5. Dynamic programming for action in the environment 125 5.1 Markov decision processes 126 5.2 Bellman s optimality criterion 129 5.3 Policy iteration 132 5.4 Value iteration 135 5.5 Approximate dynamic programming for problems with imperfect state information 137 5.6 Reinforcement learning viewed as approximate dynamic programming 141 5.7 Q -learning 141 5.8 Temporal-difference learning 144 5.9 On the relationships between temporal-difference learning and dynamic programming 148 5.10 Linear function approximations of dynamic programming 150 5.11 Linear GQ(λ ) for predictive learning 151 5.12 Summary and discussion 161 Notes and practical references 164 6. Cognitive radar 167 6.1 Three classes of radars defined 168 6.2 The perception action cycle 169 6.3 Baseband model of radar signal transmission 170 6.4 System design considerations 175 6.5 Cubature Kalman fi lter for target-state estimation 176

Contents vii 6.6 Transition from perception to action 180 6.7 Cost-to-go function 182 6.8 Cyclic directed information-flow 184 6.9 Approximate dynamic programming for optimal control 186 6.10 The curse-of-dimensionality problem 190 6.11 Two-dimensional grid for waveform library 191 6.12 Case study: tracking a falling object in space 192 6.13 Cognitive radar with single layer of memory 199 6.14 Intelligence for dealing with environmental uncertainties 206 6.15 New phenomenon in cognitive radar: chattering 209 6.16 Cognitive radar with multiscale memory 211 6.17 The explore exploit strategy defined 214 6.18 Sparse coding 215 6.19 Summary and discussion 222 Notes and practical references 225 7. Cognitive radio 230 7.1 The spectrum-underutilization problem 231 7.2 Directed information fl ow in cognitive radio 232 7.3 Cognitive radio networks 235 7.4 Where do we fi nd the spectrum holes? 237 7.5 Multitaper method for spectrum sensing 240 7.6 Case study I: wideband ATSC-DTV signal 242 7.7 Spectrum sensing in the IEEE 802.22 standard 244 7.8 Noncooperative and cooperative classes of cognitive radio networks 244 7.9 Nash equilibrium in game theory 246 7.10 Water-filling in information theory for cognitive control 248 7.11 Orthogonal frequency-division multiplexing 251 7.12 Iterative water-fi lling controller for cognitive radio networks 252 7.13 Stochastic versus robust optimization 256 7.14 Transient behavior of cognitive radio networks, and stability of equilibrium solutions 259 7.15 Case study II: robust IWFC versus classic IWFC 260 7.16 Self-organized dynamic spectrum management 265 7.17 Cooperative cognitive radio networks 268 7.18 Emergent behavior of cognitive radio networks 270 7.19 Provision for the feedback channel 272 7.20 Summary and discussion 273 Notes and practical references 276

viii Contents 8. Epilogue 282 8.1 The perception action cycle 282 8.2 Summarizing remarks on cognitive radar and cognitive radio 283 8.3 Unexplored issues 285 Glossary 293 References 297 Index 306

Preface In my Point-of-View article, entitled Cognitive dynamic systems, Proceedings of the IEEE, November 2006, I included a footnote stating that a new book on this very topic was under preparation. At long last, here is the book that I promised then, over four years later. Just as adaptive filtering, going back to the pioneering work done by Professor Bernard Widrow and his research associates at Stanford University, represents one of the hallmarks of the twentieth century in signal processing and control, I see cognitive dynamic systems, exemplified by cognitive radar, cognitive control, and cognitive radio and other engineering systems, as one of the hallmarks of the twenty-first century. The key question is: How do we define cognition? In this book of mine, I look to the human brain as the framework for cognition. As such, cognition embodies four basic processes: perception action cycle, memory, attention, and intelligence, each of which has a specific function of its own. In identifying this list of four processes. I have left out language, the fifth distinctive characteristic of human cognition, as it is outside the scope of this book. Simply put, there is no better framework than human cognition, embodying the above four processes, for the study of cognitive dynamic systems, irrespective of application. Putting aside the introductory and epilogue chapters, the remaining six chapters of the book are organized in three main parts as follows: Chapter 2, entitled the Perception action cycle, provides an introductory treatment of the four basic processes of cognition identified above. Moreover, the latter part of the chapter presents highlights of neural networks needed for the implementation of memory. Chapters 3, 4, and 5 provide the fundamentals of cognitive dynamic systems, viewed from an engineering perspective. Specifically, Chapter 3 discusses power spectrum estimation as a basic tool for sensing the environment. Chapter 4 discusses the Bayesian filter as the optimal framework for estimating the state of the environment when it is hidden. In effect, Chapters 3 and 4 are devoted to how the environment is perceived

x Preface by dynamic systems, viewed in two different ways. Chapter 5 deals with dynamic programming as the mathematical framework for how the system takes action on the environment. Chapters 6 and 7 are devoted to two important applications of cognitive dynamic systems: cognitive radar and cognitive radio respectively; both of them are fast becoming well understood, paving the way for their practical implementation. To conclude this Preface, it is my conviction that cognition will play the role of a software-centric information-processing mechanism that will make a difference to the theory and design of a new generation of engineering systems aimed at various applications, not just radar control, and radio., Ancaster, Ontario, Canada.

Acknowledgments In the course of writing this book, I learned a great deal about human cognition from the book Cortex and Mind: Unifying Cognition by Professor J. M. Fuster, University of California at Los Angeles. Just as importantly, I learned a great deal from the many lectures on cognitive dynamic systems, cognitive radio, and cognitive radar, which I had the privilege of presenting in different parts of the world. I would like to extend my special thanks to Professor Richard Sutton and his doctoral student, Hamid Maei, University of Alberta, Canada, for introducing me to a new generation of approximate dynamic-programming algorithms, the GQ(l) and Greedy-GQ, and communicating with me by e-mail to write material presented on these two algorithms in the latter part of Chapter 5 on dynamic programming. Moreover, Hamid was gracious to read over this chapter, for which I am grateful to him. I am also indebted to Professor Yann LeCun, New York University, and his ex-doctoral student Dr Marc Aurelio Ranzato for highly insightful and helpful discussion on sparse coding. In a related context, clarifying concepts made by Dr Bruno Olshausen, University of California, Berkeley, are much appreciated. I acknowledge two insightful suggestions: (1) The notion of fore-active radar as the first step towards radar cognition, which was made by Professor Christopher Baker, Australian National University, Canberra. (2) The analogy between feedback information in cognitive radar and saccade in vision, which was made by Professor José Principe, University of Florida. I thank my ex-graduate students, Dr Ienkaran Arasaratnam, Dr Peyman Setoodeh, and Dr Yanbo Xue, and current doctoral student, Farhad Khozeimeh, for their contributions to cognitive radar and cognitive radio. I have also benefited from Dr Amin Zia, who worked with me on cognitive tracking radar as a post-doctoral fellow. I am grateful to Dr. Setoodeh for careful proof-reading of the page proofs. Moreover, I had useful comments from Dr Terrence Sejnowski, Salk Institute, LaJolla, CA, and my faculty colleague Professor Suzanne Becker, McMaster University, Canada. Turning to publication of the book by Cambridge University Press, I am particularly grateful to Dr Philip Meyler, Publishing Director, and his two colleagues, Sarah Marsh and Caroline Mowatt, in overseeing the book through its different stages of production. In a related context, I also wish to thank Peter Lewis for copy editing the manuscript before going into production.

xii Acknowledgments The writing of this book has taken me close to four years, in the course of which I must have gone through 20 revisions of the manuscript. I am truly grateful to my Administrative Coordinator, Lola Brooks, for typing those many versions. Without her patience and dedication, completion of the manuscript would not have been possible. Last but by no means least, I thank my wife, Nancy, for allowing me the time I needed to write this book. Ancaster, Ontario, Canada.