Prognostics and Health Management Approaches based on belief functions

Similar documents
Lecture 1: Machine Learning Basics

Speech Emotion Recognition Using Support Vector Machine

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Learning Methods in Multilingual Speech Recognition

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Modeling function word errors in DNN-HMM based LVCSR systems

Probabilistic Latent Semantic Analysis

Lecture 1: Basic Concepts of Machine Learning

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

WHEN THERE IS A mismatch between the acoustic

Learning Methods for Fuzzy Systems

Modeling function word errors in DNN-HMM based LVCSR systems

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

Human Emotion Recognition From Speech

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Automatic Pronunciation Checker

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Speech Recognition at ICSI: Broadcast News and beyond

Lecture 9: Speech Recognition

An Online Handwriting Recognition System For Turkish

Python Machine Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Assignment 1: Predicting Amazon Review Ratings

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

BODY LANGUAGE ANIMATION SYNTHESIS FROM PROSODY AN HONORS THESIS SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE OF STANFORD UNIVERSITY

On-Line Data Analytics

Evolutive Neural Net Fuzzy Filtering: Basic Description

A study of speaker adaptation for DNN-based speech synthesis

Unsupervised Acoustic Model Training for Simultaneous Lecture Translation in Incremental and Batch Mode

(Sub)Gradient Descent

On the Formation of Phoneme Categories in DNN Acoustic Models

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS Machine Learning

Mining Association Rules in Student s Assessment Data

Uncertainty concepts, types, sources

Discriminative Learning of Beam-Search Heuristics for Planning

DOMAIN MISMATCH COMPENSATION FOR SPEAKER RECOGNITION USING A LIBRARY OF WHITENERS. Elliot Singer and Douglas Reynolds

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Speech Recognition by Indexing and Sequencing

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Software Maintenance

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Artificial Neural Networks written examination

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass

Word Segmentation of Off-line Handwritten Documents

The Evolution of Random Phenomena

Cooperative evolutive concept learning: an empirical study

Speaker recognition using universal background model on YOHO database

ACOUSTIC EVENT DETECTION IN REAL LIFE RECORDINGS

Edinburgh Research Explorer

Generative models and adversarial training

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Issues in the Mining of Heart Failure Datasets

Quantitative Evaluation of an Intuitive Teaching Method for Industrial Robot Using a Force / Moment Direction Sensor

Rule Learning With Negation: Issues Regarding Effectiveness

Mining Student Evolution Using Associative Classification and Clustering

Introduction to Simulation

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Laboratorio di Intelligenza Artificiale e Robotica

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Learning Probabilistic Behavior Models in Real-Time Strategy Games

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

SARDNET: A Self-Organizing Feature Map for Sequences

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Proof Theory for Syntacticians

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Time series prediction

Laboratorio di Intelligenza Artificiale e Robotica

Active Learning. Yingyu Liang Computer Sciences 760 Fall

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

University of Groningen. Systemen, planning, netwerken Bosman, Aart

A Reinforcement Learning Variant for Control Scheduling

Action Recognition and Video

Analysis of Speech Recognition Models for Real Time Captioning and Post Lecture Transcription

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Transcription:

Prognostics and Health Management Approaches based on belief functions FEMTO-ST institute / Dep. of Automation and Micromechatronics systems (AS2M), Besançon Emmanuel Ramasso Collaborated work with Dr. Lisa Serir, Pr. Michèle Rombaut, Pr. Thierry Denoeux, Dr. Rafael Gouriveau, Dr. Vincent Placet, Pr. Lamine Boubakar and Pr. Noureddine Zerhouni 2/12/212 Séminaire DIS GIPSA-lab

Overview Stating the context Prognostics and Health Management Belief functions Tools Evidential Evolving Systems Partially Hidden Markov Models Evidential Case-Based Reasoning Applications Bearings : Pronostia platform Turbofan : NASA Composites materials : Applied Mechanics dept. - FEMTO-ST 2

Part 1 : Stating the context 3

FEMTO-ST 65 people in 7 departments : Energie Informatique des Systèmes Complexes (DISC) Mécanique Appliquée Micro Nano Sciences et Systèmes (MN2S) Optique Temps Fréquence Automatique et Systèmes Micromécatroniques (AS2M) 4

AS2M Dept. ~8 people, 4 teams Biomedical Micronanorobotics Strategies of Perception at MicroNano scales Control and Design PHM (Prognostics and Health Management) 7 permanents + 6 PhD in 212 5

Prognostics and health management (PHM) Industrial context 6

Prognostics and health management (PHM) Maintenance activities 7

Prognostics and health management (PHM) Example : wind farm 8

Prognostics and health management (PHM) PHM decomposition 9

Prognostics and health management (PHM) Uncertainties in PHM [PhD Lisa Serir] 1

Prognostics and health management (PHM) Uncertainties in PHM [PhD Lisa Serir] 11

Prognostics and health management (PHM) The considered approach Data-driven PHM (opposed to model-based PHM) Three main steps : Uncertainty management using belief functions Condition assessement made by a classifier or a clustering algorithm 12

Belief functions Origin 13

Belief functions Main elements A variable taking values in a finite set Ω called frame of discernment A belief mass m is a mapping from the set of subsets 2Ω to [,1] with A Ω m(a)=1, and : m(a)> A is a focal set m(a)>, A >1 imprecise information Many useful tools for pattern recognition : Generalized Bayesian Theorem Conjunctive and disjunctive rules of combination Clustering algorithms 14

Belief functions Example on clustering Given data generated by accelerometers (x1 and x2) on a bearing, detection of faults is performed by a classifier that estimates the degree of belief to each possible functioning states. Objects & mass C1 C2 C12 C3 C13 C23 C123 m1 sure cluster 2 1 m2 doubt clusters 2-3 1 m3 full ignorance 1 m4 unknown 1.2.3.1.2.1.5.3.2.2.5.3 subsets m5 general case m6 probablity dist. 15

Belief functions Example on clustering One way to define the mass function : Where Aj is the j-th subset mij 1 A j d 2 ij dij is the distance between cluster j and data point i, Euclidean or Mahalanobis Let consider two cluster, with centers C1 (-3) and C2 (3), x varies between both of them C1 x C2-3 3 16

Belief functions Example on clustering One way to define the mass function : Where Aj is the j-th subset mij 1 A j d 2 ij dij is the distance between cluster j and data point i, Euclidean or Mahalanobis Let consider two cluster, with centers C1 (-3) and C2 (3), x varies between both of them C1 x C2-3 3 17

Belief functions Example on clustering One way to define the mass function : Where Aj is the j-th subset mij 1 A j d 2 ij dij is the distance between cluster j and data point i, Euclidean or Mahalanobis Let consider two cluster, with centers C1 (-3) and C2 (3), x varies between both of them C1 x C2-3 3 18

Part 2 : Tools 19

Overview Stating the context Prognostics and Health Management Belief functions Tools Evidential Evolving Systems Partially Hidden Markov Models Evidential Case-Based Reasoning Applications Bearings : Pronostia platform Turbofan : NASA Composites materials : Applied Mechanics dept. - FEMTO-ST 2

Evidential Evolving Systems Context One way is to learn the model again... another way, is to estimate the parameters recursively. In some cases, the system parameters may change, thus the system's model may be adapted to well fit current data. Work of Lisa Serir, PhD during 29-212 Evolving approaches well studied by P. Angelov (24-212). 21

Evidential Evolving Systems The method Prognostics performed in three phases : Detection of the current state : clustering Prediction of the degradation : regression Management of uncertainty : belief functions Evolving clustering : studied in «Evidential Evolving Gustafson-Kessel Algorithm For Online Data Streams Partitioning Using Belief Function Theory, L. Serir, E. Ramasso and N. Zerhouni, International Journal of Approximate Reasoning, 53(5):747-768, July 212». Prediction : studied in «L. Serir, E. Ramasso, P. Nectoux and N. Zerhouni, E2GKpro: An evidential evolving multi-modeling approach for system behavior prediction with applications, Journal of Mechanical Systems and Signal Processing, in press, 212.» 22

Evidential Evolving Systems Evolving Clustering Principle 23

Evidential Evolving Systems Evolving Clustering Computation 24

Evidential Evolving Systems Evolving Prediction Principle 25

Evidential Evolving Systems Evolving Prediction - Computation 26

Evidential Evolving Systems Results Clustering and model's adaptation in the following dataset (used in several papers) : 27

Evidential Evolving Systems Results 28

Evidential Evolving Systems Results PRONOSTIA's dataset Platform developed in FEMTO-ST / AS2M Study of bearings degradation 29

Evidential Evolving Systems Results Feature vector at time k: x(k) = [RMS(k) PSD(k)]' Goal: Predict at k + 1 : Regressors : xin = [ x(k-18) x(k-12) x(k-6) x(k)]' Output : xout = [xrms(k+1) xpsd (k+1)]' 3

Overview Stating the context Prognostics and Health Management Belief functions Tools Evidential Evolving Systems Partially Hidden Markov Models Evidential Case-Based Reasoning Applications Bearings : Pronostia platform Turbofan : NASA Composites materials : Applied Mechanics dept. - FEMTO-ST 31

Partially Hidden Markov Models Context A Hidden Markov model (HMM) is a Markov model in which the states are not directly visible. The system output, dependent on the state, is observed. A dynamical system can be represented by a statistical model assumed to be a first-order Markov process with unobserved (hidden) states. Joint collaboration with Pr. Thierry Denoeux, Heudiasyc lab., UTC, within a PEPS CNRS HMMS are widely applied to temporal pattern recognition problems such as: speech, DNA sequence analysis, machine condition monitoring, etc. 32

Partially Hidden Markov Models Motivation We considered problems in which the states are not completely hidden but partially observed, for instance : In speech recognition, partial information about phonemes may be provided by the analysis of lip motion In machine diagnosis, experts may express probability judgements on the machine condition at different time steps 33

Partially Hidden Markov Models Problem statement We focused on parameter learning based on observations of outputs : modeled by pz(.;θ), Z=(X,Y), θ unknown parameter partial information on states expressed in the belief function framework (m) The probability function pz(z;θ) and the mass function m represent two different pieces of knowledge: pz(z;θ) represents generic knowledge about the data generating process; it corresponds to random uncertainty m represents specific knowledge about a given realization z ; it captures epistemic uncertainty, i.e., uncertainty due to lack of knowledge 34

Partially Hidden Markov Models Estimation The algorithm iterates between : Estimation of the conditional expectation of the complete-data likelihood function Maximisation of this function with report to parameters of the model both given the prior information on states «m». Inference algorithms (classification phase in PHMM + Viterbi algorithm) have been modified in the same way to integrate partial knowledge m Proof in «Making use of partial knowledge about hidden states in HMMs: an approach based on belief functions, Emmanuel Ramasso, Thierry Denœux, Revised in Dec. 212 in IEEE Transactions on Fuzzy Systems» 35

Partially Hidden Markov Models Results Simulated data We considered data generated using a HMM with three states and three-dimensional Gaussian emission probability distributions pk(xt;φk) = N(μk,Σk) : 36

Partially Hidden Markov Models Results Setting partial knowledge The following parameters were studied : Imprecision of labels : experts may express doubt or may be more or less reliable, what is the sensitivity of the model? Labeling error : experts may sometimes be wrong and imprecise, what is the sensitivity of the model? Evaluation : the PHMM generates a sequence of states (using Viterbi alg.) which is a partition : it is compared with the ground truth using the Adjusted Rand Index (value between and 1) 37

Partially Hidden Markov Models Influence of labels error «Partially» supervised learning + «unsupervised» Viterbi algorithm applied to find the sequence, compared to the real one using ARI 38

Overview Stating the context Prognostics and Health Management Belief functions Tools Evidential Evolving Systems Partially Hidden Markov Models Evidential Case-Based Reasoning Applications Bearings : Pronostia platform Turbofan : NASA Composites materials : Applied Mechanics dept. - FEMTO-ST 39

Evidential Case-Based Reasoning Context In some cases, a statistical model can not represent all cases due to the lack of information. CBR approaches are then well suited. Given a library of possible degradation signals, CBR looks for similar cases and generates an estimate of the current and future health states. CBR approaches for PHM considered in T. Wang's PhD, now at General Electric Company. Joint collaboration with Pr. Michèle Rombaut, GIPSA-lab., Grenoble, within a PEPS CNRS 4

Evidential Case-Based Reasoning Motivation In usual CBR for PHM, the states are unknown. Only prediction followed by thresholding is performed. We considered the case where partial knowledge is available about the states, in the form of a belief function 41

Evidential Case-Based Reasoning Model Assume a library of trajectories (degradation signals) A «time window» defines a block of data 42

Evidential Case-Based Reasoning Prediction standard CBR Given the block of «observed» data, find the trajectories «close to» the observations, and use them to predict the future evolution 43

Evidential Case-Based Reasoning Prediction using belief functions Each piece of trajectories close to observations may have different length Each data may be accompanied by a belief function : fusion using the cautious rule 44

Evidential Case-Based Reasoning Algorithm Main contributions Classification of prediction : Does not make use of thresholds as usual PHM methods Works for multiple health indicators Fusion with direct prediction based on CBR Details in «Joint prediction of observations and states in time-series based on belief functions, IEEE Tr. on Systems, Man and Cybernetics - Part B: Cybernetics, E. Ramasso, M. Rombaut and N. Zerhouni» 45

Evidential Case-Based Reasoning Results (some) NASA's dataset (turbofan) Evaluation : Prediction between 25 and 6 time-units in advance, considered correct if falls in [-1,13] around the true prediction 46

Evidential Case-Based Reasoning Results (some) Ex : Prediction is precise from t=9, so 18 timesteps in advance Evolution of the error on prediction, and the length of prediction 47

Conclusion Prior information can be doubtful but still useful The use of belief functions is relevant to express «partial» knowledge» about the state of the world This knowledge is easier to obtain than precise knowledge (see the example on PHMM) Even corrupted prior can be useful in case doubt is present (see the example on PHMM) The use of fusion process in the belief function framework may improve detection and prediction (see the example on evidential CBR) The use of doubt in multimodeling : Decreases the number of cluster (compact models) Improves prediction (see the example on E2GKpro) 48

Future work Theoretical : Adaptation and improvements of previous models for enhanced prognostics including Decision-making process to help end-users Information fusion with physics-based prognostics Applied Composites materials (organic matrix) analysis based on acoustic emissions, jointly with the department of applied mechanics, LaBeX «ACTION» Project started in March 212 (one paper submitted in Journal of Composites Materials Part A + 2 conf.) One PhD (started on october 212) Still looking for a postdoc candidate (starting on 2/213) 49

Thanks for attending! 5