Prognostics and health management can improve the reliability and safety of transportation systems. Data collected from diverse sources provide a chance and at the same time a challenge for data-driven PHM methods and models. The data often exhibit challenging characteristics like imbalanced data on normal and faulty conditions, noise and outliers, data points of different importance for the data-driven model, etc. In this paper, a k nearest neighbors-based fuzzy support vector machine is proposed for reducing the computational burden and tackling the issue of imbalance and outlier data, in fault detection. Fault detection is mathematically a classification problem. In this paper, the reverse nearest neighbors technique is adopted for detecting outliers and the k nearest neighbors technique is used to identify the borderline points for defining the classification hyperplane in support vector machines. Considering the position of each data point and the distribution of its nearest neighbors, a new method is proposed for calculating their estimation error costs. A real case study concerning fault detection in a braking system of a highspeed train is considered.

KNN-FSVM for Fault Detection in High-Speed Trains

Zio E.
2018-01-01

Abstract

Prognostics and health management can improve the reliability and safety of transportation systems. Data collected from diverse sources provide a chance and at the same time a challenge for data-driven PHM methods and models. The data often exhibit challenging characteristics like imbalanced data on normal and faulty conditions, noise and outliers, data points of different importance for the data-driven model, etc. In this paper, a k nearest neighbors-based fuzzy support vector machine is proposed for reducing the computational burden and tackling the issue of imbalance and outlier data, in fault detection. Fault detection is mathematically a classification problem. In this paper, the reverse nearest neighbors technique is adopted for detecting outliers and the k nearest neighbors technique is used to identify the borderline points for defining the classification hyperplane in support vector machines. Considering the position of each data point and the distribution of its nearest neighbors, a new method is proposed for calculating their estimation error costs. A real case study concerning fault detection in a braking system of a highspeed train is considered.
2018
2018 IEEE International Conference on Prognostics and Health Management, ICPHM 2018
978-1-5386-1165-4
fuzzy membership calculation; fuzzy SVM; high-speed train; imbalanced data; prognostics and health management
File in questo prodotto:
File Dimensione Formato  
20.pdf

accesso aperto

: Pre-Print (o Pre-Refereeing)
Dimensione 694.4 kB
Formato Adobe PDF
694.4 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1122488
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 0
social impact