We tested the ability of unsupervised machine learning approaches to separate an unknown number of artificial PD sources simultaneously acting, recorded by a multi-antenna system in the Ultra-High Frequeny range. Each pulse has been described with a minimum set of features and successively undergone to an unsupervised clustering procedure via Hierarchical Agglomerative Clustering (HAC) technique. The results have been compared, in a fully blind performance test, with those obtained from a cross-correlation analysis of the fully sampled pulses that clearly separates three groups of pulses radiated from the three PD sources. A limited set of mono-dimensional features (maximum amplitude, standard deviation, phase, power) shows a cluster detection accuracy of 58%. A different set of features (including maximum amplitude, standard deviation, phase, power, skewness value, kurtosis value), shows an accuracy of 64% while performance parameters as Precision and F1-Score show better results. Despite the sub-optimal results, the false-negative detection rate is low and the computational cost of this simplified approach is dramatically lower than the cross-correlation approach, thus allowing possible development in real-time supervising devices.

Unsupervised Machine Learning for Blind Separation of Multiple PD Sources

Polenghi M.;Ogliari E.
2022-01-01

Abstract

We tested the ability of unsupervised machine learning approaches to separate an unknown number of artificial PD sources simultaneously acting, recorded by a multi-antenna system in the Ultra-High Frequeny range. Each pulse has been described with a minimum set of features and successively undergone to an unsupervised clustering procedure via Hierarchical Agglomerative Clustering (HAC) technique. The results have been compared, in a fully blind performance test, with those obtained from a cross-correlation analysis of the fully sampled pulses that clearly separates three groups of pulses radiated from the three PD sources. A limited set of mono-dimensional features (maximum amplitude, standard deviation, phase, power) shows a cluster detection accuracy of 58%. A different set of features (including maximum amplitude, standard deviation, phase, power, skewness value, kurtosis value), shows an accuracy of 64% while performance parameters as Precision and F1-Score show better results. Despite the sub-optimal results, the false-negative detection rate is low and the computational cost of this simplified approach is dramatically lower than the cross-correlation approach, thus allowing possible development in real-time supervising devices.
2022
ICD 2022 - IEEE 2022 4th International Conference on Dielectrics, Proceedings
978-1-6654-1833-1
File in questo prodotto:
File Dimensione Formato  
Unsupervised_Machine_Learning_for_Blind_Separation_of_Multiple_PD_Sources.pdf

Accesso riservato

Dimensione 1.88 MB
Formato Adobe PDF
1.88 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1223588
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact