We tested the ability of unsupervised machine learning approaches to separate an unknown number of artificial PD sources simultaneously acting, recorded by a multi-antenna system in the Ultra-High Frequeny range. Each pulse has been described with a minimum set of features and successively undergone to an unsupervised clustering procedure via Hierarchical Agglomerative Clustering (HAC) technique. The results have been compared, in a fully blind performance test, with those obtained from a cross-correlation analysis of the fully sampled pulses that clearly separates three groups of pulses radiated from the three PD sources. A limited set of mono-dimensional features (maximum amplitude, standard deviation, phase, power) shows a cluster detection accuracy of 58%. A different set of features (including maximum amplitude, standard deviation, phase, power, skewness value, kurtosis value), shows an accuracy of 64% while performance parameters as Precision and F1-Score show better results. Despite the sub-optimal results, the false-negative detection rate is low and the computational cost of this simplified approach is dramatically lower than the cross-correlation approach, thus allowing possible development in real-time supervising devices.
Unsupervised Machine Learning for Blind Separation of Multiple PD Sources
Polenghi M.;Ogliari E.
2022-01-01
Abstract
We tested the ability of unsupervised machine learning approaches to separate an unknown number of artificial PD sources simultaneously acting, recorded by a multi-antenna system in the Ultra-High Frequeny range. Each pulse has been described with a minimum set of features and successively undergone to an unsupervised clustering procedure via Hierarchical Agglomerative Clustering (HAC) technique. The results have been compared, in a fully blind performance test, with those obtained from a cross-correlation analysis of the fully sampled pulses that clearly separates three groups of pulses radiated from the three PD sources. A limited set of mono-dimensional features (maximum amplitude, standard deviation, phase, power) shows a cluster detection accuracy of 58%. A different set of features (including maximum amplitude, standard deviation, phase, power, skewness value, kurtosis value), shows an accuracy of 64% while performance parameters as Precision and F1-Score show better results. Despite the sub-optimal results, the false-negative detection rate is low and the computational cost of this simplified approach is dramatically lower than the cross-correlation approach, thus allowing possible development in real-time supervising devices.File | Dimensione | Formato | |
---|---|---|---|
Unsupervised_Machine_Learning_for_Blind_Separation_of_Multiple_PD_Sources.pdf
Accesso riservato
Dimensione
1.88 MB
Formato
Adobe PDF
|
1.88 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.