RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Validating Personas presents a significant challenge in current scientific literature, relying on costly external validation methods involving users, experts or simulations. This study aims to address this challenge by introducing a novel method to perform personas validation based on data already available during their development. The proposed method starts from previously developed Personas, and trains a clustering model on the entire dataset. Subsequently, a training-test split is repeated twenty times, with each iteration involving the training of a clustering model on the training set. Labels for the test set are then predicted by both models, and consensus is assessed through clustering consensus metrics such as Adjusted Rand Index, Adjusted Mutual Information, homogeneity, completeness and V- measure averaged across the twenty iterations. The proposed method is evaluated using a dataset comprising 1070 subjects regarding their willingness to vaccinate against COVID-19. Three unbalanced Personas are derived from previous studies, and are used as input to this novel method. Clustering consensus metrics reveal a fair agreement between clustering results, with average ARI of 0.3391, average AMI of 0.3435, average homogeneity of 0.3531, average completeness of 0.3473 and average V-measure of 0.3499. In conclusion, the proposed method exhibits robust capabilities in generalizing beyond the training data by accurately classifying new samples in their respective Personas, suggesting the possibility of reducing costs when compared to existing methods in current literature.

A Data-Driven Method to Perform Personas Validation Using Clustering Consensus Metrics

Tauro, Emanuele;Gorini, Alessandra;Bilo, Grzegorz;Caiani, Enrico Gianluca

2024-01-01

Abstract

Validating Personas presents a significant challenge in current scientific literature, relying on costly external validation methods involving users, experts or simulations. This study aims to address this challenge by introducing a novel method to perform personas validation based on data already available during their development. The proposed method starts from previously developed Personas, and trains a clustering model on the entire dataset. Subsequently, a training-test split is repeated twenty times, with each iteration involving the training of a clustering model on the training set. Labels for the test set are then predicted by both models, and consensus is assessed through clustering consensus metrics such as Adjusted Rand Index, Adjusted Mutual Information, homogeneity, completeness and V- measure averaged across the twenty iterations. The proposed method is evaluated using a dataset comprising 1070 subjects regarding their willingness to vaccinate against COVID-19. Three unbalanced Personas are derived from previous studies, and are used as input to this novel method. Clustering consensus metrics reveal a fair agreement between clustering results, with average ARI of 0.3391, average AMI of 0.3435, average homogeneity of 0.3531, average completeness of 0.3473 and average V-measure of 0.3499. In conclusion, the proposed method exhibits robust capabilities in generalizing beyond the training data by accurately classifying new samples in their respective Personas, suggesting the possibility of reducing costs when compared to existing methods in current literature.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Titolo del libro
	
				IFMBE Proceedings
			
	Titolo della collana
	
				IFMBE PROCEEDINGS
			
	ISBN (International Standard Book Number)
	
				9783031625015
9783031625022
			
	Parole chiave
	
				Behavioral Change
Clustering Consensus
eHealth
Personas Validation
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
EHB2023_InTakeCare_Final.pdf Accesso riservato : Publisher’s version Dimensione 456.93 kB Formato Adobe PDF Visualizza/Apri	456.93 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1279948

Citazioni

ND

0

0

ND

social impact