We propose a tree-based algorithm (μCART) for classification and regression problems in the context of functional data analysis, which allows to leverage measure learning and multiple splitting rules at the node level, with the objective of reducing error while retaining the interpretability of a tree. For each internal node, our main contribution is the idea of learning a weighted functional (Formula presented.) space by means of constrained convex optimization, which is then used to extract multiple weighted integral features from the functional predictors, in order to determine the binary split. The approach is designed to manage multiple functional predictors and/or responses, by defining suitable splitting rules and loss functions that can depend on the specific problem and can also be combined with additional scalar and categorical predictors, as the tree is grown with the original greedy CART algorithm. We focus on the case of scalar-valued functional predictors defined on unidimensional domains and illustrate the effectiveness of our method in both classification and regression tasks, through a simulation study and four real-world applications.

Measure inducing classification and regression trees for functional data

Vantini S.
2022-01-01

Abstract

We propose a tree-based algorithm (μCART) for classification and regression problems in the context of functional data analysis, which allows to leverage measure learning and multiple splitting rules at the node level, with the objective of reducing error while retaining the interpretability of a tree. For each internal node, our main contribution is the idea of learning a weighted functional (Formula presented.) space by means of constrained convex optimization, which is then used to extract multiple weighted integral features from the functional predictors, in order to determine the binary split. The approach is designed to manage multiple functional predictors and/or responses, by defining suitable splitting rules and loss functions that can depend on the specific problem and can also be combined with additional scalar and categorical predictors, as the tree is grown with the original greedy CART algorithm. We focus on the case of scalar-valued functional predictors defined on unidimensional domains and illustrate the effectiveness of our method in both classification and regression tasks, through a simulation study and four real-world applications.
2022
constrained convex optimization
decision trees
functional data analysis
high-dimensional data
splitting rule
weight function
File in questo prodotto:
File Dimensione Formato  
Statistical Analysis - 2022 - Belli - Measure inducing classification and regression trees for functional data.pdf

Accesso riservato

: Publisher’s version
Dimensione 6.13 MB
Formato Adobe PDF
6.13 MB Adobe PDF   Visualizza/Apri
11311-1205159_Vantini.pdf

Open Access dal 29/12/2022

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 3.14 MB
Formato Adobe PDF
3.14 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1205159
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 4
social impact