kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes

Christophe Genolini; René Ecochard; Mamoun Benghezal; Driss Tarak; Sandrine Andrieu; Fabien Subtil

doi:10.1371/journal.pone.0150738

Article Dans Une Revue PLoS ONE Année : 2016

kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes

, (1) , , (2) , (3) , (1, 4, 5)

1
2
3
4
5

Christophe Genolini

Fonction : Auteur
PersonId : 781116
ORCID : 0000-0002-3321-6364

René Ecochard

Fonction : Auteur
PersonId : 1394194
IdHAL : rene-ecochard
ORCID : 0000-0002-1695-789X

Biostatistiques santé [LBBE]

Mamoun Benghezal

Fonction : Auteur

Driss Tarak

Fonction : Auteur
PersonId : 735126
IdHAL : tarak-driss
ORCID : 0000-0001-6109-7393
IdRef : 160686962

Centre de Recherche sur le Sport et le Mouvement

Sandrine Andrieu

Fonction : Auteur
PersonId : 1347574
ORCID : 0000-0002-1142-770X
IdRef : 073351431

Epidémiologie et analyses en santé publique : risques, maladies chroniques et handicaps

Fabien Subtil

Fonction : Auteur
PersonId : 169593
IdHAL : fabien-subtil

Biostatistiques santé [LBBE]

Service de Biostatistiques [Lyon]

Hospices Civils de Lyon

Résumé

Background Longitudinal data are data in which each variable is measured repeatedly over time. One possibility for the analysis of such data is to cluster them. The majority of clustering methods group together individual that have close trajectories at given time points. These methods group trajectories that are locally close but not necessarily those that have similar shapes. However, in several circumstances, the progress of a phenomenon may be more important than the moment at which it occurs. One would thus like to achieve a partitioning where each group gathers individuals whose trajectories have similar shapes whatever the time lag between them. Method In this article, we present a longitudinal data partitioning algorithm based on the shapes of the trajectories rather than on classical distances. Because this algorithm is time consuming, we propose as well two data simplification procedures that make it applicable to high dimensional datasets. Results In an application to Alzheimer disease, this algorithm revealed a "rapid decline" patient group that was not found by the classical methods. In another application to the feminine menstrual cycle, the algorithm showed, contrarily to the current literature, that the luteinizing hormone presents two peaks in an important proportion of women (22%).

Mots clés

Alzheimer disease Leaves Elections Database and informatics methods Algorithms Analyse du Mouvement en Biomécanique Physiologie et Imagerie Data reduction Dogs Ovulation

Domaines

Biomécanique [physics.med-ph] Imagerie Physiologie [q-bio.TO]

Fichier principal

a13bdf28f2250f9c17a3b2382cb69abf.pdf (6.28 Mo)

Origine	Fichiers éditeurs autorisés sur une archive ouverte

Tarak DRISS : Connectez-vous pour contacter le contributeur

https://hal.parisnanterre.fr/hal-01467694

Soumis le : mercredi 24 janvier 2024-10:05:43

Dernière modification le : lundi 16 décembre 2024-17:38:39

Dates et versions

hal-01467694 , version 1 (24-01-2024)

Identifiants

HAL Id : hal-01467694 , version 1
DOI : 10.1371/journal.pone.0150738
PUBMEDCENTRAL : PMC4892497

Citer

Christophe Genolini, René Ecochard, Mamoun Benghezal, Driss Tarak, Sandrine Andrieu, et al.. kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes. PLoS ONE, 2016, 11 (6), ⟨10.1371/journal.pone.0150738⟩. ⟨hal-01467694⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

HCL CNRS UNIV-LYON1 BIOENVIS LBBE UNIV-PARIS-LUMIERES UDL UNIV-PARIS-NANTERRE UNIV-UT3 UT3-TOULOUSEINP CERPOP

228 Consultations

31 Téléchargements

kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager