Skip to content

Functional-Data-Clustering/Functional-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Functional Data for Cluster Analysis

Lists of open-access functional datasets from different fields of application. We only collect data that can be used for cluster analysis. The main objective is to facilitate comparing with existing clustering methods (for functional data) and evaluating new clustering methods. A recent comprehensive review of clustering methods for functional data is available here.

For datasets that need further processing on the linked data, a copy of them can be found in the Data folder. (This ongoing project is a bit slow, due to other commitments of the contributor.)

One-dimensional Functional Data

Name Available at Field Task Size Length Missing Value
ARC_Mobile Publisher Health Clustering 125 30/40 Yes
ArrowHead UEA & UCR Time Series Classification Repository Computer Vision Classification 211 251 No
BirdChicken UEA & UCR Time Series Classification Repository Computer Vision Classification 40 512 No
BTH_PM25 Publisher Environment Clustering 73 48 Yes
China_PM25 Publisher Environment Clustering 338 731 Yes
DiatomSizeReduction UEA & UCR Time Series Classification Repository Bioinformatics Classification 322 345 No
ECG200 UEA & UCR Time Series Classification Repository ECG Classification 200 96 No
FaceFour UEA & UCR Time Series Classification Repository Computer Vision Classification 112 350 No
Flour R (cfda) Food Classification 115 241 No
GunPoint UEA & UCR Time Series Classification Repository Motion Classification 200 150 No
Meat UEA & UCR Time Series Classification Repository Food Classification 120 448 No
Phoneme e-Book (ElemStatLearn) Speech Classification 4K+ 256 No
Strawberry UEA & UCR Time Series Classification Repository Food Classification 983 235 No
Symbols UEA & UCR Time Series Classification Repository Computer Vision Classification 1K+ 398 No
Tecator CMU StatLib Food Classification 240 100 No
... ... ... ... ... ... ...

Multi-dimensional Functional Data

Name Available at Field Task Size Length Dimension
ECG_Arrhythmia Publisher ECG Classification 10K+ 5000 12
EEG_Full UCI Machine Learning Repository EEG Classification 122 256 64
UWaveGestureLibrary UEA & UCR Time Series Classification Repository Gesture Classification 4K+ 315 3
... ... ... ... ... ... ...
... ... ... ... ... ... ...

We list below a few popular repositories, where you can find more functional datasets for cluster analysis.