Lists of open-access functional datasets from different fields of application. We only collect data that can be used for cluster analysis. The main objective is to facilitate comparing with existing clustering methods (for functional data) and evaluating new clustering methods. A recent comprehensive review of clustering methods for functional data is available here.
For datasets that need further processing on the linked data, a copy of them can be found in the Data folder. (This ongoing project is a bit slow, due to other commitments of the contributor.)
Name | Available at | Field | Task | Size | Length | Missing Value |
---|---|---|---|---|---|---|
ARC_Mobile | Publisher | Health | Clustering | 125 | 30/40 | Yes |
ArrowHead | UEA & UCR Time Series Classification Repository | Computer Vision | Classification | 211 | 251 | No |
BirdChicken | UEA & UCR Time Series Classification Repository | Computer Vision | Classification | 40 | 512 | No |
BTH_PM25 | Publisher | Environment | Clustering | 73 | 48 | Yes |
China_PM25 | Publisher | Environment | Clustering | 338 | 731 | Yes |
DiatomSizeReduction | UEA & UCR Time Series Classification Repository | Bioinformatics | Classification | 322 | 345 | No |
ECG200 | UEA & UCR Time Series Classification Repository | ECG | Classification | 200 | 96 | No |
FaceFour | UEA & UCR Time Series Classification Repository | Computer Vision | Classification | 112 | 350 | No |
Flour | R (cfda) | Food | Classification | 115 | 241 | No |
GunPoint | UEA & UCR Time Series Classification Repository | Motion | Classification | 200 | 150 | No |
Meat | UEA & UCR Time Series Classification Repository | Food | Classification | 120 | 448 | No |
Phoneme | e-Book (ElemStatLearn) | Speech | Classification | 4K+ | 256 | No |
Strawberry | UEA & UCR Time Series Classification Repository | Food | Classification | 983 | 235 | No |
Symbols | UEA & UCR Time Series Classification Repository | Computer Vision | Classification | 1K+ | 398 | No |
Tecator | CMU StatLib | Food | Classification | 240 | 100 | No |
... | ... | ... | ... | ... | ... | ... |
Name | Available at | Field | Task | Size | Length | Dimension |
---|---|---|---|---|---|---|
ECG_Arrhythmia | Publisher | ECG | Classification | 10K+ | 5000 | 12 |
EEG_Full | UCI Machine Learning Repository | EEG | Classification | 122 | 256 | 64 |
UWaveGestureLibrary | UEA & UCR Time Series Classification Repository | Gesture | Classification | 4K+ | 315 | 3 |
... | ... | ... | ... | ... | ... | ... |
... | ... | ... | ... | ... | ... | ... |
We list below a few popular repositories, where you can find more functional datasets for cluster analysis.