Recommended FL Datasets¶
This page lists the recommended datasets for federated learning research, which can be
used with Flower Datasets flwr-datasets
. To learn about the library, see the
quickstart tutorial . To
see the full FL example with Flower and Flower Datasets open the quickstart-pytorch.
Note
All datasets from HuggingFace Hub can be used with our library. This page presents just a set of datasets we collected that you might find useful.
For more information about any dataset, visit its page by clicking the dataset name.
Image Datasets¶
Name |
Size |
Image Shape |
---|---|---|
train 60k; test 10k |
28x28 |
|
train 50k; test 10k |
32x32x3 |
|
train 50k; test 10k |
32x32x3 |
|
train 60k; test 10k |
28x28 |
|
train 814k |
28x28 |
|
train 100k; valid 10k |
64x64x3 |
|
train 7.3k; test 2k |
16x16 |
|
train 10k |
227x227 |
|
train 90k; valid 90k; test 90k |
32x32x3 |
|
train 8.7k |
varies |
|
train 15.6k |
varies |
|
train 18.6k; test 4.7k |
varies |
|
train 73.3k; test 26k; extra 531k |
32x32x3 |
|
train 2.1k; test 0.9k |
varies |
|
train 59k; test 9k |
32x32 |
Audio Datasets¶
Name |
Size |
Subset |
---|---|---|
train 64.7k |
v0.01 |
|
train 105.8k |
v0.02 |
|
train 70.3k |
||
varies |
14 versions |
|
varies |
clean/other |
Tabular Datasets¶
Name |
Size |
---|---|
train 32.6k |
|
train 8.1k |
|
train 150 |
Text Datasets¶
Name |
Size |
Category |
---|---|---|
train 1.6M; test 0.5k |
Sentiment |
|
full 974; sanitized 427 |
General |
|
test 164 |
General |
|
varies |
General |
|
train 4.8k |
Financial |
|
train 0.9k; validation 0.1k; test 0.2k |
Financial |
|
train 9.5k; validation 2.4k |
Financial |
|
train 2M; validation 11k |
Medical |
|
train 183k; validation 4.3k; test 6.2k |
Medical |
|
train 10.1k; test 1.3k; validation 1.3k |
Medical |