Recommended FL Datasets¶
This page lists the recommended datasets for federated learning research, which can be
used with Flower Datasets flwr-datasets
. To learn about the library, see the
quickstart tutorial . To
see the full FL example with Flower and Flower Datasets open the quickstart-pytorch.
Note
All datasets from HuggingFace Hub can be used with our library. This page presents just a set of datasets we collected that you might find useful.
For more information about any dataset, visit its page by clicking the dataset name. For more information how to use the
Image Datasets¶
Name |
Size |
Image Shape |
---|---|---|
train 60k; test 10k |
28x28 |
|
train 50k; test 10k |
32x32x3 |
|
train 50k; test 10k |
32x32x3 |
|
train 60k; test 10k |
28x28 |
|
train 814k |
28x28 |
|
train 100k; valid 10k |
64x64x3 |
|
train 7.3k; test 2k |
16x16 |
|
train 10k |
227x227 |
|
train 90k; valid 90k; test 90k |
32x32x3 |
|
train 8.7k |
varies |
|
train 15.6k |
varies |
|
train 18.6k; test 4.7k |
varies |
|
train 73.3k; test 26k; extra 531k |
32x32x3 |
|
train 2.1k; test 0.9k |
varies |
|
train 59k; test 9k |
32x32 |
Audio Datasets¶
Name |
Size |
Subset |
---|---|---|
train 64.7k |
v0.01 |
|
train 105.8k |
v0.02 |
|
train 70.3k |
||
varies |
14 versions |
|
varies |
clean/other |
Tabular Datasets¶
Name |
Size |
---|---|
train 32.6k |
|
train 8.1k |
|
train 150 |
Text Datasets¶
Name |
Size |
Category |
---|---|---|
train 1.6M; test 0.5k |
Sentiment |
|
full 974; sanitized 427 |
General |
|
test 164 |
General |
|
varies |
General |
|
train 4.8k |
Financial |
|
train 0.9k; validation 0.1k; test 0.2k |
Financial |
|
train 9.5k; validation 2.4k |
Financial |
|
train 2M; validation 11k |
Medical |
|
train 183k; validation 4.3k; test 6.2k |
Medical |
|
train 10.1k; test 1.3k; validation 1.3k |
Medical |