Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract)
Düsing C, Cimiano P, Paaßen B (2024)
In: 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA). Proceedings of the International Conference on Data Science and Advanced Analytics. New York: IEEE: 471-472.
Konferenzbeitrag
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Abstract / Bemerkung
Federated learning (FL) facilitates shared training of machine learning models while maintaining data privacy. Unfortunately, it suffers from data imbalance among participating clients, causing the performance of the shared model to drop. To diminish the negative effects of unfavorable data-specific properties, both algorithm- and data-based approaches seek to make FL more resilient against them. In this regard, data-based approaches prove to be more versatile and require less domain knowledge to be applied efficiently. Hence, they seem particularly suitable for widespread application in various FL environments. Although data-based approaches such as local data sampling have been applied to FL in the past, previous research did not provide a systematic analysis of the potential and limitations of individual data sampling strategies to improve FL. To this end, we (1) identify relevant local data sampling strategies for FL, (2) identify data-specific properties that negatively affect FL performance, and (3) provide a benchmark of local data sampling strategies regarding their effect on model performance, convergence, and training time in synthetic, real-world, and large-scale FL environments. Moreover, we propose and rigorously test a novel method for data sampling in FL that locally optimizes the choice of sampling strategy prior to FL participation. Our results show that FL can benefit from applying local data sampling in terms of performance and convergence rate, especially when data imbalance is high or the number of clients and samples is low. Furthermore, our proposed sampling strategy offers the best trade-off between model performance and training time.
Stichworte
Federated Learning;
Data Sampling;
Data Imbalance
Erscheinungsjahr
2024
Titel des Konferenzbandes
2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA)
Serien- oder Zeitschriftentitel
Proceedings of the International Conference on Data Science and Advanced Analytics
Seite(n)
471-472
Konferenz
IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA)
Konferenzort
San Diego,CA
Konferenzdatum
2024-10-06 – 2024-10-10
ISBN
979-8-3503-6494-1,
979-8-3503-6495-8
ISSN
2472-1573
Page URI
https://pub.uni-bielefeld.de/record/3000570
Zitieren
Düsing C, Cimiano P, Paaßen B. Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract). In: 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA). Proceedings of the International Conference on Data Science and Advanced Analytics. New York: IEEE; 2024: 471-472.
Düsing, C., Cimiano, P., & Paaßen, B. (2024). Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract). 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA), Proceedings of the International Conference on Data Science and Advanced Analytics, 471-472. New York: IEEE. https://doi.org/10.1109/DSAA61799.2024.10722778
Düsing, Christoph, Cimiano, Philipp, and Paaßen, Benjamin. 2024. “Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract)”. In 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA), 471-472. Proceedings of the International Conference on Data Science and Advanced Analytics. New York: IEEE.
Düsing, C., Cimiano, P., and Paaßen, B. (2024). “Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract)” in 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA) Proceedings of the International Conference on Data Science and Advanced Analytics (New York: IEEE), 471-472.
Düsing, C., Cimiano, P., & Paaßen, B., 2024. Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract). In 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA). Proceedings of the International Conference on Data Science and Advanced Analytics. New York: IEEE, pp. 471-472.
C. Düsing, P. Cimiano, and B. Paaßen, “Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract)”, 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA), Proceedings of the International Conference on Data Science and Advanced Analytics, New York: IEEE, 2024, pp.471-472.
Düsing, C., Cimiano, P., Paaßen, B.: Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract). 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA). Proceedings of the International Conference on Data Science and Advanced Analytics. p. 471-472. IEEE, New York (2024).
Düsing, Christoph, Cimiano, Philipp, and Paaßen, Benjamin. “Leveraging Local Data Sampling Strategies to Improve Federated Learning (Extended Abstract)”. 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA). New York: IEEE, 2024. Proceedings of the International Conference on Data Science and Advanced Analytics. 471-472.
Export
Markieren/ Markierung löschen
Markierte Publikationen
Web of Science
Dieser Datensatz im Web of Science®Suchen in