Abstract
Background: Breast cancer diagnosis increasingly relies on data-driven learning from heterogeneous medical imaging sources. However, centralized deep learning approaches face major limitations, including privacy risks, institutional data silos, and limited generalization across imaging modalities.
Methods: This study proposes a privacy-aware federated learning framework integrating a hybrid CNN–Vision Transformer architecture for breast cancer classification under simulated non-identically distributed (non-IID) conditions. Public datasets representing different imaging modalities, BreakHis (histopathology), INbreast and CBIS-DDSM (mammography), and BUSI (ultrasound) are treated as independent federated clients to emulate multi-institutional collaboration. Each client trains a local model, and only the model parameters are aggregated via Federated Averaging, thereby preserving data locality. The hybrid architecture combines a ResNet-18 convolutional branch for local feature extraction with a Vision Transformer branch for global contextual representation.
Results: Across ten federated communication rounds, the global model demonstrates stable convergence under heterogeneous client distributions. The final global validation performance reaches 72.43% accuracy, AUC 0.7475, and F1-score 0.718. Evaluation on the pooled test cohort achieves approximately 84% overall accuracy with a weighted F1-score of 0.82, while malignant recall approaches 96%, prioritizing clinically critical cancer detection. Qualitative explainability analysis indicates that the model focuses on diagnostically relevant tissue regions.
Conclusion: The results demonstrate the feasibility of hybrid CNN–Vision Transformer training in a federated setting for privacy-aware breast cancer classification across heterogeneous imaging domains.
References
1. Al-Hejri AM, Sable AH, Al-Tam RM, Al-Antari MA, Alshamrani SS, Alshmrany KM, Alatebi W. A hybrid explainable federated-based vision transformer framework for breast cancer prediction via risk factors. Sci Rep. 2025: 18453 [PMID: 40419634, https://doi.org/10.1038/s41598-025-96527-0]
2. McMahan B, Moore E, Ramage D, Hampson S, y Arcas BA. Communication-efficient learning of deep networks from decentralized data. Artificial intelligence and statistics. 2017: 1273
3. Konečný J, McMahan HB, Ramage D, Richtárik P. Federated optimization: Distributed machine learning for on-device intelligence. arXiv preprint arXiv:161002527. 2016: https://doi.org/10.48550/arXiv.1610.02527]
4. Li T, Sahu AK, Zaheer M, Sanjabi M, Talwalkar A, Smith V. Federated optimization in heterogeneous networks. Proceedings of Machine learning and systems. 2020: 429
5. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:201011929. 2020: https://doi.org/10.48550/arXiv.2010.11929]
6. Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H. Training data-efficient image transformers & distillation through attention. International conference on machine learning. 2021: 10347
7. Li Q, He B, Song D. Model-contrastive federated learning. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 10713 https://doi.org/10.1109/CVPR46437.2021.01057]
8. Rieke N, Hancox J, Li W, Milletari F, Roth HR, Albarqouni S, Bakas S, Galtier MN, Landman BA, Maier-Hein K, Ourselin S, Sheller M, Summers RM, Trask A, Xu D, Baust M, Cardoso MJ. The future of digital health with federated learning. NPJ Digit Med. 2020: 119 [PMID: 33015372, https://doi.org/10.1038/s41746-020-00323-1]
9. Alotaibi M, Aljouie A, Alluhaidan N, Qureshi W, Almatar H, Alduhayan R, Alsomaie B, Almazroa A. Breast cancer classification based on convolutional neural network and image fusion approaches using ultrasound images. Heliyon. 2023: e22406 [PMID: 38074874, https://doi.org/10.1016/j.heliyon.2023.e22406]
10. Shen D, Wu G, Suk HI. Deep Learning in Medical Image Analysis. Annu Rev Biomed Eng. 2017: 221 [PMID: 28301734, https://doi.org/10.1146/annurev-bioeng-071516-044442]
11. Abdulrahman S, Tout H, Ould-Slimane H, Mourad A, Talhi C, Guizani M. A Survey on Federated Learning: The Journey From Centralized to Distributed On-Site Learning and Beyond. IEEE Internet of Things Journal. 2021: 5476 https://doi.org/10.1109/jiot.2020.3030072]
12. Salakhutdinov R, Hinton G. Deep boltzmann machines. Artificial intelligence and statistics. 2009: 448
13. Holzinger A, Biemann C, Pattichis CS, Kell DB. What do we need to build explainable AI systems for the medical domain? arXiv preprint arXiv:171209923. 2017: https://doi.org/10.48550/arXiv.1712.09923]
14. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-cam: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE international conference on computer vision. 2017: 618
15. Brunese L, Mercaldo F, Reginelli A, Santone A. Explainable Deep Learning for Pulmonary Disease and Coronavirus COVID-19 Detection from X-rays. Comput Methods Programs Biomed. 2020: 105608 [PMID: 32599338, https://doi.org/10.1016/j.cmpb.2020.105608]
16. Perez L, Wang J. The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:171204621. 2017: https://doi.org/10.48550/arXiv.1712.04621]
17. Garrucho L, Kushibar K, Jouide S, Diaz O, Igual L, Lekadir K. Domain generalization in deep learning based mass detection in mammography: A large-scale multi-center study. Artif Intell Med. 2022: 102386 [PMID: 36207090, https://doi.org/10.1016/j.artmed.2022.102386]
18. Guan H, Yap PT, Bozoki A, Liu M. Federated learning for medical image analysis: A survey. Pattern Recognit. 2024: [PMID: 38559674, https://doi.org/10.1016/j.patcog.2024.110424]
19. Kaissis GA, Makowski MR, Rückert D, Braren RF. Secure, privacy-preserving and federated machine learning in medical imaging. Nature Machine Intelligence. 2020: 305 https://doi.org/10.1038/s42256-020-0186-1]

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright (c) 2026 Bandhan Panda, Bibek Kumar Patro, Siba Sundar Das, Santosh Kumar Kar

