International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 8, Issue 3 (May-June 2026) Submit your research before last 3 days of June to publish your research paper in the issue of May-June.

Privacy-Preserving Machine Learning on Financial Data: Federated Learning, Differential Privacy, and Practical Deployment Challenges in Banking

Author(s) Jeevan Krishna Paruchuri
Country United States
Abstract Machine learning on financial data sits at an awkward intersection: the data is among the most sensitive in any industry, the regulatory regime is among the strictest, and the business value of better models is large enough to keep pulling new ML workloads into production. This survey examines the two principal families of privacy-preserving ML federated learning (training across decentralized data without centralizing it) and differential privacy (adding mathematically calibrated noise to bound the information any individual record contributes to a model) through the lens of a practitioner deploying ML in a regulated banking environment. The work is grounded in concrete operational events, including a GDPR audit that surfaced A 2022 internal GDPR audit at the partner institution discovered 14 analysts with unauthorized access to a model's training feature store; this finding directly motivated the privacy-preserving redesign reported in this paper. We review the theoretical foundations (Dwork's differential privacy framework, McMahan's FedAvg algorithm, the DP-SGD training procedure) and report the privacy-utility trade-off observed in practice: at ε=1 (strong privacy) the model accuracy degrades by approximately 5%, while at ε=5 (moderate privacy) the loss is approximately 1%**. We discuss the operational realities that make federated learning hard in banking heterogeneous data across business units, communication overhead between geographically distributed sites, convergence challenges when client distributions diverge, and the difficulty of debugging models you cannot inspect end-to-end. We argue that the privacy mechanisms themselves work; the adoption barriers are organizational, regulatory, and operational rather than algorithmic. We close with practical guidance: where federated learning earns its complexity, where centralized training with strong access controls remains the right answer, and where differential privacy is most likely to deliver its promised guarantees without crippling model utility.
Field Computer Applications
Published In Volume 6, Issue 4, July-August 2024
Published On 2024-07-12
DOI https://doi.org/10.36948/ijfmr.2024.v06i04.75352

Share this