Secure Synthetic Test Data Generation Using AI and Differential Privacy

Arfi Siddik Mollashaik; Alexander Eduardo Vicente Ferreira

Secure Synthetic Test Data Generation Using AI and Differential Privacy

Authors

Arfi Siddik Mollashaik Solution Architect at Securiti.ai
Alexander Eduardo Vicente Ferreira Inter-American Defense College (IADC)

Keywords:

AI Models, Synthetic Data, Privacy Preservation, Model Accuracy, Data Generation, Differential Privacy

Abstract

The paper examines the synthesis that is created on AI-based synthetic generations of test data, which would address the trade-offs between data secrecy, scale, and usefulness. As the privacy of data becomes a more critical issue, particularly in areas involving sensitive data, it is necessary to ensure that synthetic data can be used as a viable equivalent to real-world data and maintain levels of privacy. The data generation process incorporates the methods of differential privacy to avoid the leakage of personal data and can be regarded as the hoped-for resolution to privacy problems. The study explains the success of AI models, especially in generating high-quality synthetic data without compromising their usefulness. Among the key findings, the innovation of differential privacy is that it provides data confidentiality. Still, on issues of the preservation of maximized utility of the data, it suffers in large-scale settings. The study highlights the need to optimize these models to get a scalable, secure, and feasible solution to industries that utilize synthetic data to test and train AI-based systems.

Downloads

Requires Subscription Pdf

Published

16-09-2025

How to Cite

Arfi Siddik Mollashaik, & Alexander Eduardo Vicente Ferreira. (2025). Secure Synthetic Test Data Generation Using AI and Differential Privacy. Well Testing Journal, 34(S3), 749–766. Retrieved from https://welltestingjournal.com/index.php/WT/article/view/227

Download Citation

Issue

Vol. 34 No. S3 (2025)

Section

Original Research Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

This license requires that re-users give credit to the creator. It allows re-users to distribute, remix, adapt, and build upon the material in any medium or format, for noncommercial purposes only.

Secure Synthetic Test Data Generation Using AI and Differential Privacy

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

SCOPUS

SCIMAGO

Keywords

CC