Files
Download Full Text (336 KB)
Description
There is no easily-available dataset for mastery-focused education, where mastery replaces grades while accurately reflecting student performance. Student data is restricted due to privacy & security concerns. One K-12 app was recently discovered selling unmasked data on millions of students Synthetic datasets may solve this by providing utility, privacy preservation, scalability, customization, variability, and resistance to reverse-engineering Techniques used included autoencoders, variational auto-encoders (VAE), generative adversarial networks (GAN) , and copulas combined with GANs.
Publication Date
2025
Recommended Citation
Chinodakufa, Tapiwa Amion and Ahmed, Khandaker Mamun, "Generative AI for Synthetic Data Creation: Building Mastery-Focused Educational Datasets" (2025). External Research Posters. 16.
https://scholar.dsu.edu/erposters/16

Comments
Presented at the 2025 SDSU Data Science Symposium.