Files

Download

Download Full Text (336 KB)

Description

There is no easily-available dataset for mastery-focused education, where mastery replaces grades while accurately reflecting student performance. Student data is restricted due to privacy & security concerns. One K-12 app was recently discovered selling unmasked data on millions of students Synthetic datasets may solve this by providing utility, privacy preservation, scalability, customization, variability, and resistance to reverse-engineering Techniques used included autoencoders, variational auto-encoders (VAE), generative adversarial networks (GAN) , and copulas combined with GANs.

Publication Date

2025

Comments

Presented at the 2025 SDSU Data Science Symposium.

Generative AI for Synthetic Data Creation: Building Mastery-Focused Educational Datasets

Share

COinS