I need to generate large (1TB-3TB) synthetic MySQL datasets for testing, with a number of requirements:
a) custom output formatting (SQL, CSV, fixed-len row, etc)
b) referential integrity support (ie, child tables should reference PK values, no orphans,etc)
c) able to generate multiple tables in parallel
d) preferably able to operate without a GUI and/or manual intervention
e) uses a well defined templating construct for data generation
f) preferably open source
Does anyone out there know of a product that meets at least most of these requirements?
I found a PHP based data generation script (www.generatedata.com) that is extensible in its output formatting, so it should do everything I need it to do.
My SQL Dump
MySQL musings by a self professed MySQL Geek
- Tools to generate large synthetic data sets for testing?