I've been hacking around on this package for the last few weeks at work - I had some tables that were tens of millions of rows to anonymise. I ended up adding a method to the TableDefinition which lets me set a queryChunkSize property for the table. Then I changed the dumpTableData() method to use the queryChunkSize, collecting the results until the insert statement's chunkSize was reached.
I also added an option to export to CSV instead of SQL, but that may be beyond the desired scope for this package.
Is there any appetite from BeyondCode (maybe @mechelon or @mpociot) for a PR for any of this? I haven't really contributed anything to an open source package before so I'm not sure on the etiquette. I'd need to tidy it up before submitting, so I wanted to check first.
I've been hacking around on this package for the last few weeks at work - I had some tables that were tens of millions of rows to anonymise. I ended up adding a method to the
TableDefinitionwhich lets me set aqueryChunkSizeproperty for the table. Then I changed thedumpTableData()method to use thequeryChunkSize, collecting the results until the insert statement'schunkSizewas reached.I also added an option to export to CSV instead of SQL, but that may be beyond the desired scope for this package.
Is there any appetite from BeyondCode (maybe @mechelon or @mpociot) for a PR for any of this? I haven't really contributed anything to an open source package before so I'm not sure on the etiquette. I'd need to tidy it up before submitting, so I wanted to check first.