Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide a "sequential" subset strategy #295

Open
exhuma opened this issue Feb 8, 2024 · 0 comments
Open

Provide a "sequential" subset strategy #295

exhuma opened this issue Feb 8, 2024 · 0 comments

Comments

@exhuma
Copy link

exhuma commented Feb 8, 2024

We have a pretty large database that uses temporal tables. For a development subset we are not interested in the whole history.

When subsetting, it's important that the samples from the history tables are sequential and have no gaps. So a random subset will not provide the data needed to test the full behaviour of the application accessing that DB.

It would be necessary to provide a "sequential" subset, using one or more columns for ordering (including asc/desc option) and specifying the size of the subset. In this particular case, a static size (in number of rows) would be better than a percentage.

A nice-to-have would be to match the subset on a table-name pattern. All the history tables live in their own schema so using a pattern like history.* would be nice as it would dynamically adapt to DB changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant