Skip to content

Drifted Tabular Data Generation #2052

Answered by noamzbr
LifeBoey asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @LifeBoey

This is a really good question! There are no "absolute" answers here because drift detection is frequently used as a proxy for something else, for example "How well will my model do on this new data, for which I don't have labels yet?". That means that "how good" a drift detection algorithm is depends on the type of drift you expect to have in real life, and it's relation to the quality of your models' predictions.

Having said that, there's a couple of things you can do:

  1. As you suggested, you can add some characteristic noise to one or more of your features. You can visually explore the effects of adding noise in our Checks Demo. This example simply adds the same kind of noi…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by noamzbr
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
tabular Affects deepchecks.tabular package
2 participants