You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current implementation of dataset filtering in Soda Core relies heavily on the presence of timestamp columns to perform time-based data filtering. However, with the growing adoption of Iceberg tables, it's become evident that not all Iceberg tables utilise a timestamp column for partitioning or time-based queries.
This limitation poses challenges for people leveraging Iceberg tables in their data lakes, as they might encounter difficulties in using Soda Core's dataset filtering features effectively.
Would it be possible to support dataset filtering for Iceberg tables to Soda Core?
Thanks!
The text was updated successfully, but these errors were encountered:
Hi Paolo, I am not accustomed to Iceberg tables, could you let me know why dataset filters would not work on such tables? What would be the mechanism for partitioning in case there is no timestamp column?
Note that you can use dataset filters on any column type, it doesn't have to be a timestamp. It allows a SQL WHERE clause which makes it quite flexible.
Hello everyone,
The current implementation of dataset filtering in Soda Core relies heavily on the presence of timestamp columns to perform time-based data filtering. However, with the growing adoption of Iceberg tables, it's become evident that not all Iceberg tables utilise a timestamp column for partitioning or time-based queries.
This limitation poses challenges for people leveraging Iceberg tables in their data lakes, as they might encounter difficulties in using Soda Core's dataset filtering features effectively.
Would it be possible to support dataset filtering for Iceberg tables to Soda Core?
Thanks!
The text was updated successfully, but these errors were encountered: