Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance Issue #616

Open
arind123 opened this issue Jul 21, 2023 · 0 comments
Open

Performance Issue #616

arind123 opened this issue Jul 21, 2023 · 0 comments

Comments

@arind123
Copy link

I am trying to create a data processing pipeline that start with a catalogue of multiple Sentinel 2 tiles and multiple 10mt bands (Data worth almost 1 year so around 250 Sentinel 2 Products and 250*5 number of tif files). On that I am applying resampling, Local Algebra, Masking etc. and the same thing I am doing on some bands of 20mt resolutions. An Finally join and save the final data frame as parquet.

This whole process is taking around 15 hours to complete in Spark, Is that what it is supposed take in Rasterframes ???

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant