Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GCSFuse Read/Write Perf - gcloud storage api #1300

Open
pmorse-cr opened this issue Aug 23, 2023 · 7 comments
Open

GCSFuse Read/Write Perf - gcloud storage api #1300

pmorse-cr opened this issue Aug 23, 2023 · 7 comments
Assignees
Labels
Eng-Backlog feature request Feature request: request to add new features or functionality

Comments

@pmorse-cr
Copy link

Describe the issue
We are building a new video transcoding pipeline within Google Cloud and are looking to minimize the amount of files moving around. In video transcoding most of the time is taken in CPU/GPU, but we have found that GCSFuse is working well for smaller source files and as soon as we src >6GB video performance drops. We are testing copying to local-ssd/pd and filestore, but would like to maintain our use of GCSFuse and would like to know what the plans are to move to the new gcloud storage api/functions vs gsutil. Thank you very much~!

@pmorse-cr pmorse-cr added p1 P1 question Customer Issue: question about how to use tool labels Aug 23, 2023
@pmorse-cr
Copy link
Author

I apologize, it actually looks like it isn't a different API - but the difference in tooling within the sdk. Are there any plans to change the way the gcsfuse interacts with the API for perf improvements obtained by the new gcloud storage tooling? thanks

@raj-prince
Copy link
Collaborator

Hi @pmorse-cr,

Thank you for the showing interest in GCSFuse!

Is your workload read heavy or write heavy?

To give a little context, GCSfuse is a FUSE (Filesystem in Userspace) implementation that allows you to mount a GCS bucket as a local filesystem. This means that you can access your GCS objects as if they were regular files on your computer. GCSfuse is a good option if you need to access GCS objects with standard file system operations, such as cat, ls, cp, and mv.

As far as I know, both GCSFuse and gcloud storage use similar APIs to interact with the GCS. However, in the case of GCSFuse, requests go through the kernel (by design), which can result in a difference in the performance for some operations. Still, you can follow this doc to improve the performance based on your use-case.

-Prince

@raj-prince
Copy link
Collaborator

Hi @pmorse-cr,

In addition to the above:

  • Could you please also share the numbers you got b/w gcloud storage and gcsfuse? This will help us in understanding the gap.
  • What is the expected rate of processing data through your pipeline in order to ensure smooth execution?
  • You can refer this page, to gain a better understanding of the performance of gcsfuse for different workloads.

Also, are you talking about these improvements?
(a) CRC32C data integrity check.
(b) Graph based tasked management to parallellize the work with less overhead.

I'll discuss within team, if we can incorporate any of these in GCSFuse.

We look forward to hearing from you.

Thanks,
Prince Kumar.

@pmorse-cr
Copy link
Author

pmorse-cr commented Sep 26, 2023

Thanks very much for the reply and sorry about the delays. We are helping a Google Cloud customer build a new transcoding pipeline for VOD services delivery globally. They are currently on AWS and one of the goals is to use as much serverless as possible, while not moving files everywhere. This is why we were hoping to use GCSFuse. We have run into some performance challenges within the drm/packaging services, where it looks like we are io-bound. In regards to performance improvements with gcloud storage vs gsutil - yes the link you sent is what we found within our testing. In fact we moved the transcoding and first portion of packaging into a k8s cluster to use local and regional ssd with gcloud storage operations in front and in back of the jobs within k8s because of the slow downs. We would love to work with you all closely on this. Our customer has also referenced this article multiple times about Fuse and Transcoding, fyi - https://netflixtechblog.com/mezzfs-mounting-object-storage-in-netflixs-media-processing-platform-cda01c446ba
@raj-prince - thanks

@marcoa6
Copy link
Collaborator

marcoa6 commented Sep 27, 2023

Hi @pmorse-cr you mention serverless (Cloud Run?) but then talk about using K8S (GKE). Which will be used?
If GKE, please using the integrated GCSfuse CSI driver. If Serverless there is a potential future integration we are exploring I can make you aware of.

Re Perf: Please see performance best practices, limitations, and benchmarks.

Does the performance with gcloud meet your needs? If not, GCS is not the right solution.
If this request is to support parallel downloads within GCSfuse, it is on our backlog but dont have a date yet.

If you send me your email address, I can reach out to you with more details as i would also like to better understand the use case and see how we can collaborate

@pmorse-cr
Copy link
Author

I'd love to send you my email address and collaborate more. Is there a private way to do that vs open on this issue? Thanks @marcoa6 @raj-prince

@marcoa6 marcoa6 added feature request Feature request: request to add new features or functionality Eng-Backlog and removed question Customer Issue: question about how to use tool p1 P1 labels Sep 29, 2023
@marcoa6
Copy link
Collaborator

marcoa6 commented Sep 29, 2023

We will track this as a feature request to support parallel reads in which large objects are read in chunks in parallel to improve performance

@marcoa6 marcoa6 self-assigned this Sep 29, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Eng-Backlog feature request Feature request: request to add new features or functionality
Projects
None yet
Development

No branches or pull requests

3 participants