Getting Buffer Validation newBufferWith must not exceed 256 MB error when I am trying to load model of 2.5 gb size #5384

Pratik-7i · 2024-05-07T06:18:52Z

I have downloaded Google iOS sample of MediaPipe and tried to load my model which is 2.5 GB in size.

private var inference: LlmInference! = {
    // slm1.bin is my model
    let path = Bundle.main.path(forResource: "slm1", ofType: "bin")!
    let llmOptions = LlmInference.Options(modelPath: path)
    return LlmInference(options: llmOptions)
}()

The project is built successfully but on App launch, I am getting following error:

-[MTLDebugDevice newBufferWithBytes:length:options:]:670: failed assertion `Buffer Validation
newBufferWith*:length 0x1f400000 must not exceed 256 MB.

I came to know that a single MTLBuffer is limited to a maximum length of 256MB. If we need a total allocation of more than 256 MB, we can allocate multiple buffers and split data among them, but I don't know how I will be able to do in case of this SDK.

The text was updated successfully, but these errors were encountered:

kuaashish · 2024-05-08T08:39:06Z

Hi @Pratik-7i,

This is already on our roadmap, and while we have not yet tested it with iOS interface and other large models, we are actively working to ensure compatibility with them. This functionality will be available soon. Regarding the scenario you are currently using, please allow us some time to determine if we can assist with it.

Thank you!!

kuaashish · 2024-05-08T08:43:14Z

Hi @priankakariatyml,

Do you know way to allocating multiple buffers and distributing data among them in the SDK? Any recommendations would be greatly valued.

Thank you!!

Pratik-7i · 2024-05-09T06:57:14Z

Thanks @kuaashish
Actually we need to integrate large model in our live project, can we know when this functionality might be available?

kuaashish · 2024-05-09T07:06:40Z

Hi @Pratik-7i,

We are unable to provide an exact date at this time. However, rest assured that it will be available soon. We will keep you informed of any updates through the same thread.

Thank you!!

Pratik-7i · 2024-05-09T07:24:31Z

Thanks @kuaashish
We will be waiting for an update

Pratik-7i added the type:others issues not falling in bug, perfromance, support, build and install or feature label May 7, 2024

google-ml-butler bot assigned ayushgdev May 7, 2024

kuaashish assigned kuaashish and unassigned ayushgdev May 8, 2024

kuaashish added task:LLM inference Issues related to MediaPipe LLM Inference Gen AI setup platform:ios MediaPipe IOS issues labels May 8, 2024

kuaashish added the stat:awaiting response Waiting for user response label May 8, 2024

google-ml-butler bot removed the stat:awaiting response Waiting for user response label May 8, 2024

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting Buffer Validation newBufferWith must not exceed 256 MB error when I am trying to load model of 2.5 gb size #5384

Getting Buffer Validation newBufferWith must not exceed 256 MB error when I am trying to load model of 2.5 gb size #5384

Pratik-7i commented May 7, 2024 •

edited

kuaashish commented May 8, 2024

kuaashish commented May 8, 2024

Pratik-7i commented May 9, 2024

kuaashish commented May 9, 2024

Pratik-7i commented May 9, 2024

Getting Buffer Validation newBufferWith must not exceed 256 MB error when I am trying to load model of 2.5 gb size #5384

Getting Buffer Validation newBufferWith must not exceed 256 MB error when I am trying to load model of 2.5 gb size #5384

Comments

Pratik-7i commented May 7, 2024 • edited

kuaashish commented May 8, 2024

kuaashish commented May 8, 2024

Pratik-7i commented May 9, 2024

kuaashish commented May 9, 2024

Pratik-7i commented May 9, 2024

Pratik-7i commented May 7, 2024 •

edited