Add python binding for loading bin from memory. #5164

joeyballentine · 2023-11-22T03:47:29Z

This PR adds a simple python binding for loading a bin from memory. One already exists for loading a param from memory, and the function to load a bin from memory in c++ already exists, so all that's needed is this binding.

I've been using this exact binding in my ncnn_vulkan fork for a long time now, and it works perfectly fine.

I'm currently considering switching over to the main ncnn package, but I need a couple features (like this) before I am able to do so, so expect some more PRs from me.

tencent-adm · 2023-11-22T03:47:42Z

All committers have signed the CLA.

JeremyRand · 2023-11-23T17:48:08Z

Did you test this with both CPU and Vulkan inference?

joeyballentine · 2023-11-23T17:51:56Z

@JeremyRand I have not tested with CPU. Why would that matter?

JeremyRand · 2023-11-23T18:50:19Z

When I tried to cherry-pick your binding some months ago and tried it with CPU inference in chaiNNer, I got an immediate segfault. My understanding is that this is because the Pybind deallocates the memory as soon as the bind function returns, which causes a memory safety bug since ncnn doesn't make a copy of the data (since it assumes you're calling from C++ and will manage the memory yourself). I suspect that the reason you didn't see a segfault in Vulkan mode is because the memory management is different (it may make a copy of the data while it's uploading it to the GPU), but I didn't verify this hypothesis.

joeyballentine · 2023-11-24T18:55:48Z

Interesting. Well, IMO that isn't enough reason to justify a binding not being there. If it doesn't work for CPU, just don't use it for CPU inference.

Sounds to me like it's a bug with the CPU version of the c++ code, and therefore is irrelevant to this PR and should be fixed separately

JeremyRand · 2023-11-25T07:15:36Z

Yes, agreed that it's reasonable to have a binding if it works for Vulkan, since it's more efficient than making a copy. If it doesn't work for CPU, might be worth explicitly documenting that, but other than that I have no objection to the concept.

nihui · 2023-12-21T06:49:38Z

We may need to add parameters in load_model to distinguish whether ncnn needs to deeply copy the weight data.
This will be available in next year's version :D

Kim2091 · 2024-04-12T20:18:45Z

Implementing this would benefit many projects. Please consider doing so

Add python binding for load_model_mem

3e40924

github-actions bot added the python label Nov 22, 2023

apply code-format changes

98ee440

kangzixiang approved these changes Jan 2, 2024

View reviewed changes

whyb approved these changes Jan 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add python binding for loading bin from memory. #5164

Add python binding for loading bin from memory. #5164

joeyballentine commented Nov 22, 2023

tencent-adm commented Nov 22, 2023 •

edited

JeremyRand commented Nov 23, 2023

joeyballentine commented Nov 23, 2023

JeremyRand commented Nov 23, 2023

joeyballentine commented Nov 24, 2023

JeremyRand commented Nov 25, 2023

nihui commented Dec 21, 2023

Kim2091 commented Apr 12, 2024

Add python binding for loading bin from memory. #5164

Are you sure you want to change the base?

Add python binding for loading bin from memory. #5164

Conversation

joeyballentine commented Nov 22, 2023

tencent-adm commented Nov 22, 2023 • edited

JeremyRand commented Nov 23, 2023

joeyballentine commented Nov 23, 2023

JeremyRand commented Nov 23, 2023

joeyballentine commented Nov 24, 2023

JeremyRand commented Nov 25, 2023

nihui commented Dec 21, 2023

Kim2091 commented Apr 12, 2024

tencent-adm commented Nov 22, 2023 •

edited