Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is Qwen in the roadmap? #3583

Open
DzAvril opened this issue May 11, 2024 · 4 comments
Open

Is Qwen in the roadmap? #3583

DzAvril opened this issue May 11, 2024 · 4 comments

Comments

@DzAvril
Copy link

DzAvril commented May 11, 2024

Hello executorch team,

I have a question regarding the compatibility of executorch with Qwen, an open-source llm created by Alibaba Group (QWen) . Are there any plans to support Qwen in the future?

Thank you for your attention.

@mikekgfb
Copy link
Contributor

I have not looked at Qwen in detail, but if the architecture is similar to llama2 or llama3 it should be very easy to add, by writing a parameter json file for the model. (The MoE part may require some work?)

ExecuTorch is an OSS project and we're excited to take community contributions that meet our open source commitments and coding standards! So, if you're interested in exploring how to add support for Qwen, we'll be delighted to get a submission!

@DzAvril
Copy link
Author

DzAvril commented May 14, 2024

image
Thanks for your response @mikekgfb . In the v0.2 release note, I noticed that QW1.5 7B has been tested using torch.export. Would it be possible for you to share the test code as a reference for supporting QWen with executorch?

@JacobSzwejbka
Copy link
Contributor

@cbilgin were you involved in that testing suite

@DzAvril
Copy link
Author

DzAvril commented May 27, 2024

I apologize for reaching out once again, but would it be possible for someone to kindly provide some guidelines on how to support QWen?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants