Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process #96

Open
smruti241 opened this issue Mar 8, 2023 · 2 comments

Comments

@smruti241
Copy link

Hi @jerryji1993 , @Zhihan1996 , @project-delphi , @hjgwak , @timlautk ,

I read your paper and its very interesting. I have a dataset which consists of 6-mers only. I want to divide my dataset into test, dev and train data and assign them labels for fine tuning process directly (no pre-training required, I will use pre-trained models). Can you please tell me the procedure or any script is available in the folders of this tool? Please let me know. Thanks!

@Moeinh77
Copy link

Hi yes there is a way to load the models with HuggingFace I have done it in this repository: https://github.com/Moeinh77/Virus-DNA-Classification

@smruti241
Copy link
Author

smruti241 commented Mar 20, 2023

@Moeinh77 can you please tell me how to use it? I didnt understand properly. I have kmer data already (6-mer data). I want to use pre-trained models for fine tuning. I dont have labels added in my kmer data

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants