How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process #96

smruti241 · 2023-03-08T20:13:52Z

Hi @jerryji1993 , @Zhihan1996 , @project-delphi , @hjgwak , @timlautk ,

I read your paper and its very interesting. I have a dataset which consists of 6-mers only. I want to divide my dataset into test, dev and train data and assign them labels for fine tuning process directly (no pre-training required, I will use pre-trained models). Can you please tell me the procedure or any script is available in the folders of this tool? Please let me know. Thanks!

Moeinh77 · 2023-03-20T17:56:59Z

Hi yes there is a way to load the models with HuggingFace I have done it in this repository: https://github.com/Moeinh77/Virus-DNA-Classification

smruti241 · 2023-03-20T18:57:32Z

@Moeinh77 can you please tell me how to use it? I didnt understand properly. I have kmer data already (6-mer data). I want to use pre-trained models for fine tuning. I dont have labels added in my kmer data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process #96

How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process #96

smruti241 commented Mar 8, 2023

Moeinh77 commented Mar 20, 2023

smruti241 commented Mar 20, 2023 •

edited

How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process #96

How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process #96

Comments

smruti241 commented Mar 8, 2023

Moeinh77 commented Mar 20, 2023

smruti241 commented Mar 20, 2023 • edited

smruti241 commented Mar 20, 2023 •

edited