Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prepare CRAN release #25

Open
6 of 10 tasks
wleoncio opened this issue Feb 28, 2023 · 1 comment
Open
6 of 10 tasks

Prepare CRAN release #25

wleoncio opened this issue Feb 28, 2023 · 1 comment
Assignees

Comments

@wleoncio
Copy link
Member

wleoncio commented Feb 28, 2023

  • Internalize functions other than greedyMix()
  • Stop to-be-implemented functions on greedyMix()
  • Add examples to greedyMix()
  • Incorporate functions that performs clustering on greedyMix()
  • Achieve minimal input of greedyMix()
  • Achieve minimal input of greedyMix() with indMix() parsing
  • Make sure output of greedyMix() for FASTA files is correct (minimum quality for CRAN release)
  • Decide package function (rBAPS or BAPS?)
  • Create GH issue form for easy reporting
  • Add welcome message on load with URL to issues
@wleoncio wleoncio self-assigned this Feb 28, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
Apparently that function is only used if the input is Genepop or BAPS format. However, its code might be useful for reducing some of the input of greedyMix(), as it contains the calculation for them.
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
Replaced 1: with seq_len() and seq_along() (#25).

Helps working around NULL and 0 values.
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
It should be OK for Z to be NULL. This currently triggers an issue on initialCounts(), which should be fixed
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Aug 11, 2023
wleoncio added a commit that referenced this issue Sep 11, 2023
wleoncio added a commit that referenced this issue Sep 11, 2023
* issue-25: (38 commits)
  Adjusted unit tests for #25
  Fixed sink() usage
  Fixed docs
  Exporting importFile()
  Improved handling of supported formats for greedyMix() (#25)
  Fixed basic parsing of FASTA files (#25)
  Increment version number to 0.0.0.9022
  Fixed syntax (#25)
  Improved printing (#25)
  Partial reversion of b034158 (#25)
  Fixed to indMix (#25)
  Incorporated handleData() on greedyMix() (#25)
  Improved handleData() to handle FASTA (#25)
  Added numeric output option to load_fasta() (#25)
  Fixed test text (#25)
  Added missing documentation for arguments (#25)
  Syntax fix (#25)
  Delayed resolution of FIXMEs (#25)
  Workaround for usage of MATLAB any() (#25)
  Fixed argument passing (#25)
  ...
@wleoncio
Copy link
Member Author

wleoncio commented Sep 18, 2023

Some ideas/insights after a meeting with Jukka:

  • Fasta dataset: generate more data
    • Replicate the dataset 10-20x for each individual (row)
    • Generate more columns (ACGT alphabet)
  • Set cluster default to 5 (all dimensions of all generic objects?)
    • Alternatively, re-dim used generics before handling them
  • Investigate df_fasta$data
    • It has 6 rows when it should have 5 (looks like MATLAB also does that)
    • The first row is just zeros: may be an error (handleData() adds a row of zeros, even on MATLAB)
    • Why 99 columns when each individual has 100 letters? (handleData() drops column with 1 unique letter)
  • Investigate ninds calculation
    • Perhaps indMix() gets stuck because ninds is at 2 (when there are 5 individuals)
    • A higher ninds, say 5, should help
  • Create new haploid data from BAPS_format_clustering_diploid.txt (are BAPS files going to be supported? No pre-processing available (importFile(), handleData())
    • Each individual (integer in last column) has two rows assigned to them (ergo, "diploid")
    • Some creatures have 4 rows (tetraploid)
    • Erase every other row to create a sinthetic haploid example

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant