Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

As a user I would like a mode that more aggressively predicts author strings #242

Open
mjy opened this issue Aug 15, 2023 · 2 comments
Open

Comments

@mjy
Copy link
Contributor

mjy commented Aug 15, 2023

Given the following assumptions are always true:

  • A string contains one or more of name, author, and year
  • That string contains only those elements (i.e. there is never going to be some element that is not one of those three)

Then I would like to set an option to more agressively predict the author element. False positives are not a major concern, the benefit of good match will be seen downstream via a edit-distance calculation element.

The canonical example is: s = aus Jones.

When I do ::Biodiversity::Parser.parse(s).dig(:authorship, :normalized) then I get parsed: false. I would like to get "Jones".

@mjy
Copy link
Contributor Author

mjy commented Aug 15, 2023

Thinking further this likely holds for Year too.

@dimus
Copy link
Member

dimus commented Mar 22, 2024

hm, parser does require the binomial for species, may be I can have an option to add a 'fake' binomial for zoological species names?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants