Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crossref parser will fail on article bundles #61

Open
seasidesparrow opened this issue Aug 25, 2023 · 0 comments
Open

Crossref parser will fail on article bundles #61

seasidesparrow opened this issue Aug 25, 2023 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@seasidesparrow
Copy link
Member

Describe the bug
run.py will log an error and when downloaded article bundles are parsed with the crossref parser

To Reproduce
Steps to reproduce the behavior: Try parsing the file /proj/ads/abstracts/data/PHYU/ufn_2022_065_01.xml

Additional context
The crossref parser is expecting two things from text content: 1) that each record is wrapped in a tag (crossref.py L381), and 2) that there is only one record per bundle (crossref.py L372). To address the issue, we need a crossref parser that (a) doesn't require there to be only one <doi_record>, and (b) that records aren't necessarily embedded in a <crossref> tag.

@seasidesparrow seasidesparrow added the bug Something isn't working label Aug 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants