Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Harvester Timed Out #30

Open
bwagerson opened this issue Mar 20, 2019 · 3 comments
Open

Harvester Timed Out #30

bwagerson opened this issue Mar 20, 2019 · 3 comments

Comments

@bwagerson
Copy link

bwagerson commented Mar 20, 2019

Harvester timed out at about 677,989 out of 1,500,000 items while trying to harvest all of Arxiv.org, is there a way to pick the harvest back up where it timed out? Instead of starting at the beginning?

@bloomonkey
Copy link
Owner

Hi. See #22 for some conversation around this

@ericywl
Copy link

ericywl commented Apr 12, 2019

Sorry, but is the definitive answer to provide the resumption token? How would we get that from oai-harvest?

@bloomonkey
Copy link
Owner

Yes, the resumptionToken is the only mechanism in OAI-PMH for resuming a previous harvesting run.
I'm not sure how you'd get access to the necessary token though, as it's probably only used internal to oaiharvest to retrieve the next chunk 🤔 .
Maybe it could store the token in a local file called e.g. .resumptionToken and use this as a default for the -r option if not value is provided...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants