Documentation
¶
Overview ¶
Tool to fetch a full list of DOI from datacite.org API, because as of Fall 2019 a full dump is not yet available (https://git.io/Je6bs, https://git.io/Je6Dg).
THIS IS THROWAWAY CODE, AS IT IS HOPEFULLY OBSOLETE SOON.
Currently (12/2019) using the "dois" endpoint, from v2 of the datacite API, supposedly.
> The current version of the REST API is version 2. If you are using the endpoints /works, /members, or /data-centers, you are using version 1.
Various intervals (weekly, daily, hourly, every minute) to mitigate deep paging issue and HTTP 502s.
Errors encountered: 502, 500, 403, 400, "unexpected EOF" (maybe https://stackoverflow.com/q/21147562/89391). Strange error with minute interval: "search_after has 3 value(s) but sort has 2."
Reported issues:
- https://github.com/datacite/datacite/issues/897 ("search_after ...")
- https://github.com/datacite/datacite/issues/898 (less informative 500 on https://is.gd/uP0aJ2; 2019-10-07 16:19:00 - 16:19:59)