dcdump

command
v0.2.2 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Feb 21, 2025 License: MIT Imports: 12 Imported by: 0

Documentation

Overview

Tool to fetch a full list of DOI from datacite.org API, because as of Fall 2019 a full dump is not yet available (https://git.io/Je6bs, https://git.io/Je6Dg).

THIS IS THROWAWAY CODE, AS IT IS HOPEFULLY OBSOLETE SOON.

Currently (12/2019) using the "dois" endpoint, from v2 of the datacite API, supposedly.

> The current version of the REST API is version 2. If you are using the endpoints /works, /members, or /data-centers, you are using version 1.

Various intervals (weekly, daily, hourly, every minute) to mitigate deep paging issue and HTTP 502s.

Errors encountered: 502, 500, 403, 400, "unexpected EOF" (maybe https://stackoverflow.com/q/21147562/89391). Strange error with minute interval: "search_after has 3 value(s) but sort has 2."

Reported issues:

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL