Enhancement Subsystem

The enhancement subsystem is responsible for services that supplement the core e-print content and metadata to add value for arXiv readers and/or API clients. In general, the functionality provided by services in the enhancement subsystem goes beyond the core function of arXiv, but address important stakeholder or compliance requirements.

Context

Services in the enhancement subsystem support end-user workflows, either via form-driven interfaces provided by each service or via client-side applications that leverage enhancement services via their APIs.

A common theme in most enhancement services is the need to relate metadata added in reference to e-print submissions to the resulting announced e-print. For example, information about funding may be added at some point during or after submission but prior to announcement; that information must be transitive to the resulting announced e-print without further human intervention. This can generally be achieved by maintaining a mapping internally, and updating that mapping with data from the canonical announcement Event Stream.

Some enhancement services may call out to third party APIs, e.g. to collect additional metadata provided by publishers. See External links service.

Enhancement services may also generate events about their respective domains that may be of interest to services in other parts of the arXiv NG system. For example, new author disambiguation information may be of interest to the search system; this can be propagated by emitting events about new author relations that are consumed by the Search service, via the Enhancement Event Stream.

Domains & services

../_images/ng-enhancement.png

Author identity service

https://github.com/arXiv/arxiv-authors

The author identity service is responsible for mapping author name substrings in announced e-prints to stable author identifiers, as well as maintaining an alignment of identifiers in multiple relevant authority systems (including the classic arXiv author authority record).

There are several ways that information may arrive at the author identity service.

  • Author-owners may add author identifiers for individual author names on e-prints that are already announced, using a form-driven UI provided by this service.

  • Submitters may add author identifiers for individual author names via form-driven interfaces in the Submission service, which in turn submits that information to the author identity service.

  • Authorized trusted partners may programmatically add disambiguating information to this service via its API.

The author identity service exposes this information in two ways:

  • New information about author identities for announced e-prints (mapping between an author name substring and author identifiers) generated by new assertions or by updated authority alignments are propagated via the Enhancement Event Stream.

  • The author identity service provides a RESTful API that exposes both the most current state of author name mappings as well as the accession history of mapping and alignment information.

The author identity service supports both submission and announced e-print identifiers, and subscribes to the canonical announcement event stream in order to update submission-referenced author information with the announced e-print identifier.

Funding information service

https://github.com/arXiv/arxiv-funding

The funding information service is responsible to accession and make available funding information about specific arXiv e-prints. This service wraps the CrossRef Funder Registry.

Users can add information about funding agencies and grant numbers either directly via a form-driven interface provided by this service or via API by the submission interface during the submission process. Trusted and authorized platforms may also update this information via the service’s API.

The service provides a RESTful API that exposes both the current state of funding information about an e-print as well as the accession history of that information.

The funding information service supports both submission and announced

e-print identifiers, and subscribes to the canonical announcement event stream in order to update submission-referenced funding information with the announced e-print identifier.