search.services.fulltext module

Provides access to fulltext content for arXiv papers.

class search.services.fulltext.FulltextSession(endpoint)[source]

Bases: object

An HTTP session with the fulltext endpoint.

retrieve(document_id)[source]

Retrieve fulltext content for an arXiv paper.

Parameters:
  • document_id (str) – arXiv identifier, including version tag. E.g. "1234.56787v3".
  • endpoint (str) – Base URL for fulltext endpoint.
Returns:

Includes the content itself, creation (extraction) date, and extractor version.

Return type:

Fulltext

Raises:
  • ValueError – Raised when document_id is not a valid arXiv paper identifier.
  • IOError – Raised when unable to retrieve fulltext content.
Return type:

Fulltext

search.services.fulltext.current_session()[source]

Get/create FulltextSession for this context.

Return type:FulltextSession
search.services.fulltext.get_session(app=None)[source]

Get a new session with the fulltext endpoint.

Return type:FulltextSession
search.services.fulltext.init_app(app=None)[source]

Set default configuration parameters for an application instance.

Return type:None
search.services.fulltext.retrieve(self, document_id)[source]

Retrieve fulltext content for an arXiv paper.

Parameters:
  • document_id (str) – arXiv identifier, including version tag. E.g. "1234.56787v3".
  • endpoint (str) – Base URL for fulltext endpoint.
Returns:

Includes the content itself, creation (extraction) date, and extractor version.

Return type:

Fulltext

Raises:
  • ValueError – Raised when document_id is not a valid arXiv paper identifier.
  • IOError – Raised when unable to retrieve fulltext content.
Return type:

Fulltext