arxiv.canonical.domain package

Core data structures and concepts used to describe arXiv e-prints.

The structures in this module capture the central ideas and semantics of e-prints, their versions, and their content. It does not address things like how the canonical record is encoded in a key-value system, nor mechanisms for verifying completeness or integrity.

class arxiv.canonical.domain.AllEPrints(name, years)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Represents the complete set of announced e-prints.

class arxiv.canonical.domain.AllListings(name, years)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

All listings in the canonical record.

class arxiv.canonical.domain.CanonicalFile(modified, size_bytes, content_type, ref, filename=None, is_gzipped=False)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Represents a file in the canonical record, e.g. a source package.

content_type = None

The content type of the file.

exclude_from_comparison = {'is_gzipped', 'ref'}
filename = None

Filename in the canonical record.

classmethod from_dict(data)[source]

Reconstitute a CanonicalFile from a native dict.

Return type

CanonicalFile

is_gzipped = None

Whether or not the content at ref is served in gzipped form.

property mime_type

Convenience accessor for the MIME type of the file.

Return type

str

modified = None

Last time the file was modified.

ref = None

A reference to the location of the content of the file.

size_bytes = None

Size of the file in bytes.

to_dict()[source]

Generate a native dict from this CanonicalFile.

Return type

Dict[str, Any]

class arxiv.canonical.domain.Category[source]

Bases: str

Represents an arXiv category.

property canonical

Get the canonicalized category, if there is one.

Return type

Category

property display

String to use in display of a category.

Examples

Earth and Planetary Astrophysics (astro-ph.EP)

Return type

str

property id

Short name for category ID.

Return type

str

lookup = {'acc-phys': {'in_archive': 'acc-phys', 'is_active': False, 'is_general': False, 'name': 'Accelerator Physics'}, 'adap-org': {'in_archive': 'adap-org', 'is_active': False, 'is_general': False, 'name': 'Adaptation, Noise, and Self-Organizing Systems'}, 'alg-geom': {'in_archive': 'alg-geom', 'is_active': False, 'is_general': False, 'name': 'Algebraic Geometry'}, 'ao-sci': {'in_archive': 'ao-sci', 'is_active': False, 'is_general': False, 'name': 'Atmospheric-Oceanic Sciences'}, 'astro-ph': {'in_archive': 'astro-ph', 'is_active': False, 'is_general': False, 'name': 'Astrophysics'}, 'astro-ph.CO': {'description': 'Phenomenology of early universe, cosmic microwave background, cosmological parameters, primordial element abundances, extragalactic distance scale, large-scale structure of the universe. Groups, superclusters, voids, intergalactic medium. Particle astrophysics: dark energy, dark matter, baryogenesis, leptogenesis, inflationary models, reheating, monopoles, WIMPs, cosmic strings, primordial black holes, cosmological gravitational radiation', 'in_archive': 'astro-ph', 'is_active': True, 'is_general': False, 'name': 'Cosmology and Nongalactic Astrophysics'}, 'astro-ph.EP': {'description': 'Interplanetary medium, planetary physics, planetary astrobiology, extrasolar planets, comets, asteroids, meteorites. Structure and formation of the solar system', 'in_archive': 'astro-ph', 'is_active': True, 'is_general': False, 'name': 'Earth and Planetary Astrophysics'}, 'astro-ph.GA': {'description': 'Phenomena pertaining to galaxies or the Milky Way. Star clusters, HII regions and planetary nebulae, the interstellar medium, atomic and molecular clouds, dust. Stellar populations. Galactic structure, formation, dynamics. Galactic nuclei, bulges, disks, halo. Active Galactic Nuclei, supermassive black holes, quasars. Gravitational lens systems. The Milky Way and its contents', 'in_archive': 'astro-ph', 'is_active': True, 'is_general': False, 'name': 'Astrophysics of Galaxies'}, 'astro-ph.HE': {'description': 'Cosmic ray production, acceleration, propagation, detection. Gamma ray astronomy and bursts, X-rays, charged particles, supernovae and other explosive phenomena, stellar remnants and accretion systems, jets, microquasars, neutron stars, pulsars, black holes', 'in_archive': 'astro-ph', 'is_active': True, 'is_general': False, 'name': 'High Energy Astrophysical Phenomena'}, 'astro-ph.IM': {'description': 'Detector and telescope design, experiment proposals. Laboratory Astrophysics. Methods for data analysis, statistical methods. Software, database design', 'in_archive': 'astro-ph', 'is_active': True, 'is_general': False, 'name': 'Instrumentation and Methods for Astrophysics'}, 'astro-ph.SR': {'description': 'White dwarfs, brown dwarfs, cataclysmic variables. Star formation and protostellar systems, stellar astrobiology, binary and multiple systems of stars, stellar evolution and structure, coronas. Central stars of planetary nebulae. Helioseismology, solar neutrinos, production and detection of gravitational radiation from stellar systems', 'in_archive': 'astro-ph', 'is_active': True, 'is_general': False, 'name': 'Solar and Stellar Astrophysics'}, 'atom-ph': {'in_archive': 'atom-ph', 'is_active': False, 'is_general': False, 'name': 'Atomic, Molecular and Optical Physics'}, 'bayes-an': {'in_archive': 'bayes-an', 'is_active': False, 'is_general': False, 'name': 'Bayesian Analysis'}, 'chao-dyn': {'in_archive': 'chao-dyn', 'is_active': False, 'is_general': False, 'name': 'Chaotic Dynamics'}, 'chem-ph': {'in_archive': 'chem-ph', 'is_active': False, 'is_general': False, 'name': 'Chemical Physics'}, 'cmp-lg': {'in_archive': 'cmp-lg', 'is_active': False, 'is_general': False, 'name': 'Computation and Language'}, 'comp-gas': {'in_archive': 'comp-gas', 'is_active': False, 'is_general': False, 'name': 'Cellular Automata and Lattice Gases'}, 'cond-mat': {'in_archive': 'cond-mat', 'is_active': False, 'is_general': False, 'name': 'Condensed Matter'}, 'cond-mat.dis-nn': {'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Disordered Systems and Neural Networks'}, 'cond-mat.mes-hall': {'description': 'Semiconducting nanostructures: quantum dots, wires, and wells. Single electronics, spintronics, 2d electron gases, quantum Hall effect, nanotubes, graphene, plasmonic nanostructures', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Mesoscale and Nanoscale Physics'}, 'cond-mat.mtrl-sci': {'description': 'Techniques, synthesis, characterization, structure. Structural phase transitions, mechanical properties, phonons. Defects, adsorbates, interfaces', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Materials Science'}, 'cond-mat.other': {'description': 'Work in condensed matter that does not fit into the other cond-mat classifications', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Other Condensed Matter'}, 'cond-mat.quant-gas': {'description': 'Ultracold atomic and molecular gases, Bose-Einstein condensation, Feshbach resonances, spinor condensates, optical lattices, quantum simulation with cold atoms and molecules, macroscopic interference phenomena', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Quantum Gases'}, 'cond-mat.soft': {'description': 'Membranes, polymers, liquid crystals, glasses, colloids, granular matter', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Soft Condensed Matter'}, 'cond-mat.stat-mech': {'description': 'Phase transitions, thermodynamics, field theory, non-equilibrium phenomena, renormalization group and scaling, integrable models, turbulence', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Statistical Mechanics'}, 'cond-mat.str-el': {'description': 'Quantum magnetism, non-Fermi liquids, spin liquids, quantum criticality, charge density waves, metal-insulator transitions', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Strongly Correlated Electrons'}, 'cond-mat.supr-con': {'description': 'Superconductivity: theory, models, experiment. Superflow in helium', 'in_archive': 'cond-mat', 'is_active': True, 'is_general': False, 'name': 'Superconductivity'}, 'cs.AI': {'description': 'Covers all areas of AI except Vision, Robotics, Machine Learning, Multiagent Systems, and Computation and Language (Natural Language Processing), which have separate subject areas. In particular, includes Expert Systems, Theorem Proving (although this may overlap with Logic in Computer Science), Knowledge Representation, Planning, and Uncertainty in AI. Roughly includes material in ACM Subject Classes I.2.0, I.2.1, I.2.3, I.2.4, I.2.8, and I.2.11.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Artificial Intelligence'}, 'cs.AR': {'description': 'Covers systems organization and hardware architecture. Roughly includes material in ACM Subject Classes C.0, C.1, and C.5.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Hardware Architecture'}, 'cs.CC': {'description': 'Covers models of computation, complexity classes, structural complexity, complexity tradeoffs, upper and lower bounds. Roughly includes material in ACM Subject Classes F.1 (computation by abstract devices), F.2.3 (tradeoffs among complexity measures), and F.4.3 (formal languages), although some material in formal languages may be more appropriate for Logic in Computer Science. Some material in F.2.1 and F.2.2, may also be appropriate here, but is more likely to have Data Structures and Algorithms as the primary subject area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computational Complexity'}, 'cs.CE': {'description': 'Covers applications of computer science to the mathematical modeling of complex systems in the fields of science, engineering, and finance. Papers here are interdisciplinary and applications-oriented, focusing on techniques and tools that enable challenging computational simulations to be performed, for which the use of supercomputers or distributed computing platforms is often required. Includes material in ACM Subject Classes J.2, J.3, and J.4 (economics).', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computational Engineering, Finance, and Science'}, 'cs.CG': {'description': 'Roughly includes material in ACM Subject Classes I.3.5 and F.2.2.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computational Geometry'}, 'cs.CL': {'description': 'Covers natural language processing. Roughly includes material in ACM Subject Class I.2.7. Note that work on artificial languages (programming languages, logics, formal systems) that does not explicitly address natural-language issues broadly construed (natural-language processing, computational linguistics, speech, text retrieval, etc.) is not appropriate for this area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computation and Language'}, 'cs.CR': {'description': 'Covers all areas of cryptography and security including authentication, public key cryptosytems, proof-carrying code, etc. Roughly includes material in ACM Subject Classes D.4.6 and E.3.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Cryptography and Security'}, 'cs.CV': {'description': 'Covers image processing, computer vision, pattern recognition, and scene understanding. Roughly includes material in ACM Subject Classes I.2.10, I.4, and I.5.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computer Vision and Pattern Recognition'}, 'cs.CY': {'description': 'Covers impact of computers on society, computer ethics, information technology and public policy, legal aspects of computing, computers and education. Roughly includes material in ACM Subject Classes K.0, K.2, K.3, K.4, K.5, and K.7.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computers and Society'}, 'cs.DB': {'description': 'Covers database management, datamining, and data processing. Roughly includes material in ACM Subject Classes E.2, E.5, H.0, H.2, and J.1.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Databases'}, 'cs.DC': {'description': 'Covers fault-tolerance, distributed algorithms, stabilility, parallel computation, and cluster computing. Roughly includes material in ACM Subject Classes C.1.2, C.1.4, C.2.4, D.1.3, D.4.5, D.4.7, E.1.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Distributed, Parallel, and Cluster Computing'}, 'cs.DL': {'description': 'Covers all aspects of the digital library design and document and text creation. Note that there will be some overlap with Information Retrieval (which is a separate subject area). Roughly includes material in ACM Subject Classes H.3.5, H.3.6, H.3.7, I.7.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Digital Libraries'}, 'cs.DM': {'description': 'Covers combinatorics, graph theory, applications of probability. Roughly includes material in ACM Subject Classes G.2 and G.3.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Discrete Mathematics'}, 'cs.DS': {'description': 'Covers data structures and analysis of algorithms. Roughly includes material in ACM Subject Classes E.1, E.2, F.2.1, and F.2.2.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Data Structures and Algorithms'}, 'cs.ET': {'description': 'Covers approaches to information processing (computing, communication, sensing) and bio-chemical analysis based on alternatives to silicon CMOS-based technologies, such as nanoscale electronic, photonic, spin-based, superconducting, mechanical, bio-chemical and quantum technologies (this list is not exclusive). Topics of interest include (1) building blocks for emerging technologies, their scalability and adoption in larger systems, including integration with traditional technologies, (2) modeling, design and optimization of novel devices and systems, (3) models of computation, algorithm design and programming for emerging technologies.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Emerging Technologies'}, 'cs.FL': {'description': 'Covers automata theory, formal language theory, grammars, and combinatorics on words. This roughly corresponds to ACM Subject Classes F.1.1, and F.4.3. Papers dealing with computational complexity should go to cs.CC; papers dealing with logic should go to cs.LO.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Formal Languages and Automata Theory'}, 'cs.GL': {'description': 'Covers introductory material, survey material, predictions of future trends, biographies, and miscellaneous computer-science related material. Roughly includes all of ACM Subject Class A, except it does not include conference proceedings (which will be listed in the appropriate subject area).', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'General Literature'}, 'cs.GR': {'description': 'Covers all aspects of computer graphics. Roughly includes material in all of ACM Subject Class I.3, except that I.3.5 is is likely to have Computational Geometry as the primary subject area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Graphics'}, 'cs.GT': {'description': 'Covers all theoretical and applied aspects at the intersection of computer science and game theory, including work in mechanism design, learning in games (which may overlap with Learning), foundations of agent modeling in games (which may overlap with Multiagent systems), coordination, specification and formal methods for non-cooperative computational environments. The area also deals with applications of game theory to areas such as electronic commerce.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Computer Science and Game Theory'}, 'cs.HC': {'description': 'Covers human factors, user interfaces, and collaborative computing. Roughly includes material in ACM Subject Classes H.1.2 and all of H.5, except for H.5.1, which is more likely to have Multimedia as the primary subject area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Human-Computer Interaction'}, 'cs.IR': {'description': 'Covers indexing, dictionaries, retrieval, content and analysis. Roughly includes material in ACM Subject Classes H.3.0, H.3.1, H.3.2, H.3.3, and H.3.4.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Information Retrieval'}, 'cs.IT': {'description': 'Covers theoretical and experimental aspects of information theory and coding. Includes material in ACM Subject Class E.4 and intersects with H.1.1.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Information Theory'}, 'cs.LG': {'description': 'Covers machine learning and computational (PAC) learning. Roughly includes material in ACM Subject Class I.2.6.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Machine Learning'}, 'cs.LO': {'description': 'Covers all aspects of logic in computer science, including finite model theory, logics of programs, modal logic, and program verification. Programming language semantics should have Programming Languages as the primary subject area. Roughly includes material in ACM Subject Classes D.2.4, F.3.1, F.4.0, F.4.1, and F.4.2; some material in F.4.3 (formal languages) may also be appropriate here, although Computational Complexity is typically the more appropriate subject area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Logic in Computer Science'}, 'cs.MA': {'description': 'Covers multiagent systems, distributed artificial intelligence, intelligent agents, coordinated interactions. and practical applications. Roughly covers ACM Subject Class I.2.11.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Multiagent Systems'}, 'cs.MM': {'description': 'Roughly includes material in ACM Subject Class H.5.1.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Multimedia'}, 'cs.MS': {'description': 'Roughly includes material in ACM Subject Class G.4.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Mathematical Software'}, 'cs.NA': {'description': 'cs.NA is an alias for math.NA. Roughly includes material in ACM Subject Class G.1.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Numerical Analysis'}, 'cs.NE': {'description': 'Covers neural networks, connectionism, genetic algorithms, artificial life, adaptive behavior. Roughly includes some material in ACM Subject Class C.1.3, I.2.6, I.5.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Neural and Evolutionary Computing'}, 'cs.NI': {'description': 'Covers all aspects of computer communication networks, including network architecture and design, network protocols, and internetwork standards (like TCP/IP). Also includes topics, such as web caching, that are directly relevant to Internet architecture and performance. Roughly includes all of ACM Subject Class C.2 except C.2.4, which is more likely to have Distributed, Parallel, and Cluster Computing as the primary subject area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Networking and Internet Architecture'}, 'cs.OH': {'description': 'This is the classification to use for documents that do not fit anywhere else.', 'in_archive': 'cs', 'is_active': True, 'is_general': True, 'name': 'Other Computer Science'}, 'cs.OS': {'description': 'Roughly includes material in ACM Subject Classes D.4.1, D.4.2., D.4.3, D.4.4, D.4.5, D.4.7, and D.4.9.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Operating Systems'}, 'cs.PF': {'description': 'Covers performance measurement and evaluation, queueing, and simulation. Roughly includes material in ACM Subject Classes D.4.8 and K.6.2.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Performance'}, 'cs.PL': {'description': 'Covers programming language semantics, language features, programming approaches (such as object-oriented programming, functional programming, logic programming). Also includes material on compilers oriented towards programming languages; other material on compilers may be more appropriate in Architecture (AR). Roughly includes material in ACM Subject Classes D.1 and D.3.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Programming Languages'}, 'cs.RO': {'description': 'Roughly includes material in ACM Subject Class I.2.9.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Robotics'}, 'cs.SC': {'description': 'Roughly includes material in ACM Subject Class I.1.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Symbolic Computation'}, 'cs.SD': {'description': 'Covers all aspects of computing with sound, and sound as an information channel. Includes models of sound, analysis and synthesis, audio user interfaces, sonification of data, computer music, and sound signal processing. Includes ACM Subject Class H.5.5, and intersects with H.1.2, H.5.1, H.5.2, I.2.7, I.5.4, I.6.3, J.5, K.4.2.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Sound'}, 'cs.SE': {'description': 'Covers design tools, software metrics, testing and debugging, programming environments, etc. Roughly includes material in all of ACM Subject Classes D.2, except that D.2.4 (program verification) should probably have Logics in Computer Science as the primary subject area.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Software Engineering'}, 'cs.SI': {'description': 'Covers the design, analysis, and modeling of social and information networks, including their applications for on-line information access, communication, and interaction, and their roles as datasets in the exploration of questions in these and other domains, including connections to the social and biological sciences. Analysis and modeling of such networks includes topics in ACM Subject classes F.2, G.2, G.3, H.2, and I.2; applications in computing include topics in H.3, H.4, and H.5; and applications at the interface of computing and other disciplines include topics in J.1--J.7. Papers on computer communication systems and network protocols (e.g. TCP/IP) are generally a closer fit to the Networking and Internet Architecture (cs.NI) category.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Social and Information Networks'}, 'cs.SY': {'description': 'cs.SY is an alias for eess.SY. This section includes theoretical and experimental research covering all facets of automatic control systems. The section is focused on methods of control system analysis and design using tools of modeling, simulation and optimization. Specific areas of research include nonlinear, distributed, adaptive, stochastic and robust control in addition to hybrid and discrete event systems. Application areas include automotive and aerospace control systems, network control, biological systems, multiagent and cooperative control, robotics, reinforcement learning, sensor networks, control of cyber-physical and energy-related systems, and control of computing systems.', 'in_archive': 'cs', 'is_active': True, 'is_general': False, 'name': 'Systems and Control'}, 'dg-ga': {'in_archive': 'dg-ga', 'is_active': False, 'is_general': False, 'name': 'Differential Geometry'}, 'econ.EM': {'description': 'Econometric Theory, Micro-Econometrics, Macro-Econometrics, Empirical Content of Economic Relations discovered via New Methods, Methodological Aspects of the Application of Statistical Inference to Economic Data.', 'in_archive': 'econ', 'is_active': True, 'is_general': False, 'name': 'Econometrics'}, 'econ.GN': {'description': 'General methodological and applied contributions to economics.', 'in_archive': 'econ', 'is_active': True, 'is_general': False, 'name': 'General Economics'}, 'econ.TH': {'description': 'Includes theoretical contributions to Contract Theory, Decision Theory, Game Theory, General Equilibrium, Growth, Learning and Evolution, Macroeconomics, Market and Mechanism Design, and Social Choice.', 'in_archive': 'econ', 'is_active': True, 'is_general': False, 'name': 'Theoretical Economics'}, 'eess.AS': {'description': 'Theory and methods for processing signals representing audio, speech, and language, and their applications. This includes analysis, synthesis, enhancement, transformation, classification and interpretation of such signals as well as the design, development, and evaluation of associated signal processing systems. Machine learning and pattern analysis applied to any of the above areas is also welcome. Specific topics of interest include: auditory modeling and hearing aids; acoustic beamforming and source localization; classification of acoustic scenes; speaker separation; active noise control and echo cancellation; enhancement; de-reverberation; bioacoustics; music signals analysis, synthesis and modification; music information retrieval; audio for multimedia and joint audio-video processing; spoken and written language modeling, segmentation, tagging, parsing, understanding, and translation; text mining; speech production, perception, and psychoacoustics; speech analysis, synthesis, and perceptual modeling and coding; robust speech recognition; speaker recognition and characterization; deep learning, online learning, and graphical models applied to speech, audio, and language signals; and implementation aspects ranging from system architecture to fast algorithms.', 'in_archive': 'eess', 'is_active': True, 'is_general': False, 'name': 'Audio and Speech Processing'}, 'eess.IV': {'description': 'Theory, algorithms, and architectures for the formation, capture, processing, communication, analysis, and display of images, video, and multidimensional signals in a wide variety of applications. Topics of interest include: mathematical, statistical, and perceptual image and video modeling and representation; linear and nonlinear filtering, de-blurring, enhancement, restoration, and reconstruction from degraded, low-resolution or tomographic data; lossless and lossy compression and coding; segmentation, alignment, and recognition; image rendering, visualization, and printing; computational imaging, including ultrasound, tomographic and magnetic resonance imaging; and image and video analysis, synthesis, storage, search and retrieval.', 'in_archive': 'eess', 'is_active': True, 'is_general': False, 'name': 'Image and Video Processing'}, 'eess.SP': {'description': 'Theory, algorithms, performance analysis and applications of signal and data analysis, including physical modeling, processing, detection and parameter estimation, learning, mining, retrieval, and information extraction. The term "signal" includes speech, audio, sonar, radar, geophysical, physiological, (bio-) medical, image, video, and multimodal natural and man-made signals, including communication signals and data. Topics of interest include: statistical signal processing, spectral estimation and system identification; filter design, adaptive filtering / stochastic learning; (compressive) sampling, sensing, and transform-domain methods including fast algorithms; signal processing for machine learning and machine learning for signal processing applications; in-network and graph signal processing; convex and nonconvex optimization methods for signal processing applications; radar, sonar, and sensor array beamforming and direction finding; communications signal processing; low power, multi-core and system-on-chip signal processing; sensing, communication, analysis and optimization for cyber-physical systems such as power grids and the Internet of Things.', 'in_archive': 'eess', 'is_active': True, 'is_general': False, 'name': 'Signal Processing'}, 'eess.SY': {'description': 'This section includes theoretical and experimental research covering all facets of automatic control systems. The section is focused on methods of control system analysis and design using tools of modeling, simulation and optimization. Specific areas of research include nonlinear, distributed, adaptive, stochastic and robust control in addition to hybrid and discrete event systems. Application areas include automotive and aerospace control systems, network control, biological systems, multiagent and cooperative control, robotics, reinforcement learning, sensor networks, control of cyber-physical and energy-related systems, and control of computing systems.', 'in_archive': 'eess', 'is_active': True, 'is_general': False, 'name': 'Systems and Control'}, 'funct-an': {'in_archive': 'funct-an', 'is_active': False, 'is_general': False, 'name': 'Functional Analysis'}, 'gr-qc': {'in_archive': 'gr-qc', 'is_active': True, 'is_general': False, 'name': 'General Relativity and Quantum Cosmology'}, 'hep-ex': {'in_archive': 'hep-ex', 'is_active': True, 'is_general': False, 'name': 'High Energy Physics - Experiment'}, 'hep-lat': {'in_archive': 'hep-lat', 'is_active': True, 'is_general': False, 'name': 'High Energy Physics - Lattice'}, 'hep-ph': {'in_archive': 'hep-ph', 'is_active': True, 'is_general': False, 'name': 'High Energy Physics - Phenomenology'}, 'hep-th': {'in_archive': 'hep-th', 'is_active': True, 'is_general': False, 'name': 'High Energy Physics - Theory'}, 'math-ph': {'in_archive': 'math-ph', 'is_active': True, 'is_general': False, 'name': 'Mathematical Physics'}, 'math.AC': {'description': 'Commutative rings, modules, ideals, homological algebra, computational aspects, invariant theory, connections to algebraic geometry and combinatorics', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Commutative Algebra'}, 'math.AG': {'description': 'Algebraic varieties, stacks, sheaves, schemes, moduli spaces, complex geometry, quantum cohomology', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Algebraic Geometry'}, 'math.AP': {'description': "Existence and uniqueness, boundary conditions, linear and non-linear operators, stability, soliton theory, integrable PDE's, conservation laws, qualitative dynamics", 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Analysis of PDEs'}, 'math.AT': {'description': 'Homotopy theory, homological algebra, algebraic treatments of manifolds', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Algebraic Topology'}, 'math.CA': {'description': "Special functions, orthogonal polynomials, harmonic analysis, ODE's, differential relations, calculus of variations, approximations, expansions, asymptotics", 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Classical Analysis and ODEs'}, 'math.CO': {'description': 'Discrete mathematics, graph theory, enumeration, combinatorial optimization, Ramsey theory, combinatorial game theory', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Combinatorics'}, 'math.CT': {'description': 'Enriched categories, topoi, abelian categories, monoidal categories, homological algebra', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Category Theory'}, 'math.CV': {'description': 'Holomorphic functions, automorphic group actions and forms, pseudoconvexity, complex geometry, analytic spaces, analytic sheaves', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Complex Variables'}, 'math.DG': {'description': 'Complex, contact, Riemannian, pseudo-Riemannian and Finsler geometry, relativity, gauge theory, global analysis', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Differential Geometry'}, 'math.DS': {'description': 'Dynamics of differential equations and flows, mechanics, classical few-body problems, iterations, complex dynamics, delayed differential equations', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Dynamical Systems'}, 'math.FA': {'description': 'Banach spaces, function spaces, real functions, integral transforms, theory of distributions, measure theory', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Functional Analysis'}, 'math.GM': {'description': 'Mathematical material of general interest, topics not covered elsewhere', 'in_archive': 'math', 'is_active': True, 'is_general': True, 'name': 'General Mathematics'}, 'math.GN': {'description': 'Continuum theory, point-set topology, spaces with algebraic structure, foundations, dimension theory, local and global properties', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'General Topology'}, 'math.GR': {'description': 'Finite groups, topological groups, representation theory, cohomology, classification and structure', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Group Theory'}, 'math.GT': {'description': 'Manifolds, orbifolds, polyhedra, cell complexes, foliations, geometric structures', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Geometric Topology'}, 'math.HO': {'description': 'Biographies, philosophy of mathematics, mathematics education, recreational mathematics, communication of mathematics', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'History and Overview'}, 'math.IT': {'description': 'math.IT is an alias for cs.IT. Covers theoretical and experimental aspects of information theory and coding.', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Information Theory'}, 'math.KT': {'description': 'Algebraic and topological K-theory, relations with topology, commutative algebra, and operator algebras', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'K-Theory and Homology'}, 'math.LO': {'description': 'Logic, set theory, point-set topology, formal mathematics', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Logic'}, 'math.MG': {'description': 'Euclidean, hyperbolic, discrete, convex, coarse geometry, comparisons in Riemannian geometry, symmetric spaces', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Metric Geometry'}, 'math.MP': {'description': 'math.MP is an alias for math-ph. Mathematical methods in quantum field theory, quantum mechanics, statistical mechanics, condensed matter, nuclear and atomic physics.', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Mathematical Physics'}, 'math.NA': {'description': 'Numerical algorithms for problems in analysis and algebra, scientific computation', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Numerical Analysis'}, 'math.NT': {'description': 'Prime numbers, diophantine equations, analytic number theory, algebraic number theory, arithmetic geometry, Galois theory', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Number Theory'}, 'math.OA': {'description': 'Algebras of operators on Hilbert space, C^*-algebras, von Neumann algebras, non-commutative geometry', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Operator Algebras'}, 'math.OC': {'description': 'Operations research, linear programming, control theory, systems theory, optimal control, game theory', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Optimization and Control'}, 'math.PR': {'description': 'Theory and applications of probability and stochastic processes: e.g. central limit theorems, large deviations, stochastic differential equations, models from statistical mechanics, queuing theory', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Probability'}, 'math.QA': {'description': 'Quantum groups, skein theories, operadic and diagrammatic algebra, quantum field theory', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Quantum Algebra'}, 'math.RA': {'description': 'Non-commutative rings and algebras, non-associative algebras, universal algebra and lattice theory, linear algebra, semigroups', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Rings and Algebras'}, 'math.RT': {'description': 'Linear representations of algebras and groups, Lie theory, associative algebras, multilinear algebra', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Representation Theory'}, 'math.SG': {'description': 'Hamiltonian systems, symplectic flows, classical integrable systems', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Symplectic Geometry'}, 'math.SP': {'description': 'Schrodinger operators, operators on manifolds, general differential operators, numerical studies, integral operators, discrete models, resonances, non-self-adjoint operators, random operators/matrices', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Spectral Theory'}, 'math.ST': {'description': 'Applied, computational and theoretical statistics: e.g. statistical inference, regression, time series, multivariate analysis, data analysis, Markov chain Monte Carlo, design of experiments, case studies', 'in_archive': 'math', 'is_active': True, 'is_general': False, 'name': 'Statistics Theory'}, 'mtrl-th': {'in_archive': 'mtrl-th', 'is_active': False, 'is_general': False, 'name': 'Materials Theory'}, 'nlin.AO': {'description': 'adaptation, self-organizing systems, statistical physics, fluctuating systems, stochastic processes, interacting particle systems, machine learning', 'in_archive': 'nlin', 'is_active': True, 'is_general': False, 'name': 'Adaptation and Self-Organizing Systems'}, 'nlin.CD': {'description': 'dynamical systems, chaos, quantum chaos, topological dynamics, cycle expansions, turbulence, propagation', 'in_archive': 'nlin', 'is_active': True, 'is_general': False, 'name': 'Chaotic Dynamics'}, 'nlin.CG': {'description': 'computational methods, time series analysis, signal processing, wavelets, lattice gases', 'in_archive': 'nlin', 'is_active': True, 'is_general': False, 'name': 'Cellular Automata and Lattice Gases'}, 'nlin.PS': {'description': 'pattern formation, coherent structures, solitons', 'in_archive': 'nlin', 'is_active': True, 'is_general': False, 'name': 'Pattern Formation and Solitons'}, 'nlin.SI': {'description': 'exactly solvable systems, integrable PDEs, integrable ODEs, Painleve analysis, integrable discrete maps, solvable lattice models, integrable quantum systems', 'in_archive': 'nlin', 'is_active': True, 'is_general': False, 'name': 'Exactly Solvable and Integrable Systems'}, 'nucl-ex': {'in_archive': 'nucl-ex', 'is_active': True, 'is_general': False, 'name': 'Nuclear Experiment'}, 'nucl-th': {'in_archive': 'nucl-th', 'is_active': True, 'is_general': False, 'name': 'Nuclear Theory'}, 'patt-sol': {'in_archive': 'patt-sol', 'is_active': False, 'is_general': False, 'name': 'Pattern Formation and Solitons'}, 'physics.acc-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Accelerator Physics'}, 'physics.ao-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Atmospheric and Oceanic Physics'}, 'physics.app-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Applied Physics'}, 'physics.atm-clus': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Atomic and Molecular Clusters'}, 'physics.atom-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Atomic Physics'}, 'physics.bio-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Biological Physics'}, 'physics.chem-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Chemical Physics'}, 'physics.class-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Classical Physics'}, 'physics.comp-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Computational Physics'}, 'physics.data-an': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Data Analysis, Statistics and Probability'}, 'physics.ed-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Physics Education'}, 'physics.flu-dyn': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Fluid Dynamics'}, 'physics.gen-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': True, 'name': 'General Physics'}, 'physics.geo-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Geophysics'}, 'physics.hist-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'History and Philosophy of Physics'}, 'physics.ins-det': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Instrumentation and Detectors'}, 'physics.med-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Medical Physics'}, 'physics.optics': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Optics'}, 'physics.plasm-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Plasma Physics'}, 'physics.pop-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Popular Physics'}, 'physics.soc-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Physics and Society'}, 'physics.space-ph': {'in_archive': 'physics', 'is_active': True, 'is_general': False, 'name': 'Space Physics'}, 'plasm-ph': {'in_archive': 'plasm-ph', 'is_active': False, 'is_general': False, 'name': 'Plasma Physics'}, 'q-alg': {'in_archive': 'q-alg', 'is_active': False, 'is_general': False, 'name': 'Quantum Algebra and Topology'}, 'q-bio': {'in_archive': 'q-bio', 'is_active': False, 'is_general': False, 'name': 'Quantitative Biology'}, 'q-bio.BM': {'description': 'DNA, RNA, proteins, lipids, etc.; molecular structures and folding kinetics; molecular interactions; single-molecule manipulation.', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Biomolecules'}, 'q-bio.CB': {'description': 'Cell-cell signaling and interaction; morphogenesis and development; apoptosis; bacterial conjugation; viral-host interaction; immunology', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Cell Behavior'}, 'q-bio.GN': {'description': 'DNA sequencing and assembly; gene and motif finding; RNA editing and alternative splicing; genomic structure and processes (replication, transcription, methylation, etc); mutational processes.', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Genomics'}, 'q-bio.MN': {'description': 'Gene regulation, signal transduction, proteomics, metabolomics, gene and enzymatic networks', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Molecular Networks'}, 'q-bio.NC': {'description': 'Synapse, cortex, neuronal dynamics, neural network, sensorimotor control, behavior, attention', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Neurons and Cognition'}, 'q-bio.OT': {'description': 'Work in quantitative biology that does not fit into the other q-bio classifications', 'in_archive': 'q-bio', 'is_active': True, 'is_general': True, 'name': 'Other Quantitative Biology'}, 'q-bio.PE': {'description': 'Population dynamics, spatio-temporal and epidemiological models, dynamic speciation, co-evolution, biodiversity, foodwebs, aging; molecular evolution and phylogeny; directed evolution; origin of life', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Populations and Evolution'}, 'q-bio.QM': {'description': 'All experimental, numerical, statistical and mathematical contributions of value to biology', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Quantitative Methods'}, 'q-bio.SC': {'description': 'Assembly and control of subcellular structures (channels, organelles, cytoskeletons, capsules, etc.); molecular motors, transport, subcellular localization; mitosis and meiosis', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Subcellular Processes'}, 'q-bio.TO': {'description': 'Blood flow in vessels, biomechanics of bones, electrical waves, endocrine system, tumor growth', 'in_archive': 'q-bio', 'is_active': True, 'is_general': False, 'name': 'Tissues and Organs'}, 'q-fin.CP': {'description': 'Computational methods, including Monte Carlo, PDE, lattice and other numerical methods with applications to financial modeling', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Computational Finance'}, 'q-fin.EC': {'description': 'q-fin.EC is an alias for econ.GN. Economics, including micro and macro economics, international economics, theory of the firm, labor economics, and other economic topics outside finance', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Economics'}, 'q-fin.GN': {'description': 'Development of general quantitative methodologies with applications in finance', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'General Finance'}, 'q-fin.MF': {'description': 'Mathematical and analytical methods of finance, including stochastic, probabilistic and functional analysis, algebraic, geometric and other methods', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Mathematical Finance'}, 'q-fin.PM': {'description': 'Security selection and optimization, capital allocation, investment strategies and performance measurement', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Portfolio Management'}, 'q-fin.PR': {'description': 'Valuation and hedging of financial securities, their derivatives, and structured products', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Pricing of Securities'}, 'q-fin.RM': {'description': 'Measurement and management of financial risks in trading, banking, insurance, corporate and other applications', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Risk Management'}, 'q-fin.ST': {'description': 'Statistical, econometric and econophysics analyses with applications to financial markets and economic data', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Statistical Finance'}, 'q-fin.TR': {'description': 'Market microstructure, liquidity, exchange and auction design, automated trading, agent-based modeling and market-making', 'in_archive': 'q-fin', 'is_active': True, 'is_general': False, 'name': 'Trading and Market Microstructure'}, 'quant-ph': {'in_archive': 'quant-ph', 'is_active': True, 'is_general': False, 'name': 'Quantum Physics'}, 'solv-int': {'in_archive': 'solv-int', 'is_active': False, 'is_general': False, 'name': 'Exactly Solvable and Integrable Systems'}, 'stat.AP': {'description': 'Biology, Education, Epidemiology, Engineering, Environmental Sciences, Medical, Physical Sciences, Quality Control, Social Sciences', 'in_archive': 'stat', 'is_active': True, 'is_general': False, 'name': 'Applications'}, 'stat.CO': {'description': 'Algorithms, Simulation, Visualization', 'in_archive': 'stat', 'is_active': True, 'is_general': False, 'name': 'Computation'}, 'stat.ME': {'description': 'Design, Surveys, Model Selection, Multiple Testing, Multivariate Methods, Signal and Image Processing, Time Series, Smoothing, Spatial Statistics, Survival Analysis, Nonparametric and Semiparametric Methods', 'in_archive': 'stat', 'is_active': True, 'is_general': False, 'name': 'Methodology'}, 'stat.ML': {'description': 'Classification, Graphical Models, High Dimensional Inference', 'in_archive': 'stat', 'is_active': True, 'is_general': False, 'name': 'Machine Learning'}, 'stat.OT': {'description': 'Work in statistics that does not fit into the other stat classifications', 'in_archive': 'stat', 'is_active': True, 'is_general': False, 'name': 'Other Statistics'}, 'stat.TH': {'description': 'stat.TH is an alias for math.ST. Asymptotics, Bayesian Inference, Decision Theory, Estimation, Foundations, Inference, Testing.', 'in_archive': 'stat', 'is_active': True, 'is_general': False, 'name': 'Statistics Theory'}, 'supr-con': {'in_archive': 'supr-con', 'is_active': False, 'is_general': False, 'name': 'Superconductivity'}, 'test': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test'}, 'test.dis-nn': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Disruptive Networks'}, 'test.mes-hall': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Hall'}, 'test.mtrl-sci': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Mtrl-Sci'}, 'test.soft': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Soft'}, 'test.stat-mech': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Mechanics'}, 'test.str-el': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Electrons'}, 'test.supr-con': {'in_archive': 'test', 'is_active': False, 'is_general': False, 'name': 'Test Superconductivity'}}
property name

Get the full category name.

Return type

str

unalias()[source]

Follow any EQUIV or SUBSUMED to get the current category.

Return type

Category

class arxiv.canonical.domain.CanonicalBase[source]

Bases: object

Base class for all canonical domain classes.

exclude_from_comparison = {}

Names of attributes not to be used in __eq__ comparisons.

class arxiv.canonical.domain.CanonicalBaseCollection[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Base class for domain classes that act as collections.

class arxiv.canonical.domain.ContentType[source]

Bases: enum.Enum

Characterization of the content type of an individual bitstream.

abs = 'abs'
dvi = 'dvi'
property ext

The preferred filename extension for this ContentType.

Return type

str

from_filename(filename) = <bound method ContentType.from_filename of <enum 'ContentType'>>[source]
from_mimetype(mime) = <bound method ContentType.from_mimetype of <enum 'ContentType'>>[source]
html = 'html'
json = 'json'
make_filename(identifier, is_gzipped=False)[source]

Make a filename for a bitstream with this ContentType.

Return type

str

property mime_type

The MIME content type for this ContentType.

Return type

str

pdf = 'pdf'
ps = 'ps'
tar = 'tar'
tex = 'tex'
class arxiv.canonical.domain.EPrint(identifier, versions)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Core concept of an e-print in the canonical record.

An e-print is a collection of one or more sequential :class:`.Version`s, generally representing projections of a single scholarly work over time.

E-prints are identified by :class:`.Identifier`s.

property announced_date

Date on which the first version of this e-print was announced.

Return type

Optional[date]

property is_withdrawn

Indicate whether this e-print has been withdrawn.

Return type

bool

property size_kilobytes

Indicate the size of the current version of this e-print in kb.

Return type

int

class arxiv.canonical.domain.EPrintDay(date, eprints)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

E-prints originally announced on a specific day.

class arxiv.canonical.domain.EPrintMonth(name, days)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

E-prints originally announced in a particular calendar month.

property month

int

Type

rtype

property year

int

Type

rtype

class arxiv.canonical.domain.EPrintYear(year, months)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

E-prints originally announced in a particular calendar year.

class arxiv.canonical.domain.Event(identifier, event_date, event_type, version, categories=None, description='', is_legacy=False, event_agent=None)[source]

Bases: arxiv.canonical.domain.version._EventBase

An announcement-related event.

property event_id

The unique identifier for this event.

Return type

EventIdentifier

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

Event

classmethod get_default_shard()[source]

Get the default listing shard for this event.

Return type

str

property shard

The shard name for this event.

Return type

str

property summary

A summary of this event.

Return type

EventSummary

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

version = None

The current state of the version (i.e. after the event).

class arxiv.canonical.domain.EventIdentifier(value)[source]

Bases: str

Unique identifier for an Event.

event_date = None

Timestamp of the event.

classmethod from_parts(identifier, event_date, shard)[source]

Generate a event identifier from its parts.

Return type

EventIdentifier

shard = None

Shard ID for the event.

version_id = None

Identifier of the Version to which the event pertains.

class arxiv.canonical.domain.EventSummary(identifier, event_date, event_type, event_id, categories=None, description='', is_legacy=False, event_agent=None)[source]

Bases: arxiv.canonical.domain.version._EventBase

A lightweight description of an event.

This has all of the data of the original Event except for the state of the e-print version.

event_id = None

Unique identifier for the event.

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

EventSummary

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

class arxiv.canonical.domain.EventType[source]

Bases: enum.Enum

Supported event types.

CROSSLIST = 'cross'
JREF = 'jref'
MIGRATE = 'migrate'
MIGRATE_METADATA = 'migrate_metadata'
NEW = 'new'
REPLACED = 'replace'
UPDATED = 'update'
UPDATED_METADATA = 'update_metadata'
WITHDRAWN = 'withdraw'
property is_new_version

Indicate whether or not this event type results in a new version.

Return type

bool

class arxiv.canonical.domain.Identifier(value)[source]

Bases: str

An arXiv e-print identifier.

Supports both old-style (archive.category/YYMMNNN) and new-style (YYMM.NNNNN) identifiers.

property category_part

For old-style identifiers, conveys the primary category.

Return type

str

classmethod from_parts(year, month, inc)[source]

Generate a new-style identifier from its parts.

Return type

Identifier

property incremental_part

The part of the identifier that is incremental.

Return type

int

property month

Month in which the first version of the e-print was announced.

Return type

int

property numeric_part

The entire numeric component of the identifier.

For new-style identifiers, this is the entire identifier.

Return type

str

property year

Year in which the first version of the e-print was announced.

Return type

int

property yymm

Numeric part conveying the original announcement year and month.

Return type

str

exception arxiv.canonical.domain.InvalidIdentifier[source]

Bases: ValueError

A value was encountered that is not a valid arXiv identifier.

class arxiv.canonical.domain.Key(value)[source]

Bases: arxiv.canonical.domain.file.URI

The unique identifier for a bitstream in the canonical record.

class arxiv.canonical.domain.License(href)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

License under which the e-print was provided to arXiv.

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

License

href = None

URI of the license resource.

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

class arxiv.canonical.domain.Listing(identifier, events)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

A collection of announcement-related events on a particular day.

property date

The date of this listing.

Return type

date

property end_datetime

Timestamp of the most recent event in this listing.

Return type

datetime

events = None

Events in this listing.

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

Listing

identifier = None

Unique identifier for this listing, based on the date and name.

property number_of_events

Total number of events in this listing.

Return type

int

property number_of_events_by_type

Number of events in this listing by event type.

Return type

Dict[EventType, int]

property number_of_versions

Total number of :class:`.Version`s represented in this listing.

Return type

int

property start_datetime

Timestamp of the earliest event in this listing.

Return type

datetime

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

class arxiv.canonical.domain.ListingDay(date, listings)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Represents all of the listings for a particular day.

date = None

Date on which the events occurred.

listings = None

All of the listings on this date.

class arxiv.canonical.domain.ListingIdentifier(value)[source]

Bases: str

Unique identifier for a Listing.

Comprised of an ISO-8601 date and a name string.

classmethod from_parts(date, name)[source]

Generate from date and name parts.

Return type

ListingIdentifier

class arxiv.canonical.domain.ListingYear(year, months)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

A collection of listings over a year.

months = None

All of the listings in this year.

year = None

The year of this collection.

class arxiv.canonical.domain.ListingMonth(name, listings)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

A collection of listings over a month.

listings = None

All of the listings in this month.

property month

Month represented by this block.

Return type

int

name = None

The year and month of this collection.

property year

Year represented by this block.

Return type

int

class arxiv.canonical.domain.Metadata(primary_classification, secondary_classification, title, abstract, authors, license, comments=None, journal_ref=None, report_num=None, doi=None, msc_class=None, acm_class=None)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Submitter-provided descriptive metadata for a version.

add_secondaries(*new_secondaries)[source]

Add cross-list categories for this version.

Return type

None

property all_categories

All classification categories for this version.

Return type

List[str]

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

Metadata

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

class arxiv.canonical.domain.Person(full_name, last_name=None, first_name=None, suffix=None, orcid=None, author_id=None, affiliation=None)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Represents an human person in the canonical record.

affiliation = None
author_id = None
first_name = None
classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

Person

last_name = None
orcid = None
suffix = None
to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

class arxiv.canonical.domain.SourceType(value)[source]

Bases: str

Characterizes a version source package.

property available_formats

List the available dissemination formats for this source type.

Depending on the original source type, we may not be able to provide all supported formats.

This does not include the source format. Note also that this does not enforce rules about what should be displayed as an option or provided to end users.

Return type

List[ContentType]

property has_docx

Indicate whether the source has DOCX content.

Return type

bool

property has_encrypted_source

Indicate whether the source is encryped.

Return type

bool

property has_html

Indicate whether the source has HTML content.

Return type

bool

property has_ignore

Indicate whether the source content should be ignored.

Return type

bool

property has_odf

Indicate whether the source has ODF content.

Return type

bool

property has_pdf_only

Indicate whether the source contains only a PDF.

Return type

bool

property has_pdflatex

Indicate whether the source has PDFLaTeX content.

Return type

bool

property has_ps_only

Indicate whether the source has postcript content only.

Return type

bool

class arxiv.canonical.domain.URI(value)[source]

Bases: str

A unique identifier for bitstream content.

Bitstream content may be located in a variety of places prior to canonicalization. For example, it may be located on a local filesystem, or at a remote location accessible via HTTP.

property is_canonical

Indicate whether the URI is a key in the canonical record.

Return type

bool

property is_file

Indicate whether the URI is a path to a local file.

Return type

bool

property is_http_url

Indicate whether the URI is an HTTP URL.

Return type

bool

class arxiv.canonical.domain.Version(identifier, announced_date, announced_date_first, submitted_date, updated_date, metadata, source, events=None, previous_versions=None, submitter=None, proxy=None, is_announced=False, is_withdrawn=False, is_legacy=False, reason_for_withdrawal=None, source_type=None, render=None, formats={})[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

Represents a single version of an arXiv e-print in the record.

announced_date = None

Day on which this version was announced.

announced_date_first = None

Day on which the first version of the e-print was announced.

events = None

Events that are specific to this version of the e-print.

formats = None

Dissemination formats for this version.

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

Version

get_format(desired_format)[source]

Get a particular dissemination format for this version.

Return type

CanonicalFile

identifier = None

Unique arXiv identifier for the version.

is_announced = None

Indicate whether or not the version is announced.

is_legacy = None

Indicate whether this record was populated from the legacy system.

is_withdrawn = None

Indicate whether or not the version is withdrawn.

metadata = None

Submitter-provided descriptive metadata for the version.

property number_of_events

Numer of events described by this object (0).

Return type

Literal[0]

property number_of_versions

Number of versions described by this object (1).

Return type

Literal[1]

previous_versions = None

References to previous versions of the e-print.

proxy = None

The proxy that deposited the version on behalf of the submitter.

reason_for_withdrawal = None

The reason for the withdrawal of the e-print.

render = None

Human-readable representation of the e-print.

Usually a PDF generated from the source, but may also be a user-provided PDF.

property size_kilobytes

Size of the source package in kb.

Return type

int

source = None

The original user-submitted source package.

source_type = None

Internal code for the source type.

submitted_date = None

Timestamp when this version was submitted to arXiv.

submitter = None

Person responsible for submitting this version.

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

updated_date = None

The last time the record for this version was changed.

class arxiv.canonical.domain.VersionedIdentifier(value)[source]

Bases: str

An arXiv identifier for a specific Version.

This is an Identifier with a version (v{N}) affix.

property category_part

For old-style identifiers, conveys the primary category.

Return type

str

classmethod from_parts(arxiv_id, version)[source]

Generate a new-style versioned identifier from its parts.

Return type

VersionedIdentifier

property incremental_part

The part of the identifier that is incremental.

Return type

int

property is_old_style

Indicate whether this is an old-style identifier.

Return type

int

property month

Month in which the first version of the e-print was announced.

Return type

int

property numeric_part

The entire numeric component of the identifier.

For new-style identifiers, this is the entire identifier.

Return type

str

property year

Year in which the first version of the e-print was announced.

Return type

int

property yymm

Numeric part conveying the original announcement year and month.

Return type

str

class arxiv.canonical.domain.VersionReference(identifier, announced_date, submitted_date)[source]

Bases: arxiv.canonical.domain.base.CanonicalBase

An abridged reference to a particular Version.

announced_date = None

Date on which the version was announced.

classmethod from_dict(data)[source]

Reconstitute from a native dict.

Return type

VersionReference

identifier = None

Identifier of the version.

submitted_date = None

Date on which the version was submitted.

to_dict()[source]

Generate a native dict representation.

Return type

Dict[str, Any]

arxiv.canonical.domain.available_formats_by_ext(filename)[source]

Attempt to determine the available dissemination formats by file extension.

It sometimes (but not always) possible to infer the available dissemination formats based on the filename extension of the source package.

Note

This is largely to support format discovery in classic. In the NG canonical record, this should all be explicit.

Return type

Optional[List[ContentType]]

arxiv.canonical.domain.list_source_extensions()[source]

List all of the known filename extensions for source files.

Return type

List[str]