πŸ“Š
Dataset

arXiv Dataset

by Cornell University ID: hf-dataset--cornell-university--arxiv
FNI Rank 53
Percentile Top 0%
Activity
β†’ 0.0%

arXiv dataset and metadata of 1.7M+ scholarly papers across STEM

Data Integrity 53 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--cornell-university--arxiv
License CC0: Public Domain
Provider kaggle
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__cornell_university__arxiv,
  author = {Cornell University},
  title = {arXiv Dataset Dataset},
  year = {2026},
  howpublished = {\url{https://www.kaggle.com/datasets/Cornell-University/arxiv}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
Cornell University. (2026). arXiv Dataset [Dataset]. Free2AITools. https://www.kaggle.com/datasets/Cornell-University/arxiv

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Free2AI Nexus Index

Methodology β†’ πŸ“˜ What is FNI?
53.0
Top 0% Overall Impact
πŸ”₯ Popularity (P) 0
πŸš€ Velocity (V) 0
πŸ›‘οΈ Credibility (C) 0
πŸ”§ Utility (U) 0
Nexus Verified Data

πŸ’¬ Why this score?

The Nexus Index for arXiv Dataset aggregates Popularity (P:0), Velocity (V:0), and Credibility (C:0). The Utility score (U:0) represents deployment readiness, context efficiency, and structural reliability within the Nexus ecosystem.

Data Verified πŸ• Last Updated: Not calculated
Free2AI Nexus Index | Fair Β· Transparent Β· Explainable | Full Methodology
⬇️
Downloads
104,577
❀️
Likes
1,589

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

arXiv dataset and metadata of 1.7M+ scholarly papers across STEM

Top Tier

Social Proof

HuggingFace Hub
1.6KLikes
104.6KDownloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Kaggle metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

πŸ†” Identity & Source

id
hf-dataset--cornell-university--arxiv
source
kaggle
author
Cornell University
license
CC0: Public Domain
tags
[object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object][object Object]

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null

πŸ“Š Engagement & Metrics

likes
1,589
downloads
104,577

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)