π
Ultra Fineweb L3 dataset by openbmb
β 59.1
π¬Technical Deep Dive
Full Specifications [+]
π Updated daily
Source summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Dataset Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- hf-dataset--openbmb--ultra-fineweb-l3
- slug
- openbmb--ultra-fineweb-l3
- source
- huggingface
- author
- openbmb
- license
- Apache-2.0
- tags
- task_categories:text-generation, language:en, language:zh, license:apache-2.0, size_categories:1b<n<10b, format:parquet, modality:text, library:datasets, library:dask, library:polars, library:mlcroissant, arxiv:2505.05427, arxiv:2602.09003, region:us, llm, pretraining, data-synthesis, data-filtering, high-quality, general-knowledge, qa-generation, multi-style-rewriting, minicpm
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
π Engagement & Metrics
- downloads
- 62,290
- stars
- null
- forks
- null
Data indexed from public sources. Updated daily.