πŸ“Š
Dataset

Cc12m Images 058

by Neomi26 neomi26/cc12m-images-058
Free2AITools Nexus Index
60.0
S: Semantic 50

Query-time baseline · scored live at search

A: Authority 61
P: Popularity 50
R: Recency 95
Q: Quality 50
Tech Context
Vital Performance
Data Integrity 60 FNI Score
- Size
- Rows
- Tokens
Dataset Information Summary
Entity Passport
Registry ID neomi26/cc12m-images-058
License Other
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset_neomi26_cc12m_images_058,
  author = {Neomi26},
  title = {Cc12m Images 058 Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/Neomi26/cc12m-images-058}},
  note = {Accessed via Free2AITools.}
}
APA Style
Neomi26. (2026). Cc12m Images 058 [Dataset]. Free2AITools. https://huggingface.co/datasets/Neomi26/cc12m-images-058

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Free2AITools Nexus Index V2.0

Semantic (S) 50

Query-time baseline · scored live at search

Authority (A) 61
Popularity (P) 50
Recency (R) 95
Quality (Q) 50

πŸ’¬ Index Insight

FNI V2.0 for Cc12m Images 058: Authority (A:61), Popularity (P:50), Recency (R:95), Quality (Q:50). Semantic (S) is a query-time baseline scored live at search.

Free2AITools Nexus Index

Data Sources / Provenance

Open data Updated: Live data
⬇️
Downloads
25,395

🎯 Task Categories

image-to-text

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

cc12m-images-058

Subset of CC12M images re-hosted as individual JPEG files for browser access.

  • Images in this repo: 90,000
  • Image URL pattern: https://huggingface.co/datasets/Neomi26/cc12m-images-058/resolve/main/{folder}/{key}.jpg
  • Manifest: manifest.parquet β€” columns: key, image_url, source_url, caption, width, height, original_width, original_height, shard, repo, path

Part of Neomi26/cc12m-images-* series β€” 122 repos, from cc12m-images-000 to cc12m-images-121. Global index: Neomi26/cc12m-images-index

Original dataset: pixparse/cc12m-wds

πŸ“Š Structured Schema (Zero-Fabrication)

Feature Key Data Type
image Image
label ClassLabel

Estimated Rows: 90,000

Social Proof

HuggingFace Hub
25.4KDownloads
πŸ”„ Updated daily

Source summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

πŸ†” Identity & Source

id
hf-dataset--neomi26--cc12m-images-058
slug
neomi26--cc12m-images-058
source
huggingface
author
Neomi26
license
Other
tags
task_categories:image-to-text, license:other, size_categories:10k<n<100k, format:imagefolder, modality:image, library:datasets, library:mlcroissant, region:us, cc12m, images

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

πŸ“Š Engagement & Metrics

downloads
25,395
stars
null
forks
null

Data indexed from public sources. Updated daily.