πŸ“Š
Dataset

GroundCUA

by Elonmusk V2 elonmusk-v2/groundcua
Free2AITools Nexus Index
59.4
S: Semantic 50

Query-time baseline · scored live at search

A: Authority 61
P: Popularity 51
R: Recency 86
Q: Quality 50
Tech Context
Vital Performance
Data Integrity 59.4 FNI Score
- Size
- Rows
- Tokens
Dataset Information Summary
Entity Passport
Registry ID elonmusk-v2/groundcua
License MIT
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset_elonmusk_v2_groundcua,
  author = {Elonmusk V2},
  title = {GroundCUA Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/ElonMusk-v2/GroundCUA}},
  note = {Accessed via Free2AITools.}
}
APA Style
Elonmusk V2. (2026). GroundCUA [Dataset]. Free2AITools. https://huggingface.co/datasets/ElonMusk-v2/GroundCUA

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Free2AITools Nexus Index V2.0

Semantic (S) 50

Query-time baseline · scored live at search

Authority (A) 61
Popularity (P) 51
Recency (R) 86
Quality (Q) 50

πŸ’¬ Index Insight

FNI V2.0 for GroundCUA: Authority (A:61), Popularity (P:51), Recency (R:86), Quality (Q:50). Semantic (S) is a query-time baseline scored live at search.

Free2AITools Nexus Index

Data Sources / Provenance

Open data Updated: Live data
⬇️
Downloads
28,243

🎯 Task Categories

image-to-text

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

GroundCUA: Grounding Computer Use Agents on Human Demonstrations

🌐 Website | πŸ“‘ Paper | πŸ€— Dataset | πŸ€– Models

GroundCUA Overview

GroundCUA Dataset

GroundCUA is a large and diverse dataset of real UI screenshots paired with structured annotations for building multimodal computer use agents. It covers 87 software platforms across productivity tools, browsers, creative tools, communication apps, development environments, and system utilities. GroundCUA is designed for research on GUI grounding, UI perception, and vision-language-action models that interact with computers.


Highlights

  • 87 platforms spanning Windows, macOS, Linux, and cross-platform apps
  • Annotated UI elements with bounding boxes, text, and coarse semantic categories
  • SHA-256 file pairing between screenshots and JSON annotations
  • Supports research on GUI grounding, multimodal agents, and UI understanding
  • MIT license for broad academic and open source use

Dataset Structure

text
GroundCUA/
β”œβ”€β”€ data/              # JSON annotation files
β”œβ”€β”€ images/            # Screenshot images
└── README.md

Directory Layout

Each platform appears as a directory name inside both data/ and images/.

  • data/PlatformName/ contains annotation JSON files
  • images/PlatformName/ contains corresponding PN

Social Proof

HuggingFace Hub
28.2KDownloads
πŸ”„ Updated daily

Source summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

πŸ†” Identity & Source

id
hf-dataset--elonmusk-v2--groundcua
slug
elonmusk-v2--groundcua
source
huggingface
author
Elonmusk V2
license
MIT
tags
task_categories:image-to-text, language:en, license:mit, size_categories:1m<n<10m, modality:image, arxiv:2511.07332, region:us, computer_use, agents, grounding, multimodal, ui-vision, groundcua

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

πŸ“Š Engagement & Metrics

downloads
28,243
stars
null
forks
null

Data indexed from public sources. Updated daily.