πŸ“Š
Dataset

E Gmd

by Schism Audio schism-audio/e-gmd
Free2AITools Nexus Index
59.8
S: Semantic 50

Query-time baseline · scored live at search

A: Authority 61
P: Popularity 51
R: Recency 89
Q: Quality 50
Tech Context
Vital Performance
Data Integrity 59.8 FNI Score
- Size
- Rows
- Tokens
Dataset Information Summary
Entity Passport
Registry ID schism-audio/e-gmd
License CC-BY-4.0
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset_schism_audio_e_gmd,
  author = {Schism Audio},
  title = {E Gmd Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/schism-audio/e-gmd}},
  note = {Accessed via Free2AITools.}
}
APA Style
Schism Audio. (2026). E Gmd [Dataset]. Free2AITools. https://huggingface.co/datasets/schism-audio/e-gmd

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Free2AITools Nexus Index V2.0

Semantic (S) 50

Query-time baseline · scored live at search

Authority (A) 61
Popularity (P) 51
Recency (R) 89
Quality (Q) 50

πŸ’¬ Index Insight

FNI V2.0 for E Gmd: Authority (A:61), Popularity (P:51), Recency (R:89), Quality (Q:50). Semantic (S) is a query-time baseline scored live at search.

Free2AITools Nexus Index

Data Sources / Provenance

Open data Updated: Live data
⬇️
Downloads
31,088

🎯 Task Categories

audio-classification

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Expanded Groove MIDI Dataset (E-GMD)

This repository mirrors version 1.0.0 of Google's Expanded Groove MIDI Dataset (E-GMD) for access through the Hugging Face Hub.

E-GMD is a large dataset of human drum performances with audio recordings annotated in MIDI. It contains 444.5 hours of audio from 43 drum kits, with the same train, validation, and test split definitions as the original Groove MIDI Dataset.

Quick Start

python
from datasets import load_dataset

ds = load_dataset("schism-audio/e-gmd", split="train", streaming=True)
first = next(iter(ds))
print(first["audio"], first["midi_path"], first["split"])

Repository Layout

The original archive contains some session folders with more than 10,000 files, which exceeds Hugging Face Hub's per-folder repository limit. This mirror keeps the original filenames and original drummer/session paths, but stages files under split and kit directories:

text
audio/{split}/{kit_slug}/{original_drummer/session/path}.wav
midi/{split}/{kit_slug}/{original_drummer/session/path}.midi
metadata.csv
metadata/{split}.csv
metadata/all.csv
e-gmd-v1.0.0.csv

The root metadata.csv follows the AudioFolder convention and contains one row per WAV file. The file_name column points at the audio file, and the remaining columns preserve E-GMD metadata plus the paired MIDI path.

The metadata/*.csv files are retained from the original mirror and add these path columns:

  • file_name: audio path relative to the split folder
  • audio_path: audio path relative to the repository root
  • midi_path: paired MIDI path relative to the repository root
  • original_audio_filename: original archive audio path
  • `original_midi_filen

Social Proof

HuggingFace Hub
31.1KDownloads
πŸ”„ Updated daily

Source summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

πŸ†” Identity & Source

id
hf-dataset--schism-audio--e-gmd
slug
schism-audio--e-gmd
source
huggingface
author
Schism Audio
license
CC-BY-4.0
tags
task_categories:audio-classification, license:cc-by-4.0, size_categories:10k<n<100k, format:audiofolder, modality:audio, modality:text, library:datasets, library:mlcroissant, arxiv:2004.00188, region:us, midi, drums, percussion, drum-transcription, music-information-retrieval, automatic-drum-transcription

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

πŸ“Š Engagement & Metrics

downloads
31,088
stars
null
forks
null

Data indexed from public sources. Updated daily.