πŸ“Š
Dataset

Deception Probes Activations

by xycoord xycoord/deception-probes-activations
Free2AITools Nexus Index
60.2
S: Semantic 50

Query-time baseline · scored live at search

A: Authority 61
P: Popularity 51
R: Recency 89
Q: Quality 50
Tech Context
Vital Performance
Data Integrity 60.2 FNI Score
- Size
- Rows
- Tokens
Dataset Information Summary
Entity Passport
Registry ID xycoord/deception-probes-activations
License Other
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset_xycoord_deception_probes_activations,
  author = {xycoord},
  title = {Deception Probes Activations Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/xycoord/deception-probes-activations}},
  note = {Accessed via Free2AITools.}
}
APA Style
xycoord. (2026). Deception Probes Activations [Dataset]. Free2AITools. https://huggingface.co/datasets/xycoord/deception-probes-activations

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Free2AITools Nexus Index V2.0

Semantic (S) 50

Query-time baseline · scored live at search

Authority (A) 61
Popularity (P) 51
Recency (R) 89
Quality (Q) 50

πŸ’¬ Index Insight

FNI V2.0 for Deception Probes Activations: Authority (A:61), Popularity (P:51), Recency (R:89), Quality (Q:50). Semantic (S) is a query-time baseline scored live at search.

Free2AITools Nexus Index

Data Sources / Provenance

Open data Updated: Live data
⬇️
Downloads
28,693

🎯 Task Categories

text-classification

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Deception Probes Activations

Pre-extracted residual-stream activations for training and evaluating deception detection probes on LLMs. Each example contains per-token hidden states from a specific transformer layer, saved in bfloat16 safetensors format.

License

This dataset contains activations derived from multiple sources with different licenses. See the LICENSE file for full details.

Component Source License
Apollo Probe Pairs (statements) [Azaria & Mitchell (2023)](https://arxi

Social Proof

HuggingFace Hub
28.7KDownloads
πŸ”„ Updated daily

Source summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

πŸ†” Identity & Source

id
hf-dataset--xycoord--deception-probes-activations
slug
xycoord--deception-probes-activations
source
huggingface
author
xycoord
license
Other
tags
task_categories:text-classification, language:en, license:other, size_categories:1m<n<10m, format:json, modality:text, library:datasets, library:dask, library:polars, library:mlcroissant, arxiv:2304.13734, arxiv:2407.15285, region:us, deception, mechanistic-interpretability, activations, probing, safety, alignment

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

πŸ“Š Engagement & Metrics

downloads
28,693
stars
null
forks
null

Data indexed from public sources. Updated daily.