NVIDIA Progress Prize submission
This is the Github repository to the Progress Prize winning submission for NVIDIA Nemotron Model Reasoning Challenge.
Resources on Kaggle
Tabs on nemotron.huikang.dev
- Base β Grid of competition problems colored by how the base model (pre-fine-tuning) does on each: solved / partially solved / unsolved across its generation runs. Click a problem for its prompt, parsed transformation table, answer, per-run extracted answer, and the token-level generation trace colored by logprob.
- Synthetic β Same problem set as Base, but colored by investigation status (rule found / hypothesis formed / rule unknown). Click a problem for its prompt, parsed transformation, answer, submission, reasoning text, and investigation notes.
- Corpus β Sortable table of training corpus entries with masked, unmasked, and total token counts per row. Filter by category or problem ID; open a row to see the token-level trace with masking highlighted.
- Training β Per-problem table of step, loss-token count, and minimum logprob across training epochs. Select an epoch and a row to see token-level logprob changes against the base model.
- Metrics β Index of training runs (LR, backend, epochs, batch, LoRA rank, examples, tokens, steps). Click a run to see its per-step charts: loss per token (overall and by category), min logprob by category, gradient norm, learning rate, and step time. Cmd+click a legend entry to isolate that category.
Running the webpage locally
Serves the static site at http://localhost:33304/.
Executing training
Orig