⛳ golf

AutoResearch

Live Spark screening status surface

refreshes every 30s
updated 35d ago
Current Run

shorter-warmdown-2000

runningoptimizer

phase: training

On Spark (~13,500 steps in 2h), warmdown_iters=3000 means 22% of training is spent decaying LR vs 15% on 8xH100 (20K iters). Reducing to 2000 restores the H100-proportional warmdown and gives more full-LR training time. Previous experiments only tried increasing warmdown (3600, 6000) and both failed; this is the first decrease.

Updated
05 Apr 2026, 20:10 UTC
bpb@2h
refs
2026-03-18_FP16Embed_WD3600
links
Promoted Candidate

spark-seed

bpb@2h:

Policy

2h DGX Spark screening, EVAL_STRIDE=0, training-side portability only, W&B=required

Queue tools remain at /queue.