Mech Interp on Tiny Models

Promise81initial

Activation threshold62%

$310of $500

About

A systematic study of circuits and features in sub-1B parameter LMs, looking for transferable interpretability primitives.

Mech-interp researcher, formerly Anthropic interpretability team.

Prior ventures

01Sub-1B model selection + circuit map
Select 5 base models, map known circuits, publish methodology.
Pending
Deadline: in 14dTranche: $0.60Outputs:Methodology docCircuit map dataset
02Transferable feature catalog
Identify ≥20 features that transfer across ≥3 of the selected models.
Pending
Deadline: in 9wTranche: $0.60Outputs:Feature catalogReplication notebook
03Whitepaper + dataset release
Submit to NeurIPS / ICLR with public dataset.
Pending
Deadline: in 17wTranche: $0.60Outputs:WhitepaperPublic dataset

Activates at$500 pool

Monthly allowance$50

Auto-liquidateProgress < 30 for 30d

Auto-pivot on disputesON