s
Repositories
Select a repository to view its commits, contributors, and more.public
sparse-dictionary-learning
An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"
Python
35
4
0
Updated 11 days ago
public
AC-Solver
A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for reinforcement learning: A case study".
Jupyter Notebook
14
4
1
Updated 13 days ago
public
attn_saes
HTML
1
0
0
Updated 3 months ago
public
feature-interface
HTML
0
0
0
Updated 10 months ago
public
shehper.github.io
0
0
0
Updated 4 months ago