Partially rewriting an LLM in natural language

Using interpretations of SAE latents to simulate activations.

More from this stream

Recomended