Mech Interp → Understanding interpretable processes inside a complex object (i.e. nn). Should be a useful process, an example is circuit discovery
Mar 19, 20261 min read
Mech Interp → Understanding interpretable processes inside a complex object (i.e. nn). Should be a useful process, an example is circuit discovery