feat: Sparse probing eval based on "Are SAEs useful" paper by chanind · Pull Request #79 · adamkarvonen/SAEBench

chanind · 2025-10-04T17:46:05Z

This PR adds a sparse-probing eval called sparse_probing_sae_probes to keep it separate from the original sparse-probing eval in SAEBench. This eval is based on the SAE-Probes paper Are Sparse Autoencoders Useful? A Case Study in Sparse Probing. The benefit of the sparse-probing tasks from this paper are the following:

Lots of datasets: The SAE-Probes paper evaluates on over 140 sparse probing datasets
cross-validated probing: The SAE-Probes paper optimizes the probing pretty heavily to give a stronger realistic baseline to compare against.

This benchmark wraps the standalone sae-probes package, putting results in SAEBench format.

chanind added 2 commits October 4, 2025 18:30

polishing sae-probes sparse probing eval

850ee49

updating more docs / eval locations

e196506

chanind marked this pull request as draft October 12, 2025 23:04

chanind added 3 commits December 23, 2025 15:28

Merge branch 'main' into sae-probes-sparse-probing

dacaf5e

fixing formatting and import in test

db48f97

ignoring type error

d3f8836

chanind marked this pull request as ready for review December 23, 2025 20:58

chanind changed the title ~~Sparse probing eval based on "Are SAEs useful" paper~~ feat: Sparse probing eval based on "Are SAEs useful" paper Dec 23, 2025

updating README

5a3a63d

chanind marked this pull request as draft December 23, 2025 22:28

chanind added 2 commits December 23, 2025 18:58

nest individual results under SAE name

7e70996

updating readme

9eaffa8

chanind marked this pull request as ready for review December 24, 2025 00:39

adamkarvonen merged commit c1a27da into adamkarvonen:main Dec 30, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Sparse probing eval based on "Are SAEs useful" paper#79

feat: Sparse probing eval based on "Are SAEs useful" paper#79
adamkarvonen merged 8 commits intoadamkarvonen:mainfrom
chanind:sae-probes-sparse-probing

chanind commented Oct 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chanind commented Oct 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chanind commented Oct 4, 2025 •

edited

Loading