Sparse Autoencoder Features

News

MicroCloud Hologram Inc. announces optimization of stacked sparse ...

The stacked sparse autoencoder is a powerful deep learning architecture composed of multiple autoencoder layers, with each layer responsible for extracting features at different levels.

VentureBeat11mon

DeepMind makes big jump toward interpreting LLMs with sparse ...

A new research by Google DeepMind shows how sparse autoencoders (SAEs) with special JumpReLU activation can help interpret LLMs.

TechRepublic1y

OpenAI, Anthropic AI Research Reveals More About How LLMs Affect ...

Anthropic opened a window into the ‘black box’ where ‘features’ steer a large language model’s output. OpenAI dug into the same concept two weeks later with a deep dive into sparse ...

MIT Technology Review8mon

Google DeepMind has a new way to look inside an AI’s “mind”

To find features—or categories of data that represent a larger concept—in its AI model, Gemma, DeepMind ran a tool known as a “sparse autoencoder” on each of its layers.

The Economist1y

Researchers are figuring out how large language models work

They proposed—and subsequently tried—various workarounds, achieving good results on very small language models in 2023 with a so-called “sparse autoencoder”.

Results that may be inaccessible to you are currently showing.

Hide inaccessible results