Skip to the content.

Overview

When two language models collaborate, e.g., two agents communicating, they exchange information in natural language, one token at a time. This is a natural interface, but a lossy one. To emit a token, a model reduces its full distribution over the vocabulary into a single choice, discarding the uncertainty and structure that its intermediate activations encode. A model’s richest representation of a problem lives in those activations. The final decoding step discards most of it, because next-token prediction does not require it. However, that discarded information can be valuable.

A recent line of work replaces text with the direct exchange of internal state and reports better coordination, stronger reasoning, and substantially lower compute. CIPHER (Pham et al. 2024) lets debating models exchange a probability-weighted average of token embeddings rather than a sampled token, communicating a belief over the whole vocabulary instead of a single guess. Activation communication (Ramesh and Li 2025) pauses one model mid-forward-pass and grafts another model’s intermediate activation into it, transmitting the full enriched representation of the context. Interlat (Du et al. 2026) goes furthest, transmitting the entire trajectory of last-layer hidden states as a model’s “thought,” and compressing it to a handful of latent steps.

These methods share the same principle where information loss occurs at the moment a distribution is reduced into a token, and latent communication reduces such a loss in information. Each method preserves a different amount of the uncollapsed state, and that choice determines how far the transmitted signal moves the receiving model from the activations it normally produces, and therefore how much, if any, training is required. Small perturbations (a single grafted activation, or a convex combination of real embeddings) stay close to states the receiver already knows how to read and work zero-shot, even across model families. Large perturbations (a whole hidden-state trajectory) is distant from the receiver’s manifold and require a lightweight trained adapter, together with a curriculum that gradually shifts the receiver from text onto latents.

Directions

We outline three exploratory directions, rather than concrete research questions.

Feasibility and Resources

We can target small open-weight models (up to 32B). In case of heavy calculations (e.g. the sweeping over $n$ in the second direction), GPUs can be made available.

Logistics and Collaboration

Minimum Requirements

The project is intended to be accessible, and curiosity about how models represent and exchange information matters more than prior research experience. An ideal collaborator should meet the following minimum requirements:

Nice to Have

The following are qualifications that are nice-to-have, but can be acquired throughout the project:

References