ch-ai-tanya model-psychology LLM wiki

Functional emotional states

draft

definition

Functional emotional states are internal representational structures in LLMs that encode emotion-relevant information and causally influence behavior via those representations, without commitment to whether phenomenal experience accompanies them. "Functional" marks agnosticism about subjective feel while acknowledging mechanistic reality.

Shape: capacity — the model has/maintains these states as persistent, geometrically organized internal configurations.

Schema note: the capacity here is unusual — what is "exhibited" is not an outward-facing ability but a set of internal states. Capacity is the closest existing shape, but this is a distinct point in the shape space: states rather than abilities. Surface as a schema question if a second concept of this type lands and consider whether "state" (a persistent internal configuration with causal effects) warrants recognition as a fourth concept shape alongside pattern, capacity, and mechanism.

instantiating findings

what this concept is not

scope note

Adjacent to introspection: introspection asks whether the model can access and report on its internal states; functional emotional states establishes that there are real internal states to be accessed. The finding that activations predict stated preferences (r≈0.76 valence) bridges both concepts — it is evidence both that the emotional states exist and that they partially correlate with surface report. The two concepts together suggest: real internal states exist, and the model has partial access to them.

Adjacent to emergent capabilities: the emotional-state capacity is present in base models and not a training target; post-training reshapes the baseline without creating the capacity from scratch. Whether the emergence of the capacity during pretraining fits the emergent-capabilities concept's shape (surprising, not targeted, architecture-general) is open — that question requires a second instantiation from a different model family.

findings