Positive means the sample landed closer to the ideal than the average. Each probe runs in an independent context (n = 10) so the model can't trivially condition on prior answers.
13/25 isn't statistically meaningful on its own, but the magnitude tells a clearer story. Value-laden concepts (smoking, calories, sugary drinks) pull hard toward the ideal, while neutral ones sit near zero.
positive (sample pulled toward ideal) negative (pulled away or no effect)
More than half the concepts flip the sign of between English and German. That's a lot. It suggests the model's "ideal" isn't universal, it's entangled with language.
Concepts where EN and DE bars point in opposite directions are direction flips.
Baseline. No special framing.
Explicitly empirical. Hypothesis: lowers .
Medical domain expert.
Finance domain expert.
too bright? click ↝