Shouty describes a specific, unpleasant forwardness in the upper midrange and lower presence region — roughly 1 to 4 kHz, with listeners often most sensitive in the 2 to 4 kHz band. When this region is pushed too hard, vocals and instruments sound like they're yelling: pushed into your face, aggressive, and fatiguing, as if the headphone is raising its voice at you. Belted vocals, horns, electric guitars, saxophone, and violin are where it shows up most viciously.
What makes shouty worth its own entry is that it's distinct from harsh and from bright, even though casual reviews lump the three together. Shouty is a midrange phenomenon — the vocal and presence region overdone — and the key insight is that shout lives below the treble: a headphone can be shouty even with a dark, rolled-off top end, because the problem is lower down. Harsh is higher up — grit, edge, and roughness in the upper presence and treble, often the 5 to 8 kHz sibilance zone — an unpleasant roughness in the highs rather than a forwardness in the mids. Bright is simply a tonal tilt, elevated treble overall, and isn't even necessarily unpleasant.
So the three can appear in any combination: a shouty-but-not-bright headphone (forward mids, relaxed treble), a bright-but-not-shouty one (lifted treble, well-judged mids), and a harsh one that's neither (smooth mids, gritty highs). They're three different problems in three different places, which is why pinning the right label on a sound actually helps you predict what will and won't bother you.
Shout is also about balance and context, not just the level of one band. The same 3 kHz energy can sound fine if there's enough bass and lower-mid body underneath it, and shouty if the lower mids are lean and the upper mids are forward — which is why two headphones with similar upper-mid peaks don't always sound equally shouty. Volume matters too: an upper-mid-forward tuning often sounds clear and vocal-forward at low levels and starts filing complaints with your nervous system as you turn it up.
Tuning this region is a genuine tightrope. Too much and it's shouty and unpleasant; too little and the sound goes unclear and unengaging, because the upper mids are also where intelligibility and presence come from. A close cousin is honky or nasal — a related upper-mid coloration that makes the sound hollow or cupped rather than simply loud-in-your-face.
For a newcomer, the cleanest way to recognize shout is a powerful female vocal or a brass-heavy track played a little loud: if the singer seems to lunge forward and the sound makes you want to turn it down right as they belt, that's shoutiness — and notice that it's the vocal region doing it, not the cymbals or the sibilants.