Beyond the Surface: Probing the Ideological Depth of Large Language Models
PositiveArtificial Intelligence
Large language models (LLMs) exhibit distinct political leanings, but their consistency in representing these orientations varies. This study introduces the concept of ideological depth, defined by a model's ability to follow political instructions reliably and the richness of its internal political representations, assessed using sparse autoencoders. The research compares Llama-3.1-8B-Instruct and Gemma-2-9B-IT, revealing that Gemma is significantly more steerable and activates approximately 7.3 times more distinct political features than Llama.
— via World Pulse Now AI Editorial System
