CountSteer: Steering Attention for Object Counting in Diffusion Models
PositiveArtificial Intelligence
The article discusses CountSteer, a new method designed to enhance the performance of text-to-image diffusion models in accurately generating specified object counts. While these models typically struggle with numerical instructions, research indicates they possess an implicit awareness of their counting accuracy. CountSteer leverages this insight by adjusting the model's cross-attention hidden states during inference, resulting in a 4% improvement in object-count accuracy without sacrificing visual quality.
— via World Pulse Now AI Editorial System
