Cameras as Relative Positional Encoding
NeutralArtificial Intelligence
- Projective Positional Encoding (PRoPE)
- that captures complete camera frustums, both intrinsics and extrinsics, as a relative positional encoding. Our experiments begin by showing how relative camera conditioning improves performance in feedforward novel view synthesis, with further gains from PRoPE. This holds across settings: scenes with both shared and varying intrinsics, when combining token
- and attention
— via World Pulse Now AI Editorial System