PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications
PositiveArtificial Intelligence
The introduction of PicoSAM2 represents a significant leap in real-time, on-device segmentation technology, crucial for latency-sensitive applications like smart glasses and IoT devices. This lightweight model, with just 1.3 million parameters and optimized for the Sony IMX500 sensor, achieves impressive performance metrics, including 51.9% mIoU on the COCO dataset and 44.9% on LVIS. Its quantized version, at only 1.22MB, operates at 14.3 ms, making it uniquely suited for in-sensor deployment. The model's ability to perform knowledge distillation has further enhanced its capabilities, boosting LVIS performance by 3.5% mIoU and 5.1% mAP. By enabling privacy-preserving vision without the need for cloud or host processing, PicoSAM2 paves the way for more secure and efficient edge computing solutions, addressing growing concerns over data privacy in an increasingly connected world.
— via World Pulse Now AI Editorial System