arXiv

SEAOTTER: Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction

Title: SEAOTTER: Leveraging Sensor Embedded Autoencoding and One-Time Transcoding for Streamlined Reconstruction

Abstract:

Robotic platforms equipped with inexpensive, energy-efficient hardware can readily acquire high-resolution visual data. However, the utility of this data is often stifled by constrained on-device processing power and narrow bandwidth, which hinder the effectiveness of traditional compression standards such as JPEG and MPEG. Although next-generation codecs like AV1 and AVIF offer superior rate-distortion performance, their computational intensity makes them unfeasible for deployment without specialized application-specific integrated circuits (ASICs). While recent asymmetric autoencoder architectures achieve high-fidelity results under strict power and bandwidth limitations, they impose excessive decoding burdens and rely on proprietary formats, thereby disregarding the extensive infrastructure developed around established standards like JPEG.

To overcome these challenges, we present SEAOTTER (Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction), a compression framework tailored for cloud robotics. Recognizing that the sensor, cloud, and consumer endpoints operate under vastly different resource constraints, SEAOTTER merges the data efficiency of learned latent representations with the widespread compatibility of standard JPEG files. We address the performance degradation typically associated with naive transcoding by introducing a learnable transform for JPEG color spaces and quantization. This approach significantly enhances the accuracy of global, dense, and vision-language-based perception tasks.

By training both general-purpose and task-specific transcoding pipelines on a pre-trained, frozen encoder, SEAOTTER achieves substantial efficiency gains. When compared to AVIF at a 200:1 compression ratio, our method delivers encoding speeds seven times faster and decoding speeds 3.5 times faster, while also improving ImageNet top-1 accuracy by 8%. Crucially, these performance boosts are achieved without sacrificing compatibility with existing JPEG infrastructure. The source code is publicly accessible at https://github.com/UT-SysML/seaotter.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...