GDIP: Object-Detection in Adverse Weather Conditions Using Gated Differentiable Image Processing

Sanket Kalwar¹*

Dhruv Patel¹*

Aakash Aanegola¹

Sourav Garg²

K Madhava Krishna¹

Accepted at ICRA 2023

* denotes equal contribution
1 Robotics Research Centre, IIIT Hyderabad, India.
2 Australian Centre for Robotic Vision at the Queensland University of Technology (QUT), Brisbane, Australia.
3 ZF Friedrichshafen TCI, Hyderabad, India

[Paper]

[GitHub]

Motivation

Gated Differentiable Image Processing (GDIP) block

real-world foggy (RTTS dataset) and dark (ExDark dataset) images with annotations

Overview

We present Gated Differentiable Image Processing (GDIP) block, a domain-agnostic architecture which adaptively enhances adverse images for object detection by performing concurrent gated-weighing of image processing operations.
GDIP provides flexible concurrent gated weighting of the individual IP operations that result in superior detection performance.
GDIP block can be integrated as single or multi-level enhancement module with an encoder
In addition, GDIP block has a unique advantage with its utility as a training regularizer, which directly improves object detection training for adverse conditions. This eliminates any image enhancement overhead during inference, unlike other state-of-the-art works, thus resulting in higher throughput.

Modules

Using our GDIP block, we propose novel variants as below:

GDIP-Yolo: a single GDIP block (fed with an adverse input and latent embeddings from an encoder) can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition images (fog and low-lighting).

GDIP-Yolo Forward Pass
Multi-level GDIP-Yolo (MGDIP-Yolo): a multi-level version of GDIP where an image is progressively enhanced through multiple GDIP blocks, each guided by a different layer of the image encoder. Providing access to such multiple feature scales helps it utlilize the local/global properties to selectively apply Image Processing operations.

MGDIP-Yolo Forward Pass
GDIP as regularizer: an adaptation of GDIP as training regularizer, which directly improves object detection training by learning weather-invariant features. It can be removed during inference, thus saving compute time with improved performance in adverse conditions.

GDIP as regularizer

Quantitative Analysis

Foggy setting

Quantitative results for foggy conditions on the V_N_Ts (VOCNormal Test set), V_F_Ts (VOCFoggy Test set) and realworld RTTS dataset. Best and second best mAP scores are bold and italicized, respectively

Low-lighting setting

Quantitative results for low-lighting conditions on the V_N_Ts (VOCNormal Test set), V_D_Ts (VOCDark Test set) and real-world ExDark dataset. Best and second best mAP scores are bold and italicized, respectively.

**NOTE:** IA-Yolo uses a prior (i.e. IA-Yolo (with prior)) for low-lighting setting by removing the defogging and white balance filters. Hence, we evaluate IA-Yolo (w/o prior) by incorporating the defogging and white balance filters (their same pipeline as in case of foggy setting) to do fair comparison with our approach.

GDIP in the Wild

GDIP runs seamlessly on video sequences in the wild without any need for retraining or fine tuning, showcasing its ability to detect objects by composing Image Processing operations through gating by training on only synthetic adverse data.

**Note:** how as the car moves from complete darkness to natural lighting, the gate firing changes from only gamma to gamma, defog and tone.

Contact

If you have any questions, please reach out to any of the above mentioned authors.