SUP'COM

Doctorate thesis defense of Walid Messaoud

Publié le 13-12-2024

Attention Mechanism for Deep Learning in the Analysis of Maritime Environment

Presented by

Walid Messaoud

On 14/12, 2024 at .09.H, in Amphitheater Ibn Roched, SUP'COM 2.

Thesis Committee

President	Pr. Rabeh Attia	Polytechnic School, Tunisia
Reviewers	Pr. Mounir Sayadi	ENSIT, Tunisia
	PR. Mahmoud Melkemi	ESIGELEC, France
Examiner	Dr. Sameh Najeh	SUP'COM, Tunisia
Supervisor	Pr. Fatma Abdelkefi	SUP'COM, Tunisia
Co-supervisor	Pr. Adnane Cabani	ESIGELEC, France
Co-supervisor	Dr Rim Trabelsi	University of Gabes, Tunisia

Abstract

This research introduces the pioneering progress of object detection in the maritime field, to address its distinct challenges and push technological boundaries to enhance security, ensure navigation safety, and preserve the environment. The complexity of the maritime setting, characterized by a variety of dynamic objects, diverse vessel outlines, occlusions, and constraints within annotated datasets, presents distinct challenges. In response, we undertake a groundbreaking implementation of attention mechanisms as a key solution, directing attention to crucial image regions and adapting to the ever-changing conditions of the maritime setting.

We lead the way in applying Multi-Head Self Attention (MHSA) in the maritime realm, conducting a comprehensive evaluation of various maritime datasets to identify the most suitable ones. This pioneering effort represents a significant leap forward in improving object detection within the maritime environment. Our evaluation covers six cutting-edge models that incorporate attention mechanisms and the Low-Visibility Enhancement Network (LVENet). Utilizing maritime datasets such as SeaShips and the Singapore Maritime Dataset (SMD), our approach demonstrates substantial improvements in mean Average Precision (mAP) scores.

LVENet, designed specifically to address challenges in scenarios with reduced visibility, significantly reinforces object detection. This improvement leads to a varying mean precision improvement across a spectrum of models. Our comprehensive assessment not only marks a pivotal achievement in maritime object detection but also underscores advancements in both precision and simplicity, advocating for enhanced deployment strategies for these models in real-world maritime applications.

Furthermore, we highlight the effectiveness of a proposed adaptive pruning approach, derived from Layer-wise Relevance Propagation (LRP) in attention mechanism-based models: Bottleneck Transformers (BoTNet) and Attention Augmented Convolutional Networks (AACN). Our experiments encompass benchmarks, such as ImageNet, Pascal VOC, Canadian Institute for Advanced Research-10 (CIFAR-10), and Common Objects in Context (COCO) datasets, confirming the versatility of our approach. Notably, we extend our assessment to the maritime domain, proving the effectiveness of adaptive pruning in models tailored for maritime applications, specifically BoTNet and AACN. This validation showcases an average epoch time reduction of over 30\%, preserving accuracy and highlighting the adaptability of our proposed method in enhancing computational efficiency for maritime-related tasks.

In addressing the limited datasets in the maritime domain, our exploration delves into the realm of GANs for latent space manipulation. Our proposed method leverages a sophisticated framework that integrates a classifier, auxiliary mapping composed of transformer blocks, and the implementation of conjugate gradient. This novel approach enables precise attribute modification, showcasing its efficacy in manipulating latent spaces. Subsequently, our research presents compelling results in facial attribute manipulation using FFHQ and CelebAHQ datasets, demonstrating remarkable superiority by exceeding the state-of-the-art approaches in terms of Manipulation Disentanglement Score (MDS). Extending our method to the maritime domain, we pioneer advancements by surpassing the state-of-the-art Surrogate Gradient Field (SGF) method. Specifically, our GAN-based approach achieves substantial improvements using SeaShips and SMD datasets. These results highlight our method's remarkable strides beyond the existing benchmarks, illustrating its unparalleled efficiency in both facial attribute editing and maritime image manipulation. Additionally, we prove the versatility of our approach by conducting extensive experiments with different versions of StyleGAN, showcasing its adaptability across various GAN architectures. Our proposed approach pioneers GAN's generation of real-world images in the maritime environment, establishing new frontiers in the application of GANs.

Keywords Deep Learning, Attention Mechanism, Object Detection, Image Generation, GAN, BOTNet, AACN, MHSA, Neural Network, Latent Space.

Début
12-12-2024 / 10:24
Fin
21-12-2024 /10:24
Localisation
SUP'COM

Doctorate thesis defense of Walid Messaoud

Accueil Évènements Doctorate thesis defense of Walid Messaoud

Abstract

Début

Fin

Localisation

S'abonner