Publication

Visual question answering on remote sensing images

Devis Tuia
2024
Book Chapters

Résumé

Remote sensing visual question answering (RSVQA) aims at predicting an answer to a question (both in natural language) about an overhead image. Through natural language processing, this task allows end users to extract high-level information from remote sensing data. In this chapter, we discuss some of the works that have been proposed for RSVQA. We first systematically review eight existing datasets that can be used to train and evaluate RSVQA models. We then examine contributions on RSVQA models on the visual, language, fusion of modalities and answer prediction parts. Finally, we discuss new research directions that could be pursued to advance the field.

Source officielle

https://infoscience.epfl.ch/entities/publication/9de17a65-74ce-42dd-a9b0-b4edca094bb9

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.