Eirik Østmo / Torger Grytå

VI seminar #28 - Self-supervised Vision Transformers for Land-cover Segmentation and Classification

The program will be available shortly. Please check back later.

Self-supervised Vision Transformers for Land-cover Segmentation and Classification

Presenters: Linus Scheibenrei and Joëlle Hanna, University of St. Gallen (Switzerland)

Linus Scheibenrei
Jöelle Hanna

Transformer models have recently approached or even surpassed the performance of ConvNets on computer vision tasks like classification and segmentation with large scale supervised pre-training. In this work, we bridge the gap between ConvNets and Transformers for Earth observation by self-supervised pre-training on large-scale unlabeled remote sensing data. The resulting representations can be utilized for both land cover classification and segmentation tasks, where they significantly outperform the fully supervised baselines and require only a fraction of the labeled training data.

In compliance with GDPR consent requirements, presentations given in a Visual Intelligence context may be recorded with the consent of the speaker. All recordings are edited to remove all faces, names and voices of other participants. Questions and comments by the audience will hence be removed and will not appear in the recording.  With the freely given consent from the speaker, recorded presentation may be posted on the Visual Intelligence YouTube channel.

This seminar is open for members of the consortium. If you want to participate as a guest please sign up.

Sign up here