Context and dependencies

Visual Intelligence aims to develop new deep learning methodology which exploit context, dependencies and prior knowledge in deep learning.

Motivation

Most of current deep learning systems for image analysis depend on individual pixel information, capturing solely neighborhood dependencies via convolutions. This means that the ability to incorporate context and prior knowledge, e.g. about the topology or geometry of objects, is limited.  

The ability to conform to physical models, and principles governing the image data generation and its properties is also limited, including modelling of spatial and temporal dependencies and processes. In most applications, additional expert knowledge is available but not expressed in a form understandable by a machine. Moreover, in some applications, additional data could be incorporated in the learning process. These data may come from other sources and be related indirectly to the learning task.

Solving research challenges through new deep learning methodology

Visual Intelligence has proposed new innovative methods for including expert knowledge and additional data in deep learning models. Such results include the following:

• A method for locating key points in mammograms using graph convolutional networks.

• A method which includes contextual information about depth and time-of-day into a segmentation network in marine acoustics.

• A method for combining deep learning models and Hough transform for building detection.

These methods successfully incorporate dependencies and prior knowledge into the deep learning systems. By exploiting these geometric or contextual constraints, the search space of functions which a network aims to estimate can be significantly reduced. This in turn allows for more efficient training, in particular in a setting with only limited available training data, and results in solutions aligned with the task and application.

Highlighted publications

Understanding Deep Learning via Generalization and Optimzation Analysis for Accenerated SGD
November 15, 2024
We provide a theoretical understanding on the generalization error of momentum-based accelerated variants of stochastic gradient descent.
Visual Data Diagnosis and Debiasing with Concept Graphs
October 17, 2024
We propose ConBias, a bias diagnosis and debiasing pipeline for visual datasets.
Reinventing Self-Supervised Learning: The Magic of Memory in AI Training
October 17, 2024
MaSSL is a novel approach to self-supervised learning that enhances training stability and efficiency.