Context and dependencies

Visual Intelligence aims to develop new deep learning methodology which exploit context, dependencies and prior knowledge in deep learning.

Motivation

Most of current deep learning systems for image analysis depend on individual pixel information, capturing solely neighborhood dependencies via convolutions. This means that the ability to incorporate context and prior knowledge, e.g. about the topology or geometry of objects, is limited.

The ability to conform to physical models, and principles governing the image data generation and its properties is also limited, including modelling of spatial and temporal dependencies and processes. In most applications, additional expert knowledge is available but not expressed in a form understandable by a machine. Moreover, in some applications, additional data could be incorporated in the learning process. These data may come from other sources and be related indirectly to the learning task.

Solving research challenges through new deep learning methodology

Visual Intelligence has proposed new innovative methods for including expert knowledge and additional data in deep learning models. Such results include the following:

• A method for locating key points in mammograms using graph convolutional networks.

• A method which includes contextual information about depth and time-of-day into a segmentation network in marine acoustics.

• A method for combining deep learning models and Hough transform for building detection.

These methods successfully incorporate dependencies and prior knowledge into the deep learning systems. By exploiting these geometric or contextual constraints, the search space of functions which a network aims to estimate can be significantly reduced. This in turn allows for more efficient training, in particular in a setting with only limited available training data, and results in solutions aligned with the task and application.

‍

Highlighted publications

No items found.

Other publications

Are nuclear masks all you need for improved out-of-domain generalisation? A closer look at cancer classification in histopathology

By authors:

Dhananjay Tomar, Alexander Binder, Andreas Kleppe

Published in:

Advances in Neural Information Processing Systems 2024

September 24, 2024

PSAIR: A Neuro-Symbolic Approach to Zero-Shot Visual Grounding

By authors:

Pan, Yi; Zhang, Yujia; Kampffmeyer, Michael Christian; Zhao, Xiaoguang.

Published in:

2024 International Joint Conference on Neural Networks (IJCNN). IEEE conference proceedings 2024 ISBN 979-8-3503-5931-2

September 9, 2024

View all Publications

Context and dependencies

Summary

Summary

Key partners

Key partners

Principal investigators

Anne Solberg

Arnt-Børre Salberg

Elisabeth Wetzer

Principal investigators

Anne Solberg

Arnt-Børre Salberg

Elisabeth Wetzer

Key Personnel

Key Personnel