Search Results for author: Mark O'Connor

Found 4 papers, 4 papers with code

Extending GCC-PHAT using Shift Equivariant Neural Networks

1 code implementation9 Aug 2022 Axel Berg, Mark O'Connor, Kalle Åström, Magnus Oskarsson

Speaker localization using microphone arrays depends on accurate time delay estimation techniques.

Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition

1 code implementation8 Apr 2022 Axel Berg, Magnus Oskarsson, Mark O'Connor

While the Transformer architecture has become ubiquitous in the machine learning field, its adaptation to 3D shape recognition is non-trivial.

3D Feature Matching 3D Point Cloud Classification +2

Keyword Transformer: A Self-Attention Model for Keyword Spotting

9 code implementations1 Apr 2021 Axel Berg, Mark O'Connor, Miguel Tairum Cruz

The Transformer architecture has been successful across many domains, including natural language processing, computer vision and speech recognition.

Ranked #5 on Keyword Spotting on Google Speech Commands (using extra training data)

Keyword Spotting Speech Recognition

Deep Ordinal Regression with Label Diversity

1 code implementation29 Jun 2020 Axel Berg, Magnus Oskarsson, Mark O'Connor

By discretizing the target into a set of non-overlapping classes, it has been shown that training a classifier can improve neural network accuracy compared to using a standard regression approach.

Ranked #2 on Head Pose Estimation on BIWI (MAE (trained with BIWI data) metric)

Age Estimation Head Pose Estimation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.