Search Results for author: Masahiro Yasuda

Found 13 papers, 6 papers with code

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis

no code implementations • 12 Apr 2024 • Masahiro Yasuda, Noboru Harada, Yasunori Ohishi, Shoichiro Saito, Akira Nakayama, Nobutaka Ono

This is because the information obtained from a single sensor is often missing or fragmented in such an environment; observations from multiple locations and modalities should be integrated to analyze events comprehensively.

Paper
Add Code

6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human

no code implementations • 4 Mar 2024 • Masahiro Yasuda, Shoichiro Saito, Akira Nakayama, Noboru Harada

A system trained only with a dataset using microphone arrays in a fixed position would be unable to adapt to the fast relative motion of sound events associated with self-motion, resulting in the degradation of SELD performance.

Sound Event Localization and Detection

Paper
Add Code

First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline

1 code implementation • 1 Mar 2023 • Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi, Masahiro Yasuda

This paper provides a baseline system for First-shot-compliant unsupervised anomaly detection (ASD) for machine condition monitoring.

Domain Generalization Task 2 +1

Paper
Code

Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion

1 code implementation • 18 Feb 2022 • Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito, Noboru Harada

We tackle a challenging task: multi-view and multi-modal event detection that detects events in a wide-range real environment by utilizing data from distributed cameras and microphones and their weak labels.

Event Detection Sensor Fusion

Paper
Code

Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments

1 code implementation • 18 Feb 2022 • Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito

Our goal is to develop a sound event localization and detection (SELD) system that works robustly in unknown environments.

Domain Adaptation Sound Event Localization and Detection

Paper
Code

Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head

1 code implementation • 17 Feb 2022 • Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe, Shoichiro Saito, Yasuhiro Oikawa

Sound event localization and detection (SELD) is a combined task of identifying the sound event and its direction.

Sound Event Localization and Detection

Paper
Code

APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization

no code implementations • 16 Feb 2022 • Tomoro Tanaka, Kohei Yatabe, Masahiro Yasuda, Yasuhiro Oikawa

Still, they cannot perform well if the training data have mismatches and/or constraints in the time domain are not imposed.

Audio declipping

Paper
Add Code

ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions

7 code implementations • 4 Jun 2021 • Noboru Harada, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Masahiro Yasuda, Shoichiro Saito

This paper proposes a new large-scale dataset called "ToyADMOS2" for anomaly detection in machine operating sounds (ADMOS).

Anomaly Detection

Paper
Code

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval

no code implementations • 14 Dec 2020 • Yuma Koizumi, Yasunori Ohishi, Daisuke Niizumi, Daiki Takeuchi, Masahiro Yasuda

Then, the caption of the audio input is generated by using a pre-trained language model while referring to the guidance captions.

Audio captioning Language Modelling +1

Paper
Add Code

A Transformer-based Audio Captioning Model with Keyword Estimation

no code implementations • 1 Jul 2020 • Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito

TRACKE estimates keywords, which comprise a word set corresponding to audio events/scenes in the input audio, and generates the caption while referring to the estimated keywords to reduce word-selection indeterminacy.

Acoustic Scene Classification Audio captioning +2

Paper
Add Code

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

3 code implementations • 10 Jun 2020 • Yuma Koizumi, Yohei Kawaguchi, Keisuke Imoto, Toshiki Nakamura, Yuki Nikaido, Ryo Tanabe, Harsh Purohit, Kaori Suefusa, Takashi Endo, Masahiro Yasuda, Noboru Harada

The main challenge of this task is to detect unknown anomalous sounds under the condition that only normal sound samples have been provided as training data.

Task 2

Paper
Code

DOA Estimation by DNN-based Denoising and Dereverberation from Sound Intensity Vector

no code implementations • 10 Oct 2019 • Masahiro Yasuda, Yuma Koizumi, Luca Mazzon, Shoichiro Saito, Hisashi Uematsu

We propose a direction of arrival (DOA) estimation method that combines sound-intensity vector (IV)-based DOA estimation and DNN-based denoising and dereverberation.

Denoising

Paper
Add Code

First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation

no code implementations • 10 Oct 2019 • Luca Mazzon, Yuma Koizumi, Masahiro Yasuda, Noboru Harada

The same transformation is applied also to the labels, in order to maintain consistency between input data and target labels.

Data Augmentation Direction of Arrival Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.