Skip to content

Releases: nttcslab/dcase2025_task4_baseline

v1.1.1

03 Jun 11:19
d8f90cf

Choose a tag to compare

This version, v1.1.1, provides a slight update to v1.1.0, supporting the Evaluation dataset to be used in DCASE2025T4. The download instructions for the evaluation dataset have been updated.

This is the baseline system implementation for the DCASE2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes (DCASE2025T4).

If you use this system, please cite the following papers:

  • Binh Thien Nguyen, Masahiro Yasuda, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Noboru Harada, ”Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv 2503.22088, 2025, available at URL.

  • Masahiro Yasuda, Binh Thien Nguyen, Noboru Harada, Romain Serizel, Mayank Mishra, Marc Delcroix, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Tomohiro Nakatani, Takao Kawamura, Nobutaka Ono, ”Description and discussion on DCASE 2025 challenge task 4: Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv:xxxx.xxxx, 2025, available at URL.

v1.1.0

02 Jun 12:07
0f453d7

Choose a tag to compare

This version, v1.1.0, provides support for the Evaluation dataset to be used in DCASE2025T4.

This is the baseline system implementation for the DCASE2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes (DCASE2025T4).

If you use this system, please cite the following papers:

  • Binh Thien Nguyen, Masahiro Yasuda, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Noboru Harada, ”Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv 2503.22088, 2025, available at URL.

  • Masahiro Yasuda, Binh Thien Nguyen, Noboru Harada, Romain Serizel, Mayank Mishra, Marc Delcroix, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Tomohiro Nakatani, Takao Kawamura, Nobutaka Ono, ”Description and discussion on DCASE 2025 challenge task 4: Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv:xxxx.xxxx, 2025, available at URL.

v1.0.1

25 Apr 11:34
37cb1a4

Choose a tag to compare

This version, v1.0.1, is a minor update of the previous release, v1.0.0.
Some config variants are added reflecting other conditions related to the GPU type and the number of GPUs.

This is the baseline system implementation for the DCASE2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes (DCASE2025T4).

If you use this system, please cite the following papers:

  • Binh Thien Nguyen, Masahiro Yasuda, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Noboru Harada, ”Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv 2503.22088, 2025, available at URL.

  • Masahiro Yasuda, Binh Thien Nguyen, Noboru Harada, Romain Serizel, Mayank Mishra, Marc Delcroix, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Tomohiro Nakatani, Takao Kawamura, Nobutaka Ono, ”Description and discussion on DCASE 2025 challenge task 4: Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv:xxxx.xxxx, 2025, available at URL.

v1.0.0

02 Apr 08:50
bd91507

Choose a tag to compare

This is the baseline system implementation for the DCASE2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes (DCASE2025T4).

If you use this system, please cite the following papers:

  • Binh Thien Nguyen, Masahiro Yasuda, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Noboru Harada, ”Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv 2503.22088, 2025, available at URL.

  • Masahiro Yasuda, Binh Thien Nguyen, Noboru Harada, Romain Serizel, Mayank Mishra, Marc Delcroix, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Tomohiro Nakatani, Takao Kawamura, Nobutaka Ono, ”Description and discussion on DCASE 2025 challenge task 4: Spatial Semantic Segmentation of Sound Scenes,” in arXiv preprint arXiv:xxxx.xxxx, 2025, available at URL.