EVENT MERL Contributes to ICASSP 2025

Date released: April 2, 2025

EVENT MERL Contributes to ICASSP 2025
Date:

Sunday, April 6, 2025 - Friday, April 11, 2025
Location:

Hyderabad, India
Description:

MERL has made numerous contributions to both the organization and technical program of ICASSP 2025, which is being held in Hyderabad, India from April 6-11, 2025.

Sponsorship

MERL is proud to be a Silver Patron of the conference and will participate in the student job fair on Thursday, April 10. Please join this session to learn more about employment opportunities at MERL, including openings for research scientists, post-docs, and interns.

MERL is pleased to be the sponsor of two IEEE Awards that will be presented at the conference. We congratulate Prof. Björn Erik Ottersten, the recipient of the 2025 IEEE Fourier Award for Signal Processing, and Prof. Shrikanth Narayanan, the recipient of the 2025 IEEE James L. Flanagan Speech and Audio Processing Award. Both awards will be presented in-person at ICASSP by Anthony Vetro, MERL President & CEO.

Technical Program

MERL is presenting 15 papers in the main conference on a wide range of topics including source separation, sound event detection, sound anomaly detection, speaker diarization, music generation, robot action generation from video, indoor airflow imaging, WiFi sensing, Doppler single-photon Lidar, optical coherence tomography, and radar imaging. Another paper on spatial audio will be presented at the Generative Data Augmentation for Real-World Signal Processing Applications (GenDA) Satellite Workshop.

MERL Researchers Petros Boufounos and Hassan Mansour will present a Tutorial on “Computational Methods in Radar Imaging” in the afternoon of Monday, April 7.

Petros Boufounos will also be giving an industry talk on Thursday April 10 at 12pm, on “A Physics-Informed Approach to Sensing".

About ICASSP

ICASSP is the flagship conference of the IEEE Signal Processing Society, and the world's largest and most comprehensive technical conference focused on the research advances and latest technological development in signal and information processing. The event has been attracting more than 4000 participants each year.
MERL Contacts:

Wael H. Ali; Petros T. Boufounos; Radu Corcodel; François Germain; Chiori Hori; Siddarth Jain; Devesh K. Jha; Toshiaki Koike-Akino; Jonathan Le Roux; Yanting Ma; Hassan Mansour; Yoshiki Masuyama; Joshua Rapp; Diego Romeres; Anthony Vetro; Pu (Perry) Wang; Gordon Wichern
External Link:

https://2025.ieeeicassp.org/
Research Areas:

Artificial Intelligence, Communications, Computational Sensing, Electronic and Photonic Devices, Machine Learning, Robotics, Signal Processing, Speech & Audio
- Related Publications
  Ebbers, J., Germain, F.G., Wilkinghoff, K., Wichern, G., Le Roux, J., "No Class Left Behind: A Closer Look at Class Balancing for Audio Tagging", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-037 PDF
  @inproceedings{Ebbers2025mar,
  author = {Ebbers, Janek and Germain, François G and Wilkinghoff, Kevin and Wichern, Gordon and {Le Roux}, Jonathan},
  title = {{No Class Left Behind: A Closer Look at Class Balancing for Audio Tagging}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-037}
  }
  Araki, S., Ito, N., Haeb-Umbach, R., Wichern, G., Wang, Z.-Q., Mitsufuji, Y., "30+ Years of Source Separation Research: Achievements and Future Challenges", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-036 PDF
  @inproceedings{Araki2025mar,
  author = {Araki, Shoko and Ito, Nobutaka and Haeb-Umbach, Reinhold and Wichern, Gordon and Wang, Zhong-Qiu and Mitsufuji, Yuki},
  title = {{30+ Years of Source Separation Research: Achievements and Future Challenges}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-036}
  }
  Teh, A., Ali, W.H., Rapp, J., Mansour, H., "Indoor Airflow Imaging Using Physics-Informed Schlieren Tomography", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-035 PDF
  @inproceedings{Teh2025mar,
  author = {Teh, Arjun and Ali, Wael H. and Rapp, Joshua and Mansour, Hassan},
  title = {{Indoor Airflow Imaging Using Physics-Informed Schlieren Tomography}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-035}
  }
  Hori, C., Kambara, M., Sugiura, K., Ota, K., Khurana, S., Jain, S., Corcodel, R., Jha, D.K., Romeres, D., Le Roux, J., "Interactive Robot Action Replanning using Multimodal LLM Trained from Human Demonstration Videos", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-034 PDF
  @inproceedings{Hori2025mar,
  author = {Hori, Chiori and Kambara, Motonari and Sugiura, Komei and Ota, Kei and Khurana, Sameer and Jain, Siddarth and Corcodel, Radu and Jha, Devesh K. and Romeres, Diego and {Le Roux}, Jonathan},
  title = {{Interactive Robot Action Replanning using Multimodal LLM Trained from Human Demonstration Videos}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-034}
  }
  Saijo, K., Ebbers, J., Germain, F.G., Khurana, S., Wichern, G., Le Roux, J., "Leveraging Audio-Only Data for Text-Queried Target Sound Extraction", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-033 PDF
  @inproceedings{Saijo2025mar2,
  author = {Saijo, Kohei and Ebbers, Janek and Germain, François G and Khurana, Sameer and Wichern, Gordon and {Le Roux}, Jonathan},
  title = {{Leveraging Audio-Only Data for Text-Queried Target Sound Extraction}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-033}
  }
  Saijo, K., Ebbers, J., Germain, F.G., Wichern, G., Le Roux, J., "Task-Aware Unified Source Separation", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-032 PDF
  @inproceedings{Saijo2025mar,
  author = {Saijo, Kohei and Ebbers, Janek and Germain, François G and Wichern, Gordon and {Le Roux}, Jonathan},
  title = {{Task-Aware Unified Source Separation}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-032}
  }
  Gruttadauria, E., Fontaine, M., Le Roux, J., Essid, S., "O-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker Diarization", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-031 PDF
  @inproceedings{Gruttadauria2025mar,
  author = {Gruttadauria, Elio and Fontaine, Mathieu and {Le Roux}, Jonathan and Essid, Slim},
  title = {{O-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker Diarization}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-031}
  }
  Wilkinghoff, K., Yang, H., Ebbers, J., Germain, F.G., Wichern, G., Le Roux, J., "Keeping the Balance: Anomaly Score Calculation for Domain Generalization", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-030 PDF
  @inproceedings{Wilkinghoff2025mar,
  author = {Wilkinghoff, Kevin and Yang, Haici and Ebbers, Janek and Germain, François G and Wichern, Gordon and {Le Roux}, Jonathan},
  title = {{Keeping the Balance: Anomaly Score Calculation for Domain Generalization}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-030}
  }
  Masuyama, Y., Wichern, G., Germain, F.G., Ick, C., Le Roux, J., "Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-029 PDF Software
  @inproceedings{Masuyama2025mar,
  author = {Masuyama, Yoshiki and Wichern, Gordon and Germain, François G and Ick, Christopher and {Le Roux}, Jonathan},
  title = {{Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-029}
  }
  Kitichotkul, R., Rapp, J., Ma, Y., Mansour, H., "Doppler Single-Photon Lidar", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-028 PDF
  @inproceedings{Kitichotkul2025mar,
  author = {Kitichotkul, Ruangrawee and Rapp, Joshua and Ma, Yanting and Mansour, Hassan},
  title = {{Doppler Single-Photon Lidar}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-028}
  }
  Yataka, R., Wang, P., Boufounos, P.T., Takahashi, R., "Multi-View Radar Detection Transformer with Differentiable Positional Encoding", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-027 PDF
  @inproceedings{Yataka2025mar,
  author = {Yataka, Ryoma and Wang, Pu and Boufounos, Petros T. and Takahashi, Ryuhei},
  title = {{Multi-View Radar Detection Transformer with Differentiable Positional Encoding}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-027}
  }
  Attiah, K., Wang, P., Mansour, H., Koike-Akino, T., Boufounos, P.T., "Enabling DMG Wi-Fi Sensing in Data Transmission Intervals by Exploiting Beam Training Codebook", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2025.
  BibTeX TR2025-026 PDF
  @inproceedings{Attiah2025mar,
  author = {Attiah, Kareem and Wang, Pu and Mansour, Hassan and Koike-Akino, Toshiaki and Boufounos, Petros T.},
  title = {{Enabling DMG Wi-Fi Sensing in Data Transmission Intervals by Exploiting Beam Training Codebook}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  year = 2025,
  month = mar,
  url = {https://www.merl.com/publications/TR2025-026}
  }
  Ick, C., Wichern, G., Masuyama, Y., Germain, F.G., Le Roux, J., "Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Satellite Workshop on Generative Data Augmentation for Real-World Signal Processing Applications (GenDA), April 2025.
  BibTeX TR2025-045 PDF
  @inproceedings{Ick2025apr,
  author = {Ick, Christopher and Wichern, Gordon and Masuyama, Yoshiki and Germain, François G and {Le Roux}, Jonathan},
  title = {{Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training}},
  booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Satellite Workshop on Generative Data Augmentation for Real-World Signal Processing Applications (GenDA)},
  year = 2025,
  month = apr,
  url = {https://www.merl.com/publications/TR2025-045}
  }
  Koo, J., Wichern, G., Germain, F.G., Khurana, S., Le Roux, J., "SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers", IEEE Open Journal of Signal Processing, DOI: 10.1109/OJSP.2025.3534686, Vol. 6, pp. 266-275, January 2025.
  BibTeX TR2025-012 PDF Software
  @article{Koo2025jan,
  author = {Koo, Junghyun and Wichern, Gordon and Germain, François G and Khurana, Sameer and {Le Roux}, Jonathan},
  title = {{SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers}},
  journal = {IEEE Open Journal of Signal Processing},
  year = 2025,
  volume = 6,
  pages = {266--275},
  month = jan,
  doi = {10.1109/OJSP.2025.3534686},
  issn = {2644-1322},
  url = {https://www.merl.com/publications/TR2025-012}
  }
  Rapp, J., Mansour, H., Boufounos, P.T., Koike-Akino, T., Parsons, K., "Multi-layered Surface Estimation for Low-cost Optical Coherence Tomography", IEEE Transactions on Computational Imaging, DOI: 10.1109/TCI.2024.3497602, Vol. 10, pp. 1706-1721, December 2024.
  BibTeX TR2024-164 PDF
  @article{Rapp2024dec,
  author = {Rapp, Joshua and Mansour, Hassan and Boufounos, Petros T. and Koike-Akino, Toshiaki and Parsons, Kieran},
  title = {{Multi-layered Surface Estimation for Low-cost Optical Coherence Tomography}},
  journal = {IEEE Transactions on Computational Imaging},
  year = 2024,
  volume = 10,
  pages = {1706--1721},
  month = dec,
  doi = {10.1109/TCI.2024.3497602},
  url = {https://www.merl.com/publications/TR2024-164}
  }
  Boeddeker, C., Subramanian, A.S., Wichern, G., Haeb-Umbach, R., Le Roux, J., "TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings", IEEE/ACM Transactions on Audio, Speech, and Language Processing, DOI: 10.1109/TASLP.2024.3350887, Vol. 32, pp. 1185-1197, February 2024.
  BibTeX TR2024-006 PDF Software
  @article{Boeddeker2024feb,
  author = {Boeddeker, Christoph and Subramanian, Aswin Shanmugam and Wichern, Gordon and Haeb-Umbach, Reinhold and {Le Roux}, Jonathan},
  title = {{TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings}},
  journal = {IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year = 2024,
  volume = 32,
  pages = {1185--1197},
  month = feb,
  doi = {10.1109/TASLP.2024.3350887},
  issn = {2329-9304},
  url = {https://www.merl.com/publications/TR2024-006}
  }

Date:

Location:

Description:

MERL Contacts:

External Link:

Research Areas: