Physics-informed machine learning for audio processing

In many machine learning techniques, a model is trained on a large amount of data to accomplish a specific task. However, in audio processing, it is often difficult to collect a large amount of data for such training. On the other hand, acoustic phenomena are supposed to obey physical laws, and such physical constraints can provide useful prior information for machine learning models. The governing equations of sound propagation, such as the wave equation, are the best example. In addition, there are various other constraints based on physical properties that can be considered depending on the target or task, and we aim to build new audio processing technology by efficiently incorporating them into machine learning models. This is expected to enable technology that requires less data than conventional machine learning technology and is more flexible than technology based solely on physical models.

References

2025

  1. piml_sfest.png
    Physics-Informed Machine Learning For Sound Field Estimation: Fundamentals, state of the art, and challenges
    Shoichi Koyama, Juliano G. C. Ribeiro, Tomohiko NakamuraNatsuki Ueno, and Mirco Pezzoli
    IEEE Signal Processing Magazine, 2025

2024

  1. Sound Field Estimation Based on Physics-Constrained Kernel Interpolation Adapted to Environment
    Juliano G. C. Ribeiro, Shoichi Koyama, Ryosuke Horiuchi, and Hiroshi Saruwatari
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

2025

  1. fnt_sfest.jpg
    Sound Field Estimation: Theories and Applications (Foundations and Trends® in Signal Processing)
    2025