Abstract: Many of the existing systems for multi-channel sound source localization and separation are built on or designed for specific microphone array geometries, which means that for a new scenario ...
Abstract: Tracking multiple objects in videos relies on modeling the spatial-temporal interactions of the objects. In this paper, we propose TransMOT, which leverages powerful graph transformers to ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果