Unifying convolution and transformer: a dual stage network equipped with cross-interactive multi-modal feature fusion and edge guidance for RGB-D salient object detection.
S. Abraham, и B. Kovoor. J. Ambient Intell. Humaniz. Comput., 15 (4):
2341-2359(апреля 2024)