Abstract: The multimodal remote sensing image matching is crucial for many applications. However, nonlinear intensity distortion (NID) significantly impairs matching performance, especially when ...
The Composed Image Retrieval (CIR) task aims to retrieve target images using a composed query consisting of a reference image and a modified text. Advanced methods often utilize contrastive learning ...
Abstract: Pixel-level adaptive convolution, which overcomes the deficiency of the spatial-invariance of standard convolution, is always limited to performing feature extraction from local patches and ...
We present Follow-Your-Emoji, a diffusion-based framework for portrait animation, which animates a reference portrait with target landmark sequences. [FollowYourEmoji ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results