Abstract: This article proposes a method composed of a loss function and a feature extractor structure, to learn the distinctive feature representation (DIFR) for descriptors on multimodal images. The ...