VITR: Augmenting Vision Transformers with Relation-Focused Learning for Cross-Modal Information Retrieval
Authors: Gong, Y., Cosma, G., Finke, A.
Publication Date: 13/02/2023
Source: arXiv
Authors: Gong, Y., Cosma, G., Finke, A.
Publication Date: 13/02/2023
Source: arXiv