Diomataris, M.[Markos]
Co Author Listing * Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection
* Interpretable Visual Question Answering Via Reasoning Supervision
* Self-Supervised Learning for Visual Relationship Detection through Masked Bounding Box Reconstruction