Keith Price Bibliography journal Details for clvl

Journals starting with clvl

CLVL19 * *Closing the Loop Between Vision and Language
* Are we Asking the Right Questions in MovieQA?
* Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
* SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions

CLVL21 * *Closing the Loop Between Vision and Language
* CIGLI: Conditional Image Generation from Language & Image
* Egocentric Biochemical Video-and-Language Dataset
* Language-guided Multi-Modal Fusion for Video Action Recognition
* Latent Variable Models for Visual Question Answering
* Semi-Autoregressive Transformer for Image Captioning
* Visual Question Answering with Textual Representations for Images
* What You Say Is Not What You Do: Studying Visio-Linguistic Models for TV Series Summarization
8 for CLVL21

CLVL23 * *Closing the Loop Between Vision and Language
* Alignment and Generation Adapter for Efficient Video-Text Understanding
* BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
* Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering
* Cross-Dataset Study on the Brazilian Sign Language Translation, A
* Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering
* ECO: Ensembling Context Optimization for Vision-Language Models
* empirical study of the effect of video encoders on Temporal Video Grounding, An
* Explaining Vision and Language through Graphs of Events in Space and Time
* LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling
* Mapping Memes to Words for Multimodal Hateful Meme Classification
* Multimodal Neurons in Pretrained Text-Only Transformers
* PatFig: Generating Short and Long Captions for Patent Figures
* ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion
* Sparse Linear Concept Discovery Models
* Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
* Vision-Language Models Performing Zero-Shot Tasks Exhibit Disparities Between Gender Groups
* Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts
18 for CLVL23

Last update:10-Sep-25 13:32:00
Use price@usc.edu for comments.