Keith Price Bibliography coauth Details for nagr

Index for nagr

Nagran, A. Standard Author Listing
     with: Arnab, A.: VicTR: Video-conditioned Text Representations for Activity ...
     with: Kahatapitiya, K.: VicTR: Video-conditioned Text Representations for Ac...
     with: Ryoo, M.S.: VicTR: Video-conditioned Text Representations for Activity...

Nagrani, A. Standard Author Listing
     with: Afouras, T.: Localizing Visual Sounds the Hard Way
     with: Alabdulmohsin, I.: On Scaling Up a Multilingual Vision and Language Mo...
     with: Alahari, K.: Masking Modalities for Cross-modal Video Retrieval
     with: Albanie, S.: Learnable PINs: Cross-modal Embeddings for Person Identity
     with: Albanie, S.: Seeing Voices and Hearing Faces: Cross-Modal Biometric Ma...
     with: Amelot, J.: On Scaling Up a Multilingual Vision and Language Model
     with: Angelova, A.: On Scaling Up a Multilingual Vision and Language Model
     with: Arnab, A.: End-to-end Generative Pretraining for Multimodal Video Capt...
     with: Arnab, A.: Flexible Frame Selection for Efficient Video Reasoning
     with: Arnab, A.: On Scaling Up a Multilingual Vision and Language Model
     with: Arnab, A.: Streaming Dense Video Captioning
     with: Arnab, A.: Uncertainty-aware Weakly Supervised Action Detection from U...
     with: Arnab, A.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Bain, M.: AutoAD II: The Sequel - Who, When, and What in Movie Audio D...
     with: Bain, M.: AutoAD III: The Prequel: Back to the Pixels
     with: Bain, M.: Autoad-zero: A Training-free Framework for Zero-shot Audio D...
     with: Bain, M.: AutoAD: Movie Description in Context
     with: Bain, M.: Condensed Movies: Story Based Retrieval with Contextual Embe...
     with: Bain, M.: Count, Crop and Recognise: Fine-Grained Recognition in the W...
     with: Bain, M.: Frozen in Time: A Joint Video and Image Encoder for End-to-E...
     with: Beyer, L.: On Scaling Up a Multilingual Vision and Language Model
     with: Brown, A.: Condensed Movies: Story Based Retrieval with Contextual Emb...
     with: Buch, S.: Flexible Frame Selection for Efficient Video Reasoning
     with: Buch, S.: MoReVQA: Exploring Modular Reasoning Models for Video Questi...
     with: Buch, S.: Streaming Dense Video Captioning
     with: Caron, M.: Verbs in Action: Improving verb understanding in video-lang...
     with: Changpinyo, S.: On Scaling Up a Multilingual Vision and Language Model
     with: Chen, H.L.: Localizing Visual Sounds the Hard Way
     with: Chen, X.: On Scaling Up a Multilingual Vision and Language Model
     with: Cho, M.: MoReVQA: Exploring Modular Reasoning Models for Video Questio...
     with: Damen, D.: EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric A...
     with: Darrell, T.J.: TL;DW? Summarizing Instructional Videos with Task Relev...
     with: Dehghani, M.: On Scaling Up a Multilingual Vision and Language Model
     with: Djolonga, J.: On Scaling Up a Multilingual Vision and Language Model
     with: Gabeur, V.: Masking Modalities for Cross-modal Video Retrieval
     with: Ge, W.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Goodman, S.: On Scaling Up a Multilingual Vision and Language Model
     with: Han, T.: AutoAD III: The Prequel: Back to the Pixels
     with: Han, T.: Autoad-zero: A Training-free Framework for Zero-shot Audio De...
     with: Han, T.: AutoAD: Movie Description in Context
     with: Han, T.D.: AutoAD II: The Sequel - Who, When, and What in Movie Audio ...
     with: Hauth, A.: Learning Audio-Video Modalities from Image Captions
     with: Houlsby, N.: On Scaling Up a Multilingual Vision and Language Model
     with: Hu, H.: On Scaling Up a Multilingual Vision and Language Model
     with: Joshi, M.: On Scaling Up a Multilingual Vision and Language Model
     with: Kazakos, E.: EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric...
     with: Keysers, D.: On Scaling Up a Multilingual Vision and Language Model
     with: Kolesnikov, A.: On Scaling Up a Multilingual Vision and Language Model
     with: Kuehne, H.: Unbiasing through Textual Descriptions: Mitigating Represe...
     with: Laptev, I.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
     with: Lee, K.: On Scaling Up a Multilingual Vision and Language Model
     with: Li, G.: On Scaling Up a Multilingual Vision and Language Model
     with: Li, Y.: On Scaling Up a Multilingual Vision and Language Model
     with: Lucic, M.: On Scaling Up a Multilingual Vision and Language Model
     with: Manen, S.: Learning Audio-Video Modalities from Image Captions
     with: Miech, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
     with: Min, J.: MoReVQA: Exploring Modular Reasoning Models for Video Questio...
     with: Minderer, M.: On Scaling Up a Multilingual Vision and Language Model
     with: Momeni, L.: Verbs in Action: Improving verb understanding in video-lan...
     with: Montgomery, C.: On Scaling Up a Multilingual Vision and Language Model
     with: Mustafa, B.: On Scaling Up a Multilingual Vision and Language Model
     with: Myers, A.: Streaming Dense Video Captioning
     with: Narasimhan, M.: TL;DW? Summarizing Instructional Videos with Task Rele...
     with: Padlewski, P.: On Scaling Up a Multilingual Vision and Language Model
     with: Pang, B.: On Scaling Up a Multilingual Vision and Language Model
     with: Pavetic, F.: On Scaling Up a Multilingual Vision and Language Model
     with: Piergiovanni, A.: On Scaling Up a Multilingual Vision and Language Model
     with: Pietrzyk, P.: On Scaling Up a Multilingual Vision and Language Model
     with: Pont Tuset, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language ...
     with: Ritter, M.: On Scaling Up a Multilingual Vision and Language Model
     with: Rohrbach, A.: TL;DW? Summarizing Instructional Videos with Task Releva...
     with: Rong, K.: On Scaling Up a Multilingual Vision and Language Model
     with: Ross, D.: Speech2Action: Cross-Modal Supervision for Action Recognition
     with: Ross, D.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Rubinstein, M.: TL;DW? Summarizing Instructional Videos with Task Rele...
     with: Ruiz, C.R.: On Scaling Up a Multilingual Vision and Language Model
     with: Rupprecht, C.: Unbiasing through Textual Descriptions: Mitigating Repr...
     with: Salz, D.: On Scaling Up a Multilingual Vision and Language Model
     with: Schiele, B.: Unbiasing through Textual Descriptions: Mitigating Repres...
     with: Schmid, C.: AVFormer: Injecting Vision into Frozen Speech Models for Z...
     with: Schmid, C.: Composable Augmentation Encoding for Video Representation ...
     with: Schmid, C.: End-to-end Generative Pretraining for Multimodal Video Cap...
     with: Schmid, C.: Flexible Frame Selection for Efficient Video Reasoning
     with: Schmid, C.: Learning Audio-Video Modalities from Image Captions
     with: Schmid, C.: Look Before you Speak: Visually Contextualized Utterances
     with: Schmid, C.: Masking Modalities for Cross-modal Video Retrieval
     with: Schmid, C.: MoReVQA: Exploring Modular Reasoning Models for Video Ques...
     with: Schmid, C.: Speech2Action: Cross-Modal Supervision for Action Recognit...
     with: Schmid, C.: Streaming Dense Video Captioning
     with: Schmid, C.: TL;DW? Summarizing Instructional Videos with Task Relevanc...
     with: Schmid, C.: Uncertainty-aware Weakly Supervised Action Detection from ...
     with: Schmid, C.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Schmid, C.: Verbs in Action: Improving verb understanding in video-lan...
     with: Schmid, C.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
     with: Schofield, D.: Count, Crop and Recognise: Fine-Grained Recognition in ...
     with: Seo, P.H.: AVFormer: Injecting Vision into Frozen Speech Models for Ze...
     with: Seo, P.H.: End-to-end Generative Pretraining for Multimodal Video Capt...
     with: Seo, P.H.: Learning Audio-Video Modalities from Image Captions
     with: Seo, P.H.: Look Before you Speak: Visually Contextualized Utterances
     with: Seo, P.H.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
     with: Seybold, B.: Learning Audio-Video Modalities from Image Captions
     with: Seyedhosseini, M.: On Scaling Up a Multilingual Vision and Language Mo...
     with: Shakeri, S.: On Scaling Up a Multilingual Vision and Language Model
     with: Shvetsova, N.: Unbiasing through Textual Descriptions: Mitigating Repr...
     with: Sivic, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
     with: Soricut, R.: On Scaling Up a Multilingual Vision and Language Model
     with: Steiner, A.P.: On Scaling Up a Multilingual Vision and Language Model
     with: Sukthankar, R.: Speech2Action: Cross-Modal Supervision for Action Reco...
     with: Sun, C.: Composable Augmentation Encoding for Video Representation Lea...
     with: Sun, C.: Learning Audio-Video Modalities from Image Captions
     with: Sun, C.: Masking Modalities for Cross-modal Video Retrieval
     with: Sun, C.: Speech2Action: Cross-Modal Supervision for Action Recognition
     with: Sun, C.: TL;DW? Summarizing Instructional Videos with Task Relevance a...
     with: Sun, C.: Uncertainty-aware Weakly Supervised Action Detection from Unt...
     with: Tay, Y.: On Scaling Up a Multilingual Vision and Language Model
     with: Tian, Y.L.: Composable Augmentation Encoding for Video Representation ...
     with: Tschannen, M.: On Scaling Up a Multilingual Vision and Language Model
     with: Varol, G.: AutoAD II: The Sequel - Who, When, and What in Movie Audio ...
     with: Varol, G.: AutoAD III: The Prequel: Back to the Pixels
     with: Varol, G.: Autoad-zero: A Training-free Framework for Zero-shot Audio ...
     with: Varol, G.: AutoAD: Movie Description in Context
     with: Varol, G.: Frozen in Time: A Joint Video and Image Encoder for End-to-...
     with: Vedaldi, A.: Localizing Visual Sounds the Hard Way
     with: Wang, X.: On Scaling Up a Multilingual Vision and Language Model
     with: Wang, Z.H.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Waters, A.: On Scaling Up a Multilingual Vision and Language Model
     with: Wu, J.L.: On Scaling Up a Multilingual Vision and Language Model
     with: Xie, J.Y.: Autoad-zero: A Training-free Framework for Zero-shot Audio ...
     with: Xie, W.: AutoAD II: The Sequel - Who, When, and What in Movie Audio De...
     with: Xie, W.: AutoAD III: The Prequel: Back to the Pixels
     with: Xie, W.: Autoad-zero: A Training-free Framework for Zero-shot Audio De...
     with: Xie, W.: AutoAD: Movie Description in Context
     with: Xie, W.: Localizing Visual Sounds the Hard Way
     with: Xiong, X.: Streaming Dense Video Captioning
     with: Xiong, X.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Xu, Y.Z.: On Scaling Up a Multilingual Vision and Language Model
     with: Yan, S.: Streaming Dense Video Captioning
     with: Yan, S.: UnLoc: A Unified Framework for Video Localization Tasks
     with: Yang, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model ...
     with: Zhai, X.H.: On Scaling Up a Multilingual Vision and Language Model
     with: Zhou, X.Y.: Streaming Dense Video Captioning
     with: Zisserman, A.: AutoAD II: The Sequel - Who, When, and What in Movie Au...
     with: Zisserman, A.: AutoAD III: The Prequel: Back to the Pixels
     with: Zisserman, A.: Autoad-zero: A Training-free Framework for Zero-shot Au...
     with: Zisserman, A.: AutoAD: Movie Description in Context
     with: Zisserman, A.: Condensed Movies: Story Based Retrieval with Contextual...
     with: Zisserman, A.: Count, Crop and Recognise: Fine-Grained Recognition in ...
     with: Zisserman, A.: EPIC-Fusion: Audio-Visual Temporal Binding for Egocentr...
     with: Zisserman, A.: Frozen in Time: A Joint Video and Image Encoder for End...
     with: Zisserman, A.: Learnable PINs: Cross-modal Embeddings for Person Ident...
     with: Zisserman, A.: Localizing Visual Sounds the Hard Way
     with: Zisserman, A.: Seeing Voices and Hearing Faces: Cross-Modal Biometric ...
     with: Zisserman, A.: Speech2Action: Cross-Modal Supervision for Action Recog...
     with: Zisserman, A.: Verbs in Action: Improving verb understanding in video-...
154 for Nagrani, A.

Nagrecha, K. Standard Author Listing
with: Vasconcelos, N.M.: Gradient-based Algorithms for Machine Teaching
with: Wang, P.: Gradient-based Algorithms for Machine Teaching

Last update: 4-Jun-26 17:05:40
Use price@usc.edu for comments.