Keith Price Bibliography coauth Details for lapt

Index for lapt

Laptev, D. Standard Author Listing
     with: Buhmann, J.M.: Convolutional Decision Trees for Feature Learning and S...
     with: Buhmann, J.M.: TI-POOLING: Transformation-Invariant Pooling for Featur...
     with: Buhmann, J.M.: Transformation-Invariant Convolutional Jungles
     with: Pollefeys, M.: TI-POOLING: Transformation-Invariant Pooling for Featur...
     with: Savinov, N.: TI-POOLING: Transformation-Invariant Pooling for Feature ...

Laptev, I. Standard Author Listing
     with: Ademtew, H.B.: All Languages Matter: Evaluating LMMs on Culturally Div...
     with: Agrawal, N.: Learning from Narrated Instruction Videos
     with: Agrawal, N.: Unsupervised Learning from Narrated Instruction Videos
     with: Ahsan, N.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Akbarzadeh, A.: Galilean-diagonalized spatio-temporal interest operators
     with: Alahari, K.: Pose Estimation and Segmentation of Multiple People in St...
     with: Alahari, K.: Pose Estimation and Segmentation of People in 3D Movies
     with: Alayrac, J.: End-to-End Learning of Visual Representations From Uncura...
     with: Alayrac, J.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
     with: Alayrac, J.B.: Cross-Task Weakly Supervised Learning From Instructiona...
     with: Alayrac, J.B.: HowTo100M: Learning a Text-Video Embedding by Watching ...
     with: Alayrac, J.B.: Joint Discovery of Object States and Manipulation Actions
     with: Alayrac, J.B.: Learning Actionness via Long-range Temporal Order Verif...
     with: Alayrac, J.B.: Learning from Narrated Instruction Videos
     with: Alayrac, J.B.: Learning from Video and Text via Large-Scale Discrimina...
     with: Alayrac, J.B.: Look for the Change: Learning Object States and State-M...
     with: Alayrac, J.B.: Multi-Task Learning of Object States and State-Modifyin...
     with: Alayrac, J.B.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
     with: Alayrac, J.B.: Unsupervised Learning from Narrated Instruction Videos
     with: Ali, A.: PairDETR: Joint Detection and Association of Human Bodies and...
     with: Amirudin, A.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Anwer, R.M.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Aremu, T.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Audibert, J.Y.: Data-driven crowd analysis in videos
     with: Audibert, J.Y.: Density-aware person detection and tracking in crowds
     with: Azizov, D.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Azizpour, H.: Object Detection Using Strongly-Supervised Deformable Pa...
     with: Bach, F.: Automatic Annotation of Human Actions in Video
     with: Bach, F.: Finding Actors and Actions in Movies
     with: Bach, F.: Weakly Supervised Action Labeling in Videos under Ordering C...
     with: Bach, F.: Weakly-Supervised Alignment of Video with Text
     with: Baliah, S.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Baumgartner, A.: Automatic Extraction of Roads from Aerial Images Base...
     with: Baumgartner, A.: Automatic Road Extraction Based on Multi-Scale Modeli...
     with: Baumgartner, A.: Multi-Scale and Snakes for Automatic Road Extraction
     with: Belongie, S.J.: Periodic Motion Detection and Segmentation via Approxi...
     with: Bhatia, N.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Bhatkal, A.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Black, M.J.: Learning from Synthetic Humans
     with: Black, M.J.: Learning Joint Reconstruction of Hands and Manipulated Ob...
     with: Bogo, F.: Leveraging Photometric Consistency Over Time for Sparsely Su...
     with: Bojanowski, P.: Finding Actors and Actions in Movies
     with: Bojanowski, P.: Instance-Level Video Segmentation from Object Tracks
     with: Bojanowski, P.: Learning from Narrated Instruction Videos
     with: Bojanowski, P.: Learning from Video and Text via Large-Scale Discrimin...
     with: Bojanowski, P.: Unsupervised Learning from Narrated Instruction Videos
     with: Bojanowski, P.: Weakly Supervised Action Labeling in Videos under Orde...
     with: Bojanowski, P.: Weakly-Supervised Alignment of Video with Text
     with: Bottou, L.: Is object localization for free? - Weakly-supervised learn...
     with: Bottou, L.: Learning and Transferring Mid-level Image Representations ...
     with: Boujemaa, N.: Video copy detection: a comparative study
     with: Bretzner, L.: Hand gesture recognition using multi-scale colour featur...
     with: Buisson, O.: Video copy detection: a comparative study
     with: Cabrera, A.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Caputo, B.: Local velocity-adapted motion events for spatio-temporal r...
     with: Caputo, B.: Recognizing human actions: a local SVM approach
     with: Carpentier, J.: Estimating 3D Motion and Forces of Human-Object Intera...
     with: Carpentier, J.: Estimating 3D Motion and Forces of Person-Object Inter...
     with: Cavada, S.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Ceylan, D.: BodyNet: Volumetric Inference of 3D Human Body Shapes
     with: Chadha, A.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Chang, X.J.: RoomTour3D: Geometry-Aware Video-Instruction Tuning for E...
     with: Chari, V.: On pairwise costs for network flow multi-object tracking
     with: Charon, G.: P-CNN: Pose-Based CNN Features for Action Recognition
     with: Chen, L.: Video copy detection: a comparative study
     with: Chen, S.Z.: Airbert: In-Domain Pretraining for Vision-and-Language Nav...
     with: Chen, S.Z.: gSDF: Geometry-Driven Signed Distance Functions for 3D Han...
     with: Chen, S.Z.: Learning from Unlabeled 3D Environments for Vision-and-Lan...
     with: Chen, S.Z.: SUGAR: Pre-training 3D Visual Representations for Robotics
     with: Chen, S.Z.: Think Global, Act Local: Dual-scale Graph Transformer for ...
     with: Chen, Z.: AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Objec...
     with: Chen, Z.: gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-...
     with: Chigorin, A.: MobileFace: 3D Face Reconstruction with Efficient CNN Re...
     with: Chigorin, A.: PairDETR: Joint Detection and Association of Human Bodie...
     with: Chim, J.: All Languages Matter: Evaluating LMMs on Culturally Diverse ...
     with: Chinaev, N.: MobileFace: 3D Face Reconstruction with Efficient CNN Reg...
     with: Cho, M.: Deep Metric Learning Beyond Binary Supervision
     with: Cho, M.: Editor's Note: Special Issue on ACCV 2024
     with: Cho, M.: Unsupervised object discovery and localization in the wild: P...
     with: Cho, M.: Unsupervised Object Discovery and Tracking in Video Collections
     with: Cho, M.S.: ContextLocNet: Context-Aware Deep Network Models for Weakly...
     with: Cho, M.S.: Thin-Slicing for Pose: Learning to Understand Pose without ...
     with: Cholakkal, H.: All Languages Matter: Evaluating LMMs on Culturally Div...
     with: Choudhury, M.: All Languages Matter: Evaluating LMMs on Culturally Div...
     with: Cinbis, R.G.: Cross-Task Weakly Supervised Learning From Instructional...
     with: Damen, D.: Action Modifiers: Learning From Adverbs in Instructional Vi...
     with: Damen, D.: GenHowTo: Learning to Generate Actions and State Transforma...
     with: Damen, D.: ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual...
     with: Delaitre, V.: People Watching: Human Actions as a Cue for Single View ...
     with: Delaitre, V.: Recognizing human actions in still images: A study of ba...
     with: Delaitre, V.: Scene Semantics from Long-Term Observation of People
     with: Dexter, E.: Cross-View Action Recognition from Temporal Self-similarit...
     with: Dexter, E.: Multi-view synchronization of human actions and dynamic sc...
     with: Dexter, E.: View-Independent Action Recognition from Temporal Self-Sim...
     with: Dissanayake, D.: All Languages Matter: Evaluating LMMs on Culturally D...
     with: Djanibekov, A.: All Languages Matter: Evaluating LMMs on Culturally Di...
     with: Doughty, H.: Action Modifiers: Learning From Adverbs in Instructional ...
     with: Duchenne, O.: Automatic Annotation of Human Actions in Video
     with: Eckstein, W.: Automatic Extraction of Roads from Aerial Images Based o...
     with: Efros, A.A.: People Watching: Human Actions as a Cue for Single View G...
     with: Efros, A.A.: Scene Semantics from Long-Term Observation of People
     with: Esplana, A.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Farestam, F.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Farhadi, A.: Hollywood in Homes: Crowdsourcing Data Collection for Act...
     with: Fatima, M.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Felsberg, M.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Fouhey, D.: Cross-Task Weakly Supervised Learning From Instructional V...
     with: Fouhey, D.F.: People Watching: Human Actions as a Cue for Single View ...
     with: Fouhey, D.F.: Scene Semantics from Long-Term Observation of People
     with: Gaikov, G.: PairDETR: Joint Detection and Association of Human Bodies ...
     with: Garcia, R.: Segmenter: Transformer for Semantic Segmentation
     with: Garcia, R.: SUGAR: Pre-training 3D Visual Representations for Robotics
     with: Gatti, P.: ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual...
     with: Ghallabi, W.A.: All Languages Matter: Evaluating LMMs on Culturally Di...
     with: Ghasemaghaei, A.: All Languages Matter: Evaluating LMMs on Culturally ...
     with: Girshick, R.: Editorial: Deep Learning for Computer Vision
     with: Gokani, M.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Gorban, A.: THUMOS challenge on action recognition for videos 'in the ...
     with: Gouet Brunet, V.: Video copy detection: a comparative study
     with: Grave, E.: Weakly-Supervised Alignment of Video with Text
     with: Guhur, P.L.: Airbert: In-Domain Pretraining for Vision-and-Language Na...
     with: Guhur, P.L.: Learning from Unlabeled 3D Environments for Vision-and-La...
     with: Guhur, P.L.: Think Global, Act Local: Dual-scale Graph Transformer for...
     with: Gupta, A.: Hollywood in Homes: Crowdsourcing Data Collection for Activ...
     with: Gupta, A.: People Watching: Human Actions as a Cue for Single View Geo...
     with: Gupta, A.: Scene Semantics from Long-Term Observation of People
     with: Gupta, R.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Hamerlik, E.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Han, M.F.: RoomTour3D: Geometry-Aware Video-Instruction Tuning for Emb...
     with: Hasson, Y.: AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Obj...
     with: Hasson, Y.: Learning Joint Reconstruction of Hands and Manipulated Obj...
     with: Hasson, Y.: Leveraging Photometric Consistency Over Time for Sparsely ...
     with: Hasson, Y.: Towards Unconstrained Joint Hand-Object Reconstruction Fro...
     with: Hmaiti, Y.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Idrees, H.: THUMOS challenge on action recognition for videos 'in the ...
     with: Ihsani, M.K.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Izzati, F.A.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Jankovic, B.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Jiang, Y.G.: THUMOS challenge on action recognition for videos 'in the...
     with: Joly, A.: Video copy detection: a comparative study
     with: Junejo, I.N.: Cross-View Action Recognition from Temporal Self-similar...
     with: Junejo, I.N.: View-Independent Action Recognition from Temporal Self-S...
     with: Kalevatykh, I.: Learning Joint Reconstruction of Hands and Manipulated...
     with: Kantorov, V.: ContextLocNet: Context-Aware Deep Network Models for Wea...
     with: Kantorov, V.: Efficient Feature Extraction, Encoding, and Classificati...
     with: Kareem, A.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Kareem, D.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Khan, F.S.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Khan, S.: All Languages Matter: Evaluating LMMs on Culturally Diverse ...
     with: Kim, S.Y.: Deep Metric Learning Beyond Binary Supervision
     with: Klaser, A.: Evaluation of local spatio-temporal features for action re...
     with: Kokkinos, I.: Editorial: Deep Learning for Computer Vision
     with: Kuckreja, K.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Kukleva, A.: Learning Interactions and Relationships Between Movie Cha...
     with: Kumar, A.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Kumar, V.: Long term spatio-temporal modeling for action detection
     with: Kwak, S.: Deep Metric Learning Beyond Binary Supervision
     with: Kwak, S.: Thin-Slicing for Pose: Learning to Understand Pose without E...
     with: Kwak, S.: Unsupervised object discovery and localization in the wild: ...
     with: Kwak, S.: Unsupervised Object Discovery and Tracking in Video Collecti...
     with: Laaksonen, J.: All Languages Matter: Evaluating LMMs on Culturally Div...
     with: Lacoste Julien, S.: Joint Discovery of Object States and Manipulation ...
     with: Lacoste Julien, S.: Learning from Narrated Instruction Videos
     with: Lacoste Julien, S.: On pairwise costs for network flow multi-object tr...
     with: Lacoste Julien, S.: Unsupervised Learning from Narrated Instruction Vi...
     with: Lajugie, R.: Instance-Level Video Segmentation from Object Tracks
     with: Lajugie, R.: Weakly Supervised Action Labeling in Videos under Orderin...
     with: Lajugie, R.: Weakly-Supervised Alignment of Video with Text
     with: Law To, J.: Video copy detection: a comparative study
     with: Li, K.: All Languages Matter: Evaluating LMMs on Culturally Diverse 10...
     with: Li, Z.M.: Estimating 3D Motion and Forces of Human-Object Interactions...
     with: Li, Z.M.: Estimating 3D Motion and Forces of Person-Object Interaction...
     with: Liang, X.D.: RoomTour3D: Geometry-Aware Video-Instruction Tuning for E...
     with: Lindeberg, T.: Automatic Extraction of Roads from Aerial Images Based ...
     with: Lindeberg, T.: Distance Measure and a Feature Likelihood Map Concept f...
     with: Lindeberg, T.: Galilean-diagonalized spatio-temporal interest operators
     with: Lindeberg, T.: Hand gesture recognition using multi-scale colour featu...
     with: Lindeberg, T.: Interest Point Detection and Scale Selection in Space-T...
     with: Lindeberg, T.: Local Descriptors for Spatio-temporal Recognition
     with: Lindeberg, T.: Local velocity-adapted motion events for spatio-tempora...
     with: Lindeberg, T.: multi-scale feature likelihood map for direct evaluatio...
     with: Lindeberg, T.: Recognizing human actions: a local SVM approach
     with: Lindeberg, T.: Space-time interest points
     with: Lindeberg, T.: Tracking of multi-state hand models using particle filt...
     with: Lindeberg, T.: Velocity adaptation of space-time interest points
     with: Lindeberg, T.: Velocity adaptation of spatio-temporal receptive fields...
     with: Ma, L.: RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodi...
     with: Maani, F.A.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Mahmood, N.: Learning from Synthetic Humans
     with: Malik, J.: Editorial: Deep Learning for Computer Vision
     with: Manjunath, S.: All Languages Matter: Evaluating LMMs on Culturally Div...
     with: Mansard, N.: Estimating 3D Motion and Forces of Human-Object Interacti...
     with: Mansard, N.: Estimating 3D Motion and Forces of Person-Object Interact...
     with: Marszalek, M.: Actions in context
     with: Marszalek, M.: Learning realistic human actions from movies
     with: Martin, X.: Learning from Synthetic Humans
     with: Maslych, M.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Mayer, H.: Automatic Extraction of Roads from Aerial Images Based on S...
     with: Mayer, H.: Automatic Road Extraction Based on Multi-Scale Modeling, Co...
     with: Mayer, H.: Multi-Scale and Snakes for Automatic Road Extraction
     with: Mayol Cuevas, W.: Action Modifiers: Learning From Adverbs in Instructi...
     with: Miech, A.: End-to-End Learning of Visual Representations From Uncurate...
     with: Miech, A.: HowTo100M: Learning a Text-Video Embedding by Watching Hund...
     with: Miech, A.: Just Ask: Learning to Answer Questions from Millions of Nar...
     with: Miech, A.: Learning from Video and Text via Large-Scale Discriminative...
     with: Miech, A.: Learning to Answer Visual Questions From Web Videos
     with: Miech, A.: Look for the Change: Learning Object States and State-Modif...
     with: Miech, A.: Multi-Task Learning of Object States and State-Modifying Ac...
     with: Miech, A.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval ...
     with: Miech, A.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
     with: Miech, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
     with: Mihaylov, M.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Mirkin, S.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: More, K.: All Languages Matter: Evaluating LMMs on Culturally Diverse ...
     with: Nagrani, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mod...
     with: Nguyen, T.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Obando Ceron, J.: All Languages Matter: Evaluating LMMs on Culturally ...
     with: Oisel, L.: Joint pose estimation and action recognition in image graphs
     with: Oliva, A.: Predicting Actions from Static Scenes
     with: Olsson, C.: Predicting Actions from Static Scenes
     with: Oquab, M.: ContextLocNet: Context-Aware Deep Network Models for Weakly...
     with: Oquab, M.: Is object localization for free? - Weakly-supervised learni...
     with: Oquab, M.: Learning and Transferring Mid-level Image Representations U...
     with: Osokin, A.: Context-Aware CNNs for Person Head Detection
     with: Otieno, O.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Papandreou, G.: Editorial: Deep Learning for Computer Vision
     with: Parizi, S.N.: Improving Bag-of-features Action Recognition with Non-lo...
     with: Parizi, S.N.: Modeling Image Context Using Object Centered Grid
     with: Perez, P.: Cross-View Action Recognition from Temporal Self-similarities
     with: Perez, P.: Joint pose estimation and action recognition in image graphs
     with: Perez, P.: Multi-view synchronization of human actions and dynamic sce...
     with: Perez, P.: Periodic Motion Detection and Segmentation via Approximate ...
     with: Perez, P.: Retrieving actions in movies
     with: Perez, P.: View-Independent Action Recognition from Temporal Self-Simi...
     with: Peyre, J.: Detecting Unseen Visual Relations Using Analogies
     with: Peyre, J.: Weakly-Supervised Learning of Visual Relations
     with: Pollefeys, M.: Leveraging Photometric Consistency Over Time for Sparse...
     with: Ponce, J.: Automatic Annotation of Human Actions in Video
     with: Ponce, J.: Finding Actors and Actions in Movies
     with: Ponce, J.: Unsupervised object discovery and localization in the wild:...
     with: Ponce, J.: Unsupervised Object Discovery and Tracking in Video Collect...
     with: Ponce, J.: Weakly Supervised Action Labeling in Videos under Ordering ...
     with: Ponce, J.: Weakly-Supervised Alignment of Video with Text
     with: Pont Tuset, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language ...
     with: Qin, C.: All Languages Matter: Evaluating LMMs on Culturally Diverse 1...
     with: Rabbani, M.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Rabevohitra, F.H.: All Languages Matter: Evaluating LMMs on Culturally...
     with: Radionova, E.: RoomTour3D: Geometry-Aware Video-Instruction Tuning for...
     with: Raja, K.: Joint pose estimation and action recognition in image graphs
     with: Ramanan, D.: Guest Editorial: Video Recognition
     with: Ridzuan, M.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Rodriguez, M.D.: Data-driven crowd analysis in videos
     with: Rodriguez, M.D.: Density-aware person detection and tracking in crowds
     with: Romero, J.: Learning from Synthetic Humans
     with: Rozenfeld, B.: Learning realistic human actions from movies
     with: Russell, B.: BodyNet: Volumetric Inference of 3D Human Body Shapes
     with: Rybalchenko, D.: PairDETR: Joint Detection and Association of Human Bo...
     with: Saad, M.: All Languages Matter: Evaluating LMMs on Culturally Diverse ...
     with: Sanjeev, S.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Sasikumar, N.: All Languages Matter: Evaluating LMMs on Culturally Div...
     with: Schmid, C.: Actions in context
     with: Schmid, C.: Airbert: In-Domain Pretraining for Vision-and-Language Nav...
     with: Schmid, C.: AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Obj...
     with: Schmid, C.: BodyNet: Volumetric Inference of 3D Human Body Shapes
     with: Schmid, C.: Detecting Unseen Visual Relations Using Analogies
     with: Schmid, C.: Evaluation of local spatio-temporal features for action re...
     with: Schmid, C.: Finding Actors and Actions in Movies
     with: Schmid, C.: gSDF: Geometry-Driven Signed Distance Functions for 3D Han...
     with: Schmid, C.: Just Ask: Learning to Answer Questions from Millions of Na...
     with: Schmid, C.: Learning from Synthetic Humans
     with: Schmid, C.: Learning from Unlabeled 3D Environments for Vision-and-Lan...
     with: Schmid, C.: Learning Joint Reconstruction of Hands and Manipulated Obj...
     with: Schmid, C.: Learning realistic human actions from movies
     with: Schmid, C.: Learning to Answer Visual Questions From Web Videos
     with: Schmid, C.: Leveraging Photometric Consistency Over Time for Sparsely ...
     with: Schmid, C.: Long-Term Temporal Convolutions for Action Recognition
     with: Schmid, C.: P-CNN: Pose-Based CNN Features for Action Recognition
     with: Schmid, C.: Segmenter: Transformer for Semantic Segmentation
     with: Schmid, C.: SUGAR: Pre-training 3D Visual Representations for Robotics
     with: Schmid, C.: Synthetic Humans for Action Recognition from Unseen Viewpo...
     with: Schmid, C.: Think Global, Act Local: Dual-scale Graph Transformer for ...
     with: Schmid, C.: Towards Unconstrained Joint Hand-Object Reconstruction Fro...
     with: Schmid, C.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
     with: Schmid, C.: Unsupervised object discovery and localization in the wild...
     with: Schmid, C.: Unsupervised Object Discovery and Tracking in Video Collec...
     with: Schmid, C.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
     with: Schmid, C.: Weakly Supervised Action Labeling in Videos under Ordering...
     with: Schmid, C.: Weakly-Supervised Alignment of Video with Text
     with: Schmid, C.: Weakly-Supervised Learning of Visual Relations
     with: Schuldt, C.: Local velocity-adapted motion events for spatio-temporal ...
     with: Schuldt, C.: Recognizing human actions: a local SVM approach
     with: Sedlar, J.: Estimating 3D Motion and Forces of Human-Object Interactio...
     with: Sedlar, J.: Estimating 3D Motion and Forces of Person-Object Interacti...
     with: Seguin, G.: Instance-Level Video Segmentation from Object Tracks
     with: Seguin, G.: Pose Estimation and Segmentation of Multiple People in Ste...
     with: Seguin, G.: Pose Estimation and Segmentation of People in 3D Movies
     with: Seo, M.: Deep Metric Learning Beyond Binary Supervision
     with: Seo, P.H.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
     with: Shah, M.: All Languages Matter: Evaluating LMMs on Culturally Diverse ...
     with: Shah, M.: THUMOS challenge on action recognition for videos 'in the wi...
     with: Shaker, A.M.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Shakya, P.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Shtanchaev, A.: All Languages Matter: Evaluating LMMs on Culturally Di...
     with: Sigurdsson, G.A.: Hollywood in Homes: Crowdsourcing Data Collection fo...
     with: Singh, H.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Sivic, J.: Automatic Annotation of Human Actions in Video
     with: Sivic, J.: Cross-Task Weakly Supervised Learning From Instructional Vi...
     with: Sivic, J.: Data-driven crowd analysis in videos
     with: Sivic, J.: Density-aware person detection and tracking in crowds
     with: Sivic, J.: Detecting Unseen Visual Relations Using Analogies
     with: Sivic, J.: End-to-End Learning of Visual Representations From Uncurate...
     with: Sivic, J.: Estimating 3D Motion and Forces of Human-Object Interaction...
     with: Sivic, J.: Estimating 3D Motion and Forces of Person-Object Interactio...
     with: Sivic, J.: Finding Actors and Actions in Movies
     with: Sivic, J.: GenHowTo: Learning to Generate Actions and State Transforma...
     with: Sivic, J.: Guest Editorial: Video Recognition
     with: Sivic, J.: Is object localization for free? - Weakly-supervised learni...
     with: Sivic, J.: Joint Discovery of Object States and Manipulation Actions
     with: Sivic, J.: Just Ask: Learning to Answer Questions from Millions of Nar...
     with: Sivic, J.: Learning Actionness via Long-range Temporal Order Verificat...
     with: Sivic, J.: Learning and Transferring Mid-level Image Representations U...
     with: Sivic, J.: Learning from Narrated Instruction Videos
     with: Sivic, J.: Learning from Video and Text via Large-Scale Discriminative...
     with: Sivic, J.: Learning to Answer Visual Questions From Web Videos
     with: Sivic, J.: Look for the Change: Learning Object States and State-Modif...
     with: Sivic, J.: Multi-Task Learning of Object States and State-Modifying Ac...
     with: Sivic, J.: On pairwise costs for network flow multi-object tracking
     with: Sivic, J.: People Watching: Human Actions as a Cue for Single View Geo...
     with: Sivic, J.: Pose Estimation and Segmentation of Multiple People in Ster...
     with: Sivic, J.: Pose Estimation and Segmentation of People in 3D Movies
     with: Sivic, J.: Predicting Actions from Static Scenes
     with: Sivic, J.: Recognizing human actions in still images: A study of bag-o...
     with: Sivic, J.: Scene Semantics from Long-Term Observation of People
     with: Sivic, J.: ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual...
     with: Sivic, J.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval ...
     with: Sivic, J.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
     with: Sivic, J.: Unsupervised Learning from Narrated Instruction Videos
     with: Sivic, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
     with: Sivic, J.: Weakly Supervised Action Labeling in Videos under Ordering ...
     with: Sivic, J.: Weakly-Supervised Learning of Visual Relations
     with: Smaira, L.: End-to-End Learning of Visual Representations From Uncurat...
     with: Solorio, T.: All Languages Matter: Evaluating LMMs on Culturally Diver...
     with: Soucek, T.: GenHowTo: Learning to Generate Actions and State Transform...
     with: Soucek, T.: Look for the Change: Learning Object States and State-Modi...
     with: Soucek, T.: Multi-Task Learning of Object States and State-Modifying A...
     with: Soucek, T.: ShowHowTo: Generating Scene-Conditioned Step-by-Step Visua...
     with: Srivastava, A.: All Languages Matter: Evaluating LMMs on Culturally Di...
     with: Steger, C.T.: Automatic Extraction of Roads from Aerial Images Based o...
     with: Steger, C.T.: Automatic Road Extraction Based on Multi-Scale Modeling,...
     with: Stentiford, F.W.M.: Video copy detection: a comparative study
     with: Strudel, R.: Segmenter: Transformer for Semantic Segmentation
     with: Sukthankar, R.: THUMOS challenge on action recognition for videos 'in ...
     with: Tapaswi, M.: Airbert: In-Domain Pretraining for Vision-and-Language Na...
     with: Tapaswi, M.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
     with: Tapaswi, M.: Learning from Unlabeled 3D Environments for Vision-and-La...
     with: Tapaswi, M.: Learning Interactions and Relationships Between Movie Cha...
     with: Tapaswi, M.: Long term spatio-temporal modeling for action detection
     with: Tapaswi, M.: Think Global, Act Local: Dual-scale Graph Transformer for...
     with: Targhi, A.T.: Modeling Image Context Using Object Centered Grid
     with: Tekin, B.: Leveraging Photometric Consistency Over Time for Sparsely S...
     with: Thawakar, O.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Toyin, H.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Tran, D.: Editor's Note: Special Issue on ACCV 2024
     with: Tzionas, D.: Learning Joint Reconstruction of Hands and Manipulated Ob...
     with: Ullah, M.M.: Actlets: A novel local representation for human action re...
     with: Ullah, M.M.: Evaluation of local spatio-temporal features for action r...
     with: Ullah, M.M.: Improving Bag-of-features Action Recognition with Non-loc...
     with: Varol, G.: BodyNet: Volumetric Inference of 3D Human Body Shapes
     with: Varol, G.: Hollywood in Homes: Crowdsourcing Data Collection for Activ...
     with: Varol, G.: Learning from Synthetic Humans
     with: Varol, G.: Learning Joint Reconstruction of Hands and Manipulated Obje...
     with: Varol, G.: Long-Term Temporal Convolutions for Action Recognition
     with: Varol, G.: Synthetic Humans for Action Recognition from Unseen Viewpoi...
     with: Varol, G.: Towards Unconstrained Joint Hand-Object Reconstruction From...
     with: Vayani, A.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Vedaldi, A.: Editorial: Deep Learning for Computer Vision
     with: Vu, T.H.: Context-Aware CNNs for Person Head Detection
     with: Vu, T.H.: Predicting Actions from Static Scenes
     with: Wang, H.: Evaluation of local spatio-temporal features for action reco...
     with: Wang, X.G.: Editorial: Deep Learning for Computer Vision
     with: Wang, X.L.: Hollywood in Homes: Crowdsourcing Data Collection for Acti...
     with: Watawana, H.: All Languages Matter: Evaluating LMMs on Culturally Dive...
     with: Wills, J.: Periodic Motion Detection and Segmentation via Approximate ...
     with: Wray, M.: GenHowTo: Learning to Generate Actions and State Transformat...
     with: Wray, M.: ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual ...
     with: Xavier, N.: All Languages Matter: Evaluating LMMs on Culturally Divers...
     with: Yan, S.C.: Editorial: Deep Learning for Computer Vision
     with: Yang, A.: Just Ask: Learning to Answer Questions from Millions of Narr...
     with: Yang, A.: Learning to Answer Visual Questions From Web Videos
     with: Yang, A.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
     with: Yang, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model ...
     with: Yang, J.: BodyNet: Volumetric Inference of 3D Human Body Shapes
     with: Yao, A.: Editor's Note: Special Issue on ACCV 2024
     with: Yuille, A.L.: Editorial: Deep Learning for Computer Vision
     with: Yumer, M.E.: BodyNet: Volumetric Inference of 3D Human Body Shapes
     with: Zagoruyko, S.: PairDETR: Joint Detection and Association of Human Bodi...
     with: Zamir, A.R.: THUMOS challenge on action recognition for videos 'in the...
     with: Zha, H.B.: Editor's Note: Special Issue on ACCV 2024
     with: Zhang, J.Y.: RoomTour3D: Geometry-Aware Video-Instruction Tuning for E...
     with: Zhang, M.: All Languages Matter: Evaluating LMMs on Culturally Diverse...
     with: Zhukov, D.: Cross-Task Weakly Supervised Learning From Instructional V...
     with: Zhukov, D.: HowTo100M: Learning a Text-Video Embedding by Watching Hun...
     with: Zhukov, D.: Learning Actionness via Long-range Temporal Order Verifica...
     with: Zhumakhanova, K.: All Languages Matter: Evaluating LMMs on Culturally ...
     with: Zhumakhanova, K.: RoomTour3D: Geometry-Aware Video-Instruction Tuning ...
     with: Zisserman, A.: End-to-End Learning of Visual Representations From Uncu...
     with: Zisserman, A.: Synthetic Humans for Action Recognition from Unseen Vie...
     with: Zisserman, A.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
413 for Laptev, I.

Laptin, M. Standard Author Listing
     with: Bhide, S.: Reinforcement learning for instance segmentation with high-...
     with: Hilt, P.: Reinforcement learning for instance segmentation with high-l...
     with: Kaziakhmedov, E.: Reinforcement learning for instance segmentation wit...
     with: Kreshuk, A.: Reinforcement learning for instance segmentation with hig...
     with: Pape, C.: Reinforcement learning for instance segmentation with high-l...
     with: Zarvandi, M.: Reinforcement learning for instance segmentation with hi...

Last update:13-Jun-26 21:12:51
Use price@usc.edu for comments.