Laptev, D.
Standard Author Listing
with: Buhmann, J.M.: Convolutional Decision Trees for Feature Learning and S...
with: Buhmann, J.M.: TI-POOLING: Transformation-Invariant Pooling for Featur...
with: Buhmann, J.M.: Transformation-Invariant Convolutional Jungles
with: Pollefeys, M.: TI-POOLING: Transformation-Invariant Pooling for Featur...
with: Savinov, N.: TI-POOLING: Transformation-Invariant Pooling for Feature ...
Laptev, I.
Standard Author Listing
with: Agrawal, N.: Learning from Narrated Instruction Videos
with: Agrawal, N.: Unsupervised Learning from Narrated Instruction Videos
with: Akbarzadeh, A.: Galilean-diagonalized spatio-temporal interest operators
with: Alahari, K.: Pose Estimation and Segmentation of Multiple People in St...
with: Alahari, K.: Pose Estimation and Segmentation of People in 3D Movies
with: Alayrac, J.: End-to-End Learning of Visual Representations From Uncura...
with: Alayrac, J.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
with: Alayrac, J.B.: Cross-Task Weakly Supervised Learning From Instructiona...
with: Alayrac, J.B.: HowTo100M: Learning a Text-Video Embedding by Watching ...
with: Alayrac, J.B.: Joint Discovery of Object States and Manipulation Actions
with: Alayrac, J.B.: Learning Actionness via Long-range Temporal Order Verif...
with: Alayrac, J.B.: Learning from Narrated Instruction Videos
with: Alayrac, J.B.: Learning from Video and Text via Large-Scale Discrimina...
with: Alayrac, J.B.: Look for the Change: Learning Object States and State-M...
with: Alayrac, J.B.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
with: Alayrac, J.B.: Unsupervised Learning from Narrated Instruction Videos
with: Audibert, J.Y.: Data-driven crowd analysis in videos
with: Audibert, J.Y.: Density-aware person detection and tracking in crowds
with: Azizpour, H.: Object Detection Using Strongly-Supervised Deformable Pa...
with: Bach, F.: Automatic Annotation of Human Actions in Video
with: Bach, F.: Finding Actors and Actions in Movies
with: Bach, F.: Weakly Supervised Action Labeling in Videos under Ordering C...
with: Bach, F.: Weakly-Supervised Alignment of Video with Text
with: Baumgartner, A.: Automatic Extraction of Roads from Aerial Images Base...
with: Baumgartner, A.: Automatic Road Extraction Based on Multi-Scale Modeli...
with: Baumgartner, A.: Multi-Scale and Snakes for Automatic Road Extraction
with: Belongie, S.J.: Periodic Motion Detection and Segmentation via Approxi...
with: Black, M.J.: Learning from Synthetic Humans
with: Black, M.J.: Learning Joint Reconstruction of Hands and Manipulated Ob...
with: Bogo, F.: Leveraging Photometric Consistency Over Time for Sparsely Su...
with: Bojanowski, P.: Finding Actors and Actions in Movies
with: Bojanowski, P.: Instance-Level Video Segmentation from Object Tracks
with: Bojanowski, P.: Learning from Narrated Instruction Videos
with: Bojanowski, P.: Learning from Video and Text via Large-Scale Discrimin...
with: Bojanowski, P.: Unsupervised Learning from Narrated Instruction Videos
with: Bojanowski, P.: Weakly Supervised Action Labeling in Videos under Orde...
with: Bojanowski, P.: Weakly-Supervised Alignment of Video with Text
with: Bottou, L.: Is object localization for free? - Weakly-supervised learn...
with: Bottou, L.: Learning and Transferring Mid-level Image Representations ...
with: Boujemaa, N.: Video copy detection: a comparative study
with: Bretzner, L.: Hand gesture recognition using multi-scale colour featur...
with: Buisson, O.: Video copy detection: a comparative study
with: Caputo, B.: Local velocity-adapted motion events for spatio-temporal r...
with: Caputo, B.: Recognizing human actions: a local SVM approach
with: Carpentier, J.: Estimating 3D Motion and Forces of Human-Object Intera...
with: Carpentier, J.: Estimating 3D Motion and Forces of Person-Object Inter...
with: Ceylan, D.: BodyNet: Volumetric Inference of 3D Human Body Shapes
with: Chari, V.: On pairwise costs for network flow multi-object tracking
with: Charon, G.: P-CNN: Pose-Based CNN Features for Action Recognition
with: Chen, L.: Video copy detection: a comparative study
with: Chen, S.Z.: Airbert: In-Domain Pretraining for Vision-and-Language Nav...
with: Chen, S.Z.: Learning from Unlabeled 3D Environments for Vision-and-Lan...
with: Chen, S.Z.: Think Global, Act Local: Dual-scale Graph Transformer for ...
with: Chen, Z.: AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Objec...
with: Chigorin, A.: MobileFace: 3D Face Reconstruction with Efficient CNN Re...
with: Chinaev, N.: MobileFace: 3D Face Reconstruction with Efficient CNN Reg...
with: Cho, M.: Deep Metric Learning Beyond Binary Supervision
with: Cho, M.: Unsupervised object discovery and localization in the wild: P...
with: Cho, M.: Unsupervised Object Discovery and Tracking in Video Collections
with: Cho, M.S.: ContextLocNet: Context-Aware Deep Network Models for Weakly...
with: Cho, M.S.: Thin-Slicing for Pose: Learning to Understand Pose without ...
with: Cinbis, R.G.: Cross-Task Weakly Supervised Learning From Instructional...
with: Damen, D.: Action Modifiers: Learning From Adverbs in Instructional Vi...
with: Delaitre, V.: People Watching: Human Actions as a Cue for Single View ...
with: Delaitre, V.: Recognizing human actions in still images: A study of ba...
with: Delaitre, V.: Scene Semantics from Long-Term Observation of People
with: Dexter, E.: Cross-View Action Recognition from Temporal Self-similarit...
with: Dexter, E.: Multi-view synchronization of human actions and dynamic sc...
with: Dexter, E.: View-Independent Action Recognition from Temporal Self-Sim...
with: Doughty, H.: Action Modifiers: Learning From Adverbs in Instructional ...
with: Duchenne, O.: Automatic Annotation of Human Actions in Video
with: Eckstein, W.: Automatic Extraction of Roads from Aerial Images Based o...
with: Efros, A.A.: People Watching: Human Actions as a Cue for Single View G...
with: Efros, A.A.: Scene Semantics from Long-Term Observation of People
with: Farhadi, A.: Hollywood in Homes: Crowdsourcing Data Collection for Act...
with: Fouhey, D.: Cross-Task Weakly Supervised Learning From Instructional V...
with: Fouhey, D.F.: People Watching: Human Actions as a Cue for Single View ...
with: Fouhey, D.F.: Scene Semantics from Long-Term Observation of People
with: Garcia, R.: Segmenter: Transformer for Semantic Segmentation
with: Girshick, R.: Editorial: Deep Learning for Computer Vision
with: Gorban, A.: THUMOS challenge on action recognition for videos 'in the ...
with: Gouet Brunet, V.: Video copy detection: a comparative study
with: Grave, E.: Weakly-Supervised Alignment of Video with Text
with: Guhur, P.L.: Airbert: In-Domain Pretraining for Vision-and-Language Na...
with: Guhur, P.L.: Learning from Unlabeled 3D Environments for Vision-and-La...
with: Guhur, P.L.: Think Global, Act Local: Dual-scale Graph Transformer for...
with: Gupta, A.: Hollywood in Homes: Crowdsourcing Data Collection for Activ...
with: Gupta, A.: People Watching: Human Actions as a Cue for Single View Geo...
with: Gupta, A.: Scene Semantics from Long-Term Observation of People
with: Hasson, Y.: AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Obj...
with: Hasson, Y.: Learning Joint Reconstruction of Hands and Manipulated Obj...
with: Hasson, Y.: Leveraging Photometric Consistency Over Time for Sparsely ...
with: Hasson, Y.: Towards Unconstrained Joint Hand-Object Reconstruction Fro...
with: Idrees, H.: THUMOS challenge on action recognition for videos 'in the ...
with: Jiang, Y.G.: THUMOS challenge on action recognition for videos 'in the...
with: Joly, A.: Video copy detection: a comparative study
with: Junejo, I.N.: Cross-View Action Recognition from Temporal Self-similar...
with: Junejo, I.N.: View-Independent Action Recognition from Temporal Self-S...
with: Kalevatykh, I.: Learning Joint Reconstruction of Hands and Manipulated...
with: Kantorov, V.: ContextLocNet: Context-Aware Deep Network Models for Wea...
with: Kantorov, V.: Efficient Feature Extraction, Encoding, and Classificati...
with: Kim, S.Y.: Deep Metric Learning Beyond Binary Supervision
with: Klaser, A.: Evaluation of local spatio-temporal features for action re...
with: Kokkinos, I.: Editorial: Deep Learning for Computer Vision
with: Kukleva, A.: Learning Interactions and Relationships Between Movie Cha...
with: Kumar, V.: Long term spatio-temporal modeling for action detection
with: Kwak, S.: Deep Metric Learning Beyond Binary Supervision
with: Kwak, S.: Thin-Slicing for Pose: Learning to Understand Pose without E...
with: Kwak, S.: Unsupervised object discovery and localization in the wild: ...
with: Kwak, S.: Unsupervised Object Discovery and Tracking in Video Collecti...
with: Lacoste Julien, S.: Joint Discovery of Object States and Manipulation ...
with: Lacoste Julien, S.: Learning from Narrated Instruction Videos
with: Lacoste Julien, S.: On pairwise costs for network flow multi-object tr...
with: Lacoste Julien, S.: Unsupervised Learning from Narrated Instruction Vi...
with: Lajugie, R.: Instance-Level Video Segmentation from Object Tracks
with: Lajugie, R.: Weakly Supervised Action Labeling in Videos under Orderin...
with: Lajugie, R.: Weakly-Supervised Alignment of Video with Text
with: Law To, J.: Video copy detection: a comparative study
with: Li, Z.M.: Estimating 3D Motion and Forces of Human-Object Interactions...
with: Li, Z.M.: Estimating 3D Motion and Forces of Person-Object Interaction...
with: Lindeberg, T.: Automatic Extraction of Roads from Aerial Images Based ...
with: Lindeberg, T.: Distance Measure and a Feature Likelihood Map Concept f...
with: Lindeberg, T.: Galilean-diagonalized spatio-temporal interest operators
with: Lindeberg, T.: Hand gesture recognition using multi-scale colour featu...
with: Lindeberg, T.: Interest Point Detection and Scale Selection in Space-T...
with: Lindeberg, T.: Local Descriptors for Spatio-temporal Recognition
with: Lindeberg, T.: Local velocity-adapted motion events for spatio-tempora...
with: Lindeberg, T.: multi-scale feature likelihood map for direct evaluatio...
with: Lindeberg, T.: Recognizing human actions: a local SVM approach
with: Lindeberg, T.: Space-time interest points
with: Lindeberg, T.: Tracking of multi-state hand models using particle filt...
with: Lindeberg, T.: Velocity adaptation of space-time interest points
with: Lindeberg, T.: Velocity adaptation of spatio-temporal receptive fields...
with: Mahmood, N.: Learning from Synthetic Humans
with: Malik, J.: Editorial: Deep Learning for Computer Vision
with: Mansard, N.: Estimating 3D Motion and Forces of Human-Object Interacti...
with: Mansard, N.: Estimating 3D Motion and Forces of Person-Object Interact...
with: Marszalek, M.: Actions in context
with: Marszalek, M.: Learning realistic human actions from movies
with: Martin, X.: Learning from Synthetic Humans
with: Mayer, H.: Automatic Extraction of Roads from Aerial Images Based on S...
with: Mayer, H.: Automatic Road Extraction Based on Multi-Scale Modeling, Co...
with: Mayer, H.: Multi-Scale and Snakes for Automatic Road Extraction
with: Mayol Cuevas, W.: Action Modifiers: Learning From Adverbs in Instructi...
with: Miech, A.: End-to-End Learning of Visual Representations From Uncurate...
with: Miech, A.: HowTo100M: Learning a Text-Video Embedding by Watching Hund...
with: Miech, A.: Just Ask: Learning to Answer Questions from Millions of Nar...
with: Miech, A.: Learning from Video and Text via Large-Scale Discriminative...
with: Miech, A.: Look for the Change: Learning Object States and State-Modif...
with: Miech, A.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval ...
with: Miech, A.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Oisel, L.: Joint pose estimation and action recognition in image graphs
with: Oliva, A.: Predicting Actions from Static Scenes
with: Olsson, C.: Predicting Actions from Static Scenes
with: Oquab, M.: ContextLocNet: Context-Aware Deep Network Models for Weakly...
with: Oquab, M.: Is object localization for free? - Weakly-supervised learni...
with: Oquab, M.: Learning and Transferring Mid-level Image Representations U...
with: Osokin, A.: Context-Aware CNNs for Person Head Detection
with: Papandreou, G.: Editorial: Deep Learning for Computer Vision
with: Parizi, S.N.: Improving Bag-of-features Action Recognition with Non-lo...
with: Parizi, S.N.: Modeling Image Context Using Object Centered Grid
with: Perez, P.: Cross-View Action Recognition from Temporal Self-similarities
with: Perez, P.: Joint pose estimation and action recognition in image graphs
with: Perez, P.: Multi-view synchronization of human actions and dynamic sce...
with: Perez, P.: Periodic Motion Detection and Segmentation via Approximate ...
with: Perez, P.: Retrieving actions in movies
with: Perez, P.: View-Independent Action Recognition from Temporal Self-Simi...
with: Peyre, J.: Detecting Unseen Visual Relations Using Analogies
with: Peyre, J.: Weakly-Supervised Learning of Visual Relations
with: Pollefeys, M.: Leveraging Photometric Consistency Over Time for Sparse...
with: Ponce, J.: Automatic Annotation of Human Actions in Video
with: Ponce, J.: Finding Actors and Actions in Movies
with: Ponce, J.: Unsupervised object discovery and localization in the wild:...
with: Ponce, J.: Unsupervised Object Discovery and Tracking in Video Collect...
with: Ponce, J.: Weakly Supervised Action Labeling in Videos under Ordering ...
with: Ponce, J.: Weakly-Supervised Alignment of Video with Text
with: Raja, K.: Joint pose estimation and action recognition in image graphs
with: Ramanan, D.: Guest Editorial: Video Recognition
with: Rodriguez, M.D.: Data-driven crowd analysis in videos
with: Rodriguez, M.D.: Density-aware person detection and tracking in crowds
with: Romero, J.: Learning from Synthetic Humans
with: Rozenfeld, B.: Learning realistic human actions from movies
with: Russell, B.: BodyNet: Volumetric Inference of 3D Human Body Shapes
with: Schmid, C.: Actions in context
with: Schmid, C.: Airbert: In-Domain Pretraining for Vision-and-Language Nav...
with: Schmid, C.: AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Obj...
with: Schmid, C.: BodyNet: Volumetric Inference of 3D Human Body Shapes
with: Schmid, C.: Detecting Unseen Visual Relations Using Analogies
with: Schmid, C.: Evaluation of local spatio-temporal features for action re...
with: Schmid, C.: Finding Actors and Actions in Movies
with: Schmid, C.: Just Ask: Learning to Answer Questions from Millions of Na...
with: Schmid, C.: Learning from Synthetic Humans
with: Schmid, C.: Learning from Unlabeled 3D Environments for Vision-and-Lan...
with: Schmid, C.: Learning Joint Reconstruction of Hands and Manipulated Obj...
with: Schmid, C.: Learning realistic human actions from movies
with: Schmid, C.: Leveraging Photometric Consistency Over Time for Sparsely ...
with: Schmid, C.: Long-Term Temporal Convolutions for Action Recognition
with: Schmid, C.: P-CNN: Pose-Based CNN Features for Action Recognition
with: Schmid, C.: Segmenter: Transformer for Semantic Segmentation
with: Schmid, C.: Synthetic Humans for Action Recognition from Unseen Viewpo...
with: Schmid, C.: Think Global, Act Local: Dual-scale Graph Transformer for ...
with: Schmid, C.: Towards Unconstrained Joint Hand-Object Reconstruction Fro...
with: Schmid, C.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Schmid, C.: Unsupervised object discovery and localization in the wild...
with: Schmid, C.: Unsupervised Object Discovery and Tracking in Video Collec...
with: Schmid, C.: Weakly Supervised Action Labeling in Videos under Ordering...
with: Schmid, C.: Weakly-Supervised Alignment of Video with Text
with: Schmid, C.: Weakly-Supervised Learning of Visual Relations
with: Schuldt, C.: Local velocity-adapted motion events for spatio-temporal ...
with: Schuldt, C.: Recognizing human actions: a local SVM approach
with: Sedlar, J.: Estimating 3D Motion and Forces of Human-Object Interactio...
with: Sedlar, J.: Estimating 3D Motion and Forces of Person-Object Interacti...
with: Seguin, G.: Instance-Level Video Segmentation from Object Tracks
with: Seguin, G.: Pose Estimation and Segmentation of Multiple People in Ste...
with: Seguin, G.: Pose Estimation and Segmentation of People in 3D Movies
with: Seo, M.: Deep Metric Learning Beyond Binary Supervision
with: Shah, M.: THUMOS challenge on action recognition for videos 'in the wi...
with: Sigurdsson, G.A.: Hollywood in Homes: Crowdsourcing Data Collection fo...
with: Sivic, J.: Automatic Annotation of Human Actions in Video
with: Sivic, J.: Cross-Task Weakly Supervised Learning From Instructional Vi...
with: Sivic, J.: Data-driven crowd analysis in videos
with: Sivic, J.: Density-aware person detection and tracking in crowds
with: Sivic, J.: Detecting Unseen Visual Relations Using Analogies
with: Sivic, J.: End-to-End Learning of Visual Representations From Uncurate...
with: Sivic, J.: Estimating 3D Motion and Forces of Human-Object Interaction...
with: Sivic, J.: Estimating 3D Motion and Forces of Person-Object Interactio...
with: Sivic, J.: Finding Actors and Actions in Movies
with: Sivic, J.: Guest Editorial: Video Recognition
with: Sivic, J.: Is object localization for free? - Weakly-supervised learni...
with: Sivic, J.: Joint Discovery of Object States and Manipulation Actions
with: Sivic, J.: Just Ask: Learning to Answer Questions from Millions of Nar...
with: Sivic, J.: Learning Actionness via Long-range Temporal Order Verificat...
with: Sivic, J.: Learning and Transferring Mid-level Image Representations U...
with: Sivic, J.: Learning from Narrated Instruction Videos
with: Sivic, J.: Learning from Video and Text via Large-Scale Discriminative...
with: Sivic, J.: Look for the Change: Learning Object States and State-Modif...
with: Sivic, J.: On pairwise costs for network flow multi-object tracking
with: Sivic, J.: People Watching: Human Actions as a Cue for Single View Geo...
with: Sivic, J.: Pose Estimation and Segmentation of Multiple People in Ster...
with: Sivic, J.: Pose Estimation and Segmentation of People in 3D Movies
with: Sivic, J.: Predicting Actions from Static Scenes
with: Sivic, J.: Recognizing human actions in still images: A study of bag-o...
with: Sivic, J.: Scene Semantics from Long-Term Observation of People
with: Sivic, J.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval ...
with: Sivic, J.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Sivic, J.: Unsupervised Learning from Narrated Instruction Videos
with: Sivic, J.: Weakly Supervised Action Labeling in Videos under Ordering ...
with: Sivic, J.: Weakly-Supervised Learning of Visual Relations
with: Smaira, L.: End-to-End Learning of Visual Representations From Uncurat...
with: Soucek, T.: Look for the Change: Learning Object States and State-Modi...
with: Steger, C.T.: Automatic Extraction of Roads from Aerial Images Based o...
with: Steger, C.T.: Automatic Road Extraction Based on Multi-Scale Modeling,...
with: Stentiford, F.W.M.: Video copy detection: a comparative study
with: Strudel, R.: Segmenter: Transformer for Semantic Segmentation
with: Sukthankar, R.: THUMOS challenge on action recognition for videos 'in ...
with: Tapaswi, M.: Airbert: In-Domain Pretraining for Vision-and-Language Na...
with: Tapaswi, M.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
with: Tapaswi, M.: Learning from Unlabeled 3D Environments for Vision-and-La...
with: Tapaswi, M.: Learning Interactions and Relationships Between Movie Cha...
with: Tapaswi, M.: Long term spatio-temporal modeling for action detection
with: Tapaswi, M.: Think Global, Act Local: Dual-scale Graph Transformer for...
with: Targhi, A.T.: Modeling Image Context Using Object Centered Grid
with: Tekin, B.: Leveraging Photometric Consistency Over Time for Sparsely S...
with: Tzionas, D.: Learning Joint Reconstruction of Hands and Manipulated Ob...
with: Ullah, M.M.: Actlets: A novel local representation for human action re...
with: Ullah, M.M.: Evaluation of local spatio-temporal features for action r...
with: Ullah, M.M.: Improving Bag-of-features Action Recognition with Non-loc...
with: Varol, G.: BodyNet: Volumetric Inference of 3D Human Body Shapes
with: Varol, G.: Hollywood in Homes: Crowdsourcing Data Collection for Activ...
with: Varol, G.: Learning from Synthetic Humans
with: Varol, G.: Learning Joint Reconstruction of Hands and Manipulated Obje...
with: Varol, G.: Long-Term Temporal Convolutions for Action Recognition
with: Varol, G.: Synthetic Humans for Action Recognition from Unseen Viewpoi...
with: Varol, G.: Towards Unconstrained Joint Hand-Object Reconstruction From...
with: Vedaldi, A.: Editorial: Deep Learning for Computer Vision
with: Vu, T.H.: Context-Aware CNNs for Person Head Detection
with: Vu, T.H.: Predicting Actions from Static Scenes
with: Wang, H.: Evaluation of local spatio-temporal features for action reco...
with: Wang, X.G.: Editorial: Deep Learning for Computer Vision
with: Wang, X.L.: Hollywood in Homes: Crowdsourcing Data Collection for Acti...
with: Wills, J.: Periodic Motion Detection and Segmentation via Approximate ...
with: Yan, S.C.: Editorial: Deep Learning for Computer Vision
with: Yang, A.: Just Ask: Learning to Answer Questions from Millions of Narr...
with: Yang, A.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Yang, J.: BodyNet: Volumetric Inference of 3D Human Body Shapes
with: Yuille, A.L.: Editorial: Deep Learning for Computer Vision
with: Yumer, E.: BodyNet: Volumetric Inference of 3D Human Body Shapes
with: Zamir, A.R.: THUMOS challenge on action recognition for videos 'in the...
with: Zhukov, D.: Cross-Task Weakly Supervised Learning From Instructional V...
with: Zhukov, D.: HowTo100M: Learning a Text-Video Embedding by Watching Hun...
with: Zhukov, D.: Learning Actionness via Long-range Temporal Order Verifica...
with: Zisserman, A.: End-to-End Learning of Visual Representations From Uncu...
with: Zisserman, A.: Synthetic Humans for Action Recognition from Unseen Vie...
with: Zisserman, A.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
299 for Laptev, I.