17.1.3.6.1 Action Localization, Action Localisation

Chapter Contents (Back)
Localization. Action Localization.
See also Action Segmentation.

Yeo, C., Ahammad, P., Ramchandran, K., Sastry, S.S.,
High-Speed Action Recognition and Localization in Compressed Domain Videos,
CirSysVideo(18), No. 8, August 2008, pp. 1006-1015.
IEEE DOI 0809
BibRef

Nga, D.H.[Do Hang], Yanai, K.[Keiji],
Automatic extraction of relevant video shots of specific actions exploiting Web data,
CVIU(118), No. 1, 2014, pp. 2-15.
Elsevier DOI 1312
BibRef
Earlier:
Automatic collection of Web video shots corresponding to specific actions using Web images,
LSVSM12(15-20).
IEEE DOI 1207
BibRef
Earlier:
Automatic construction of an action video shot database using web videos,
ICCV11(527-534).
IEEE DOI 1201
Web video. Based on text tags. BibRef

Cho, J.C.[Jung-Chan], Lee, M.[Minsik], Chang, H.J.[Hyung Jin], Oh, S.H.[Song-Hwai],
Robust action recognition using local motion and group sparsity,
PR(47), No. 5, 2014, pp. 1813-1825.
Elsevier DOI 1402
Action recognition BibRef

Wang, G.F.[Guo-Feng], Qin, X.Y.[Xue-Ying], Zhong, F.[Fan], Liu, Y.[Yue], Li, H.B.[Hong-Bo], Peng, Q.S.[Qun-Sheng], Yang, M.H.[Ming-Hsuan],
Visual Tracking via Sparse and Local Linear Coding,
IP(24), No. 11, November 2015, pp. 3796-3809.
IEEE DOI 1509
image coding BibRef
Earlier: A1, A3, A4, A6, A2, Only:
Visual Tracking in Continuous Appearance Space via Sparse Coding,
ACCV12(III:57-70).
Springer DOI 1304

See also Visual Tracking via Temporally Smooth Sparse Coding.
See also Visual Tracking via Coarse and Fine Structural Local Sparse Appearance Models. BibRef

Qi, Y.K.[Yuan-Kai], Qin, L.[Lei], Zhang, J.[Jian], Zhang, S.P.[Sheng-Ping], Huang, Q.M.[Qing-Ming], Yang, M.H.[Ming-Hsuan],
Structure-Aware Local Sparse Coding for Visual Tracking,
IP(27), No. 8, August 2018, pp. 3857-3869.
IEEE DOI 1806
image coding, image representation, image sequences, object tracking, target tracking, dictionary, template update BibRef

Jain, M.[Mihir], van Gemert, J.C.[Jan C.], Jégou, H.[Hervé], Bouthemy, P.[Patrick], Snoek, C.G.M.[Cees G. M.],
Tubelets: Unsupervised Action Proposals from Spatiotemporal Super-Voxels,
IJCV(124), No. 3, September 2017, pp. 287-311.
Springer DOI 1708
BibRef
Earlier:
Action Localization with Tubelets from Motion,
CVPR14(740-747)
IEEE DOI 1409
determine when and where certain actions appear. BibRef

Jain, M.[Mihir], van Gemert, J.C.[Jan C.], Snoek, C.G.M.[Cees G.M.],
What do 15,000 object categories tell us about classifying and localizing actions?,
CVPR15(46-55)
IEEE DOI 1510
BibRef

Yang, P.W.[Peng-Wan], Mettes, P.S.[Pascal S.], Snoek, C.G.M.[Cees G. M.],
Few-Shot Transformation of Common Actions into Time and Space,
CVPR21(16026-16035)
IEEE DOI 2111
Location awareness, Transformers, Pattern recognition, Noise measurement, Proposals BibRef

Mettes, P.S.[Pascal S.], Snoek, C.G.M.[Cees G. M.],
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions,
ICCV17(4453-4462)
IEEE DOI 1802
image classification, image motion analysis, object detection, object recognition, video signal processing, Trajectory BibRef

Mettes, P.S.[Pascal S.], Snoek, C.G.M.[Cees G. M.],
Pointly-Supervised Action Localization,
IJCV(127), No. 3, March 2019, pp. 263-281.
Springer DOI 1903
Localization by finding bounding boxes. BibRef

Mettes, P.S.[Pascal S.], van Gemert, J.C.[Jan C.], Snoek, C.G.M.[Cees G. M.],
Spot On: Action Localization from Pointly-Supervised Proposals,
ECCV16(V: 437-453).
Springer DOI 1611
BibRef

van Gemert, J.C.[Jan C.], Jain, M.[Mihir], Gati, E.[Ella], Snoek, C.G.M.[Cees G.M.],
APT: Action localization proposals from dense trajectories,
BMVC15(xx-yy).
DOI Link 1601
BibRef

Jain, M.[Mihir], van Gemert, J.C.[Jan C.], Mensink, T.[Thomas], Snoek, C.G.M.[Cees G.M.],
Objects2action: Classifying and Localizing Actions without Any Video Example,
ICCV15(4588-4596)
IEEE DOI 1602
Computational modeling BibRef

Soomro, K.[Khurram], Idrees, H.[Haroon], Shah, M.[Mubarak],
Online Localization and Prediction of Actions and Interactions,
PAMI(41), No. 2, February 2019, pp. 459-472.
IEEE DOI 1901
BibRef
Earlier:
Predicting the Where and What of Actors and Actions through Online Action Localization,
CVPR16(2648-2657)
IEEE DOI 1612
BibRef
Earlier:
Action Localization in Videos through Context Walk,
ICCV15(3280-3288)
IEEE DOI 1602
Videos, Support vector machines, Predictive models, Motion segmentation, Visualization, Training, Dynamic programming, structural SVM. Context BibRef

Soomro, K.[Khurram], Shah, M.[Mubarak],
Unsupervised Action Discovery and Localization in Videos,
ICCV17(696-705)
IEEE DOI 1802
directed graphs, feature extraction, image classification, image segmentation, knapsack problems, pattern clustering, Videos BibRef

Song, H., Wu, X., Zhu, B., Wu, Y., Chen, M., Jia, Y.,
Temporal Action Localization in Untrimmed Videos Using Action Pattern Trees,
MultMed(21), No. 3, March 2019, pp. 717-730.
IEEE DOI 1903
data mining, feature extraction, image motion analysis, image segmentation, learning (artificial intelligence), overlap loss function BibRef

Zeng, R.H.[Run-Hao], Gan, C.[Chuang], Chen, P.H.[Pei-Hao], Huang, W.B.[Wen-Bing], Wu, Q.Y.[Qing-Yao], Tan, M.K.[Ming-Kui],
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization,
IP(28), No. 12, December 2019, pp. 5797-5808.
IEEE DOI 1909
Videos, Training, Proposals, Image segmentation, Correlation, Object detection, Semantics, Weakly supervised learning, untrimmed video BibRef

Zhang, Y.Q.[Yong-Qiang], Ding, M.L.[Ming-Li], Bai, Y.C.[Yan-Cheng], Liu, D.D.[Dan-Dan], Ghanem, B.[Bernard],
Learning a strong detector for action localization in videos,
PRL(128), 2019, pp. 407-413.
Elsevier DOI 1912
Frame-level object detection, Deformable anchor cuboid, Action localization BibRef

Heilbron, F.C.[Fabian Caba], Lee, J.Y.[Joon-Young], Jin, H.L.[Hai-Lin], Ghanem, B.[Bernard],
What Do I Annotate Next? An Empirical Study of Active Learning for Action Localization,
ECCV18(XI: 212-229).
Springer DOI 1810
BibRef

Heilbron, F.C.[Fabian Caba], Barrios, W., Escorcia, V., Ghanem, B.[Bernard],
SCC: Semantic Context Cascade for Efficient Action Detection,
CVPR17(3175-3184)
IEEE DOI 1711
Computational modeling, Context modeling, Dogs, Legged locomotion, Proposals, Semantics, Video, sequences BibRef

Escorcia, V.[Victor], Heilbron, F.C.[Fabian Caba], Niebles, J.C.[Juan Carlos], Ghanem, B.[Bernard],
DAPs: Deep Action Proposals for Action Understanding,
ECCV16(III: 768-784).
Springer DOI 1611
BibRef

Heilbron, F.C.[Fabian Caba], Thabet, A.[Ali], Niebles, J.C.[Juan Carlos], Ghanem, B.[Bernard],
Camera Motion and Surrounding Scene Appearance as Context for Action Recognition,
ACCV14(IV: 583-597).
Springer DOI 1504
BibRef

Long, F.C.[Fu-Chen], Yao, T.[Ting], Qiu, Z.F.[Zhao-Fan], Tian, X.M.[Xin-Mei], Mei, T.[Tao], Luo, J.B.[Jie-Bo],
Coarse-to-Fine Localization of Temporal Action Proposals,
MultMed(22), No. 6, June 2020, pp. 1577-1590.
IEEE DOI 2005
BibRef
Earlier: A1, A2, A3, A4, A6, A5:
Gaussian Temporal Awareness Networks for Action Localization,
CVPR19(344-353).
IEEE DOI 2002
Proposals, Videos, Painting, Brushes, Task analysis, Feature extraction, Action Proposals, Action Recognition, Video Captioning BibRef

Kumar, N., Sukavanam, N.,
Weakly supervised deep network for spatiotemporal localization and detection of human actions in wild conditions,
VC(36), No. 9, September 2020, pp. 1809-1821.
Springer DOI 2008
BibRef

Yang, L., Peng, H., Zhang, D., Fu, J., Han, J.,
Revisiting Anchor Mechanisms for Temporal Action Localization,
IP(29), 2020, pp. 8535-8548.
IEEE DOI 2008
Temporal action localization, default anchor, anchor free, complementarity BibRef

Xu, W., Yu, J., Miao, Z., Wan, L., Ji, Q.,
Spatio-Temporal Deep Q-Networks for Human Activity Localization,
CirSysVideo(30), No. 9, September 2020, pp. 2984-2999.
IEEE DOI 2009
Proposals, Reinforcement learning, Activity recognition, Context modeling, Electron tubes, seq-to-seq model BibRef

Qin, X.L.[Xiao-Lei], Ge, Y.X.[Yong-Xin], Yu, H.[Hui], Chen, F.Y.[Fei-Yu], Yang, D.[Dan],
Spatial Enhancement and Temporal Constraint for Weakly Supervised Action Localization,
SPLetters(27), 2020, pp. 1520-1524.
IEEE DOI 2009
Training, Proposals, Feature extraction, Entropy, Signal processing, Signal processing algorithms, confidence connectivity enhancement BibRef

Yu, J.R.[Jia-Ruo], Ge, Y.X.[Yong-Xin], Qin, X.L.[Xiao-Lei], Li, Z.Q.[Zi-Qiang], Huang, S.[Sheng], Chen, F.Y.[Fei-Yu],
Deep feature enhancing and selecting network for weakly supervised temporal action localization,
JVCIR(80), 2021, pp. 103276.
Elsevier DOI 2110
Weakly supervised, Temporal action localization, Deep learning BibRef

Ge, Y.X.[Yong-Xin], Qin, X.L.[Xiao-Lei], Yang, D.[Dan], Jagersand, M.[Martin],
Deep snippet selective network for weakly supervised temporal action localization,
PR(110), 2021, pp. 107686.
Elsevier DOI 2011
Weak supervision, Temporal action localization, Erasing branches, Ternary mask, Background suppression branch BibRef

Li, Y.G.[Ye-Guang], Zhang, M.Y.[Ming-Yuan], Hu, L.[Liang], Li, J.[Jun], Wang, D.Q.[De-Qing],
Candidate region correlation for video action detection,
JVCIR(71), 2020, pp. 102818.
Elsevier DOI 2009
Deep learning, Action detection, Region correlation, Self-attention mechanism BibRef

Chen, P., Gan, C., Shen, G., Huang, W., Zeng, R., Tan, M.,
Relation Attention for Temporal Action Localization,
MultMed(22), No. 10, October 2020, pp. 2723-2733.
IEEE DOI 2009
Proposals, Feature extraction, Task analysis, Object detection, Deep learning, Sports, Semantics, Temporal action localization, relation attention BibRef

Zhang, S.W.[Shi-Wei], Song, L.[Lin], Gao, C.X.[Chang-Xin], Sang, N.[Nong],
GLNet: Global Local Network for Weakly Supervised Action Localization,
MultMed(22), No. 10, October 2020, pp. 2610-2622.
IEEE DOI 2009
Annotations, Proposals, Predictive models, Task analysis, Feature extraction, Electron tubes, Training, weakly supervised BibRef

Xu, L.[Liang], Wang, X.G.[Xing-Gang], Liu, W.Y.[Wen-Yu], Feng, B.[Bin],
Cascaded Boundary Network for High-Quality Temporal Action Proposal Generation,
CirSysVideo(30), No. 10, October 2020, pp. 3702-3713.
IEEE DOI 2010
Proposals, Videos, Feature extraction, Task analysis, Object detection, Visualization, Correlation, long short-term memory BibRef

Liu, X.L.[Xiao-Long], Sun, Y.C.[Yu-Chao], Lu, J.H.[Jiang-Hu], Yao, C.[Cong], Zhou, Y.[Yu],
Self-Similarity Action Proposal,
SPLetters(27), 2020, pp. 2064-2068.
IEEE DOI 2012
Proposals, Generators, Image segmentation, Sampling methods, Motion segmentation, Feature extraction, Visualization, temporal action localization BibRef

Su, R.[Rui], Xu, D.[Dong], Sheng, L., Ouyang, W.L.[Wan-Li],
PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization,
IP(30), 2021, pp. 2103-2113.
IEEE DOI 2102
image colour analysis, image motion analysis, learning (artificial intelligence), object detection, PCG-TAL, cross-stream cooperation BibRef

Su, R.[Rui], Ouyang, W.L.[Wan-Li], Zhou, L.P.[Lu-Ping], Xu, D.[Dong],
Improving Action Localization by Progressive Cross-Stream Cooperation,
CVPR19(12008-12017).
IEEE DOI 2002
BibRef

Ning, K., Xie, L., Liu, J., Wu, F., Tian, Q.,
Interaction-Integrated Network for Natural Language Moment Localization,
IP(30), 2021, pp. 2538-2548.
IEEE DOI 2102
Visualization, Semantics, Location awareness, Task analysis, Linguistics, Convolution, Data models, vision-language understanding BibRef

Zhang, X.Y.[Xiao-Yu], Shi, H.C.[Hai-Chao], Li, C.S.[Chang-Sheng], Li, P.[Peng], Li, Z.K.[Ze-Kun], Ren, P.[Peng],
Weakly-supervised action localization via embedding-modeling iterative optimization,
PR(113), 2021, pp. 107831.
Elsevier DOI 2103
Action recognition, Temporal action localization, Attention mechanism, Generative adversarial networks, Subspace embedding BibRef

Zhang, X.Y.[Xiao-Yu], Shi, H.C.[Hai-Chao], Li, C.S.[Chang-Sheng], Shi, X.C.[Xin-Chu],
Action Shuffling for Weakly Supervised Temporal Localization,
IP(31), 2022, pp. 4447-4457.
IEEE DOI 2207
Feature extraction, Location awareness, Training, Task analysis, Annotations, Semantics, Network architecture, intra-action BibRef

Shi, H.C.[Hai-Chao], Zhang, X.Y.[Xiao-Yu], Li, C.S.[Chang-Sheng],
StochasticFormer: Stochastic Modeling for Weakly Supervised Temporal Action Localization,
IP(32), 2023, pp. 1379-1389.
IEEE DOI 2303
Location awareness, Stochastic processes, Feature extraction, Videos, Transformers, Training, Annotations, stochastic process BibRef

Wang, B., Yang, L., Zhao, Y.,
POLO: Learning Explicit Cross-Modality Fusion for Temporal Action Localization,
SPLetters(28), 2021, pp. 503-507.
IEEE DOI 2103
Videos, Location awareness, Convolution, Training, Feature extraction, Task analysis, Kernel, Feature fusion, temporal action localization BibRef

Hu, Y.P.[Yu-Peng], Liu, M.[Meng], Su, X.O.[Xia-Obin], Gao, Z.[Zan], Nie, L.Q.[Li-Qiang],
Video Moment Localization via Deep Cross-Modal Hashing,
IP(30), 2021, pp. 4667-4677.
IEEE DOI 2105
BibRef

Huang, L.J.[Lin-Jiang], Huang, Y.[Yan], Ouyang, W.L.[Wan-Li], Wang, L.[Liang],
Modeling Sub-Actions for Weakly Supervised Temporal Action Localization,
IP(30), 2021, pp. 5154-5167.
IEEE DOI 2106
Location awareness, Hidden Markov models, Proposals, Task analysis, Prototypes, Annotations, Deep learning, Weakly supervised learning, sub-action modeling BibRef

Su, R.[Rui], Xu, D.[Dong], Zhou, L.P.[Lu-Ping], Ouyang, W.L.[Wan-Li],
Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain,
IP(30), 2021, pp. 6659-6672.
IEEE DOI 2108
Videos, Location awareness, Task analysis, Feature extraction, Reliability, Annotations, two stream fusion BibRef

Huang, L.J.[Lin-Jiang], Wang, L.[Liang], Li, H.S.[Hong-Sheng],
Multi-Modality Self-Distillation for Weakly Supervised Temporal Action Localization,
IP(31), 2022, pp. 1504-1519.
IEEE DOI 2202
Location awareness, Reliability, Noise measurement, Annotations, Training, Head, Task analysis, self-distillation BibRef

Huang, L.J.[Lin-Jiang], Huang, Y.[Yan], Ouyang, W.L.[Wan-Li], Wang, L.A.[Li-Ang],
Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization,
PAMI(44), No. 9, September 2022, pp. 5729-5746.
IEEE DOI 2208
Prototypes, Task analysis, Location awareness, Feature extraction, Annotations, Computational modeling, Visualization, multi-label clustering loss BibRef

Huang, L.J.[Lin-Jiang], Wang, L.[Liang], Li, H.S.[Hong-Sheng],
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation,
CVPR22(3262-3271)
IEEE DOI 2210
BibRef
Earlier:
Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization,
ICCV21(7982-7991)
IEEE DOI 2203
Location awareness, Codes, Benchmark testing, Pattern recognition, Video analysis and understanding, Action and event recognition. Image annotation, Task analysis, Action and behavior recognition BibRef

Pan, J.T.[Jun-Ting], Chen, S.[Siyu], Shou, M.Z.[Mike Zheng], Liu, Y.[Yu], Shao, J.[Jing], Li, H.S.[Hong-Sheng],
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization,
CVPR21(464-474)
IEEE DOI 2111
Location awareness, Visualization, Cognition, Pattern recognition, Complexity theory, Task analysis BibRef

Liu, Z.Y.[Zi-Yi], Wang, L.[Le], Zhang, Q.L.[Qi-Lin], Tang, W.[Wei], Zheng, N.N.[Nan-Ning], Hua, G.[Gang],
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks,
PAMI(44), No. 9, September 2022, pp. 5886-5902.
IEEE DOI 2208
Location awareness, Proposals, Videos, Task analysis, Feature extraction, Training, Action localization, temporal contrast BibRef

Liu, Z.Y.[Zi-Yi], Wang, L.[Le], Zhang, Q.L.[Qi-Lin], Gao, Z.N.[Zhan-Ning], Niu, Z.X.[Zhen-Xing], Zheng, N.N.[Nan-Ning], Hua, G.[Gang],
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks 1,
ICCV19(3898-3907)
IEEE DOI 2004
image classification, video signal processing, action proposal evaluator, Streaming media BibRef

Zhai, Y.H.[Yuan-Hao], Wang, L.[Le], Tang, W.[Wei], Zhang, Q.L.[Qi-Lin], Yuan, J.S.[Jun-Song], Hua, G.[Gang],
Two-stream Consensus Network for Weakly-supervised Temporal Action Localization,
ECCV20(VI:37-54).
Springer DOI 2011
BibRef

Zhai, Y.H.[Yuan-Hao], Wang, L.[Le], Tang, W.[Wei], Zhang, Q.L.[Qi-Lin], Zheng, N.N.[Nan-Ning], Hua, G.[Gang],
Action Coherence Network for Weakly-Supervised Temporal Action Localization,
MultMed(24), No. 2022, pp. 1857-1870.
IEEE DOI 2204
Proposals, Location awareness, Coherence, Feature extraction, Training, Optical losses, Optical fiber networks, weakly-supervised learning BibRef

Zhai, Y.H.[Yuan-Hao], Wang, L.[Le], Liu, Z.Y.[Zi-Yi], Zhang, Q.L.[Qi-Lin], Hua, G.[Gang], Zheng, N.N.[Nan-Ning],
Action Coherence Network for Weakly Supervised Temporal Action Localization,
ICIP19(3696-3700)
IEEE DOI 1910
weakly-supervised, temporal action lo-calization, coherence loss BibRef

Zhai, Y.H.[Yuan-Hao], Wang, L.[Le], Tang, W.[Wei], Zhang, Q.L.[Qi-Lin], Zheng, N.N.[Nan-Ning], Doermann, D.[David], Yuan, J.S.[Jun-Song], Hua, G.[Gang],
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization,
PAMI(45), No. 4, April 2023, pp. 4136-4151.
IEEE DOI 2303
Videos, Location awareness, Proposals, Adaptation models, Uncertainty, Training, Optical flow, Temporal action localization, weakly-supervised learning BibRef

Mettes, P.S.[Pascal S.], Thong, W.[William], Snoek, C.G.M.[Cees G. M.],
Object Priors for Classifying and Localizing Unseen Actions,
IJCV(129), No. 6, June 2021, pp. 1954-1971.
Springer DOI 2106
BibRef

Hu, T.[Tao], Thong, W.[William], Mettes, P.S.[Pascal S.], Snoek, C.G.M.[Cees G.M.],
Query by Activity Video in the Wild,
ICIP23(550-554)
IEEE DOI 2312
BibRef

Hu, Y.P.[Yu-Peng], Nie, L.Q.[Li-Qiang], Liu, M.[Meng], Wang, K.[Kun], Wang, Y.L.[Ying-Long], Hua, X.S.[Xian-Sheng],
Coarse-to-Fine Semantic Alignment for Cross-Modal Moment Localization,
IP(30), 2021, pp. 5933-5943.
IEEE DOI 2107
Semantics, Location awareness, Visualization, Context modeling, Proposals, Task analysis, Correlation, hierarchical semantic pruning BibRef

Zhao, T.[Tao], Han, J.W.[Jun-Wei], Yang, L.[Le], Wang, B.L.[Bing-Lu], Zhang, D.W.[Ding-Wen],
SODA: Weakly Supervised Temporal Action Localization Based on Astute Background Response and Self-Distillation Learning,
IJCV(129), No. 8, August 2021, pp. 2474-2498.
Springer DOI 2108
BibRef

Bai, C.[Cong], Li, H.K.[Hong-Kai], Zhang, J.L.[Jing-Lin], Huang, L.[Ling], Zhang, L.[Lu],
Unsupervised Adversarial Instance-Level Image Retrieval,
MultMed(23), 2021, pp. 2199-2207.
IEEE DOI 2108
Retrival of instances from daily life monitoring. Image retrieval, Training, Generators, Generative adversarial networks, Feature extraction, unsupervised training BibRef

Ding, X.P.[Xin-Peng], Wang, N.N.[Nan-Nan], Gao, X.B.[Xin-Bo], Li, J.[Jie], Wang, X.Y.[Xiao-Yu], Liu, T.L.[Tong-Liang],
KFC: An Efficient Framework for Semi-Supervised Temporal Action Localization,
IP(30), 2021, pp. 6869-6878.
IEEE DOI 2108
Perturbation methods, Location awareness, Feature extraction, Training, Annotations, Semisupervised learning, Semantics, video understanding BibRef

Zhang, C.[Can], Cao, M.[Meng], Yang, D.M.[Dong-Ming], Jiang, J.[Ji], Zou, Y.X.[Yue-Xian],
Synergic learning for noise-insensitive webly-supervised temporal action localization,
IVC(113), 2021, pp. 104247.
Elsevier DOI 2108
Temporal action localization, Web supervision, Spatio-temporal representation BibRef

Cao, M.[Meng], Zhang, C.[Can], Chen, L.[Long], Shou, M.Z.[Mike Zheng], Zou, Y.X.[Yue-Xian],
Deep Motion Prior for Weakly-Supervised Temporal Action Localization,
IP(31), 2022, pp. 5203-5213.
IEEE DOI 2208
Optical losses, Videos, Optical imaging, Location awareness, Feature extraction, Adaptive optics, Xenon, motion-guided loss BibRef

Zhang, C.[Can], Cao, M.[Meng], Yang, D.M.[Dong-Ming], Chen, J.[Jie], Zou, Y.X.[Yue-Xian],
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning,
CVPR21(16005-16014)
IEEE DOI 2111
Location awareness, Annotations, Benchmark testing, Pattern recognition, Videos BibRef

Chen, G.[Guang], Zhang, C.[Can], Zou, Y.X.[Yue-Xian],
AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection,
MultMed(23), 2021, pp. 2672-2682.
IEEE DOI 2109
Proposals, Feature extraction, Task analysis, Training, Object detection, Convolution, AFNet BibRef

Zhang, C.[Can], Zou, Y.X.[Yue-Xian], Chen, G.[Guang], Gan, L.[Lei],
EAR: Efficient action recognition with local-global temporal aggregation,
IVC(116), 2021, pp. 104329.
Elsevier DOI 2112
Efficient action recognition, Local-global temporal aggregation, Motion representation, Persistence of appearance BibRef

Zhang, X.Y.[Xiao-Yu], Zhang, Y.[Yaru], Shi, H.C.[Hai-Chao], Dong, J.[Jing],
SAPS: Self-Attentive Pathway Search for weakly-supervised action localization with background-action augmentation,
CVIU(210), 2021, pp. 103256.
Elsevier DOI 2109
Video understanding, Action localization, Representation learning, Neural architecture search, Background modeling BibRef

Xuan, H.Y.[Han-Yu], Luo, L.[Lei], Zhang, Z.Y.[Zhen-Yu], Yang, J.[Jian], Yan, Y.[Yan],
Discriminative Cross-Modality Attention Network for Temporal Inconsistent Audio-Visual Event Localization,
IP(30), 2021, pp. 7878-7888.
IEEE DOI 2109
Visualization, Location awareness, Semantics, Task analysis, Correlation, Linear programming, Fuses, Multi-modality perception, discriminative representation BibRef

Zhang, Z.J.[Zi-Jian], Zhao, Z.[Zhou], Zhang, Z.[Zhu], Lin, Z.J.[Zhi-Jie], Wang, Q.[Qi], Hong, R.C.[Ri-Chang],
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks,
MultMed(23), 2021, pp. 3306-3317.
IEEE DOI 2109
Bidirectional control, Semantics, Task analysis, Correlation, Natural languages, Visualization, texual video localization BibRef

Zhang, Z.M.[Zong-Meng], Han, X.J.[Xian-Jing], Song, X.M.[Xue-Meng], Yan, Y.[Yan], Nie, L.Q.[Li-Qiang],
Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos,
IP(30), 2021, pp. 8265-8277.
IEEE DOI 2110
Start and end points of a moment described by a natural language sentence. Videos, Location awareness, Task analysis, Semantics, Syntactics, Convolution, Cognition, Temporal language localization, video and language BibRef

Zhao, P.[Peisen], Xie, L.X.[Ling-Xi], Zhang, Y.[Ya], Tian, Q.[Qi],
Universal-to-Specific Framework for Complex Action Recognition,
MultMed(23), 2021, pp. 3441-3453.
IEEE DOI 2110
Convolution, Task analysis, Feature extraction, Manganese, Solid modeling, Action recognition, neural networks BibRef

Zhao, P.[Peisen], Xie, L.X.[Ling-Xi], Ju, C.[Chen], Zhang, Y.[Ya], Wang, Y.F.[Yan-Feng], Tian, Q.[Qi],
Bottom-up Temporal Action Localization with Mutual Regularization,
ECCV20(VIII:539-555).
Springer DOI 2011
BibRef

Paul, S.[Sudipta], Mithun, N.C.[Niluthpol Chowdhury], Roy-Chowdhury, A.K.[Amit K.],
Text-Based Localization of Moments in a Video Corpus,
IP(30), 2021, pp. 8886-8899.
IEEE DOI 2111
Task analysis, Location awareness, Semantics, Visualization, Image coding, Feature extraction, Annotations, video corpus BibRef

Su, R.[Rui], Xu, D.[Dong], Zhou, L.P.[Lu-Ping], Ouyang, W.L.[Wan-Li],
Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization,
PAMI(43), No. 12, December 2021, pp. 4477-4490.
IEEE DOI 2112
Location awareness, Detectors, Feature extraction, Spatial temporal resolution, Training data, Motion segmentation, two-stream cooperation BibRef

Li, G.Z.[Guo-Zhang], Li, J.[Jie], Wang, N.N.[Nan-Nan], Ding, X.P.[Xin-Peng], Li, Z.F.[Zhi-Feng], Gao, X.B.[Xin-Bo],
Multi-Hierarchical Category Supervision for Weakly-Supervised Temporal Action Localization,
IP(30), 2021, pp. 9332-9344.
IEEE DOI 2112
Videos, Location awareness, Training, Feature extraction, Bars, Proposals, Measurement, Weak supervision, multi-hierarchical categories BibRef

Li, G.Z.[Guo-Zhang], Cheng, D.[De], Ding, X.P.[Xin-Peng], Wang, N.N.[Nan-Nan], Wang, X.Y.[Xiao-Yu], Gao, X.B.[Xin-Bo],
Boosting Weakly-Supervised Temporal Action Localization with Text Information,
CVPR23(10648-10657)
IEEE DOI 2309
BibRef

Ng, Y.B.[Yan Bin], Fernando, B.[Basura],
Weakly supervised action segmentation with effective use of attention and self-attention,
CVIU(213), 2021, pp. 103298.
Elsevier DOI 2112
Weakly supervised action segmentation, Self-attention, Sequence-to-sequence models BibRef

Zhou, Y.[Yuan], Wang, R.L.[Ruo-Lin], Li, H.R.[Hong-Ru], Kung, S.Y.[Sun-Yuan],
Temporal Action Localization Using Long Short-Term Dependency,
MultMed(23), 2021, pp. 4363-4375.
IEEE DOI 2112
Videos, Feature extraction, Proposals, Task analysis, Recurrent neural networks, video content analysis BibRef

Zhao, P.[Peisen], Xie, L.X.[Ling-Xi], Zhang, Y.[Ya], Tian, Q.[Qi],
Actionness-Guided Transformer for Anchor-Free Temporal Action Localization,
SPLetters(29), 2022, pp. 194-198.
IEEE DOI 2202
Proposals, Transformers, Videos, Location awareness, Training, Feature extraction, Convolution, Temporal action localization, transformer BibRef

Sun, C.[Che], Song, H.[Hao], Wu, X.X.[Xin-Xiao], Jia, Y.D.[Yun-De], Luo, J.B.[Jie-Bo],
Exploiting Informative Video Segments for Temporal Action Localization,
MultMed(24), 2022, pp. 274-287.
IEEE DOI 2202
Motion segmentation, Location awareness, Proposals, Generators, Aggregates, Image segmentation, Feature extraction, attention mechanism BibRef

Wang, B.L.[Bing-Lu], Zhang, X.[Xun], Zhao, Y.Q.[Yong-Qiang],
Exploring Sub-Action Granularity for Weakly Supervised Temporal Action Localization,
CirSysVideo(32), No. 4, April 2022, pp. 2186-2198.
IEEE DOI 2204
Location awareness, Proposals, Task analysis, Aggregates, Training, Prediction algorithms, Marine vehicles, sub-action granularity BibRef

Xu, J.L.[Jing-Lin], Chen, G.Y.[Guang-Yi], Lu, J.W.[Ji-Wen], Zhou, J.[Jie],
Unintentional Action Localization via Counterfactual Examples,
IP(31), 2022, pp. 3281-3294.
IEEE DOI 2205
Location awareness, Training, Predictive models, Anomaly detection, Correlation, Task analysis, Proposals, intention BibRef

Xu, J.L.[Jing-Lin], Chen, G.Y.[Guang-Yi], Zhou, N.X.[Nuo-Xing], Zheng, W.S.[Wei-Shi], Lu, J.W.[Ji-Wen],
Probabilistic Temporal Modeling for Unintentional Action Localization,
IP(31), 2022, pp. 3081-3094.
IEEE DOI 2205
Probabilistic logic, Location awareness, Videos, Annotations, Uncertainty, Reliability, Anomaly detection, action intention BibRef

Ma, F.[Fan], Zhu, L.C.[Lin-Chao], Yang, Y.[Yi],
Weakly Supervised Moment Localization with Decoupled Consistent Concept Prediction,
IJCV(130), No. 5, May 2022, pp. 1244-1258.
Springer DOI 2205
BibRef

Ma, F.[Fan], Zhu, L.C.[Lin-Chao], Yang, Y.[Yi], Zha, S.X.[Sheng-Xin], Kundu, G.[Gourab], Feiszli, M.[Matt], Shou, Z.[Zheng],
SF-net: Single-frame Supervision for Temporal Action Localization,
ECCV20(IV:420-437).
Springer DOI 2011
BibRef

Fu, H.[Hao], Wang, H.X.[Hong-Xing],
Multiple cross-attention for video-subtitle moment retrieval,
PRL(156), 2022, pp. 7-14.
Elsevier DOI 2205
Video-subtitle moment retrieval, Multi-modal learning, Cross-attention BibRef

Li, X.W.[Xue-Wei], Wu, H.J.[Hong-Jun], Li, M.Z.[Meng-Zhu], Liu, H.Z.[Hong-Zhe],
Multi-label video classification via coupling attentional multiple instance learning with label relation graph,
PRL(156), 2022, pp. 53-59.
Elsevier DOI 2205
Multi-label video classification, Multiple instance learning, Attentional feature learning, Label relation graph BibRef

Rodin, I.[Ivan], Furnari, A.[Antonino], Mavroeidis, D.[Dimitrios], Farinella, G.M.[Giovanni Maria],
Untrimmed Action Anticipation,
CIAP22(III:337-348).
Springer DOI 2205
BibRef

Xia, K.[Kun], Wang, L.[Le], Zhou, S.P.[San-Ping], Hua, G.[Gang], Tang, W.[Wei],
Dual relation network for temporal action localization,
PR(129), 2022, pp. 108725.
Elsevier DOI 2206
Temporal action localization, Relation reasoning BibRef

Cheng, Y.[Yi], Sun, Y.[Ying], Fan, H.[Hehe], Zhuo, T.[Tao], Lim, J.H.[Joo-Hwee], Kankanhalli, M.[Mohan],
Entropy guided attention network for weakly-supervised action localization,
PR(129), 2022, pp. 108718.
Elsevier DOI 2206
Temporal action localization, Weakly-supervised learning, Entropy guided loss, Global similarity loss BibRef

Uslu, G.[Gamze], Baydere, S.[Sebnem],
A Segmentation Scheme for Knowledge Discovery in Human Activity Spotting,
Cyber(52), No. 7, July 2022, pp. 5668-5681.
IEEE DOI 2207
Feature extraction, Time-domain analysis, Training, Knowledge discovery, Training data, Task analysis, Entropy, sliding windowing BibRef

Kim, Y.H.[Young Hwi], Nam, S.[Seonghyeon], Kim, S.J.[Seon Joo],
2PESNet: Towards online processing of temporal action localization,
PR(131), 2022, pp. 108871.
Elsevier DOI 2208
Online video understanding, Temporal action localization BibRef

Zhao, Y.[Yibo], Zhang, H.[Hua], Gao, Z.[Zan], Guan, W.[Weili], Nie, J.[Jie], Liu, A.[Anan], Wang, M.[Meng], Chen, S.Y.[Sheng-Yong],
A Temporal-Aware Relation and Attention Network for Temporal Action Localization,
IP(31), 2022, pp. 4746-4760.
IEEE DOI 2208
Proposals, Feature extraction, Task analysis, Location awareness, Network architecture, Convolution, Aggregates, temporal action localization BibRef

Zhao, Y.[Yibo], Zhang, H.[Hua], Gao, Z.[Zan], Guan, W.[Weili], Wang, M.[Meng], Chen, S.Y.[Sheng-Yong],
A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization,
CirSysVideo(34), No. 8, August 2024, pp. 7202-7215.
IEEE DOI 2408
Location awareness, Task analysis, Proposals, Circuits and systems, Uncertainty, Prototypes, Multitasking, snippet enhancement loss BibRef

Zhao, Y.[Yibo], Zhang, H.[Hua], Gao, Z.[Zan], Gao, W.J.[Wen-Jie], Wang, M.[Meng], Chen, S.Y.[Sheng-Yong],
A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization,
MultMed(25), 2023, pp. 8253-8266.
IEEE DOI 2312
BibRef

Wang, Q.Y.[Qing-Yun], Song, Y.[Yan], Zou, R.[Rong], Shu, X.B.[Xiang-Bo],
Progressive enhancement network with pseudo labels for weakly supervised temporal action localization,
JVCIR(87), 2022, pp. 103590.
Elsevier DOI 2208
Temporal action localization, Weak supervision, Pseudo label, Video understanding BibRef

Zeng, R.H.[Run-Hao], Huang, W.B.[Wen-Bing], Tan, M.K.[Ming-Kui], Rong, Y.[Yu], Zhao, P.L.[Pei-Lin], Huang, J.Z.[Jun-Zhou], Gan, C.[Chuang],
Graph Convolutional Module for Temporal Action Localization in Videos,
PAMI(44), No. 10, October 2022, pp. 6209-6223.
IEEE DOI 2209
BibRef
Earlier: A1, A2, A7, A3, A4, A5, A6:
Graph Convolutional Networks for Temporal Action Localization,
ICCV19(7093-7102)
IEEE DOI 2004
Location awareness, Videos, Proposals, Semantics, Image edge detection, Sports, Feature extraction, video analysis. convolutional, graph theory, image classification, learning (artificial intelligence), action proposal graph BibRef

Souri, Y.[Yaser], Fayyaz, M.[Mohsen], Minciullo, L.[Luca], Francesca, G.[Gianpiero], Gall, J.[Juergen],
Fast Weakly Supervised Action Segmentation Using Mutual Consistency,
PAMI(44), No. 10, October 2022, pp. 6196-6208.
IEEE DOI 2209
Training, Viterbi algorithm, Streaming media, Artificial neural networks, Task analysis, Predictive models, weakly supervised learning BibRef

Yudistira, N.[Novanto], Kavitha, M.S.[Muthu Subash], Kurita, T.[Takio],
Weakly-Supervised Action Localization, and Action Recognition Using Global-Local Attention of 3D CNN,
IJCV(130), No. 10, October 2022, pp. 2349-2363.
Springer DOI 2209
BibRef

Nawaz, H.S.[Hafiza Sadia], Shi, Z.S.[Zhen-Sheng], Gan, Y.H.[Yan-Hai], Hirpa, A.[Amanuel], Dong, J.Y.[Jun-Yu], Zheng, H.Y.[Hai-Yong],
Temporal Moment Localization via Natural Language by Utilizing Video Question Answers as a Special Variant and Bypassing NLP for Corpora,
CirSysVideo(32), No. 9, September 2022, pp. 6174-6185.
IEEE DOI 2209
Visualization, Location awareness, Natural language processing, Oceans, Transformers, Semantics, Grounding, Moment retrieval, moment localization using language BibRef

Han, T.T.[Ting-Ting], Wang, K.[Kai], Yu, J.[Jun], Fan, J.P.[Jian-Ping],
Weakly supervised moment localization with natural language based on semantic reconstruction,
IVC(126), 2022, pp. 104532.
Elsevier DOI 2209
Cross-modal moment localization, Weakly supervised temporal grounding, Semantic reconstruction BibRef

Zhang, Y.[Yaru], Zhang, X.Y.[Xiao-Yu], Shi, H.C.[Hai-Chao],
OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization,
PR(133), 2023, pp. 109027.
Elsevier DOI 2210
Temporal action localization, Open-world learning, Self-paced learning BibRef

Liu, Y.[Yi], Wang, L.M.[Li-Min], Wang, Y.[Yali], Ma, X.[Xiao], Qiao, Y.[Yu],
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization,
IP(31), 2022, pp. 6937-6950.
IEEE DOI 2212
Annotations, Location awareness, Sports, Benchmark testing, Taxonomy, Task analysis, Hair, Temporal action localization, fine-grained, deep learning BibRef

Zhang, S.Y.[Song-Yang], Peng, H.[Houwen], Fu, J.L.[Jian-Long], Lu, Y.J.[Yi-Juan], Luo, J.B.[Jie-Bo],
Multi-Scale 2D Temporal Adjacency Networks for Moment Localization With Natural Language,
PAMI(44), No. 12, December 2022, pp. 9073-9087.
IEEE DOI 2212
Location awareness, Context modeling, Task analysis, Natural languages, Feature extraction, Rats, Semantics BibRef

Yang, L.[Le], Han, J.W.[Jun-Wei], Zhao, T.[Tao], Lin, T.W.[Tian-Wei], Zhang, D.W.[Ding-Wen], Chen, J.X.[Jian-Xin],
Background-Click Supervision for Temporal Action Localization,
PAMI(44), No. 12, December 2022, pp. 9814-9829.
IEEE DOI 2212
Location awareness, Annotations, Proposals, Task analysis, Costs, Hidden Markov models, Supervised learning, weakly supervised learning BibRef

Fu, J.[Jie], Gao, J.Y.[Jun-Yu], Xu, C.S.[Chang-Sheng],
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization,
IP(31), 2022, pp. 7363-7377.
IEEE DOI 2212
Training, Prototypes, Reliability, Representation learning, Annotations, Task analysis, Probabilistic logic, point-level weakly-supervised action localization BibRef

Fu, J.[Jie], Gao, J.Y.[Jun-Yu], Xu, C.S.[Chang-Sheng],
Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization,
PAMI(45), No. 10, October 2023, pp. 12427-12443.
IEEE DOI 2310
BibRef

Chen, M.Y.[Meng-Yuan], Gao, J.Y.[Jun-Yu], Xu, C.S.[Chang-Sheng],
Uncertainty-Aware Dual-Evidential Learning for Weakly-Supervised Temporal Action Localization,
PAMI(45), No. 12, December 2023, pp. 15896-15911.
IEEE DOI 2311
BibRef
Earlier:
Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization,
CVPR23(14741-14750)
IEEE DOI 2309
BibRef

Hu, Y.F.[Yu-Fan], Fu, J.[Jie], Chen, M.Y.[Meng-Yuan], Gao, J.Y.[Jun-Yu], Dong, J.F.[Jian-Feng], Fan, B.[Bin], Liu, H.M.[Hong-Min],
Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization,
CirSysVideo(34), No. 1, January 2024, pp. 207-220.
IEEE DOI 2401
BibRef

Hu, Y.F.[Yu-Fan], Gao, J.Y.[Jun-Yu], Dong, J.F.[Jian-Feng], Fan, B.[Bin], Liu, H.M.[Hong-Min],
Exploring Rich Semantics for Open-Set Action Recognition,
MultMed(26), 2024, pp. 5410-5421.
IEEE DOI 2404
Semantics, Prototypes, Knowledge graphs, Visualization, Task analysis, Uncertainty, Training, Open-set action recognition, semantic relation modeling BibRef

Gao, J.Y.[Jun-Yu], Chen, M.Y.[Meng-Yuan], Xu, C.S.[Chang-Sheng],
Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization,
PAMI(45), No. 12, December 2023, pp. 15949-15963.
IEEE DOI 2311
BibRef

Chen, M.Y.[Meng-Yuan], Gao, J.Y.[Jun-Yu], Yang, S.C.[Shi-Cai], Xu, C.S.[Chang-Sheng],
Dual-Evidential Learning for Weakly-supervised Temporal Action Localization,
ECCV22(IV:192-208).
Springer DOI 2211
BibRef

Gao, J.Y.[Jun-Yu], Chen, M.Y.[Meng-Yuan], Xu, C.S.[Chang-Sheng],
Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization,
CVPR22(19967-19977)
IEEE DOI 2210
Location awareness, Training, Codes, Video sequences, Benchmark testing, Pattern recognition, Video analysis and understanding BibRef

Sun, W.Q.[Wei-Qi], Su, R.[Rui], Yu, Q.[Qian], Xu, D.[Dong],
Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization,
CirSysVideo(33), No. 1, January 2023, pp. 354-366.
IEEE DOI 2301
Videos, Location awareness, Task analysis, Motion segmentation, Feature extraction, Training, Sports, Weakly-supervised learning, slow motion BibRef

Vo, K.[Khoa], Truong, S.[Sang], Yamazaki, K.[Kashu], Raj, B.[Bhiksha], Tran, M.T.[Minh-Triet], Le, N.[Ngan],
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation,
IJCV(131), No. 1, January 2023, pp. 302-323.
Springer DOI 2301

WWW Link. BibRef

Pehlivan, S.[Selen], Laaksonen, J.T.[Jorma T.],
Improved action proposals using fine-grained proposal features with recurrent attention models,
JVCIR(90), 2023, pp. 103709.
Elsevier DOI 2301
Temporal action proposal generation, Untrimmed video understanding, Temporal convolution, Attention BibRef

Liu, M.[Meng], Nie, L.Q.[Li-Qiang], Wang, Y.[Yunxiao], Wang, M.[Meng], Rui, Y.[Yong],
A Survey on Video Moment Localization,
Surveys(55), No. 9, January 2023, pp. xx-yy.
DOI Link 2302
Survey, Action Localization. vision and language, survey, cross-modal retrieval, video moment retrieval, Video moment localization BibRef

Zhang, D.W.[Ding-Wen],
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization,
PAMI(45), No. 3, March 2023, pp. 3019-3031.
IEEE DOI 2302
Pipelines, Location awareness, Aggregates, Training, Streaming media, Context modeling, Annotations, Equivalent mechanism, temporal action localization BibRef

Xue, C.[Cheng], Zhong, X.[Xionghu], Cai, M.J.[Min-Jie], Chen, H.[Hao], Wang, W.W.[Wen-Wu],
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention,
MultMed(25), 2023, pp. 418-429.
IEEE DOI 2302
Visualization, Location awareness, Task analysis, Semantics, Feature extraction, Correlation, Automobiles, Audio-visual, deep learning BibRef

Yang, W.F.[Wen-Fei], Zhang, T.Z.[Tian-Zhu], Zhang, Y.D.[Yong-Dong], Wu, F.[Feng],
Uncertainty Guided Collaborative Training for Weakly Supervised and Unsupervised Temporal Action Localization,
PAMI(45), No. 4, April 2023, pp. 5252-5267.
IEEE DOI 2303
Training, Uncertainty, Location awareness, Testing, Task analysis, Proposals, Noise measurement, Weakly supervised, unsupervised, collaborative training BibRef

Su, Y.T.[Yu-Ting], Wang, W.K.[Wei-Kang], Liu, J.[Jing], Ma, S.[Shuang], Yang, X.K.[Xiao-Kang],
Sequence as a Whole: A Unified Framework for Video Action Localization With Long-Range Text Query,
IP(32), 2023, pp. 1403-1418.
IEEE DOI 2303
Location awareness, Task analysis, Grounding, Transformers, Convolution, Visualization, Spatiotemporal phenomena, vision transformer BibRef

Wang, C.[Chuanxu], Wang, J.[Jing], Liu, P.[Peng],
Complementary adversarial mechanisms for weakly-supervised temporal action localization,
PR(139), 2023, pp. 109426.
Elsevier DOI 2304
Temporal action localization, Boundary regression, Complementary adversarial mechanism, Action recognition BibRef

Zhang, H.P.[Hai-Ping], Ma, C.H.[Cong-Hao], Yu, D.J.[Dong-Jin], Guan, L.M.[Li-Ming], Wang, D.J.[Dong-Jing], Hu, Z.P.[Ze-Peng], Liu, X.[Xu],
MTSCANet: Multi temporal resolution temporal semantic context aggregation network,
IET-CV(17), No. 3, 2023, pp. 366-378.
DOI Link 2305
convolutional neural nets, learning (artificial intelligence), neural net architecture BibRef

Gao, Z.[Zan], Cui, X.[Xinglei], Zhuo, T.[Tao], Cheng, Z.Y.[Zhi-Yong], Liu, A.A.[An-An], Wang, M.[Meng], Chen, S.[Shenyong],
A Multitemporal Scale and Spatial-Temporal Transformer Network for Temporal Action Localization,
HMS(53), No. 3, June 2023, pp. 569-580.
IEEE DOI 2306
Transformers, Semantics, Feature extraction, Proposals, Location awareness, Convolution, Task analysis, temporal action localization (TAL) BibRef

Zhu, Z.X.[Zi-Xin], Wang, L.[Le], Tang, W.[Wei], Zheng, N.N.[Nan-Ning], Hua, G.[Gang],
ContextLoc++: A Unified Context Model for Temporal Action Localization,
PAMI(45), No. 8, August 2023, pp. 9504-9519.
IEEE DOI 2307
BibRef
Earlier: A1, A3, A2, A4, A5:
Enriching Local and Global Contexts for Temporal Action Localization,
ICCV21(13496-13505)
IEEE DOI 2203
Proposals, Location awareness, Context modeling, Visualization, Optical flow, Adaptation models, Task analysis, temporal action localization. Codes, Computational modeling, Network architecture, Video analysis and understanding BibRef

Liu, S.[Shuo], Quan, W.[Weize], Wang, C.[Chaoqun], Liu, Y.[Yuan], Liu, B.[Bin], Yan, D.M.[Dong-Ming],
Dense Modality Interaction Network for Audio-Visual Event Localization,
MultMed(25), 2023, pp. 2734-2748.
IEEE DOI 2307
Visualization, Location awareness, Aircraft, Correlation, Task analysis, Synchronization, Fuses, Attention, Multi-modality BibRef

Sun, L.[Li], Wang, P.[Ping], Wang, L.[Liuan], Sun, J.[Jun], Okatani, T.[Takayuki],
Zero-shot temporal event localisation: Label-free, training-free, domain-free,
IET-CV(17), No. 5, 2023, pp. 599-613.
DOI Link 2309
computer vision, video retrieval BibRef

Raza, M.A.[Muhammad Ahmed], Chen, L.F.[Long-Fei], Nanbo, L.[Li], Fisher, R.B.[Robert B.],
EatSense: Human centric, action recognition and localization dataset for understanding eating behaviors and quality of motion assessment,
IVC(137), 2023, pp. 104762.
Elsevier DOI 2309
EatSense, Eating vision dataset, Atomic-action recognition, Change in movement detection BibRef

Liu, Y.[Yu], Yang, F.[Fan], Ginhac, D.[Dominique],
Accumulated micro-motion representations for lightweight online action detection in real-time,
JVCIR(95), 2023, pp. 103879.
Elsevier DOI 2309
Motion representation, Spatiotemporal action localization, Online action detection, Real-time computing, Embedded system BibRef

Mettes, P.[Pascal],
Universal Prototype Transport for Zero-Shot Action Recognition and Localization,
IJCV(131), No. 1, January 2023, pp. 3060-3073.
Springer DOI 2310
BibRef

Chen, Z.Y.[Zheng-Yan], Liu, H.[Hong], Zhang, L.L.[Lin-Lin], Liao, X.[Xin],
Multi-Dimensional Attention With Similarity Constraint for Weakly-Supervised Temporal Action Localization,
MultMed(25), 2023, pp. 4349-4360.
IEEE DOI 2310
BibRef

Sun, X.[Xin], Gao, J.L.[Jia-Lin], Zhu, Y.Z.[Yi-Zhe], Wang, X.[Xuan], Zhou, X.[Xi],
Video Moment Retrieval via Comprehensive Relation-Aware Network,
CirSysVideo(33), No. 9, September 2023, pp. 5281-5295.
IEEE DOI 2310
BibRef

Wang, Y.X.[Yun-Xiao], Liu, M.[Meng], Wei, Y.W.[Yin-Wei], Cheng, Z.Y.[Zhi-Yong], Wang, Y.L.[Ying-Long], Nie, L.Q.[Li-Qiang],
Siamese Alignment Network for Weakly Supervised Video Moment Retrieval,
MultMed(25), 2023, pp. 3921-3933.
IEEE DOI 2310
BibRef

Ju, C.[Chen], Zhao, P.[Peisen], Chen, S.[Siheng], Zhang, Y.[Ya], Zhang, X.Y.[Xiao-Yun], Wang, Y.F.[Yan-Feng], Tian, Q.[Qi],
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization,
MultMed(25), 2023, pp. 6688-6701.
IEEE DOI 2311
BibRef

Moniruzzaman, M., Yin, Z.Z.[Zhao-Zheng],
Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization,
CirSysVideo(33), No. 11, November 2023, pp. 6939-6951.
IEEE DOI 2311
BibRef

Fang, X.[Xiang], Liu, D.Z.[Dai-Zong], Zhou, P.[Pan], Hu, Y.C.[Yu-Chong],
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval,
MultMed(25), 2023, pp. 7517-7532.
IEEE DOI 2311
BibRef

Liu, Z.H.[Zi-Hao], Yan, D.F.[Dan-Feng], Cai, Y.Q.[Yuan-Qiang], Song, Y.[Yan],
Spatio-temporal human action localization in indoor surveillances,
PR(147), 2024, pp. 110087.
Elsevier DOI 2312
Video analysis, Spatio-temporal action localization dataset, Real-world indoor surveillance BibRef

Xia, K.[Kun], Wang, L.[Le], Shen, Y.C.[Yi-Chao], Zhou, S.[Sanpin], Hua, G.[Gang], Tang, W.[Wei],
Exploring Action Centers for Temporal Action Localization,
MultMed(25), 2023, pp. 9425-9436.
IEEE DOI 2312
BibRef

Wang, S.M.[Shao-Meng], Yan, R.[Rui], Huang, P.[Peng], Dai, G.Z.[Guang-Zhao], Song, Y.[Yan], Shu, X.B.[Xiang-Bo],
Com-STAL: Compositional Spatio-Temporal Action Localization,
CirSysVideo(33), No. 12, December 2023, pp. 7645-7657.
IEEE DOI 2312
BibRef

Moniruzzaman, M.[Md], Yin, Z.Z.[Zhao-Zheng],
Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization,
MultMed(26), 2024, pp. 270-283.
IEEE DOI Code:
WWW Link. 2401
BibRef

Sun, Y.Z.[Yun-Zhuo], Xu, Y.F.[Yi-Fang], Xie, Z.[Zien], Shu, Y.K.[Yu-Kun], Du, S.[Sidan],
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features,
SPLetters(31), 2024, pp. 521-525.
IEEE DOI 2402
Semantics, Visualization, Feature extraction, Task analysis, Decoding, Natural languages, Computational modeling, video highlight detection BibRef

Chen, T.B.[Tong-Bao], Wang, W.[Wenmin], Jiang, Z.[Zhe], Li, R.C.[Ruo-Chen], Wang, B.S.[Bing-Shu],
Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval,
MultMed(26), 2024, pp. 3799-3813.
IEEE DOI 2402
Visualization, Task analysis, Database languages, Semantics, Pipelines, Calibration, Transformers, Cross-modality, calibration, video corpus moment retrieval BibRef

Gan, M.G.[Ming-Gang], Zhang, Y.[Yan],
Content Temporal Relation Network for temporal action proposal generation,
PR(149), 2024, pp. 110245.
Elsevier DOI Code:
WWW Link. 2403
Temporal action proposal generation, Temporal action detection, Untrimmed video analysis, Proposal-proposal relations BibRef

Wang, B.L.[Bing-Lu], Zhao, Y.Q.[Yong-Qiang], Yang, L.[Le], Long, T.[Teng], Li, X.L.[Xue-Long],
Temporal Action Localization in the Deep Learning Era: A Survey,
PAMI(46), No. 4, April 2024, pp. 2171-2190.
IEEE DOI 2403
Location awareness, Videos, Surveys, Task analysis, Prediction algorithms, Training, Supervised learning, weakly supervised learning BibRef

Wang, C.X.[Chuan-Xu], Wang, J.[Jing], Xu, W.T.[Wen-Ting],
Double branch synergies with modal reinforcement for weakly supervised temporal action detection,
JVCIR(99), 2024, pp. 104090.
Elsevier DOI 2403
Multi-branch synergies, Temporal action localization, Modal reinforcement, Weakly supervising learning BibRef

Jiang, Y.Y.[Yuan-Yuan], Yin, J.Q.[Jian-Qin], Dang, Y.H.[Yong-Hao],
Leveraging the Video-Level Semantic Consistency of Event for Audio-Visual Event Localization,
MultMed(26), 2024, pp. 4617-4627.
IEEE DOI 2403
Semantics, Dogs, Visualization, Location awareness, Feature extraction, Task analysis, Encoding, Audio-visual learning, weakly-supervised learning BibRef

Hu, X.J.[Xue-Jiao], Wang, S.J.[Shi-Jie], Li, M.[Ming], Li, Y.[Yang], Du, S.[Sidan],
Distribution-Aware Activity Boundary Representation for Online Detection of Action Start in Untrimmed Videos,
SPLetters(31), 2024, pp. 765-769.
IEEE DOI 2403
Location awareness, Videos, Training, Task analysis, Representation learning, Standards, Uncertainty, distribution-aware activity boundary BibRef

Li, T.T.[Ting-Tian], Sun, Z.X.[Zi-Xun], Xiao, X.Y.[Xin-Yu],
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning,
IP(33), 2024, pp. 1911-1922.
IEEE DOI 2403
Visualization, Task analysis, Feature extraction, Vectors, Semantics, Self-supervised learning, Image reconstruction, Unsupervised, representation activation sequence BibRef

Li, G.Z.[Guo-Zhang], Cheng, D.[De], Wang, N.N.[Nan-Nan], Li, J.[Jie], Gao, X.B.[Xin-Bo],
Neighbor-Guided Pseudo-Label Generation and Refinement for Single-Frame Supervised Temporal Action Localization,
IP(33), 2024, pp. 2419-2430.
IEEE DOI 2404
Semantics, Videos, Location awareness, Predictive models, Annotations, Feature extraction, Transformers, single-frame temporal action localization BibRef

Shao, Y.X.[Yu-Xiang], Zhang, F.F.[Fei-Fei], Xu, C.S.[Chang-Sheng],
Snippet-to-Prototype Contrastive Consensus Network for Weakly Supervised Temporal Action Localization,
MultMed(26), 2024, pp. 6717-6729.
IEEE DOI 2404
Location awareness, Prototypes, Semantics, Task analysis, Videos, Self-supervised learning, Annotations, Contrastive learning, weakly-supervised temporal action localization BibRef

Li, Q.[Qiang], Zu, G.[Guang], Xu, H.[Hui], Kong, J.[Jun], Zhang, Y.[Yanni], Wang, J.Z.[Jian-Zhong],
An Adaptive Dual Selective Transformer for Temporal Action Localization,
MultMed(26), 2024, pp. 7398-7412.
IEEE DOI 2405
Transformers, Proposals, Videos, Mixers, Task analysis, Location awareness, Feature extraction, video understanding BibRef

Yang, S.[Shuo], Wu, X.X.[Xin-Xiao], Shang, Z.[Zirui], Luo, J.B.[Jie-Bo],
Dynamic Pathway for Query-Aware Feature Learning in Language-Driven Action Localization,
MultMed(26), 2024, pp. 7451-7461.
IEEE DOI 2405
Semantics, Location awareness, Motion segmentation, Task analysis, Proposals, Encoding, Feature extraction, Dynamic pathway, video moment retrieval BibRef

Mokari, M.[Mozhgan], Sadeghi, K.H.[Khosrow Haj],
Enhancing temporal action localization in an end-to-end network through estimation error incorporation,
IVC(145), 2024, pp. 104994.
Elsevier DOI 2405
Temporal action localization, Activity, Classification, Activity proposal, Action recognition BibRef

Cao, C.Q.[Cong-Qi], Wang, Y.Z.[Yi-Zhe], Zhang, Y.[Yueran], Lu, Y.[Yue], Zhang, X.[Xin], Zhang, Y.N.[Yan-Ning],
Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization,
CirSysVideo(34), No. 5, May 2024, pp. 3327-3339.
IEEE DOI 2405
Semantics, Visualization, Location awareness, Task analysis, Feature extraction, Predictive models, Computational modeling, temporal action localization BibRef

Hu, X.J.[Xue-Jiao], Wang, S.J.[Shi-Jie], Li, M.[Ming], Li, Y.[Yang], Du, S.[Sidan],
Time-attentive fusion network: An efficient model for online detection of action start,
IET-IPR(18), No. 7, 2024, pp. 1892-1902.
DOI Link 2405
feature extraction, image processing, video signal processing BibRef

Yang, J.[Jin], Wei, P.[Ping], Zheng, N.N.[Nan-Ning],
Cross Time-Frequency Transformer for Temporal Action Localization,
CirSysVideo(34), No. 6, June 2024, pp. 4625-4638.
IEEE DOI 2406
Time-frequency analysis, Feature extraction, Transformers, Location awareness, Logic gates, Task analysis, cross time-frequency features BibRef

Huang, Z.[Zhanghao], Ji, Y.[Yi], Li, Y.[Ying], Liu, C.P.[Chun-Ping],
Gazing After Glancing: Edge Information Guided Perception Network for Video Moment Retrieval,
SPLetters(31), 2024, pp. 1535-1539.
IEEE DOI 2406
Feature extraction, Task analysis, Visualization, Location awareness, Convolution, Training, Semantics, vision language task BibRef

Tang, Y.P.[Ye-Peng], Wang, W.N.[Wei-Ning], Zhang, C.J.[Chun-Jie], Liu, J.[Jing], Zhao, Y.[Yao],
Learnable Feature Augmentation Framework for Temporal Action Localization,
IP(33), 2024, pp. 4002-4015.
IEEE DOI 2407
Feature extraction, Task analysis, Semantics, Location awareness, Detectors, Data augmentation, Training, Temporal action detection, feature augmentation BibRef

Han, D.[De], Cheng, X.[Xing], Guo, N.[Nan], Ye, X.C.[Xiao-Chun], Rainer, B.[Benjamin], Priller, P.[Peter],
Momentum Cross-Modal Contrastive Learning for Video Moment Retrieval,
CirSysVideo(34), No. 7, July 2024, pp. 5977-5994.
IEEE DOI 2407
Proposals, Semantics, Task analysis, Visualization, Location awareness, Feature extraction, Computational modeling, attention mechanism BibRef

Vahdani, E.[Elahe], Tian, Y.L.[Ying-Li],
POTLoc: Pseudo-label Oriented Transformer for point-supervised temporal Action Localization,
CVIU(246), 2024, pp. 104044.
Elsevier DOI 2408
Temporal action detection, Point-supervised learning, Self-training BibRef

Zhang, T.Y.[Tian-Yi], Li, R.[Ronglu], Feng, P.M.[Peng-Ming], Zhang, R.[Rubo],
Integration of Global and Local Knowledge for Foreground Enhancing in Weakly Supervised Temporal Action Localization,
MultMed(26), 2024, pp. 8476-8487.
IEEE DOI 2408
Location awareness, Task analysis, Pipelines, Annotations, Training, Convolution, Feature extraction, Weakly supervised learning, video content analysis BibRef

Chen, Z.M.[Zhao-Min], Jin, X.[Xin], Chan, S.X.[Si-Xian],
SiSe: Simultaneous and Sequential Transformers for multi-label activity recognition,
PR(156), 2024, pp. 110844.
Elsevier DOI 2408
Multi-label, Activity recognition, Sequential transformer, Hierarchical structure BibRef

Chen, L.[Lin], Zhang, J.[Jing], Zhang, Y.F.[Yi-Fan], Kang, J.P.[Jun-Peng], Zhuo, L.[Li],
MKP-Net: Memory knowledge propagation network for point-supervised temporal action localization in livestreaming,
CVIU(248), 2024, pp. 104109.
Elsevier DOI 2409
Livestreaming, Point-supervised, Temporal action localization, Memory knowledge propagation, Dual optimization loss BibRef


Zhang, Z.[Zejian], Palmero, C.[Cristina], Escalera, S.[Sergio],
DualH: A Dual Hierarchical Model for Temporal Action Localization,
FG24(1-10)
IEEE DOI 2408
Location awareness, Face recognition, Gesture recognition, Feature extraction, Transformers, Encoding, Videos BibRef

Panta, L.[Love], Shrestha, P.[Prashant], Sapkota, B.[Brabeem], Bhattarai, A.[Amrita], Manandhar, S.[Suresh], Sah, A.K.[Anand Kumar],
Cross-modal Contrastive Learning with Asymmetric Co-attention Network for Video Moment Retrieval,
Pretrain24(617-624)
IEEE DOI 2404
Representation learning, Visualization, Grounding, Self-supervised learning, Computer architecture BibRef

Denize, J.[Julien], Liashuha, M.[Mykola], Rabarisoa, J.[Jaonary], Orcesi, A.[Astrid], Hérault, R.[Romain],
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting Using Transformers,
Pretrain24(518-528)
IEEE DOI Code:
WWW Link. 2404
Costs, Source coding, Pipelines, Self-supervised learning, Transformers, Spatiotemporal phenomena, Labeling BibRef

Luo, D.Z.[De-Zhao], Huang, J.[Jiabo], Gong, S.G.[Shao-Gang], Jin, H.L.[Hai-Lin], Liu, Y.[Yang],
Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models,
WACV24(5452-5461)
IEEE DOI 2404
Vocabulary, Visualization, Correlation, Costs, Annotations, Transfer learning, Algorithms, Video recognition and understanding BibRef

Rahman, M.A.[Md Atiqur], Laganiére, R.[Robert],
Spatio-Temporal Activity Detection via Joint Optimization of Spatial and Temporal Localization,
RWSurvil24(242-250)
IEEE DOI 2404
Location awareness, Deep learning, Benchmark testing, Feature extraction, Spatiotemporal phenomena BibRef

Mondal, A.[Anindya], Nag, S.[Sauradip], Prada, J.M.[Joaquin M.], Zhu, X.T.[Xia-Tian], Dutta, A.[Anjan],
Actor-agnostic Multi-label Action Recognition with Multi-modal Query,
NIVT23(784-794)
IEEE DOI Code:
WWW Link. 2401
BibRef

Warchocki, J.[Jan], Oprescu, T.[Teodor], Wang, Y.H.[Yun-Han], Damacus, A.[Alexandru], Misterka, P.[Paul], Bruintjes, R.J.[Robert-Jan], Lengyel, A.[Attila], Strafforello, O.[Ombretta], van Gemert, J.C.[Jan C.],
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models,
CVEU23(3000-3008)
IEEE DOI 2401
BibRef

Heigold, G.[Georg], Keysers, D.[Daniel], Minderer, M.[Matthias], Lucic, M.[Mario], Gritsenko, A.[Alexey], Yu, F.[Fisher], Bewley, A.[Alex], Kipf, T.[Thomas],
Video OWL-ViT: Temporally-consistent open-world localization in video,
ICCV23(13756-13765)
IEEE DOI 2401
BibRef

Shao, J.Y.[Jia-Yi], Wang, X.H.[Xiao-Han], Quan, R.J.[Rui-Jie], Zheng, J.J.[Jun-Jun], Yang, J.[Jiang], Yang, Y.[Yi],
Action Sensitivity Learning for Temporal Action Localization,
ICCV23(13411-13423)
IEEE DOI 2401
BibRef

Barrios, W.[Wayner], Soldan, M.[Mattia], Ceballos-Arroyo, A.M.[Alberto Mario], Heilbron, F.C.[Fabian Caba], Ghanem, B.[Bernard],
Localizing Moments in Long Video Via Multimodal Guidance,
ICCV23(13621-13632)
IEEE DOI Code:
WWW Link. 2401
BibRef

Wang, G.Q.[Gui-Qin], Zhao, P.[Peng], Zhao, C.[Cong], Yang, S.[Shusen], Cheng, J.[Jie], Leng, L.[Luziwei], Liao, J.X.[Jian-Xing], Guo, Q.H.[Qing-Hai],
Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling,
ICCV23(10169-10179)
IEEE DOI 2401
BibRef

Shah, A.[Anshul], Lundell, B.[Benjamin], Sawhney, H.[Harpreet], Chellappa, R.[Rama],
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos,
ICCV23(10341-10353)
IEEE DOI Code:
WWW Link. 2401
BibRef

Liu, Q.[Qinying], Wang, Z.[Zilei], Rong, S.[Shenghai], Li, J.J.[Jun-Jie], Zhang, Y.X.[Yi-Xin],
Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach,
ICCV23(10399-10409)
IEEE DOI Code:
WWW Link. 2401
BibRef

Tang, X.J.[Xiao-Jun], Fan, J.S.[Jun-Song], Luo, C.C.[Chuan-Chen], Zhang, Z.X.[Zhao-Xiang], Zhang, M.[Man], Yang, Z.Y.[Zong-Yuan],
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization,
ICCV23(6599-6609)
IEEE DOI Code:
WWW Link. 2401
BibRef

Croitoru, I.[Ioana], Bogolin, S.V.[Simion-Vlad], Albanie, S.[Samuel], Liu, Y.[Yang], Wang, Z.W.[Zhao-Wen], Yoon, S.H.[Seung-Hyun], Dernoncourt, F.[Franck], Jin, H.L.[Hai-Lin], Bui, T.[Trung],
Moment Detection in Long Tutorial Videos,
ICCV23(2594-2604)
IEEE DOI Code:
WWW Link. 2401
BibRef

Xia, K.[Kun], Wang, L.[Le], Zhou, S.P.[San-Ping], Hua, G.[Gang], Tang, W.[Wei],
Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization,
ICCV23(10126-10135)
IEEE DOI 2401
BibRef

Geng, T.T.[Tian-Tian], Wang, T.[Teng], Duan, J.M.[Jin-Ming], Cong, R.[Runmin], Zheng, F.[Feng],
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline,
CVPR23(22942-22951)
IEEE DOI 2309
BibRef

Zheng, W.R.[Wen-Ru], Yoshihashi, R.[Ryota], Kawakami, R.[Rei], Sato, I.[Ikuro], Kanezaki, A.[Asako],
Multi Event Localization by Audio-Visual Fusion with Omnidirectional Camera and Microphone Array,
MULA23(2566-2574)
IEEE DOI 2309
BibRef

Moon, W.J.[Won-Jun], Hyun, S.[Sangeek], Park, S.U.[Sang-Uk], Park, D.[Dongchan], Heo, J.P.[Jae-Pil],
Query: Dependent Video Representation for Moment Retrieval and Highlight Detection,
CVPR23(23023-23033)
IEEE DOI 2309
BibRef

Luo, D.[Dezhao], Huang, J.[Jiabo], Gong, S.G.[Shao-Gang], Jin, H.L.[Hai-Lin], Liu, Y.[Yang],
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training,
CVPR23(23045-23055)
IEEE DOI 2309
BibRef

Cao, S.Q.[Shu-Qiang], Luo, W.X.[Wei-Xin], Wang, B.[Bairui], Zhang, W.[Wei], Ma, L.[Lin],
E2E-LOAD: End-to-End Long-form Online Action Detection,
ICCV23(10388-10398)
IEEE DOI Code:
WWW Link. 2401
BibRef

Shi, D.F.[Ding-Feng], Zhong, Y.J.[Yu-Jie], Cao, Q.[Qiong], Ma, L.[Lin], Lit, J.[Jia], Tao, D.C.[Da-Cheng],
TriDet: Temporal Action Detection with Relative Boundary Modeling,
CVPR23(18857-18866)
IEEE DOI 2309
BibRef

Wang, Y.[Yu], Li, Y.D.[Ya-Dong], Wang, H.B.[Hong-Bin],
Two-Stream Networks for Weakly-Supervised Temporal Action Localization with Semantic-Aware Mechanisms,
CVPR23(18878-18887)
IEEE DOI 2309
BibRef

Zala, A.[Abhay], Cho, J.[Jaemin], Kottur, S.[Satwik], Chen, X.[Xilun], Oguz, B.[Barlas], Mehdad, Y.[Yashar], Bansal, M.[Mohit],
Hierarchical Video-Moment Retrieval and Step-Captioning,
CVPR23(23056-23065)
IEEE DOI 2309
BibRef

Ju, C.[Chen], Zheng, K.[Kunhao], Liu, J.X.[Jin-Xiang], Zhao, P.[Peisen], Zhang, Y.[Ya], Chang, J.L.[Jian-Long], Tian, Q.[Qi], Wang, Y.F.[Yan-Feng],
Distilling Vision-Language Pre-Training to Collaborate with Weakly-Supervised Temporal Action Localization,
CVPR23(14751-14762)
IEEE DOI 2309
BibRef

Chi, H.G.[Hyung-Gun], Lee, K.[Kwonjoon], Agarwal, N.[Nakul], Xu, Y.[Yi], Ramani, K.[Karthik], Choi, C.[Chiho],
AdamsFormer for Spatial Action Localization in the Future,
CVPR23(17885-17895)
IEEE DOI 2309
BibRef

Rizve, M.N.[Mamshad Nayeem], Mittal, G.[Gaurav], Yu, Y.[Ye], Hall, M.[Matthew], Sajeev, S.[Sandra], Shah, M.[Mubarak], Chen, M.[Mei],
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization,
CVPR23(22992-23002)
IEEE DOI 2309
BibRef

Zhou, J.Q.[Jing-Qiu], Huang, L.[Linjiang], Wang, L.[Liang], Liu, S.[Si], Li, H.S.[Hong-Sheng],
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels,
CVPR23(23003-23012)
IEEE DOI 2309
BibRef

Zhao, C.[Chen], Liu, S.M.[Shu-Ming], Mangalam, K.[Karttikeya], Ghanem, B.[Bernard],
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization,
CVPR23(10637-10647)
IEEE DOI 2309
BibRef

Kang, H.[Hyolim], Kim, H.[Hanjung], An, J.[Joungbin], Cho, M.[Minsu], Kim, S.J.[Seon Joo],
Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks,
CVPR23(6514-6523)
IEEE DOI 2309
BibRef

Seol, M.[Muah], Kim, J.[Jonghee], Moon, J.[Jinyoung],
BMRN: Boundary Matching and Refinement Network for Temporal Moment Localization with Natural Language,
ODRUM23(5571-5579)
IEEE DOI 2309
BibRef

Ren, H.[Huan], Yang, W.F.[Wen-Fei], Zhang, T.Z.[Tian-Zhu], Zhang, Y.D.[Yong-Dong],
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization,
CVPR23(2394-2404)
IEEE DOI 2309
BibRef

Ren, H.R.[Hao-Ran], Ren, H.[Hao], Lu, H.[Hong], Jin, C.[Cheng],
Weakly-Supervised Temporal Action Localization with Regional Similarity Consistency,
MMMod23(I: 69-81).
Springer DOI 2304
BibRef

Niu, Y.[Yanrui], Yang, J.Y.[Jing-Yao], Liang, C.[Chao], Huang, B.[Baojin], Wang, Z.Y.[Zhong-Yuan],
A Spatio-Temporal Identity Verification Method for Person-Action Instance Search in Movies,
MMMod23(I: 82-94).
Springer DOI 2304
BibRef

Rai, A.K.[Ayush K.], Krishna, T.[Tarun], Dietlmeier, J.[Julia], McGuinness, K.[Kevin], Smeaton, A.F.[Alan F.], O'Connor, N.E.[Noel E.],
Motion Aware Self-Supervision for Generic Event Boundary Detection,
WACV23(2727-2738)
IEEE DOI 2302
Representation learning, Pipelines, Task analysis, Videos, Software development management BibRef

Mahmud, T.[Tanvir], Marculescu, D.[Diana],
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization,
WACV23(5147-5156)
IEEE DOI 2302
Location awareness, Training, Visualization, Fuses, Refining, Algorithms: Video recognition and understanding (tracking, Vision + language and/or other modalities BibRef

Zhou, J.X.[Jian-Xiong], Wu, Y.[Ying],
Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization,
WACV23(6017-6026)
IEEE DOI 2302
Location awareness, Convolution, Feature extraction, Task analysis, Optical flow, Videos BibRef

Kang, T.K.[Tae-Kyung], Lee, G.H.[Gun-Hee], Jin, K.M.[Kyung-Min], Lee, S.W.[Seong-Whan],
Action-aware Masking Network with Group-based Attention for Temporal Action Localization,
WACV23(6047-6056)
IEEE DOI 2302
Location awareness, Computational modeling, Semantics, Benchmark testing, Feature extraction, Task analysis BibRef

Cao, M.[Meng], Yang, T.Y.[Tian-Yu], Weng, J.W.[Jun-Wu], Zhang, C.[Can], Wang, J.[Jue], Zou, Y.X.[Yue-Xian],
LocVTP: Video-Text Pre-training for Temporal Localization,
ECCV22(XXVI:38-56).
Springer DOI 2211
BibRef

Cheng, F.[Feng], Bertasius, G.[Gedas],
TallFormer: Temporal Action Localization with a Long-Memory Transformer,
ECCV22(XXXIV:503-521).
Springer DOI 2211
BibRef

Kim, Y.H.[Young Hwi], Kang, H.[Hyolim], Kim, S.J.[Seon Joo],
A Sliding Window Scheme for Online Temporal Action Localization,
ECCV22(XXXIV:653-669).
Springer DOI 2211
BibRef

Rao, V.[Varshanth], Khalil, M.I.[Md Ibrahim], Li, H.[Haoda], Dai, P.[Peng], Lu, J.W.[Ju-Wei],
Dual Perspective Network for Audio-Visual Event Localization,
ECCV22(XXXIV:689-704).
Springer DOI 2211
BibRef

Huang, J.[Jiabo], Jin, H.L.[Hai-Lin], Gong, S.G.[Shao-Gang], Liu, Y.[Yang],
Video Activity Localisation with Uncertainties in Temporal Boundary,
ECCV22(XXXIV:724-740).
Springer DOI 2211
BibRef

Aakur, S.[Sathyanarayanan], Sarkar, S.[Sudeep],
Actor-Centered Representations for Action Localization in Streaming Videos,
ECCV22(XXXVIII:70-87).
Springer DOI 2211
BibRef

Paul, S.[Sudipta], Mithun, N.C.[Niluthpol Chowdhury], Roy-Chowdhury, A.K.[Amit K.],
Text-Based Temporal Localization of Novel Events,
ECCV22(XIV:567-587).
Springer DOI 2211
BibRef

Zhang, C.L.[Chen-Lin], Wu, J.X.[Jian-Xin], Li, Y.[Yin],
ActionFormer: Localizing Moments of Actions with Transformers,
ECCV22(IV:492-510).
Springer DOI 2211
BibRef

Togashi, R.[Riku], Otani, M.[Mayu], Nakashima, Y.[Yuta], Rahtu, E.[Esa], Heikkilä, J.[Janne], Sakai, T.[Tetsuya],
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval,
CVPR22(21044-21053)
IEEE DOI 2210
Current measurement, Computational modeling, Stability analysis, Pattern recognition, Reliability, Datasets and evaluation, Video analysis and understanding BibRef

Zhang, Y.H.[Yun-Hua], Doughty, H.[Hazel], Shao, L.[Ling], Snoek, C.G.M.[Cees G. M.],
Audio-Adaptive Activity Recognition Across Video Domains,
CVPR22(13781-13790)
IEEE DOI 2210
Training, Adaptation models, Visualization, Computational modeling, Semantics, Self-supervised learning, Vision+X BibRef

Liu, W.Z.[Wei-Zhe], Tekin, B.[Bugra], Coskun, H.[Huseyin], Vineet, V.[Vibhav], Fua, P.[Pascal], Pollefeys, M.[Marc],
Learning to Align Sequential Actions in the Wild,
CVPR22(2171-2181)
IEEE DOI 2210
Representation learning, Codes, Self-supervised learning, Benchmark testing, Pattern recognition, Behavioral sciences, Video analysis and understanding BibRef

Li, W.[Wei], Chen, S.[Shimin], Gu, J.Y.[Jian-Yang], Wang, N.[Ning], Chen, C.[Chen], Guo, Y.D.[Yan-Dong],
MV-TAL: Mulit-view Temporal Action Localization in Naturalistic Driving,
AICity22(3241-3247)
IEEE DOI 2210
Location awareness, Measurement, Visualization, Aggregates, Gray-scale BibRef

Zhang, C.[Can], Yang, T.Y.[Tian-Yu], Weng, J.[Junwu], Cao, M.[Meng], Wang, J.[Jue], Zou, Y.X.[Yue-Xian],
Unsupervised Pre-training for Temporal Action Localization Tasks,
CVPR22(14011-14021)
IEEE DOI 2210
Location awareness, Representation learning, Bridges, Adaptation models, Codes, Computational modeling, Self- semi- meta- unsupervised learning BibRef

Li, J.J.[Jing-Jing], Yang, T.Y.[Tian-Yu], Ji, W.[Wei], Wang, J.[Jue], Cheng, L.[Li],
Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization,
CVPR22(19882-19892)
IEEE DOI 2210
Location awareness, Representation learning, Noise reduction, Pipelines, Memory management, Pattern recognition, retrieval BibRef

He, B.[Bo], Yang, X.[Xitong], Kang, L.[Le], Cheng, Z.[Zhiyu], Zhou, X.[Xin], Shrivastava, A.[Abhinav],
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization,
CVPR22(13915-13925)
IEEE DOI 2210
Training, Location awareness, Computational modeling, Pipelines, Predictive models, Pattern recognition, retrieval BibRef

Xia, K.[Kun], Wang, L.[Le], Zhou, S.P.[San-Ping], Zheng, N.N.[Nan-Ning], Tang, W.[Wei],
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization,
CVPR22(13874-13883)
IEEE DOI 2210
Location awareness, Tracking, Detectors, Feature extraction, Pattern recognition, Task analysis, Motion and tracking BibRef

Bao, W.T.[Wen-Tao], Yu, Q.[Qi], Kong, Y.[Yu],
OpenTAL: Towards Open Set Temporal Action Localization,
CVPR22(2969-2979)
IEEE DOI 2210
Location awareness, Deep learning, Uncertainty, Grounding, Supervised learning, Color, Video analysis and understanding, Action and event recognition BibRef

Sridhar, D.[Deepak], Quader, N.[Niamul], Muralidharan, S.[Srikanth], Li, Y.X.[Yao-Xin], Dai, P.[Peng], Lu, J.W.[Ju-Wei],
Class Semantics-based Attention for Action Detection,
ICCV21(13719-13728)
IEEE DOI 2203
Location awareness, Semantics, Transforms, Performance gain, Benchmark testing, Proposals, Action and behavior recognition, Vision applications and systems BibRef

Huang, J.[Jiabo], Liu, Y.[Yang], Gong, S.G.[Shao-Gang], Jin, H.L.[Hai-Lin],
Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation,
ICCV21(7179-7188)
IEEE DOI 2203
Training, Visualization, Image segmentation, Correlation, Annotations, Semantics, Customer relationship management, Vision+language BibRef

Xu, M.M.[Meng-Meng], Pérez-Rúa, J.M.[Juan-Manuel], Escorcia, V.[Victor], Martínez, B.[Brais], Zhu, X.T.[Xia-Tian], Zhang, L.[Li], Ghanem, B.[Bernard], Xiang, T.[Tao],
Boundary-sensitive Pre-training for Temporal Localization in Videos,
ICCV21(7200-7210)
IEEE DOI 2203
Location awareness, Annotations, Computational modeling, Manuals, Complexity theory, Task analysis, Representation learning BibRef

Nam, J.[Jinwoo], Ahn, D.C.[Dae-Chul], Kang, D.Y.[Dong-Yeop], Ha, S.J.[Seong Jong], Choi, J.H.[Jong-Hyun],
Zero-shot Natural Language Video Localization,
ICCV21(1450-1459)
IEEE DOI 2203
Understanding videos to localize moments with natural language. Location awareness, Training, Costs, Annotations, Computational modeling, Natural languages, Detectors, Visual reasoning and logical representation BibRef

Wang, Y.X.[Yu-Xuan], Gao, D.F.[Di-Fei], Yu, L.C.[Li-Cheng], Lei, W.X.[Wei-Xian], Feiszli, M.[Matt], Shou, M.Z.[Mike Zheng],
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval,
ECCV22(XXXV:709-725).
Springer DOI 2211
BibRef

Shou, M.Z.[Mike Zheng], Lei, S.W.X.[Stan Wei-Xian], Wang, W.Y.[Wei-Yao], Ghadiyaram, D.[Deepti], Feiszli, M.[Matt],
Generic Event Boundary Detection: A Benchmark for Event Segmentation,
ICCV21(8055-8064)
IEEE DOI 2203
Quality assurance, Codes, Annotations, Benchmark testing, Complexity theory, Cognitive science, Action and behavior recognition BibRef

Ju, C.[Chen], Zhao, P.[Peisen], Chen, S.[Siheng], Zhang, Y.[Ya], Wang, Y.F.[Yan-Feng], Tian, Q.[Qi],
Divide and Conquer for Single-frame Temporal Action Localization,
ICCV21(13435-13444)
IEEE DOI 2203
Location awareness, Training, Annotations, Estimation, Benchmark testing, Generators, Action and behavior recognition, Video analysis and understanding BibRef

Narayan, S.[Sanath], Cholakkal, H.[Hisham], Hayat, M.[Munawar], Khan, F.S.[Fahad Shahbaz], Yang, M.H.[Ming-Hsuan], Shao, L.[Ling],
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations,
ICCV21(13588-13597)
IEEE DOI 2203
Location awareness, Codes, Noise reduction, Benchmark testing, Robustness, Mutual information, Action and behavior recognition, Recognition and classification BibRef

Lee, P.[Pilhyeon], Byun, H.R.[Hye-Ran],
Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization,
ICCV21(13628-13637)
IEEE DOI 2203
Location awareness, Training, Greedy algorithms, Costs, Codes, Annotations, Action and behavior recognition, BibRef

Zhao, C.[Chen], Thabet, A.[Ali], Ghanem, B.[Bernard],
Video Self-Stitching Graph Network for Temporal Action Localization,
ICCV21(13638-13647)
IEEE DOI 2203
Location awareness, Training, Correlation, Codes, Aggregates, Task analysis, Action and behavior recognition, Video analysis and understanding BibRef

Kang, H.[Hyolim], Kim, K.[Kyungmin], Ko, Y.[Yumin], Kim, S.J.[Seon Joo],
CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization,
ICCV21(13709-13718)
IEEE DOI 2203
Location awareness, Computational modeling, Streaming media, Proposals, Task analysis, Action and behavior recognition, Vision for robotics and autonomous vehicles BibRef

Zhang, L.Y.[Ling-Yu], Radke, R.J.[Richard J.],
Natural Language Video Moment Localization Through Query-Controlled Temporal Convolution,
WACV22(2524-2532)
IEEE DOI 2202
Location awareness, Heating systems, Visualization, Convolution, Video sequences, Natural languages, Analysis and Understanding BibRef

Trehan, S.[Shubham], Aakur, S.N.[Sathyanarayanan N.],
Towards Active Vision for Action Localization with Reactive Control and Predictive Learning,
WACV22(3391-3400)
IEEE DOI 2202
Location awareness, Training, Visualization, Supervised learning, Training data, Reinforcement learning, Observers, Vision Systems and Applications Vision for Robotics BibRef

Lee, J.T.[Jun-Tae], Jain, M.[Mihir], Yun, S.[Sungrack],
Few-Shot Common Action Localization via Cross-Attentional Fusion of Context and Temporal Dynamics,
ICCV23(10180-10189)
IEEE DOI 2401
BibRef
Earlier: A1, A3, Only:
Multi-Scale Temporal Feature Fusion for Few-Shot Action Recognition,
ICIP23(1785-1789)
IEEE DOI 2312
BibRef

Kim, H.[Hanul], Jain, M.[Mihir], Lee, J.T.[Jun-Tae], Yun, S.[Sungrack], Porikli, F.M.[Fatih M.],
Efficient Action Recognition via Dynamic Knowledge Propagation,
ICCV21(13699-13708)
IEEE DOI 2203
Knowledge engineering, Costs, Computational modeling, Action and behavior recognition, Video analysis and understanding BibRef

Lee, J.T.[Jun-Tae], Yun, S.[Sungrack], Jain, M.[Mihir],
Leaky Gated Cross-Attention for Weakly Supervised Multi-Modal Temporal Action Localization,
WACV22(817-826)
IEEE DOI 2202
Location awareness, Logic gates, Benchmark testing, Multimedia Applications BibRef

Hsieh, H.Y.[He-Yen], Chen, D.J.[Ding-Jie], Liu, T.L.[Tyng-Luh],
Contextual Proposal Network for Action Localization,
WACV22(766-775)
IEEE DOI 2202
Location awareness, Recurrent neural networks, Bidirectional control, Performance gain, Proposals, Task analysis, Multimedia Applications BibRef

Cheng, Y.[Yi], Sun, Y.[Ying], Lin, D.Y.[Dong-Yun], Lim, J.H.[Joo-Hwee],
Action Relational Graph for Weakly-Supervised Temporal Action Localization,
ICIP21(2563-2567)
IEEE DOI 2201
Location awareness, Correlation, Image processing, Task analysis, Videos, Weakly-supervised temporal action localization, Untrimmed video BibRef

Biswas, S.[Sovan], Gall, J.[Juergen],
Multiple Instance Triplet Loss for Weakly Supervised Multi-Label Action Localisation of Interacting Persons,
DYAD21(2159-2167)
IEEE DOI 2112
Training, Costs, Annotations, Task analysis, Videos BibRef

Li, Z.[Zhe], Abu Farha, Y.[Yazan], Gall, J.[Juergen],
Temporal Action Segmentation from Timestamp Supervision,
CVPR21(8361-8370)
IEEE DOI 2111
Training, Annotations, Computational modeling, Predictive models, Pattern recognition, Task analysis BibRef

Ma, J.W.[Jun-Wei], Gorti, S.K.[Satya Krishna], Volkovs, M.[Maksims], Yu, G.[Guangwei],
Weakly Supervised Action Selection Learning in Video,
CVPR21(7583-7592)
IEEE DOI 2111
Location awareness, Codes, Annotations, Predictive models, Benchmark testing, Pattern recognition BibRef

Liu, Y.[Yuan], Chen, J.Y.[Jing-Yuan], Chen, Z.F.[Zhen-Fang], Deng, B.[Bing], Huang, J.Q.[Jian-Qiang], Zhang, H.W.[Han-Wang],
The Blessings of Unlabeled Background in Untrimmed Videos,
CVPR21(6172-6181)
IEEE DOI 2111
Location awareness, Training, Visualization, Smoothing methods, Computational modeling, Pattern recognition BibRef

Li, Z.H.[Zhi-Hui], Yao, L.[Lina],
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations,
CVPR21(4749-4758)
IEEE DOI 2111
Location awareness, Annotations, Estimation, Object segmentation, Predictive models, Recycling, Pattern recognition BibRef

Gao, S.H.[Shang-Hua], Han, Q.[Qi], Li, Z.Y.[Zhong-Yu], Peng, P.[Pai], Wang, L.[Liang], Cheng, M.M.[Ming-Ming],
Global2Local: Efficient Structure Search for Video Action Segmentation,
CVPR21(16800-16809)
IEEE DOI 2111
Codes, Probabilistic logic, Pattern recognition, Task analysis, Forecasting BibRef

Liu, X.L.[Xiao-Long], Hu, Y.[Yao], Bai, S.[Song], Ding, F.[Fei], Bai, X.[Xiang], Torr, P.H.S.[Philip H.S.],
Multi-shot Temporal Event Localization: a Benchmark,
CVPR21(12591-12601)
IEEE DOI 2111
Location awareness, TV, Codes, Annotations, Benchmark testing, Motion pictures BibRef

Wang, H.[Hao], Zha, Z.J.[Zheng-Jun], Li, L.[Liang], Liu, D.[Dong], Luo, J.B.[Jie-Bo],
Structured Multi-Level Interaction Network for Video Moment Localization via Language Query,
CVPR21(7022-7031)
IEEE DOI 2111
Location awareness, Natural languages, Benchmark testing, Pattern recognition, Proposals, Task analysis BibRef

Lin, C.M.[Chu-Ming], Xu, C.M.[Cheng-Ming], Luo, D.H.[Dong-Hao], Wang, Y.B.[Ya-Biao], Tai, Y.[Ying], Wang, C.J.[Cheng-Jie], Li, J.L.[Ji-Lin], Huang, F.Y.[Fei-Yue], Fu, Y.W.[Yan-Wei],
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization,
CVPR21(3319-3328)
IEEE DOI 2111
Location awareness, Design methodology, Computational modeling, Predictive models, Feature extraction, Pattern recognition BibRef

Tirupattur, P.[Praveen], Duarte, K.[Kevin], Rawat, Y.S.[Yogesh S.], Shah, M.[Mubarak],
Modeling Multi-Label Action Dependencies for Temporal Action Localization,
CVPR21(1460-1470)
IEEE DOI 2111
Measurement, Location awareness, Codes, Network architecture, Benchmark testing, Pattern recognition BibRef

Lópcz-Sastrc, R.J.[Roberto J.], Baptista-Ríos, M.[Marcos], Rodríguez, F.J. .A.[Francisco J. Acevedo-], Martín-Martín, P.[Pilar], Maldonado-Bascón, S.[Saturnino],
Live Video Action Recognition from Unsupervised Action Proposals,
MVA21(1-6)
DOI Link 2109
Pipelines, Object segmentation, Generators, Proposals, Videos BibRef

Tan, R.[Reuben], Xu, H.J.[Hui-Juan], Saenko, K.[Kate], Plummer, B.A.[Bryan A.],
LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval,
WACV21(2082-2091)
IEEE DOI 2106
Training, Location awareness, Annotations, Semantics, Natural languages BibRef

Rodriguez-Opazo, C.[Cristian], Marrese-Taylor, E.[Edison], Fernando, B.[Basura], Li, H.D.[Hong-Dong], Gould, S.[Stephen],
DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video,
WACV21(1078-1087)
IEEE DOI 2106
Location awareness, Technological innovation, Natural languages, Benchmark testing, Feature extraction BibRef

Pardo, A.[Alejandro], Alwassel, H.[Humam], Heilbron, F.C.[Fabian Caba], Thabet, A.[Ali], Ghanem, B.[Bernard],
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization,
WACV21(3318-3327)
IEEE DOI 2106
Location awareness, Training, Detectors, Generators, Iterative methods BibRef

Rotsidis, A.[Alexandros], Lutteroth, C.[Christof], Hall, P.[Peter], Richardt, C.[Christian],
ExMaps: Long-Term Localization in Dynamic Scenes using Exponential Decay,
WACV21(2866-2875)
IEEE DOI 2106
Location awareness, Visualization, Robot vision systems, Cameras, Mobile applications BibRef

Vaudaux-Ruth, G.[Guillaume], Tong, A.C.H.[Adrien Chan-Hon], Achard, C.[Catherine],
SALAD: Self-Assessment Learning for Action Detection,
WACV21(1268-1277)
IEEE DOI 2106
BibRef
And:
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos,
ICPR21(631-638)
IEEE DOI 2105
Location awareness, Machine learning algorithms, Production, Machine learning, Performance gain, Feature extraction, Loss measurement. Measurement, Annotations, Reinforcement learning, Detectors, Streaming media, Pattern recognition BibRef

Lu, C.K.[Chong-Kai], Li, R.M.[Rui-Min], Fu, H.[Hong], Fu, B.[Bin], Wang, Y.H.[Yi-Hao], Lo, W.L.[Wai-Lun], Chi, Z.[Zheru],
Precise Temporal Localization for Complete Actions with Quantified Temporal Structure,
ICPR21(4781-4788)
IEEE DOI 2105
Location awareness, Estimation, Benchmark testing, Predictive models, Prediction algorithms, Detection algorithms BibRef

Lin, Y.B.[Yan-Bo], Wang, Y.C.A.F.[Yu-Chi-Ang Frank],
Audiovisual Transformer with Instance Attention for Audio-visual Event Localization,
ACCV20(VI:274-290).
Springer DOI 2103
BibRef

Long, F.[Fuchen], Yao, T.[Ting], Qiu, Z.F.[Zhao-Fan], Tian, X.M.[Xin-Mei], Luo, J.B.[Jie-Bo], Mei, T.[Tao],
Learning to Localize Actions from Moments,
ECCV20(III:137-154).
Springer DOI 2012
BibRef

Min, K.[Kyle], Corso, J.J.[Jason J.],
Adversarial Background-aware Loss for Weakly-supervised Temporal Activity Localization,
ECCV20(XIV:283-299).
Springer DOI 2011
BibRef

Aakur, S.[Sathyanarayanan], Sarkar, S.[Sudeep],
Action Localization Through Continual Predictive Learning,
ECCV20(XIV:300-317).
Springer DOI 2011
BibRef

Chen, S.X.[Shao-Xiang], Jiang, Y.G.[Yu-Gang],
Hierarchical Visual-textual Graph for Temporal Activity Localization via Language,
ECCV20(XX:601-618).
Springer DOI 2011
BibRef

Yang, P.W.[Peng-Wan], Hu, V.T.[Vincent Tao], Mettes, P.S.[Pascal S.], Snoek, C.G.M.[Cees G. M.],
Localizing the Common Action Among a Few Videos,
ECCV20(VII:505-521).
Springer DOI 2011
BibRef

Toering, M.[Martine], Gatopoulos, I.[Ioannis], Stol, M.[Maarten], Hu, V.T.[Vincent Tao],
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting,
WACV22(846-856)
IEEE DOI 2202
Representation learning, Visualization, Semantics, Prototypes, Computational efficiency, Task analysis, Action and Behavior Recognition Video analysis and understanding BibRef

Yoon, S.[Sunjae], Hong, J.W.[Ji Woo], Yoon, E.[Eunseop], Kim, D.[Dahyun], Kim, J.Y.[Jun-Yeong], Yoon, H.S.[Hee Suk], Yoo, C.D.[Chang D.],
Selective Query-Guided Debiasing for Video Corpus Moment Retrieval,
ECCV22(XXXVI:185-200).
Springer DOI 2211
BibRef

Yoon, S.[Sunjae], Koo, G.[Gwanhyeong], Kim, D.[Dahyun], Yoo, C.D.[Chang D.],
SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval,
ICCV23(13530-13540)
IEEE DOI 2401
BibRef

Yoon, S.[Sunjae], Kim, D.[Dahyun], Hong, J.W.[Ji Woo], Kim, J.Y.[Jun-Yeong], Kim, K.[Kookhoi], Yoo, C.D.[Chang D.],
Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval,
ICIP21(534-538)
IEEE DOI 2201
Training, Image processing, Natural languages, Benchmark testing, Proposals, Multi-modal video corpus moment retrieval, Weakly-supervised learning BibRef

Ma, M.[Minuk], Yoon, S.[Sunjae], Kim, J.Y.[Jun-Yeong], Lee, Y.J.[Young-Joon], Kang, S.H.[Sung-Hun], Yoo, C.D.[Chang D.],
VLANet: Video-language Alignment Network for Weakly-supervised Video Moment Retrieval,
ECCV20(XXVIII:156-171).
Springer DOI 2011
Localize the temporal moment in untrimmed video specified by natural language query. BibRef

Luo, Z.K.[Zhe-Kun], Guillory, D.[Devin], Shi, B.F.[Bai-Feng], Ke, W.[Wei], Wan, F.[Fang], Darrell, T.J.[Trevor J.], Xu, H.J.[Hui-Juan],
Weakly-supervised Action Localization with Expectation-maximization Multi-instance Learning,
ECCV20(XXIX: 729-745).
Springer DOI 2010

See also C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection.
See also Discrepant Multiple Instance Learning for Weakly Supervised Object Detection. BibRef

Kanth, R.K.[R. Krishna], Ramaswamy, A.[Akshaya], Kumar, A.A.[A. Anil], Gubbi, J.[Jayavardhana], Balamuralidhar, P.,
STP-Net: Spatio-Temporal Polarization Network for action recognition using polarimetric videos,
ComputationalApp22(767-776)
IEEE DOI 2202
Deep learning, Conferences, Activity recognition, Feature extraction, Natural language processing, Sensors BibRef

Ramaswamy, A.[Akshaya], Seemakurthy, K.[Karthik], Gubbi, J.[Jayavardhana], Balamuralidhar, P.,
Video action re-localization using spatio-temporal correlation,
Activity22(192-201)
IEEE DOI 2202
Dimensionality reduction, Correlation, Databases, Convolution, Surveillance, Conferences, Neural networks BibRef

Ramaswamy, A., Seemakurthy, K., Gubbi, J., Purushothaman, B.,
Spatio-temporal action detection and localization using a hierarchical LSTM,
DeepVision20(3303-3312)
IEEE DOI 2008
Feature extraction, Microprocessors, Task analysis, Visualization, Proposals BibRef

Gong, G.Q.[Guo-Qiang], Wang, X.H.[Xing-Han], Mu, Y.D.[Ya-Dong], Tian, Q.[Qi],
Learning Temporal Co-Attention Models for Unsupervised Video Action Localization,
CVPR20(9816-9825)
IEEE DOI 2008
Training, Benchmark testing, Proposals, Task analysis, Noise measurement, Convolution, TV BibRef

Jain, M., Ghodrati, A., Snoek, C.G.M.,
ActionBytes: Learning From Trimmed Videos to Localize Actions,
CVPR20(1168-1177)
IEEE DOI 2008
Videos, Training, Feature extraction, Task analysis, Pipelines, Testing, Semantics BibRef

Zhang, D., Dai, X., Wang, Y.,
METAL: Minimum Effort Temporal Activity Localization in Untrimmed Videos,
CVPR20(3881-3891)
IEEE DOI 2008
Videos, Training, Metals, Testing, Feature extraction, Task analysis, Visualization BibRef

Eun, H.J.[Hyun-Jun], Moon, J.Y.[Jin-Young], Park, J.Y.[Jong-Youl], Jung, C.[Chanho], Kim, C.[Changick],
Learning to Discriminate Information for Online Action Detection,
CVPR20(806-815)
IEEE DOI 2008
Logic gates, Streaming media, Task analysis, Feature extraction, Benchmark testing, Telecommunications, Recurrent neural networks BibRef

Shi, B.F.[Bai-Feng], Dai, Q.[Qi], Mu, Y.D.[Ya-Dong], Wang, J.D.[Jing-Dong],
Weakly-Supervised Action Localization by Generative Attention Modeling,
CVPR20(1006-1016)
IEEE DOI 2008
Feature extraction, Task analysis, Context modeling, Training, Pipelines, Graphical models BibRef

Aliakbarian, S.[Sadegh], Saleh, F.S.[Fatemeh Sadat], Salzmann, M.[Mathieu], Petersson, L.[Lars], Gould, S.[Stephen],
A Stochastic Conditioning Scheme for Diverse Human Motion Prediction,
CVPR20(5222-5231)
IEEE DOI 2008
Perturbation methods, Stochastic processes, Decoding, Training, Predictive models, Task analysis, Diversity reception BibRef

Rodriguez-Opazo, C.[Cristian], Marrese-Taylor, E.[Edison], Saleh, F.S.[Fatemeh Sadat], Li, H.D.[Hong-Dong], Gould, S.[Stephen],
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention,
WACV20(2453-2462)
IEEE DOI 2006
Proposals, Task analysis, Natural languages, Visualization, Semantics, Robots BibRef

Islam, A., Radke, R.J.,
Weakly Supervised Temporal Action Localization Using Deep Metric Learning,
WACV20(536-545)
IEEE DOI 2006
Feature extraction, Measurement, Training, Task analysis, Machine learning, Feeds, Face BibRef

Rashid, M., Kjellström, H., Lee, Y.J.,
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks,
WACV20(604-613)
IEEE DOI 2006
Convolution, Training, Feature extraction, Testing, Motion segmentation, Predictive models BibRef

Miki, D.[Daisuke], Chen, S.[Shi], Demachi, K.[Kazuyuki],
Weakly Supervised Graph Convolutional Neural Network for Human Action Localization,
WACV20(642-650)
IEEE DOI 2006
Feature extraction, Time series analysis, Training, Convolution, Machine learning, Skeleton, Supervised learning BibRef

Kwak, I.S., Guo, J., Hantman, A., Branson, K., Kriegman, D.,
Detecting the Starting Frame of Actions in Video,
WACV20(478-486)
IEEE DOI 2006
Mice, Neuroscience, Optimal matching, Neural activity, Recurrent neural networks, Task analysis, Neurons BibRef

Gleason, J., Schwarcz, S., Ranjan, R., Castillo, C.D., Chen, J., Chellappa, R.,
Activity Detection in Untrimmed Videos Using Chunk-based Classifiers,
WACVWS20(107-116)
IEEE DOI 2006
Videos, Task analysis, Proposals, Machine learning, Standards BibRef

Gleason, J., Castillo, C.D., Chellappa, R.,
Real-time Detection of Activities in Untrimmed Videos,
WACVWS20(117-125)
IEEE DOI 2006
Videos, Proposals, Cameras, Real-time systems, Training, Object detection, Measurement BibRef

Rahman, M.A., Laganičre, R.,
Single-Stage End-to-End Temporal Activity Detection in Untrimmed Videos,
CRV20(206-213)
IEEE DOI 2006
temporal activity detection, activity recognition, single-stage detection, 3D convolutional network BibRef

Narayan, S., Cholakkal, H., Khan, F.S., Shao, L.,
3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization,
ICCV19(8678-8686)
IEEE DOI 2004
Code, Counting.
WWW Link. feature extraction, image classification, image sequences, video signal processing, Motion pictures BibRef

Wu, W., He, D., Tan, X., Chen, S., Wen, S.,
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition,
ICCV19(6221-6230)
IEEE DOI 2004
image classification, image motion analysis, learning (artificial intelligence), Markov processes BibRef

Gao, M.F.[Ming-Fei], Zhou, Y.B.[Ying-Bo], Xu, R.[Ran], Socher, R.[Richard], Xiong, C.M.[Cai-Ming],
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos,
CVPR21(1915-1923)
IEEE DOI 2111
Training, Annotations, Scalability, Real-time systems, Generators, Pattern recognition BibRef

Gao, M.F.[Ming-Fei], Xu, M.Z.[Ming-Ze], Davis, L.S.[Larry S.], Socher, R.[Richard], Xiong, C.M.[Cai-Ming],
StartNet: Online Detection of Action Start in Untrimmed Videos,
ICCV19(5541-5550)
IEEE DOI 2004
feature extraction, gesture recognition, image classification, image colour analysis, Training data BibRef

Wehrmann, J., Lopes, M.A., Souza, D., Barros, R.,
Language-Agnostic Visual-Semantic Embeddings,
ICCV19(5803-5812)
IEEE DOI 2004
Code, Visualization.
WWW Link. data visualisation, information retrieval, learning (artificial intelligence), Architecture BibRef

Pramono, R.R.A., Chen, Y., Fang, W.,
Hierarchical Self-Attention Network for Action Localization in Videos,
ICCV19(61-70)
IEEE DOI 2004
cameras, clutter, convolutional neural nets, image capture, image fusion, image motion analysis, image recognition, Training BibRef

Nguyen, P., Ramanan, D., Fowlkes, C.,
Weakly-Supervised Action Localization With Background Modeling,
ICCV19(5501-5510)
IEEE DOI 2004
image motion analysis, image sequences, learning (artificial intelligence), multimedia Web sites, Training data BibRef

Zhai, C.B.[Chang-Bo], Wang, L.[Le], Zhang, Q.L.[Qi-Lin], Gao, Z.N.[Zhan-Ning], Niu, Z.X.[Zhen-Xing], Zheng, N.N.[Nan-Ning], Hua, G.[Gang],
Action Co-localization in an Untrimmed Video by Graph Neural Networks,
MMMod20(I:555-567).
Springer DOI 2003
BibRef

Liu, D.C.[Dao-Chang], Jiang, T.T.[Ting-Ting], Wang, Y.Z.[Yi-Zhou],
Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization,
CVPR19(1298-1307).
IEEE DOI 2002
BibRef

Wang, W.N.[Wei-Ning], Huang, Y.[Yan], Wang, L.[Liang],
Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model,
CVPR19(334-343).
IEEE DOI 2002
BibRef

Li, H., Yang, J., Zhou, Y., Li, S.,
Rethinking Temporal Structure Modeling Method for Temporal Action Localization,
ICIP19(3676-3680)
IEEE DOI 1910
Action localization, spatial-temporal feature, video content analysis, supervised learning BibRef

Nguyen, P., Han, B., Liu, T., Prasad, G.,
Weakly Supervised Action Localization by Sparse Temporal Pooling Network,
CVPR18(6752-6761)
IEEE DOI 1812
Videos, Proposals, Feature extraction, Task analysis, Convolutional neural networks, Prediction algorithms BibRef

Vial, R., Zhu, H., Tian, Y., Lu, S.,
Search video action proposal with recurrent and static YOLO,
ICIP17(2035-2039)
IEEE DOI 1803
Clutter, Detectors, Dynamic programming, Labeling, Object detection, Proposals, Training, action detection, action localization, video object proposal BibRef

Shao, D.[Dian], Xiong, Y.[Yu], Zhao, Y.[Yue], Huang, Q.Q.[Qing-Qiu], Qiao, Y.[Yu], Lin, D.[Dahua],
Find and Focus: Retrieve and Localize Video Events with Natural Language Queries,
ECCV18(IX: 202-218).
Springer DOI 1810
BibRef

Sharir, G.[Gilad], Tuytelaars, T.[Tinne],
Action in chains: A chains model for action localization and classification,
WACV14(610-617)
IEEE DOI 1406
Computational modeling BibRef

Lan, T.[Tian], Wang, Y.[Yang], Mori, G.[Greg],
Discriminative figure-centric models for joint action localization and recognition,
ICCV11(2003-2010).
IEEE DOI 1201
BibRef

Ta, A.P.[Anh-Phuong], Wolf, C.[Christian], Lavoue, G.[Guillaume], Baskurt, A.[Atilla], Jolion, J.M.[Jean-Michel],
Pairwise Features for Human Action Recognition,
ICPR10(3224-3227).
IEEE DOI 1008
BibRef
And: A1, A2, A3, A4, Only:
Recognizing and Localizing Individual Activities through Graph Matching,
AVSS10(196-203).
IEEE DOI 1009
BibRef

Chapter on Motion -- Human Motion, Surveillance, Tracking, Surveillance, Activities continues in
Action Segmentation .


Last update:Sep 28, 2024 at 17:47:54