15.3.1.9 Vision-Language Navigation

Chapter Contents (Back)
Navigation. Vision-Language.

Bajcsy, R., and Nagel, H.H.,
Descriptive and Prescriptive Languages for Mobility Tasks: Are They Different?,
AIU96(280-300). BibRef 9600

Zhu, M., Chen, W., Xia, J., Ma, Y., Zhang, Y., Luo, Y., Huang, Z., Liu, L.,
Location2Vec: A Situation-Aware Representation for Visual Exploration of Urban Locations,
ITS(20), No. 10, October 2019, pp. 3981-3990.
IEEE DOI 1910
Trajectory, Visualization, Sociology, Statistics, Vehicle dynamics, Mobile handsets, Natural language processing, Human mobility, visual exploration BibRef

Li, P.[Pei], Li, X.[Xinde], Li, X.H.[Xiang-Hui], Pan, H.[Hong], Khyam, M.O., Noor-A-Rahim, M., Ge, S.S.[Shuzhi Sam],
Place perception from the fusion of different image representation,
PR(110), 2021, pp. 107680.
Elsevier DOI 2011
Indoor place perception, CNN, LSTM, Convolutional auto-encoder, Natural language BibRef


Wang, H.Q.[Han-Qing], Wang, W.[Wenguan], Shu, T.[Tianmin], Liang, W.[Wei], Shen, J.B.[Jian-Bing],
Active Visual Information Gathering for Vision-language Navigation,
ECCV20(XXII:307-322).
Springer DOI 2011
BibRef

Cao, J.[Jize], Gan, Z.[Zhe], Cheng, Y.[Yu], Yu, L.C.[Li-Cheng], Chen, Y.C.[Yen-Chun], Liu, J.J.[Jing-Jing],
Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-language Models,
ECCV20(VI:565-580).
Springer DOI 2011
BibRef

Qi, Y., Wu, Q., Anderson, P., Wang, X., Wang, W.Y., Shen, C., van den Hengel, A.,
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments,
CVPR20(9979-9988)
IEEE DOI 2008
Task analysis, Navigation, Robots, Natural languages, Visualization, Object recognition, Indoor environments BibRef

Krantz, J.[Jacob], Wijmans, E.[Erik], Majumdar, A.[Arjun], Batra, D.[Dhruv], Lee, S.[Stefan],
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments,
ECCV20(XXVIII:104-120).
Springer DOI 2011
Agents must execute low-level actions to follow natural language navigation directions. BibRef

Wang, H.[Hu], Wu, Q.[Qi], Shen, C.H.[Chun-Hua],
Soft Expert Reward Learning for Vision-and-Language Navigation,
ECCV20(IX:126-141).
Springer DOI 2011
BibRef

Kim, J., Moon, S., Rohrbach, A., Darrell, T.J., Canny, J.,
Advisable Learning for Self-Driving Vehicles by Internalizing Observation-to-Action Rules,
CVPR20(9658-9667)
IEEE DOI 2008
Visualization, Semantics, Natural languages, Image segmentation, Generators, Training, Roads BibRef

Fu, T.J.[Tsu-Jui], Wang, X.E.[Xin Eric], Peterson, M.F.[Matthew F.], Grafton, S.T.[Scott T.], Eckstein, M.P.[Miguel P.], Wang, W.Y.[William Yang],
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler,
ECCV20(VI:71-86).
Springer DOI 2011
Based on language descriptions, relate them to the environment. BibRef

Qi, Y.K.[Yuan-Kai], Pan, Z.Z.[Zi-Zheng], Zhang, S.P.[Sheng-Ping], van den Hengel, A.[Anton], Wu, Q.[Qi],
Object-and-action Aware Model for Visual Language Navigation,
ECCV20(X:303-317).
Springer DOI 2011
BibRef

Majumdar, A.[Arjun], Shrivastava, A.[Ayush], Lee, S.[Stefan], Anderson, P.[Peter], Parikh, D.[Devi], Batra, D.[Dhruv],
Improving Vision-and-language Navigation with Image-text Pairs from the Web,
ECCV20(VI:259-274).
Springer DOI 2011
BibRef

Zhu, F.D.[Feng-Da], Zhu, Y.[Yi], Chang, X.J.[Xiao-Jun], Liang, X.D.[Xiao-Dan],
Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks,
CVPR20(10009-10019)
IEEE DOI 2008
Task analysis, Navigation, Cognition, Trajectory, Semantics, Training, Natural languages BibRef

Hao, W., Li, C., Li, X., Carin, L., Gao, J.,
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training,
CVPR20(13134-13143)
IEEE DOI 2008
Task analysis, Navigation, Visualization, Trajectory, Presses, Head, Predictive models BibRef

Yu, F., Deng, Z., Narasimhan, K., Russakovsky, O.,
Take the Scenic Route: Improving Generalization in Vision-and-Language Navigation,
VL3W20(4000-4004)
IEEE DOI 2008
Navigation, Benchmark testing, Task analysis, Natural languages, Visualization, Training data, Markov processes BibRef

Ma, C.Y.[Chih-Yao], Wu, Z.X.[Zu-Xuan], Al Regib, G.[Ghassan], Xiong, C.M.[Cai-Ming], Kira, Z.[Zsolt],
The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation,
CVPR19(6725-6733).
IEEE DOI 2002
Navigating to a goal purely from language instructions and visual information. BibRef

Ke, L.Y.M.[Li-Yi-Ming], Li, X.J.[Xiu-Jun], Bisk, Y.[Yonatan], Holtzman, A.[Ari], Gan, Z.[Zhe], Liu, J.J.[Jing-Jing], Gao, J.F.[Jian-Feng], Choi, Y.J.[Ye-Jin], Srinivasa, S.[Siddhartha],
Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation,
CVPR19(6734-6742).
IEEE DOI 2002
BibRef

Wang, X.[Xin], Xiong, W.H.[Wen-Han], Wang, H.M.[Hong-Min], Wang, W.Y.[William Yang],
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation,
ECCV18(XVI: 38-55).
Springer DOI 1810
BibRef

Deng, C.R.[Chao-Rui], Wu, Q.[Qi], Wu, Q.Y.[Qing-Yao], Hu, F.Y.[Fu-Yuan], Lyu, F.[Fan], Tan, M.K.[Ming-Kui],
Visual Grounding via Accumulated Attention,
CVPR18(7746-7755)
IEEE DOI 1812
Visualization, Feature extraction, Grounding, Natural languages, Redundancy, Task analysis, Computational modeling BibRef

Anderson, P.[Peter], Wu, Q.[Qi], Teney, D.[Damien], Bruce, J.[Jake], Johnson, M.[Mark], Sünderhauf, N.[Niko], Reid, I.D.[Ian D.], Gould, S.[Stephen], van den Hengel, A.J.[Anton J.],
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments,
CVPR18(3674-3683)
IEEE DOI 1812
Navigation, Task analysis, Robots, Visualization, Cameras, Natural languages BibRef

Chen, H.[Howard], Suhr, A.[Alane], Misra, D.[Dipendra], Snavely, N.[Noah], Artzi, Y.[Yoav],
TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments,
CVPR19(12530-12539).
IEEE DOI 2002
BibRef

Nguyen, K.[Khanh], Dey, D.[Debadeepta], Brockett, C.[Chris], Dolan, B.[Bill],
Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention,
CVPR19(12519-12529).
IEEE DOI 2002
BibRef

Wang, X.[Xin], Huang, Q.[Qiuyuan], Celikyilmaz, A.[Asli], Gao, J.F.[Jian-Feng], Shen, D.[Dinghan], Wang, Y.F.[Yuan-Fang], Wang, W.Y.[William Yang], Zhang, L.[Lei],
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation,
CVPR19(6622-6631).
IEEE DOI 2002
BibRef

Khoshelham, K., Díaz-Vilariño, L.,
3D Modelling of Interior Spaces: Learning the Language of Indoor Architecture,
CloseRange14(321-326).
DOI Link 1411
BibRef

van Laere, O.[Olivier], Schockaert, S.[Steven], Dhoedt, B.[Bart],
Finding locations of Flickr resources using language models and similarity search,
ICMR11(48).
DOI Link 1301
estimate where a given photo or video was taken, using only the tags that a user has assigned BibRef

Chapter on Active Vision, Camera Calibration, Mobile Robots, Navigation, Road Following continues in
Visual SLAM: Simultaneous Location and Mapping or Matching .


Last update:Sep 19, 2021 at 21:11:01