The PASCAL Object Recognition Database Collection,
2006.
Dataset, Objects.
HTML Version. Various datasets for object recognition. Pointers to some of the
others.
MSR VTT Dataset,
A Large Video Description Dataset for Bridging Video and Language.
WWW Link.
Dataset, Visual Question Answering.
See also MSR-VTT: A Large Video Description Dataset for Bridging Video and Language.
Video Objects: A Test Database for Video Object Recognition,
2006.
Dataset, Objects.
HTML Version. 180 videos of 15 objects.
Animals with Attributes: A dataset for Attribute Based Classification,
2006.
Dataset, Objects.
WWW Link. 30,000+ images, 40 animal classes.
Image Net, ImageNet Dataset,
2014.
WWW Link.
Dataset, Objects. Large set of images (or sets of datasets) for recognition.
Related to ImageNet Challanges for recognition.
14Million+ images.
Links to Stanford
See also Stanford University, Computer Science Departent. and Princeton.
See also Princeton.
Washington Ground Truth Image Database,
CBIR dataset.
Online2004
WWW Link.
Dataset.
Dataset, Retrieval.
BibRef
0400
LHI Object Datasets,
Includes hand segmentations, and annotations.
Online2004
HTML Version.
Dataset.
Dataset, Object Recognition.
Transportation images, Animals, Aerial Images, Objects,
Dataset also includes other data.
See also Lotus Hill Institute.
BibRef
0400
NEC Animal Dataset,
Online2009
WWW Link.
Dataset.
Dataset, Object Recognition. It consists of about 5000 high quality images
from 60 toy animals taken at different poses against a plain
background.
BibRef
0900
Xcavator.Net,
Online2007
WWW Link.
Dataset, Object Recognition. Photo search for professional use. Searches stock databases, you then
purchase the image for use.
Part of CogniSign LLC.
BibRef
0700
The ETH-80 Dataset,
2017
Dataset, Objects.
WWW Link.
The ETH-80 dataset contains visual object images from 8 different
categories including apples, cars, cows,cups, dogs, horses, pears and
tomatoes.
See also Covariance descriptors on a Gaussian manifold and their application to image set classification.
See also Swiss Federal Institute of Technology in Zurich.
15 Scene Dataset,
Dataset, Objects.
HTML Version. The 15 scene categories are office, kitchen, living room, bedroom,
store, industrial, tall building, inside cite, street, highway, coast,
open country, mountain, forest, and suburb. Images in the dataset are
about 250*300 resolution, with 210 to 410 images per class.
Video Dataset Overview,
2021
WWW Link.
Dataset, Overview. A good collection of Video datasets for various uses, activity, instruction,
sports, etc..
Multi-Weather 4Seasons Dataset,
2021
Dataset, Driving.
WWW Link.
Vasiljevic, I.,
Kolkin, N.,
Zhang, S.,
Luo, R.,
Wang, H.,
DIODE: A Dense Indoor and Outdoor Depth Dataset,
2019
Dataset, Object Extraction.
WWW Link.
BibRef
Blanco, J.L.[Jose-Luis],
Moreno, F.A.[Francisco-Angel],
Gonzalez, J.[Javier],
A collection of outdoor robotic datasets with centimeter-accuracy
ground truth,
AutRob(27), No. 4, 2009, pp. 327.
Springer DOI
WWW Link.
Dataset, SLAM. Malaga Parking
BibRef
0900
Geusebroek, J.M.[Jan-Mark],
Burghouts, G.J.[Gertjan J.],
Smeulders, A.W.M.[Arnold W.M.],
The Amsterdam Library of Object Images,
IJCV(61), No. 1, January 2005, pp. 103-112.
DOI Link
0410
WWW Link.
Dataset, Objects. 1000 objects over 100 images per object.
BibRef
Torralba, A.B.[Antonio B.],
Fergus, R.[Rob],
Freeman, W.T.[William T.],
80 Million Tiny Images: A Large Data Set for Nonparametric Object and
Scene Recognition,
PAMI(30), No. 11, November 2008, pp. 1958-1970.
IEEE DOI
WWW Link.
0809
BibRef
And:
CSAIL-TR-2007-024, 2007.
Dataset, Retrieval. Images from the WWW, associated with a noun. Large comprehensive dataset.
Dataset with segmentations.
BibRef
Gong, Y.C.[Yun-Chao],
Pawlowski, M.[Marcin],
Yang, F.[Fei],
Brandy, L.[Louis],
Boundev, L.[Lubomir],
Fergus, R.[Rob],
Web scale photo hash clustering on a single machine,
CVPR15(19-27)
IEEE DOI
1510
BibRef
Russell, B.[Bryan],
Torralba, A.B.[Antonio B.],
Freeman, W.T.[William T.],
LableMe: The Open Annotation Tool,
Online2010.
WWW Link.
1108
Dataset, Retrieval.
Code, Annotation. The site for the annotation tool, also the video version.
BibRef
Zhou, B.[Bolei],
Lapedriza, A.[Agata],
Khosla, A.[Aditya],
Oliva, A.[Aude],
Torralba, A.B.[Antonio B.],
Places: A 10 Million Image Database for Scene Recognition,
PAMI(40), No. 6, June 2018, pp. 1452-1464.
IEEE DOI
1805
Dataset, Retrieval. Context, Databases, Image recognition, Semantics, Sun, Training,
Visualization, Scene classification, deep feature, deep learning,
visual recognition
BibRef
Escalante, H.J.[Hugo Jair],
Hernandez, C.A.[Carlos A.],
Gonzalez, J.A.[Jesus A.],
Lopez-Lopez, A.,
Montes-y-Gomez, M.[Manuel],
Morales, E.F.[Eduardo F.],
Sucar, L.E.[L. Enrique],
Villasenor, L.[Luis],
Grubinger, M.[Michael],
The segmented and annotated IAPR TC-12 benchmark,
CVIU(114), No. 4, April 2010, pp. 419-428.
Elsevier DOI
1003
Dataset, Retrieval. Data set creation; Ground truth collection; Evaluation metrics;
Automatic image annotation; Image retrieval
BibRef
Russakovsky, O.[Olga],
Deng, J.[Jia],
Su, H.[Hao],
Krause, J.[Jonathan],
Satheesh, S.[Sanjeev],
Ma, S.[Sean],
Huang, Z.H.[Zhi-Heng],
Karpathy, A.[Andrej],
Khosla, A.[Aditya],
Bernstein, M.[Michael],
Berg, A.C.[Alexander C.],
Fei-Fei, L.[Li],
ImageNet Large Scale Visual Recognition Challenge,
IJCV(115), No. 3, December 2015, pp. 211-252.
Springer DOI
1512
Dataset, Object Category. Object category classification and detection on hundreds of object
categories and millions of images.
BibRef
Loh, Y.P.[Yuen Peng],
Chan, C.S.[Chee Seng],
Getting to know low-light images with the Exclusively Dark dataset,
CVIU(178), 2019, pp. 30-42.
Elsevier DOI
1812
Dataset, Low Light.
BibRef
Rosu, R.A.[Radu Alexandru],
Quenzel, J.[Jan],
Behnke, S.[Sven],
Semi-supervised Semantic Mapping Through Label Propagation with
Semantic Texture Meshes,
IJCV(128), No. 5, May 2020, pp. 1220-1238.
Springer DOI
2005
BibRef
Aizawa, K.,
Fujimoto, A.,
Otsubo, A.,
Ogawa, T.,
Matsui, Y.,
Tsubota, K.,
Ikuta, H.,
Building a Manga Dataset 'Manga109' With Annotations for Multimedia
Applications,
MultMedMag(27), No. 2, April 2020, pp. 8-18.
IEEE DOI
2006
Dataset, Manga. Machine learning, Visualization, Character recognition, Art,
Machine learning algorithms, Task analysis
BibRef
Kuznetsova, A.[Alina],
Rom, H.[Hassan],
Alldrin, N.[Neil],
Uijlings, J.[Jasper],
Krasin, I.[Ivan],
Pont-Tuset, J.[Jordi],
Kamali, S.[Shahab],
Popov, S.[Stefan],
Malloci, M.[Matteo],
Kolesnikov, A.[Alexander],
Duerig, T.[Tom],
Ferrari, V.[Vittorio],
The Open Images Dataset V4,
IJCV(128), No. 7, July 2020, pp. 1956-1981.
Springer DOI
2007
Dataset, Object Detection. 9.2M images with unified annotations.
HTML Version.
BibRef
Maugey, T.,
Toni, L.,
Large Database Compression Based on Perceived Information,
SPLetters(27), 2020, pp. 1735-1739.
IEEE DOI
2010
Covariance matrices, Compression algorithms, Databases,
Measurement, Signal processing algorithms, Image coding, Entropy,
sampling
BibRef
He, Y.[Yue],
Shen, Z.Y.[Zhe-Yan],
Cui, P.[Peng],
Towards Non-I.I.D. image classification: A dataset and baselines,
PR(110), 2021, pp. 107383.
Elsevier DOI
2011
Non-I.I.D, Dataset, Context, Bias, ConvNet, Batch balancing
BibRef
Pang, Y.,
Cao, J.,
Li, Y.,
Xie, J.,
Sun, H.,
Gong, J.,
TJU-DHD: A Diverse High-Resolution Dataset for Object Detection,
IP(30), 2021, pp. 207-219.
IEEE DOI
2011
Object detection, Feature extraction, Image resolution,
Face recognition, Proposals, Training, Face detection, Dataset,
large scale
BibRef
Xu, X.W.[Xiao-Wei],
Zhang, X.Y.[Xin-Yi],
Yu, B.[Bei],
Hu, X.B.S.[Xiao-Bo Sharon],
Rowen, C.[Christopher],
Hu, J.T.[Jing-Tong],
Shi, Y.Y.[Yi-Yu],
DAC-SDC Low Power Object Detection Challenge for UAV Applications,
PAMI(43), No. 2, February 2021, pp. 392-403.
IEEE DOI
2101
More for detection, but generally a dataset, evaluation paper.
This paper presents in detail the dataset and evaluation procedure. It
further discusses the methods developed by some of the entries as well
as representative results.
Object detection, Graphics processing units,
Field programmable gate arrays, Task analysis, low power
BibRef
Thakur, S.[Sanchari],
Bruzzone, L.[Lorenzo],
An Approach to the Generation and Analysis of Databases of Simulated
Radar Sounder Data for Performance Prediction and Target
Interpretation,
GeoRS(59), No. 10, October 2021, pp. 8269-8287.
IEEE DOI
2109
Radar, Databases, Moon, Computational modeling, Solid modeling,
Instruments, Clutter, Feature analysis, geoelectrical modeling,
similarity measure
BibRef
SynthCity:
A Large-Scale Synthetic Point Cloud,
2019.
WWW Link.
Dataset, Point Clouds. Synthetic point clouds and RGB data from a detailed city model.
WHU Datasets,
2020.
WWW Link.
Dataset, Buildings. Several datasets.
See also Whuan University.
VisDrone Datasets,
2019.
WWW Link.
Dataset, Drone Images. Several datasets related to annual challenges..
Song, D.[Dan],
Nie, W.Z.[Wei-Zhi],
Li, W.H.[Wen-Hui],
Kankanhalli, M.[Mohan],
Liu, A.A.[An-An],
Monocular Image-Based 3-D Model Retrieval: A Benchmark,
Cyber(53), To be published.
IEEE DOI
Dataset, MI3DOR.
WWW Link.
Dataset, 3D Objects. Monocular image based 3D object retrieval
BibRef
0000
Tan, X.[Xin],
Xu, K.[Ke],
Cao, Y.[Ying],
Zhang, Y.H.[Yi-Heng],
Ma, L.Z.[Li-Zhuang],
Lau, R.W.H.[Rynson W. H.],
Night-Time Scene Parsing With a Large Real Dataset,
IP(30), 2021, pp. 9085-9098.
IEEE DOI
2112
Dataset, NightCity. Streaming media, Urban areas, Image segmentation, Annotations,
Semantics, Computer science, Automobiles, Autonomous driving,
adverse conditions
BibRef
Xie, Z.F.[Zhi-Feng],
Wang, S.[Sen],
Xu, K.[Ke],
Zhang, Z.Z.[Zhi-Zhong],
Tan, X.[Xin],
Xie, Y.[Yuan],
Ma, L.Z.[Li-Zhuang],
Boosting Night-Time Scene Parsing With Learnable Frequency,
IP(32), 2023, pp. 2386-2398.
IEEE DOI
2305
Time-frequency analysis, Frequency conversion,
Image segmentation, Context modeling, Image coding, Transformers,
frequency analysis
BibRef
Deschaud, J.E.[Jean-Emmanuel],
Duque, D.[David],
Richa, J.P.[Jean Pierre],
Velasco-Forero, S.[Santiago],
Marcotegui, B.[Beatriz],
Goulette, F.[François],
Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for
Challenging Tasks in 3D Mapping,
RS(13), No. 22, 2021, pp. xx-yy.
DOI Link
2112
Dataset, Point Cloud.
BibRef
Lopes, A.[Alexandre],
Souza, R.[Roberto],
Pedrini, H.[Helio],
A survey on RGB-D datasets,
CVIU(222), 2022, pp. 103489.
Elsevier DOI
2209
RGB-D data, Monocular depth estimation, Computer vision, Depth datasets
BibRef
Liu, K.[Kang],
Yang, J.[Jian],
Li, S.Y.[Sheng-Yang],
Remote-Sensing Cross-Domain Scene Classification: A Dataset and
Benchmark,
RS(14), No. 18, 2022, pp. xx-yy.
DOI Link
2209
BibRef
Liu, K.,
Wu, A.,
Wan, X.,
Li, S.Y.[Sheng-Yang],
MRSSC: A Benchmark Dataset for Multimodal Remote Sensing Scene
Classification,
ISPRS21(B2-2021: 785-792).
DOI Link
2201
BibRef
Feng, R.T.[Rui-Tao],
Li, X.H.[Xing-Hua],
Bai, J.J.[Jian-Jun],
Ye, Y.X.[Yuan-Xin],
MID: A Novel Mountainous Remote Sensing Imagery Registration Dataset
Assessed by a Coarse-to-Fine Unsupervised Cascading Network,
RS(14), No. 17, 2022, pp. xx-yy.
DOI Link
2209
BibRef
Zimmerer, D.[David],
Full, P.M.[Peter M.],
Isensee, F.[Fabian],
Jäger, P.[Paul],
Adler, T.[Tim],
Petersen, J.[Jens],
Köhler, G.[Gregor],
Ross, T.[Tobias],
Reinke, A.[Annika],
Kascenas, A.[Antanas],
Jensen, B.S.[Bjørn Sand],
O'Neil, A.Q.[Alison Q.],
Tan, J.[Jeremy],
Hou, B.[Benjamin],
Batten, J.[James],
Qiu, H.Q.[Hua-Qi],
Kainz, B.[Bernhard],
Shvetsova, N.[Nina],
Fedulova, I.[Irina],
Dylov, D.V.[Dmitry V.],
Yu, B.L.[Bao-Lun],
Zhai, J.Y.[Jian-Yang],
Hu, J.T.[Jing-Tao],
Si, R.X.[Run-Xuan],
Zhou, S.H.[Si-Hang],
Wang, S.Q.[Si-Qi],
Li, X.Y.[Xin-Yang],
Chen, X.[Xuerun],
Zhao, Y.[Yang],
Marimont, S.N.[Sergio Naval],
Tarroni, G.[Giacomo],
Saase, V.[Victor],
Maier-Hein, L.[Lena],
Maier-Hein, K.[Klaus],
MOOD 2020: A Public Benchmark for Out-of-Distribution Detection and
Localization on Medical Images,
MedImg(41), No. 10, October 2022, pp. 2728-2738.
IEEE DOI
2210
Biomedical imaging, Training, Benchmark testing, Anomaly detection,
Task analysis, Annotations, Prediction algorithms,
out-of-distribution analysis
BibRef
Ding, J.[Jian],
Xue, N.[Nan],
Xia, G.S.[Gui-Song],
Bai, X.[Xiang],
Yang, W.[Wen],
Yang, M.Y.[Michael Ying],
Belongie, S.[Serge],
Luo, J.B.[Jie-Bo],
Datcu, M.[Mihai],
Pelillo, M.[Marcello],
Zhang, L.P.[Liang-Pei],
Object Detection in Aerial Images: A Large-Scale Benchmark and
Challenges,
PAMI(44), No. 11, November 2022, pp. 7778-7796.
IEEE DOI
2210
Object detection, Earth, Libraries, Codes, Task analysis,
Software algorithms, Software, Object detection, remote sensing,
benchmark dataset
BibRef
Zachar, P.[Paulina],
Ostrowski, W.[Wojciech],
Platek-Zak, A.[Anna],
Kurczynski, Z.[Zdzislaw],
The Influence of Point Cloud Accuracy from Image Matching on
Automatic Preparation of Training Datasets for Object Detection in
UAV Images,
IJGI(11), No. 11, 2022, pp. xx-yy.
DOI Link
2212
BibRef
Liu, F.[Fan],
Chen, D.[Delong],
Du, X.Y.[Xiao-Yu],
Gao, R.Z.[Rui-Zhuo],
Xu, F.[Feng],
MEP-3M: A large-scale multi-modal E-commerce product dataset,
PR(140), 2023, pp. 109519.
Elsevier DOI
2305
Dataset, E-commerce product classification,
Fine-grained learning, Hierarchical classification, Automatic Checkout
BibRef
Feng, T.L.[Ting-Lei],
Zhai, Y.J.[Ying-Jie],
Yang, J.F.[Ju-Feng],
Liang, J.[Jie],
Fan, D.P.[Deng-Ping],
Zhang, J.[Jing],
Shao, L.[Ling],
Tao, D.C.[Da-Cheng],
IC9600: A Benchmark Dataset for Automatic Image Complexity Assessment,
PAMI(45), No. 7, July 2023, pp. 8577-8593.
IEEE DOI
2306
Integrated circuits, Complexity theory, Feature extraction,
Integrated circuit modeling, Task analysis, Entropy, Visualization,
large-scale well-annotated dataset
BibRef
Kawano, K.[Keisuke],
Kutsuna, T.[Takuro],
Tokuhisa, R.[Ryoko],
Nakamura, A.[Akihiro],
Esaki, Y.S.[Yasu-Shi],
StyleDiff: Attribute comparison between unlabeled datasets in latent
disentangled space,
IVC(138), 2023, pp. 104808.
Elsevier DOI
2310
Helps developers understand the differences between the datasets with
respect to such latent attribute distributions.
Dataset comparing, StyleSpace, Optimal transport
BibRef
Hu, F.[Fei],
Ma, Y.[Yibo],
Zhong, W.[Wei],
Ye, L.[Long],
Yang, X.[Xinyan],
Fang, L.[Li],
Zhang, Q.[Qin],
A Dataset and Benchmark for 3D Scene Plausibility Assessment,
MultMed(26), 2024, pp. 6529-6541.
IEEE DOI
2404
Image quality, Quality assessment, Task analysis,
Neural networks, Solid modeling, Semantics, plausibility assessment
BibRef
Wan, Z.J.[Zhi-Jing],
Wang, Z.X.[Zhi-Xiang],
Chung, C.[Cheukting],
Wang, Z.[Zheng],
A Survey of Dataset Refinement for Problems in Computer Vision
Datasets,
Surveys(56), No. 7, April 2024, pp. xx-yy.
DOI Link
2405
Dataset refinement, data sampling, subset selection, active learning
BibRef
Zhu, X.F.[Xue-Feng],
Xu, T.Y.[Tian-Yang],
Liu, Z.T.[Zong-Tao],
Tang, Z.Y.[Zhang-Yong],
Wu, X.J.[Xiao-Jun],
Kittler, J.V.[Josef V.],
UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark
for Multi-modal Learning,
IJCV(132), No. 8, August 2024, pp. 2845-2860.
Springer DOI
2408
BibRef
Hou, Y.J.[Yu-Jun],
Quintana, M.[Matias],
Khomiakov, M.[Maxim],
Yap, W.[Winston],
Ouyang, J.[Jiani],
Ito, K.[Koichi],
Wang, Z.[Zeyu],
Zhao, T.H.[Tian-Hong],
Biljecki, F.[Filip],
Global Streetscapes: A comprehensive dataset of 10 million
street-level images across 688 cities for urban science and analytics,
PandRS(215), 2024, pp. 216-238.
Elsevier DOI Code:
WWW Link.
2408
Urban analytics, Volunteered geographic information,
Data fusion, GeoAI, Machine learning, Spatial data infrastructure
BibRef
Liu, J.W.[Jia-Wei],
Wang, Z.J.[Zhi-Jie],
Ma, L.[Lei],
Fang, C.R.[Chun-Rong],
Bai, T.T.[Tong-Tong],
Zhang, X.F.[Xu-Fan],
Liu, J.[Jia],
Chen, Z.Y.[Zhen-Yu],
Benchmarking Object Detection Robustness against Real-World Corruptions,
IJCV(132), No. 10, October 2024, pp. 4398-4416.
Springer DOI
2410
BibRef
Yang, X.Z.[Xi-Zhong],
Guo, Q.[Qi],
Chen, W.B.[Wen-Bin],
Song, M.[Mofei],
Webly supervised 3D shape recognition,
PR(158), 2025, pp. 110982.
Elsevier DOI Code:
WWW Link.
2411
Need to label lots of data for deep learning.
3D shape dataset, 3D shape recognition, Webly supervised learning
BibRef
Upadhyay, A.[Avinash],
Dhupar, B.[Bhipanshu],
Sharma, M.[Manoj],
Shukla, A.[Ankit],
Abraham, A.[Ajith],
LWIRPOSE: A Novel Long Wave Infrared Thermal Image Pose Dataset and
Benchmark,
ICIP24(186-192)
IEEE DOI Code:
WWW Link.
2411
Legged locomotion, Surveillance, Pose estimation, Lighting,
Medical services, Photothermal effects, Benchmark testing,
Thermal 2D pose estimation
BibRef
Goyal, S.[Sachin],
Maini, P.[Pratyush],
Lipton, Z.C.[Zachary C.],
Raghunathan, A.[Aditi],
Kolter, J.Z.[J. Zico],
Scaling Laws for Data Filtering:
Data Curation Cannot be Compute Agnostic,
CVPR24(22702-22711)
IEEE DOI
2410
Training, Filtering, Computational modeling,
Graphics processing units, Data models, Scaling laws, Data Curation
BibRef
Kent, D.[Daniel],
Alyaqoub, M.[Mohammed],
Lu, X.[Xiaohu],
Khatounabadi, H.[Hamed],
Sung, K.[Kookjin],
Scheller, C.[Cole],
Dalat, A.[Alexander],
Guo, X.W.[Xin-Wei],
Bin Thabit, A.[Asma],
Whitley, R.[Roberto],
Radha, H.[Hayder],
MSU-4S: The Michigan State University Four Seasons Dataset,
CVPR24(22658-22667)
IEEE DOI Code:
WWW Link.
2410
Meteorological radar, Global navigation satellite system,
Laser radar, Spaceborne radar, Object detection, Cameras, dataset,
snow
BibRef
Shao, S.T.[Shi-Tong],
Yin, Z.[Zeyuan],
Zhou, M.[Muxin],
Zhang, X.D.[Xin-Dong],
Shen, Z.Q.[Zhi-Qiang],
Generalized Large-Scale Data Condensation via Various Backbone and
Statistical Matching,
CVPR24(16709-16718)
IEEE DOI
2410
Image segmentation, Synthetic data,
Dataset Condensation, Large-scale Dataset, Generalized Matching
BibRef
Singh, K.[Krishnakant],
Navaratnam, T.[Thanush],
Holmer, J.[Jannik],
Schaub-Meyer, S.[Simone],
Roth, S.[Stefan],
Is Synthetic Data all We Need? Benchmarking the Robustness of Models
Trained with Synthetic Images,
SyntaGen24(2505-2515)
IEEE DOI
2410
Measurement, Analytical models, Shape, Noise, Cloning,
Benchmark testing, Robustness, synthetic models, robustness, benchmarking
BibRef
Wang, Z.Y.[Zi-Yu],
Xu, Y.[Yue],
Lu, C.[Cewu],
Li, Y.L.[Yong-Lu],
Dancing with Still Images:
Video Distillation via Static-Dynamic Disentanglement,
CVPR24(6296-6304)
IEEE DOI Code:
WWW Link.
2410
Image segmentation, Systematics, Image coding, Costs, Dynamics,
Taxonomy, Machine learning
BibRef
Chang, C.[Cheng],
Long, K.Y.[Ke-Yu],
Li, Z.J.[Zi-Jian],
Rai, H.[Himanshu],
Classifier Guided Cluster Density Reduction for Dataset Selection,
VDU24(7338-7347)
IEEE DOI Code:
WWW Link.
2410
Training, Visualization, Codes, Annotations, Filtering, Data Search,
domain Transfer, deep learning
BibRef
Wei, W.[Wei],
de Schepper, T.[Tom],
Mets, K.[Kevin],
Dataset condensation with latent quantile matching,
Distill24(7703-7712)
IEEE DOI
2410
Training, Measurement, Data privacy, Memory management,
Machine learning, Current distribution, Dataset condensation,
Goodness of fit tests
BibRef
Su, D.[Duo],
Hou, J.J.[Jun-Jie],
Gao, W.Z.[Wei-Zhi],
Tian, Y.J.[Ying-Jie],
Tang, B.[Bowen],
D4M: Dataset Distillation via Disentangled Diffusion Model,
CVPR24(5809-5818)
IEEE DOI
2410
Training, Computational modeling, Prototypes,
Computer architecture, Diffusion models, Prototype Learning
BibRef
Sun, P.[Peng],
Shi, B.[Bei],
Yu, D.[Daiwei],
Lin, T.[Tao],
On the Diversity and Realism of Distilled Dataset: An Efficient
Dataset Distillation Paradigm,
CVPR24(9390-9399)
IEEE DOI Code:
WWW Link.
2410
Training, Accuracy, Computational modeling, Neural networks,
Graphics processing units, Machine learning, Dataset Distillation
BibRef
Gu, J.Y.[Jian-Yang],
Vahidian, S.[Saeed],
Kungurtsev, V.[Vyacheslav],
Wang, H.[Haonan],
Jiang, W.[Wei],
You, Y.[Yang],
Chen, Y.[Yiran],
Efficient Dataset Distillation via Minimax Diffusion,
CVPR24(15793-15803)
IEEE DOI Code:
WWW Link.
2410
Training, Image resolution, Source coding, Process control,
Diffusion processes, Diffusion models, dataset distillation,
efficient training
BibRef
Lu, Y.[Yao],
Gu, J.Y.[Jian-Yang],
Chen, X.G.[Xu-Guang],
Vahidian, S.[Saeed],
Xuan, Q.[Qi],
Exploring the Impact of Dataset Bias on Dataset Distillation,
Distill24(7656-7663)
IEEE DOI Code:
WWW Link.
2410
Training, Codes, Prevention and mitigation, Mathematical models,
Dataset Bias, Dataset Distillation
BibRef
Li, L.Z.[Long-Zhen],
Li, G.[Guang],
Togo, R.[Ren],
Maeda, K.[Keisuke],
Ogawa, T.[Takahiro],
Haseyama, M.[Miki],
Generative Dataset Distillation: Balancing Global Structure and Local
Details,
Distill24(7664-7671)
IEEE DOI
2410
Training, Shape, Face recognition, Computational modeling, Semantics,
Dataset distillation, Generative adversarial network, Optimization
BibRef
Khaki, S.[Samir],
Sajedi, A.[Ahmad],
Wang, K.[Kai],
Liu, L.Z.[Lucy Z.],
Lawryshyn, Y.A.[Yuri A.],
Plataniotis, K.N.[Konstantinos N.],
ATOM: Attention Mixer for Efficient Dataset Distillation,
Distill24(7692-7702)
IEEE DOI
2410
Training, Location awareness, Costs, Atomic layer deposition,
Performance gain, Transformers, Dataset Distillation,
Attention Matching
BibRef
He, M.[Muyang],
Yang, S.[Shuo],
Huang, T.J.[Tie-Jun],
Zhao, B.[Bo],
Large-scale Dataset Pruning with Dynamic Uncertainty,
Distill24(7713-7722)
IEEE DOI Code:
WWW Link.
2410
Training, Uncertainty, Computational modeling, Predictive models, Transformers
BibRef
Zhang, X.[Xin],
Du, J.W.[Jia-Wei],
Li, Y.S.[Yun-Song],
Xie, W.Y.[Wei-Ying],
Zhou, J.T.Y.[Joey Tian-Yi],
Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for
Enhanced Dataset Pruning,
CVPR24(26213-26222)
IEEE DOI Code:
WWW Link.
2410
Training, Codes, Accuracy, Training data, Dynamic range
BibRef
Ge, Y.H.[Yun-Hao],
Tang, Y.[Yihe],
Xu, J.[Jiashu],
Gokmen, C.[Cem],
Li, C.S.[Cheng-Shu],
Ai, W.[Wensi],
Martinez, B.J.[Benjamin Jose],
Aydin, A.[Arman],
Anvari, M.[Mona],
Chakravarthy, A.K.[Ayush K],
Yu, H.X.[Hong-Xing],
Wong, J.[Josiah],
Srivastava, S.[Sanjana],
Lee, S.[Sharon],
Zha, S.X.[Sheng-Xin],
Itti, L.[Laurent],
Li, Y.Z.[Yun-Zhu],
Martín-Martín, R.[Roberto],
Liu, M.[Miao],
Zhang, P.[Pengchuan],
Zhang, R.[Ruohan],
Fei-Fei, L.[Li],
Wu, J.J.[Jia-Jun],
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation,
CVPR24(22401-22412)
IEEE DOI Code:
WWW Link.
2410
Systematics, Computational modeling, Predictive models,
Benchmark testing, Cameras, Robustness, benchmarking, simulation,
embodied AI
BibRef
Mahmoud, A.[Anas],
Elhoushi, M.[Mostafa],
Abbas, A.[Amro],
Yang, Y.[Yu],
Ardalani, N.[Newsha],
Leather, H.[Hugh],
Morcos, A.S.[Ari S.],
Sieve: Multimodal Dataset Pruning Using Image Captioning Models,
CVPR24(22423-22432)
IEEE DOI
2410
Filtering, Computational modeling, Semantics, Benchmark testing,
Transformers, Vision-language, pruning
BibRef
Zhao, X.[Xiting],
Schwertfeger, S.[Sören],
3DRef: 3D Dataset and Benchmark for Reflection Detection in RGB and
Lidar Data,
3DV24(225-234)
IEEE DOI Code:
WWW Link.
2408
Point cloud compression, Image segmentation, Laser radar, Annotations,
Semantics, Supervised learning, reflection detection, benchmark
BibRef
Wu, R.W.[Rou-Wan],
Cheng, X.Y.[Xiao-Ya],
Zhu, J.L.[Jue-Lin],
Liu, Y.X.[Yu-Xiang],
Zhang, M.J.[Mao-Jun],
Yan, S.[Shen],
UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization,
3DV24(1574-1583)
IEEE DOI Code:
WWW Link.
2408
Location awareness, Visualization, Solid modeling, Target tracking,
Pipelines, Hierarchical systems, uav localization, datasets, 6-DoF localization
BibRef
Schubert, M.[Marius],
Riedlinger, T.[Tobias],
Kahl, K.[Karsten],
Kröll, D.[Daniel],
Schoenen, S.[Sebastian],
egvic, S.[Sinia],
Rottmann, M.[Matthias],
Identifying Label Errors in Object Detection Datasets by Loss
Inspection,
WACV24(4570-4579)
IEEE DOI
2404
Training, Reviews, Computational modeling, Object detection,
Benchmark testing, Noise measurement, Object recognition,
Image recognition and understanding
BibRef
Glatt, O.[Ortal],
Ater, Y.[Yotam],
Kim, W.S.[Woo-Shik],
Werman, S.[Shira],
Berby, O.[Oded],
Zini, Y.[Yael],
Zelinger, S.[Shay],
Lee, S.[Sangyoon],
Choi, H.[Heejin],
Soloveichik, E.[Evgeny],
Beyond RGB: A Real World Dataset for Multispectral Imaging in Mobile
Devices,
WACV24(4332-4342)
IEEE DOI
2404
Photography, Performance evaluation, Pipelines, Benchmark testing,
Sensor fusion, Cameras, Algorithms, Datasets and evaluations,
image and video synthesis
BibRef
An, G.Y.[Guo-Yuan],
Kim, W.J.[Woo Jae],
Yang, S.[Saelyne],
Li, R.[Rong],
Huo, Y.[Yuchi],
Yoon, S.E.[Sung-Eui],
Towards Content-based Pixel Retrieval in Revisited Oxford and Paris,
ICCV23(20450-20461)
IEEE DOI
2401
BibRef
Hataya, R.[Ryuichiro],
Bao, H.[Han],
Arai, H.[Hiromi],
Will Large-scale Generative Models Corrupt Future Datasets?,
ICCV23(20498-20508)
IEEE DOI Code:
WWW Link.
2401
BibRef
Niemeijer, J.[Joshua],
Mittal, S.[Sudhanshu],
Brox, T.[Thomas],
Synthetic Dataset Acquisition for a Specific Target Domain,
BRAVO23(4057-4066)
IEEE DOI
2401
BibRef
Barkan, O.[Oren],
Reiss, T.[Tal],
Weill, J.[Jonathan],
Katz, O.[Ori],
Hirsch, R.[Roy],
Malkiel, I.[Itzik],
Koenigstein, N.[Noam],
Efficient Discovery and Effective Evaluation of Visual Perceptual
Similarity: A Benchmark and Beyond,
ICCV23(19950-19961)
IEEE DOI
2401
BibRef
Wang, J.Q.[Jia-Qi],
Zhang, P.[Pan],
Chu, T.[Tao],
Cao, Y.H.[Yu-Hang],
Zhou, Y.J.[Yu-Jie],
Wu, T.[Tong],
Wang, B.[Bin],
He, C.H.[Cong-Hui],
Lin, D.[Dahua],
V3Det: Vast Vocabulary Visual Detection Dataset,
ICCV23(19787-19797)
IEEE DOI Code:
WWW Link.
2401
BibRef
Bastani, F.[Favyen],
Wolters, P.[Piper],
Gupta, R.[Ritwik],
Ferdinando, J.[Joe],
Kembhavi, A.[Aniruddha],
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image
Understanding,
ICCV23(16726-16736)
IEEE DOI Code:
WWW Link.
2401
BibRef
Xu, J.[Jiacong],
Zhang, Y.[Yi],
Peng, J.W.[Jia-Wei],
Ma, W.[Wufei],
Jesslen, A.[Artur],
Ji, P.L.[Peng-Liang],
Hu, Q.X.[Qi-Xin],
Zhang, J.H.[Jie-Hua],
Liu, Q.H.[Qi-Hao],
Wang, J.H.[Jia-Hao],
Ji, W.[Wei],
Wang, C.[Chen],
Yuan, X.D.[Xiao-Ding],
Kaushik, P.[Prakhar],
Zhang, G.F.[Guo-Feng],
Liu, J.[Jie],
Xie, Y.S.[Yu-Shan],
Cui, Y.W.[Ya-Wen],
Yuille, A.L.[Alan L.],
Kortylewski, A.[Adam],
Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape,
ICCV23(9065-9075)
IEEE DOI
2401
BibRef
Lu, C.S.[Chong-Shan],
Yin, F.[Fukun],
Chen, X.[Xin],
Liu, W.[Wen],
Chen, T.[Tao],
Yu, G.[Gang],
Fan, J.Y.[Jia-Yuan],
A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel
View Synthesis and Implicit Scene Reconstruction,
ICCV23(7523-7533)
IEEE DOI Code:
WWW Link.
2401
BibRef
Lin, H.Z.[Hao-Zhe],
Chen, Z.Q.[Ze-Qun],
Zhang, J.Z.[Jin-Zhi],
Bai, B.[Bing],
Wang, Y.[Yu],
Huang, R.[Ruqi],
Fang, L.[Lu],
RealGraph: A Multiview Dataset for 4D Real-world Context Graph
Generation,
ICCV23(3735-3745)
IEEE DOI Code:
WWW Link.
2401
BibRef
Ypsilantis, N.A.[Nikolaos-Antonios],
Chen, K.[Kaifeng],
Cao, B.[Bingyi],
Lipovský, M.[Mário],
Dogan-Schönberger, P.[Pelin],
Makosa, G.[Grzegorz],
Bluntschli, B.[Boris],
Seyedhosseini, M.[Mojtaba],
Chum, O.[Ondrej],
Araujo, A.[André],
Towards Universal Image Embeddings: A Large-Scale Dataset and
Challenge for Generic Image Representations,
ICCV23(11256-11267)
IEEE DOI Code:
WWW Link.
2401
BibRef
Bafghi, R.A.[Reza Akbarian],
Gurari, D.[Danna],
A New Dataset Based on Images Taken by Blind People for Testing the
Robustness of Image Classification Models Trained for ImageNet
Categories,
CVPR23(16261-16270)
IEEE DOI
2309
BibRef
Kulinski, S.[Sean],
Waytowich, N.R.[Nicholas R.],
Hare, J.Z.[James Z.],
Inouye, D.I.[David I.],
StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods
For Multi-Agent Environments,
CVPR23(22004-22013)
IEEE DOI
2309
BibRef
Tejero, J.G.[Javier Gamazo],
Zinkernagel, M.S.[Martin S.],
Wolf, S.[Sebastian],
Sznitman, R.[Raphael],
Márquez-Neila, P.[Pablo],
Full or Weak Annotations? An Adaptive Strategy for Budget-Constrained
Annotation Campaigns,
CVPR23(11381-11391)
IEEE DOI
2309
Dataset annotations.
BibRef
Meng, L.C.[Ling-Chen],
Dai, X.Y.[Xi-Yang],
Chen, Y.P.[Yin-Peng],
Zhang, P.C.[Peng-Chuan],
Chen, D.D.[Dong-Dong],
Liu, M.C.[Meng-Chen],
Wang, J.F.[Jian-Feng],
Wu, Z.X.[Zu-Xuan],
Yuan, L.[Lu],
Jiang, Y.G.[Yu-Gang],
Detection Hub: Unifying Object Detection Datasets via Query
Adaptation on Language Embedding,
CVPR23(11402-11411)
IEEE DOI
2309
BibRef
Gao, R.[Ruohan],
Dou, Y.M.[Yi-Ming],
Li, H.[Hao],
Agarwal, T.[Tanmay],
Bohg, J.[Jeannette],
Li, Y.Z.[Yun-Zhu],
Fei-Fei, L.[Li],
Wu, J.J.[Jia-Jun],
The Object Folder Benchmark:
Multisensory Learning with Neural and Real Objects,
CVPR23(17276-17286)
IEEE DOI
2309
BibRef
Bravo, M.A.[María A.],
Mittal, S.[Sudhanshu],
Ging, S.[Simon],
Brox, T.[Thomas],
Open-vocabulary Attribute Detection,
CVPR23(7041-7050)
IEEE DOI
2309
Task and benchmark.
BibRef
Low, S.[Spencer],
Nina, O.[Oliver],
Sappa, A.D.[Angel D.],
Blasch, E.[Erik],
Inkawhich, N.[Nathan],
Multi-modal Aerial View Object Classification Challenge Results -
PBVS 2023,
PBVS23(412-421)
IEEE DOI
2309
BibRef
Li, K.[Kejie],
Bian, J.W.[Jia-Wang],
Castle, R.[Robert],
Torr, P.H.S.[Philip H.S.],
Prisacariu, V.A.[Victor Adrian],
MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices,
CVPR23(4892-4901)
IEEE DOI
2309
BibRef
Zhao, G.[Ganlong],
Li, G.B.[Guan-Bin],
Qin, Y.P.[Yi-Peng],
Yu, Y.Z.[Yi-Zhou],
Improved Distribution Matching for Dataset Condensation,
CVPR23(7856-7865)
IEEE DOI
2309
BibRef
Xu, A.[Austin],
Vasileva, M.I.[Mariya I.],
Dave, A.[Achal],
Seshadri, A.[Arjun],
HandsOff: Labeled Dataset Generation With No Additional Human
Annotations,
CVPR23(7991-8000)
IEEE DOI
2309
BibRef
Liu, S.[Songhua],
Ye, J.W.[Jing-Wen],
Yu, R.[Runpeng],
Wang, X.C.[Xin-Chao],
Slimmable Dataset Condensation,
CVPR23(3759-3768)
IEEE DOI
2309
BibRef
Sadasivan, V.S.[Vinu Sankar],
Soltanolkotabi, M.[Mahdi],
Feizi, S.[Soheil],
CUDA: Convolution-Based Unlearnable Datasets,
CVPR23(3862-3871)
IEEE DOI
2309
BibRef
Riz, L.[Luigi],
Caraffa, A.[Andrea],
Bortolon, M.[Matteo],
Mekhalfi, M.L.[Mohamed Lamine],
Boscaini, D.[Davide],
Moura, A.[André],
Antunes, J.[José],
Dias, A.[André],
Silva, H.[Hugo],
Leonidou, A.[Andreas],
Constantinides, C.[Christos],
Keleshis, C.[Christos],
Abate, D.[Dante],
Poiesi, F.[Fabio],
The MONET dataset: Multimodal drone thermal dataset recorded in rural
scenarios,
MULA23(2546-2554)
IEEE DOI
2309
BibRef
Deitke, M.[Matt],
Schwenk, D.[Dustin],
Salvador, J.[Jordi],
Weihs, L.[Luca],
Michel, O.[Oscar],
VanderBilt, E.[Eli],
Schmidt, L.[Ludwig],
Ehsanit, K.[Kiana],
Kembhavi, A.[Aniruddha],
Farhadi, A.[Ali],
Objaverse: A Universe of Annotated 3D Objects,
CVPR23(13142-13153)
IEEE DOI
2309
BibRef
Yu, X.G.[Xiang-Gang],
Xu, M.[Mutian],
Zhang, Y.[Yidan],
Liu, H.L.[Hao-Lin],
Ye, C.J.[Chong-Jie],
Wu, Y.S.[Yu-Shuang],
Yan, Z.Z.[Zi-Zheng],
Zhu, C.M.[Chen-Ming],
Xiong, Z.Y.[Zhang-Yang],
Liang, T.Y.[Tian-You],
Chen, G.Y.[Guan-Ying],
Cui, S.G.[Shu-Guang],
Han, X.G.[Xiao-Guang],
MVImgNet: A Large-scale Dataset of Multi-view Images,
CVPR23(9150-9161)
IEEE DOI
2309
BibRef
Xiong, Z.Y.[Zhang-Yang],
Li, C.[Chenghong],
Liu, K.[Kenkun],
Liao, H.J.[Hong-Jie],
Hu, J.Q.[Jian-Qiao],
Zhu, J.[Junyi],
Ning, S.[Shuliang],
Qiu, L.[Lingteng],
Wang, C.[Chongjie],
Wang, S.J.[Shi-Jie],
Cui, S.G.[Shu-Guang],
Han, X.G.[Xiao-Guang],
MVHumanNet: A Large-Scale Dataset of Multi-View Daily Dressing Human
Captures,
CVPR24(19801-19811)
IEEE DOI
2410
Visualization, Technological innovation, Solid modeling,
Annotations, Text to image, Neural radiance field
BibRef
Gochoo, M.[Munkhjargal],
Otgonbold, M.E.[Munkh-Erdene],
Ganbold, E.[Erkhembayar],
Hsieh, J.W.[Jun-Wei],
Chang, M.C.[Ming-Ching],
Chen, P.Y.[Ping-Yang],
Dorj, B.[Byambaa],
Jassmi, H.A.[Hamad Al],
Batnasan, G.[Ganzorig],
Alnajjar, F.[Fady],
Abduljabbar, M.[Mohammed],
Lin, F.P.[Fang-Pang],
FishEye8K: A Benchmark and Dataset for Fisheye Camera Object
Detection,
AICity23(5305-5313)
IEEE DOI
2309
BibRef
Lamb, N.[Nikolas],
Palmer, C.[Cameron],
Molloy, B.[Benjamin],
Banerjee, S.[Sean],
Banerjee, N.K.[Natasha Kholgade],
Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken
Objects and Their Complete Counterparts,
CVPR23(4681-4691)
IEEE DOI
2309
BibRef
Jung, H.J.[Hyun-Jun],
Ruhkamp, P.[Patrick],
Zhai, G.Y.[Guang-Yao],
Brasch, N.[Nikolas],
Li, Y.T.[Yi-Tong],
Verdie, Y.[Yannick],
Song, J.F.[Ji-Fei],
Zhou, Y.[Yiren],
Armagan, A.[Anil],
Ilic, S.[Slobodan],
Leonardis, A.[Ales],
Navab, N.[Nassir],
Busam, B.[Benjamin],
On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks,
CVPR23(780-791)
IEEE DOI
2309
BibRef
Wu, T.[Tong],
Zhang, J.R.[Jia-Rui],
Fu, X.[Xiao],
Wang, Y.X.[Yu-Xin],
Ren, J.W.[Jia-Wei],
Pan, L.[Liang],
Wu, W.[Wayne],
Yang, L.[Lei],
Wang, J.Q.[Jia-Qi],
Qian, C.[Chen],
Lin, D.[Dahua],
Liu, Z.W.[Zi-Wei],
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic
Perception, Reconstruction and Generation,
CVPR23(803-814)
IEEE DOI
2309
BibRef
Mehl, L.[Lukas],
Schmalfuss, J.[Jenny],
Jahedi, A.[Azin],
Nalivayko, Y.[Yaroslava],
Bruhn, A.[Andrés],
Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene
Flow, Optical Flow and Stereo,
CVPR23(4981-4991)
IEEE DOI
2309
BibRef
Bonfiglioli, L.[Luca],
Toschi, M.[Marco],
Silvestri, D.[Davide],
Fioraio, N.[Nicola],
de Gregorio, D.[Daniele],
The Eyecandies Dataset for Unsupervised Multimodal Anomaly Detection
and Localization,
ACCV22(V:459-475).
Springer DOI
2307
BibRef
Bailer, W.[Werner],
Fassold, H.[Hannes],
People@Places and ToDY: Two Datasets for Scene Classification in Media
Production and Archiving,
MMMod23(I: 489-501).
Springer DOI
2304
adds annotations to the Places365 dataset.
BibRef
Truong, Q.T.[Quang-Trung],
Vu, T.A.[Tuan-Anh],
Ha, T.S.[Tan-Sang],
Lokoc, J.[Jakub],
Wong, Y.H.[Yue-Him],
Joneja, A.[Ajay],
Yeung, S.K.[Sai-Kit],
Marine Video Kit:
A New Marine Video Dataset for Content-Based Analysis and Retrieval,
MMMod23(I: 539-550).
Springer DOI
2304
BibRef
Zhao, B.[Bo],
Bilen, H.[Hakan],
Dataset Condensation with Distribution Matching,
WACV23(6503-6512)
IEEE DOI
2302
Training, Costs, Computational modeling, Computational efficiency,
Task analysis, Algorithms: Machine learning architectures,
and algorithms (including transfer)
BibRef
Athar, A.[Ali],
Luiten, J.[Jonathon],
Voigtlaender, P.[Paul],
Khurana, T.[Tarasha],
Dave, A.[Achal],
Leibe, B.[Bastian],
Ramanan, D.[Deva],
BURST: A Benchmark for Unifying Object Recognition, Segmentation and
Tracking in Video,
WACV23(1674-1683)
IEEE DOI
2302
Measurement, Training, Vocabulary, Target tracking, Annotations,
Taxonomy, Benchmark testing
BibRef
Zhang, Y.H.[Yuan-Han],
Yin, Z.F.[Zhen-Fei],
Shao, J.[Jing],
Liu, Z.W.[Zi-Wei],
Benchmarking Omni-Vision Representation Through the Lens of Visual
Realms,
ECCV22(VII:594-611).
Springer DOI
2211
WWW Link. OmniBenchmark using 21 datasets.
Open Set analysis.
BibRef
Chun, S.[Sanghyuk],
Kim, W.[Wonjae],
Park, S.[Song],
Chang, M.[Minsuk],
Oh, S.J.[Seong Joon],
ECCV Caption: Correcting False Negatives by Collecting
Machine-and-Human-verified Image-Caption Associations for MS-COCO,
ECCV22(VIII:1-19).
Springer DOI
2211
BibRef
Cai, Z.A.[Zhong-Ang],
Ren, D.[Daxuan],
Zeng, A.[Ailing],
Lin, Z.Y.[Zheng-Yu],
Yu, T.[Tao],
Wang, W.J.[Wen-Jia],
Fan, X.Y.[Xiang-Yu],
Gao, Y.[Yang],
Yu, Y.F.[Yi-Fan],
Pan, L.[Liang],
Hong, F.Z.[Fang-Zhou],
Zhang, M.Y.[Ming-Yuan],
Loy, C.C.[Chen Change],
Yang, L.[Lei],
Liu, Z.W.[Zi-Wei],
HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling,
ECCV22(VII:557-577).
Springer DOI
2211
BibRef
Shrestha, R.[Rakesh],
Hu, S.Q.[Si-Qi],
Gou, M.H.[Ming-Hao],
Liu, Z.Y.[Zi-Yuan],
Tan, P.[Ping],
A Real World Dataset for Multi-view 3D Reconstruction,
ECCV22(VIII:56-73).
Springer DOI
2211
BibRef
Lin, L.Q.[Li-Qiang],
Liu, Y.L.[Yi-Lin],
Hu, Y.[Yue],
Yan, X.G.[Xing-Guang],
Xie, K.[Ke],
Huang, H.[Hui],
Capturing, Reconstructing, and Simulating: The UrbanScene3D Dataset,
ECCV22(VIII:93-109).
Springer DOI
2211
BibRef
He, J.[Ju],
Yang, S.[Shuo],
Yang, S.K.[Shao-Kang],
Kortylewski, A.[Adam],
Yuan, X.D.[Xiao-Ding],
Chen, J.N.[Jie-Neng],
Liu, S.[Shuai],
Yang, C.[Cheng],
Yu, Q.H.[Qi-Hang],
Yuille, A.L.[Alan L.],
PartImageNet: A Large, High-Quality Dataset of Parts,
ECCV22(VIII:128-145).
Springer DOI
2211
BibRef
Xu, H.[Hang],
Zhao, Q.[Qiang],
Ma, Y.[Yike],
Li, X.D.[Xiao-Dong],
Yuan, P.[Peng],
Feng, B.[Bailan],
Yan, C.G.[Cheng-Gang],
Dai, F.[Feng],
PANDORA: A Panoramic Detection Dataset for Object with Orientation,
ECCV22(VIII:237-252).
Springer DOI
2211
BibRef
Xu, H.[Hang],
Liu, X.Y.[Xin-Yuan],
Zhao, Q.[Qiang],
Ma, Y.[Yike],
Yan, C.G.[Cheng-Gang],
Dai, F.[Feng],
Gaussian Label Distribution Learning for Spherical Image Object
Detection,
CVPR23(1033-1042)
IEEE DOI
2309
BibRef
Uijlings, J.[Jasper],
Mensink, T.[Thomas],
Ferrari, V.[Vittorio],
The Missing Link: Finding Label Relations Across Datasets,
ECCV22(VIII:540-556).
Springer DOI
2211
BibRef
Lin, X.T.[Xiao-Tian],
Xu, L.[Leiyang],
Wang, Q.[Qiang],
Automatic Dataset Generation for Specific Object Detection,
ICIP22(3076-3080)
IEEE DOI
2211
Image segmentation, Buildings, Morphology, Object detection,
Visual systems, Approximation algorithms, Data models,
Image processing
BibRef
Beghdadi, A.[Ayman],
Mallem, M.[Malik],
Beji, L.[Lotfi],
Benchmarking Performance of Object Detection Under Image Distortions
in an Uncontrolled Environment,
ICIP22(2071-2075)
IEEE DOI
2211
Training, Performance evaluation, Codes, Databases, Object detection,
Benchmark testing, Deep learning, Object detection, Distortion,
Benchmarking
BibRef
Helm, D.[Daniel],
Jogl, F.[Fabian],
Kampel, M.[Martin],
Historian: A Large-Scale Historical Film Dataset with Cinematographic
Annotation,
ICIP22(2087-2091)
IEEE DOI
2211
Annotations, Pipelines, Object detection, Benchmark testing, Cameras,
Historical Film Dataset, Film Archives, Deep Learning, Cinematographic Data
BibRef
Li, X.K.[Xin-Ke],
Ding, H.H.[Heng-Hui],
Tong, Z.K.[Ze-Kun],
Wu, Y.W.[Yu-Wei],
Chee, Y.M.[Yeow Meng],
Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled
Primitives,
CVPR22(15926-15936)
IEEE DOI
2210
Training, Solid modeling, Annotations, Computational modeling,
Soft sensors, 3D from multi-view and sensors, Representation learning
BibRef
Batalo, B.[Bojan],
Souza, L.S.[Lincon S.],
Gatto, B.B.[Bernardo B.],
Sogi, N.[Naoya],
Fukui, K.[Kazuhiro],
Analysis of Temporal Tensor Datasets on Product Grassmann Manifold,
VDU22(4868-4876)
IEEE DOI
2210
Manifolds, Measurement, Geometry, Tensors, Data visualization,
Gyroscopes
BibRef
Gao, R.[Ruohan],
Si, Z.[Zilin],
Chang, Y.Y.[Yen-Yu],
Clarke, S.[Samuel],
Bohg, J.[Jeannette],
Fei-Fei, L.[Li],
Yuan, W.Z.[Wen-Zhen],
Wu, J.J.[Jia-Jun],
ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer,
CVPR22(10588-10598)
IEEE DOI
2210
Location awareness, Visualization, Shape, Computational modeling,
Estimation, Rendering (computer graphics), Vision + X, Vision + graphics
BibRef
Wang, Y.[Ye],
Mu, N.[Norman],
Grandi, D.[Daniele],
Savva, N.[Nicolas],
Steinhardt, J.[Jacob],
A3D: Studying Pretrained Representations with Programmable Datasets,
VDU22(4877-4885)
IEEE DOI
2210
Image synthesis, Computational modeling, Pipelines,
Transfer learning, Consumer products, Data visualization
BibRef
Bennequin, E.[Etienne],
Tami, M.[Myriam],
Toubhans, A.[Antoine],
Hudelot, C.[Céline],
Few-Shot Image Classification Benchmarks are Too Far From Reality:
Build Back Better with Semantic Task Sampling,
VDU22(4766-4775)
IEEE DOI
2210
Fungi, Training, Limiting, Shape, Semantics, Benchmark testing,
Pattern recognition
BibRef
Jiang, H.[Han],
Li, Z.[Zeqian],
Whitehill, J.[Jacob],
Can the Mathematical Correctness of Object Configurations Affect the
Accuracy of Their Perception?,
VDU22(4759-4765)
IEEE DOI
2210
Training, Computational modeling, Semantics, Symbols,
Object detection, Mathematical models
BibRef
Ma, J.X.[Jia-Xin],
Ushiku, Y.[Yoshitaka],
Sagara, M.[Miori],
The Effect of Improving Annotation Quality on Object Detection
Datasets: A Preliminary Study,
VDU22(4849-4858)
IEEE DOI
2210
Annotations, Object detection, Machine learning,
Benchmark testing, Internet
BibRef
Collins, J.[Jasmine],
Goel, S.[Shubham],
Deng, K.[Kenan],
Luthra, A.[Achleshwar],
Xu, L.[Leon],
Gundogdu, E.[Erhan],
Zhang, X.[Xi],
Vicente, T.F.Y.[Tomas F. Yago],
Dideriksen, T.[Thomas],
Arora, H.[Himanshu],
Guillaumin, M.[Matthieu],
Malik, J.[Jitendra],
ABO: Dataset and Benchmarks for Real-World 3D Object Understanding,
CVPR22(21094-21104)
IEEE DOI
2210
Training, Geometry, Solid modeling, Azimuth, Navigation, Estimation,
Datasets and evaluation, 3D from single images
BibRef
Jung, H.J.[Hyun-Jun],
Wu, S.C.[Shun-Cheng],
Ruhkamp, P.[Patrick],
Zhai, G.Y.[Guang-Yao],
Schieber, H.[Hannah],
Rizzoli, G.[Giulia],
Wang, P.Y.[Peng-Yuan],
Zhao, H.C.[Hong-Cheng],
Garattoni, L.[Lorenzo],
Roth, D.[Daniel],
Meier, S.[Sven],
Navab, N.[Nassir],
Busam, B.[Benjamin],
HouseCat6D: A Large-Scale Multi-Modal Category Level 6D Object
Perception Dataset with Household Objects in Realistic Scenarios,
CVPR24(22498-22508)
IEEE DOI
2410
Annotations, Pose estimation, Robot vision systems, Buildings,
Grasping, Category Level 6D Pose Estimation, Robotic Grasping
BibRef
Wang, P.Y.[Peng-Yuan],
Jung, H.J.[Hyun-Jun],
Li, Y.T.[Yi-Tong],
Shen, S.Y.[Si-Yuan],
Srikanth, R.P.[Rahul Parthasarathy],
Garattoni, L.[Lorenzo],
Meier, S.[Sven],
Navab, N.[Nassir],
Busam, B.[Benjamin],
PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose
Estimation with Photometrically Challenging Objects,
CVPR22(21190-21199)
IEEE DOI
2210
Solid modeling, Annotations, Shape, Pose estimation, Pipelines,
Robot vision systems, Datasets and evaluation,
Pose estimation and tracking
BibRef
Kataoka, H.[Hirokatsu],
Hayamizu, R.[Ryo],
Yamada, R.[Ryosuke],
Nakashima, K.[Kodai],
Takashima, S.[Sora],
Zhang, X.Y.[Xin-Yu],
Martinez-Noriega, E.J.[Edgar Josafat],
Inoue, N.[Nakamasa],
Yokota, R.[Rio],
Replacing Labeled Real-image Datasets with Auto-generated Contours,
CVPR22(21200-21209)
IEEE DOI
2210
Costs, Shape, Supervised learning, Transformers, Fractals,
Datasets and evaluation,
Self- semi- meta- unsupervised learning
BibRef
Li, D.[Daiqing],
Ling, H.[Huan],
Kim, S.W.[Seung Wook],
Kreis, K.[Karsten],
Fidler, S.[Sanja],
Torralba, A.[Antonio],
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations,
CVPR22(21298-21308)
IEEE DOI
2210
Training, Image segmentation, Benchmark testing,
Generative adversarial networks, Generators,
Self- semi- meta- Transfer/low-shot/long-tail learning
BibRef
Yu, H.[Haibao],
Luo, Y.Z.[Yi-Zhen],
Shu, M.[Mao],
Huo, Y.[Yiyi],
Yang, Z.[Zebang],
Shi, Y.F.[Yi-Feng],
Guo, Z.L.[Zheng-Long],
Li, H.Y.[Han-Yu],
Hu, X.[Xing],
Yuan, J.[Jirui],
Nie, Z.[Zaiqing],
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure
Cooperative 3D Object Detection,
CVPR22(21329-21338)
IEEE DOI
2210
Costs, Annotations, Object detection, Benchmark testing, Sensors,
Datasets and evaluation, 3D from multi-view and sensors,
Vision applications and systems
BibRef
Greff, K.[Klaus],
Belletti, F.[Francois],
Beyer, L.[Lucas],
Doersch, C.[Carl],
Du, Y.L.[Yi-Lun],
Duckworth, D.[Daniel],
Fleet, D.J.[David J],
Gnanapragasam, D.[Dan],
Golemo, F.[Florian],
Herrmann, C.[Charles],
Kipf, T.[Thomas],
Kundu, A.[Abhijit],
Lagun, D.[Dmitry],
Laradji, I.[Issam],
Liu, H.T.[Hsueh-Ti],
Meyer, H.[Henning],
Miao, Y.[Yishu],
Nowrouzezahrai, D.[Derek],
Oztireli, C.[Cengiz],
Pot, E.[Etienne],
Radwan, N.[Noha],
Rebain, D.[Daniel],
Sabour, S.[Sara],
Sajjadi, M.S.M.[Mehdi S. M.],
Sela, M.[Matan],
Sitzmann, V.[Vincent],
Stone, A.[Austin],
Sun, D.Q.[De-Qing],
Vora, S.[Suhani],
Wang, Z.Y.[Zi-Yu],
Wu, T.H.[Tian-Hao],
Yi, K.M.[Kwang Moo],
Zhong, F.C.[Fang-Cheng],
Tagliasacchi, A.[Andrea],
Kubric: A scalable dataset generator,
CVPR22(3739-3751)
IEEE DOI
2210
Training, Data privacy, Solid modeling, Annotations, Pipelines,
Training data, Image and video synthesis and generation,
Self- semi- meta- Video analysis and understanding
BibRef
Rangnekar, A.[Aneesh],
Mulhollan, Z.[Zachary],
Vodacek, A.[Anthony],
Hoffman, M.[Matthew],
Sappa, A.[Angel],
Blasch, E.[Erik],
Yu, J.[Jun],
Zhang, L.W.[Li-Wen],
Du, S.S.[Shen-Shen],
Chang, H.[Hao],
Lu, K.[Keda],
Zhang, Z.[Zhong],
Gao, F.[Fang],
Yu, Y.[Ye],
Shuang, F.[Feng],
Wang, L.[Lei],
Ling, Q.[Qiang],
Shyam, P.[Pranjay],
Yoon, K.J.[Kuk-Jin],
Kim, K.S.[Kyung-Soo],
Semi-Supervised Hyperspectral Object Detection Challenge Results:
PBVS 2022,
PBVS22(389-397)
IEEE DOI
2210
Training, Training data, Object detection,
Semisupervised learning, Transformers
BibRef
Eulig, E.[Elias],
Saranrittichai, P.[Piyapat],
Mummadi, C.K.[Chaithanya Kumar],
Rambach, K.[Kilian],
Beluch, W.[William],
Shi, X.[Xiahan],
Fischer, V.[Volker],
DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the
Presence of Shortcut and Generalization Opportunities,
ICCV21(10635-10644)
IEEE DOI
2203
Measurement, Deep learning, Visualization, Correlation, Shape,
Image color analysis, Neural networks, Datasets and evaluation,
Recognition and classification
BibRef
Eftekhar, A.[Ainaz],
Sax, A.[Alexander],
Malik, J.[Jitendra],
Zamir, A.[Amir],
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision
Datasets from 3D Scans,
ICCV21(10766-10776)
IEEE DOI
2203
Computational modeling, Pipelines, Estimation, Benchmark testing,
Multitasking, Datasets and evaluation,
Vision for robotics and autonomous vehicles
BibRef
Peres, V.M.X.[Vitor Miguel Xavier],
Musse, S.R.[Soraia Raupp],
Towards the Creation of Spontaneous Datasets Based on Youtube Reaction
Videos,
ISVC21(II:203-215).
Springer DOI
2112
BibRef
Kriegler, A.[Andreas],
Steininger, D.[Daniel],
Wöber, W.[Wilfried],
Visual Semantic Context Encoding for Aerial Data Introspection and
Domain Prediction,
IbPRIA22(433-446).
Springer DOI
2205
BibRef
Steininger, D.[Daniel],
Widhalm, V.[Verena],
Simon, J.[Julia],
Kriegler, A.[Andreas],
Sulzbacher, C.[Christoph],
The Aircraft Context Dataset: Understanding and Optimizing Data
Variability in Aerial Domains,
AOTW21(3816-3825)
IEEE DOI
2112
Training, Annotations, Atmospheric modeling, Semantics,
Pose estimation, Focusing, Data models
BibRef
LeBauer, D.[David],
Burnette, M.[Max],
Fahlgren, N.[Noah],
Kooper, R.[Rob],
McHenry, K.[Kenton],
Stylianou, A.[Abby],
What Does TERRA-REF's High Resolution, Multi Sensor Plant Sensing
Public Domain Data Offer the Computer Vision Community?,
CVPPA21(1409-1415)
IEEE DOI
2112
Wavelength measurement, Measurement by laser beam, Cameras
BibRef
Pham, K.[Khoi],
Kafle, K.[Kushal],
Lin, Z.[Zhe],
Ding, Z.H.[Zhi-Hong],
Cohen, S.[Scott],
Tran, Q.[Quan],
Shrivastava, A.[Abhinav],
Learning to Predict Visual Attributes in the Wild,
CVPR21(13013-13023)
IEEE DOI
2111
WWW Link.
Dataset, VAW. Geometry, Visualization, Shape,
Image color analysis, Annotations, Prediction algorithms
BibRef
Zhou, Q.[Qiang],
Wang, S.Y.[Shi-Yin],
Wang, Y.T.[Yi-Tong],
Huang, Z.L.[Zi-Long],
Wang, X.G.[Xing-Gang],
Human De-occlusion: Invisible Perception and Recovery for Humans,
CVPR21(3690-3700)
IEEE DOI
2111
WWW Link.
Dataset, Human Occlusion. Annotations, Aggregates, Refining,
Predictive models, Task analysis
BibRef
Changpinyo, S.[Soravit],
Sharma, P.[Piyush],
Ding, N.[Nan],
Soricut, R.[Radu],
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To
Recognize Long-Tail Visual Concepts,
CVPR21(3557-3567)
IEEE DOI
2111
Dataset, Image Captioning. Conceptual 12M (CC12M), a dataset with 12 million image-text pairs.
Visualization, Image recognition, Pipelines,
Benchmark testing, Data collection, Knowledge discovery
BibRef
Miao, J.[Jiaxu],
Wei, Y.C.[Yun-Chao],
Wu, Y.[Yu],
Liang, C.[Chen],
Li, G.R.[Guang-Rui],
Yang, Y.[Yi],
VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild,
CVPR21(4131-4141)
IEEE DOI
2111
Image segmentation, Annotations,
Task analysis, Spatial resolution, Videos
BibRef
Ahmadyan, A.[Adel],
Zhang, L.[Liangkai],
Ablavatski, A.[Artsiom],
Wei, J.N.[Jia-Ning],
Grundmann, M.[Matthias],
Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild
with Pose Annotations,
CVPR21(7818-7827)
IEEE DOI
2111
Measurement, Solid modeling,
Annotations, Shape, Object detection
BibRef
van Horn, G.[Grant],
Cole, E.[Elijah],
Beery, S.[Sara],
Wilber, K.[Kimberly],
Belongie, S.[Serge],
MacAodha, O.[Oisin],
Benchmarking Representation Learning for Natural World Image
Collections,
CVPR21(12879-12888)
IEEE DOI
2111
Training, Learning systems, Visualization, Wildlife,
Transfer learning, Benchmark testing, Feature extraction
BibRef
Wenzel, P.[Patrick],
Wang, R.[Rui],
Yang, N.[Nan],
Cheng, Q.[Qing],
Khan, Q.[Qadeer],
von Stumberg, L.[Lukas],
Zeller, N.[Niclas],
Cremers, D.[Daniel],
4Seasons: A Cross-Season Dataset for Multi-Weather SLAM in Autonomous
Driving,
GCPR20(404-417).
Springer DOI
2110
BibRef
Pan, Y.C.[Yan-Cheng],
Gao, B.[Biao],
Mei, J.L.[Ji-Lin],
Geng, S.[Sibo],
Li, C.K.[Cheng-Kun],
Zhao, H.J.[Hui-Jing],
SemanticPOSS:
A Point Cloud Dataset with Large Quantity of Dynamic Instances,
ConferenceIntelligent Vehicles, 2020.
WWW Link.
HTML Version.
BibRef
2000
Dung, H.A.[Hoang Anh],
Chen, B.[Bo],
Chin, T.J.[Tat-Jun],
A Spacecraft Dataset for Detection, Segmentation and Parts
Recognition,
AI4Space21(2012-2019)
IEEE DOI
2109
Space vehicles, Deep learning, Image segmentation,
Satellites, Service robots, Object detection
BibRef
Anderson, C.[Connor],
Teuscher, A.[Adam],
Anderson, E.[Elizabeth],
Larsen, A.[Alysia],
Shirley, J.[Josh],
Farrell, R.[Ryan],
Have Fun Storming the Castle(s)!,
WACV21(3702-3711)
IEEE DOI
WWW Link.
2106
Dataset, Castles. 2400 individual castles, palaces and fortresses from more than 90
countries, contains more than 770K images.
Visualization, Image recognition, Geology,
Computational modeling, Image retrieval
BibRef
Birhane, A.[Abeba],
Prabhu, V.U.[Vinay Uday],
Large image datasets: A pyrrhic win for computer vision?,
WACV21(1536-1546)
IEEE DOI
2106
Faces
BibRef
Figueiredo, A.[Augusto],
Brayan, J.[Johnata],
Reis, R.O.[Renan Oliveira],
Prates, R.[Raphael],
Schwartz, W.R.[William Robson],
MoRe: A Large-Scale Motorcycle Re-Identification Dataset,
WACV21(4033-4042)
IEEE DOI
WWW Link.
2106
Dataset, Vehicles. Training, Deep learning, Computational modeling,
Surveillance, Motorcycles, Traffic control
BibRef
Le, H.A.[Hoang-An],
Mensink, T.[Thomas],
Das, P.[Partha],
Karaoglu, S.[Sezer],
Gevers, T.[Theo],
EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes,
WACV21(1578-1588)
IEEE DOI
WWW Link.
2106
Dataset, Outdoor Scenes. Deep learning, Image segmentation,
Image color analysis, Computational modeling, Semantics
BibRef
Kim, S.P.[Sang-Pil],
Chi, H.G.[Hyung-Gun],
Hu, X.[Xiao],
Huang, Q.X.[Qi-Xing],
Ramani, K.[Karthik],
A Large-scale Annotated Mechanical Components Benchmark for
Classification and Retrieval Tasks with Deep Neural Networks,
ECCV20(XVIII:175-191).
Springer DOI
2012
BibRef
Duan, J.,
Yu, S.,
Tan, H.L.,
Tan, C.,
Actionet: An Interactive End-To-End Platform For Task-Based Data
Collection And Augmentation In 3D Environment,
ICIP20(1566-1570)
IEEE DOI
2011
Collecting the data.
Task analysis, Videos,
Graphical user interfaces, Planning, Data collection, Robots,
3D environment
BibRef
Zhang, Y.,
Zhang, L.,
Hamidouche, W.,
Deforges, O.,
A Fixation-Based 360° Benchmark Dataset For Salient Object Detection,
ICIP20(3458-3462)
IEEE DOI
2011
Benchmark testing, Visualization,
Object detection, Head, Measurement, Training, VR,
benchmark
BibRef
Hsu, T.M.H.[Tzu-Ming Harry],
Qi, H.[Hang],
Brown, M.[Matthew],
Federated Visual Classification with Real-World Data Distribution,
ECCV20(X:76-92).
Springer DOI
2011
Species and landmark classification.
BibRef
Zheng, J.[Jia],
Zhang, J.F.[Jun-Fei],
Li, J.[Jing],
Tang, R.[Rui],
Gao, S.H.[Sheng-Hua],
Zhou, Z.H.[Zi-Han],
Structured3D:
A Large Photo-realistic Dataset for Structured 3d Modeling,
ECCV20(IX:519-535).
Springer DOI
2011
BibRef
Song, J.M.[Jia-Ming],
Dauphin, Y.[Yann],
Auli, M.[Michael],
Ma, T.Y.[Teng-Yu],
Robust and On-the-Fly Dataset Denoising for Image Classification,
ECCV20(XXIX: 556-572).
Springer DOI
2010
BibRef
Wang, X.,
Zhang, X.,
Zhu, Y.,
Guo, Y.,
Yuan, X.,
Xiang, L.,
Wang, Z.,
Ding, G.,
Brady, D.,
Dai, Q.,
Fang, L.,
PANDA: A Gigapixel-Level Human-Centric Video Dataset,
CVPR20(3265-3275)
IEEE DOI
2008
Task analysis, Spatial resolution, Trajectory, Cameras,
Benchmark testing, Visualization, Head
BibRef
Warburg, F.[Frederik],
Hauberg, S.[Søren],
López-Antequera, M.[Manuel],
Gargallo, P.[Pau],
Kuang, Y.[Yubin],
Civera, J.[Javier],
Mapillary Street-Level Sequences:
A Dataset for Lifelong Place Recognition,
CVPR20(2623-2632)
IEEE DOI
2008
Dataset, Mapillary mapping platform.
Urban areas, Cameras, Image recognition, Meteorology, Task analysis,
Image sequences, Benchmark testing
BibRef
Li, X.,
Wei, T.,
Chen, Y.P.,
Tai, Y.,
Tang, C.,
FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation,
CVPR20(2866-2875)
IEEE DOI
2008
Image segmentation, Training, Animals, Task analysis, Semantics,
Tools
BibRef
Scheck, T.[Tobias],
Seidel, R.[Roman],
Hirtz, G.[Gangolf],
Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor
Dataset for Deep Transfer Learning,
WACV20(932-941)
IEEE DOI
2006
Dataset, Fisheye Images. Cameras, Image segmentation, Object detection,
Semantics, Solid modeling, Rendering (computer graphics)
BibRef
Chou, S.H.[Shih-Han],
Sun, C.[Cheng],
Chang, W.Y.[Wen-Yen],
Hsu, W.T.[Wan-Ting],
Sun, M.[Min],
Fu, J.L.[Jian-Long],
360-Indoor: Towards Learning Real-World Objects in 360° Indoor
Equirectangular Images,
WACV20(834-842)
IEEE DOI
2006
Object detection, Videos, Distortion, Automobiles,
Task analysis
BibRef
Behley, J.,
Garbade, M.,
Milioto, A.,
Quenzel, J.,
Behnke, S.,
Stachniss, C.,
Gall, J.,
SemanticKITTI:
A Dataset for Semantic Scene Understanding of LiDAR Sequences,
ICCV19(9296-9306)
IEEE DOI
2004
Dataset, LiDAR. distance measurement, image segmentation,
optical radar, stereo image processing, LiDAR sequences, Lasers
BibRef
Wang, X.,
Wu, J.,
Chen, J.,
Li, L.,
Wang, Y.,
Wang, W.Y.,
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for
Video-and-Language Research,
ICCV19(4580-4590)
IEEE DOI
WWW Link.
2004
Dataset, . language translation, linguistics, natural language processing,
video signal processing, unified multilingual model, Social network services
BibRef
Gu, S.,
Lugmayr, A.,
Danelljan, M.,
Fritsche, M.,
Lamour, J.,
Timofte, R.,
DIV8K: DIVerse 8K Resolution Image Dataset,
AIM19(3512-3516)
IEEE DOI
2004
Dataset, High Resolution. convolutional neural nets, image resolution,
learning (artificial intelligence), CNN, image processing
BibRef
Mauceri, C.[Cecilia],
Palmer, M.[Martha],
Heckman, C.[Christoffer],
SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions,
CLVL19(1883-1886)
IEEE DOI
2004
Dataset, Recognition. image colour analysis, object detection, SLAM (robots),
spatial referring expressions, SUN-Spot, objects localization,
multimodal
BibRef
Sølund, T.[Thomas],
Buch, A.G.[Anders Glent],
Krüger, N.[Norbert],
Aanæs, H.[Henrik],
A Large-Scale 3D Object Recognition Dataset,
3DV16(73-82)
IEEE DOI
1701
Dataset, Object Recognition.
WWW Link. object recognition
BibRef
Hua, B.S.[Binh-Son],
Pham, Q.H.[Quang-Hieu],
Nguyen, D.T.[Duc Thanh],
Tran, M.K.[Minh-Khoi],
Yu, L.F.[Lap-Fai],
Yeung, S.K.[Sai-Kit],
SceneNN: A Scene Meshes Dataset with aNNotations,
3DV16(92-101)
IEEE DOI
1701
Dataset, RGB-D.
WWW Link. Cameras
BibRef
Rotman, D.[Daniel],
Gilboa, G.[Guy],
A Depth Restoration Occlusionless Temporal Dataset,
3DV16(176-184)
IEEE DOI
1701
Dataset, RGB-D.
BibRef
Zhang, J.J.[Jun-Jie],
Zhang, J.[Jian],
Lu, J.F.[Jian-Feng],
Shen, C.H.[Chun-Hua],
Curr, K.[Kate],
Phua, R.[Robin],
Neville, R.[Richard],
Edmonds, E.[Elise],
SLNSW-UTS:
A Historical Image Dataset for Image Multi-Labeling and Retrieval,
DICTA16(1-6)
IEEE DOI
1701
Dataset, Object Recognition. 29713 images, 119 labels.
BibRef
Xiang, Y.[Yu],
Kim, W.[Wonhui],
Chen, W.[Wei],
Ji, J.W.[Jing-Wei],
Choy, C.[Christopher],
Su, H.[Hao],
Mottaghi, R.[Roozbeh],
Guibas, L.J.[Leonidas J.],
Savarese, S.[Silvio],
ObjectNet3D: A Large Scale Database for 3D Object Recognition,
ECCV16(VIII: 160-176).
Springer DOI
1611
Dataset, Object Recognition.
WWW Link.
BibRef
Lin, T.Y.[Tsung-Yi],
Maire, M.[Michael],
Belongie, S.J.[Serge J.],
Hays, J.[James],
Perona, P.[Pietro],
Ramanan, D.[Deva],
Dollár, P.[Piotr],
Zitnick, C.L.[C. Lawrence],
Microsoft COCO: Common Objects in Context,
ECCV14(V: 740-755).
Springer DOI
1408
Dataset, Objects.
WWW Link.
BibRef
Flickr30k Dataset,
From image descriptions to visual denotations.
WWW Link.
Dataset, Visual Question Answering. Extension of Flickr 8k dataset.
Plummer, B.A.[Bryan A.],
Wang, L.W.[Li-Wei],
Cervantes, C.M.[Chris M.],
Caicedo, J.C.[Juan C.],
Hockenmaier, J.[Julia],
Lazebnik, S.[Svetlana],
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for
Richer Image-to-Sentence Models,
IJCV(123), No. 1, May 2017, pp. 74-93.
Springer DOI
1705
BibRef
Earlier:
ICCV15(2641-2649)
IEEE DOI
1602
Dataset, Object Recognition. Benchmark testing
BibRef
Fanello, S.R.[Sean Ryan],
Ciliberto, C.[Carlo],
Santoro, M.[Matteo],
Natale, L.[Lorenzo],
Metta, G.[Giorgio],
Rosasco, L.[Lorenzo],
Odone, F.[Francesca],
iCub World: Friendly Robots Help Building Good Vision Data-Sets,
GT13(700-705)
IEEE DOI
1309
Dataset, Object Recognition. Human Robot Interaction; Object Categorization Dataset; iCub
BibRef
Ponomarenko, N.[Nikolay],
Ieremeiev, O.[Oleg],
Lukin, V.[Vladimir],
Jin, L.[Lina],
Egiazarian, K.O.[Karen O.],
A New Color Image Database TID2013: Innovations and Results,
ACIVS13(402-413).
Springer DOI
1311
Dataset, Color Images.
BibRef
Ponce, J.,
Berg, T.L.,
Everingham, M.R.,
Forsyth, D.A.,
Hebert, M.,
Lazebnik, S.[Svetlana],
Marszalek, M.,
Schmid, C.,
Russell, B.C.,
Torralba, A.,
Williams, C.K.I.,
Zhang, J.,
Zisserman, A.,
Dataset Issues in Object Recognition,
CLOR06(29-48).
Springer DOI
0711
Dataset, Discussion.
BibRef
Campbell, R., and
Flynn, P.J.,
A WWW-Accessible 3D Image and Model Database for
Computer Vision Research,
EEMCV98(148-154).
BibRef
9800
And:
EEMTV98(xx)
Dataset, 3-D Data.
HTML Version.
BibRef
Nene, S.A.,
Nayar, S.K.[Shree K.],
Murase, H.[Hiroshi],
Columbia Object Image Library (COIL-100),
ColumbiaTechnical Report CUCS-006-96, February 1996.
PS File. Also:
WWW Link. Also the COIL-20 database.
WWW Link.
Dataset, Objects.
BibRef
9602
Chapter on Matching and Recognition Using Volumes, High Level Vision Techniques, Invariants continues in
General Spatial Reasoning and Geometric Reasoning Issues, Visual Relations .