13.4.6 Object Recognition, Retrieval Datasets

Chapter Contents (Back)
Evaluation, Recognition. Dataset, Objects. Dataset, Retrieval.
See also Visual Question Answering, Query, VQA, Visual Dialog.

The PASCAL Object Recognition Database Collection,
2006. Dataset, Objects.
HTML Version. Various datasets for object recognition. Pointers to some of the others.

MSR VTT Dataset,
A Large Video Description Dataset for Bridging Video and Language. WWW Link.
Dataset, Visual Question Answering.
See also MSR-VTT: A Large Video Description Dataset for Bridging Video and Language.

Video Objects: A Test Database for Video Object Recognition,
2006. Dataset, Objects.
HTML Version. 180 videos of 15 objects.

Animals with Attributes: A dataset for Attribute Based Classification,
2006. Dataset, Objects.
WWW Link. 30,000+ images, 40 animal classes.

Image Net, ImageNet Dataset,
2014.
WWW Link. Dataset, Objects. Large set of images (or sets of datasets) for recognition. Related to ImageNet Challanges for recognition. 14Million+ images. Links to Stanford
See also Stanford University, Computer Science Departent. and Princeton.
See also Princeton.

Washington Ground Truth Image Database,
CBIR dataset. Online2004
WWW Link. Dataset. Dataset, Retrieval. BibRef 0400

LHI Object Datasets,
Includes hand segmentations, and annotations. Online2004
HTML Version. Dataset. Dataset, Object Recognition. Transportation images, Animals, Aerial Images, Objects, Dataset also includes other data.
See also Lotus Hill Institute. BibRef 0400

NEC Animal Dataset,
Online2009
WWW Link. Dataset. Dataset, Object Recognition. It consists of about 5000 high quality images from 60 toy animals taken at different poses against a plain background. BibRef 0900

Xcavator.Net,
Online2007
WWW Link. Dataset, Object Recognition. Photo search for professional use. Searches stock databases, you then purchase the image for use. Part of CogniSign LLC. BibRef 0700

The ETH-80 Dataset,
2017 Dataset, Objects.
WWW Link. The ETH-80 dataset contains visual object images from 8 different categories including apples, cars, cows,cups, dogs, horses, pears and tomatoes.
See also Covariance descriptors on a Gaussian manifold and their application to image set classification.
See also Swiss Federal Institute of Technology in Zurich.

15 Scene Dataset,
Dataset, Objects.
HTML Version. The 15 scene categories are office, kitchen, living room, bedroom, store, industrial, tall building, inside cite, street, highway, coast, open country, mountain, forest, and suburb. Images in the dataset are about 250*300 resolution, with 210 to 410 images per class.

Video Dataset Overview,
2021
WWW Link. Dataset, Overview. A good collection of Video datasets for various uses, activity, instruction, sports, etc..

Multi-Weather 4Seasons Dataset,
2021 Dataset, Driving.
WWW Link.

Vasiljevic, I., Kolkin, N., Zhang, S., Luo, R., Wang, H.,
DIODE: A Dense Indoor and Outdoor Depth Dataset,
2019 Dataset, Object Extraction.
WWW Link. BibRef

Blanco, J.L.[Jose-Luis], Moreno, F.A.[Francisco-Angel], Gonzalez, J.[Javier],
A collection of outdoor robotic datasets with centimeter-accuracy ground truth,
AutRob(27), No. 4, 2009, pp. 327.
Springer DOI
WWW Link. Dataset, SLAM. Malaga Parking BibRef 0900

Geusebroek, J.M.[Jan-Mark], Burghouts, G.J.[Gertjan J.], Smeulders, A.W.M.[Arnold W.M.],
The Amsterdam Library of Object Images,
IJCV(61), No. 1, January 2005, pp. 103-112.
DOI Link 0410

WWW Link. Dataset, Objects. 1000 objects over 100 images per object. BibRef

Torralba, A.B.[Antonio B.], Fergus, R.[Rob], Freeman, W.T.[William T.],
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition,
PAMI(30), No. 11, November 2008, pp. 1958-1970.
IEEE DOI
WWW Link. 0809
BibRef
And: CSAIL-TR-2007-024, 2007. Dataset, Retrieval. Images from the WWW, associated with a noun. Large comprehensive dataset. Dataset with segmentations. BibRef

Gong, Y.C.[Yun-Chao], Pawlowski, M.[Marcin], Yang, F.[Fei], Brandy, L.[Louis], Boundev, L.[Lubomir], Fergus, R.[Rob],
Web scale photo hash clustering on a single machine,
CVPR15(19-27)
IEEE DOI 1510
BibRef

Russell, B.[Bryan], Torralba, A.B.[Antonio B.], Freeman, W.T.[William T.],
LableMe: The Open Annotation Tool,
Online2010.
WWW Link. 1108
Dataset, Retrieval. Code, Annotation. The site for the annotation tool, also the video version. BibRef

Zhou, B.[Bolei], Lapedriza, A.[Agata], Khosla, A.[Aditya], Oliva, A.[Aude], Torralba, A.B.[Antonio B.],
Places: A 10 Million Image Database for Scene Recognition,
PAMI(40), No. 6, June 2018, pp. 1452-1464.
IEEE DOI 1805
Dataset, Retrieval. Context, Databases, Image recognition, Semantics, Sun, Training, Visualization, Scene classification, deep feature, deep learning, visual recognition BibRef

Escalante, H.J.[Hugo Jair], Hernandez, C.A.[Carlos A.], Gonzalez, J.A.[Jesus A.], Lopez-Lopez, A., Montes-y-Gomez, M.[Manuel], Morales, E.F.[Eduardo F.], Sucar, L.E.[L. Enrique], Villasenor, L.[Luis], Grubinger, M.[Michael],
The segmented and annotated IAPR TC-12 benchmark,
CVIU(114), No. 4, April 2010, pp. 419-428.
Elsevier DOI 1003
Dataset, Retrieval. Data set creation; Ground truth collection; Evaluation metrics; Automatic image annotation; Image retrieval BibRef

Russakovsky, O.[Olga], Deng, J.[Jia], Su, H.[Hao], Krause, J.[Jonathan], Satheesh, S.[Sanjeev], Ma, S.[Sean], Huang, Z.H.[Zhi-Heng], Karpathy, A.[Andrej], Khosla, A.[Aditya], Bernstein, M.[Michael], Berg, A.C.[Alexander C.], Fei-Fei, L.[Li],
ImageNet Large Scale Visual Recognition Challenge,
IJCV(115), No. 3, December 2015, pp. 211-252.
Springer DOI 1512
Dataset, Object Category. Object category classification and detection on hundreds of object categories and millions of images. BibRef

Loh, Y.P.[Yuen Peng], Chan, C.S.[Chee Seng],
Getting to know low-light images with the Exclusively Dark dataset,
CVIU(178), 2019, pp. 30-42.
Elsevier DOI 1812
Dataset, Low Light. BibRef

Rosu, R.A.[Radu Alexandru], Quenzel, J.[Jan], Behnke, S.[Sven],
Semi-supervised Semantic Mapping Through Label Propagation with Semantic Texture Meshes,
IJCV(128), No. 5, May 2020, pp. 1220-1238.
Springer DOI 2005
BibRef

Aizawa, K., Fujimoto, A., Otsubo, A., Ogawa, T., Matsui, Y., Tsubota, K., Ikuta, H.,
Building a Manga Dataset 'Manga109' With Annotations for Multimedia Applications,
MultMedMag(27), No. 2, April 2020, pp. 8-18.
IEEE DOI 2006
Dataset, Manga. Machine learning, Visualization, Character recognition, Art, Machine learning algorithms, Task analysis BibRef

Kuznetsova, A.[Alina], Rom, H.[Hassan], Alldrin, N.[Neil], Uijlings, J.[Jasper], Krasin, I.[Ivan], Pont-Tuset, J.[Jordi], Kamali, S.[Shahab], Popov, S.[Stefan], Malloci, M.[Matteo], Kolesnikov, A.[Alexander], Duerig, T.[Tom], Ferrari, V.[Vittorio],
The Open Images Dataset V4,
IJCV(128), No. 7, July 2020, pp. 1956-1981.
Springer DOI 2007
Dataset, Object Detection. 9.2M images with unified annotations.
HTML Version. BibRef

Maugey, T., Toni, L.,
Large Database Compression Based on Perceived Information,
SPLetters(27), 2020, pp. 1735-1739.
IEEE DOI 2010
Covariance matrices, Compression algorithms, Databases, Measurement, Signal processing algorithms, Image coding, Entropy, sampling BibRef

He, Y.[Yue], Shen, Z.Y.[Zhe-Yan], Cui, P.[Peng],
Towards Non-I.I.D. image classification: A dataset and baselines,
PR(110), 2021, pp. 107383.
Elsevier DOI 2011
Non-I.I.D, Dataset, Context, Bias, ConvNet, Batch balancing BibRef

Pang, Y., Cao, J., Li, Y., Xie, J., Sun, H., Gong, J.,
TJU-DHD: A Diverse High-Resolution Dataset for Object Detection,
IP(30), 2021, pp. 207-219.
IEEE DOI 2011
Object detection, Feature extraction, Image resolution, Face recognition, Proposals, Training, Face detection, Dataset, large scale BibRef

Xu, X.W.[Xiao-Wei], Zhang, X.Y.[Xin-Yi], Yu, B.[Bei], Hu, X.B.S.[Xiao-Bo Sharon], Rowen, C.[Christopher], Hu, J.T.[Jing-Tong], Shi, Y.Y.[Yi-Yu],
DAC-SDC Low Power Object Detection Challenge for UAV Applications,
PAMI(43), No. 2, February 2021, pp. 392-403.
IEEE DOI 2101
More for detection, but generally a dataset, evaluation paper. This paper presents in detail the dataset and evaluation procedure. It further discusses the methods developed by some of the entries as well as representative results. Object detection, Graphics processing units, Field programmable gate arrays, Task analysis, low power BibRef

Thakur, S.[Sanchari], Bruzzone, L.[Lorenzo],
An Approach to the Generation and Analysis of Databases of Simulated Radar Sounder Data for Performance Prediction and Target Interpretation,
GeoRS(59), No. 10, October 2021, pp. 8269-8287.
IEEE DOI 2109
Radar, Databases, Moon, Computational modeling, Solid modeling, Instruments, Clutter, Feature analysis, geoelectrical modeling, similarity measure BibRef

SynthCity: A Large-Scale Synthetic Point Cloud,
2019.
WWW Link. Dataset, Point Clouds. Synthetic point clouds and RGB data from a detailed city model.

WHU Datasets,
2020.
WWW Link. Dataset, Buildings. Several datasets.
See also Whuan University.


Wenzel, P.[Patrick], Wang, R.[Rui], Yang, N.[Nan], Cheng, Q.[Qing], Khan, Q.[Qadeer], von Stumberg, L.[Lukas], Zeller, N.[Niclas], Cremers, D.[Daniel],
4Seasons: A Cross-Season Dataset for Multi-Weather SLAM in Autonomous Driving,
GCPR20(404-417).
Springer DOI 2110
BibRef

Pan, Y.C.[Yan-Cheng], Gao, B.[Biao], Mei, J.L.[Ji-Lin], Geng, S.[Sibo], Li, C.K.[Cheng-Kun], Zhao, H.J.[Hui-Jing],
SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances,
ConferenceIntelligent Vehicles, 2020.
WWW Link.
HTML Version. BibRef 2000

Dung, H.A.[Hoang Anh], Chen, B.[Bo], Chin, T.J.[Tat-Jun],
A Spacecraft Dataset for Detection, Segmentation and Parts Recognition,
AI4Space21(2012-2019)
IEEE DOI 2109
Space vehicles, Deep learning, Image segmentation, Satellites, Service robots, Object detection BibRef

Anderson, C.[Connor], Teuscher, A.[Adam], Anderson, E.[Elizabeth], Larsen, A.[Alysia], Shirley, J.[Josh], Farrell, R.[Ryan],
Have Fun Storming the Castle(s)!,
WACV21(3702-3711)
IEEE DOI
WWW Link. 2106
Dataset, Castles. 2400 individual castles, palaces and fortresses from more than 90 countries, contains more than 770K images. Visualization, Image recognition, Geology, Computational modeling, Image retrieval BibRef

Birhane, A.[Abeba], Prabhu, V.U.[Vinay Uday],
Large image datasets: A pyrrhic win for computer vision?,
WACV21(1536-1546)
IEEE DOI 2106
Faces BibRef

Figueiredo, A.[Augusto], Brayan, J.[Johnata], Reis, R.O.[Renan Oliveira], Prates, R.[Raphael], Schwartz, W.R.[William Robson],
MoRe: A Large-Scale Motorcycle Re-Identification Dataset,
WACV21(4033-4042)
IEEE DOI
WWW Link. 2106
Dataset, Vehicles. Training, Deep learning, Computational modeling, Surveillance, Motorcycles, Traffic control BibRef

Le, H.A.[Hoang-An], Mensink, T.[Thomas], Das, P.[Partha], Karaoglu, S.[Sezer], Gevers, T.[Theo],
EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes,
WACV21(1578-1588)
IEEE DOI
WWW Link. 2106
Dataset, Outdoor Scenes. Deep learning, Image segmentation, Image color analysis, Computational modeling, Semantics BibRef

Kim, S.P.[Sang-Pil], Chi, H.G.[Hyung-Gun], Hu, X.[Xiao], Huang, Q.X.[Qi-Xing], Ramani, K.[Karthik],
A Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval Tasks with Deep Neural Networks,
ECCV20(XVIII:175-191).
Springer DOI 2012
BibRef

Duan, J., Yu, S., Tan, H.L., Tan, C.,
Actionet: An Interactive End-To-End Platform For Task-Based Data Collection And Augmentation In 3D Environment,
ICIP20(1566-1570)
IEEE DOI 2011
Collecting the data. Task analysis, Videos, Graphical user interfaces, Planning, Data collection, Robots, 3D environment BibRef

Zhang, Y., Zhang, L., Hamidouche, W., Deforges, O.,
A Fixation-Based 360° Benchmark Dataset For Salient Object Detection,
ICIP20(3458-3462)
IEEE DOI 2011
Benchmark testing, Visualization, Object detection, Head, Measurement, Training, VR, benchmark BibRef

Hsu, T.M.H.[Tzu-Ming Harry], Qi, H.[Hang], Brown, M.[Matthew],
Federated Visual Classification with Real-World Data Distribution,
ECCV20(X:76-92).
Springer DOI 2011
Species and landmark classification. BibRef

Zheng, J.[Jia], Zhang, J.F.[Jun-Fei], Li, J.[Jing], Tang, R.[Rui], Gao, S.H.[Sheng-Hua], Zhou, Z.[Zihan],
Structured3D: A Large Photo-realistic Dataset for Structured 3d Modeling,
ECCV20(IX:519-535).
Springer DOI 2011
BibRef

Song, J.M.[Jia-Ming], Dauphin, Y.[Yann], Auli, M.[Michael], Ma, T.Y.[Teng-Yu],
Robust and On-the-Fly Dataset Denoising for Image Classification,
ECCV20(XXIX: 556-572).
Springer DOI 2010
BibRef

Wang, X., Zhang, X., Zhu, Y., Guo, Y., Yuan, X., Xiang, L., Wang, Z., Ding, G., Brady, D., Dai, Q., Fang, L.,
PANDA: A Gigapixel-Level Human-Centric Video Dataset,
CVPR20(3265-3275)
IEEE DOI 2008
Task analysis, Spatial resolution, Trajectory, Cameras, Benchmark testing, Visualization, Head BibRef

Warburg, F.[Frederik], Hauberg, S.[Sųren], López-Antequera, M.[Manuel], Gargallo, P.[Pau], Kuang, Y.[Yubin], Civera, J.[Javier],
Mapillary Street-Level Sequences: A Dataset for Lifelong Place Recognition,
CVPR20(2623-2632)
IEEE DOI 2008
Dataset, Mapillary mapping platform. Urban areas, Cameras, Image recognition, Meteorology, Task analysis, Image sequences, Benchmark testing BibRef

Li, X., Wei, T., Chen, Y.P., Tai, Y., Tang, C.,
FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation,
CVPR20(2866-2875)
IEEE DOI 2008
Image segmentation, Training, Animals, Task analysis, Semantics, Computer vision, Tools BibRef

Scheck, T.[Tobias], Seidel, R.[Roman], Hirtz, G.[Gangolf],
Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor Dataset for Deep Transfer Learning,
WACV20(932-941)
IEEE DOI 2006
Dataset, Fisheye Images. Cameras, Image segmentation, Object detection, Semantics, Solid modeling, Rendering (computer graphics) BibRef

Chou, S.H.[Shih-Han], Sun, C.[Cheng], Chang, W.Y.[Wen-Yen], Hsu, W.T.[Wan-Ting], Sun, M.[Min], Fu, J.L.[Jian-Long],
360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images,
WACV20(834-842)
IEEE DOI 2006
Object detection, Videos, Distortion, Automobiles, Computer vision, Task analysis BibRef

Behley, J., Garbade, M., Milioto, A., Quenzel, J., Behnke, S., Stachniss, C., Gall, J.,
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences,
ICCV19(9296-9306)
IEEE DOI 2004
Dataset, LiDAR. distance measurement, image segmentation, optical radar, stereo image processing, LiDAR sequences, Lasers BibRef

Wang, X., Wu, J., Chen, J., Li, L., Wang, Y., Wang, W.Y.,
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research,
ICCV19(4580-4590)
IEEE DOI
WWW Link. 2004
Dataset, . language translation, linguistics, natural language processing, video signal processing, unified multilingual model, Social network services BibRef

Gu, S., Lugmayr, A., Danelljan, M., Fritsche, M., Lamour, J., Timofte, R.,
DIV8K: DIVerse 8K Resolution Image Dataset,
AIM19(3512-3516)
IEEE DOI 2004
Dataset, High Resolution. convolutional neural nets, image resolution, learning (artificial intelligence), CNN, image processing BibRef

Mauceri, C.[Cecilia], Palmer, M.[Martha], Heckman, C.[Christoffer],
SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions,
CLVL19(1883-1886)
IEEE DOI 2004
Dataset, Recognition. image colour analysis, object detection, SLAM (robots), spatial referring expressions, SUN-Spot, objects localization, multimodal BibRef

Sųlund, T.[Thomas], Buch, A.G.[Anders Glent], Krüger, N.[Norbert], Aanęs, H.[Henrik],
A Large-Scale 3D Object Recognition Dataset,
3DV16(73-82)
IEEE DOI 1701
Dataset, Object Recognition.
WWW Link. object recognition BibRef

Hua, B.S.[Binh-Son], Pham, Q.H.[Quang-Hieu], Nguyen, D.T.[Duc Thanh], Tran, M.K.[Minh-Khoi], Yu, L.F.[Lap-Fai], Yeung, S.K.[Sai-Kit],
SceneNN: A Scene Meshes Dataset with aNNotations,
3DV16(92-101)
IEEE DOI 1701
Dataset, RGB-D.
WWW Link. Cameras BibRef

Rotman, D.[Daniel], Gilboa, G.[Guy],
A Depth Restoration Occlusionless Temporal Dataset,
3DV16(176-184)
IEEE DOI 1701
Dataset, RGB-D. BibRef

Zhang, J.J.[Jun-Jie], Zhang, J.[Jian], Lu, J.F.[Jian-Feng], Shen, C.H.[Chun-Hua], Curr, K.[Kate], Phua, R.[Robin], Neville, R.[Richard], Edmonds, E.[Elise],
SLNSW-UTS: A Historical Image Dataset for Image Multi-Labeling and Retrieval,
DICTA16(1-6)
IEEE DOI 1701
Dataset, Object Recognition. 29713 images, 119 labels. BibRef

Xiang, Y.[Yu], Kim, W.[Wonhui], Chen, W.[Wei], Ji, J.W.[Jing-Wei], Choy, C.[Christopher], Su, H.[Hao], Mottaghi, R.[Roozbeh], Guibas, L.J.[Leonidas J.], Savarese, S.[Silvio],
ObjectNet3D: A Large Scale Database for 3D Object Recognition,
ECCV16(VIII: 160-176).
Springer DOI 1611
Dataset, Object Recognition.
WWW Link. BibRef

Lin, T.Y.[Tsung-Yi], Maire, M.[Michael], Belongie, S.J.[Serge J.], Hays, J.[James], Perona, P.[Pietro], Ramanan, D.[Deva], Dollįr, P.[Piotr], Zitnick, C.L.[C. Lawrence],
Microsoft COCO: Common Objects in Context,
ECCV14(V: 740-755).
Springer DOI 1408
Dataset, Objects.
WWW Link. BibRef

Flickr30k Dataset,
From image descriptions to visual denotations. WWW Link.
Dataset, Visual Question Answering. Extension of Flickr 8k dataset.

Plummer, B.A.[Bryan A.], Wang, L.[Liwei], Cervantes, C.M.[Chris M.], Caicedo, J.C.[Juan C.], Hockenmaier, J.[Julia], Lazebnik, S.[Svetlana],
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models,
IJCV(123), No. 1, May 2017, pp. 74-93.
Springer DOI 1705
BibRef
Earlier: ICCV15(2641-2649)
IEEE DOI 1602
Dataset, Object Recognition. Benchmark testing BibRef

Fanello, S.R.[Sean Ryan], Ciliberto, C.[Carlo], Santoro, M.[Matteo], Natale, L.[Lorenzo], Metta, G.[Giorgio], Rosasco, L.[Lorenzo], Odone, F.[Francesca],
iCub World: Friendly Robots Help Building Good Vision Data-Sets,
GT13(700-705)
IEEE DOI 1309
Dataset, Object Recognition. Human Robot Interaction; Object Categorization Dataset; iCub BibRef

Ponomarenko, N.[Nikolay], Ieremeiev, O.[Oleg], Lukin, V.[Vladimir], Jin, L.[Lina], Egiazarian, K.O.[Karen O.],
A New Color Image Database TID2013: Innovations and Results,
ACIVS13(402-413).
Springer DOI 1311
Dataset, Color Images. BibRef

Ponce, J., Berg, T.L., Everingham, M.R., Forsyth, D.A., Hebert, M., Lazebnik, S.[Svetlana], Marszalek, M., Schmid, C., Russell, B.C., Torralba, A., Williams, C.K.I., Zhang, J., Zisserman, A.,
Dataset Issues in Object Recognition,
CLOR06(29-48).
Springer DOI 0711
Dataset, Discussion. BibRef

Campbell, R., and Flynn, P.J.,
A WWW-Accessible 3D Image and Model Database for Computer Vision Research,
EEMCV98(148-154). BibRef 9800
And: EEMTV98(xx) Dataset, 3-D Data.
HTML Version. BibRef

Nene, S.A., Nayar, S.K.[Shree K.], Murase, H.[Hiroshi],
Columbia Object Image Library (COIL-100),
ColumbiaTechnical Report CUCS-006-96, February 1996.
PS File. Also:
WWW Link. Also the COIL-20 database.
WWW Link. Dataset, Objects. BibRef 9602

Chapter on Matching and Recognition Using Volumes, High Level Vision Techniques, Invariants continues in
General Spatial Reasoning and Geometric Reasoning Issues, Visual Relations .


Last update:Oct 20, 2021 at 09:45:26