Skip to content

COCO

Back to AI datasets

Field Value
Description COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features:
Object segmentation
Recognition in context
Superpixel stuff segmentation
330K images (>200K labeled)
1.5 million object instances
80 object categories
91 stuff categories
5 captions per image
250,000 people with keypoints

This dataset includes train/
Folder /datasets/ai/coco
Discipline AI / Object Segmentation / Object Detection
DOI 10.48550/arXiv.1405.0312
Link Access Data
Public True
Publication Date 2017
Downloaded 2025-11-09
Data Type LMDB, SquashFS
Extracted files on Ceph
Dataset Size 30G (extracted)
Number of Files 410544 (extracted)
Usage
$ module avail
$ module load datasets
$ module load ai/coco/2017
Usage Policy Link https://creativecommons.org/licenses/by/4.0/legalcode
Usage Policy
Citation Lin, T.-Y., Maire, M., Belongie, S., Bourdev, L., Girshick, R., Hays, J., Perona, P., Ramanan, D., Zitnick, C. L., & Dollár, P. (2015). Microsoft COCO: Common objects in context. arXiv preprint arXiv:1405.0312. https://arxiv.org/abs/1405.0312
BibTeX
📜 View BibTeX citation
@misc{lin2015microsoftcococommonobjects,
title={Microsoft COCO: Common Objects in Context},
author={Tsung-Yi Lin and Michael Maire and Serge Belongie and Lubomir Bourdev and Ross Girshick and James Hays and Pietro Perona and Deva Ramanan and C. Lawrence Zitnick and Piotr Dollár},
year={2015},
eprint={1405.0312},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/1405.0312},
}