PhysicalAI-SmartSpaces¶

Back to AI datasets

Field	Value
Description	Comprehensive, annotated dataset for multi-camera tracking and 2D/3D object detection. This dataset is synthetically generated with Omniverse. This dataset consists of over 250 hours of video from across nearly 1,500 cameras from indoor scenes in warehouses, hospitals, retail, and more. The dataset is time synchronized for tracking humans across multiple cameras using feature representation and no personal data. Dataset Description Dataset Owner(s) NVIDIA Dataset Creation Date We started to create this dataset in December, 2023. First version was completed and released as part of 8th AI City Challenge in conjunction with CVPR 2024. Dataset Characterization Data Collection Method: Synthetic Labeling Method: Automatic with IsaacSim Video Format Video Standard: MP4 (H.264) Video Resolution: 1080p Video Frame rate: 30 FPS
Folder	`/datasets/ai/huggingface/nvidia/PhysicalAI-SmartSpaces`
Discipline	AI / Computer Vision / PhysicalAI
DOI	10.48550/arXiv.2412.00692
Link	Access Data
Public	`True`
Publication Date	2024
Downloaded	2025-11-09
Data Type	LMDB, SquashFS Extracted MP4 files on Ceph
Dataset Size	6.7M (extracted)
Number of Files	3192 (extracted)
Usage	$ module avail $ module load datasets $ module load ai/huggingface/nvidia/PhysicalAI-SmartSpaces/2024
Usage Policy Link	https://choosealicense.com/licenses/cc-by-4.0/
Usage Policy
Citation	Tang, Z., Wang, S., Anastasiu, D. C., Chang, M.-C., Sharma, A., Kong, Q., Kobori, N., Gochoo, M., Batnasan, G., Otgonbold, M.-E., Alnajjar, F., Hsieh, J.-W., Kornuta, T., Li, X., Zhao, Y., Zhang, H., Radhakrishnan, S., Jain, A., Kumar, R., Murali, V. N., Wang, Y., Pusegaonkar, S. S., Wang, Y., Biswas, S., Wu, X., Zheng, Z., Chakraborty, P., & Chellappa, R. (2025). The 9th AI City Challenge. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (pp. 5467–5476). Honolulu, HI, USA. Wang, S., Anastasiu, D. C., Tang, Z., Chang, M.-C., Yao, Y., Zheng, L., Rahman, M. S., Arya, M. S., Sharma, A., Chakraborty, P., Prajapati, S., Kong, Q., Kobori, N., Gochoo, M., Otgonbold, M.-E., Batnasan, G., Alnajjar, F., Chen, P.-Y., Hsieh, J.-W., Wu, X., Pusegaonkar, S. S., Wang, Y., Biswas, S., & Chellappa, R. (2024). The 8th AI City Challenge. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (pp. 7261–7272). Seattle, WA, USA. Wang, Y., Meinhardt, T., Cetintas, O., Yang, C.-Y., Pusegaonkar, S. S., Missaoui, B., Biswas, S., Tang, Z., & Leal-Taixé, L. (2024). MCBLT: Multi-camera multi-object 3D tracking in long videos. arXiv preprint arXiv:2412.00692.
BibTeX	📜 View BibTeX citation @InProceedings{Tang25AICity25, author = {Zheng Tang and Shuo Wang and David C. Anastasiu and Ming-Ching Chang and Anuj Sharma and Quan Kong and Norimasa Kobori and Munkhjargal Gochoo and Ganzorig Batnasan and Munkh-Erdene Otgonbold and Fady Alnajjar and Jun-Wei Hsieh and Tomasz Kornuta and Xiaolong Li and Yilin Zhao and Han Zhang and Subhashree Radhakrishnan and Arihant Jain and Ratnesh Kumar and Vidya N. Murali and Yuxing Wang and Sameer Satish Pusegaonkar and Yizhou Wang and Sujit Biswas and Xunlei Wu and Zhedong Zheng and Pranamesh Chakraborty and Rama Chellappa}, title = {The 9th AI City Challenge}, booktitle = {Proc. ICCV Workshops}, pages = {5467--5476}, address = {Honolulu, HI, USA}, year = {2025} } @inproceedings{Wang24AICity24, author = {Shuo Wang and David C. Anastasiu and Zheng Tang and Ming-Ching Chang and Yue Yao and Liang Zheng and Mohammed Shaiqur Rahman and Meenakshi S. Arya and Anuj Sharma and Pranamesh Chakraborty and Sanjita Prajapati and Quan Kong and Norimasa Kobori and Munkhjargal Gochoo and Munkh-Erdene Otgonbold and Ganzorig Batnasan and Fady Alnajjar and Ping-Yang Chen and Jun-Wei Hsieh and Xunlei Wu and Sameer Satish Pusegaonkar and Yizhou Wang and Sujit Biswas and Rama Chellappa}, title = {The 8th AI City Challenge}, booktitle = {Proc. CVPR Workshops}, pages = {7261--7272}, address = {Seattle, WA, USA}, year = {2024} } @misc{Wang24MCBLT, author = {Yizhou Wang and Tim Meinhardt and Orcun Cetintas and Cheng-Yen Yang and Sameer Satish Pusegaonkar and Benjamin Missaoui and Sujit Biswas and Zheng Tang and Laura Leal-Taix{\'e}}, title = {MCBLT: Multi-Camera Multi-Object 3D Tracking in Long Videos}, note = {arXiv:2412.00692}, year = {2024} }