yfcc100m_dataset_short holds the first 300K objects in yfcc100m_dataset. It is used to test. To process the whole dataset, do a manual download of the compressed file.