麝香(第1版)数据集免费

jsaiyyp 17 2021-08-31 机器学习

资源介绍

Shan-Hung Wu
Associate Professor, CS, National Tsing Hua University(NTHU)
Email: shwu [AT] cs.nthu.edu.tw

Data Set Information:

This dataset was collected by Shan-Hung Wu and DataLab members at National Tsing Hua University, Taiwan. It random sampled 180 images from the NUS-WIDE image database. Each image has 500 features consisting of the bag of words based on SIFT descriptions. With a series of experiments on the Amazon Mechanical Turk platform, there are 325 user-perceived clusters from 100 users and their corresponding descriptions.

Dataset spec 1:

- #Image: 180
- #Cluster: 325 (may be created by different users)
- #User: 100
- |Vocabulary of supervision|: 108

- cluster_data.csv : 325 clusters x 180 images
- 'cluster_data.csv' is an indicator matrix. M_(i,j) = 1 if image_j belongs to cluster_i. Note: Clusters may be created by different users.

- cluster_userIndex.csv : 325 clusters x 1 userIndex(0-99)
- 'cluster_userIndex.csv' is an vector where V_i = k if cluster_i is grouped by user_k.

- data_feature.csv : 180 images x 500 features
- Each row is 500 features vector consisting of the bag of words based on SIFT descriptions. All 180 images are sampled from NUS-WIDE dataset.
- Reference: [Web Link]

- supervision_cluster_matrix.csv : 108 bag of words x 183 clusters
- We parse the raw supervisions and merge similar words into 108 dimensions. Each row is a description of corresponding cluster.

- perception_words.csv : 108 perception words
- Vocabulary of perception words .

 

 

Dataset spec 2(Raw data):

- cluster_list.csv:
-FileName: ['UserId'], ['ImageId Cluster'], ['Description']
-['UserId']: Specify the user who created the cluster.
-['ImageId Cluster']: Image ids in the cluster which are separated by ';'.
-['Description']: A sentence or some keywords describe the images in the cluster by user.
- 325 records(clusters) in total.

Attribute Information:

As the above.

Relevant Papers:

Learning User Perceived Clusters with Feature-Level Supervision
Ting-Yu Cheng, ; Kuan-Hua Lin, ; Xinyang Gong, Baidu Inc.; Kang-Jun Liu, ; Shan-Hung Wu*, National Tsing Hua University

 

Citation Request:

Please include this citation if you use this dataset.

Learning User Perceived Clusters with Feature-Level Supervision
Ting-Yu Cheng, ; Kuan-Hua Lin, ; Xinyang Gong, Baidu Inc.; Kang-Jun Liu, ; Shan-Hung Wu*, National Tsing Hua University

END

发表评论