南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (1): 41–48.doi: 10.13232/j.cnki.jnju.2019.01.004

• • 上一篇    下一篇

基于语义相似度的无监督图像哈希方法

王伯伟1,聂秀山1*,马林元2,尹义龙3   

  1. 1.山东财经大学计算机科学与技术学院,济南,250014;2.山东财经大学实验教学中心,济南,250014;3.山东大学软件学院,济南,250100
  • 接受日期:2018-09-01 出版日期:2019-02-01 发布日期:2019-01-26
  • 通讯作者: 聂秀山,E-mail:niexsh@sdufe.edu.cn E-mail:niexsh@sdufe.edu.cn
  • 基金资助:
    国家自然科学基金(61671274),山东省高等学校科技计划(J17KB161),山东省高校优势学科人才团队培育计划

Unsupervised image hash method based on semantic similarity

Wang Bowei1,Nie Xiushan1*,Ma Linyuan2,Yin Yilong3   

  1. (1.School of Computer Science and Technology,Shandong University of Finance and Economics,Ji’nan,250014,China; 2.Experimental Teaching Center,Shandong University of Finance and Economics,Ji’nan,250014,China; 3.School of Software Engineering, Shandong University,Ji’nan,250100,China
  • Accepted:2018-09-01 Online:2019-02-01 Published:2019-01-26
  • Contact: Nie Xiushan,E-mail:niexsh@sdufe.edu.cn E-mail:niexsh@sdufe.edu.cn

摘要: 哈希方法作为最近邻搜索中的一个重要算法,具有快速及低内存的优良特性,能够较好地解决现实图像数据库中存在的样本标签信息缺失、人工标注成本过高等问题,因此在图像检索领域得到广泛使用. 提出一种基于语义相似度的无监督图像哈希方法. 首先对原始图像进行语义聚类,然后基于图像的语义相似性,把原始图像特征映射到汉明空间;同时,为了增强哈希学习的鲁棒性,在所得到的目标函数中,采用了2,p范数(0

关键词: 语义相似度, 无监督哈希, 离散哈希, 相似性学习

Abstract: As one of the most important algorithms in nearest neighbor search,the hash method has the advantages of fast speed and low memory. It can solve the problems of the lack of sample label information and high cost of manual labeling in the real image databases. Therefore,it is widely used in the field of image retrieval. This paper proposes an unsupervised image hashing method based on semantic similarity. This proposed method first performs semantic clustering on the original image,and then maps the original image features to the hamming space based on the semantic similarity of the image. In addition,in order to enhance the robustness of hash learning,the proposed method adopts 2,p-norm (0

Key words: semantic similarity, unsupervised hashing, discrete hashing, similarity learning

中图分类号: 

  • TP37
[1] Kang W C,Li W J,Zhou Z H. Column sampling based discrete supervised hashing ∥ Proceedings of the 30th AAAI Conference on Artificial Intelligence. Phoenix,AZ,USA:AAAI Press,2016:1230-1236.
[2] Zhu X F,Huang Z,Shen H T,et al. Linear cross-modal hashing for efficient multimedia search ∥ Proceedings of the 21st ACM international conference on Multimedia. New York,NY,USA:ACM,2013:143-152.
[3] Yu Z,Wu F,Yang Y,et al. Discriminative coupled dictionary hashing for fast cross-media retrieval ∥ Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. New York,NY,USA:ACM,2014:395-404.
[4] Dasgupta A,Kumar R,Sarlos T. Fast locality-sensitive hashing ∥ Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York,NY,USA:ACM,2011:1073-1081.
[5] Har-Peled S,Indyk P,Motwani R. Approximate nearest neighbor:Towards removing the curse of dimensionality. Theory of Computing,2012,8(1):321-350.
[6] Ji J Q,Li J M,Yan S C,et al. Min-max hash for jaccard similarity ∥ 2013 IEEE 13th International Conference on Data Mining. Dallas,TX,USA:IEEE,2013:301-309.
[7] Liu W,Wang J,Ji R R,et al. Supervised hashing with kernels ∥ Conference on Computer Vision and Pattern Recognition. Providence,RI,USA:IEEE,2012:2074-2081.
[8] Lin G S,Shen C H,Van Den Hengel A. Supervised hashing using graph cuts and boosted decision trees. IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(11):2317-2331.
[9] Xia R,Pan Y,Lai H,et al. Supervised hashing for image retrieval via image representation learning ∥ Proceedings of the 28th AAAI Conference on Artificial Intelligence. Québec City,Canada:AAAI,2014.
[10] Gong Y C,Lazebnik S. Iterative quantization:A procrustean approach to learning binary codes ∥ Internaltional Conference on Computer Vision and Pattern Recogintion,CVPR 2011. Colorado Springs,CO,USA:IEEE,2011:817-824.
[11] Salakhutdinov R,Hinton G. Semantic hashing. International Journal of Approximate Reasoning,2009,50(7):969-978.
[12] Lin J,Morère O,Petta J,et al. Tiny descriptors for image retrieval with unsupervised triplet hashing ∥ 2016 Data Compression Conference. Snowbird,UT,USA:IEEE,2016:397-406.
[13] Lin K,Lu J W,Chen C S,et al. Learning compact binary descriptors with unsupervised deep neural networks ∥ 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas,NV,USA:IEEE,2016:1183-1192.
[14] Shen F M,Shen C H,Shi Q F,et al. Hashing on nonlinear manifolds. IEEE Transactions on Image Processing,2015,24(6):1839-1851.
[15] Von Luxburg U. Clustering stability:An overview. Foundations and Trends in Machine Learning,2010,2(3):235-274.
[16] Lin G S,Shen C H,Shi Q F,et al. Fast supervised hashing with decision trees for high-dimensional data ∥ 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus,OH,USA:IEEE,2014:1971-1978.
[17] Lin G S,Shen C H,Suter D,et al. A general two-step approach to learning-based hashing ∥ IEEE International Conference on Computer Vision. Sydney,Australia:IEEE,2014:2552-2559.
[18] Leng C,Cheng J,Wu J X,et al. Supervised hashing with soft constraints ∥ Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. New York,NY,USA:ACM,2014:1851-1854.
[19] Zhang D Q,Li W J. Large-scale supervised multimodal hashing with semantic correlation maximization ∥ Proceedings of the 28th AAAI Conference on Artificial Intelligence. Québec City,Canada:AAAI Press,2014:2177-2183.
[20] Yang Y,Ma Z G,Yang Y,et al. Multitask spectral clustering by exploring intertask correlation. IEEE Transactions on Cybernetics,2015,45(5):1083-1094.
[21] Yang Y,Zha Z J,Gao Y,et al. Exploiting web images for semantic video indexing via robust sample-specific loss. IEEE Transactions on Multimedia,2014,16(6):1677-1689.
[22] Yan T K,Xu X S,Guo S,et al. Supervised robust discrete multimodal hashing for cross-media retrieval ∥ Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. New York,NY,USA:ACM,2016:1271-1280.
[23] Shen F M,Gao X,Liu L,et al. Deep asymmetric pairwise hashing ∥ Proceedings of the 25th ACM International Conference on Multimedia. New York,NY,USA:ACM,2017:1522-1530.
[24] Jiang Q Y,Li W J. Deep cross-modal hashing. 2016,arXiv:1602.02255.
[25] Shen F M,Zhou X,Song J,et al. A fast optimization method for general binary code learning. IEEE Transactions on Image Processing,2016,25(12):5610-5621.
[26] Attouch H,Bolte J,Svaiter B F. Convergence of descent methods for semi-algebraic and tame problems:Proximal algorithms,forward-backward splitting,and regularized Gauss-Seidel methods. Mathematical Programming,2013,137(1-2):91-129.
[27] Weiss Y,Torralba A,Fergus R. Spectral hashing ∥ Proceedings of the 21st International Conference on Neural Information Processing Systems. Vancouver,Canada:Curran Associates Inc.,2008:1753-1760.
[28] Wang J,Kumar S,Chang S F. Semi-supervised hashing for scalable image retrieval ∥ 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco,CA,USA:IEEE,2010:3424-3431.
[29] Liu W,Wang J,Kumar S,et al. Hashing with graphs ∥ Proceedings of the 28th International Conference on Machine Learning,ICML-11. Bellevue,WA,USA,2011:1-8.
[30] Liong V E,Lu J W,Wang G,et al. Deep hashing for compact binary codes learning ∥ 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston,MA,USA:IEEE,2015:2475-2483.
[31] Lei Z,Shen J L,Liang X,et al. Unsupervised topic hypergraph hashing for efficient mobile image retrieval. IEEE Transactions on Cybernetics,2017,47(11):3941-3954.
[1] 孔 颉, 孙权森, 纪则轩, 刘亚洲. 基于仿射不变离散哈希的遥感图像快速目标检测新方法[J]. 南京大学学报(自然科学版), 2019, 55(1): 49-60.
[2] 李 钝,薛昊原,李 伦,郑志蕴*. 面向教学资源的实体链接算法[J]. 南京大学学报(自然科学版), 2015, 51(4): 901-908.
[3] 吴秦,梁久祯,白玉昭. 一种基于语义词典的局部查询扩展方法[J]. 南京大学学报(自然科学版), 2014, 50(4): 526-.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 段新春,施 斌,孙梦雅,魏广庆,顾 凯,冯晨曦. FBG蒸发式湿度计研制及其响应特性研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1075 -1084 .
[2] 汪 勇,刘 瑾*,宋泽卓,白玉霞,王琼亚,祁长青,孙少锐. 高分子稳定剂加固河道边坡表层砂土室内试验研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1095 -1104 .
[3] 卢 毅,于 军,龚绪龙,王宝军,魏广庆,季峻峰. 基于DFOS的连云港第四纪地层地面沉降监测分析[J]. 南京大学学报(自然科学版), 2018, 54(6): 1114 -1123 .
[4] 孙 玫,张 森,聂培尧,聂秀山. 基于朴素贝叶斯的网络查询日志session划分方法研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1132 -1140 .
[5] 胡 淼,王开军,李海超,陈黎飞. 模糊树节点的随机森林与异常点检测[J]. 南京大学学报(自然科学版), 2018, 54(6): 1141 -1151 .
[6] 周星星,张海平,吉根林. 具有时空特性的区域移动模式挖掘算法[J]. 南京大学学报(自然科学版), 2018, 54(6): 1171 -1182 .
[7] 赵天龙,刘 峥,韩慧健,张彩明. 基于二分图的个性化图像标签推荐算法[J]. 南京大学学报(自然科学版), 2018, 54(6): 1193 -1205 .
[8] 韩明鸣, 郭虎升, 王文剑. 面向非平衡多分类问题的二次合成QSMOTE方法[J]. 南京大学学报(自然科学版), 2019, 55(1): 1 -13 .
[9] 刘 素, 刘惊雷. 基于特征选择的CP-nets结构学习[J]. 南京大学学报(自然科学版), 2019, 55(1): 14 -28 .
[10] 孔 颉, 孙权森, 纪则轩, 刘亚洲. 基于仿射不变离散哈希的遥感图像快速目标检测新方法[J]. 南京大学学报(自然科学版), 2019, 55(1): 49 -60 .