Open Access Open Access  Restricted Access Subscription or Fee Access

Multikeyword Ranked Search Based on Hierarchical Clustering Index

V Lalitha, K Gandhimathi

Abstract


Outsourcing of data into cloud becomes an effective trend in modern day computing due to its ability to provide low-cost, pay-as-you-go IT services. Cloud data owners prefer to outsource documents in an encrypted form for the purpose of privacy preserving. Therefore, it is necessary to develop effective and reliable cipher text search techniques. One challenge is that the relationship between documents will be normally concealed in the process of encryption, which will lead to significant search accuracy performance degradation. Also the volume of data in data centers has experienced a dramatic growth. This will make it even more challenging to design cipher text search schemes that can provide efficient and reliable online information retrieval on large volume of encrypted data. In this paper, a hierarchical clustering method is planned to support more search semantics and also to meet the demand for fast cipher text search within a big data environment. The proposed hierarchical approach clusters the documents based on the minimum relevance threshold, and then partitions the resulting clusters into sub-clusters until the constraint on the maximum size of cluster is reached. In order to confirm the authenticity of search results, a structure called minimum hash sub-tree is designed in this paper. The results show that with a sharp increase of documents in the dataset the search time of the proposed method increases linearly whereas the search time of the traditional method increases exponentially. Furthermore, the implemented method has an advantage over the traditional method in the rank privacy and relevance of retrieved documents.

Full Text:

PDF

References


C. Wang, N. Cao, K. Ren, W.J. Lou. Enabling secure and efficient ranked keyword search over outsourced cloud data, IEEE Trans Parallel Distrib Syst. 2012; 23(8): 1467–79p.

C. Wang, N. Cao, J. Li, K. Ren, W.J. Lou. Secure ranked key-word search over encrypted cloud data, In: Proc. IEEE 30th Int. Conf. Distrib. Comput. Syst. Genova, Italy, 2010, 253–62p.

N. Cao, C. Wang, M. Li, K. Ren, W.J. Lou. Privacy-preserving multi-keyword ranked search over encrypted cloud data, In: Proc. IEEE INFOCOM. Shanghai, China, 2011, 829–37p.

W. Sun, B. Wang, N. Cao, M. Li, W. Lou, Y. T. Hou, H. Li. Privacy-preserving multi-keyword text search in the cloud supporting similarity-based ranking, In: Proc. 8th ACM SIGSAC Symp. Inform., Comput. Commun. Security. Hangzhou, China, 2013, 71–82p.

C. Chen, X.J. Zhu, P.S. Shen, J.K. Hu. A hierarchical clustering method for big data oriented ciphertext search, In: Proc. IEEE INFOCOM, Workshop on Security and Privacy in Big Data. Toronto, Canada, 2014, 559–64p.

R.X. Li, Z.Y. Xu, W.S. Kang, K.C. Yow, C.Z. Xu. Efficient Multi-keyword ranked query over encrypted data in cloud com-puting, Futur. Gener. Comp. Syst. 2014; 30: 179–90p.

D. Cash, J. Jaeger, S. Jarecki, C. Jutla, H. Krawczyk, M.C. Rosu, M. Steiner. Dynamic searchable encryption in very large databases: data structures and implementation, In: Proc. Netw. Distrib. Syst. Security Symp. 2014; 14. doi: http://dx.doi.org/10.14722/ndss.2014.23264.

S. Jarecki, C. Jutla, H. Krawczyk, M. Rosu, M. Steiner. Outsourced symmetric private information retrieval, In: Proc. ACM SIGSAC Conf. Comput. Commun. Secur. 2013, 875–88p.


Refbacks

  • There are currently no refbacks.