With the advent of cloud computing, data owners are motivated to outsource their complex data management systems from local sites to the commercial public cloud for great flexibility and economic savings. But for protecting data privacy, sensitive data have to be encrypted before outsourcing, which obsoletes traditional data utilization based on plaintext keyword search. Thus, enabling an encrypted cloud data search service is of paramount importance. Considering the large number of data users and documents in the cloud, it is necessary to allow multiple keywords in the search request and return documents in the order of their relevance to these keywords. Related works on searchable encryption focus on single keyword search or Boolean keyword search, and rarely sort the search results. In this research paper, for the first time, we define and solve the challenging problem of privacy-preserving multi-keyword ranked search over encrypted data in cloud computing (MRSE). We establish a set of strict privacy requirements for such a secure cloud data utilization system. Among various multi-keyword semantics, we choose the efficient similarity measure of “coordinate matching,” i.e., as many matches as possible, to capture the relevance of data documents to the search query. We further use “inner product similarity” to quantitatively evaluate such similarity measure. We first propose a basic idea for the MRSE based on secure inner product computation, and then give two significantly improved MRSE schemes to achieve various stringent privacy requirements in two different threat models. To improve search experience of the data search service, we further extend these two schemes to support more search semantics. Thorough analysis investigating privacy and efficiency guarantees of proposed schemes is given. Experiments on the real-world data set further show proposed schemes indeed introduce low overhead on computation and communication.
Keywords: Distributed Computing; Multi Keyword Ranked Search; Infrastructure-as-a-Service (IaaS); Platform-as-a-Service (PaaS); Software-as-a-Service (SaaS); National Institute of Standards and Terminology (NIST)
[1]
Rafail Ostrovsky,et al.
Searchable symmetric encryption: improved definitions and efficient constructions
,
2006,
CCS '06.
[2]
Mihir Bellare,et al.
Deterministic and Efficiently Searchable Encryption
,
2007,
CRYPTO.
[3]
Tri Van Le.
Efficient Provably Secure Public Key Steganography
,
2003,
IACR Cryptol. ePrint Arch..
[4]
Cong Wang,et al.
Privacy-preserving multi-keyword ranked search over encrypted cloud data
,
2011,
2011 Proceedings IEEE INFOCOM.
[5]
Ian H. Witten,et al.
Managing Gigabytes: Compressing and Indexing Documents and Images
,
1999
.
[6]
Rafail Ostrovsky,et al.
Public Key Encryption with Keyword Search
,
2004,
EUROCRYPT.
[7]
Cong Wang,et al.
Efficient verifiable fuzzy keyword search over encrypted data in cloud computing
,
2013,
Comput. Sci. Inf. Syst..
[8]
Dawn Xiaodong Song,et al.
Practical techniques for searches on encrypted data
,
2000,
Proceeding 2000 IEEE Symposium on Security and Privacy. S&P 2000.
[9]
Luis Rodero-Merino,et al.
A break in the clouds: towards a cloud definition
,
2008,
CCRV.
[10]
Michael Mitzenmacher,et al.
Privacy Preserving Keyword Searches on Remote Encrypted Data
,
2005,
ACNS.
[11]
Zhenyu Yang,et al.
LT codes-based secure and reliable cloud storage service
,
2012,
2012 Proceedings IEEE INFOCOM.
[12]
Kristin E. Lauter,et al.
Cryptographic Cloud Storage
,
2010,
Financial Cryptography Workshops.
[13]
Rafail Ostrovsky,et al.
Public Key Encryption That Allows PIR Queries
,
2007,
CRYPTO.