Privacy-Preserving Sequential Pattern Release

Jin, Huidong; Chen, Jie; He, Hongxing; O’Keefe, Christine M.

doi:10.1007/978-3-540-71701-0_57

Huidong Jin¹,
Jie Chen¹,
Hongxing He¹ &
…
Christine M. O’Keefe²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4426))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1878 Accesses
2 Citations

Abstract

We investigate situations where releasing frequent sequential patterns can compromise individual’s privacy. We propose two concrete objectives for privacy protection: k-anonymity and α-dissociation. The first addresses the problem of inferring patterns with very low support, say, in [1,k). These inferred patterns can become quasi-identifiers in linking attacks. We show that, for all but one definition of support, it is impossible to reliably infer support values for patterns with two or more negative items (items which do not occur in a pattern) solely based on frequent sequential patterns. For the remaining definition, we formulate privacy inference channels. α-dissociation handles the problem of high certainty of inferring sensitive attribute values. In order to remove privacy threats w.r.t. the two objectives, we show that we only need to examine pairs of sequential patterns with length difference of 1. We then establish a Privacy Inference Channels Sanitisation (PICS) algorithm. It can, as illustrated by experiments, reduce the privacy disclosure risk carried by frequent sequential patterns with a small computation overhead.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Privately vertically mining of sequential patterns based on differential privacy with high efficiency and utility

Article Open access 19 October 2023

Are Sequential Patterns Shareable? Ensuring Individuals’ Privacy

Frequent Sequence Pattern Mining with Differential Privacy

References

Vaidya, J., Clifton, C.: Privacy-preserving data mining: Why, how, and when. IEEE Security & Privacy 2, 19–27 (2004)
Article Google Scholar
Wong, R., et al. (alpha,k)-anonymity: An enhanced k-anonymity model for privacy-preserving data publishing. In: KDD’06, pp. 754–759 (2006)
Google Scholar
Atzori, M., et al.: Blocking anonymity threats raised by frequent itemset mining. In: ICDM’05, pp. 561–564 (2005)
Google Scholar
Oliveira, S.R.M., Zaïane, O.R., Saygin, Y.: Secure association rule sharing. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, pp. 74–85. Springer, Heidelberg (2004)
Google Scholar
Kantarcioglu, M., Jin, J., Clifton, C.: When do data mining results violate privacy? In: KDD’04, pp. 599–604. ACM Press, New York (2004)
Google Scholar
Jin, H., et al.: Mining unexpected associations for signalling potential adverse drug reactions from administrative health databases. In: Ng, W.-K., et al. (eds.) PAKDD 2006. LNCS (LNAI), vol. 3918, pp. 867–876. Springer, Heidelberg (2006)
Chapter Google Scholar
Ayres, J., et al.: Sequential PAttern Mining using a bitmap representation. In: KDD’02, pp. 215–224 (2002)
Google Scholar
Sweeney, L.: k-anonymity: a model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-based Systems 10, 557–570 (2002)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

CSIRO Mathematical and Information Sciences, GPO Box 664, Canberra ACT 2601, Australia
Huidong Jin, Jie Chen & Hongxing He
CSIRO Preventative Health National Research Flagship, Canberra ACT 2601, Australia
Christine M. O’Keefe

Authors

Huidong Jin
View author publications
You can also search for this author in PubMed Google Scholar
Jie Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hongxing He
View author publications
You can also search for this author in PubMed Google Scholar
Christine M. O’Keefe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zhi-Hua Zhou Hang Li Qiang Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jin, H., Chen, J., He, H., O’Keefe, C.M. (2007). Privacy-Preserving Sequential Pattern Release. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_57

Download citation

DOI: https://doi.org/10.1007/978-3-540-71701-0_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71700-3
Online ISBN: 978-3-540-71701-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Privacy-Preserving Sequential Pattern Release

Abstract

Access this chapter

Preview

Similar content being viewed by others

Privately vertically mining of sequential patterns based on differential privacy with high efficiency and utility

Are Sequential Patterns Shareable? Ensuring Individuals’ Privacy

Frequent Sequence Pattern Mining with Differential Privacy

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Privacy-Preserving Sequential Pattern Release

Abstract

Access this chapter

Preview

Similar content being viewed by others

Privately vertically mining of sequential patterns based on differential privacy with high efficiency and utility

Are Sequential Patterns Shareable? Ensuring Individuals’ Privacy

Frequent Sequence Pattern Mining with Differential Privacy

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation