Privacy-Preserving Data mining
Theses
User Profiling and Classification for Fraud Detection in Mobile Communications Networks
(Jaakko Hollmén, HUT 2000)
The Third International Knowledge Discovery and Data Mining Tools Competition, 1999
(competition task was to build a network intrusion detector)
Overviews
Privacy Preserving Data Mining: Challenges and Opportunities
(Srikant, 2002)
@
Oblivious Transfer and Private Information Retrieval
Publications
Security and Privacy Implications of Data Mining
(Clifton, 1996)
2000
Data Mining Approaches for Intrusion Detection
(Wenke Lee, Salvatore J. Stolfo, USENIX Security Symposium 2000)
Privacy Preserving Data Mining
(Yehuda Lindell and Benny Pinkas 2000)
Privacy-Preserving Data Mining
( Rakesh Agrawal, Ramakrishnan Srikant, SIGMOD 2000)
Auditing Boolean Attributes
(Jon Kleinberg, Christos Papadimitriou, Prabhakar Raghavan, PODC 2000)
Protocols For Secure Remote database Access With Approximate Matching
(Wenliang Du, Mikhail J. Atallah, ACM SPE 2000)
2001
Privacy Preserving Distributed Data Mining
(Chris Clifton, 2001)
Selective Private Function Evaluation with Applications to Private Statistics
(Ran Canetti, Yuval Ishai, Ravi Kumar, Michael K. Reiter, Ronitt Rubinfeld, Rebecca N. Wright, 2001)
On the Design and Quantification of Privacy Preserving Data Mining Algorithms
(Dakshi Agrawal, Charu C. Aggarwal, SIGMOD 2001)
Privacy-Preserving Cooperative Statistical Analysis
(Wenliang Du and Mikhail J. Atallah, ACSAC 2001)
Protocols for Secure Remote Database Access with Approximate Matching
(Wenliang Du, Mikahil J. Atallah, 2001)
Protecting Respondent's Privacy in Microdata Release
(Pierangela Samarati, IEEE Transactions on Knowledge and Data Engineering 2001)
2002
KDD 2002
Privacy Preserving Mining of Association Rules
(Evfimievski, R. Srikant, R. Agrawal and J. Gehrke, KDD 2002)
Maintaining Data Privacy in Association Rule Mining
( Shariq Rizvi, Jayant R. Haritsa, VLDB 2002)
Privacy-preserving Distributed Mining of Association Rules on Horizontally Partitioned Data
(Murat Kantarcioglu, Chris Clifton, 2002)
Privacy Preserving Association Rule Mining in Vertically Partitioned Data
(Jaideep Vaidya, Chris Clifton, 2002)
Vulnerabilities in Similarity Search Based Systems
(Ali Saman Tosun, Hakan Ferhatosmanoglu, CIKM 2002)
Collaborative Filtering with Privacy
(John Canny, IEEE S&P 2002)
Collaborative Filtering with Privacy via Factor Analysis
(John Canny, ACM SIGIR 2002)
ACM SIGKDD Explorations Newsletter
Cryptographic Techniques for Privacy-Preserving Data Mining
(Benny Pinkas, Newsletter of the ACM Special Interest Group on Knowledge Discovery and Data Mining, January 2003)
Randomization in privacy preserving data mining
(Alexandre Evfimievski)
Achieving k-anonymity privacy protection using generalization and suppression
(Latanya Sweeney, 2002)
2003
Cryptographic Randomized Response Techniques
(Andris Ambainis, Markus Jakobsson and Helger Lipmaa, eprint 2003/027)
KDD 2003
Using Randomized Response Techniques for Privacy-Preserving Data Mining
(Wenliang Du and Zhijun Zhan, SIGKDD 2003)
Privacy-Preserving K-Means Clustering over Vertically Partitioned Data
(Jaideep Vaidya, Chris Clifton)
SIGMOD 2003, San Diego
Rights protection for relational data
(Radu Sion, Mikhail Atallah, Sunil Prabhakar)
Information sharing across private databases
(Rakesh Agrawal, Alexandre Evfievski, Ramakrishnan Srikant)
Limiting privacy breaches in privacy preserving data mining
(Alexandre Evfimievski, Johannes Gehrke, Ramakrishnan Srikant)
Privacy in data systems
(Rakesh Agrawal)
Revealing information while preserving privacy
(Irit Dinur, Kobbi Nissim)
ICDM 2003
On Inverse Frequent Set Mining
(Taneli Mielikäinen, PPDM 2003)
Random Data Perturbation Techniques and Privacy Preserving Data Mining
(H. Kargupta, S. Datta, Q. Wang, and K. Sivakumar, ICDM 2003)
Privacy-Preserving Collaborative Filtering using Randomized Perturbation Techniques
(Huseyin Polat and Wenliang Du)
Random projection and privacy preserving correlation computation from distributed data
(H. Kargupta, K. Liu, and J. Ryan, TR, 2003)
2004
SIAM ICDM
Privacy-Preserving Multivariate Statistical Analysis: Linear Regression and Classification
(Wenliang Du, Yunghsiang S. Han and Shigang Chen, 2004)
Privacy Preserving Keyword Searches on Remote Encrypted Data
(Yan-Cheng Chang and Michael Mitzenmacher, eprint 2004/051)
Efficient Private Matching and Set Intersection
(Freedman, Nissim, Pinkas, Eurocrypt 2004)
Privacy-Enhanced Searches Using Encrypted Bloom Filters
(Steven M. Bellovin, William R. Cheswick, 2004)
Secure Indexes
(Eu-Jin Goh, 2004)
Private Inference Control
(David Woodruff and Jessica Staddon, eprint 2004/130)
On Private Similarity Search Protocols
(Sven Laur, Helger Lipmaa, NordSec 2004)
On Private Scalar Product Computation for Privacy-Preserving Data Mining
(Bart Goethals, Sven Laur, Helger Lipmaa and Taneli Mielikäinen, ICISC 2004)
Private and Threshold Set-Intersection
(Lea Kissner, Dawn Song, CRYPTO 2005 (full version))
Conference version
Experimental Analysis of Privacy-Preserving Statistics Computation
(Hiranmayee Subramaniam, Rebecca N. Wright, Zhiqiang Yang, Secure Data Management 2004)
KDD 2004
kTTP: A New Privacy Model for Large Scale Distributed Environments
(Bobi Gilburd, Assaf Schuster, Ran Wolff, SIGKDD 2004)
Privacy-Preserving Bayesian Network Structure Computation on Distributed Heterogeneous Data
(Rebecca M. Wright, Zhiqiang Yang, KDD 2004)
Private and Threshold Set-Intersection
(Lea Kissner, Dawn Song, CMU TR, 2004)
Bottom-Up Generalization: A Data Mining Solution to Privacy Protection
(Ke Wang, Philip S. Yu, Sourav Chakraborty, IEEE ICDM 2004)
2005
Privacy-Preserving Classification of Customer Data without Loss of Accuracy
(Zhiqiang Yang, Sheng Zhong, Rebecca N. Wright, SDM 2005)
Testing Disjointness of Private Datasets
(Aggelos Kiayias, Antonina Mitrofanova, FC 2005)
Improved Privacy-Preserving Bayesian Network Parameter Learning
(Zhiqiang Yang, Rebecca N. Wright, PDM 05)
KDD 2005
Privacy-Preserving Distributed k-Means Clustering over Arbitrarily Partitioned Data
(Geetha Jagannathan, Rebecca N. Wright, KDD 2005)
Anonymity-preserving data collection
(Zhiqiang Yang, Sheng Zhong, Rebecca N. Wright, KDD 2005)
A new scheme on privacy-preserving data classification
(Nan Zhang, Shengquan Wang, Wei Zhao, KDD 2005)
Secure Computation Over Distributed Databases
(Chunhua Su, Kouchi Sakurai, 2005)
Secure Computation of Constant Depth Circuits with Applications to Database Search Problems
(Omer Barkol, Yuval Ishai, Crypto 2005)
Private Searching On Streaming Data
(Rafail Ostrovsky, William Skeith III, eprint 2005/242)
Private Itemset Support Counting
(Sven Laur, Helger Lipmaa, ICICS 2005)
Privacy-Preserving SVM using Nonlinear Kernels on Horizontally Partitioned Data
( Hwanjo Yu, Xiaoqian Jiang, Jaideep Vaidya, 2005)
Privacy-Preserving Polling using Playing Cards
(Sid Stamm and Markus Jakobsson, eprint 2005/444)
Private Approximation of Search Problems
(Amos Beimel, Paz Carmi, Kobbi Nissim, Enav Weinreb, ECCC TR05-141)
Template-Based Privacy Preservation in Classification Problems
(Ke Wang, Benjamin C. M. Fung, Philip S. Yu, IEEE ICDM 2005)
Top-Down Specialization for Information and Privacy Preservation
(Benjamin C. M. Fung, Ke Wang, Philip S. Yu, IEEE ICDE 2005)
Integrating Private Databases for Data Analysis
(Ke Wang, Benjamin C. M. Fung, Guozhu Dong, IEEE ISI 2005)
2006
Polylogarithmic Private Approximations and Efficient Matching
(Piotr Indyk, David Woodruff, TCC 2006)
New Constructions and Practical Applications for Private Stream Searching
(John Bethencourt, Dawn Song, Brent Waters, IEEE SP 2006)
A New Privacy-Preserving Distributed k-Clustering Algorithm
(Geetha Jagannathan, Krishnan Pillaipakkamnatt, Rebecca N. Wright, SDM 206)
Experimental Analysis of a Privacy-Preserving Scalar Product Protocol
(Zhiqiang Yang, Rebecca N. Wright, Hiranmayee Subramaniam, CSSE 2006)
Privacy Preserving SVM Classification On Vertically Partitioned Data
(Hwanjo Yu, Jaideep Vaidya, Xiaoqian Jiang, PAKDD 2006)
Privacy Preserving SVM Using Secure Set Intersection Cardinality
(Hwanjo Yu, Xiaoqian Jiang, Jaideep Vaidya, ACM SAC 2006)
Handicapping Attacker's Confidence
( An Alternative to k-Anonymization)
New Techniques for Private Stream Searching
(J Bethencourt, Dawn Song, Brent Waters, TR 2006)
New Constructions and Practical Applications for Private Stream Searching (Extended Abstract)
(J Bethencourt, Dawn Song, Brent Waters, IEEE SP 2006)
Searchable symmetric encryption: improved definitions and efficient constructions
(R Curtmola, J Garay, S Kamara, R Ostrovsky, ACM CCS 2006)
Improving the Decoding Efficiency of Private Search
(George Danezis, Claudia Diaz, eprint 2006/024)
Privacy Preserving Nearest Neighbor Search
(Mark Shaneck, Yongdae Kim, Vipin Kumar, PADM 2006)
Privacy Preserving Nearest Neighbor Search
(Mark Shaneck, Yongdae Kim, Vipin Kumar, U Minnesota TR 06-014)
KDD 2006
Cryptographically Private Support Vector Machines
(Sven Laur, Helger Lipmaa and Taneli Mielikäinen, KDD 2006)
Efficient Anonymity-Preserving Data Collection
(Justin Brickell, Vitaly Shmatikov, KDD 2006)
Workload-Aware Anonymization
(Kristen LeFevre, David DeWitt, Raghu Ramakrishnan, KDD 2006)
(alpha, k)-Anonymity
( An Enhanced k-Anonymity Model for Privacy-Preserving Data Publishing)
On Privacy Preservation against Adversarial Data Mining
(Charu Aggarwal, Jian Pei, Bo Zhang, KDD 2006)
Utility-Based Anonymization Using Local Recodings
(Jian Xu, Wei Wang, Jian Pei, Xiaoyuan Wang, Baile Shi, Ada Fu, KDD 2006)
Anonymization for Sequential Releases
(Ke Wang, Benjamin C. M. Fung, KDD 2006)
2007
Algebraic Lower Bounds for Computing on Encrypted Data
(R Ostrovsky, W Skeith, eprint 2007/064)
Public-key encryption that allows PIR queries D Boneh, E Kushilevitz, R Ostrovsky, W Skeith, eprint 2007/073
(29.04.07)
Conjunctive, Subset, and Range Queries on Encrypted Data
(Dan Boneh, Brent Waters, eprint 2006/287)
Secure Two-Party k-Means Clustering
(Paul Bunn and Rafail Ostrovsky, eprint 2007/231)
Private Multiparty Sampling and Approximation of Vector Combinations
(Yuval Ishai, Tal Malkin, Martin J. Strauss, Rebecca Wright, ICALP 2007)
Privacy-Preservation for Gradient Descent Methods
(Li Wan, Wee Keong Ng, Shuguo Han, Vincent C. S. Lee, KDD 2007)
Privacy-Preserving Self-Organizing Map
(Shuguo Han, Wee Keong Ng, 2007)
AC-Framework for Privacy-Preserving Collaboration
(Wei Jiang, Chris Clifton)
Computing Join Aggregates over Private Tables
(Rong She, Ke Wang, Ada Waichee Fu, Yabo Xu, 2007)
2008
Avrim Blum and Katrina Ligett and Aaron Roth, STOC 2008
(05.02.08)
On Some Open Questions in Communication-Efficient Cryptocomputing
(Helger Lipmaa, eprint 2008/107)
Link farms
The Privacy, Security and Data Mining Site
(U Alberta)
Privacy Preserving Data Mining Papers
(@UMBC.EDU)
Projects, working groups, ...
Privacy-Sensitive Data Mining from Multi-Party Distributed Data
(Hillol Kargupta, UMBC.Edu)
Courses, seminars
Privacy-Preserving Data Mining
(Seminar at Helsinki University of Technology (leader)
Some PPDM-ers
Jaideep S. Vaidya
(Purdue)
Vassilios S. Verykios
(Drexel)
Data-mining in general
Organizations
ACM SIGKDD
National Center for Data Mining
Some data miners
Rakesh Agrawal
(IBM Almaden)
Usama M. Fayyad
(Microsoft)
Heikki Mannila
(Helsinki UT)
Raghu Ramakrishnan
(Wisconsin)
Padhraic Smyth
(Irvine)
Ramakrishnan Srikant
(IBM Almaden)
Journals
Journal of privacy technology
IEEE Transactions on Knowledge and Data Engineering
Knowledge and Information Systems: An International Journal
Data Mining and Knowledge Discovery
IEEE Transactions on Pattern Analysis and Machine Intelligence
Workshops
DIMACS Summer School Tutorial on New Frontiers in Data Mining
(13-17.08.2001, Piscataway, NJ, USA)
Workshop on Data Mining for Security Applications
(08.11.2001, Philadelphia, PA, USA)
Workshop on Privacy, Security, and Data Mining
(2002)
2nd Workshop on Privacy Preserving Data Mining
(Melbourne, Florida, USA, November 19, 2003)
PSDM 2004
(01.11.2004, Brighton, UK)
Deadlines for conferences
Bart Goethals's collection
Conferences
IEEE International Conference on Data Mining
IEEE ICDM 2002
(9-12.12.2002, Maebashi City, Japan)
IEEE ICDM 2003
(19-22.11.2003, Melbourne, FL, USA)
IEEE ICDM 2004
(1-4.11.2004, Brighton, UK)
IEEE ICDM 2005
(27-30.10.2004, New Orleans, Louisiana, USA)
ACM KDD 2002
(23-26.07.2002, Edmonton, Alberta, Canada)
DMKD workshop
DMKD 2004
(13.06.2004, Maison de la Chimie, Paris, France)
DIMACS/PORTIA Workshop on Privacy-Preserving Data Mining
(15-16.03.04)
Collections
Meetings and Conferences in Data Mining, Knowledge Discovery, Genomic Mining, and Web Mining
Past Data Mining and Knowledge Discovery Meetings in 2002
PDM 2006
(08.04.2006, Atlanta, USA)
MSc Degree in Data Mining
(CCSU)
Data Mining, Knowledge Discovery, Genomic Mining, Web Mining
(KD Nuggets directory)
Media coverage
To share is human
(GCN, 2004)
Cryptology Pointers
by
Helger Lipmaa
Got any suggestions or additional links? Mail to
<lipmaa>
research.cyber.ee
NB! If you find any broken links, please be kind and report them to me together with their current location!
(C) Helger Lipmaa 1997-2009.