KDD Cup  

Home Page
KDD Cup 2008
KDD Cup 2007
KDD Cup 2006
KDD Cup 2005
KDD Cup 2004
KDD Cup 2003
KDD Cup 2002
KDD Cup 2001
KDD Cup 2000
KDD Cup 1999
KDD Cup 1998
KDD Cup 1997
SIGKDD

KDD Cup 2000: Datasets

Real Datasets for Association Rule Discovery (updated Oct 2002)

Three real-world datasets are available. You are required to sign a simple non-disclosure agreement in order to receive a password to access the data. Basically, any use of the data is allowed as long as the proper acknowledgment to Blue Martini Software is provided and a copy of the work is sent (e-mail is fine). For reference, please reference the following article instead of the KDD Cup paper:

Zijian Zheng, Ron Kohavi, and Llew Mason, Real World Performance of Association Rule Algorithms, KDD 2001.

The bibtex entry is:

@inproceedings{ zheng-kohavi-mason-real-assoc,
author = "Zijian Zheng and Ron Kohavi and Llew Mason",
title = "Real World Performance of Association Rule Algorithms",
booktitle = "Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge
Discovery and Data Mining",
editor={Foster Provost and Ramakrishnan Srikant},
pages={401--406},
year = 2001,
url = {http://robotics.Stanford.EDU/users/ronnyk/realWorldAssoc.pdf}}

  • BMS-WebView-1
  • BMS-POS

  • Note, a long version of the oroginal paper is available as well as the slides.
    Please remember the restrictions on the data.