Reading List 1 (Course Reader) 2 (Online)

    Introduction and Overview

  1. Statistical themes and lessons for data mining
    Daryl Pregibon, Clark Glymour, David Madigan and Padhraic Smyth
    In Proc. Second International Conference on Knowldege Discovery and Data Mining, pp 25-42, 1996
  2. An Overview of Predictive Learning and Function Approximation
    J. H. Friedman
    In V. Cherkassky, J.H. Friedman, and H. Wechsler, editors, From Statistics to Neural Networks, Proc. NATO/ASI Workshop, pp 1-61, Springer Verlag, 1994

    Exploratory Data Analysis

  3. The New Jersey Data Reduction Report
    Daniel Barbara, William DuMouchel, Christos Faloutsos, Peter J. Haas, Joseph M. Hellerstein, Yannis Ioannisdis, H. V. Jagadish, Theodore Johnson, Raymond Ng, Viswanath Poosala, Kenneth A. Ross, and Kenneth C. Servcik
    IEEE Bulletin of the Technical Committee on Data Engineering, 20(4), Dec, 1997, pp 3-45

    Clustering/Segmentation

  4. Chameleon: Hierarchical Clustering Using Dynamic Modeling
    George Karypis, Eui-Hong (Sam) Han, Vipin Kumar
    IEEE Computer, 32(8), 1999 Aug, pp. 68-75

    Association Rules

  5. Mining the Most Interesting Rules
    Roberto J. Bayardo Jr., Rakesh Agrawal
    See 1999 ACM SIGMOD Workshop on Research Issues in DMKD
    Proc. 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-99), Aug 1999, pp 145-154

    Classification

  6. Applying Classification Algorithms in Practice
    C. E. Brodley and P. Smyth
    Statistics and Computing, 7, 1997

    Combining Multiple Models

  7. Combining Predictors
    Leo Breiman
    in COMBINING ARTIFICIAL NEURAL NETS: Ensemble and Modular Multi-Net Systems
    Edited: Amanda Sharkey Publisher: Springer-Verlag London Ltd 1999

    Scalability Issues

  8. Mining Very Large Databases
    Venkatesh Ganti, Johannes Gehrke, Raghu Ramakrishnan
    IEEE Computer, 32(8), Aug, 1999, pp. 38-45

    Web Mining & Information Retrieval

  9. A Framework for Collaborative, Content-Based and Demographic Filtering
    Michael J. Pazzani
    Artificial Intelligence (in press)
  10. Mining the Web's Link Structure
    Soumen Chakrabarti, Byron E. Dom, S. Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew Tomkins, David Gibson, and Jon Kleinberg
    IEEE Computer, 32(8), Aug, 1999, pp. 60-67