NEWS

1 Course

Regular
ESE (Option III)
Descriptor Descriptor
Topics/Schedule Topics/Schedule
TA: Alex Strehl TA: Kui-yu Chang
Schedule of student presentations
Secured Website
Login with your last name in ALL CAPS and your social as the password.
  • Secured Website
    Login:surname(lowercase)
    Password:9-digit student #
    If surname fails, prepend first letter of firstname. e.g. the TA would use kchang if chang fails
  • Homework
    Homework

    2 References

    1. Reading List 1 (Course Reader)
    2. Reading List 2 (Online)
    3. Enterprise Miner Manual (Online)
    4. SAS Online Tutorial
    5. Introduction to OLE DB for Data Mining
    6. PageRank Paper
    7. Citeseer at NEC
    3 Term Paper 4 News/Media

    5 Tools

    1. Enterprise Miner
      1. Detailed Remote Access Instructions
      2. Frequently Asked Questions (FAQ) by students
    2. UT WNT page w/screenshots
    3. Notes on connecting to published Applications.
    4. Aladdin Expander (.gz)
    5. Ghostview (.ps)
    6. Acrobat Reader (.pdf)

    7 Topics

    Visualization
  • Introduction to Visualization: Vis '96 Tutorial
  • GTM (Generative Topographical Mapping)
  • Hierachical Probabilistic PCA
  • CVIZ - IBM SurfAid
  • Data Reduction
  • Principal Component Analysis (PCA)
  • Synopsis 1
  • Synopsis 2 
  • Synopsis 3 (Theoretical)
  • Notes on Spectral Decomposition
  • MATLAB procedure comparing SVD and PCA
  • Principal Curves (nonlinear)
  • Web-based statistical analyzer
  • Classification
  • ROC curve notes (with figures)
    roc.ps roc.ps.gz roc.zip
  • Support Vector Machines
  • Bayesian Networks
  • Tutorial on Learning Bayesian Networks
  • Bayesian Belief Networks
  • Clustering
    METIS
    Miscellaneous
  • EM Algorithm
  • 6 Free Software/Libraries

  • KDNuggets Software

  •  
    Software Libraries (Open-source)
  • BayesBuilder (Win95/NT)
  • DBMiner (Win 95/NT)
  • EXPO SE (Win 95/NT)
  • MLC++ (C++)
  • MOBAL (currently down)
  • TOOLDIAG (C)
  • WEKA (Java)
  • 8 Datasets

    Links (original) Local copy
    1. UCI KDD Archive
    2. UCI Machine Learning databse
    3. Delve
    4. ELENA
    5. PRNN
    6. PROBEN1
    7. StatLib
    8. Statlog
    LANS benchmarks Datasets
    KDD Sisyphus I Yes
    KDNuggets Datasets
    Financial Time Series Data
    The Data Mine