CrossMine

Efficient Classification Across Multiple Database Relations

Authors

Introduction

CrossMine is a tool for classification in relational databases. The training set is a relational database that contains one target relation, in which every tuple is associated with a class label. The test set is a database of the same schema (and usually all relations remain same except the target relation), but the tuples in the target relation are not labeled. The target relation contains objects to be classified, and the other relations provide useful knowledge for classification. Given a training set, CrossMine builds a rule-based classifier, which can be applied on the test set to predict the class labels of unseen tuples.

Usage Agreement

Note

User Manual

1. Data Format

The training set contains the following files:

2. Usage Instructions

  1. Run CrossMine.exe.
  2. Click "Operation→Read Trainset" and read in "relations_train.des". Now all training data are read in, and you may navigate the data by clicking on relations.
  3. Click "Operation→Build Rules" to build rules. Rules are output to "rule.txt".
  4. Click "Operation→Read Testset" to read in the testing data.
  5. Click "Operation→Classify All Tuples" to perform classification.
  6. Click on the ID of any target tuple (either in training or testing set) to see the rule used for classifying this tuple.

Download CrossMine

Downloading is for internal research usage only.  Redistribution and commercial usage are not permitted. Download

References