For a first feature of a dataset having a plurality of features, training
a classifier to predict the first feature in terms of other features in
the data set to obtain a trained classifier; scrambling the values of a
second feature in the data set to obtain a scrambled data set, executing
the trained classifier on the scrambled data set, determining predictive
importance of the seconds feature in predicting the first feature based
at least in part on the accuracy of the trained classifier in predicting
the first feature when executed with the scrambled data set and creating
a graph of the data set in which each of the first and the second
features is a node of the graph and a label on an edge between the first
node and the second node is based at least in part on the predictive
importance of the first feature in terms of the second feature.