Search code examples
python-3.xerror-handlinggraphvizdecision-tree

How to graph a tree with graphviz?


I can't reproduce a simple example. Here is how it goes:

import pandas as pd
import numpy as np
import sklearn as skl
from sklearn import tree
from sklearn.cross_validation import train_test_split as tts



#   import data and give a little overview
sample = pd.read_stata('sample_data.dta')
s = sample

#   Let's learn
X = np.array((s.crisis, s.cash, s.industry, s.current_debt, s.activity)).reshape(1000, 5)
y = np.array(s.wc_measure)

X_train, X_test, y_train, y_test = tts(X, y, test_size = .8)

my_tree = tree.DecisionTreeClassifier()
clf = my_tree.fit(X_train, y_train)
predictions = my_tree.predict(X_test)

#   Graph the tree
from sklearn.externals.six import StringIO  
import pydotplus 
dotfile = StringIO() 
tree.export_graphviz(clf, out_file=dotfile) 
pydotplus.graph_from_dot_data(dotfile.getvalue()).write_png("my_tree.png")

When I try to graph the tree, I get the following error:

pydotplus.graphviz.InvocationException: GraphViz's executables not found

Any idea what the problem is? I neatly follow graphviz's documentation.


Solution

  • It seem you didn't install the graphviz software at all. You should download and install it before using the module.