Search code examples
pythongraphvizdecision-treepydot

Plotting a decision tree with pydot


I have trained a decision tree (Python dictionary) as below. Now I am trying to plot it using pydot. In defining each node of the tree (pydot graph), I appoint it a unique (and verbose) name and a brief label.

My problem is that in the resulting figure that I get by writing to a .png, I see the verbosenode names and not the node labels.

I have followed the answer by @Martijn Pieters here. I do not know what I am missing, any ideas?

import pydot

tree= {'salary': {'41k-45k': 'junior', '46k-50k': {'department': {'marketing': 'senior', 'sales': 'senior', 'systems': 'junior'}}, '36k-40k': 'senior', '26k-30k': 'junior', '31k-35k': 'junior', '66k-70k': 'senior'}}

def walk_dictionaryv2(graph, dictionary, parent_node=None):
    '''
    Recursive plotting function for the decision tree stored as a dictionary
    '''

    for k in dictionary.keys():

        if parent_node is not None:

            from_name = parent_node.get_name().replace("\"", "") + '_' + str(k)
            from_label = str(k)

            node_from = pydot.Node(from_name, label=from_label)

            graph.add_edge( pydot.Edge(parent_node, node_from) )

            if isinstance(dictionary[k], dict): # if interim node


                walk_dictionaryv2(graph, dictionary[k], node_from)

            else: # if leaf node
                to_name = str(k) + '_' + str(dictionary[k]) # unique name
                to_label = str(dictionary[k])

                node_to = pydot.Node(to_name, label=to_label, shape='box')
                graph.add_edge(pydot.Edge(node_from, node_to))

                #node_from.set_name(to_name)

        else:

            from_name =  str(k)
            from_label = str(k)

            node_from = pydot.Node(from_name, label=from_label)
            walk_dictionaryv2(graph, dictionary[k], node_from)


def plot_tree(tree, name):

    # first you create a new graph, you do that with pydot.Dot()
    graph = pydot.Dot(graph_type='graph')

    walk_dictionaryv2(graph, tree)

    graph.write_png(name+'.png')


plot_tree(tree,'name')

This is the (undesired) output I get with the code above:

enter image description here


Solution

  • You need to explicitly add the nodes you create to the graph:

    node_from = pydot.Node(from_name, label=from_label)
    graph.add_node(node_from)
    

    and

    node_to = pydot.Node(to_name, label=to_label, shape='box')
    graph.add_node(node_to)
    

    otherwise the renderer won't see the names. graph.add_node() includes the node metadata in the generated .dot file.

    With those graph.add_node() lines added, the result is:

    name.png