Trees and Graphs

In this lesson, you'll explore key data structures in computer science, learning to represent them with Python tools like dictionaries and lists. You'll implement depth-first and breadth-first search traversals and apply these skills to a maze path-finding project in VS Code.

Learning Goals Learning Outcomes Teacher Notes

Warning: Your screen size is quite small for viewing this. Rotate your device into landscape mode if you can.

1 Introduction
2 Understanding Trees
3 Representing Trees in Python
4 Understanding Graphs
5 Depth-First Search (DFS)
6 Implementing DFS
7 Breadth-First Search (BFS)
8 Implementing BFS
9 Friend Finder
10 DFS Solution
11 BFS Solution
12 Challenge
13 Wrap Up
Quiz

1 - Introduction

In this lesson, you'll learn about trees and graphs, which are important data structures in computer science. You'll represent them using simple Python structures like dictionaries and lists, implement depth-first search (DFS) and breadth-first search (BFS) traversals, and apply them to a maze path-finding project in VS Code.

By the end of this lesson, you will be able to:

Explain the concepts of trees and graphs.
Represent trees and graphs in Python.
Implement DFS and BFS traversals.
Apply traversals to solve a path-finding problem in a maze.

You'll need VS Code or a similar IDE with the Python extension. Create a new folder called 'TreesGraphs' and work inside it. We'll create Python files as we go.

2 - Understanding Trees

A tree is a hierarchical data structure consisting of nodes connected by edges, with no cycles. It has a root node, and each node can have child nodes. Trees are used in file systems, organisation charts, and decision-making algorithms.

Key terms:

Root: The top node with no parent.
Leaf: A node with no children.
Edge: Connection between nodes.
Height: The longest path from root to leaf.

For example, a family tree starts with a root (grandparent) and branches to children and grandchildren. This branching structure allows trees to represent relationships where one item can lead to multiple others, like folders containing subfolders in a computer file system.

Trees are acyclic (no loops) and connected, making them efficient for searching and organising data.

3 - Representing Trees in Python

We can represent a tree using a dictionary where keys are nodes and values are lists of children. This is a simple way to model the hierarchical structure of a tree in Python, where each key represents a parent node, and the list shows its direct children. For example, if 'A' has children 'B' and 'C', it means 'A' is the parent, and 'B' and 'C' branch out from it.

Imagine the tree looking like this: 'A' at the top, connected down to 'B' and 'C'. From 'B', it branches to 'D' and 'E', and from 'C' to 'F'. 'D', 'E', and 'F' are leaves since they have no children.

Create a file named 'tree.py' in your 'TreesGraphs' folder.

Add the following code to represent a simple tree:

tree = {
    'A': ['B', 'C'],
    'B': ['D', 'E'],
    'C': ['F'],
    'D': [],
    'E': [],
    'F': []
}

def print_tree(node, tree, level=0, prefix=""):
    print(prefix + node)
    for child in tree[node]:
        print_tree(child, tree, level + 1, prefix + "--")

print_tree('A', tree)

Run the code. You should see the dictionary printed, representing a tree with root 'A', children 'B' and 'C', and so on.

This is called an adjacency list representation because it lists the adjacent (connected) nodes for each node. Leaves like 'D', 'E', 'F' have empty lists, meaning no further branches. This structure makes it easy to traverse the tree later, as we'll see in upcoming steps.

4 - Understanding Graphs

A graph is a collection of nodes (also called vertices) connected by edges. Unlike trees, which are strictly hierarchical and have no cycles, graphs are more flexible. They can have cycles, meaning you can start at a node, follow edges, and return to the starting node, and they don't have to follow a top-down structure. This makes graphs ideal for modelling complex networks, such as social media connections (where friends are linked), road maps, or even the internet with linked web pages.

Key terms:

Vertex: A node in the graph, like a city in a map.
Edge: A connection between vertices, which can be directed (one-way) or undirected (two-way).
Cycle: A path that starts and ends at the same vertex, forming a loop.
Directed Graph: Edges have a direction, like one-way streets or following someone on social media (A follows B, but not necessarily vice versa).
Undirected Graph: Edges are bidirectional, like mutual friendships or two-way roads.

For example, think of a road network as an undirected graph: cities are vertices, and roads are edges connecting them. If there's a loop where you can drive from city A to B to C and back to A, that's a cycle. This is different from a tree, where such loops aren't possible.

Graphs are incredibly versatile for representing real-world relationships and are used to solve problems like finding the shortest path between two points, detecting connections, or even recommending friends on social networks.

Next, we'll represent a graph in Python using a simple structure.

5 - Depth-First Search (DFS)

DFS is a traversal algorithm that explores as far as possible along each branch before backtracking. It uses a stack (or recursion) and is useful for path-finding or detecting cycles.

In DFS, you start at a node, visit an unvisited neighbour, and repeat recursively. If no unvisited neighbours, backtrack.

Imagine you're exploring a maze: you pick one path and follow it as deep as you can until you hit a dead end, then you backtrack to the last junction and try a different path. This 'depth-first' approach means you explore one branch thoroughly before moving to the next, which is great for finding any path quickly, even if it's not the shortest one.

DFS can find paths in mazes by exploring deep paths first. To make this clearer, think of DFS as diving deep into the structure, like following a single thread in a story until it ends, before picking up another thread. This method is efficient for tasks where you need to check if a path exists, as it can reach far nodes quickly without exploring everything at once.

Next, you'll implement DFS on the graph.

Unlock the Full Learning Experience

Get ready to embark on an incredible learning journey! Get access to this lesson and hundreds more in our Digital Skills Curriculum.

Get started

Copyright Notice
This lesson is copyright of DigitalSkills.org 2017 - 2025. Unauthorised use, copying or distribution is not allowed.

Trees and Graphs