Lec 05 - Priority Queue

Definition

In a priority queue, each element has a "priority" and an element with higher priority is served before an element with lower priority.

In the later implementation using binary max heap, we may notice that this "priority" is related to the magnitude of the number, but not exactly everywhere, but on the same level.

And in Java, the priority queue is implemented using Binary Min Heap. So, here we will introduce the definition of binary heap also.

Binary Max Heap

A Binary (Max) Heap is a complete binary tree that maintains the Max Heap property.

Complete Binary Tree: Every level in the binary tree, except possibly the last/lowest level, is completely filled, and all vertices in the last level are as far left as possible
Binary Heap property: It has two versions, max and min.
- Binary Max Heap Property: The value of a vertex — except the leaf/leaves — must be greater than (or equal to) $\geq$ the value of its one (or two) child(ren).
- Binary Min Heap Property: Just the opposite of the above. The value of a vertex — except the leaf/leaves — must be less than (or equal to) $\leq$ the value of its one (or two) child(ren).

Below is an example of a binary max heap from visualgo. And remember this general shape of a complete binary tree as it will be useful when we analyze the basic operation inside ths binary max heap.

Additional Information

Height: the height of a tree is the number of edges from the top-most node to the farthest node in a subtree. For example, in the figure above, the height of this complete binary tree is 3.

Conversion between Binary Max Heap and Binary Min Heap

As we will see later, in Java, the implementation of priority queue uses the binary min heap. So, to do the conversion, if we only deal with numbers (including this visualization that is restricted to integers only), it is easy. Why?

We can re-create a Binary Heap with the negation of every integer in the original Binary Heap. If we start with a Binary Max Heap, the resulting Binary Heap is a Binary Min Heap (if we ignore the negative symbols — see the picture above), and vice versa.

Basic Manipulation

A complete binary tree can be stored efficiently as a compact array A as there is no gap between vertices of a complete binary tree/elements of a compact array. (Go back to visualgo to visualize this process).

This way, we can implement basic binary tree traversal operations with simple index manipulations (with help of bit shift manipulation):

parent(i) = i>>1, index i divided by 2 (integer division),
left(i) = i<<1, index i multiplied by 2,
right(i) = (i<<1)+1, index i multiplied by 2 and added by 1.

Insert

Insertion of a new item v into a Binary Max Heap can only be done at the last index N plus 1 to maintain the compact array. Why?

This is to maintain the complete binary tree property.

However, the Max Heap property may still be violated. This operation then fixes Max Heap property from the insertion point upwards (if necessary) and stop when there is no more Max Heap property violation. (See the visualization on visualgo to understand it better)

ExtractMax

The method returns and deletes the root vertex, then replace the root with the last index N. This is also to maintain the complete binary tree property.

But after the replacement, it will very likely violates the Max Heap property. This operation then fixes Binary Max Heap property from the root downwards by comparing the current value with the its child/the larger of its two children (if necessary). (Again, see the visualization on visualgo to understand it better)

Create

To create a binary max heap, one trivial way is to call the Insert N times. Thus, its time complexity will be $O(n\log n)$ . However, to make it faster, there is the second way to create a binary max heap,

insert each vertex sequentially, don't care about the max heap property
ignore all the bottom leaves, and start fixing the max heap property (by calling shiftDown()) from the second-last level of vertices, one-by-one and do this all the way until the root vertex.

Analysis of $O(n)$ Create

The height of a full binary tree of size $N$ is $\log_2N$ .
The cost to run shiftDown(i) operation is not the gross upper bound $O(\log N)$ , but $O(h)$ where $h$ is the height of the subtree rooted at $i$ . And thus, we can write $O(h)=c\times h$ , where $c$ is a constant.
There are $\lceil\frac{N}{2^{h+1}}\rceil$ vertices at height $h$ in a full binary tree.

On the example full binary tree above with $N=7$ and $h=2$ ,

There are, $\lceil\frac{7}{2^{(0+1)}}\rceil=4$ vertices at height $h=0$ (the bottom level).

After knowing the above three points and using the idea to start from $h=1$ all the way to the root, we can derive the formula to calculate the analysis as follows,

\begin{align*} \sum_{h=0}^{\lfloor \lg n \rfloor} \left \lceil\frac{n}{2^{h+1}}\right \rceil \cdot O(h) &= \sum_{h=0}^{\lfloor \lg n \rfloor} \left\lceil \frac{n}{2^{h+1}}\right\rceil \cdot (c\cdot h) \\ &= O \left( n \sum_{h=0}^{\lfloor \lg n \rfloor} \frac{h}{2^h} \right) \\ &= O \left( n \sum_{h=0}^{\infty} \frac{h}{2^h} \right) \\ &= O(2n) \\ &= O(n) \\ \end{align*}

Equation Explanation

From formula 2 to 3, it suffices to calculate $h\to\infty$ .
From formula 3 to 4, it uses the formula that $\sum_0^\infty kx^k\to2$ , and here we just let $x=\frac{1}{2}$ .

Take away from two creates

So, the take away from these two different creates is that,

If our starting point is different, we may get different results.

In this case, if we start fixing the binary max heap property from the root and down until the height = 1 level of vertices, we get $O(n\log n)$ complexity. But if we start fixing the binary max heap property from the height = 1 level of vertices and up until the root, we get $O(n)$ complexity. Thus, try changing start point when solving some problems☺️.

Update

To update the priority of an existing value that is already inserted into a Binary (Max) Heap, if the index i of the value is known, we can do the following simple strategy:

Simply update A[i] = newv and then
call both shiftUp(i) and shiftDown(i).

Only at most one of this Max Heap property restoration operation will be successful, i.e., shiftUp(i)/shiftDown(i) will be triggered if newv > or < old value of A[parent(i)]/A[larger of the two children of i], respectively.

Delete

To delete an existing value that is already inserted into a Binary Max Heap and we already know the index of the value to be deleted, we can just

update A[i]=A[1]+1 (a larger number greater than the current root),
call shiftUp(i) (technically, UpdateKey(i, A[1]+1)))
then call ExtractMax() once to remove it.

But what if we don't know the index of the value to be updated / deleted?

Wait for the Hash Table next week!

HeapSort

HeapSort is just simply calling the ExtractMax operation N times. Thus, its time complexity is obviously $O(n\log n)$ . (See the visualization on visualgo!)

One advantage of Heapsort is that we can use it to achieve partial sort! (Its real world application includes the searching result you get from Google). And below is an application from LeetCode 😂

Kth Largest Element in an Array - LeetCodeLeetCode

For the explanation, please see from here!

More Binary Heap

This part mainly comes from the Week 6's tutorial content.

Number of comparisons

The orginal question is,

What is the minimum and maximum number of comparisons between Binary Heap elements required to construct a Binary (Max) Heap of arbitrary n elements using the $O(n)$ Create(array)?

The minimum case is when everything is in order, the number of comparions is just the number of edges in the bianry tree.

The maximum case needs manual computation starting from the second-last layer, but the idea is "try to bubble/shift every node to the bottom layer"

Find the all numbers greater than k in Binary Max Heap in $O(k)$

This problem is a bit different from the leetcode one, as it explicitly specifies that the algorithm should be within $O(k)$ time. And it needs to print out all the vertices that are greater than k.

To solve it, we need to use the recursive/wishful thinking (we have learned in CS1010 or maybe CS1101S). So, we can break a bigger Binary Heap into three parts,

The left Binary Heap,
The root,
The right Binary Heap

In each breakdown, we compare the root with k, if it's bigger than k, we print the root and recursively call the function on the left binary heap and the right binary heap. Else, we return from the current function call, meaning this below this level, there isn't any value that is bigger than k.

So, the pseudocode will look like,

findVerticesBiggerThanX(vertex v, int x)
	if (v.value > x)
		output v.value
		findVerticesBiggerThanX(v.left, x)
		findVerticesBiggerThanX(v.right, x)

Change from Binary Max Heap to Binary Min Heap — Practical

This can be done by changing the order of Comparator.comparingInt(). For example,

PriorityQueue<Moose> pq = new PriorityQueue<>(
    Comparator.comparingInt((Moose m) -> m.power).reversed()
);

Moose is a class with an integer member called power. This Comparator also defines how the priority in the priority queue should be implemented!

PreviousLec 04 - List NextLec 06 - Hash Table

Last updated 4 months ago

hashtagDefinition

hashtagBinary Max Heap

hashtagAdditional Information

hashtagBasic Manipulation

hashtagInsert

hashtagExtractMax

hashtagCreate

hashtagAnalysis of O(n)O(n)O(n) Create

hashtagEquation Explanation

hashtagTake away from two creates

hashtagUpdate

hashtagDelete

hashtagHeapSort

hashtagMore Binary Heap

hashtagNumber of comparisons

hashtagFind the all numbers greater than k in Binary Max Heap in O(k)O(k)O(k)

hashtagChange from Binary Max Heap to Binary Min Heap — Practical