Algorithms Review Cheat Sheet

Sat May 13 2017

Basic Concepts

Complexity

An algorithm is efficient if it runs in polynomial time or better.

Big-O

Denotes worst case running time, a bound on the largest possible run time given input of size N. Theta is lower and upper bound

1 < log n < n < n log n < n^2 < n^3 < a^n < n!

Amortized

Average running time per operation over a worst-case sequence of operation

Ex. Size-doubling array: 2^n operation every 2^ith step, but constant step

otherwise. Hence, O(n) with constant amortized time

Stable Matching

Given a set of preferences among hospitals and medical school students, design a self-reinforcing admissions process

Unstable paier: x and y are unstable if
- x prefers y and y prefers x to their assigned groupign
Stable Assignment: Assignment with no unstable pairs
Perfect matching: Everyone has a single unique match

Propose-and-reject algorithm

Gale-Shapely
Guarantees a stable matching for any problem
Gives a proposing-optimal assignment

while some man is free and hasn't proposed to every woman
	choose such a man m
	w = first woman on m's list he has not proposed to
	if w is free
		assign m and w together
	else if w prefers m to her fiance m'
		assign m and w to be engaged, m' is free
	else
		w rejects m

Data Structures

Linked List

Each data node is linked to the next
Access: O(n)
Insert: O(1)
head - first node in list
tail - last node in list

Stack

Last in, First out
Access: O(1)
Insert: O(1)
push(), pop()
Implement with linked list

Queue

First in, First out
Access: O(1)
Insert: O(1) if doubly linked list

Searching

A linear search can be performed on a sorted sequence O(n)
Binary Search: Divide-and-conquer on sorted sequence O(log n)

Sorting

Bubble Sort: Arranges items by iterating over the sequence, putting larger

values on the top piece by piece, swap if left > right adjacent. O(n^2)

Selection Sort: Repeatedly select the next smallest item from unsorted

items O(n^2)

Insertion Sort: Iterates over sorted sequence, inserts the next

unsorted item into its proper place O(n^2)

Heap Sort: Build heap and extract keys in sorted sequence O(n log n)

Heap Trees

A heap is a complete binary tree in which the nodes are organied based on their data values

Max heap: The value is greater than its two children
Min heap: The value is smaller than its two children
Can only remove from the root node
Inserts in next available left/top location, and sift upwards
can be stored in a dynamic array
- parent = i // 2
- left = 2 * i
- right = 2 * i + 1
Insertion: O(log n)
Extraction: O(log n)
Building
- Top down: Heaps constructed by inserting into an empty heap O(n log n)
- Bottom up: Put everything in array and then fix O(n)

BST

Tree such that all keys less than current key is in the left subtree and all keys greater than the key are stored in the right subtree

Searching is trivial
Can be built recursively or iteratively
Removing
- Remove interior node ->
- Successor: Minimum key in right subtree
Search, insert, delete: O(n)

AVL Trees

Developed by Adel’son-Velskhii and Landis to guarantee that the tree is height balanced

Master Theorem

Algorithm Subcategories

Divide and Conquer

Break up problem into several parts
solve each part recursively
combine solutions to sub-problems into overall solution
n^2 -> n log n

Mergesort

Divide array into two halves
recursively sort each half
merge two halves to make a sorted whole

Convex Hull

Given a set of points, find the smallest convex polygon containing all the points

Graham-scan

start at bottom-left point
sort points by relative polar angle
begin
	add point to stack
	pop off if it creates a left turn

Counting Inversion

Find number of inversions made to match two lists

if list L has one element
	return 0 and list L
Divide L into two halves A and B
(ra, A) = function(A)
(rb, B) = function(B)
(r, L)  = merge-and-count(A,B)
return r = ra + rb + r and sorted list L

Greedy Algorithms

Build up solutions in small steps
Make local decisions
Previous decisions do not factor into the next consideration

Coin changing

Given currency denominations, devise a method to pay amount to customer using fewest number of coins

At each iteration, add coin of largest value that does not take us past the

amount to be paid * Optimal for US coinage * Suboptimal for other situations (1, 10, 21, 34, 70, 100):(140)

Interval Scheduling

Find the maximal subset of mutually compatible jobs

Consider ordering
- Earliest Start time
- Earliest finish time
- Shortest interval
- fewest conflicts
- Typically Finish Time is Optimal
Unweighted solution: Take the next valid job based on finish time and no

conflicts

O(n log(n))

Interval Partitioning

Find minimum number of classrooms to schedule all lectures so that no two occur at the same time

number of classrooms needed >= depth
Sort by start time
Assign lecture to any compatible classroom
O(n log(n))

Dynamic Programming

Develop a solution as a solution to several subproblems, and get benefits in performance as a result of the division of computation.

Weighted Interval Scheduling

Knapsack

Given:

A collection of n items
Each item has a weight w
each item has a value c
The knapsack has a total weight W

Task:

Determine set S of items of max value that can be contained in knapsack

Divisible:

Items do not have to be included in entirety

sort items in decresing order of c/w
i = 1
currentW = 0
while currentW + w_i < W
	take item of weight w_i and cost c_i
	currentW += w_i
	i++
take W-currentW portion of item i

Indivisible:

function(n,c,w,W)
if n <= 0
	return 0
if W < w_n
	withLastItem = -1
else
	withLastItem = c_n + function(n-1,c,w,W-w_n)
withoutLastItem = function(n-1,c,w,W)
return max{withLast,withoutLast}

Create knapsack with subproblems
let S[k][v] := the solution with first k items and available weight v

init S[0][v] = 0
init S[k][0] = 0
for v from 1 to W
	for k from 1 to n
		S[k][v] = S[k-1][v]
		if w_k <= v and S[k-1][v-w_k] + c_k > S[k][v]
			S[k][v] = S[k-1][v-w_k] + c_k
return S[n][W]

Longest Increasing Subsequence

Graph Algorithms

A graph is usually defined as having a number of vertices V and a number of edges E with or without weights.

Prim’s Algorithm

Finds Min-Span Tree
Greedy
O(V^2)

1. Choose an arbitrary starting vertex
2. Of edges that connect to the tree, add the minimum-weight one
3. Repeat until all vertices are accounted for

Dijkstra’s Algorithm

Shortest path between nodes with non-negative edges
O(V^2)

1. Choose starting vertex
2. Assign each node dist: 0 for starting, inf otherwise
3. For current, consider all neighbors and calc tentative dists.
		Compare with previous value and assign smaller
4. Mark the current node as visited and remove from unvisited set
5. if destination has been reached, terminate
	   else take 'shortest' unvisited node path, set as new current, and 3)

Floyd Warshall

Compares all possible paths through the graph between each pair of vertices
O(V^3)

let dist be a |V| x |V| array of minimum dists at infinity
for each vertex v
	dist[v][v] = 0
for each edge (u,v)
	dist[u][v] = w(u,v) // the weight of the edge (u,v)
for k from 1 to |V|
	for i from 1 to |V|
		for j from 1 to |V|
			if dist[i]][j] > dist[i][k] + dist[k][j]
				dist[i][j] = dist[i][k] + dist[k][j]
			end if

Bellman-Ford

Computes shortest path from source vertex to all other vertices
Slower than Dijkstras
but can handle negative values
Runs in O(|V||E|)

for each vertex v in vertices:
	dist[v]    = inf
	predecessor[v] = null;
dist[source] = 0

for i from 1 to size(vertices)-1:
	for each edge (u,v) with weight w in edges:
		if dist[u] + w < dist[v]:
			dist[v] = dist[u] + w
			predecessor[v] = u

for each edge (u,v) with weight w in edges:
	if dist[u] + w < dist[v]:
		error "Graph contains a negative-weight cycle"
return dist,predecessor

Kruskal

Ford-Fulkerson

Greedy Algorithm
Computes maximum flow in a flow network
Complexity: O(Ef); E edges, f maximum flow

given G = (V,E) with flow capacity c, source s, and sink t
f(u,v) = 0 for all edges(u,v)
while there is a path from s to t in G_f, with c_f(u,v) > 0 for all edges:
	find c_f(p) = min(c_f(u,v):(u,v))
	for each edge (u,v)
		f(u,v) += c_f(p)
		f(v,u) -= c_f(p)

Return to main blog page