COMP3010: Algorithm Theory and Design
Daniel Sutantyo, Department of Computing, Macquarie University
5.2 - Greedy Algorithm - Activity-Selection Problem
Problem description
5.2 - Activity-Selection Problem
- You have a list of n activities you can perform, but all of these activities use the same resource, so you cannot perform two of these activities at the same time
- Examples:
- one classroom for multiple classes
- one doctor, multiple appointments
- Examples:
- You want to do as many activities as you can, which ones should you choose?
Problem description
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
time
Problem description
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
- In problem-solving, try to generate examples that will invalidate your trivial solutions, so you don't spend too much time working on a wrong approach
Problem description
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
activity
start
finish
4 5 6 7 9 9 10 11 12 14 16
1 3 0 5 3 5 6 8 8 2 12
1 2 3 4 5 6 7 8 9 10 11
Problem description
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
activity
start
finish
4 5 6 7 9 9 10 11 12 14 16
1 3 0 5 3 5 6 8 8 2 12
1 2 3 4 5 6 7 8 9 10 11
Problem description
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
activity
start
finish
4 5 6 7 9 9 10 11 12 14 16
1 3 0 5 3 5 6 8 8 2 12
1 2 3 4 5 6 7 8 9 10 11
Problem description
5.2 - Activity-Selection Problem
- Let S={a1,a2,…,an} be the set of n activities ordered by their finish times (in increasing order)
- Each activity ai has start time si and finish time fi, 0≤si<fi
- ai takes place during half-open time interval [si,fi)
- x∈[si,fi) means si≤x<fi
- activities ai and aj are compatible if [si,fi) and [sj,fj) do not overlap
- i.e. fi<sj or si≥fj
- ai takes place during half-open time interval [si,fi)
Problem description (formal)
5.2 - Activity-Selection Problem
- Input:
- The set S={a1,a2,…,an} of n activities, sorted in increasing order of finish time
- The start time si and finish time fi for each activity ai
-
Output:
- The maximum subset A={aℓ1,aℓ2,…,aℓm}⊆S of compatible activities, where two activities ai and aj are compatible if [si,fi) and [sj,fj) do not overlap, i.e. fi<sj or si≥fj
Problems and Subproblems?
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
activity
start
finish
4 5 6 7 9 9 10 11 12 14 16
1 3 0 5 3 5 6 8 8 2 12
1 2 3 4 5 6 7 8 9 10 11
Problems and Subproblems
5.2 - Activity-Selection Problem
[ ]
[a1]
[a1,a2]
[a1,a2,a3]
[ ]
[a1]
[a2]
[ ]
[a1,a2]
[a1,a3]
[a1]
[a2,a3]
[a2]
[ a3]
[ ]
pick a1?
pick a2?
pick a3?
Problems and Subproblems
5.2 - Activity-Selection Problem
select(S)
pick a1?
pick a2?
- Let's call the problem, select(S) where S is a set of activities (remember that we want to return the set of compatible activities)
select(S∖{a1})
select(S∖{a1,a2})
select(S∖{a1})
select(S∖{a1})
select(S∖{a1})
select(S)
select(S)
Problems and Subproblems
5.2 - Activity-Selection Problem
- We can do better: if we pick ak, then we cannot pick another activity that runs on the same time, i.e. whatever activity we choose next must either finish before ak starts or starts after ak finishes
- Let Si,j be the set of activities that start after activity ai finishes, and finishes before activity aj starts
0
2
4
6
8
10
12
14
16
S1,11
a1
a11
Problems and Subproblems
5.2 - Activity-Selection Problem
- Let Si,j be the set of activities that start after activity ai finishes, and finishes before activity aj starts
- So once we pick an activity ak, we can only pick S?,k and Sk,?
- hmm, how does this notation work?
0
2
4
6
8
10
12
14
16
S4,?
a4
S?,4
Problems and Subproblems
5.2 - Activity-Selection Problem
- Here's the problem, what is the value of i and j for Si,j that denotes all the activities before or after ak,
- or more simply: how do you denote the set of all activities?
- we have 11 activities, so S1,11 denotes the set of activities that starts after activity 1 finishes and finishes before activity 11 starts
- solution: make a couple of dummy/phantom activities:
- a0, finishes before a1 starts
- a12, starts after a11 finishes
- so
- S0,k denotes all the activities the finishes before ak starts (but after this phantom activity finishes)
- Sk,12 denotes all the activities that starts after ak finishes (but before this phantom activity starts)
- S0,12 denotes the set of all activities
- or more simply: how do you denote the set of all activities?
Problems and Subproblems
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
S4,12
a4
S0,4
- Make sure you understand the notation, it may be a little unusual, e.g. S0,2 is NOT {a1}, it's actually empty (because nothing finishes before a2 starts)
- S0,1 and S11,12 are empty sets
Problems and Subproblems
5.2 - Activity-Selection Problem
select(S0,12)
select(S0,1)+select(S1,12)
select(S0,11)+select(S11,12)
select(S0,2)+select(S2,12)
select(S0,10)+select(S10,12)
pick a1
…
- The search tree can be written more nicely now:
pick a2
pick a10
pick a11
Problems and Subproblems
5.2 - Activity-Selection Problem
- Of course you can generalise this:
- if we start with Si,j, then if we pick ak, we have the subproblems Si,k and Sk,j
select(Si,j)
select(Si,i+1)+select(Si+1,j)
select(Si,j−1)+select(Sj−1,j)
select(Si,i+2)+select(Si+2,j)
select(Si,k)+select(Sk,j)
pick ai+1
pick aj−1
pick ai+2
…
…
pick ak
Problems and Subproblems
5.2 - Activity-Selection Problem
- Finally, let Ai,j be the optimal solution select(Si,j), that is, Ai,j is the maximum set of compatible activities in Si,j
0
2
4
6
8
10
12
14
16
S2,11
a4
a8
A2,11
Optimal Substructure
5.2 - Activity-Selection Problem
- Let Ai,j be the optimal solution for Si,j and suppose that ak∈Ai,j
- we picked ak, so we have the solutions for the subproblems Si,k and Sk,j
- let Ai,k=Ai,j∩Si,k and Ak,j=Ai,j∩Sk,j be the solutions to these subproblems
select(Si,j)
select(Si,k)
select(Sk,j)
optimal solution: Ai,j
optimal solution: Ai,k
optimal solution: Ak,j
- i.e. Ai,j=Ai,k∪{ak}∪Ak,j and the optimal solution is ∣Ai,j∣=∣Ai,k∣+1+ ∣Ak,j∣
Optimal Substructure
5.2 - Activity-Selection Problem
- Suppose there is a better solution for the subproblem Si,k, say Ai,k′
-
This means that ∣Ai,k′∣>∣Ai,k∣, so ∣Ai,k′∣+1+ ∣Ak,j∣ >∣Ai,j∣
- this contradicts the assumption that Ai,j is the optimal solution to the problem
- We can use a symmetric argument for the solution to the subproblem Sk,j
select(Si,j)
select(Si,k)
select(Sk,j)
optimal solution: Ai,j
solution: Ai,k
optimal solution: Ak,j
optimal solution: Ai,k′
Recursive Relation
5.2 - Activity-Selection Problem
select(Si,j)
select(Si,i+1)+select(Si+1,j)
select(Si,j−1)+select(Sj−1,j)
select(Si,i+2)+select(Si+2,j)
select(Si,k)+select(Sk,j)
pick ai+1
pick aj−1
pick ai+2
…
…
pick ak
∣Ai,j∣=⎩⎨⎧ak∈Si,jmax(∣Ai,k∣+1+∣Ak,j∣) 0 if Si,j is not emptyif Si,j is empty
Greedy Choice
5.2 - Activity-Selection Problem
- What is the greedy choice?
- we want to maximise the number of activities
- which activity, if chosen, leaves the as much resource as possible so we can fit in more activities?
Greedy Choice
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
- Pick the one that finishes earliest
- Remember that the activities are sorted according to which one finishes first, so a1 will finish before a2
Greedy Choice
5.2 - Activity-Selection Problem
0
2
4
6
8
10
12
14
16
- Alternatively, pick the one that starts latest
Greedy Choice
5.2 - Activity-Selection Problem
- In dynamic programming, the hard part is in working out what the subproblems are, and then defining the recursive relationship
- overlapping subproblems are usually easy to spot
- proof for optimal substructure are all very similar
- In greedy algorithm, the hard part is in working out what is the greedy choice
- can you think of a bad greedy choice for the activity selection problem?
- we mentioned this at the start
Greedy Choice
5.2 - Activity-Selection Problem
- Remember the steps (this is the first approach)
- Show that there is an optimal substructure
- Show the recursive relation that gives optimal solution
- Show that if we make the greedy choice, only one subproblem remains
- Show that it is safe to make the greedy choice
- Find the value of the optimal solution (run the algorithm)
- Develop the recursive solution to an iterative one
Greedy Choice
5.2 - Activity-Selection Problem
- Alternatively:
- Show that it is safe to make a greedy choice such that our problem only has one subproblem to solve
- Show that the problem has an optimal substructure after we make the greedy choice
- Find the value of the optimal solution (run the algorithm)
- Develop the recursive solution to an iterative one
Greedy Choice
5.2 - Activity-Selection Problem
- Show that if we make the greedy choice then only one subproblem remains:
- if we pick activity ak, then we need to find the solutions to the subproblem Si,k and Sk,j
- the greedy choice is aℓ, the activity in Si,k that will finish first
- Si,i+1=∅
- therefore there is only one subproblem to solve
Greedy Choice
5.2 - Activity-Selection Problem
- Intuition:
- pick the activity that finishes first, i.e. a1
- recall that the activities are sorted according to finish time, in increasing order
- suppose a1 is not in the optimal solution
- there must be another activity, say ak that is the first activity in the optimal solution
- ak finishes later than a1
- so we can substitute a1 in for ak and still get the optimal number of activities
- pick the activity that finishes first, i.e. a1
Greedy Choice
5.2 - Activity-Selection Problem
- Theorem:
- If am is an activity in Sk,j with the earliest finish time, then am is in a maximum-size subset of compatible activities of Sk,j (i.e. am∈Ak,j)
- Proof:
- let aℓ be the activity in Ak,j with the earliest finish time.
- if aℓ=am, then we are done
- if aℓ=am, then replace aℓ with am
- i.e. let Ak,j′=Ak,j−{aℓ}+{am}
- Since the activities in Ak,j are compatible, and aℓ is the first activity in Ak,j to finish and fm≤fℓ,
- then it follows that the activities in Ak,j′ are also compatible
- Since ∣Ak,j′∣=∣Ak,j∣
- it follows that Ak,j′ is also a maximum-size subset of compatible activities of Sk,j
Recursive Solution
5.2 - Activity-Selection Problem
// int s[] is the array containing start times // int f[] is the array containing finish times // remember that the arrays are sorted by finishing time // so f[m] < f[m+1], i.e. activity m finishes before activity m+1 // calling select(k,n), select only activities that starts after k finishes // activity n is the phantom activity HashSet<Integer> select(int k, int n){}{ int m = k+1; // find the next activity that starts after activity k finishes while (m <= n && s[m] < f[k]) m++: // add the activity to the set of solution if (m <= n){ HashSet<Integer> ans = new HashSet<>(select(m,n)); ans.add(m); return ans; } }
Iterative Solution
5.2 - Activity-Selection Problem
// int s[] is the array containing start times // int f[] is the array containing finish times HashSet<Integer> select(){ int n = s.length; HashSet<Integer> answer = new HashSet<Integer>(); answer.add(1); k = 1; for(int m = 2; m < n; m++){ if (s[m] >= f[k]){ answer.add(m); k = m; } } return answer; }
COMP3010 - 5.2 - Greedy Algorithm - Activity Selection
By Daniel Sutantyo
COMP3010 - 5.2 - Greedy Algorithm - Activity Selection
Worked example for activity selection problem
- 269