A Target Problem: RCPSP

The Resource Constrained Project Scheduling Problem:

We have a set of $n$ activities
Each activity $i$ has fixed duration $d_i$
Activities are connected by end-to-start precedence relations
There are $m$ resources
Each resource $k$ has fixed capacity $c_k$
Each activity requires an amount $r_{i,k}$ of resource $k$
requires = $r_{i,k}$ resource units are locked while the activity runs

Let's see a sample instance...

A Target Problem: RCPSP

The network of activities/precedences is called Project Graph

Typically: fake start/end activities
Fake = 0 duration, 0 requirements
They can be disregarded in CP models

Goal:

Build a schedule
- Assign a start time to all activities
- Satisfy all constraints
Minimize the project completion time (makespan)

A Target Problem: RCPSP

Many practical applications:

Large scale construction projects
Research/development projects
Production planning
Parallel software optimization
Code optimization (compile time optimization)
...

Can we tackle this problem using CP?

A CP Model for the RCPSP

Which variables (i.e. how to model decisions)?

Natural approach: a start time variable for each activity

$s_i \in \{0..eoh\}$

$eoh$ is a safe "End Of Horizon":

$eoh = \sum_{i=0}^{n-1} d_i$

There is always a schedule with makespan $\leq eoh$
Unless the resource constraints are trivially infeasible

A CP Model for the RCPSP

How to model the problem objective?

Makespan = project completion time = largest end time:

$\min z = \max_{i = 0..n-1} (s_i + d_i)$

How to model the precedence constraints?

If there is a precedence between activities $i$ and $j$ :

$s_i + d_i \leq s_j$

A CP Model for the RCPSP

How to model the resource constraints?

If $c_k = 1$ , then activities should not overlap

Formally, for each pair of activities $i,j$ s.t. $r_{i,k} = r_{j,k} = 1$ :

$(s_i + d_i \leq s_j) \vee (s_j + d_j \leq s_i)$

A resource with unary capacity is called "disjunctive"
We have seen this on the Job Shop Scheduling Problem

But what if $c_k > 1$ ?

A CP Model for the RCPSP

If $c_k > 1$ finding a good model is difficult

Some possibilities

A sum constraint for each time point
A sum constraint for each activity start

Both are complicated and lead to weak propagation

This is one of the reason why MILP is not good for the RCPSP
A notable exception: the approach works for SAT based solvers

Is there an alternative? We can use a global constraint!

Constraint Systems

Constraint Based Scheduling:
The ${\rm\scriptsize CUMULATIVE}$ Constraint

The ${\rm\scriptsize CUMULATIVE}$ Constraint

We can use a new global constraint!

Basic idea: one global constraint for each resource

${\rm\scriptsize CUMULATIVE}(s, d, r, c)$

$s$ is a vector of start time variables $s_i$
$d$ is a vector of durations $d_i$
$r$ is a vector of requirements $r_i$
$c$ is the capacity

The durations and the requirements can be either scalars of variables

The ${\rm\scriptsize CUMULATIVE}$ Constraint

The cumulative constraint enforces consistency on:

$\sum_{\substack{i = 0..n-1,\\ s_i \leq t < s_i + d_i}} r_i \leq c, \quad \forall t = 0..\max\{s_i + d_i\}$

In brief: the resource capacity is never exceeded

Which kind of consistency?

Feasibility checking is easy when all $s_i$ are fixed (as usual):

Check the resource usage only at the activity starts
Rationale: resource usage can increase only at the start times

Unfortunately, filtering is NP-hard!

The ${\rm\scriptsize CUMULATIVE}$ Constraint

Cumulative is an NP-hard constraint

Proof (just an idea):

If we could enforce GAC on $s_i$ ...
...Then we could solve the decision version the bin-packing problem...
...The bin-packing problem is NP-hard

Practical consequences:

All filtering algorithms are suboptimal
Typically: weak, bound-based, forms of consistency

The ${\rm\scriptsize CUMULATIVE}$ Constraint

Some filtering algorithms for ${\rm\scriptsize CUMULATIVE}$ :

Disjunctive filtering
Timetable filtering
Edge-finder
Not-first/not-last rules
Timetable edge-finding
Energetic reasoning
...

Why so many?

Filtering is always incomplete
The ${\rm\scriptsize CUMULATIVE}$ constraint is very important!

Timetable Filtering

As an example, we will describe timetable filtering

One of the weakest algorithms
But also one of the fastest ones

80% of the times, this is all you need

Key idea #1: rely on a minimum usage profile

Min. usage profile = guaranteed min. consumption per time point
Use the profile to determine bounds for the $s_i$ variables

Before presenting the algorithm we need some preliminary notions...

Timetable Filtering

Some notable time point for each activity:

Earliest Start Time: $EST_i = \underline{s}_i$

Earliest End Time: $EET_i = \underline{s}_i + d_i$

Timetable Filtering

Some notable time point for each activity:

Latest Start Time: $LST_i = \overline{s}_i$

Latest End Time: $LET_i = \overline{s}_i + d_i$

Timetable Filtering - Compulsory Parts

If we have $LST_i < EET_i$ , then:

In the interval $LST_i, EET_i$ , activity $i$ will certainly be executing
Therefore, $r_i$ units of the resource will be locked

We say that the activity has a compulsory part

Timetable Filtering - Min. Usage Profile

By aggregating all compulsory parts we get the usage profile:

For each time instant: minimum guaranteed resource usage

Key idea #2: For each activity $i$ :

Sweep the timeline (SWEEP is also the propagator name)
Search for a suitable start time
Update the domain of $s_i$ accordingly

Timetable Filtering

Timetable filtering for a single activity $a_i$

We keep a timeline cursor
The initial position of the cursor is $\underline{s}_i$

Timetable Filtering

Timetable filtering for a single activity $a_i$

We check whether there is enough capacity available
In case there is, the cursor switches to checking mode
We store the current cursor position into a variable $s^*$

Timetable Filtering

Timetable filtering for a single activity $a_i$

In checking mode, we test whether $s^*$ is a valid start time
This is true iff there is enough capacity in the interval $[s^*, s^* + d_i[$
Thus, we keep on checking until we reach $s^* + d_i$

Timetable Filtering

Timetable filtering for a single activity $a_i$

In checking mode, we move only between Latest Start Times
Rationale: compulsory parts begin only at LSTs
Hence, the resource usage can increase only at LSTs

Timetable Filtering

Timetable filtering for a single activity $a_i$

If there is not enough capacity, we switch to seeking mode
In seeking mode, we have concluded that $s^*$ is not a valid start
Hence, we search for another candidate start time

Timetable Filtering

Timetable filtering for a single activity $a_i$

In seeking mode, we move only between Earliest End Times
Rationale: compulsory parts end at EETs
Hence, the resource usage can decrease only at EETs

Timetable Filtering

Timetable filtering for a single activity $a_i$

If there is enough capacity, we switch to checking mode
We store the current cursor position in $s^*$
We start checking the interval $[s^*, s^* + d_i[$

Timetable Filtering

Timetable filtering for a single activity $a_i$

In checking mode, sweeping can proceed up to $\overline{s}_i + d_i$

Timetable Filtering

Timetable filtering for a single activity $a_i$

If at some point we reach $s^* + d_i$ while in checking mode...

...We can prune $D(s_i)$ , setting $s^*$ as the new $EST_i$

This is the case in our example

Timetable Filtering

Timetable filtering for a single activity $a_i$

If at some point we surpass $\overline{s}_i$ while in seeking mode...

...We can immediately fail

Timetable Filtering

Some final considerations:

Upper bounds on the start variables can be computed similarly
The profile can be computed in $O(n \log n)$
- Approach: sort and then scan
Sweeping has complexity $O(n)$
We need to filter $n$ activities

Overall, the algorithm has complexity $O(n^2)$

Other Forms of Cumulative Filtering

A few other propagators for ${\rm\scriptsize CUMULATIVE}$ deserve a mention:

Edge Finder

Considers pairs $(\Omega, i)$
- $\Omega =$ a set of activities
- $i =$ the activities to be filtered
Detects if activity $i$ cannot precede any activity in $\Omega$
Updates $D(s_i)$ based on that information
Complexity $O(k\, n^2)$ ( $k =$ num. distinct requirements)

A very effective approach in some cases (typically: tight time windows)

Time window $= [\underline{s}_i, \overline{s}_i]$ (in this context)

Other Forms of Cumulative Filtering

A few other propagators for ${\rm\scriptsize CUMULATIVE}$ deserve a mention:

Energetic Reasoning

Energy = required resource $\times$ time
Reason on the required energy in certain time intervals
Detect overusage $\Rightarrow$ fail
Detect potential overusage $\Rightarrow$ prune

An interesting, but seldom useful approach:

PRO: Subsumes both timetabling at edge
CON: Complexity $O(n^3)$ (too high in many cases)

Other Forms of Cumulative Filtering

A few other propagators for cumulative deserve a mention:

Timetable Edge Finding

A more modern approach
Mixes ideas from timetabling and edge finder
Stronger than Edge Finder
Complexity $O(n^2)$
- Convergence is reached in multiple iterations
Does not dominate timetabling

Constraint Systems

Constraint Based Scheduling:
Search Strategies for scheduling problems

Search Strategies for Scheduling Problems

How do we search for a solution for the RCPSP?

Several design decisions to take:

Which variable shall we pick?
Which value shall we assign?
How should we backtrack?
...

Simple things first, so we start from...

Value Selection for the RCPSP

How should we select the values for the $s_i$ variables?

The objective is to minimize the makespan
Increasing a $s_i$ (others $s_j$ untouched) cannot improve the makespan

Consequence: selecting $\underline{s}_i$ seems a good idea

This is true not only for the RCPSP:

Many scheduling problems have so-called regular cost metrics
Regular = increasing a single $s_i$ cannot improve the cost

Variable Selection for the RCPSP

How should we select the branching variable?

It's easier to reason on an example:

The fake source/sink activities have been removed

Variable Selection for the RCPSP

After propagating the precendence constraints, we get:

Notation: $[EST_i..LST_i] / [EET_i..LET_i]$

Variable Selection for the RCPSP

We now need to pick a variable for branching:

A sensible criterion: minimum $\underline{s}_i$

Variable Selection for the RCPSP

How to break ties:

Smallest deadlines, i.e. minimum $LET_i$

Variable Selection for the RCPSP

We now now schedule the selected activity at $EST_i$ :

The whole duration of the activity becomes a compulsory part