7th ed. chapter 06

(Latest Revision: Sep 27, 2007 )

Chapter Six -- Process Synchronization -- Lecture Notes

6.0 Objectives
- Introduce the Critical Section (CS) Problem
- Present both hardware and software solutions to the CS problem
- Concept of atomic transaction and mechanisms to insure atomicity.

6.1 Background

Time = 1	producer: register₁ = counter	register₁ == 5
Time = 2	producer: register₁ = register₁ + 1	register₁ == 6
Time = 3	consumer: register₂ = counter	register₂ == 5
Time = 4	consumer: register₂ = register₂ - 1	register₂ == 4
Time = 5	producer: counter = register₁	counter == 6
Time = 6	consumer: counter = register₂	counter == 4

The example in section 6.1 shows that when two processes attempt to write to the same location in memory concurrently the outcome can be incorrect, even if each individual program is "error free."

Time = 1 producer: register₁ = counter register₁ == 5 Time = 2 producer: register₁ = register₁ + 1 register₁ == 6 Time = 3 consumer: register₂ = counter register₂ == 5 Time = 4 consumer: register₂ = register₂ - 1 register₂ == 4 Time = 5 producer: counter = register₁ counter == 6 Time = 6 consumer: counter = register₂ counter == 4

The example reminds us of these two essential facts:
1. In the typical system an interrupt can arrive during the execution of any instruction, and
2. After servicing an interrupt the OS is free to give the CPU to any ready process.
As a result there is potential for corruption of shared data due to the interleaved execution of instructions of different processes. (In the table above we see the instructions of the producer and consumer interleaved.)
The example of the shared counter is not very significant by itself. However the same kind of problem must be solved repeatedly by operating systems programmers: "... different parts of the system manipulate [shared] resources and we want the changes not to interfere with one another."
Often programmers take an approach to the problem like this: write each individual process so it is "correct" by itself, and synchronize the processes so that each one operates with exclusive access to the data at the "critical" times when a potential for error exists.

6.2 The Critical Section Problem
- Suppose that a set {P₁, P₂, ..., P_n} of two or more processes share a datum D.
  
  Let the text sections (programs) of {P₁, P₂, ... , P_n} be {T₁, T₂, ... , T_n}
- A programmer can designate certain sections of the T_i to be critical sections. The property that defines a critical section is this: "when one process is executing in its critical section, no other process is allowed to execute concurrently in its critical section."
- A typical example would be to find sections of the T_i that write to D, and designate those sections of code as critical sections. (for example, the producer instruction that writes to the shared counter). If process X attempts to read D while process Y is writing to it, that can cause errors too. D could be in a transitional, inconsistent state. Therefore if any process might write D, normally all sections of code that access D (either to read or write) have to be designated as critical sections.
- You have to keep in mind that the need for critical sections stems from the nature of the architecture of the computer. For example, it may conceivably be safe for a process to read a simple shared variable like a character or integer, even though another process might try to write the variable concurrently. If the hardware implements the operations used on the datum so that processes get exclusive access, then the operations are safe. In that case, they don't need to be protected with critical sections.
- When we protect a datum D with critical sections, in all the T_i we place entry code before each critical section and exit code after each critical section.
- A process executes the entry code to acquire the exclusive right to execute a critical section of code.
- A process executes the exit code to release the exclusive right to execute a critical section of code.
- So for example we would change the code of the producer and consumer like this:
```
--------------------------------
(producer code)

while (1)
{
  nextProduced=makeItem();
   /* while buffer full */
  while (counter==BUFFER_SIZE) 
    ; /* do nothing */
  buffer[in]=nextProduced;
  in=(in+1)%BUFFER_SIZE;  
  ENTRY SECTION OF PRODUCER CODE;
  counter++; 
  EXIT SECTION OF PRODUCER CODE;
}

--------------------------------
(consumer code)

while (1)
{
    /* while buffer empty */
  while (counter==0) 
    ; /* do nothing */
  nextConsumed=buffer[out];
  out=(out+1)%BUFFER_SIZE; 
  ENTRY SECTION OF CONSUMER CODE;
  counter--; 
  EXIT SECTION OF CONSUMER CODE;
  consume(nextConsumed);
}
```
- If the entry and exit code operate correctly then the producer cannot begin to change the counter while the consumer is in the act of changing the counter, and vice-versa. It is impossible for these portions of the producer and consumer to be interleaved, and so it is impossible for the counter to receive an incorrect value.
- The "trick" is to design the entry and exit code so that it works the way it's supposed to work.
- We want the entry and exit code to enforce exclusive access, but that's not really enough. The protocol implemented by the entry and exit code should be "fair" to the set of processes, and should allow them to operate as efficiently as possible.
- The entry and exit code we create are called a solution to the critical section problem. A solution to the critical section problem must satisfy these requirements:
  1. Mutual Exclusion: If process Pi is executing in its critical section, then no other processes can be executing [concurrently] in their critical sections.
  2. Progress: If no process is executing in its critical section and some processes wish to enter their critical sections, then only those processes that are not executing in their remainder section [in other words, only processes executing entry code or exit code] can participate in the decision on which process will enter its critical section next, and this selection cannot be postponed indefinitely.
  3. Bounded Waiting: There exists a bound (a priori) on the number of times that other processes are allowed to enter their critical sections after a process has made a request to enter its critical section and before that request is granted. [In other words, individual processes do not experience indefinite postponement waiting to enter their critical sections.]
- Historical Note: Edsger Dijkstra did much of the early work on the critical section problem. Dijkstra died in 2002. Dijkstra made many important contributions to computer science. You can read about Dijkstra here:
It is important to be aware that kernel code in many operating systems can and does employ special, privileged techniques for maintaining exclusive access to data. An OS can mask interrupts and refuse to relinquish its CPU while executing critical code. On a uniprocessor this would allow the OS to have the assurance of exclusive access to data. However doing this also might make the system unresponsive and/or prevent real-time processes from meeting deadlines. Besides, the technique does not provide a full and general solution to the Critical Section Problem for arbitrary sets of processes.

6.3 Peterson's Solution

Let's start out with a simplification: there are two processes P0 and P1.
The computer we are using is capable of performing atomic reads and writes to a simple integer or Boolean variable.

Algorithm 1: What if we try to solve the problem this way?

(We use an infinite loop construct - think of it as a way to perform many trials of the synchronization code, or imagine that the processes are long-running servers that have to perform some particular set of actions over and over as long as the system is up.)


[additional] shared variable: turn = 0; // Initialized to zero 
--------------------------------
(P0 code)

do
{
  while (turn != 0) //  Atomically test 'turn' 
     no-op; 
  critical section of P0 ; 
  turn = 1 ;   // Atomically write 'turn'  
  remainder section of P0
} while (1) ;

--------------------------------
(P1 code)

do
{
  while (turn != 1) //  Atomically test 'turn' 
     no-op; 
  critical section of P1 ;
  turn = 0 ;   // Atomically write 'turn'  
  remainder section of P1
} while (1) ;

Algorithm 1 satisfies the mutual exclusion and bounded waiting requirements, but violates the progress requirement.

Algorithm 2: What if we try to solve the problem this way?


shared boolean flag[2]={false, false};
-------------------------------
(P0 code)

do
{ 
  flag[0] = true ;
  while (flag[1]) 
    /* do nothing */ ; 
  critical section of P0 ; 
  flag[0]=false ; 
  remainder section of P0
} while (1) ;

--------------------------------
(P1 code)

do
{ 
  flag[1] = true ;
  while (flag[0]) 
    /* do nothing */ ; 
  critical section of P1 ; 
  flag[1]=false ; 
  remainder section of P1 ;
} while (1) ;

Algorithm 2 gives us mutual exclusion but not conformance with the progress requirement. If we interchange the first two lines of the code of each process, we are assured progress, but not mutual exclusion.

Algorithm 3 (Peterson's algorithm): What if we try to solve the problem this way?


shared boolean flag[2]={false, false};
shared int turn = 0 ;
--------------------------------
(P0 code)

do
{ 
  flag[0] = true ;
  turn = 1 ; 
  while (flag[1] && turn == 1) 
    /* do nothing */ ; 
  critical section of P0 ; 
  flag[0]=false ; 
  remainder section of P0
} while (1) ;

--------------------------------
(P1 code)

do
{
  flag[1] = true ;
  turn = 0  
  while (flag[0] && turn == 0) 
    /* do nothing */ ; 
  critical section of P1 ; 
  flag[1]=false ; 
  remainder section of P1 ;
} while (1) ;

Peterson's algorithm actually works.
- If the processes set their flags at about the same time, then only one of them will win the race to set the value of turn. The loser will set the value of turn that allows the winner to drop out of the inner while-loop and enter its critical section (CS).
- If P1 is in its remainder section and P0 tries to get into its CS, it will not be delayed because flag[1] will be false.
- If P1 then tries to enter its CS while P0 is in its CS then P1 will be delayed because flag[0] will be true and turn will be zero.
- Later when P0 leaves its CS it will set flag[0] to false. This will allow P1 to drop out of the inner while-loop and enter its CS.
- Under these circumstances P0 cannot enter its CS again before P1 enters its CS. If P0 moves into its entry code immediately after exiting its CS, then P0 will be delayed in its inner while-loop because flag[1] will be true and turn will be 1.
Multiple-Process Solutions
- Lamport's bakery algorithm is a correct solution to the n-process critical section problem. The bakery algorithm goes like this:
```
shared boolean choosing[n] = {false, ..., false} ;
shared int num[n] = {0, ..., 0};
--------------------------------
(Pi code -- Pi executes "Customer(i)")

void Customer (int me)
{
  do
  {  
       /* entry code */
    choosing[me]=true ;
    num[me]=1+max(num[0],...,num[n-1]);
    choosing[me]=false ;

    for (him=0;him<n;him++)
      {
        while (choosing[him])  /* do nothing */ ;
        while ( num[him]!=0 
                 && ( (num[him],him) < (num[me],me) ) 
          /* do nothing */ ;
      } 
    criticalSection(me) ; 
       /* exit code */
    num[me]=0 ; 
    remainderSection(me) ;
  } while (1) ;
}
```
- A process P executing "Customer" picks the "next highest" number. Note however that we don't guarantee that different processes always pick different numbers. (Basically we'd have to solve another critical section problem to be able to guarantee that.)
- After choosing its number, process P examines all the other processes (actually including itself too). P waits for each process Q to finish choosing. P then waits until either P's number is not higher than Q's, or until Q's number is 0. (Throughout this discussion it must be remembered that a tie between the numbers is broken by comparing the id numbers of P and Q -- low number wins.)
- After P does all that waiting, it is P's turn, and P enters its critical section.
- P then sets its number to 0 to indicate it is no longer inside its critical section (and also to indicate it is not currently interested in re-entering its critical section). This allows the process with the next highest number (if any) to enter its critical section.
- If a process is executing in its critical section and other processes come along later and try to enter their critical sections then they will be delayed -- all those processes that came along later will have higher numbers than the process executing its critical section.
- If no processes are executing in their critical sections and two or more processes then attempt to enter their critical sections concurrently, the "lowest" process will be allowed to enter its critical section and the rest will be "higher" and they will be delayed in one of the while-loops within the for-loop.
- The bakery algorithm works, but you should try to think of ways that it might fail, because it will help you to understand it better. Many people have had a difficult time coming to a complete understanding of such algorithms -- including the people who designed them!
- As an example, think about the algorithm that would result if we removed this line:
```
while (choosing[him])  /* do nothing */ ;
```
  If we removed the line, would we get another algorithm that is simpler, but just as correct? It may not be obvious, but with that change the algorithm would no longer satisfy the mutual exclusion requirement. Think about what could happen if P0 and P1 both choose numbers concurrently, and then if P1 started executing very quickly relative to P0.

6.4 Synchronization Hardware

Peterson's algorithm and Lamport's bakery algorithm work correctly. They are considered "software solutions" because the only dependency they have on the hardware is that simple load and store instructions are atomic. However, they have drawbacks. These algorithms are rather complex and difficult to understand. Besides, they do wasteful "busy waiting."

Let's try to develop something that has fewer disadvantages. One thing to notice is that we can get simpler solution to the critical section problem if we have a "fancy" instruction implemented by the computer's hardware. Our text gives the examples of what can be done with an atomic test-and-set instruction or an atomic swap instruction.

DEFINITION OF THE TestAndSet INSTRUCTION
This must be implemented as an atomic operation.


boolean TestAndSet(boolean &target)
{
  boolean rv=target; /* make copy */
  target=true; /* set */
  return rv ; /* return copy */
}

Code as simple as this:

---------------------
shared boolean locked=false;
---------------------
do
{
  while (TestAndSet(locked)) 
    /* do nothing */ ;
  criticalSection(me) ;
  locked=false ;
  remainderSection(me) ;
} while(1) ;
---------------------

implements mutual exclusion for a a set of n processes. (Each process P_i executes the code above with me == i.)

We can satisfy all requirements for a solution to the critical section problem with code like this:

---------------------
shared boolean waiting[n]={false,...,false};
shared boolean locked=false;
---------------------

void SolveCS(int me)
{
  local boolean wasLocked ;
  local int you;
  do
  {
    waiting[me]=true;
    wasLocked=true;
    while( waiting[me] && wasLocked )
      wasLocked = TestAndSet(locked) ;
    waiting[me]=false;

    criticalSection(me) ;

    you=(me+1)%n ;
    while ( (you!=me) && (!waiting[you]) )
       you = (you+1)%n;
    if   (you==me) locked=false
    else waiting[you]=false ;
    
    remainderSection(me) ;
    
  } while(1) ;
}
---------------------

The code above is not shorter than the bakery algorithm, but it is somewhat easier to understand.

Unfortunately, it is not easy to implement TestAndSet (or AtomicSwap) on a multiprocessor architecture.
Another problem with the solutions examined in this section: busy waiting.

6.5 Semaphores

A semaphore is a variable that supports two simple operations: wait() and signal().
Semaphores are a convenience for the programmer of critical sections. One just employs wait() and signal() as entry and exit code, instead of the complex instructions we saw above in Peterson's solution, Lamport's Bakery Algorithm, or even the code used with the TestAndSet() instruction.
Of course, the system designers have to implement wait() and signal() with the requisite atomicity properties, which may require use of, for example, the Bakery Algorithm in the implementation code.
To avoid excessive busy-waiting is an important design goal.
With a queuing semaphore we can create a simple solution to a critical section problem - one that does not require busy waiting.

The queuing semaphore is a special data structure. The data part consists of an integer value and a list. It might be represented this way:
```
typedef struct
{
  int value ;
  struct process *L ;
} semaphore ;
```
The semaphore data structure requires two operations, wait() and signal(), which must be implemented atomically. The following pseudo code describes what the operations do, but does not give any clue about how to implement the operations atomically:
```
void wait(semaphore S)
{
  S.value--;
  if (S.value<0)
  {
    add this process to S.L;
    block() ;
  }
}

void signal(semaphore S)
{
  S.value++;
  if (S.value<=0)
  {
    remove a process P from S.L;
    wakeup(P);
  }
}
```
One can implement block() and wakeup(P) as system calls. A call to block() would put the calling process to sleep. The OS would get control of the CPU and put the process that called block() into a special sleep queue. The sleep queue is a data structure that the OS maintains. A process is not runnable while in the sleep queue. A call to wakeup(P) would cause the OS to get control of the CPU and to remove P from the sleep queue and return it to the ready queue.

On a uniprocessor, one can implement wait() and signal() (without any busy waiting) as system calls. The OS can guarantee the atomic execution of wait() and signal() if it does two things while executing the code of wait() or signal():
- refuse to relinquish the CPU, and
- mask interrupts
Under those circumstances nothing can "sneak in" and run in the CPU until after the wait() or signal() has completed.

Unfortunately the method described above is hard to generalize to a multiprocessor platform. We would have to guarantee that no code running on any of the other CPU's would do anything to "conflict" with the critical section of code running the wait() or the signal().

However on a multiprocessor, we could implement wait() and signal() using one of the software solutions we examined earlier in the chapter. For example, wait() could be implemented like this:
```
typedef struct
{
  boolean waiting[n] ;
  boolean lock ;
  int value ;
  struct process *L ;
} semaphore ;

void wait(semaphore S, int me)
{
  boolean willBlock=false ;
  int wasLocked ;

     /* Entry Code for making wait() atomic */
  S.waiting[me]=true;
  wasLocked=true;
  while(S.waiting[me] && wasLocked) wasLocked=TestAndSet(S.lock);
  S.waiting[me]=false;

  S.value--;
  if (S.value<0)
  {
    add this process to S.L;
    willBlock=true ;
  }

     /* Exit Code for making wait() atomic */
  int you=(me+1)%n ;
  while ( (you != me) && !S.waiting[you] ) you=(you+1)%n ;
  if (you==me) 
  {
    if (willBlock) block(S.lock,false); 
    else S.lock=false ;
  }
  else 
  {
    if (willBlock) block(S.waiting[you],false); 
    else S.waiting[you]=false ;
  }
}
```
Note that the code above employs a modified version of the block() system call. The meaning of block(x,v) is "block the process making this call and then set the variable x equal to the value v."

Why do we have to change the form of block()?

Basically it is due to a problem that comes up if a process P executing a wait() needs to block. In that case P needs to block and perform the exit code. Unfortunately no matter what order P tries to perform these actions, it will do something wrong.

If P blocks it can't do anything next, so it can't execute the exit code. Consider that if P does not set one of the flags to false -- S.lock or S.waiting[you] -- then none of the other processes using the semaphore will be able to perform a signal() or a wait(). All progress of the group of processes will stop. In particular, no process will ever wake P up.

On the other hand it is not acceptable for P to set one of the flags to false first and then block. The problem is that another process Q might execute a signal() and a wakeup(P) before P is able to block.

Therefore, depending on exactly how wakeup() works on the system, P could "miss" its wakeup. P might wake up later when some other process executes a signal(), or it might never wake up. Either way, a lost wakeup can cause processes to malfunction.

The solution we employ here is to take the responsibility away from the process P and place it with the OS. The OS sets the flag to false after blocking P.

Note that the solution we posed for the multiprocessor does require some busy waiting. However generally the amount of time spent doing this busy waiting will be negligible. There are only a few instructions involved in the wait and signal code, and processes do their busy waiting only when waiting to perform those short sequences of instructions.

Contrast that with the case of such code as that below. Here some of the critical sections could be very long. There is the potential, for example, that one process will executes a very long time in its critical section and that several other processes busy wait the whole time.
```
void SolveCS(int me)
{
  local boolean wasLocked ;
  local int you;
  do
  {
    waiting[me]=true;
    wasLocked=true;
    while( waiting[me] && wasLocked )
      wasLocked = TestAndSet(lock) ;
    waiting[me]=false;

    criticalSection(me) ; /* could be very long */

    you=(me+1)%n ;
    while ( (you!=me) && (!waiting[you]) )
       you = (you+1)%n;
    if   (you==me) lock=false
    else waiting[you]=false ;
    
    remainderSection(me) ;
    
  } while(1) ;
}
```
In the version of the code below, implementing the wait and signal as described above for the multiprocessor case, the processes are blocked most of the time while waiting to enter their critical section. They only do busy waiting for a brief time while executing wait() and signal().

As a result there is no significant busy waiting in this solution.
```
---------------------
shared semaphore mutex ;
---------------------
void SolveCS(int me)
{
  do
  {
    wait (mutex) ;
    criticalSection(me) ; /* could be very long */
    signal (mutex) ;
    remainderSection(me) ;
  } while(1) ;
}
```

Deadlocks and Starvation


---------------------

  /* Declare two shared variables and separate semaphores 
     to protect each variable. */

shared int s_count, int q_count ;
shared semaphore S, Q ;
---------------------
P0:
wait(S)
wait(Q)
 /* P0 accesses s_count and q_count */
  ...

signal(Q)
signal(S)
---------------------
P1
wait(Q)
wait(S)
  /* P1 accesses s_count and q_count */
  ...

signal(S)
signal(Q)
---------------------

The code above can lead to deadlock -- a situation wherein each process waits for an event that will never happen.

6.6 Classic Problems of Synchronization

We can solve the bounded buffer problem by encapsulating the functionality of the counter in semaphores, as illustrated by the code below.


---------------------
#define BUFFER_SIZE 10
typedef struct 
{ 
   /* declare desired fields for the buffer item type */
} item ;
int in=0,                  /* in == next position to add an item */
out=0;                     /* out == next position to remove an item */
shared semaphore full(0),  /* full.value == the number of full buffers */
empty(BUFFER_SIZE) ;       /* empty.value == the number of empty buffers */
---------------------
Producer's Code
do
{
   /* produce an item in nextp */
   wait (empty) ;
   buffer[in]= nextp ;
   in = (in + 1) mod BUFFER_SIZE ;
   signal(full) ;
} while (TRUE) ;
---------------------
Consumer's Code
do
{
   wait (full) ;
   nextc = buffer[out] ;
   out = (out + 1) mod BUFFER_SIZE ;
   signal(empty) ;
   /* consume nextc */
} while (TRUE) ;
---------------------

The Readers-Writers Problem: Several processes share access to a file. Some of the processes (readers) never do anything but read the file. Others (writers) may perform writes on the file when they access it. The problem is to synchronize the processes so that readers can share the file concurrently but writers get exclusive access. There are different versions of the problem, for example:
- The first readers-writers problem: no reader will be kept waiting unless a writer has already obtained permission to access the file.
- The second readers-writers problem: once a writer is ready, that writer performs its access as soon as possible.
A solution to either of the two problems above may result in starvation.

A compromise might be a protocol that:
- allows all processes to be served in FIFO order, and
- allows groups of readers between writers to access the file concurrently.
Many operating systems now make reader-writer locks available to system programmers. Processes that want only to read from a file can acquire a (shared) read-lock. Processes that want to write request an exclusive writer's lock.
The Dining Philosopher's Problem: One way to understand this problem is to think of five processes and five disk drives arranged in a circle, with the drives and the processes alternating. Each process occasionally needs to get exclusive access to the disk drives on its 'left' and 'right' in order to transfer some data from one to the other. The problem is to synchronize the processes so they get the access they need without experiencing indefinite postponement. The protocol has to resolve competition between pairs of processes that have to share the disk drive between them.

The example protocol proposed in this section allows deadlock to occur, an extreme form of starvation.

There are several ways to solve the problem. Resource ordering provides a very simple solution. A satisfactory solution to the problem would
1. make sure only one process has possession of any particular disk drive at any one time.
2. allow only processes that are vying for disk drive D or are in possession of D to participate in the decision as to who gets possession of D next, and make sure this decision is not postponed indefinitely.
3. make sure there exists a bound on the number of times that other processes are allowed to make a data transfer after one process P has made a request to begin a transfer and before P is allowed to perform the transfer.

6.7 Monitors

One may solve critical section problems, and many other types of synchronization problems, simply and easily through the use of semaphores. However programmers can make errors even when performing the simplest coding tasks.
Through the use of monitors the programmer can get a compiler's help in preventing coding errors.
Students who have used an object-oriented language will see that monitors are basically objects that are rigged so that only one process at a time can access the object. (The monitor concept dates back to the work of Brinch-Hansen in 1973. The idea of object-oriented computing goes back as far as 1967: [Simula 67, 1967, Dahl & Nygaard]).
The programmer writes code for the monitor. The programmer also writes the "client code" that uses the monitor. The compiler takes care of, in effect, generating the entry code and exit code that guards the monitor so that only one process at a time can execute monitor code.
We can examine how the authors of our text employ a monitor and some client code to create a partial solution to the dining philosophers problem. (Mutual exclusion is assured, and deadlock is impossible, but starvation can occur. What is deadlock? What is starvation? What, if any, is the relation between the two?)
Deadlock can't happen in the example protocol because there is no "hold and wait."
However, starvation can occur. In order to eat, a philosopher P has to wait for this event: E = [both neighbors of P are not eating]. There is no built-in, up-front limit to how many times a philosopher different from P will eat before E happens. In other words, P can be postponed indefinitely.

6.8 Synchronization Examples

Synchronization in Solaris 2

For synchronizing concurrent processes, Solaris 2 provides these tools:
- adaptive mutexes
  - used to protect short code segments accessing shared data
  - will busy wait if the thread holding the lock is executing
  - will block otherwise
  - on a uni-processor the only option will be to block
- condition variables and semaphores -- used to protect long code segments
- reader-writer locks
  - protect long sections of code accessing data that is read frequently but not written frequently
  - allows multiple concurrent reads
  - relatively expensive to implement
- turnstiles
  - a queue for threads blocked on a lock
  - design twist:
    - synchronization objects don't have turnstiles
    - threads "carry around" a turnstile
    - first thread that has to block on a lock "donates" its turnstile for the queuing on that lock. When the thread is awakened it will get another turnstile from a pool maintained by the OS.
  - organized so that when a high priority thread H blocks on a lock held by a low priority thread L, L "inherits" the priority of H until L releases the lock. This helps H get the lock as soon as possible.
All the synchronization primitives listed above are available to user level threads. However, priority inheritance is implemented only for kernel level threads.
Synchronization in Windows XP
- The Windows XP kernel is multithreaded. It supports real-time applications and multiple processors.
- On a uni-processor it relies to some extent on masking interrupts to insure exclusive access to global shared data.
- On a multiprocessor, it uses spinlocks to guard short sections of code accessing global resources. The OS ensures that a thread holding a spinlock will not be preempted.
- Threads outside the kernel can use dispatcher objects to synchronize.
  - synchronization can take the form of mutex, semaphore, event, or timer.
  - Events are used much like conditions -- to wait for a desired condition to occur.
  - Timers are to notify thread(s) that a specific amount of time has expired.
Synchronization in Linux
- The Linux kernel was nonpreemptive through much of its development (like many versions of unix) but since version 2.6 the kernel is said to be preemptive.
- On uni-processors, however, Linux enforces exclusive access by temporarily disabling preemption of the kernel.
- Linux has spinlocks for waits of short duration, and semaphores for longer waits.
- Reader-writer versions of both spinlocks and semaphores are available.
Synchronization in Pthreads
- The Pthreads API offers mutex locks, condition variables, and read-write locks
- Semaphores and spinlocks are often available as extensions to the standard package.

6.9 Atomic Transactions

Definitions

A Transaction is:

A collection of instructions that perform a single logical function

"We can think of a transaction as a program unit that accesses and perhaps updates various data items that reside on a disk within some files. From our point of view, such a transaction is simply a sequence of read and write operations terminated by either a commit operation or an abort operation."

An Atomic Transaction is:

performed in its entirety or not at all.
The solutions to synchronization problems we have worked out so far do not take into account the possibility that the process may fail spontaneously while executing in its critical section -- for example due to a system crash or hardware failure.
Log-Based Recovery
- How can we insure that critical sections are executed atomically when there is non-zero probability that processes will abort without warning?
  - With write ahead logging and stable storage we can operate with very high confidence that "all transactions will be atomic" -- e.g. all critical sections will be executed atomically.
  - For each write operation to a data field the system logs, on stable storage, the unique ID of the operation, unique ID of the data field, old value of the data field, and new value of the data field to be written.
  - A process writes the log entry before attempting the actual change to the data.
  - In effect the log entry is of the form "Transaction T intends to change cell C from value X to value Y."
  - A process writes a <T_i starts> entry to the log when it first enters the code of the transaction.
  - It writes a <T_i commits> record after it has completed the transaction successfully.
  - If a transaction T_i aborts and there is a <T_i starts> entry in the log with no matching <T_i commits> record then in a recovery phase the OS can undo what T_i did after T_i wrote the <T_i starts> record. The OS just looks at the log and changes all the values of the data fields back to what they were. (The OS has to be careful in the manner it does this because another failure could occur in the midst of the recovery.)
  - When the OS undoes an aborted transaction, that preserves atomicity.
  - Similarly a transaction that fails after writing a <T_i commits> may be redone after the failure.
Checkpoints
- A process may flush all log records and data changes to stable storage, and then write a <checkpoint> record to the log.
- The <checkpoint> can speed recovery because the OS knows that if a <T_i commits> appears before a <checkpoint>, the OS does not need to redo T_i.
Serializability
- An operation O_i of transaction T_i conflicts with operation O_j of T_j if both O_i and O_j access the same data item and at least one of the two does a write to the item.
- If one schedule can be transformed into another just by swapping operations that don't conflict, then the two schedules are equivalent -- they have the same effect on the data.
Locking Protocol
- When we want to insure atomicity of some transaction, we may not have to treat the whole section as a single critical section to be protected by a single lock or semaphore.
- It is enough to insure serializability -- to insure that when two transactions execute concurrently the effect on the data is the same as if one transaction was carried out completely first and then the other.
- If we use a lock for each data item and require transactions to follow a locking protocol, we can ensure serializability.
- The so-called two-phase locking protocol may be used.
  - The transaction may obtain but not release locks during the growing phase.
  - The transaction may release locks but not obtain any new locks during the shrinking phase.
  - The two-phase locking protocol ensures conflict serializability but does not ensure freedom from deadlock.
  - There are conflict-serializable schedules that cannot be obtained through two-phase locking.
Timestamp-Based Protocols
- We can give each transaction T_i a unique "timestamp" and require that when TS(T_i) < TS(T_j) the system must "ensure that the produced schedule is equivalent to a serial schedule in which T_i is executed before T_j.
- The timestamps are "birthdates" and "older" transactions are supposed to (at least in effect) "go first."
- We keep track of the youngest transaction that has done a write or read on every data item Q.
- If T_i is about to read data item Q and T_i is older than the youngest writer of Q (YW(Q)) then T_i is rolled back and it has to be run again (with a new timestamp).
- If T_i is about to read data item Q and T_i is younger than the YW(Q) then T_i performs the read and the age of T_i is compared with the youngest reader of Q (YR(Q)). If T_i is younger then the YR(Q) is updated.
- If T_i is about to write data item Q and T_i is older than YR (Q) then T_i is rolled back and it has to be run again (with a new timestamp).
- Else if T_i is about to write data item Q and T_i is older than YW(Q) then T_i is attempting to write an obsolete value and T_i is rolled back and it has to be run again (with a new timestamp).
- Else the write operation is carried out.
- Some schedules that can be produced by this timestamp-based protocol cannot be produced by two-phase locking, and vice versa.
- This timestamp-based protocol insures that conflicting operations are executed in time-stamp order. Therefore the protocol ensures conflict serializability.
- The protocol never results in a wait. Processes either go ahead with their actions or are rolled back. Therefore deadlock cannot occur.
- Since there is no guarantee that a process that has been rolled back will not be rolled back again, there is no guarantee that processes will not starve.