@yinquan – 2015 – September

Computer System Design
- Bus
  - Transfer data to certain department
  - Because the transferring speed is different, you must distinguish different bus (north bridge and south bridge)
- CPU
  - From SMP (Symmetrical Multi-Processing) to Multi-core Processor.
What problem the Computer Operating System want to solve?
- CPU
  The CPU is as the resource to Operating System. The system can allocate and control this resource. If the computer needs to do something, it will need this resource. Otherwise, the task cannot be run.
  - The Operating System handle the CPU allocation. It controls every process’s time.
  - The Operating System can force the CPU resource into the process needed. This is called Preemptive
- Device
  - The Operating System will hide the detail of hardware. The invoker only uses the simple object to operate the various hardware.
  - The Operating System classify the various hardware and unify the operation.
- Memory
  - The memory is the important resource. The process can be run only to get CPU and memory.
  - The memory usage should be considered as effectiveness, safe, hiding detail.
    Virtual Memory can implement these features. However, the memory only is expressed as the physical memory. The virtual memory must convert to the physical memory by the special way.
  - segmentation:
    
    The segmentation can solve the safe and hiding detail, but it isn’t effective. The one segmentation is for a whole process. Finding the suitable size is important for the segmentation.
  - pages:
    
    The page is that memory can be divided into the fix size page. The size of every page is decided by hardware. Or the selection provides by hardware, and the operation system can choose.
    
    As the general, the 4K is normal size for a page.
    
    The page can be defined as Virtual Page in the virtual memory, Physical Page in the physical memory and Disk Page in the Disk.
    
    If you want to share a physical page, you only let the virtual pages map to that physical page. The data in that physical page will be shared with these virtual pages.
    
    Safe
    
    A page can be set a security level only by the operating system.
    
    The MMU, the physical department, will convert the virtual memory provided by a user to the physical memory managed by the operating system.
- Thread
  - The Basic of Thread:
    - Lightweight Process. The minimum unit of the program running.
    - The thread contain, TID (thread id), PC, register, heap
    - Sharing the resource of the process, code segmentation, data segmentation and process heap, file opened, signals
    The situation used thread:
    - The things needed to wait or calculate
    - The program need to run in multithread
    - The multi-thread can show the maximum power of the multi-core process
    - Sharing the data of the process
    The private resource of the thread:
    - stack
    - Thread Local Storage
    - register
      
      Thread Private Thread Share
      
      Local Variable Global Variable
      
      Function Argument Data in Heap
      
      Static Variable in Function
      
      Code
      
      File opened
  - Thread Schedule:
    - Running: The current thread is running
    - Ready: The current thread can run, but the resource of CPU already has been used.
    - Waiting: The current thread is waiting for happening an event ( I/O or async) and can’t run
      Priority Schedule:
      
      Deciding the order of the thread. Every thread has priority itself.
      
      Round Robin:
      
      Every Thread only own the time slice of the CPU for a while. If the time slice is over, the Thread will give up running.
      - Thread Automatic Schedule:
        IO Bound Thread
        The time it takes to complete a computation is determined principally by the period spent waiting for input/output operations to be completed.
        CPU Bound Thread
        The time for it to complete a task is determined principally by the speed of the central processor: processor utilization is high, perhaps at 100% usage for many seconds or minutes. Interrupts generated by peripherals may be processed slowly, or indefinitely delayed.
        IO Bound Thread is easy to get the high priority than CPU Bound Thread
        
        Starvation
        The CPU always run the high priority thread, the low priority thread never chances to run.
        For solving starvation, the low priority thread will run as long as this thread waits for a long time.
        Preemption
        When a thread consumes its time slice, it will be forced to give up running and get into ready status.
      - Linux Thread:
        Task
        In Linux, the process and the thread all is as the task. The task can share the memory. The thread
        Process : fork + exec
        Thread : clone + exec
  - The Thread Safe:
    - Competition and Atomic
      If the operation of a system isn’t atomic, it is easy to happen mistake when applying a resource competed (like memory). When facing this situation, we must use other ways to ensure the operation is atomic.
    - Lock and Async
      
      The lock is a way that ensure the operation must be atomic.
      Before accessing the resource, the thread should acquire a lock and release this lock after accessing.
      
      Binary Semaphore
      
      The semaphores which are restricted to the values 0 and 1 ( or locked/unlocked, unavailable/available) are called binary semaphores
      semaphore is multiple. It allows the resource is occurred N times.
      
      Mutex
      
      A mutex is essentially the same thing as a binary semaphore and sometimes uses the same basic implementation. The differences between them are in how they are used. While a binary semaphore may be used as a mutex, a mutex is a more specific use-case, which allows extra guarantees:
      1. Mutexes have a concept of an owner. Only the process that locked the mutex is supposed to unlock it. If the owner is stored by the mutex this can be verified at runtime.
      2. Mutexes may provide priority inversion safety. If the mutex knows its current owner, it is possible to promote the priority of the owner whenever a higher priority task starts waiting on the mutex.
      3. Mutexes may also provide deletion safety, where the process holding the mutex cannot be accidentally deleted.
      Critical Section
      In concurrent programming, a critical section is a part of a multi-process program that may not be concurrently executed by more than one of the program’s processes/threads; in other words, it is a piece of program that requires mutual exclusion of access. Typically, the critical section accesses a shared resource (data structure or device)
      
      A critical section may consist of multiple discontiguous parts of the program’s code. For example, one part of a program might read from a file that another part wishes to modify.
      
      How critical sections are implemented varies among operating systems.
      - The simplest method is to prevent any change of processor control inside section. On uni-processor systems, this can be done by disabling interrupts on entry into the critical section, avoiding system calls that can cause a context switch while inside the section, and restoring interrupts to their previous state on exit.Any thread of execution entering any critical section anywhere in the system will, with this implementation, prevent any other thread, including an interrupt, from being granted processing time one the CPU.
      - This brute-force approach can be improved upon by using semaphore. To enter a critical section, a thread must obtain a semaphore, which it release on leaving the section. Other threads are prevented from entering the critical section at the same time as the original thread, but are free to gain control of the CPU and execute other code, including other critical sections that are protected by different semaphores.
      Read-Write Lock
      To thread reading data more time and writing data less time, the general locks don’t have efficiency.
      The read-write lock has two modes to one lock, Shared or Exclusive:
      
      Lock Status Shared Exclusive
      
      Free Success Success
      
      Share Success Wait
      
      Exclusive Wait Wait
      
      This table shows another locks’ status when a lock acquires a read-write lock.
      Condition Variable
      It’s usage is like a fence.
      1. Threads can wait one condition variable.
      2. These threads waiting can be awaked by a thread with setting this condition variable.
      Making many threads waiting an event, then all threads resume to run when event happens
    - Reentrant and Thread Safe
      The routine is called reentrant if it can be interrupted in the middle of its execution and then safely called again (“re-entered”) before its previous invocations complete execution.
      The interruption could be caused by an internal action such as a jump or call, or by an external action as a hardware interrupt or signal. Once the reentered invocation completes, the previous invocations will resume correct execution.
      The condition becoming a reentrant:
      - Not using any (local) static variable or global non-static variable
      - Not returning any (local) static variable or global non-static variable
      - Only depending the argument provided by invoker
      - Not depending any special resource lock (mutex)
      - Not invoking any routine isn’t reentrant
      The reentrant is threadsafe’s strong guarantee.
    - Excessive Optimize
      volatile
      1. Preventing the compile from temporarily storing into the register for a variable to increasing the speed and not writing back.
      2. Preventing the compile from arranging the order of the operation operating volatile variable.
      barrier
      The volatile can solve wrong order caused by compile, but it doesn’t solve that wrong order cause by CPU. The CPU can dynamically change the order of the code.
      For example, in the following code, The step creating object will have the chance to change the order.
```
volatile T* pInst = 0;
T* GetInstance()
{
  if (pInst == NULL)
  {
    lock();
    if (pInst == NULL)
       pInst = new T;
    unlock();
  }
 return pInst;
}
```
      Two if statement can invoking work load of lock descreases to smaller.
      
      Now, the barrier isn’t cross platform. Every CPU have itself implementation.
      It likes a fence prevent the mechanism changed the order in CPU working.
```
volatile T* pInst = 0;
T* GetInstance()
{
  if (pInst == NULL)
  {
    lock();
    if (!pInst)
    {
       T* temp = new T;
       barrier();
       pInst = temp;
    }
    unlock();
  }
 return pInst;
}
```
  - The Model of the multi-thread:
    In the operation system, the thread in the kernel is different the thread in the user space.
    For that, the thread in the kernel has three kinds of relation with the thread in the user space.
    - One Vs. One
      One thread in the kernel is associated with one thread in the user space.
      Disadvantage:
      - The number of kernel thread has limitation, so the user space thread also has limitation
      - The changing context between kernel threads has more cost.
    - One Vs. Multiple
      One thread in the kernel is associated with multiple threads in the user space.
      This situation decreases the cost switched the context between kernel threads.
      Disadvantage:
      - If the thread in the user space is blocked, it will block all user space threads in that kernel thread.
      - This situation doesn’t outstanding increase the performance.
    - Multiple Vs. Multiple
      Multiple threads in the user space are associated less multiple threads in the kernel
      This can solve two below situation’s problem. But It’s performance isn’t better than One Vs. One

Thread Private	Thread Share
Local Variable	Global Variable
Function Argument	Data in Heap
	Static Variable in Function
	Code
	File opened

Lock Status	Shared	Exclusive
Free	`Success`	`Success`
Share	`Success`	`Wait`
Exclusive	`Wait`	`Wait`

The Computer Basic Constructure