Commits · 30f6e9ddfc4823f608fe6a7b93f639900b7c25a0 · Verlässliche Systemsoftware / projects / osv

Jun 05, 2013

Avi Kivity authored 11 years ago

When a tracepoint is disabled, we want it to have no impact on running code.

This patch changes the fast path to be a single 5-byte nop instruction. When
a tracepoint is enabled, the nop is patched to a jump instruction to the
out-of-line slow path.

b03979d9

May 27, 2013

Add "memory clobber" to STI and CLI instructions · a200bb7a

Nadav Har'El authored 11 years ago

When some code section happens to be called from both thread context and
interrupt context, and we need mutual exclusion (we don't want the interrupt
context to start while the critical section is in the middle of running in
thread context), we surround the critical code section with CLI and STI.

But we need the compiler to assure us that writes to memory done between
the calls to CLI and STI stay between them. For example, if we have

    thread context:                 interrupt handler:

      CLI;                          a--;
      a++;
      STI;

We don't want the a++ to be moved by the compiler before the CLI. We also
don't want the compiler to save a's value in a register and only actually
write it back to the memory location 'a' after the STI (when an interrupt
handler might be concurrently writing). We also don't want the compiler
to remember a's last value in a register and use it again after the next
CLI.

To ensure these things, we need the "memory clobber" option on both the CLI
and STI instructions. The "volatile" keyword is not enough - it guarantees
that the instruction isn't deleted or moved, but not that stuff that
should have been in memory isn't just in registers.

Note that Linux also has these memory clobbers on sti() and cli().
Linus Torvals explains in a post from 1996 why these were necessary:
http://lkml.indiana.edu/hypermail/linux/kernel/9605/0214.html

All that being said, we never noticed a bug caused by the missing
"memory" clobbers. But better safe than sorry....

a200bb7a

May 26, 2013

x64: use wrfsbase for faster context switching, when available · 3c9ba28d
Avi Kivity authored 11 years ago
```
Drops context switch time by ~80ns.
```
3c9ba28d
x64: add wrfsbase accessor · bb33c998
Avi Kivity authored 11 years ago
```
Faster way to write fsbase on newer processors.
```
bb33c998

signal handling: fix FPU clobbering bug · 94a7015e

Nadav Har'El authored 11 years ago

This patch adds missing FPU-state saving when calling signal handlers.
The state is saved on the stack, to allow nesting of signal handling
(delivery of a second signal while a first signal's handler is running).

In Linux calling conventions, the FPU state is caller-saved, i.e., a
called function can use FPU at will because the caller is assumed to have
saved it if needed. However, signal handlers are called asynchronously,
possibly in the middle of some FPU computation without that computation
getting a chance to save its state. So we must save this state before calling
the signal handling function.

Without this fix, we had problems even if the signal handlers themselves
did not use the FPU. A typical scenario - which we encountered in the
"sunflow" benchmark - is that the signal handler does something which uses
a mutex (e.g., malloc()) and causes a reschedule. The reschedule, not a
preempt(), thinks it does not need to save the FPU state, and the thread
we switch to clobbers this state.

94a7015e

May 18, 2013
- math: add extern "C" to __isnan · 147bd7a1
  Avi Kivity authored 11 years ago
  
  musl doesn't define __isnan, so we must mark as a C function.
  147bd7a1
May 07, 2013

sched: adjust stack deleter signature · 0fe76c0f

Avi Kivity authored 11 years ago

We want the size as well, to be able to munmap() pthread stacks.  Pass the
entire stack_info so we have this information.

0fe76c0f

May 06, 2013

boot: adjust boot loader for loaded payload size dynamically · 53fa5724

Avi Kivity authored 11 years ago

Since we're packing the entire file system into the boot image, it has
overflowed the 128MB limit that was set for it.

Adjust the boot loader during build time to account for the actual loaded size.

Fixes wierd corruption during startup.

53fa5724

May 01, 2013

Unify "mutex_t" and "mutex" types · 3c692eaa

Nadav Har'El authored 11 years ago

Previously we had two different mutex types - "mutex_t" defined by
<osv/mutex.h> for use in C code, and "mutex" defined by <mutex.hh>
for use in C++ code. This is difference is unnecessary, and causes
a mess for functions that need to accept either type, so they work
for both C++ and C code (e.g., consider condvar_wait()).

So after this commit, we have just one include file, <osv/mutex.h>
which works both in C and C++ code. This results in the same type
and same functions being defined, plus some additional conveniences
when in C++, such as method variants of the functions (e.g.,
m.lock() in addition to mutex_lock(m)), and the "with_lock" function.

The mutex type is now called either "mutex_t" or "struct mutex" in
C code, or can also be called just "mutex" in C++ code (all three
names refer to an identical type - there's no longer a different
mutex_t and mutex type).

This commit also modifies all the includers of <mutex.hh> to use
<osv/mutex.h>, and fixes a few miscelleneous compilation issues
that were discovered in the process.

3c692eaa

Apr 28, 2013
- cpuid: parse kvm cpuid · 6a227bd0
  Avi Kivity authored 11 years ago
  
  6a227bd0
- mmu: convert hardcoded pt_index() uses to calls to pt_index() · f7ed5c0a
  Avi Kivity authored 11 years ago
  
  f7ed5c0a
- build: collect elf notes in the output object · 1f198825
  Avi Kivity authored 11 years ago
  
  Needed to pass metadata to Xen.
  1f198825
Apr 24, 2013
- runtime: make abort() crash other processors immediately · 17e62a34
  Avi Kivity authored 11 years ago
  
  17e62a34
- apic: add a method to send NMIs · 0acf0b40
  Avi Kivity authored 11 years ago
  
  0acf0b40
- mmu: change linear_map() to accept a void* for the virtual address · 2d7a1b1d
  Avi Kivity authored 11 years ago
  
  Easier for most users.
  2d7a1b1d
- x64: switch phys_mem to 0xffffc00000000000 · de6ee8d0
  Avi Kivity authored 11 years ago
  
  0xffff800000000000 is used by Xen, so avoid it.
  de6ee8d0
Apr 23, 2013
- mmu: consolidate all appearances of 0xffff800000000000 into a variable · a0e8888c
  Avi Kivity authored 11 years ago
  
  Eliminate duplication.
  a0e8888c
- Add apic->ipi_allbutself · 7e0cac31
  Nadav Har'El authored 11 years ago
  
  Sorry, also forgot to commit this earlier! Add a new function to send API to all processors accept this one.
  7e0cac31
Apr 22, 2013

Allow creation of a new sched::thread pinned to a specific CPU. · 9e7ee944

Nadav Har'El authored 11 years ago

Previously, we had the option to create a pinned thread, but it always
runs on the same CPU as the current thread, which is kind of odd. Changed
the boolean attribute "pinned" to a cpu* attribute specifying the cpu to
pin to.

Example code to run a start a new thread pinned on cpu 1:
new sched::thread([&]{...}, sched::thread::attr(sched::cpus[1]));

I need this feature to test the cross-CPU TLB flushing feature - I need
to be able to run two threads on two different CPUs.

9e7ee944

Apr 14, 2013
- x64: add ioapic support · 3fd77ee5
  Avi Kivity authored 11 years ago
  
  Edge-triggered interrupts only, at present.
  3fd77ee5
Apr 11, 2013
- debug: convert yet another debug variant · 01a7ea60
  Avi Kivity authored 11 years ago
  
  01a7ea60
- debug: remove 'lf' parameter · ce9df682
  Avi Kivity authored 11 years ago
  
  This is only causing confusion; change all callers to add '\n' explicitly and drop the optional argument.
  ce9df682
Apr 08, 2013

Fix memory leak in thread creation · 68284de3

Nadav Har'El authored 11 years ago

Thread object creation used to leak one page for the FPU state (thanks
Avi for spotting this). Fix this (add a destructor which frees the page)
and add to the test-suite a test for checking that thread creation doesn't
leak memory - and while we're at it, also checked that alloc_page() and
malloc() at various sizes do not leak memory.

68284de3

Free TCB (and thread-local storage) area on thread destruction. · 1f68d677

Nadav Har'El authored 11 years ago

We forgot to free the TCB allocated buffer on thread destruction, causing
a leak of around 1000 bytes per thread creation. We still have another
leak (exactly one page per thread object creation) that I'm trying to find :(

1f68d677

Apr 04, 2013
- interrupt: simplify unregister_handler() · 450e2851
  Avi Kivity authored 11 years ago
  
  No need to explicitly construct a temporary function object.
  450e2851
Apr 03, 2013

Handle #DE (divide error) exception · 372c4a73

Nadav Har'El authored 11 years ago

In the existing code, #PF was handled correctly (generating a SIGSEGV),
but on most other x86 hardware exceptions, we just abort()ed the kernel.

The #DE (divide error) exception should, like #PF, generate a signal
(the inappropriately-named SIGFPE), and this patch does this. Strangely,
the SPECjvm2008 benchmark depends on this behavior (I didn't check its
source code to figure out why).

To make it easier to generate other signals in the future, I abstracted
the existing function handle_segmentation_fault() into a more general
generate_signal() which is used in both #PF and #DE handling.

372c4a73

Mar 21, 2013
- x64: qualify newer cr4 features on cpuid · 323764c0
  Avi Kivity authored 12 years ago
  
  Breaks on older processors.
  323764c0
- x64: fix cpuid xsave parsing · c2aba2e0
  Avi Kivity authored 12 years ago
  
  Used the wrong bit.
  c2aba2e0
Mar 19, 2013
- x64: avoid stack instructions in context switch code · 5384840a
  Avi Kivity authored 12 years ago
  
  Due to the "red zone", stack operations may corrupt local variables. Use ordinary moves and jumps instead.
  5384840a
- sched: save/restore fpu when preempting · 118e49b8
  Avi Kivity authored 12 years ago
  
  Normal scheduling does not need to save or restore the fpu when switching threads, since all fpu registers are caller-saved (so calling schedule()) may clobber the fpu). However this does not hold on preemption, so we need to save and restore the fpu state explicitly.
  118e49b8
- x64: set up cr4 for fpu save/restore · 03f81acc
  Avi Kivity authored 12 years ago
  
  03f81acc
- x64: add cpuid parser · d4adc432
  Avi Kivity authored 12 years ago
  
  Parse cpuid flag bits relevant to osv.
  d4adc432
- x64: add cpuid accessors · 01fa9ddf
  Avi Kivity authored 12 years ago
  
  01fa9ddf
- x64: add definitions for cr0/cr4 register bits · 08cf2de5
  Avi Kivity authored 12 years ago
  
  08cf2de5
- x64: add accessors for xsave/xsaveopt/xrstor/fxsave/fxrstor · eb3d0b69
  Avi Kivity authored 12 years ago
  
  eb3d0b69
Mar 07, 2013

sched: copy the tls image to a new thread, instead of zeroing it · f136f328

Avi Kivity authored 12 years ago

With the current memset(), every thread starts out with zero-initialized
tls variables.

Switch to memcpy(), so it gets the proper static initializer.

Fixes conf-preempt=0.

f136f328

Mar 06, 2013
- processor.hh: unindent · 295f8de8
  Avi Kivity authored 12 years ago
  
  295f8de8
- signals: implement SIGSEGV delivery · 5835902e
  Avi Kivity authored 12 years ago
  
  This only implements SIGSEGv delivery to the thread that triggered it (i.e., it must be unblocked).
  5835902e
Mar 04, 2013
- smp: add missing 'break' · 32432315
  Avi Kivity authored 12 years ago
  
  Found by eclipse.
  32432315
Mar 03, 2013

sched: preemption · 378e8aa1

Avi Kivity authored 12 years ago

Split the core scheduler into a function for calling from interrupts, and
a wrapper for calling it from normal paths.  Call the preemptible path from
interrupt handlers.

378e8aa1