Explain Linux commit message that patches/secures POP SS followed by a #BP interrupt (INT3)

Question

This is in reference to CVE-2018-8897 (which appears related to CVE-2018-1087), described as follows: A statement in the System Programming Guide of the Intel 64 and IA-32 Architectures Software Developer's Manual (SDM) was mishandled in the development of some or all operating-system kernels, resulting in unexpected behavior for #DB exceptions that are deferred by MOV SS or POP SS, as

Accepted Answer

usergs is referring to the x86-64 swapgs instruction, which exchanges gs with an internal saved GS value for the kernel to find the kernel stack from a syscall entry point.  The swaps also swap the cached gsbase segment info, rather than reloading from the GDT based on the gs value itself.  (wrgsbase can change the GS base independently of the GDT/LDT)AMD&#8217;s design is that syscall doesn&#8217;t change RSP to point to the kernel stack, and doesn&#8217;t read/write any memory, so syscall itself can be fast.  But then you enter the kernel with all registers holding their user-space values.  See Why does Windows64 use a different calling convention from all other OSes on x86-64? for some links to mailing list discussions between kernel devs and AMD architects in ~2000, tweaking the design of syscall and swapgs to make it usable before any AMD64 CPUs were sold.Apparently keeping track of whether GS is currently the kernel or user value is tricky for error handling: There&#8217;s no way to say &#8220;I want kernelgs now&#8221;; you have to know whether to run swapgs or not in any error-handling path.  The only instruction is a swap, not a set it to one vs. the other.Read comments in arch/x86/entry/entry_64.S e.g. https://github.com/torvalds/linux/blob/9fb71c2f230df44bdd237e9a4457849a3909017d/arch/x86/entry/entry_64.S#L1267 (from current Linux) which mentions usergs, and the next block of comments describes doing a swapgs before jumping to some error handling code with kernel gsbase.IIRC, the Linux kernel [gs:0] holds a thread info block, at the lowest addresses of the kernel stack for that thread.  The block includes the kernel stack pointer (as an absolute address, not relative to gs).I wouldn&#8217;t be surprised if this bug is basically tricking the kernel to loading kernel rsp from a user-controlled gsbase, or otherwise screwing up the dead-reckoning of swapgs so it has the wrong gs at some point.

Advertisement

Answer