x86/fpu: Simplify fpstate_sanitize_xstate() calls
authorIngo Molnar <mingo@kernel.org>
Tue, 28 Apr 2015 09:17:55 +0000 (11:17 +0200)
committerIngo Molnar <mingo@kernel.org>
Tue, 19 May 2015 13:48:00 +0000 (15:48 +0200)
Remove the extra layer of __fpstate_sanitize_xstate():

if (!use_xsaveopt())
return;
__fpstate_sanitize_xstate(tsk);

and move the check for use_xsaveopt() into fpstate_sanitize_xstate().

In general we optimize for the presence of CPU features, not for
the absence of them. Furthermore there's little point in this inlining,
as the call sites are not super hot code paths.

Doing this uninlining shrinks the code a bit:

   text    data     bss     dec     hex filename
   14108751        2573624 1634304 18316679        1177d87 vmlinux.before
   14108627        2573624 1634304 18316555        1177d0b vmlinux.after

Also remove a pointless '!fx' check from fpstate_sanitize_xstate().

Cc: Andy Lutomirski <luto@amacapital.net>
Cc: Borislav Petkov <bp@alien8.de>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Fenghua Yu <fenghua.yu@intel.com>
Cc: H. Peter Anvin <hpa@zytor.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Oleg Nesterov <oleg@redhat.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@kernel.org>
arch/x86/include/asm/fpu/internal.h
arch/x86/kernel/fpu/xstate.c

index 6b6fa46037f87e795939224a8cc0d202c009c4df..88fec3f108de16c597595f55435289d16fc54920 100644 (file)
@@ -139,14 +139,7 @@ static inline void fx_finit(struct i387_fxsave_struct *fx)
        fx->mxcsr = MXCSR_DEFAULT;
 }
 
-extern void __fpstate_sanitize_xstate(struct task_struct *);
-
-static inline void fpstate_sanitize_xstate(struct task_struct *tsk)
-{
-       if (!use_xsaveopt())
-               return;
-       __fpstate_sanitize_xstate(tsk);
-}
+extern void fpstate_sanitize_xstate(struct task_struct *);
 
 #define user_insn(insn, output, input...)                              \
 ({                                                                     \
index fc2ff1239fea604267986b32b5bf1256ef0ddeea..47b9591947e1e3bde36ad98751aec70875f66151 100644 (file)
@@ -92,13 +92,13 @@ EXPORT_SYMBOL_GPL(cpu_has_xfeatures);
  * if the corresponding header bit is zero. This is to ensure that user-space doesn't
  * see some stale state in the memory layout during signal handling, debugging etc.
  */
-void __fpstate_sanitize_xstate(struct task_struct *tsk)
+void fpstate_sanitize_xstate(struct task_struct *tsk)
 {
        struct i387_fxsave_struct *fx = &tsk->thread.fpu.state.fxsave;
        int feature_bit;
        u64 xfeatures;
 
-       if (!fx)
+       if (!use_xsaveopt())
                return;
 
        xfeatures = tsk->thread.fpu.state.xsave.header.xfeatures;