s390/mm,tlb: optimize TLB flushing for zEC12

The zEC12 machines introduced the local-clearing control for the IDTE and IPTE instruction. If the control is set only the TLB of the local CPU is cleared of entries, either all entries of a single address space for IDTE, or the entry for a single page-table entry for IPTE. Without the local-clearing control the TLB flush is broadcasted to all CPUs in the configuration, which is expensive. The reset of the bit mask of the CPUs that need flushing after a non-local IDTE is tricky. As TLB entries for an address space remain in the TLB even if the address space is detached a new bit field is required to keep track of attached CPUs vs. CPUs in the need of a flush. After a non-local flush with IDTE the bit-field of attached CPUs is copied to the bit-field of CPUs in need of a flush. The ordering of operations on cpu_attach_mask, attach_count and mm_cpumask(mm) is such that an underindication in mm_cpumask(mm) is prevented but an overindication in mm_cpumask(mm) is possible. Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com>
author: Martin Schwidefsky <schwidefsky@de.ibm.com> 2014-04-03 13:55:01 +0200
committer: Martin Schwidefsky <schwidefsky@de.ibm.com> 2014-04-03 14:31:00 +0200
commit: 1b948d6caec4f28e3524244ca0f77c6ae8ddceef (patch)
tree: bc7e1d5800f10c39979d3f47872ba7047568f8a4 /arch/s390/include/asm/mmu_context.h
parent: 02a8f3abb708919149cb657a5202f4603f0c38e2 (diff)
download: talos-obmc-linux-1b948d6caec4f28e3524244ca0f77c6ae8ddceef.tar.gz
talos-obmc-linux-1b948d6caec4f28e3524244ca0f77c6ae8ddceef.zip
1 files changed, 5 insertions, 0 deletions
diff --git a/arch/s390/include/asm/mmu_context.h b/arch/s390/include/asm/mmu_context.h
index 7abf318b1522..71a258839039 100644
--- a/arch/s390/include/asm/mmu_context.h
+++ b/arch/s390/include/asm/mmu_context.h
@@ -15,6 +15,7 @@
 static inline int init_new_context(struct task_struct *tsk,
 				   struct mm_struct *mm)
 {
+	cpumask_clear(&mm->context.cpu_attach_mask);
 	atomic_set(&mm->context.attach_count, 0);
 	mm->context.flush_mm = 0;
 	mm->context.asce_bits = _ASCE_TABLE_LENGTH | _ASCE_USER_BITS;
@@ -59,6 +60,8 @@ static inline void switch_mm(struct mm_struct *prev, struct mm_struct *next,
 
 	if (prev == next)
 		return;
+	if (MACHINE_HAS_TLB_LC)
+		cpumask_set_cpu(cpu, &next->context.cpu_attach_mask);
 	if (atomic_inc_return(&next->context.attach_count) >> 16) {
 		/* Delay update_user_asce until all TLB flushes are done. */
 		set_tsk_thread_flag(tsk, TIF_TLB_WAIT);
@@ -73,6 +76,8 @@ static inline void switch_mm(struct mm_struct *prev, struct mm_struct *next,
 	}
 	atomic_dec(&prev->context.attach_count);
 	WARN_ON(atomic_read(&prev->context.attach_count) < 0);
+	if (MACHINE_HAS_TLB_LC)
+		cpumask_clear_cpu(cpu, &prev->context.cpu_attach_mask);
 }
 
 #define finish_arch_post_lock_switch finish_arch_post_lock_switch
author	Martin Schwidefsky <schwidefsky@de.ibm.com>	2014-04-03 13:55:01 +0200
committer	Martin Schwidefsky <schwidefsky@de.ibm.com>	2014-04-03 14:31:00 +0200
commit	1b948d6caec4f28e3524244ca0f77c6ae8ddceef (patch)
tree	bc7e1d5800f10c39979d3f47872ba7047568f8a4 /arch/s390/include/asm/mmu_context.h
parent	02a8f3abb708919149cb657a5202f4603f0c38e2 (diff)
download	talos-obmc-linux-1b948d6caec4f28e3524244ca0f77c6ae8ddceef.tar.gz talos-obmc-linux-1b948d6caec4f28e3524244ca0f77c6ae8ddceef.zip