bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[OpenMP] NFC: Fix trivial typos in comments	Kazuaki Ishizaki	2020-01-07	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: jdoerfert, Jim Reviewed By: Jim Subscribers: Jim, mgorny, guansong, jfb, openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D72285
*	[OpenMP] NFC: Fix trivial typos in comments	Kelvin Li	2020-01-03	1	-2/+2
\| \| \| \| \| \|	Submitted by: kiszk Differential Revision: https://reviews.llvm.org/D72171
*	[OpenMP] Fix linkage issue on FreeBSD	David Carlier	2019-12-06	1	-1/+1
\| \| \| \|	needs kmp_set_thread_affinity_mask_initial implementation.
*	[OpenMP] Enable thread affinity on FreeBSD	David Carlier	2019-10-08	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	Reviewers: chandlerc, jlpeyton, jdoerfert, dim Reviewed-By: dim Differential Revision: https://reviews.llvm.org/D68580 llvm-svn: 374118
*	[OpenMP] Remove OMP spec versioning	Jonathan Peyton	2019-07-12	1	-43/+4
\| \| \| \| \| \| \| \| \| \|	Remove all older OMP spec versioning from the runtime and build system. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D64534 llvm-svn: 365963
*	Fixed build warning with -DLIBOMP_USE_HWLOC=1	Andrey Churbanov	2019-06-03	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	Made type of depth of hwloc object to correapond with change from unsigned in hwloc 1,x to int in hwloc 2.x. This eliminates the warning on signed-unsigned comparison. Differential Revision: https://reviews.llvm.org/D62332 llvm-svn: 362401
*	Fix hwloc topology traversal code unable to handle situation where L2 cache ↵	Paul Osmialowski	2019-05-16	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	is common for the packages Currently cores within package that share the same L2 cache are grouped together. The current logic behind this assumes that the L2 cache is always at deeper (or the same) level than the package itself. In case when L2 cache is common for all packages (and the packages are at deeper level than L2 cache) the whole of the further topology discovery fails to find any computational units resulting in following assertion: Assertion failure at kmp_affinity.cpp(715): nActiveThreads == __kmp_avail_proc. OMP: Error #13: Assertion failure at kmp_affinity.cpp(715). This patch adds a bit of a logic that prevents such situation from occurring. Differential Revision: https://reviews.llvm.org/D61796 llvm-svn: 360890
*	[OpenMP] Eliminate some compiler warnings	Jonathan Peyton	2019-04-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	* Remove accidental == for = * Assign values to variables to appease compiler * Surround debug code with KMP_DEBUG * Remove unused local typedefs Differential Revision: https://reviews.llvm.org/D60983 llvm-svn: 359599
*	Update more file headers across all of the LLVM projects in the monorepo	Chandler Carruth	2019-01-19	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to reflect the new license. These used slightly different spellings that defeated my regular expressions. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351648
*	[OpenMP] Remove compiler warning about unused value	Joachim Protze	2019-01-16	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The compiler warns about an unused variable/statement: runtime/src/kmp_affinity.cpp:4958:18: warning: statement has no effect [-Wunused-value] KA_TRACE(1000, ; { ^ runtime/src/kmp_debug.h:84:24: note: in definition of macro 'KA_TRACE' __kmp_debug_printf x; \ ^ Instead of the unused reference to this function, this patch now calls the function with an empty string. The call to this function should have no effect. Patch provided by joachim.protze Reviewers: jlpeyton, hbae, AndreyChurbanov Reviewed By: AndreyChurbanov Tags: #openmp, #ompt Differential Revision: https://reviews.llvm.org/D56775 llvm-svn: 351323
*	[OpenMP] Fix for nested proc_bind affinity bug	Jonathan Peyton	2019-01-15	1	-6/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Using proc_bind clause on a nested #pragma omp parallel region with KMP_AFFINITY set causes an assertion error. This assertion occurs because the place-partition-var is not properly initialized in the nested master threads. Trying to get an intuitive result with KMP_AFFINITY + proc_bind is difficult because of how the KMP_AFFINITY gtid-to-place mapping occurs. This patch creates an initial place list no matter what affinity mechanism is used. For KMP_AFFINITY, the place-partition-var is initialized to all the places. Differential Revision: https://reviews.llvm.org/D55795 llvm-svn: 351227
*	[OpenMP] Implement OpenMP 5.0 affinity format functionality	Jonathan Peyton	2018-12-13	1	-30/+110
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds the affinity format functionality introduced in OpenMP 5.0. This patch adds: Two new environment variables: OMP_DISPLAY_AFFINITY=TRUE\|FALSE OMP_AFFINITY_FORMAT=<string> and Four new API: 1) omp_set_affinity_format() 2) omp_get_affinity_format() 3) omp_display_affinity() 4) omp_capture_affinity() The affinity format functionality has two ICV's associated with it: affinity-display-var (bool) and affinity-format-var (string). The affinity-display-var enables/disables the functionality through the envirable OMP_DISPLAY_AFFINITY. The affinity-format-var is a formatted string with the special field types beginning with a '%' character similar to printf For example, the affinity-format-var could be: "OMP: host:%H pid:%P OStid:%i num_threads:%N thread_num:%n affinity:{%A}" The affinity-format-var is displayed by every thread implicitly at the beginning of a parallel region when any thread's affinity has changed (including a brand new thread being spawned), or explicitly using the omp_display_affinity() API. The omp_capture_affinity() function can capture the affinity-format-var in a char buffer. And omp_set\|get_affinity_format() allow the user to set\|get the affinity-format-var explicitly at runtime. omp_capture_affinity() and omp_get_affinity_format() both return the number of characters needed to hold the entire string it tried to make (not including NULL character). If not enough buffer space is available, both these functions truncate their output. Differential Revision: https://reviews.llvm.org/D55148 llvm-svn: 349089
*	[OpenMP] Fix balanced affinity so thread's private affinity mask is updated	Jonathan Peyton	2018-09-26	1	-7/+5
\| \| \| \| \| \| \| \| \| \| \|	Balanced affinity only updated the thread's affinity with the operating system. This change also has the thread's private mask reflect that change as well so that any API that probes the thread's affinity mask will report the correct mask value. Differential Revision: https://reviews.llvm.org/D52379 llvm-svn: 343142
*	[OpenMP] Fixed affinity verbose double printing for balanced type.	Jonathan Peyton	2018-08-24	1	-1/+2
\| \| \| \|	llvm-svn: 340647
*	[OpenMP] Cleanup code	Jonathan Peyton	2018-08-09	1	-38/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch cleans up unused functions, variables, sign compare issues, and addresses some -Warning flags which are now enabled including -Wcast-qual. Not all the warning flags in LibompHandleFlags.cmake are enabled, but some are with this patch. Some __kmp_gtid_from_* macros in kmp.h are switched to static inline functions which allows us to remove the awkward definition of KMP_DEBUG_ASSERT() and KMP_ASSERT() macros which used the comma operator. This had to be done for the innumerable -Wunused-value warnings related to KMP_DEBUG_ASSERT() Differential Revision: https://reviews.llvm.org/D49105 llvm-svn: 339393
*	[OpenMP] Introduce hierarchical scheduling	Jonathan Peyton	2018-07-09	1	-0/+87
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces the logic implementing hierarchical scheduling. First and foremost, hierarchical scheduling is off by default To enable, use -DLIBOMP_USE_HIER_SCHED=On during CMake's configure stage. This work is based off if the IWOMP paper: "Workstealing and Nested Parallelism in SMP Systems" Hierarchical scheduling is the layering of OpenMP schedules for different layers of the memory hierarchy. One can have multiple layers between the threads and the global iterations space. The threads will go up the hierarchy to grab iterations, using possibly a different schedule & chunk for each layer. [ Global iteration space (0-999) ] (use static) [ L1 \| L1 \| L1 \| L1 ] (use dynamic,1) [ T0 T1 \| T2 T3 \| T4 T5 \| T6 T7 ] In the example shown above, there are 8 threads and 4 L1 caches begin targeted. If the topology indicates that there are two threads per core, then two consecutive threads will share the data of one L1 cache unit. This example would have the iteration space (0-999) split statically across the four L1 caches (so the first L1 would get (0-249), the second would get (250-499), etc). Then the threads will use a dynamic,1 schedule to grab iterations from the L1 cache units. There are currently four supported layers: L1, L2, L3, NUMA OMP_SCHEDULE can now read a hierarchical schedule with this syntax: OMP_SCHEDULE='EXPERIMENTAL LAYER,SCHED[,CHUNK][:LAYER,SCHED[,CHUNK]...]:SCHED,CHUNK And OMP_SCHEDULE can still read the normal SCHED,CHUNK syntax from before I've kept most of the hierarchical scheduling logic inside kmp_dispatch_hier.h to try to keep it separate from the rest of the code. Differential Revision: https://reviews.llvm.org/D47962 llvm-svn: 336571
*	[OpenMP] Fix affinity API for KMP_AFFINITY=none\|compact\|scatter	Jonathan Peyton	2018-04-18	1	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, the affinity API reports garbage for the initial place list and any thread's place lists when using KMP_AFFINITY=none\|compact\|scatter. This patch does two things: for KMP_AFFINITY=none, Creates a one entry table for the places, this way, the initial place list is just a single place with all the proc ids in it. We also set the initial place of any thread to 0 instead of KMP_PLACE_ALL so that the thread reports that single place (place 0) instead of garbage (-1) when using the affinity API. When non-OMP_PROC_BIND affinity is used (including KMP_AFFINITY=compact\|scatter), a thread's place list is populated correctly. We assume that each thread is assigned to a single place. This is implemented in two of the affinity API functions Differential Revision: https://reviews.llvm.org/D45527 llvm-svn: 330283
*	[AArch64] fix an issue with older /proc/cpuinfo layout	Paul Osmialowski	2017-12-13	1	-0/+8
\| \| \| \| \| \| \| \| \| \|	There are two /proc/cpuinfo layots in use for AArch64: old and new. The old one has all 'processor : n' lines in one section, hence checking for duplications does not make sense. Differential Revision: https://reviews.llvm.org/D41000 llvm-svn: 320593
*	Fix thread affinity on non-x86 Linux	Jonas Hahnfeld	2017-12-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	To make thread affinity work according to the OpenMP spec, the runtime needs information about the hardware topology. On Linux the default way is to parse /proc/cpuinfo which contains this information for x86 machines but (at least) not for AArch64 and Power architectures. Fortunately, there is a different code path which is able to get that data from sysfs. The needed patch has landed in 2006 for Linux 2.6.16 which is safe to assume nowadays (even RHEL 5 had a kernel version derived from 2.6.18, and we are now at RHEL 7!). Differential Revision: https://reviews.llvm.org/D40357 llvm-svn: 320151
*	Eliminate double printing of verbose affinity settings	Jonathan Peyton	2017-12-06	1	-1/+3
\| \| \| \| \| \| \| \| \|	Redundant extra verbose output of binding to full mask in case affinity=balanced or OMP_PLACES=<any> or OMP_PROC_BIND=<any> Differential Revision: https://reviews.llvm.org/D40624 llvm-svn: 319960
*	Extension of HWLOC topology discovery with NUMA nodes and tiles	Andrey Churbanov	2017-11-30	1	-172/+345
\| \| \| \| \| \| \| \|	Patch by Olga Malysheva Differential Revision: https://reviews.llvm.org/D40309 llvm-svn: 319422
*	Warning is emitted when tiles are requested but cannot be used	Jonathan Peyton	2017-11-29	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added two warnings: 1) Before building the topology map check if tiles are requested but the topo method is not hwloc; 2) After building the topology map check if tiles are requested but not detected by the library. Patch by Olga Malysheva Differential Revision: https://reviews.llvm.org/D40340 llvm-svn: 319374
*	Apply formatting changes	Jonathan Peyton	2017-10-20	1	-3/+0
\| \| \| \| \| \| \| \| \| \|	.clang-format's comments are removed and a (hopefully) final set of formatting changes are applied. Differential Revision: https://reviews.llvm.org/D38837 Differential Revision: https://reviews.llvm.org/D38920 llvm-svn: 316227
*	Remove unnecessary semicolons	Jonathan Peyton	2017-09-27	1	-1/+1
\| \| \| \| \| \| \| \|	Removes semicolons after if {} blocks, function definitions, etc. I was able to apply the large OMPT patch cleanly on top of this one with no conflicts. llvm-svn: 314340
*	Minor code cleanup of Klocwork issues	Jonathan Peyton	2017-09-05	1	-6/+5
\| \| \| \| \| \| \| \| \| \| \|	Minor code cleanup of Klocwork issues. Fatal messages are given no return attribute. Define and use KMP_NORETURN to work for multiple C++ versions. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D37275 llvm-svn: 312538
*	OpenMP RTL cleanup: eliminated warnings with -Wcast-qual, patch 2.	Andrey Churbanov	2017-07-17	1	-7/+5
\| \| \| \| \| \| \| \| \|	Changes are: got all atomics to accept volatile pointers that allowed to simplify many type conversions. Windows specific code fixed correspondingly. Differential Revision: https://reviews.llvm.org/D35417 llvm-svn: 308164
*	OpenMP RTL cleanup: eliminated warnings with -Wcast-qual.	Andrey Churbanov	2017-07-03	1	-76/+79
\| \| \| \| \| \| \| \| \| \| \|	Changes are: replaced C-style casts with cons_cast and reinterpret_cast; type of several counters changed to signed; type of parameters of 32-bit and 64-bit AND and OR intrinsics changes to unsigned; changed files formatted using clang-format version 3.8.1. Differential Revision: https://reviews.llvm.org/D34759 llvm-svn: 307020
*	Fix minor formatting issues	Jonathan Peyton	2017-06-01	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	Some code was restructured to move it under KMP_DEBUG. The rest is formatting changes to fix some things broken by clang-format Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D33744 llvm-svn: 304438
*	Fix for KMP_AFFINITY=disabled and KMP_TOPOLOGY_METHOD=hwloc	Jonathan Peyton	2017-05-31	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	With these settings, the create_hwloc_map() method was being called causing an assert(). After some consideration, it was determined that disabling affinity explicitly should just disable hwloc as well. i.e., KMP_AFFINITY overrides KMP_TOPOLOGY_METHOD. This lets the user know that the Hwloc mechanism is being ignored when KMP_AFFINITY=disabled. Differential Revision: https://reviews.llvm.org/D33208 llvm-svn: 304344
*	Fix for KMP_AFFINITY=respect with multiple processor groups	Jonathan Peyton	2017-05-15	1	-3/+2
\| \| \| \| \| \| \| \| \|	An assert() was being tripped when KMP_AFFINITY=respect + Multiple Processor Groups. Let __kmp_affinity_create_proc_group_map() function be able to create address2os object which contains a single group by deleting restriction that process affinity mask must span multiple groups. llvm-svn: 303101
*	Clang-format and whitespace cleanup of source code	Jonathan Peyton	2017-05-12	1	-4711/+4265
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch contains the clang-format and cleanup of the entire code base. Some of clang-formats changes made the code look worse in places. A best effort was made to resolve the bulk of these problems, but many remain. Most of the problems were mangling line-breaks and tabbing of comments. Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D32659 llvm-svn: 302929
*	Fix Hwloc API Incompatibility	Jonathan Peyton	2017-04-25	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \|	Older Hwloc libraries (< 1.10.0) don't offer the HWLOC_OBJ_NUMANODE nor HWLOC_OBJ_PACKAGE types. Instead they are named HWLOC_OBJ_NODE and HWLOC_OBJ_SOCKET instead. This patch just defines the newer names based on the older names when using an older Hwloc. Differential Revision: https://reviews.llvm.org/D32496 llvm-svn: 301349
*	KMP_HW_SUBSET extended with NUMA support when HWLOC enabled	Andrey Churbanov	2017-04-13	1	-75/+638
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31600 llvm-svn: 300220
*	Fix incorrect initial value of __kmp_affinity_type.	Jonathan Peyton	2017-03-20	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Affinity initialization code expects __kmp_affinity_type has the value affinity_default by default, but the cleanup code does not properly set the value back to affinity_default. This may introduce some issues when multiple roots are trying to initialize/uninitialize the runtime successively. Patch by Hansang Bae Differential Revision: https://reviews.llvm.org/D31012 llvm-svn: 298313
*	Printing OS thread id, when KMP_AFFINITY is set.	Jonathan Peyton	2017-01-27	1	-6/+4
\| \| \| \| \| \| \| \|	Patch by Vishakha Agrawal Differential Revision: https://reviews.llvm.org/D28873 llvm-svn: 293315
*	kmp_affinity: Fix check if specific bit is set	Jonas Hahnfeld	2017-01-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Clang 4.0 trunk warns: warning: logical not is only applied to the left hand side of this bitwise operator [-Wlogical-not-parentheses] This points to a potential bug if the code really wants to check if the single bit is not set: If for example (buf.edx >> 9) = 2 (has any bit set except the least significant one), 'logical not' will return 0 which stays 0 after the 'bitwise and'. To do this correctly we first need to evaluate the 'bitwise and'. In that case it returns 2 & 1 = 0 which after the 'logical not' evaluates to 1. Differential Revision: https://reviews.llvm.org/D28599 llvm-svn: 291764
*	Introduce dynamic affinity dispatch capabilities	Jonathan Peyton	2016-11-14	1	-56/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This set of changes enables the affinity interface (Either the preexisting native operating system or HWLOC) to be dynamically set at runtime initialization. The point of this change is that we were seeing performance degradations when using HWLOC. This allows the user to use the old affinity mechanisms which on large machines (>64 cores) makes a large difference in initialization time. These changes mostly move affinity code under a small class hierarchy: KMPAffinity class Mask {} KMPNativeAffinity : public KMPAffinity class Mask : public KMPAffinity::Mask KMPHwlocAffinity class Mask : public KMPAffinity::Mask Since all interface functions (for both affinity and the mask implementation) are virtual, the implementation can be chosen at runtime initialization. Differential Revision: https://reviews.llvm.org/D26356 llvm-svn: 286890
*	Fix bitmask upper bounds check	Jonathan Peyton	2016-09-12	1	-15/+16
\| \| \| \| \| \| \| \| \| \| \| \|	Rather than checking KMP_CPU_SETSIZE, which doesn't exist when using Hwloc, we use the get_max_proc() function which can vary based on the operating system. For example on Windows with multiple processor groups, it might be the case that the highest bit possible in the bitmask is not equal to the number of hardware threads on the machine but something higher than that. Differential Revision: https://reviews.llvm.org/D24206 llvm-svn: 281245
*	Move function into cpp file under KMP_AFFINITY_SUPPORTED guard.	Jonathan Peyton	2016-09-02	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \|	When affinity isn't supported, __kmp_affinity_compact doesn't exist. The problem is that in kmp_affinity.h there is a function which uses it without the proper KMP_AFFINITY_SUPPORTED guard around it. The compiler was smart enough to ignore it and the function __kmp_affinity_cmp_Address_child_num which relies on it, but I think it is cleaner to have it under the proper guard. Since the function is only used in the kmp_affinity.cpp file and there aren't any plans to have it elsewhere. I have moved it there. llvm-svn: 280542
*	Replace a bad instance of __kmp_free() with KMP_CPU_FREE_ARRAY() macro.	Jonathan Peyton	2016-09-02	1	-1/+1
\| \| \| \|	llvm-svn: 280530
*	Fixed x2APIC discovery for 256-processor architectures.	Andrey Churbanov	2016-08-05	1	-3/+3
\| \| \| \| \| \| \| \|	Mask for value read from ebx register returned by CPUID expanded to 0xFFFF. Differential Revision: https://reviews.llvm.org/D23203 llvm-svn: 277825
*	Make balanced affinity work on AArch64.	Paul Osmialowski	2016-07-29	1	-57/+141
\| \| \| \| \| \| \| \| \| \| \|	This patch enables balanced affinity on machines that do not have hardware threads and have cores clustered into packages. In facts, balacing algorithm could be generalized for any arrangement with at least two levels of hierarchy (depth > 1). Differential Revision: https://reviews.llvm.org/D22365 llvm-svn: 277212
*	D22136: Memory leaks fixed by adding missed __kmp_free() calls	Andrey Churbanov	2016-07-08	1	-0/+2
\| \| \| \|	llvm-svn: 274850
*	Improvements to process affinity mask setting	Jonathan Peyton	2016-06-21	1	-51/+102
\| \| \| \| \| \| \| \| \| \| \| \|	A couple improvements: 1) Add ability to limit fullMask size when KMP_HW_SUBSET limits resources. 2) Make KMP_HW_SUBSET work for affinity_none, and only limit fullMask in this case. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21528 llvm-svn: 273278
*	Change hwloc discovery algorithm to print topology only for accessible resources	Jonathan Peyton	2016-06-16	1	-17/+29
\| \| \| \| \| \| \| \| \| \| \| \| \|	Change hwloc discovery algorithm to print topology for only accessible resources, and report uniformity correspondingly, similar to what other topology discovery algorithms do. Fixes minor inconsistency in total topology reported and resources used for threads binding in case hwloc used. Patch by Andrey Churbanov. Differential Revision: http://reviews.llvm.org/D21389 llvm-svn: 272952
*	Fixed missing memory cleanup in __kmp_affinity_create_hwloc_map()	Jonathan Peyton	2016-06-16	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	Cleanup: fixed missing memory cleanup in couple of corner cases. Fixes possible memory leak in some corner cases Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21355 llvm-svn: 272946
*	Deprecate KMP_PLACE_THREADS and rename as KMP_HW_SUBSET	Jonathan Peyton	2016-06-16	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Deprecate KMP_PLACE_THREADS and rename it to KMP_HW_SUBSET due to confusion about its purpose and function among users. KMP_HW_SUBSET is an environment variable which allows users to easily pick a subset of the hardware topology to use. e.g., KMP_HW_SUBSET=30c,2t means use 30 cores, 2 threads per core. Patch by Andrey Churbanov Differential Revision: http://reviews.llvm.org/D21340 llvm-svn: 272937
*	Affinity mask processing improvements	Jonathan Peyton	2016-06-13	1	-49/+44
\| \| \| \| \| \| \| \| \| \| \| \|	Remove static specifier from var fullMask and remove kmp_get_fullMask() routine. When iterating through procs in a mask, always check if proc is in fullMask (this check was missing in a few places). Patch by Brian Bliss. Differential Revision: http://reviews.llvm.org/D21300 llvm-svn: 272589
*	Hwloc refactoring patch	Jonathan Peyton	2016-06-13	1	-106/+122
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These changes remove the hwloc_topology_ignore_type function which doesn't exist in the hwloc 2.0 API. In the existing code, the topology extracted from hwloc has the cache levels stripped out and then assumes the final stripped topology follows the typical three-level topology: packages -> cores -> HW threads. But the code is doing unclean manipulations to determine at what level those resources are located and also assumes too much about what hwloc is detecting (there could be intermediate levels in between socket and core for instance). This new way of extracting the topology doesn't strip out any hardware objects that hwloc detects. It does not assume the three level topology, and instead searches for the relevant three levels within the topology for each bit of information using hwloc interface functions. i.e., the three level topology subset that our affinity code is interested in is extracted from the hwloc topology tree directly. For example, the new __kmp_hwloc_get_nobjs_under_obj function gives the user the number of cores under a socket reliably without worrying if there are unexpected objects between the socket object and core object in the hwloc topology structure. Also, now that all topology information is kept, there are also possibilities of using the caches/numa nodes to determine more sophisticated affinity settings in the future. There is also some cleanup code added for the destruction of the __kmp_hwloc_topology object. Differential Revision: http://reviews.llvm.org/D21195 llvm-svn: 272565
*	Remove architecture dependent Hwloc DEBUG section	Jonathan Peyton	2016-04-25	1	-30/+0
\| \| \| \| \| \| \|	This debug sections's functionality can be replicated using the environment variable KMP_TOPOLOGY_METHOD with different values and KMP_AFFINITY=verbose llvm-svn: 267472