bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[tsan] Add support for C++ exceptions into TSan (call __tsan_func_exit ↵	Kuba Brecka	2016-11-14	7	-178/+195
\| \| \| \| \| \| \| \| \| \|	during unwinding), LLVM part This adds support for TSan C++ exception handling, where we need to add extra calls to __tsan_func_exit when a function is exitted via exception mechanisms. Otherwise the shadow stack gets corrupted (leaked). This patch moves and enhances the existing implementation of EscapeEnumerator that finds all possible function exit points, and adds extra EH cleanup blocks where needed. Differential Revision: https://reviews.llvm.org/D26177 llvm-svn: 286893
*	Add a checkSymbolTable() method to the MachOObjectFile class.	Kevin Enderby	2016-11-14	1	-0/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The philosophy of the error checking in libObject for Mach-O files is that the constructor will check the load commands so for their tables the offsets and sizes are properly contained in the file. But there is no checking of the entries of any of the tables. For the contents of the tables themselves the methods accessing the contents of the entries return errors as needed. In some cases this however makes it difficult or cumbersome to produce a good error message which would include the tool name, file name, archive member, and name of the architecture of a slice of a universal file the error occurred in. So idea is that there will be a method to check a table which can be called up front before using it allowing a good error message to be produced before a table is used. And if only verification of the Mach-O file and its tables are wanted a new possible method checkAllTables() could be added to call all of the methods to check all the tables at some time when such methods exist. The checkSymbolTable() is the first of such methods to check one of the Mach-O file tables. This method initially will used in llvm-objdump’s DisassembleMachO() routine before it gets the section and symbol information. As if there are problems with the symbol table currently the error is first encountered by the bool operator() in the SymbolSorter() struct which passed to std::sort(). In this case there is no context as to the file name the symbol which results a poor error message: LLVM ERROR: truncated or malformed object (bad string index: 22 for symbol at index 1) with the added call to the checkSymbolTable() method the error message includes the tool name and file name: llvm-objdump: 'macho-invalid-symbol-strx': truncated or malformed object (bad string table index: 22 past the end of string table, for symbol at index 1) llvm-svn: 286887
*	[Hexagon] Give a predicate function a more meaningful name	Krzysztof Parzyszek	2016-11-14	2	-18/+18
\| \| \| \| \| \| \|	Change "orisadd" to "IsOrAdd" to follow the naming conventions, and change "isOrAdd" in the C++ code to "isOrEquivalentToAdd". llvm-svn: 286886
*	ARM: try to fix GCC 4.8 compilation again after r286881.	Tim Northover	2016-11-14	1	-1/+2
\| \| \| \|	llvm-svn: 286882
*	Recommit: ARM: sort register lists by encoding in push/pop instructions.	Tim Northover	2016-11-14	3	-2/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For example we were producing push {r8, r10, r11, r4, r5, r7, lr} This is misleading (r4, r5 and r7 are actually pushed before the rest), and other components (stack folding recently) often forget to deal with the extra complexity coming from the different order, leading to miscompiles. Finally, we warn about our own code in -no-integrated-as mode without this, which is really not a good idea. Fixed usage of std::sort so that we (hopefully) use instantiations that actually exist in GCC 4.8. llvm-svn: 286881
*	[AArch64] Change some pointers to references. NFC.	Geoff Berry	2016-11-14	1	-16/+16
\| \| \| \| \| \|	Follow-up change to r286875. llvm-svn: 286879
*	[AArch64] Split 0 vector stores into scalar store pairs.	Geoff Berry	2016-11-14	1	-4/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Replace a splat of zeros to a vector store by scalar stores of WZR/XZR. The load store optimizer pass will merge them to store pair stores. This should be better than a movi to create the vector zero followed by a vector store if the zero constant is not re-used, since one instructions and one register live range will be removed. For example, the final generated code should be: stp xzr, xzr, [x0] instead of: movi v0.2d, #0 str q0, [x0] Reviewers: t.p.northover, mcrosier, MatzeB, jmolloy Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D26561 llvm-svn: 286875
*	[AArch64] Factor out transform code from split16BStore. NFC.	Geoff Berry	2016-11-14	1	-24/+31
\| \| \| \|	llvm-svn: 286874
*	[ThinLTO] Only promote exported locals as marked in index	Teresa Johnson	2016-11-14	3	-16/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We have always speculatively promoted all renamable local values (except const non-address taken variables) for both the exporting and importing module. We would then internalize them back based on the ThinLink results if they weren't actually exported. This is inefficient, and results in unnecessary renames. It also meant we had to check the non-renamability of a value in the summary, which was already checked during function importing analysis in the ThinLink. Made renameModuleForThinLTO (which does the promotion/renaming) instead use the index when exporting, to avoid unnecessary renames/promotions. For importing modules, we can simply promoted all values as any local we import by definition is exported and needs promotion. This required changes to the method used by the FunctionImport pass (only invoked from 'opt' for testing) and when invoked from llvm-link, since neither does a ThinLink. We simply conservatively mark all locals in the index as promoted, which preserves the current aggressive promotion behavior. I also needed to change an llvm-lto based test where we had previously been aggressively promoting values that weren't importable (aliasees), but now will not promote. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26467 llvm-svn: 286871
*	[libFuzzer] replace 'auto' with 'auto *' to better follow the LLVM style	Kostya Serebryany	2016-11-14	1	-3/+3
\| \| \| \|	llvm-svn: 286870
*	Revert: r286868 - Test commit	Daniel Sanders	2016-11-14	1	-1/+0
\| \| \| \|	llvm-svn: 286869
*	Test commit	Daniel Sanders	2016-11-14	1	-0/+1
\| \| \| \|	llvm-svn: 286868
*	Revert "ARM: sort register lists by encoding in push/pop instructions."	Tim Northover	2016-11-14	3	-28/+2
\| \| \| \| \| \| \|	This reverts commit 286866. It broke a bot, something to do with exactly which templates std::sort accepts. llvm-svn: 286867
*	ARM: sort register lists by encoding in push/pop instructions.	Tim Northover	2016-11-14	3	-2/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	For example we were producing push {r8, r10, r11, r4, r5, r7, lr} This is misleading (r4, r5 and r7 are actually pushed before the rest), and other components (stack folding recently) often forget to deal with the extra complexity coming from the different order, leading to miscompiles. Finally, we warn about our own code in -no-integrated-as mode without this, which is really not a good idea. llvm-svn: 286866
*	[PPC] Add intrinsic mapping to the xscvhpsp instruction	Sean Fertile	2016-11-14	1	-0/+9
\| \| \| \| \| \| \| \| \|	add an intrinsic to expose the 'VSX Scalar Convert Half-Precision to Single-Precision' instruction. Differential review: https://reviews.llvm.org/D26536 llvm-svn: 286862
*	AMDGPU/SI: Support data types other than V4f32 in image intrinsics	Changpeng Fang	2016-11-14	2	-63/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extend image intrinsics to support data types of V1F32 and V2F32. TODO: we should define a mapping table to change the opcode for data type of V2F32 but just one channel is active, even though such case should be very rare. Reviewers: tstellarAMD Differential Revision: http://reviews.llvm.org/D26472 llvm-svn: 286860
*	Use _Unwind_Backtrace on Apple platforms.	Bob Wilson	2016-11-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Darwin's backtrace() function does not work with sigaltstack (which was enabled when available with r270395) — it does a sanity check to make sure that the current frame pointer is within the expected stack area (which it is not when using an alternate stack) and gives up otherwise. The alternative of _Unwind_Backtrace seems to work fine on macOS, so use that when backtrace() fails. Note that we then use backtrace_symbols_fd() with the addresses from _Unwind_Backtrace, but I’ve tested that and it also seems to work fine. rdar://problem/28646552 llvm-svn: 286851
*	Typo	Adrian Prantl	2016-11-14	1	-1/+1
\| \| \| \|	llvm-svn: 286845
*	Restore "[ThinLTO] Prevent exporting of locals used/defined in module level asm"	Teresa Johnson	2016-11-14	4	-9/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This restores the rest of r286297 (part was restored in r286475). Specifically, it restores the part requiring adding a dependency from the Analysis to Object library (downstream use changed to correctly model split BitReader vs BitWriter libraries). Original description of this part of patch follows: Module level asm may also contain defs of values. We need to prevent export of any refs to local values defined in module level asm (e.g. a ref in normal IR), since that also requires renaming/promotion of the local. To do that, the summary index builder looks at all values in the module level asm string that are not marked Weak or Global, which is exactly the set of locals that are defined. A summary is created for each of these local defs and flagged as NoRename. This required adding handling to the BitcodeWriter to look at GV declarations to see if they have a summary (rather than skipping them all). Finally, added an assert to IRObjectFile::CollectAsmUndefinedRefs to ensure that an MCAsmParser is available, otherwise the module asm parse would silently fail. Initialized the asm parser in the opt tool for use in testing this fix. Fixes PR30610. llvm-svn: 286844
*	[Hexagon] Remove unsafe load instructions that affect Stack Slot Coloring	Sumanth Gundapaneni	2016-11-14	1	-12/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The Stack slot coloring pass removes a store that is followed by a load that deal with the same stack slot. The function isLoadFromStackSlot is supposed to consider the loads that have no side-effects. This patch fixed the issue by removing the unsafe loads from this function Eg: %vreg0<def> = L2_loadruh_io <fi#15>, 0 S2_storeri_io <fi#15>, 0, %vreg0 In this case, we load an unsigned extended half word and store this in to the same stack slot. The Stack slot coloring pass considers safe to remove the store. This patch marked all the non-vector byte and half word loads as unsafe. llvm-svn: 286843
*	[ThinLTO] Make inline assembly handling more efficient in summary	Teresa Johnson	2016-11-14	4	-9/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The change in r285513 to prevent exporting of locals used in inline asm added all locals in the llvm.used set to the reference set of functions containing inline asm. Since these locals were marked NoRename, this automatically prevented importing of the function. Unfortunately, this caused an explosion in the summary reference lists in some cases. In my particular example, it happened for a large protocol buffer generated C++ file, where many of the generated functions contained an inline asm call. It was exacerbated when doing a ThinLTO PGO instrumentation build, where the PGO instrumentation included thousands of private __profd_* values that were added to llvm.used. We really only need to include a single llvm.used local (NoRename) value in the reference list of a function containing inline asm to block it being imported. However, it seems cleaner to add a flag to the summary that explicitly describes this situation, which is what this patch does. Reviewers: mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26402 llvm-svn: 286840
*	[CostModel][X86] Added mul costs for vXi8 vectors	Simon Pilgrim	2016-11-14	1	-5/+21
\| \| \| \| \| \|	More realistic v16i8/v32i8/v64i8 MUL costs - we have to extend to vXi16, use PMULLW and then truncate the result llvm-svn: 286838
*	[X86][AVX] Fixed v16i16/v32i8 ADD/SUB costs on AVX1 subtargets	Simon Pilgrim	2016-11-14	1	-0/+4
\| \| \| \| \| \| \| \|	Add explicit v16i16/v32i8 ADD/SUB costs, matching the costs of v4i64/v8i32 - they were missing for some reason. This has side effects on the LV max bandwidth tests (AVX1 now prefers 128-bit vectors vs AVX2 which still prefers 256-bit) llvm-svn: 286832
*	[PPC] add intrinsics for vec extract exp/significand and vec test data class.	Sean Fertile	2016-11-14	1	-6/+18
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26272 llvm-svn: 286829
*	Remove redundant condition (PR28352) NFCI.	Simon Pilgrim	2016-11-14	1	-2/+3
\| \| \| \| \| \|	We were already testing is the op was not a leaf, so need to then test if it was a leaf (added it to the assert instead). llvm-svn: 286817
*	[InlineCost] Remove skew when calculating call costs	James Molloy	2016-11-14	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When calculating the cost of a call instruction we were applying a heuristic penalty as well as the cost of the instruction itself. However, when calculating the benefit from inlining we weren't discounting the equivalent penalty for the call instruction that would be removed! This caused skew in the calculation and meant we wouldn't inline in the following, trivial case: int g() { h(); } int f() { g(); } llvm-svn: 286814
*	Remove redundant condition (PR28800) NFCI.	Simon Pilgrim	2016-11-14	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	'A \|\| (!A && B)' is equivalent to 'A \|\| B': (LoopCycle > DefCycle) \|\| (LoopCycle <= DefCycle && LoopStage <= DefStage) --> (LoopCycle > DefCycle) \|\| (LoopStage <= DefStage) llvm-svn: 286811
*	GlobalISel: Fix indentation. NFC	Diana Picus	2016-11-14	2	-4/+4
\| \| \| \|	llvm-svn: 286808
*	[JumpThreading] Prevent non-deterministic use lists	Pablo Barrio	2016-11-14	1	-8/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Unfolding selects was previously done with the help of a vector of pointers that was then sorted to be able to remove duplicates. As this sorting depends on the memory addresses, it was non-deterministic. A SetVector is used now so that duplicates are removed without the need of sorting first. Reviewers: mgrang, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26450 llvm-svn: 286807
*	Add explicit (void) cast to unused unique_ptr::release() results	Eric Fiselier	2016-11-14	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds explicit `(void)` casts to discarded `release()` calls to suppress -Wunused-result. This patch fixes all warnings are generated as a result of [applying `[[nodiscard]]` within libc++](https://reviews.llvm.org/D26596). Similar fixes were applied to Clang in r286796. Reviewers: chandlerc, dberris Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26598 llvm-svn: 286797
*	Demangle: only demangle mangled symbols	Saleem Abdulrasool	2016-11-14	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \|	Only attempt to demangle symbols which have the itanium C++ prefix of `_Z`. This ensures that we do not treat any symbol name as a managled named. We would previously treat a C function `f` as a mangled name and decode that to `float` incorrectly. While it is easy to add tests for this, Mehdi recommended against introducing tests for the demangler as libc++abi should cover the testing. llvm-svn: 286795
*	[AVX-512] Add suffixless aliases for EVEX encoded ↵	Craig Topper	2016-11-14	1	-0/+10
\| \| \| \| \| \| \| \|	vcvtsi2ss/vcvtsi2sd/vcvtusi2ss/vcvtusi2sd. This matches the VEX behavior. Fixes another problem from PR28850. llvm-svn: 286790
*	[X86] Cleanup 'x' and 'y' mnemonic suffixes for ↵	Craig Topper	2016-11-14	3	-23/+71
\| \| \| \| \| \| \| \| \| \| \| \| \|	vcvtpd2dq/vcvttpd2dq/vcvtpd2ps and similar instructions. -Don't print the 'x' suffix for the 128-bit reg/mem VEX encoded instructions in Intel syntax. This is consistent with the EVEX versions. -Don't print the 'y' suffix for the 256-bit reg/reg VEX encoded instructions in Intel or AT&T syntax. This is consistent with the EVEX versions. -Allow the 'x' and 'y' suffixes to be used for the reg/mem forms when we're assembling using Intel syntax. -Allow the 'x' and 'y' suffixes on the reg/reg EVEX encoded instructions in Intel or AT&T syntax. This is consistent with what VEX was already allowing. This should fix at least some of PR28850. llvm-svn: 286787
*	[AVX-512] Remove and autoupgrade masked dword/qword variable shift ↵	Craig Topper	2016-11-14	2	-32/+35
\| \| \| \| \| \|	intrinsics to the new unmasked versions and selects. llvm-svn: 286786
*	[ValueTracking] recognize even more variants of smin/smax	Sanjay Patel	2016-11-13	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Similar to: https://reviews.llvm.org/rL285499 https://reviews.llvm.org/rL286318 We can't minimally expose this in IR tests because we don't have min/max intrinsics, but the difference is visible in codegen because SelectionDAGBuilder::visitSelect() uses matchSelectPattern(). We're not canonicalizing these patterns in IR (yet), so I don't expect there to be any regressions as noted here: http://lists.llvm.org/pipermail/llvm-dev/2016-November/106868.html llvm-svn: 286776
*	[AVX-512] Fix a disassembler failure for AVX-512 vcmpss/vcmpsd with an ↵	Craig Topper	2016-11-13	1	-4/+14
\| \| \| \| \| \| \| \|	immediate larger than 32. Fix the same bug with VLX vcmpps/vcmppd. Fixes PR24941. llvm-svn: 286775
*	[ValueTracking] move min/max matching to helper function; NFCI	Sanjay Patel	2016-11-13	1	-46/+59
\| \| \| \|	llvm-svn: 286772
*	[X86][IR] Reduce the number of full string comparisons in the code that ↵	Craig Topper	2016-11-13	1	-156/+173
\| \| \| \| \| \|	autoupgrades masked shift intrinsics. llvm-svn: 286768
*	AMDGPU: Implement SGPR spilling with scalar stores	Matt Arsenault	2016-11-13	3	-10/+153
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	nThis avoids the nasty problems caused by using memory instructions that read the exec mask while spilling / restoring registers used for control flow masking, but only for VI when these were added. This always uses the scalar stores when enabled currently, but it may be better to still try to spill to a VGPR and use this on the fallback memory path. The cache also needs to be flushed before wave termination if a scalar store is used. llvm-svn: 286766
*	revert commit r286761, some builds failed on Win platforms	Igor Breger	2016-11-13	2	-17/+4
\| \| \| \|	llvm-svn: 286765
*	[X86][AVX512] Removing llvm x86 intrinsics for _mm_mask_move_{ss\|sd} intrinsics.	Ayman Musa	2016-11-13	2	-4/+17
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26128 llvm-svn: 286761
*	[X86][AVX512] Add patterns for all variants of VMOVSS/VMOVSD instructions.	Ayman Musa	2016-11-13	2	-0/+91
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26022 llvm-svn: 286758
*	[InstCombine][AVX-512] Teach InstCombineCalls to handle the new unmasked ↵	Craig Topper	2016-11-13	1	-4/+18
\| \| \| \| \| \|	AVX-512 variable shift intrinsics. llvm-svn: 286755
*	[AVX-512] Add unmasked intrinsics for variable shifts of dwords and qwords.	Craig Topper	2016-11-13	1	-0/+8
\| \| \| \| \| \|	These will be used to replace the masked intrinsics so that InstCombineCalls can optimize the AVX-512 variable shifts the same way it does for AVX2. llvm-svn: 286754
*	[AMDGPU] Add f16 support (VI+)	Konstantin Zhuravlyov	2016-11-13	18	-238/+617
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D25975 llvm-svn: 286753
*	Bitcode: Change module reader functions to return an llvm::Expected.	Peter Collingbourne	2016-11-13	10	-103/+77
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26562 llvm-svn: 286752
*	Analysis: Simplify the ScalarEvolution::getGEPExpr() interface. NFCI.	Peter Collingbourne	2016-11-13	3	-14/+10
\| \| \| \| \| \| \| \|	All existing callers were manually extracting information out of an existing GEP instruction and passing it to getGEPExpr(). Simplify the interface by changing it to take a GEPOperator instead. llvm-svn: 286751
*	Bitcode: More precise casting. NFCI.	Peter Collingbourne	2016-11-13	1	-3/+3
\| \| \| \|	llvm-svn: 286750
*	IR: Change the Type::get{Array,Vector,Pointer}ElementType() functions to ↵	Peter Collingbourne	2016-11-13	1	-1/+2
\| \| \| \| \| \| \| \|	perform the correct type assertion. Previously we were only asserting that the type was a sequential type. llvm-svn: 286749
*	[InstCombine][AVX-512] Expand vector shift handling to work on the AVX-512 ↵	Craig Topper	2016-11-13	1	-1/+45
\| \| \| \| \| \| \| \|	shift by immediate and shift by single value. This does not include support for the AVX-512 variable shifts. That will be coming in a future patch. llvm-svn: 286739