bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86] Don't allow commuting to form phsub operations.	Craig Topper	2017-01-21	1	-2/+2
\| \| \| \| \| \|	Fixes PR31714. llvm-svn: 292713
*	Add missing dependency to "Module Summary Analysis" pass	Mehdi Amini	2017-01-21	1	-0/+1
\| \| \| \| \| \|	This is fixing a clang crash when running `clang -flto=thin -save-temps` llvm-svn: 292711
*	[PM] Sink an LCSSA preservation assert from the LoopSimplify pass into	Chandler Carruth	2017-01-21	2	-14/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the library routine shared with the new PM and other code. This assert checks that when LCSSA preservation is requested we start in LCSSA form. Without this early assert, given very complex test cases we can hit an assert or crash much later on when trying to preserve LCSSA. The new PM's loop simplify doesn't need to (and indeed can't) preserve LCSSA as the new PM doesn't deal in transforms in the dependency graph. But we asked the library to and shockingly, this didn't work very well! Stop doing that. Now the assert will tell us immediately with existing test cases. Before this, it took a pretty convoluted input to trigger this. However, sinking the assert also found a bug in LoopUnroll where we asked simplifyLoop to preserve LCSSA right before we reform it. That's kinda silly and unsurprising that it wasn't available. =D Stop doing that too. We also would assert that the unrolled loop was in LCSSA even if preserving LCSSA was never requested! I don't have a test case or anything here. I spotted it by inspection and it seems quite obvious. No logic change anyways, that's just avoiding a spurrious assert. llvm-svn: 292710
*	LiveRegUnits: Add accumulateBackward() function	Matthias Braun	2017-01-21	2	-31/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Re-Commit r292543 with a fix for the situation when the chain end is MBB.end(). This function can be used to accumulate the set of all read and modified register in a sequence of instructions. Use this code in AArch64A57FPLoadBalancing::scavengeRegister() to prove the concept. - The AArch64A57LoadBalancing code is using a backwards analysis now which is irrespective of kill flags. This is the main motivation for this change. Differential Revision: http://reviews.llvm.org/D22082 llvm-svn: 292705
*	[SLP] Make ReductionOpcode have the right (enum) type. NFC.	Michael Kuperstein	2017-01-21	1	-13/+10
\| \| \| \|	llvm-svn: 292703
*	MergeFunctions: Preserve debug info in thunks, under option ↵	Anmol P. Paralkar	2017-01-21	1	-22/+248
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	-mergefunc-preserve-debug-info Summary: Under option -mergefunc-preserve-debug-info we: - Do not create a new function for a thunk. - Retain the debug info for a thunk's parameters (and associated instructions for the debug info) from the entry block. Note: -debug will display the algorithm at work. - Create debug-info for the call (to the shared implementation) made by a thunk and its return value. - Erase the rest of the function, retaining the (minimally sized) entry block to create a thunk. - Preserve a thunk's call site to point to the thunk even when both occur within the same translation unit, to aid debugability. Note that this behaviour differs from the underlying -mergefunc implementation which modifies the thunk's call site to point to the shared implementation when both occur within the same translation unit. Reviewers: echristo, eeckstein, dblaikie, aprantl, friss Reviewed By: aprantl Subscribers: davide, fhahn, jfb, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D28075 llvm-svn: 292702
*	LowerTypeTests: Fix use-after-free. Found by asan/msan.	Peter Collingbourne	2017-01-21	1	-2/+5
\| \| \| \|	llvm-svn: 292700
*	[AMDGPU] Fix build broken in r292688.	Eugene Zelenko	2017-01-21	1	-0/+1
\| \| \| \|	llvm-svn: 292699
*	[SLP] Delete useless helper. NFC.	Michael Kuperstein	2017-01-21	1	-14/+10
\| \| \| \| \| \| \|	The helper contained a branch for a special case that is unnecessary, and a cast. llvm-svn: 292698
*	[libFuzzer] Use CXX to set compiler to use	Vitaly Buka	2017-01-21	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	Reviewers: kcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28976 llvm-svn: 292697
*	[libFuzzer] fix gcc build	Kostya Serebryany	2017-01-21	1	-1/+0
\| \| \| \|	llvm-svn: 292695
*	[NVPTX] Auto-upgrade some NVPTX intrinsics to LLVM target-generic code.	Justin Lebar	2017-01-21	2	-49/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Specifically, we upgrade llvm.nvvm.: * brev{32,64} * clz.{i,ll} * popc.{i,ll} * abs.{i,ll} * {min,max}.{i,ll,u,ull} * h2f These either map directly to an existing LLVM target-generic intrinsic or map to a simple LLVM target-generic idiom. In all cases, we check that the code we generate is lowered to PTX as we expect. These builtins don't need to be backfilled in clang: They're not accessible to user code from nvcc. Reviewers: tra Subscribers: majnemer, cfe-commits, llvm-commits, jholewinski Differential Revision: https://reviews.llvm.org/D28793 llvm-svn: 292694
*	[NVPTX] Move getDivF32Level, usePrecSqrtF32, and useF32FTZ into out of ↵	Justin Lebar	2017-01-21	3	-46/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DAGToDAG and into TargetLowering. Summary: DADToDAG has access to TargetLowering, but not vice versa, so this is the more general location for these functions. NFC Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D28795 llvm-svn: 292693
*	[ConstantFolding] Constant-fold llvm.sqrt(x) like other intrinsics.	Justin Lebar	2017-01-21	1	-13/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently we return undef, but we're in the process of changing the LangRef so that llvm.sqrt behaves like the other math intrinsics, matching the return value of the standard libcall but not setting errno. This change is legal even without the LangRef change because currently calling llvm.sqrt(x) where x is negative is spec'ed to be UB. But in practice it's also safe because we're simply constant-folding fewer inputs: Inputs >= -0 get constant-folded as before, but inputs < -0 now aren't constant-folded, because ConstantFoldFP aborts if the host math function raises an fp exception. Reviewers: hfinkel, efriedma, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28929 llvm-svn: 292692
*	[AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-01-21	8	-198/+270
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 292688
*	[libFuzzer] use print+exit(1) instead of assert to report an error	Kostya Serebryany	2017-01-21	2	-3/+5
\| \| \| \|	llvm-svn: 292685
*	[libFuzzer] re-enable LLVMFuzzer-RepeatedMemcmp test, cleanup the test ↵	Kostya Serebryany	2017-01-21	2	-20/+15
\| \| \| \| \| \|	runner nearby llvm-svn: 292683
*	[libFuzzer] call __sanitizer_dump_coverage via EF	Kostya Serebryany	2017-01-20	2	-1/+4
\| \| \| \|	llvm-svn: 292681
*	[PPC] Give unaligned memory access lower cost on processor that supports it	Guozhi Wei	2017-01-20	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	Newer ppc supports unaligned memory access, it reduces the cost of unaligned memory access significantly. This patch handles this case in PPCTTIImpl::getMemoryOpCost. This patch fixes pr31492. Differential Revision: https://reviews.llvm.org/D28630 llvm-svn: 292680
*	[ThinLTO] Don't perform computeDeadSymbols during O0 link, as the result is ↵	Mehdi Amini	2017-01-20	1	-13/+13
\| \| \| \| \| \|	never used (NFC) llvm-svn: 292679
*	[NewGVN] Optimize processing for instructions found trivially dead.	Davide Italiano	2017-01-20	1	-3/+19
\| \| \| \| \| \| \| \| \| \| \|	Don't call `isTriviallyDeadInstructions()` once we discover that an instruction is dead. Instead, set DFS number zero (as suggested by Danny) and forget about it (this also speeds up things as we won't try to reprocess that block). Differential Revision: https://reviews.llvm.org/D28930 llvm-svn: 292676
*	GlobalISel: prevent heap use-after-free when looking up VReg.	Tim Northover	2017-01-20	1	-1/+3
\| \| \| \| \| \| \| \|	Translating the constant can create more VRegs, which can invalidate the reference into the DenseMap. So we have to look up the value again after all that's happened. llvm-svn: 292675
*	LowerTypeTests: Simplify; always create SizeM1 with type IntPtrTy, move ↵	Peter Collingbourne	2017-01-20	1	-7/+2
\| \| \| \| \| \|	initialization out of if statement. llvm-svn: 292674
*	Add indirect call promotion to SamplePGO	Dehao Chen	2017-01-20	1	-7/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds metadata for indirect call promotion in the sample profile loader. Reviewers: xur, davidxl, dnovillo Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28923 llvm-svn: 292672
*	[libFuzzer] Don't use `#ifdef` for defined macros, instead use `#if`.	Marcos Pividori	2017-01-20	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28972 llvm-svn: 292670
*	[libFuzzer] Use clang as linker on Windows, to properly include sanitizer ↵	Marcos Pividori	2017-01-20	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	libraries. In order to use sanitizers on Windows, we need to link against many runtime libraries which will depend on the target being created (executable or dll) and the c runtime library used (MT/MD). By default, cmake uses link.exe for linking, which fails because we don't specify the appropiate dependencies. As we don't want to consider all of that possible situations which depends on the implementation of the compiler-rt, the simplest option is to change the rules for linking executables and shared libraries, using the compiler instead of link.exe. Clang driver will consider the sanitizer flags, and automatically provide the required libraries to the linker. Differential Revision: https://reviews.llvm.org/D27869 llvm-svn: 292669
*	[libFuzzer] Properly use compiler options supported on Windows.	Marcos Pividori	2017-01-20	1	-1/+1
\| \| \| \| \| \| \| \|	Replace "-g" by "-gline-tables-only". "-g" is not supported by clang-cl. Differential Revision: https://reviews.llvm.org/D27868 llvm-svn: 292668
*	[ThinLTO] The "codegen only" path didn't honor the recently added file-based API	Mehdi Amini	2017-01-20	1	-15/+19
\| \| \| \|	llvm-svn: 292667
*	Improve PGO support for the new inliner	Easwaran Raman	2017-01-20	4	-27/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the following to the new PM based inliner in PGO mode: * Use block frequency analysis to derive callsite's profile count and use that to adjust thresholds of hot and cold callsites. * Incrementally update the BFI of the caller after a callee gets inlined into it. This incremental update is only within an invocation of the run method - BFI is not preserved across calls to run. Update the function entry count of the callee after inlining it into a caller. * I've tuned the thresholds for the hot and cold callsites using a hacked up version of the old inliner that explicitly computes BFI on a set of internal benchmarks and spec. Once the new PM based pipeline stabilizes (IIRC Chandler mentioned there are known issues) I'll benchmark this again and adjust the thresholds if required. Inliner PGO support. Differential revision: https://reviews.llvm.org/D28331 llvm-svn: 292666
*	[pdb] Write the Named Stream mapping to Yaml and binary.	Zachary Turner	2017-01-20	4	-12/+47
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28919 llvm-svn: 292665
*	[pdb] Merge NamedStreamMapBuilder and NamedStreamMap.	Zachary Turner	2017-01-20	6	-71/+69
\| \| \| \| \| \| \| \| \| \| \|	While the builder pattern has proven useful for certain other larger types, in this case it was hampering the ability to use the data structure, as for runtime access we need a map that we can efficiently read from and write to. So the two are merged into a single data structure that can efficiently be read to, written from, deserialized from bytes, and serialized to bytes. llvm-svn: 292664
*	[PDB] Rename some files to be more intuitive.	Zachary Turner	2017-01-20	8	-44/+44
\| \| \| \|	llvm-svn: 292663
*	IPO, LTO: Plumb the summary from the LTO API into the pass manager.	Peter Collingbourne	2017-01-20	3	-9/+17
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28840 llvm-svn: 292661
*	[ValueTracking] recognize variations of 'clamp' to improve codegen (PR31693)	Sanjay Patel	2017-01-20	1	-1/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	By enhancing value tracking, we allow an existing min/max canonicalization to kick in and improve codegen for several targets that have min/max instructions. Unfortunately, recognizing min/max in value tracking may cause us to hit a hack in InstCombiner::visitICmpInst() more often: http://lists.llvm.org/pipermail/llvm-dev/2017-January/109340.html ...but I'm hoping we can remove that soon. Correctness proofs based on Alive: Name: smaxmin Pre: C1 < C2 %cmp2 = icmp slt i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp slt i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %min => %cmp2 = icmp slt i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp sgt i8 %min, C1 %r = select i1 %cmp1, i8 %min, i8 C1 Name: sminmax Pre: C1 > C2 %cmp2 = icmp sgt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp sgt i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %max => %cmp2 = icmp sgt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp slt i8 %max, C1 %r = select i1 %cmp1, i8 %max, i8 C1 ---------------------------------------- Optimization: smaxmin Done: 1 Optimization is correct! ---------------------------------------- Optimization: sminmax Done: 1 Optimization is correct! Name: umaxmin Pre: C1 u< C2 %cmp2 = icmp ult i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp ult i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %min => %cmp2 = icmp ult i8 %x, C2 %min = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp ugt i8 %min, C1 %r = select i1 %cmp1, i8 %min, i8 C1 Name: uminmax Pre: C1 u> C2 %cmp2 = icmp ugt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp3 = icmp ugt i8 %x, C1 %r = select i1 %cmp3, i8 C1, i8 %max => %cmp2 = icmp ugt i8 %x, C2 %max = select i1 %cmp2, i8 %x, i8 C2 %cmp1 = icmp ult i8 %max, C1 %r = select i1 %cmp1, i8 %max, i8 C1 ---------------------------------------- Optimization: umaxmin Done: 1 Optimization is correct! ---------------------------------------- Optimization: uminmax Done: 1 Optimization is correct! llvm-svn: 292660
*	[ThinLTO] Drop non-prevailing non-ODR weak to declarations	Teresa Johnson	2017-01-20	2	-11/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Allow non-ODR weak/linkonce non-prevailing copies to be marked as available_externally in the index. Add support for dropping these to declarations in the backend. Reviewers: mehdi_amini, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28806 llvm-svn: 292656
*	LowerTypeTests: Implement importing of type identifiers.	Peter Collingbourne	2017-01-20	1	-2/+84
\| \| \| \| \| \| \| \| \|	To import a type identifier we read the summary and create external references to the symbols defined when exporting. Differential Revision: https://reviews.llvm.org/D28546 llvm-svn: 292654
*	[libFuzzer] add an assert to protect against LLVMFuzzerInitialize changing ↵	Kostya Serebryany	2017-01-20	4	-0/+22
\| \| \| \| \| \|	argv[0] llvm-svn: 292652
*	AMDGPU/R600: Serialize vector trunc stores to private AS	Jan Vesely	2017-01-20	5	-3/+44
\| \| \| \| \| \| \| \| \| \| \|	Add DUMMY_CHAIN SDNode to denote stores of interest Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=28915 Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=30411 Differential Revision: https://reviews.llvm.org/D27964 llvm-svn: 292651
*	NewGVN: Fix PR 31686 and PR 31698 by rewriting store leader handling.	Daniel Berlin	2017-01-20	1	-36/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This rewrites store expression/leader handling. We no longer use the value operand as the leader, instead, we store it separately. We also now store the stored value as part of the expression, and compare it when comparing stores for equality. This enables us to get rid of a bunch of our previous hacks and machinations, as the existing machinery takes care of everything except updating the stored value on classes. The only time we have to update it is if the storecount goes to 0, and when we do, we destroy it. Since we no longer use the value operand as the leader, during elimination, we have to use the value operand. Doing this also fixes a bunch of store forwarding cases we were missing. Any value operand we use is guaranteed to either be updated by previous eliminations, or minimized by future ones. (IE the fact that we don't use the most dominating value operand when it's not a constant does not affect anything). Sadly, this change also exposes that we didn't pay attention to the output of the pr31594.ll test, as it also very clearly exposes the same store leader bug we are fixing here. (I added pr31682.ll anyway, but maybe we think that's too large to be useful) On the plus side, propagate-ir-flags.ll now passes due to the corrected store forwarding. This change was 3 stage'd on darwin and linux, with the full test-suite. Reviewers: davide Subscribers: llvm-commits llvm-svn: 292648
*	LowerTypeTests: Compute SizeM1BitWidth in exportTypeId. NFCI.	Peter Collingbourne	2017-01-20	1	-7/+6
\| \| \| \| \| \|	This avoids needing to store it in a separate field in TypeIdLowering. llvm-svn: 292647
*	[libFuzzer] experimental support for 'equivalance fuzzing'	Kostya Serebryany	2017-01-20	10	-7/+243
\| \| \| \|	llvm-svn: 292646
*	[WebAssembly] Don't create bitcast-wrappers for varargs.	Dan Gohman	2017-01-20	1	-0/+5
\| \| \| \| \| \| \| \| \|	WebAssembly varargs functions use a significantly different ABI than non-varargs functions, and the current code in WebAssemblyFixFunctionBitcasts doesn't handle that difference. For now, just avoid creating wrapper functions in the presence of varargs. llvm-svn: 292645
*	[ThinLTO] Fix lazy-loading of MDString instruction attachments	Mehdi Amini	2017-01-20	3	-6/+19
\| \| \| \| \| \| \| \| \|	CFI is using intrinsics that takes MDString as arguments, and this was broken during lazy-loading of metadata. Differential Revision: https://reviews.llvm.org/D28916 llvm-svn: 292641
*	[DWARF] [ObjectYAML] Adding APIs for unittesting	Chris Bieneman	2017-01-20	2	-34/+85
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds some new APIs to enable using the YAML DWARF representation in unit tests. The most basic new API is DWARFYAML::EmitDebugSections which converts a YAML string into a series of owned MemoryBuffer objects stored in a StringMap. The string map can then be used to construct a DWARFContext for parsing in place of an ObjectFile. Reviewers: dblaikie, clayborg Subscribers: mgorny, fhahn, jgosnell, aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D28828 llvm-svn: 292634
*	Recommit "[InlineCost] Use TTI to check if GEP is free." #3	Haicheng Wu	2017-01-20	1	-2/+18
\| \| \| \| \| \| \| \| \| \| \| \|	This is the third attemp to recommit r292526. The original summary: Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. llvm-svn: 292633
*	ARM: tLDR_postidx should be marked mayLoad	Matthias Braun	2017-01-20	1	-1/+1
\| \| \| \| \| \|	This fixes -verify-machineinstrs complaints. llvm-svn: 292629
*	[InstCombine][X86] Add MULDQ/MULUDQ undef handling	Simon Pilgrim	2017-01-20	2	-0/+21
\| \| \| \|	llvm-svn: 292627
*	AArch64LoadStoreOptimizer: Update kill flags when merging stores	Matthias Braun	2017-01-20	1	-2/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Kill flags need to be updated correctly when moving stores up/down to form store pair instructions. Those invalid flags have been ignored before but as of r290014 they are recognized when using -mllvm -verify-machineinstrs. Also simplifies test/CodeGen/AArch64/ldst-opt-dbg-limit.mir, renames it to ldst-opt.mir test and adds a new tests for this change. Differential Revision: https://reviews.llvm.org/D28875 llvm-svn: 292625
*	[mips] Fix debug information for __thread variable	Petar Jovanovic	2017-01-20	6	-1/+37
\| \| \| \| \| \| \| \| \| \| \|	This patch fixes debug information for __thread variable on Mips using .dtprelword and .dtpreldword directives. Patch by Aleksandar Beserminji. Differential Revision: http://reviews.llvm.org/D28770 llvm-svn: 292624
*	[AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-01-20	11	-200/+300
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 292623