bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[ARM] Update test to use CHECK-LABEL. NFCI.	Diana Picus	2016-07-18	1	-6/+8
\| \| \| \|	llvm-svn: 275777
*	[ARM] Skip inline asm memory operands in DAGToDAGISel	Diana Picus	2016-07-18	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current logic for handling inline asm operands in DAGToDAGISel interprets the operands by looking for constants, which should represent the flags describing the kind of operand we're dealing with (immediate, memory, register def etc). The operands representing actual data are skipped only if they are non-const, with the exception of immediate operands which are skipped explicitly when a flag describing an immediate is found. The oversight is that memory operands may be const too (e.g. for device drivers reading a fixed address), so we should explicitly skip the operand following a flag describing a memory operand. If we don't, we risk interpreting that constant as a flag, which is definitely not intended. Fixes PR26038 Differential Revision: https://reviews.llvm.org/D22103 llvm-svn: 275776
*	[AVX512] Add EVEX versions of scalar ADD/SUB/MUL/DIV to load folding tables.	Craig Topper	2016-07-18	1	-0/+137
\| \| \| \|	llvm-svn: 275775
*	[X86] Fix test checks to include leading 'v' on avx mnemonic names.	Craig Topper	2016-07-18	1	-13/+13
\| \| \| \|	llvm-svn: 275774
*	[ARM] Honour ABI for rem under -O0 for EABI, GNUEABI, Android and Musl	Diana Picus	2016-07-18	1	-12/+105
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	At higher optimization levels, we generate the libcall for DIVREM_Ix, which is fine: aeabi_{u\|i}divmod. At -O0 we generate the one for REM_Ix, which is the default {u}mod{q\|h\|s\|d}i3. This commit makes sure that we don't generate REM_Ix calls for ABIs that don't support them (i.e. where we need to use DIVREM_Ix instead). This is achieved by bailing out of FastISel, which can't handle non-double multi-reg returns, and letting the legalization infrastructure expand the REM_Ix calls. It also updates the divmod-eabi.ll test to run under -O0 as well, and adds some Windows checks to it to make sure we don't break things for it. Fixes PR27068 Differential Revision: https://reviews.llvm.org/D21926 llvm-svn: 275773
*	[X86] Add VPADD instructions to X86InstrInfo::isAssociativeAndCommutative.	Craig Topper	2016-07-18	8	-436/+458
\| \| \| \|	llvm-svn: 275769
*	[X86] Add floating point packed logical ops to ↵	Craig Topper	2016-07-18	2	-3/+3
\| \| \| \| \| \|	X86InstrInfo::isAssociativeAndCommutative. llvm-svn: 275768
*	[X86] Add AVX512 instructions to X86InstrInfo::isAssociativeAndCommutative.	Craig Topper	2016-07-18	1	-276/+276
\| \| \| \|	llvm-svn: 275767
*	[X86] Add AVX512 load opcodes and a couple AVX load opcodes to ↵	Craig Topper	2016-07-18	2	-22/+22
\| \| \| \| \| \|	X86InstrInfo::areLoadsFromSameBasePtr. llvm-svn: 275765
*	[X86] Add more opcodes to isFrameLoadOpcode/isFrameStoreOpcode. Mainly ↵	Craig Topper	2016-07-18	6	-144/+144
\| \| \| \| \| \|	AVX-512 related. llvm-svn: 275764
*	[AVX512] Use VMOVAPSZ128rr/VMOVAPS256rr for VR128X/VR256X physreg moves when ↵	Craig Topper	2016-07-18	13	-354/+381
\| \| \| \| \| \| \| \|	VLX is supported. Ideally we would use VEX encoded moves instead of EVEX if the high 16 registers aren't referenced, but this a good first step. llvm-svn: 275763
*	[GVNHoist] Change the key for VNtoInsns to a pair	David Majnemer	2016-07-18	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While debugging GVNHoist, I found it confusing that the entries in a VNtoInsns were not always value numbers. They _usually_ were except for StoreInst in which case they were a hash of two different value numbers. This leads to two observations: - It is more difficult to debug things when the semantic contents of VNtoInsns changes over time. - Using a single value number is not much cheaper, the value of VNtoInsns is a SmallVector. - It is not immediately clear what the algorithm would do if there were hash collisions in the StoreInst case. Using a DenseMap of std::pair sidesteps all of this. N.B. The changes in the test were due their sensitivity to the iteration order of VNtoInsns which has changed. llvm-svn: 275761
*	[llvm-cov] Attempt to fix a test failure on Windows	Vedant Kumar	2016-07-18	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Don't make the test/tools/llvm-cov/demangle.test depend on the order in which symbols are seen, or on the exact formatting llvm-cov emits after a symbol is printed. This is an attempt to fix a Windows bot failure: http://lab.llvm.org:8011/builders/clang-x86-win2008-selfhost/builds/9141 I don't know what the root cause of the failure is, or why the showTemplateInstantiations test doesn't fail in the same way on the Windows bots. However, this measure can't hurt, and it'll at least get me on the blamelists again. llvm-svn: 275758
*	Revert r275678, "Revert "Revert r275027 - Let FuncAttrs infer the 'returned' ↵	NAKAMURA Takumi	2016-07-18	3	-6/+6
\| \| \| \| \| \| \| \| \| \|	argument attribute"" This reverts also r275029, "Update Clang tests after adding inference for the returned argument attribute" It broke LTO build. Seems miscompilation. llvm-svn: 275756
*	[GVN] Move other PRE tests to a subdirectory.	Davide Italiano	2016-07-17	5	-0/+0
\| \| \| \|	llvm-svn: 275742
*	[GVN] Move the PRE/LOADPRE test in a subdirectory.	Davide Italiano	2016-07-17	18	-0/+0
\| \| \| \|	llvm-svn: 275741
*	[GVN] Use FileCheck instead of grep for tests.	Davide Italiano	2016-07-17	18	-19/+135
\| \| \| \|	llvm-svn: 275739
*	[X86] Add CTPOP/CTLZ/CTTZ scalar cost tests	Simon Pilgrim	2016-07-17	1	-6/+171
\| \| \| \|	llvm-svn: 275725
*	[X86][AVX] Added VBROADCASTF128/VBROADCASTI128 tests	Simon Pilgrim	2016-07-17	2	-0/+240
\| \| \| \|	llvm-svn: 275713
*	[X86] Regenerated ctlz/cttz scalar tests for 32/64-bit targets with/without ↵	Simon Pilgrim	2016-07-17	1	-170/+640
\| \| \| \| \| \|	LZCNT/TZCNT support llvm-svn: 275710
*	[X86] Regenerated popcnt scalar tests for 32/64-bit targets with/without ↵	Simon Pilgrim	2016-07-17	1	-13/+230
\| \| \| \| \| \|	POPCNT support llvm-svn: 275709
*	[ThinLTO] Perform profile-guided indirect call promotion	Teresa Johnson	2016-07-17	2	-0/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: To enable profile-guided indirect call promotion in ThinLTO mode, we simply add call graph edges for each profitable target from the profile to the summaries, then the summary-guided importing will consider the callee for importing as usual. Also we need to enable the indirect call promotion pass creation in the PassManagerBuilder when PerformThinLTO=true (we are in the ThinLTO backend), so that the newly imported functions are considered for promotion in the backends. The IC promotion profiles refer to callees by GUID, which required adding GUIDs to the per-module VST in bitcode (and assigning them valueIds similar to how they are assigned valueIds in the combined index). Reviewers: mehdi_amini, xur Subscribers: mehdi_amini, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D21932 llvm-svn: 275707
*	X86: Updated a test file. NFC.	Elena Demikhovsky	2016-07-17	1	-79/+452
\| \| \| \| \| \| \|	This test shows subotimal code generated for AVX-512 vs PENTIUM4. The issue will be fixed in an upcomming commit. llvm-svn: 275702
*	[PM] Convert IVUsers analysis to new pass manager.	Dehao Chen	2016-07-16	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Convert IVUsers analysis to new pass manager. Reviewers: davidxl, silvas Subscribers: junbuml, sanjoy, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22434 llvm-svn: 275698
*	[InstCombine] allow X + signbit --> X ^ signbit for vector splats	Sanjay Patel	2016-07-16	1	-1/+1
\| \| \| \|	llvm-svn: 275691
*	add vector test to show missing transform	Sanjay Patel	2016-07-16	1	-0/+10
\| \| \| \|	llvm-svn: 275690
*	update tests to use FileCheck, consolidate tests, fix comments	Sanjay Patel	2016-07-16	3	-80/+106
\| \| \| \|	llvm-svn: 275688
*	update test to use FileCheck	Sanjay Patel	2016-07-16	1	-5/+10
\| \| \| \|	llvm-svn: 275687
*	auto-generate checks	Sanjay Patel	2016-07-16	1	-47/+54
\| \| \| \|	llvm-svn: 275686
*	auto-ggenerate checks	Sanjay Patel	2016-07-16	1	-42/+47
\| \| \| \|	llvm-svn: 275685
*	[InstCombine] reassociate logic ops with constants separated by a zext	Sanjay Patel	2016-07-16	1	-15/+10
\| \| \| \| \| \| \| \| \| \| \| \|	This is a partial implementation of a general fold for associative+commutative operators: (op (cast (op X, C2)), C1) --> (cast (op X, op (C1, C2))) (op (cast (op X, C2)), C1) --> (op (cast X), op (C1, C2)) There are 7 associative operators and 13 cast types, so this could potentially go a lot further. Differential Revision: https://reviews.llvm.org/D22421 llvm-svn: 275684
*	Revert "Revert r275027 - Let FuncAttrs infer the 'returned' argument attribute"	Hal Finkel	2016-07-16	3	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r275042; the initial commit triggered self-hosting failures on ARM/AArch64. James Molloy identified the problematic backend code, which has been disabled in r275677. Trying again... Original commit message: Let FuncAttrs infer the 'returned' argument attribute A function can have one argument with the 'returned' attribute, indicating that the associated argument is always the return value of the function. Add FuncAttrs inference logic. llvm-svn: 275678
*	Disable this-return argument forwarding on ARM/AArch64	Hal Finkel	2016-07-16	3	-5/+5
\| \| \| \| \| \| \| \| \| \| \|	r275042 reverted function-attribute inference for the 'returned' attribute because the feature triggered self-hosting failures on ARM and AArch64. James Molloy determined that the this-return argument forwarding feature, which directly ties the returned input argument to the returned value, was the cause. It seems likely that this forwarding code contains, or triggers, a subtle bug. Disabling for now until we can track that down. llvm-svn: 275677
*	Re-commit [AMDGPU] Add metadata for runtime	Yaxun Liu	2016-07-16	1	-0/+848
\| \| \| \| \| \|	Attempting to fix lit test failure on ppc. llvm-svn: 275676
*	llc: Add support for -run-pass none	Matthias Braun	2016-07-16	174	-174/+174
\| \| \| \| \| \| \| \| \| \|	This does not schedule any passes besides the ones necessary to construct and print the machine function. This is useful to test .mir file reading and printing. Differential Revision: http://reviews.llvm.org/D22432 llvm-svn: 275664
*	ARM/MIR: Move test from MIR to CodeGen/ARM directory	Matthias Braun	2016-07-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	test/CodeGen/MIR/ARM/ARMLoadStoreDBG.mir is an actual test for the ARM load store optimization pass and not a test of the mir parser/printer. It belongs to test/CodeGen/ARM; This also updates the test to use the new -run-pass llc syntax. llvm-svn: 275662
*	MIParser: reject subregister indexes on physregs	Matthias Braun	2016-07-16	1	-0/+12
\| \| \| \|	llvm-svn: 275658
*	[llvm-cov] Optionally use a symbol demangler when preparing reports	Vedant Kumar	2016-07-15	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	Add an option to specify a symbol demangler (as well as options to the demangler). This can be used to make reports more human-readable. This option is especially useful in -output-dir mode, since it isn't as easy to manually pipe reports into a demangler in this mode. llvm-svn: 275640
*	AMDGPU: Fix verifier error from partially undef copy	Matt Arsenault	2016-07-15	1	-3/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	In this situation: %VGPR2<def> = BUFFER_LOAD_DWORD_OFFSET %SGPR8_SGPR9_SGPR10_SGPR11, %VGPR7<def,tied3> = V_MAC_F32_e32 %VGPR0<undef>, %VGPR1<kill>, %VGPR7<kill,tied0>, %EXEC<imp-use> %VGPR3_VGPR4_VGPR5_VGPR6<def> = COPY %VGPR0_VGPR1_VGPR2_VGPR3 %VGPR4<def> = COPY %VGPR2 The copy for VGPR1 -> VGPR4 was an error from reading undefined VGPR1, but VGPR4 is defined immediately after this copy. llvm-svn: 275635
*	ExpandPostRAPseudos should transfer implicit uses, not only implicit defs	Michael Kuperstein	2016-07-15	2	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, we would expand: %BL<def> = COPY %DL<kill>, %EBX<imp-use,kill>, %EBX<imp-def> Into: %BL<def> = MOV8rr %DL<kill>, %EBX<imp-def> Dropping the imp-use on the floor. That confused CriticalAntiDepBreaker, which (correctly) assumes that if an instruction defs but doesn't use a register, that register is dead immediately before the instruction - while in this case, the high lanes of EBX can be very much alive. This fixes PR28560. Differential Revision: https://reviews.llvm.org/D22425 llvm-svn: 275634
*	[pdb] Teach MsfBuilder and other classes about the Free Page Map.	Zachary Turner	2016-07-15	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Block 1 and 2 of an MSF file are bit vectors that represent the list of blocks allocated and free in the file. We had been using these blocks to write stream data and other data, so we mark them as the free page map now. We don't yet serialize these pages to the disk, but at least we make a note of what it is, and avoid writing random data to them. Doing this also necessitated cleaning up some of the tests to be more general and hardcode fewer values, which is nice. llvm-svn: 275629
*	[pdb] Round trip the NameMap data structure to YAML.	Zachary Turner	2016-07-15	1	-0/+7
\| \| \| \|	llvm-svn: 275628
*	[pdb] Use MsfBuilder to handle the writing PDBs.	Zachary Turner	2016-07-15	3	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously we would read a PDB, then write some of it back out, but write the directory, super block, and other pertinent metadata back out unchanged. This generates incorrect PDBs since the amount of data written was not always the same as the amount of data read. This patch changes things to use the newly introduced `MsfBuilder` class to write out a correct and accurate set of Msf metadata for the data actually written, which opens up the door for adding and removing type records, symbol records, and other types of data to an existing PDB. llvm-svn: 275627
*	StructurizeCFG: Fix inverting constantexpr conditions	Matt Arsenault	2016-07-15	1	-0/+30
\| \| \| \|	llvm-svn: 275626
*	AMDGPU: Remove brev intrinsic	Matt Arsenault	2016-07-15	1	-10/+0
\| \| \| \|	llvm-svn: 275620
*	AMDGPU: Fix TargetPrefix for remaining r600 intrinsics	Matt Arsenault	2016-07-15	36	-93/+93
\| \| \| \|	llvm-svn: 275619
*	AMDGPU: Remove AMDGPU.ldexp	Matt Arsenault	2016-07-15	1	-21/+0
\| \| \| \|	llvm-svn: 275618
*	AMDGPU: Remove legacy rsq.clamped intrinsic	Matt Arsenault	2016-07-15	7	-99/+59
\| \| \| \| \| \| \| \|	Mesa still has a use of llvm.AMDGPU.rsq.f64 remaining. Also fix mismatch with non-IEEE rsq selecting to IEEE rsq. llvm-svn: 275617
*	CodeGen: avoid emitting unnecessary CFI	Saleem Abdulrasool	2016-07-15	2	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Remove unnecessary clutter in assembly output. When using SjLj EH, the CFI is not actually used for anything. Do not emit the CFI needlessly. The minor test adjustments are interesting. The prologue test was just overzealous matcching. The interesting case is the LSDA change. It was originally added to ensure that various compilations did not mangle the name (it explicitly checked the name!). However, subsequent cleanups made it more reliant on the CFI to find the name. Parse the generated code flow to generically find the label still. llvm-svn: 275614
*	Teach fast isel about the win64 calling convention.	Nico Weber	2016-07-15	2	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This mostly just works. Vectorcall rets are still not supported. The win64_eh test change is because fast isel doesn't use rsi for temporary computations, so it doesn't need to be pushed. The test case I'm changing was originally added to test pushes, but by now there are other test cases in that file exercising that code path. https://reviews.llvm.org/D22422 llvm-svn: 275607