bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[MergeICmps] Enable the MergeICmps Pass by default.	Clement Courbet	2018-02-06	3	-41/+16
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Now that PR33325 is fixed, this should always improve the generated code. Reviewers: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42793 llvm-svn: 324317
*	[X86] Modify a few tests to not use icmps that are provably false.	Craig Topper	2018-02-06	3	-17/+15
\| \| \| \| \| \| \| \|	These used things like unsigned less than zero, which is always false because there is no unsigned number less than zero. I plan to teach DAG combine to optimize these so need to stop using them. llvm-svn: 324315
*	AMDGPU/MemoryModel: Fix monotonic atomic loads	Konstantin Zhuravlyov	2018-02-06	1	-2/+2
\| \| \| \| \| \|	Those should have glc bit set for system and agent synchronization scopes llvm-svn: 324314
*	[WebAssembly] Fix test expectations after r324274	Derek Schuff	2018-02-06	2	-80/+33
\| \| \| \| \| \| \|	Wasm uses the expand action for several FP compare ops, and that behavior changed. llvm-svn: 324305
*	Update test expectations after reverting PLT change	Reid Kleckner	2018-02-06	2	-15/+15
\| \| \| \|	llvm-svn: 324304
*	Revert "Don't assume a null GV is local for ELF and MachO."	Reid Kleckner	2018-02-06	6	-20/+20
\| \| \| \| \| \| \| \|	This reverts r323297. It breaks building grub. llvm-svn: 324301
*	[X86] Auto-generate complete checks. NFC	Craig Topper	2018-02-05	1	-21/+54
\| \| \| \|	llvm-svn: 324295
*	[X86] Relax restrictions on what setcc condition codes can be folded with a ↵	Craig Topper	2018-02-05	2	-6/+6
\| \| \| \| \| \| \| \|	sext when AVX512 is enabled. We now allow all signed comparisons and not equal. The complement that needs to be added for this is no worse than the extend. And the vector output forms of pcmpeq/pcmpgt have better latency than the k-register version on SKX. llvm-svn: 324294
*	[LoopStrengthReduce, x86] don't add cost for a cmp that will be macro-fused ↵	Sanjay Patel	2018-02-05	1	-12/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR35681) In the motivating case from PR35681 and represented by the macro-fuse-cmp test: https://bugs.llvm.org/show_bug.cgi?id=35681 ...there's a 37 -> 31 byte size win for the loop because we eliminate the big base address offsets. SPEC2017 on Ryzen shows no significant perf difference. Differential Revision: https://reviews.llvm.org/D42607 llvm-svn: 324289
*	[PEI] Fix failing test caused by r324283	Francis Visoiu Mistrih	2018-02-05	1	-0/+6
\| \| \| \| \| \|	X86FrameLowering sets stack size to 0 if redzone is enabled. llvm-svn: 324285
*	[DWARF] Regularize dumping strings from line tables.	Paul Robinson	2018-02-05	3	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The major visible difference here is that in line-table dumps, directory and file names are wrapped in double-quotes; previously, directory names got single quotes and file names were not quoted at all. The improvement in this patch is that when a DWARF v5 line table header has indirect strings, in a verbose dump these will all have their section[offset] printed as well as the name itself. This matches the format used for dumping strings in the .debug_info section. Differential Revision: https://reviews.llvm.org/D42802 llvm-svn: 324270
*	[X86] Teach DAG unfoldMemoryOperand to reconvert CMPs to tests	Nirav Dave	2018-02-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Copy MI-level cmp->test conversion to SelectionDAG-level memory unfold. This fixes a regression from upcoming D41293 change. Reviewers: craig.topper, RKSimon Reviewed By: craig.topper Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D42808 llvm-svn: 324261
*	[X86] Artificially lower the complexity of the scalar ANDN patterns so that ↵	Craig Topper	2018-02-05	3	-20/+24
\| \| \| \| \| \| \| \| \| \| \| \|	AND with immediate will match first. This allows the immediate to folded into the and instead of being forced to move into a register. This can sometimes result in shorter encodings since the and can sign extend an immediate. This also allows us to match an and to a movzx after a not. This can cause an extra move if the input to the separate NOT has an additional user which requires a copy before the NOT. llvm-svn: 324260
*	[X86] Teach X86DAGToDAGISel::shrinkAndImmediate to preserve upper 32 zeroes ↵	Craig Topper	2018-02-05	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	of a 64 bit mask. If the upper 32 bits of a 64 bit mask are all zeros, we have special isel patterns to use a 32-bit and instead of a 64-bit and by relying on the impliciting zeroing of 32 bit ops. This patch teachs shrinkAndImmediate not to break that optimization. Differential Revision: https://reviews.llvm.org/D42899 llvm-svn: 324249
*	[Hexagon] Use V6_vmpyih for halfword multiplication	Krzysztof Parzyszek	2018-02-05	1	-4/+2
\| \| \| \| \| \| \|	Unlike V6_vmpyhv, it produces the result in the exact form that is expected without the need for a shuffle. llvm-svn: 324241
*	[PowerPC] Check hot loop exit edge in PPCCTRLoops	Hiroshi Inoue	2018-02-05	1	-0/+187
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	PPCCTRLoops transform loops using mtctr/bdnz instructions if loop trip count is known and big enough to compensate for the cost of mtctr. But if there is a loop exit edge which is known to be frequently taken (by builtin_expect or by PGO), we should not transform the loop to avoid the cost of mtctr instruction. Here is an example of a loop with hot exit edge: for (unsigned i = 0; i < TripCount; i++) { // do something if (__builtin_expect(check(), 1)) break; // do something } Differential Revision: https://reviews.llvm.org/D42637 llvm-svn: 324229
*	[X86] Add isel patterns for selecting masked SUBV_BROADCAST with bitcasts. ↵	Craig Topper	2018-02-05	1	-24/+194
\| \| \| \| \| \| \| \|	Remove combineBitcastForMaskedOp. Add test cases for the merge masked versions to make sure we have all those covered. llvm-svn: 324210
*	[X86] Remove X86ISD::SHUF128 from combineBitcastForMaskedOp. Use isel ↵	Craig Topper	2018-02-05	4	-28/+28
\| \| \| \| \| \| \| \| \| \|	patterns instead. We always created X86ISD::SHUF128 with a 64-bit element type so we can use isel patterns to detect a bitconvert to 32-bit to handle masking. The test changes are because we also match the bitconvert even if there is no masking. This leads to unnecessary isel pattern, but it requires more multiclass hackery in tablegen to get rid of it. llvm-svn: 324205
*	[X86] Auto-generate full checks. NFC	Craig Topper	2018-02-04	1	-6/+39
\| \| \| \|	llvm-svn: 324202
*	X86 Tests: Add shuffle that can be improved by widening elements. NFC	Zvi Rackover	2018-02-04	1	-0/+34
\| \| \| \| \| \|	To be improved by D42044 llvm-svn: 324200
*	[X86] Add DAG combine to turn (bitcast (and/or/xor (bitcast X), Y)) -> ↵	Craig Topper	2018-02-04	16	-1387/+1260
\| \| \| \| \| \| \| \| \| \|	(and/or/xor X, (bitcast Y)) when casting between GPRs and mask operations. This reduces the number of transitions between k-registers and GPRs, reducing the number of instructions. There's still some room for improvement to remove more transitions, but this is a good start. llvm-svn: 324184
*	[MIPS] Regenerate vector tests with update script	Simon Pilgrim	2018-02-03	1	-1316/+6782
\| \| \| \| \| \|	Hopefully help make this a lot more maintainable llvm-svn: 324180
*	[X86][SSE] Don't chain shuffles together in schedule tests	Simon Pilgrim	2018-02-03	4	-128/+201
\| \| \| \| \| \|	This is necessary to prevent the shuffles from being combined/simplified in an upcoming patch. llvm-svn: 324178
*	[X86] Remove and autoupgrade kand/kandn/kor/kxor/kxnor/knot intrinsics.	Craig Topper	2018-02-03	3	-111/+110
\| \| \| \| \| \| \| \|	Clang already stopped using these a couple months ago. The test cases aren't great as there is nothing forcing the operations to stay in k-registers so some of them moved back to scalar ops due to the bitcasts being moved around. llvm-svn: 324177
*	[RISCV] Update two RISCV codegen tests after rL323991	Alex Bradbury	2018-02-03	2	-4/+4
\| \| \| \| \| \| \|	From the discussion in D41835 it looks possible the change will be backed out, but for now let's fix the RISCV tests. llvm-svn: 324172
*	[X86] Add avx512 command line to ptest.ll to demonstrate that 512-bit ↵	Craig Topper	2018-02-02	1	-30/+111
\| \| \| \| \| \|	vectors are not handled by LowerVectorAllZeroTest. llvm-svn: 324130
*	Partially revert r324124 [X86] Add tests for missed opportunities to use ↵	Craig Topper	2018-02-02	1	-269/+0
\| \| \| \| \| \| \| \| \| \|	ptest for all ones comparison. Turns out I misunderstood the flag behavior of PTEST because I read the documentation for KORTEST which is different than PTEST/KTEST and made a bad assumption. Keep the test rename though cause that's useful. llvm-svn: 324129
*	[X86] Add tests for missed opportunities to use ptest for all ones comparison.	Craig Topper	2018-02-02	2	-243/+511
\| \| \| \| \| \|	Also rename the test from pr12312.ll to ptest.ll so its more recognizable. llvm-svn: 324124
*	[AMDGPU] Switch to the new addr space mapping by default	Yaxun Liu	2018-02-02	82	-2466/+2466
\| \| \| \| \| \| \| \|	This requires corresponding clang change. Differential Revision: https://reviews.llvm.org/D40955 llvm-svn: 324101
*	Add llc tests for comparison chains.	Clement Courbet	2018-02-02	2	-0/+116
\| \| \| \| \| \|	See https://reviews.llvm.org/D42793#996098 for context. llvm-svn: 324099
*	[X86][SSE] Force double domain for SHUFPD stack folding tests	Simon Pilgrim	2018-02-02	2	-3/+9
\| \| \| \|	llvm-svn: 324094
*	[ARM] fixed some tabs/whitespaces in test. NFC.	Sjoerd Meijer	2018-02-02	1	-6/+6
\| \| \| \|	llvm-svn: 324074
*	[SelectionDAG] Consider endianness in scalarizeVectorStore().	Jonas Paulsson	2018-02-02	2	-64/+66
\| \| \| \| \| \| \| \| \| \| \| \|	When handling vectors with non byte-sized elements, reverse the order of the elements in the built integer if the target is Big-Endian. SystemZ tests updated. Review: Eli Friedman, Ulrich Weigand. https://reviews.llvm.org/D42786 llvm-svn: 324063
*	[SystemZ] Update test case (NFC)	Jonas Paulsson	2018-02-02	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	test/CodeGen/SystemZ/vec-trunc-to-i1.ll was marked as a temporary FAIL when it was previously updated when it needed one more COPY. This was however wrong, since the loop body had been reduced significantly, and it was actually an improvement. Review: Ulrich Weigand. llvm-svn: 324060
*	[X86] Legalize (v64i1 (bitcast (i64 X))) on 32-bit targets by extracting ↵	Craig Topper	2018-02-02	2	-3232/+116
\| \| \| \| \| \| \| \|	32-bit halves from i32, bitcasting each to v32i1, and concatenating. This prevents the scalarization that would otherwise occur. llvm-svn: 324057
*	[X86] Legalize (i64 (bitcast (v64i1 X))) on 32-bit targets by extracting to ↵	Craig Topper	2018-02-02	3	-490/+471
\| \| \| \| \| \| \| \|	v32i1 and bitcasting to i32. This saves a trip through memory and seems to open up other combining opportunities. llvm-svn: 324056
*	[RISCV] Define getSetCCResultType for setting vector setCC type	Shiva Chen	2018-02-02	1	-0/+35
\| \| \| \| \| \| \| \|	To avoid trigger "No default SetCC type for vectors!" Assertion Differential Revision: https://reviews.llvm.org/D42675 llvm-svn: 324054
*	[AArch64][GlobalISel] Fix old use of % sigil in test.	Amara Emerson	2018-02-02	1	-2/+2
\| \| \| \| \| \|	My rebase had missed the new $ sigil we're using. llvm-svn: 324051
*	[GlobalISel] Constrain the dest reg of IMPLICT_DEF.	Amara Emerson	2018-02-02	1	-0/+19
\| \| \| \| \| \| \| \| \| \|	This fixes a crash where the user is a COPY, which deliberately does not constrain its source operands, resulting in a vreg without a reg class escaping selection. Differential Revision: https://reviews.llvm.org/D42697 llvm-svn: 324047
*	SplitKit: Fix liveness recomputation in some remat cases.	Matthias Braun	2018-02-02	1	-0/+245
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Example situation: ``` BB0: %0 = ... use %0 ; ... condjump BB1 jmp BB2 BB1: %0 = ... ; rematerialized def from above (from earlier split step) jmp BB2 BB2: ; ... use %0 ``` %0 will have a live interval with 3 value numbers (for the BB0, BB1 and BB2 parts). Now SplitKit tries and succeeds in rematerializing the value number in BB2 (This only works because it is a secondary split so SplitKit is can trace this back to a single original def). We need to recompute all live ranges affected by a value number that we rematerialize. The case that we missed before is that when the value that is rematerialized is at a join (Phi VNI) then we also have to recompute liveness for the predecessor VNIs. rdar://35699130 Differential Revision: https://reviews.llvm.org/D42667 llvm-svn: 324039
*	Fix check-prefixes typo and line endings.	Simon Pilgrim	2018-02-01	1	-2/+2
\| \| \| \|	llvm-svn: 324024
*	[X86][SSE] Add SSE41 to variable permute tests	Simon Pilgrim	2018-02-01	1	-15/+144
\| \| \| \|	llvm-svn: 324017
*	[X86][XOP] Add XOP to variable permute tests	Simon Pilgrim	2018-02-01	2	-0/+598
\| \| \| \|	llvm-svn: 324015
*	[PowerPC] Tell VSX swap removal that scalar conversions are lane-sensitive	Nemanja Ivanovic	2018-02-01	1	-0/+18
\| \| \| \| \| \| \| \|	This is a rather non-controversial change. We were missing these instructions from the list of instructions that are lane-sensitive. These two put the result into lane 0 (BE) or 3 (LE) regardless of the input. This patch fixes PR36068. llvm-svn: 324005
*	[DAGCombiner] When folding (insert_subvector undef, (bitcast ↵	Craig Topper	2018-02-01	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \|	(extract_subvector N1, Idx)), Idx) -> (bitcast N1) make sure that N1 has the same total size as the original output We were only checking the element count, but not the total width. This could cause illegal bitcasts to be created if for example the output was 512-bits, but N1 is 256 bits, and the extraction size was 128-bits. Fixes PR36199 Differential Revision: https://reviews.llvm.org/D42809 llvm-svn: 324002
*	[GlobalISel] Fix assert failure when legalizing non-power-2 loads.	Amara Emerson	2018-02-01	1	-0/+10
\| \| \| \| \| \| \|	Until we support extending loads properly we're going to fall back for these. We already handle stores in the same way, so this is just being consistent. llvm-svn: 324001
*	[MachineCopyPropagation] Extend pass to do COPY source forwarding	Geoff Berry	2018-02-01	115	-492/+583
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change extends MachineCopyPropagation to do COPY source forwarding and adds an additional run of the pass to the default pass pipeline just after register allocation. This version of this patch uses the newly added MachineOperand::isRenamable bit to avoid forwarding registers is such a way as to violate constraints that aren't captured in the Machine IR (e.g. ABI or ISA constraints). This change is a continuation of the work started in D30751. Reviewers: qcolombet, javed.absar, MatzeB, jonpa, tstellar Subscribers: tpr, mgorny, mcrosier, nhaehnle, nemanjai, jyknight, hfinkel, arsenm, inouehrs, eraman, sdardis, guyblank, fedor.sergeev, aheejin, dschuff, jfb, myatsina, llvm-commits Differential Revision: https://reviews.llvm.org/D41835 llvm-svn: 323991
*	AMDGPU/SI: Adjust the encoding family for D16 buffer instructions when the ↵	Changpeng Fang	2018-02-01	4	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	target has UnpackedD16VMem feature. Reviewers: Matt and Brian Differential Revision: https://reviews.llvm.org/D42548 llvm-svn: 323988
*	[X86][SSE] LowerBUILD_VECTORAsVariablePermute - add support for scaling ↵	Simon Pilgrim	2018-02-01	2	-332/+150
\| \| \| \| \| \| \| \| \| \|	index vectors This allows us to use PSHUFB for v8i16/v4i32 and VPERMD/PERMPS for v4i64/v4f64 variable shuffles. Differential Revision: https://reviews.llvm.org/D42487 llvm-svn: 323987
*	[AArch64] add tests with sqrt estimate and ieee denorms; NFC	Sanjay Patel	2018-02-01	1	-0/+50
\| \| \| \| \| \|	As noted in D42323, we're not checking for denorms as we should. llvm-svn: 323985