bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] Tests for unsigned saturation downconvert detection.	Artur Gainullin	2018-04-14	1	-0/+310
\| \| \| \|	llvm-svn: 330088
*	[X86][MMX] Set PAVG/PHADD/PMIN/PMAX/PSIGN instructions to use same scheduler ↵	Simon Pilgrim	2018-04-14	1	-78/+78
\| \| \| \| \| \|	classes as SSE/AVX llvm-svn: 330085
*	MachO: trap unreachable instructions	Tim Northover	2018-04-13	10	-7/+35
\| \| \| \| \| \| \|	Debugability is more important than saving 4 bytes to let us to fall through to nonense. llvm-svn: 330073
*	Revert r329956, "AArch64: Introduce a DAG combine for folding offsets into ↵	Peter Collingbourne	2018-04-13	6	-172/+68
\| \| \| \| \| \| \| \| \| \|	addresses." Caused a hang and eventually an assertion failure in LTO builds of 7zip-benchmark on aarch64 iOS targets. http://green.lab.llvm.org/green/job/lnt-ctmark-aarch64-O3-flto/2024/ llvm-svn: 330063
*	[mips] Materialize constants for multiplication	Simon Dardis	2018-04-13	1	-223/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, the MIPS backend would alwyas break down constant multiplications into a series of shifts, adds, and subs. This patch changes that so the cost of doing so is estimated. The cost is estimated against worst case constant materialization and retrieving the results from the HI/LO registers. For cases where the value type of the multiplication is not legal, the cost of legalization is estimated and is accounted for before performing the optimization of breaking down the constant This resolves PR36884. Thanks to npl for reporting the issue! Reviewers: abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D45316 llvm-svn: 330037
*	[ARM] FP16 vmaxnm/vminnm scalar instructions	Sjoerd Meijer	2018-04-13	3	-1/+790
\| \| \| \| \| \| \| \| \|	This adds code generation support for the FP16 vmaxnm/vminnm scalar instructions. Differential Revision: https://reviews.llvm.org/D44675 llvm-svn: 330034
*	[X86][AVX512] UNPCKL/H PS and PD should be scheduled with WriteFShuffle not ↵	Simon Pilgrim	2018-04-13	1	-192/+192
\| \| \| \| \| \|	WriteFAdd llvm-svn: 330023
*	[PostRASink]Add register dependency check for implicit operands	Jun Bum Lim	2018-04-13	1	-0/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change extend the register dependency check for implicit operands in Copy instructions. Fixes PR36902. Reviewers: thegameg, sebpop, uweigand, jnspaulsson, gberry, mcrosier, qcolombet, MatzeB Reviewed By: thegameg Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44958 llvm-svn: 330018
*	[NEON] Support intrinsic for scalar and vector versions of the VRINTN ↵	Ivan A. Kosarev	2018-04-13	1	-0/+11
\| \| \| \| \| \| \| \|	instruction Differential Revision: https://reviews.llvm.org/D45514 llvm-svn: 330011
*	[NFC] fix trivial typos in comments	Hiroshi Inoue	2018-04-13	2	-2/+2
\| \| \| \| \| \|	"the the" -> "the", "we we" -> "we", etc llvm-svn: 330006
*	[X86] Introduce cldemote instruction	Gabor Buella	2018-04-13	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Hint to hardware to move the cache line containing the address to a more distant level of the cache without writing back to memory. Reviewers: craig.topper, zvi Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D45256 llvm-svn: 329992
*	[X86] Remove the pmuldq/pmuldq intrinsics and replace with native IR.	Craig Topper	2018-04-13	11	-1643/+2553
\| \| \| \| \| \| \| \|	This completes the work started in r329604 and r329605 when we changed clang to no longer use the intrinsics. We lost some InstCombine SimplifyDemandedBit optimizations through this change as we aren't able to fold 'and', bitcast, shuffle very well. llvm-svn: 329990
*	[PowerPC] add fsub-fneg test; NFC	Sanjay Patel	2018-04-12	1	-0/+21
\| \| \| \| \| \| \| \| \|	This is a test for a transform that was suggested in the post-commit mailing list thread for rL329821. The target in question is not in trunk, so PPC gets to stand in for it because it's the only in-tree target that sets 'isFPExtFree()' to 'true'. llvm-svn: 329963
*	AArch64: Introduce a DAG combine for folding offsets into addresses.	Peter Collingbourne	2018-04-12	6	-68/+172
\| \| \| \| \| \| \| \| \| \| \|	This is a code size win in code that takes offseted addresses frequently, such as C++ constructors that typically need to compute an offseted address of a vtable. This reduces the size of Chromium for Android's .text section by 108KB. Differential Revision: https://reviews.llvm.org/D45199 llvm-svn: 329956
*	[X86] Introduce LLVM wbinvd intrinsic	Gabor Buella	2018-04-12	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \|	A previously missing intrinsic for an old instruction. Reviewers: craig.topper, echristo Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D45312 llvm-svn: 329936
*	[Power9]Legalize and emit code for converting (Un)Signed DWord to Quad-Precision	Lei Huang	2018-04-12	1	-0/+140
\| \| \| \| \| \| \| \| \| \| \|	Legalize and emit code for: * xscvsdqp * xscvudqp Differential Revision: https://reviews.llvm.org/D45230 llvm-svn: 329931
*	[AArch64] Move AFI->setRedZone(false) to top of emitPrologue	Jessica Paquette	2018-04-12	1	-5/+35
\| \| \| \| \| \| \| \| \| \| \| \| \|	AFI->setRedZone(false) was put in the wrong place before, and so it only fired on functions that didn't have stack frames. This moves that to the top of emitPrologue to make sure that every function without a redzone has it set correctly. This also adds a function representing one of the early exit cases (GHC calling convention) to the MachineOutliner noredzone test to ensure that we can outline from functions like these, where we never use a redzone. llvm-svn: 329922
*	revert r328921 - [DAGCombine] (float)((int) f) --> ftrunc (PR36617)	Sanjay Patel	2018-04-12	8	-83/+330
\| \| \| \| \| \| \|	This change is exposing UB in source code - as was warned/predicted. :) See D44909 for discussion. Reverting while we figure out how to fix things. llvm-svn: 329920
*	[RISCV] Change function alignment to 4 bytes, and 2 bytes for RVC	Shiva Chen	2018-04-12	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: According RISC-V ELF psABI specification, base RV32 and RV64 ISAs only allow 32-bit instruction alignment, but instruction allow to be aligned to 16-bit boundaries for C-extension. So we just align to 4 bytes and 2 bytes for C-extension is enough. Reviewers: asb, apazos Differential Revision: https://reviews.llvm.org/D45560 Patch by Kito Cheng. llvm-svn: 329899
*	[MIPS GlobalISel] minor update to MIR tests added in r329819	Petar Jovanovic	2018-04-12	3	-12/+0
\| \| \| \| \| \| \| \|	Remove 'registers' section, as suggested (D. Sanders) at code review https://reviews.llvm.org/D44304 llvm-svn: 329888
*	[NFC] fix trivial typos in documents and comments	Hiroshi Inoue	2018-04-12	1	-1/+1
\| \| \| \| \| \|	"is is" -> "is", "if if" -> "if", "or or" -> "or" llvm-svn: 329878
*	[RISCV] Codegen support for RV32D floating point comparison operations	Alex Bradbury	2018-04-12	4	-0/+1327
\| \| \| \| \| \| \| \|	Also add double-prevoius-failure.ll which captures a test case that at one point triggered a compiler crash, while developing calling convention support for f64 on RV32D with soft-float ABI. llvm-svn: 329877
*	[RISCV] Codegen support for RV32D floating point conversion operations	Alex Bradbury	2018-04-12	2	-0/+106
\| \| \| \| \| \| \|	This also includes support and a test for truncating stores, which are now possible thanks to the fpround pattern. llvm-svn: 329876
*	[RISCV] Add codegen support for RV32D floating point arithmetic operations	Alex Bradbury	2018-04-12	1	-0/+256
\| \| \| \|	llvm-svn: 329874
*	[RISCV] Add tests missed in r329871	Alex Bradbury	2018-04-12	6	-0/+464
\| \| \| \|	llvm-svn: 329872
*	[PowerPC] Fix condition for 64-bit rotate when replacing r+r instr with r+i	Nemanja Ivanovic	2018-04-11	1	-0/+64
\| \| \| \| \| \| \| \| \| \|	This patch fixes https://bugs.llvm.org/show_bug.cgi?id=37039 The condition only covers one of the two 64-bit rotate instructions. This just adds the second (RLDICLo). Patch by Josh Stone. llvm-svn: 329852
*	Attempting to work around a non-determinism issue.	Puyan Lotfi	2018-04-11	1	-2/+0
\| \| \| \| \| \| \|	The main thing that matters with this test is that the COPYs are moved together not where the REG_SEQUENCES are. llvm-svn: 329850
*	[X86] Describe wbnoinvd instruction	Gabor Buella	2018-04-11	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Similar to the wbinvd instruction, except this one does not invalidate caches. Ring 0 only. The encoding matches a wbinvd instruction with an F3 prefix. Reviewers: craig.topper, zvi, ashlykov Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D43816 llvm-svn: 329847
*	[X86][Atom] Convert Atom scheduler model to SchedRW (PR32431)	Simon Pilgrim	2018-04-11	11	-159/+172
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Atom is the only x86 target that still uses schedule itineraries, if we can remove this then we can begin the work on removing x86 itineraries. I've also found that it will help with PR36550. I've focussed on matching the existing model as closely as possible (relying on the schedule tests), PR36895 indicated a lot of these were incorrect but we can just as easily fix these after this patch as before. Hopefully we can get llvm-exegesis to help here, There are a few instructions that rely on itinerary scheduling (mainly push/pop/return) of multiple resource stages, but I don't think any of these are show stoppers. There are also a few codegen changes that seem related to the post-ra scheduler acting a little differently, I haven't tracked these down but they don't seem critical. NOTE: I don't have access to any Atom hardware, so this hasn't been tested in the wild. Differential Revision: https://reviews.llvm.org/D45486 llvm-svn: 329837
*	[AMDGPU] Ensure there are enough registers for wave dispatch	Tim Renouf	2018-04-11	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes the number of SGPRs and VGPRs in the *_RSRC1 register to allow for registers set up in wave dispatch, even if those registers are not used in the shader. Re-landed after noticing that the buildbot failure from 329808 seemed to be unrelated. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45503 Change-Id: I6575f0e0d2a528d1319d0b289f0ebe4510fa5771 llvm-svn: 329826
*	[FastISel] Disable local value sinking by default	Reid Kleckner	2018-04-11	26	-64/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is causing compilation timeouts on code with long sequences of local values and calls (i.e. foo(1); foo(2); foo(3); ...). It turns out that code coverage instrumentation is a great way to create sequences like this, which how our users ran into the issue in practice. Intel has a tool that detects these kinds of non-linear compile time issues, and Andy Kaylor reported it as PR37010. The current sinking code scans the whole basic block once per local value sink, which happens before emitting each call. In theory, local values should only be introduced to be used by instructions between the current flush point and the last flush point, so we should only need to scan those instructions. llvm-svn: 329822
*	[DWARFv5] Fuss with asm syntax for conveying MD5 checksum.	Paul Robinson	2018-04-11	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	Previously the MD5 option of the .file directive provided the checksum as a quoted hex string; now it's a normal hex number with 0x prefix, same as the .octa directive accepts. Differential Revision: https://reviews.llvm.org/D45459 llvm-svn: 329820
*	[MIPS GlobalISel] Select add i32, i32	Petar Jovanovic	2018-04-11	5	-0/+131
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add the minimal support necessary to lower a function that returns the sum of two i32 values. Support argument/return lowering of i32 values through registers only. Add tablegen for regbankselect and instructionselect. Patch by Petar Avramovic. Differential Revision: https://reviews.llvm.org/D44304 llvm-svn: 329819
*	[AMDGPU] Fix lowering enqueue_kernel	Yaxun Liu	2018-04-11	1	-4/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Two issues were fixed: runtime has difficulty to allocate memory for an external symbol of a kernel and set the address of the external symbol, therefore make the runtime handle of an enqueued kernel an ordinary global variable. Runtime only needs to store the address of the loaded kernel to the handle and has verified that this approach works. handle the situation where __enqueue_kernel* gets inlined therefore the enqueued kernel may be used through a constant expr instead of an instruction. Differential Revision: https://reviews.llvm.org/D45187 llvm-svn: 329815
*	Revert "[AMDGPU] Ensure there are enough registers for wave dispatch"	Tim Renouf	2018-04-11	1	-18/+0
\| \| \| \| \| \| \| \| \|	This reverts 329808. That change caused a report of a failure in test/CodeGen/MIR/AMDGPU/mir-canon-multi.mir that I didn't see. I suspect it is an expensive-check-only error. Change-Id: I8133f26f15e7d5ec2b09c687c12cd70e918461b0 llvm-svn: 329811
*	[AMDGPU] Ensure there are enough registers for wave dispatch	Tim Renouf	2018-04-11	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes the number of SGPRs and VGPRs in the *_RSRC1 register to allow for registers set up in wave dispatch, even if those registers are not used in the shader. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45503 Change-Id: I6575f0e0d2a528d1319d0b289f0ebe4510fa5771 llvm-svn: 329808
*	[X86] Add variable shuffle schedule classes	Simon Pilgrim	2018-04-11	4	-62/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Split variable index shuffles from immediate index shuffles WriteFVarShuffle - variable 'in-lane' shuffles (VPERMILPS/VPERMIL2PS etc.) WriteVarShuffle - variable 'in-lane' shuffles (PSHUFB/VPPERM etc.) WriteFVarShuffle256 - variable 'cross-lane' shuffles (VPERMPS etc.) WriteVarShuffle256 - variable 'cross-lane' shuffles (VPERMD etc.) Differential Revision: https://reviews.llvm.org/D45404 llvm-svn: 329806
*	[AArch64] Add test case for r329797	Francis Visoiu Mistrih	2018-04-11	1	-0/+17
\| \| \| \| \| \|	Forgot to add a test case in the previous commit. llvm-svn: 329805
*	[X86][SSE] Tweak cmpps schedule test so that it works properly with just sse1	Simon Pilgrim	2018-04-11	1	-4/+5
\| \| \| \| \| \|	movhps/movlps test are still broken so we can't disable sse2 yet llvm-svn: 329802
*	[ARM] FP16 VSEL codegen	Sjoerd Meijer	2018-04-11	1	-12/+142
\| \| \| \| \| \| \| \| \| \| \| \| \|	This is a follow up of rL327695 to instruction select more variants of VSELGT and VSELGE, for which it is necessary to custom lower SELECT. More work is required in this area, which will be addressed soon: - more variants need to be regression tested, but this depends on the next point. - first LowerConstantFP need to be adjusted for fp16 values. Differential Revision: https://reviews.llvm.org/D45205 llvm-svn: 329788
*	[X86] Remove 128/256-bit masked pmaddubsw and pmaddwd intrinsics. Replace ↵	Craig Topper	2018-04-11	4	-46/+171
\| \| \| \| \| \| \| \|	512-bit masked intrinsic with unmasked intrinsic and a select. The 128/256-bit versions were no longer used by clang. It uses the legacy SSE/AVX2 version and a select. The 512-bit was changed to the same for consistency. llvm-svn: 329774
*	[X86] In X86FlagsCopyLowering, when rewriting a memory setcc we need to emit ↵	Craig Topper	2018-04-11	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	an explicit MOV8mr instruction. Previously the code only knew how to handle setcc to a register. This should fix a crash in the chromium build. llvm-svn: 329771
*	[X86] Switch a test from grep to FileCheck. NFC	Craig Topper	2018-04-11	1	-1/+11
\| \| \| \|	llvm-svn: 329769
*	Simplification of libcall like printf->puts must check for RtLibUseGOT metadata.	Sriraman Tallam	2018-04-10	1	-0/+19
\| \| \| \| \| \| \| \| \| \|	With -fno-plt, for example, calls to printf when getting converted to puts still use the PLT. This patch checks for the metadata "RtLibUseGOT" and annotates the declaration with the right attributes. Differential Revision: https://reviews.llvm.org/D45180 llvm-svn: 329768
*	GOTPCREL references must always use RIP.	Sriraman Tallam	2018-04-10	1	-0/+6
\| \| \| \| \| \| \| \|	With -fno-plt, global value references can use GOTPCREL and RIP must be used. Differential Revision: https://reviews.llvm.org/D45460 llvm-svn: 329765
*	AMDGPU: enable 128-bit for local addr space under an option	Marek Olsak	2018-04-10	6	-14/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Author: Samuel Pitoiset ds_read_b128 and ds_write_b128 have been recently enabled under the amdgpu-ds128 option because the performance benefit is unclear. Though, using 128-bit loads/stores for the local address space appears to introduce regressions in tessellation shaders. Not sure what is broken, but as ds_read_b128/ds_write_b128 are not enabled by default, just introduce a global option and enable 128-bit only if requested (until it's fixed/used correctly). v2: - fix regressions in merge-stores.ll and multiple_tails.ll Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105464 llvm-svn: 329764
*	Disable flaky tests till they get fixed.	Galina Kistanova	2018-04-10	2	-1/+2
\| \| \| \|	llvm-svn: 329763
*	[AArch64][Falkor] Fix bug in Falkor HWPF collision avoidance pass.	Geoff Berry	2018-04-10	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When inserting MOVs to avoid Falkor HWPF collisions, the non-base register operand of load instructions (e.g. a register offset) was not being considered live, so it could potentially have been used as a scratch register, clobbering the actual offset value. Reviewers: mcrosier Subscribers: rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D45502 llvm-svn: 329761
*	[MachO] Emit Weak ReadOnlyWithRel to ConstDataSection	Steven Wu	2018-04-10	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Darwin dynamic linker can handle weak symbols in ConstDataSection. ReadonReadOnlyWithRel symbols should be emitted in ConstDataSection instead of normal DataSection. rdar://problem/39298457 Reviewers: dexonsmith, kledzik Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45472 llvm-svn: 329752
*	[AArch64] Fix isel failure when BUILD_PAIR nodes are left over.	Amara Emerson	2018-04-10	1	-0/+13
\| \| \| \| \| \|	rdar://39175175 llvm-svn: 329743