bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	In the below scenario, we must be able to skip the a DBG_VALUE instruction and	Sumanth Gundapaneni	2017-01-09	1	-3/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	remove the dead store. %vreg0<def> = L2_loadri_io <fi#15>, 0; mem:LD4[%dataF](align=4) DBG_VALUE %vreg0, %noreg, !"dataF", <!184>; IntRegs:%vreg0 S2_storeri_io <fi#15>, 0, %vreg0; mem:ST4[%dataF] In reality, this kind of stores are eliminated before Stack Slot Coloring pass, possibly in instruction lowering Differential Revision: https://reviews.llvm.org/D26616 llvm-svn: 291455
*	[X86][AVX512] Enable v16i8/v32i8 vector shifts to use an ↵	Simon Pilgrim	2017-01-09	7	-332/+296
\| \| \| \| \| \| \| \| \| \|	extend+shift+truncate pattern. Use the existing AVX2 v8i16 vector shift lowering for v16i8 (extending to v16i32) on AVX512 targets and v32i8 (extending to v32i16) on AVX512BW targets. Cost model updates to follow. llvm-svn: 291451
*	fix comment typos; NFC	Sanjay Patel	2017-01-09	2	-7/+7
\| \| \| \|	llvm-svn: 291447
*	[X86][AVX512DQ] Enable v16i16 vector shifts to use an extend+shift+truncate ↵	Simon Pilgrim	2017-01-09	7	-148/+60
\| \| \| \| \| \| \| \| \| \|	pattern. Use the existing AVX2 v8i16 vector shift lowering for v16i16 on AVX512 targets (AVX512BW will have already have lowered with vpsravw). Cost model updates to follow. llvm-svn: 291445
*	[X86][AVX512DQ] Added AVX512DQ to 128/256 bit vector shift tests	Simon Pilgrim	2017-01-09	6	-84/+215
\| \| \| \|	llvm-svn: 291444
*	[IR] Adding const_value_op_iterator for IR/User.h	Mohammed Agabaria	2017-01-09	2	-0/+45
\| \| \| \| \| \| \| \|	const value op iterator is missing from User.h class. Differential Revision: https://reviews.llvm.org/D28464 llvm-svn: 291443
*	Some formatting in TargetMachineC. NFC	Amaury Sechet	2017-01-09	1	-2/+2
\| \| \| \|	llvm-svn: 291442
*	[SelectionDAG] Fix in legalization of UMAX/SMAX/UMIN/SMIN. Solves PR31486.	Bjorn Pettersson	2017-01-09	2	-2/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Originally i64 = umax t8, Constant:i64<4> was expanded into i32,i32 = umax Constant:i32<0>, Constant:i32<0> i32,i32 = umax t7, Constant:i32<4> Now instead the two produced umax:es return i32 instead of i32, i32. Thanks to Jan Vesely for help with the test case. Patch by mikael.holmen at ericsson.com Reviewers: bogner, jvesely, tstellarAMD, arsenm Subscribers: test, wdng, RKSimon, arsenm, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D28135 llvm-svn: 291441
*	RuntimeDyldELF: add missing test cases for AArch64	Eugene Leviant	2017-01-09	2	-3/+43
\| \| \| \|	llvm-svn: 291438
*	Fix MSVC build failure introduced in r291431	Pavel Labath	2017-01-09	1	-4/+3
\| \| \| \| \| \| \|	MSVC does not like to reinterpret_cast to a uint64_t. Use a different cast instead. llvm-svn: 291435
*	RuntimeDyldELF: don't create thunk if not needed	Eugene Leviant	2017-01-09	3	-1/+61
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch doesn't create thunk for branch operation when following conditions are met: - Architecture is AArch64 - Relocation target is in the same object file - Relocation target is close enough to be encoded in immediate offset In such case we branch directly to the target instead of branching to thunk Differential revision: https://reviews.llvm.org/D28108 llvm-svn: 291431
*	[PM] Teach SCEV to invalidate itself when its dependencies become	Chandler Carruth	2017-01-09	3	-0/+84
\| \| \| \| \| \| \| \| \| \| \| \| \|	invalid. This fixes use-after-free bugs that will arise with any interesting use of SCEV. I've added a dedicated test that works diligently to trigger these kinds of bugs in the new pass manager and also checks for them explicitly as well as triggering ASan failures when things go squirly. llvm-svn: 291426
*	[WebAssembly] Fix the opcode values for i64.eq and i64.ne.	Dan Gohman	2017-01-09	1	-2/+2
\| \| \| \|	llvm-svn: 291424
*	Remove unused method in LoopVectorize.cpp.	Jonas Paulsson	2017-01-09	1	-7/+0
\| \| \| \| \| \| \|	computeInterleaveCount() is not defined/used and is therefore removed. Review: Davide Italiano llvm-svn: 291423
*	NewGVN: Fix PR 31573, a failure to verify memory congruency due to	Daniel Berlin	2017-01-09	2	-1/+56
\| \| \| \| \| \| \|	not excluding ourselves when checking if any equivalent stores exist. llvm-svn: 291421
*	NewGVN: Change a std::vector to SmallVector and cleanup naming.	Daniel Berlin	2017-01-09	1	-10/+11
\| \| \| \|	llvm-svn: 291420
*	[AVX-512] Change another pattern that was using BLENDM to use masked moves. ↵	Craig Topper	2017-01-09	3	-38/+47
\| \| \| \| \| \|	A future patch will conver it back to BLENDM if its beneficial to register allocation. llvm-svn: 291419
*	[AVX-512] Add patterns to use a zero masked VPTERNLOG instruction for ↵	Craig Topper	2017-01-09	13	-217/+184
\| \| \| \| \| \| \| \|	vselects of all ones and all zeros. Previously we emitted a VPTERNLOG and a separate masked move. llvm-svn: 291415
*	Define sys::path::convert_to_slash	Rui Ueyama	2017-01-09	3	-10/+20
\| \| \| \| \| \| \| \|	This patch moves convertToUnixPathSeparator from LLD to LLVM. Differential Revision: https://reviews.llvm.org/D28444 llvm-svn: 291414
*	CommandLine option: Relax the assertion introduced in r290467 to allows for ↵	Mehdi Amini	2017-01-08	1	-1/+1
\| \| \| \| \| \| \| \| \|	empty string This is used in LDC for custom boolean commandline options, setArgStr is called with an empty string before using AddLiteralOption. llvm-svn: 291406
*	[MemDep] NFC walk invariant.group graph only down	Piotr Padlewski	2017-01-08	3	-26/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: By using stripPointerCasts we can get to the root value and then walk down the bitcast graph Reviewers: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28181 llvm-svn: 291405
*	[LCSSA] Fix some typos. NFCI.	Davide Italiano	2017-01-08	1	-3/+3
\| \| \| \|	llvm-svn: 291404
*	[AVX-512] If avx512dq is available use vpmovm2d/vpmovm2q instead of vselect ↵	Craig Topper	2017-01-08	2	-37/+101
\| \| \| \| \| \|	of zeroes/ones when handling sign extends of i1 without VLX. llvm-svn: 291402
*	[X86] Add avx512bw and avx512dq command lines to the vector compare results ↵	Craig Topper	2017-01-08	1	-1498/+4602
\| \| \| \| \| \| \| \|	test. This is preparation for improving a case with avx512dq. llvm-svn: 291401
*	[SCCP] Unknown instructions are sent to overdefined anyway. NFCI.	Davide Italiano	2017-01-08	1	-18/+0
\| \| \| \|	llvm-svn: 291400
*	[Orc][RPC] Lock the pending results data structure when installing new result	Lang Hames	2017-01-08	1	-22/+51
\| \| \| \| \| \| \| \| \| \| \| \| \|	handlers, make abandonPendingResults public API. This should make installing asynchronous result handlers thread safe. The abandonPendingResults method is made public so that clients can disconnect from a remote even if they have asynchronous handlers awaing results from that remote. The asynchronous handlers will all receive "abandoned result" errors as their argument. llvm-svn: 291399
*	llvm-objdump: speed up -objc-meta-data	Saleem Abdulrasool	2017-01-08	2	-26/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Running a Debug build of objdump -objc-meta-data with a large Mach-O file is currently unnecessarily slow. With some local test input, this change reduces the run time from 75-85s down to 15-20s. The two changes are: Assert on pointer equality not array equality Replace vector<pair<address, symbol>> with DenseMap<address, symbol> Additionally, use a std::unique_ptr rather than handling the memory manually. Patch by Dave Lee! llvm-svn: 291398
*	Strip trailing whitespace.	Simon Pilgrim	2017-01-08	1	-1/+1
\| \| \| \|	llvm-svn: 291395
*	unittest: remove extraneous ';'	Saleem Abdulrasool	2017-01-08	1	-1/+1
\| \| \| \| \| \|	Silences a warning from gcc:6. NFC llvm-svn: 291394
*	Fix line endings and strip trailing whitespace.	Simon Pilgrim	2017-01-08	1	-71/+71
\| \| \| \|	llvm-svn: 291393
*	[x86] fix usage of stale operands when lowering select	Sanjay Patel	2017-01-08	2	-7/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I noticed this problem as part of the ongoing attempt to canonicalize min/max ops in IR. The debug output shows nodes like this: t4: i32 = xor t2, Constant:i32<-1> t21: i8 = setcc t4, Constant:i32<0>, setlt:ch t14: i32 = select t21, t4, Constant:i32<-1> And because the select is holding onto the t4 (xor) node while EmitTest creates a new x86-specific xor node, the lowering results in: t4: i32 = xor t2, Constant:i32<-1> t25: i32,i32 = X86ISD::XOR t2, Constant:i32<-1> t28: i32,glue = X86ISD::CMOV Constant:i32<-1>, t4, Constant:i8<15>, t25:1 Differential Revision: https://reviews.llvm.org/D28374 llvm-svn: 291392
*	[CostModel][X86] Fixed vXi8 uniform shift costs.	Simon Pilgrim	2017-01-08	6	-45/+61
\| \| \| \| \| \| \| \| \| \|	The 'fast' costs should only work for shifts by uniform constants (uniform non-constant are lowered using the slow default implementation). Logical shifts were not taking into account that we must mask the psrlw result, so the costs needed to be doubled. Added missing AVX2/AVX512BW costs as well. llvm-svn: 291391
*	[CostModel][X86] Moved legal uniform shift costs earlier.	Simon Pilgrim	2017-01-08	3	-32/+44
\| \| \| \| \| \|	XOP was prematurely matching, doubling the cost of ashr/lshr uniform shifts. llvm-svn: 291390
*	[AVX-512] Remove redundant patterns that select unaligned moves with zero ↵	Craig Topper	2017-01-08	1	-1/+1
\| \| \| \| \| \|	masking for patterns that already use the aligned form. NFC llvm-svn: 291383
*	[Orc][RPC] Fix typo.	Lang Hames	2017-01-08	1	-1/+1
\| \| \| \|	llvm-svn: 291381
*	[Orc][RPC] Add an APICalls utility for grouping RPC funtions for registration.	Lang Hames	2017-01-08	2	-32/+150
\| \| \| \| \| \| \| \| \| \| \| \| \|	APICalls allows groups of functions to be composed into an API that can be registered as a unit with an RPC endpoint. Doing registration on a-whole API basis (rather than per-function) allows missing API functions to be detected early. APICalls also allows Function membership to be tested at compile-time. This allows clients to write static assertions that functions to be called are members of registered APIs. llvm-svn: 291380
*	[ThinLTO] Fix lazy-loading of Metadata attachment, which left some Fwd ref ↵	Mehdi Amini	2017-01-08	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	behind The change in r291362 was too agressive. We still need to flush at the end of the block because function local metadata can introduce fwd ref as well. (Bootstrap with ThinLTO was broken) llvm-svn: 291379
*	[ThinLTO] Expected<> return values need to be handled to avoid an assertion	Mehdi Amini	2017-01-08	1	-1/+8
\| \| \| \|	llvm-svn: 291377
*	[Orc][RPC] Add a class-method version of addHandler to MultiThreadedRPCEndpoint.	Lang Hames	2017-01-08	1	-0/+9
\| \| \| \| \| \| \| \| \|	This brings MultiThreadedRPCEndpoint's addHandler API in-line with SingleThreadedRPCEndpoint's. This will be tested in an up-coming unit-test for MultiThreadedRPCEndpoint. llvm-svn: 291376
*	[AVR] Implement TargetLoweing::getRegisterByName	Dylan McKay	2017-01-07	3	-0/+61
\| \| \| \| \| \| \|	This allows the use of the 'read_register' intrinsics used by clang's named register globals features. llvm-svn: 291375
*	[Orc][RPC] Rename Single/MultiThreadedRPC to Single/MultithreadedRPCEndpoint.	Lang Hames	2017-01-07	3	-21/+22
\| \| \| \|	llvm-svn: 291374
*	[Orc][RPC] Remove a redundant 'if' statement.	Lang Hames	2017-01-07	1	-3/+1
\| \| \| \|	llvm-svn: 291373
*	[CostModel][X86] Update SSE41/AVX1 vXi32 SHL costs	Simon Pilgrim	2017-01-07	2	-12/+14
\| \| \| \| \| \|	SSE41 provides pmulld which allows the simpler pslld/paddd/cvttps2dq/pmulld pattern than SSE2's use of pmuludq. llvm-svn: 291372
*	[AVX-512] Remove patterns from the other VBLENDM instructions. They are all ↵	Craig Topper	2017-01-07	13	-184/+305
\| \| \| \| \| \| \| \|	redundant with masked move instructions. We should probably teach the two address instruction pass to turn masked moves into BLENDM when its beneficial to the register allocator. llvm-svn: 291371
*	[X86] Regenerate a test to remove tab characters.	Craig Topper	2017-01-07	1	-4/+4
\| \| \| \|	llvm-svn: 291370
*	[AVX-512] Remove patterns from masked broadcast versions of BLENDM instructions.	Craig Topper	2017-01-07	1	-6/+3
\| \| \| \| \| \| \| \|	All but (v2f64 broadcast f64) are handled with VBROADCAST instructions. The v2f64 version can be handled with VMOVDDUP. We may want to consider converting to BLENDM instructions in the two address instruction pass if its beneficial to register allocation. llvm-svn: 291369
*	[AVX-512] Add masked forms of the alternate MOVDDUP patterns.	Craig Topper	2017-01-07	2	-0/+52
\| \| \| \| \| \|	I'm not too sure how to get isel to select even all of the unmasked forms, but at least we have a consistent set now. llvm-svn: 291368
*	[CostModel][X86] Fix AVX2 v16i16 shift 'splat' costs.	Simon Pilgrim	2017-01-07	3	-14/+31
\| \| \| \|	llvm-svn: 291366
*	[CostModel][X86] Match 256-bit vector shift 'splat' costs for AVX2 and above	Simon Pilgrim	2017-01-07	4	-65/+64
\| \| \| \| \| \|	We were matching against general vector shift costs before the uniform splat costs llvm-svn: 291365
*	[CostModel][X86] Generalized cost calculation of SHL by constant -> MUL ↵	Simon Pilgrim	2017-01-07	1	-21/+10
\| \| \| \| \| \|	conversion. llvm-svn: 291364