bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[LoopDist] Split main class. NFC	Adam Nemet	2016-04-27	1	-86/+96
\| \| \| \| \| \| \| \| \| \|	This splits out the per-loop functionality from the Pass class. With this the fact whether the loop is forced-distribute with the new metadata/pragma can be cached in the per-loop class rather than passed around. llvm-svn: 267643
*	[LVI] Reduce compile time by lazily scanning blocks if needed	Philip Reames	2016-04-27	1	-26/+26
\| \| \| \| \| \| \| \| \| \|	When encountering a non-local pointer, LVI would eagerly scan the block for dereferences of the given object to prove the pointer to be non null. That's all well and good, but then we'd go recurse through our input blocks. As a result, we could end up scanning each and every block we traverse, even if the final definition was obviously non null or we found a constant value somewhere up the chain. The previous code papered over this by using the isKnownNonNull routine from value tracking. This made the duplication less painful in the common case. Instead, we know do the block scan only after we've gotten the recursive results back. This lets us stop scanning individual blocks as soon as we've determined it to be non-null in any predecessor block and use our usual merge rules to propagate that information cheaply through successor blocks. For a pointer which can be found non-null, this does strictly less work and sometimes substaintially so. Note that the case where we can't prove something non-null is still the really expensive case. We end up scanning each and every block looking for a dereference and never end up finding one. llvm-svn: 267642
*	[MachineInstrBundle] Actually set the PartialDeadDef flag only when the register	Quentin Colombet	2016-04-27	1	-1/+1
\| \| \| \| \| \| \| \| \|	is defined! The users were checking the proper thing (Defined + PartialDeadDef), but the information may have been wrong for other use cases, so fix that. llvm-svn: 267641
*	Add optimization bisect opt-in calls for SystemZ passes	Andrew Kaylor	2016-04-26	3	-0/+9
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19562 llvm-svn: 267636
*	Add optimization bisect opt-in calls for NVPTX passes	Andrew Kaylor	2016-04-26	5	-1/+13
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19518 llvm-svn: 267635
*	[X86] Make sure it is safe to clobber EFLAGS, if need be, when choosing	Quentin Colombet	2016-04-26	2	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the prologue. Do not use basic blocks that have EFLAGS live-in as prologue if we need to realign the stack. Realigning the stack uses AND instruction and this clobbers EFLAGS. An other alternative would have been to save and restore EFLAGS around the stack realignment code, but this is likely inefficient. Fixes PR27531. llvm-svn: 267634
*	PM: Port Reassociate to the new pass manager	Justin Bogner	2016-04-26	4	-140/+104
\| \| \| \|	llvm-svn: 267631
*	Reassociate: Convert another functor into a lambda. NFC	Justin Bogner	2016-04-26	1	-15/+13
\| \| \| \| \| \|	Also move the explanatory comment with it. llvm-svn: 267628
*	[LVI] Cut short search if we know we can't return a useful result	Philip Reames	2016-04-26	1	-9/+44
\| \| \| \| \| \|	Previously we were recursing on our operands for unary and binary operators regardless of whether we knew how to reason about the operator in question. This has the effect of doing a potentially large amount of work, only to throw it away. By checking whether the operation is one LVI can handle, we can cut short the search and return the (overdefined) answer more quickly. The quality of the results produced should not change. llvm-svn: 267626
*	[SimplifyCFG] propagate branch metadata when creating select	Sanjay Patel	2016-04-26	1	-1/+1
\| \| \| \|	llvm-svn: 267624
*	[X86] Teach the expansion of copy instructions how to do proper liveness.	Quentin Colombet	2016-04-26	1	-15/+22
\| \| \| \| \| \| \|	When the simple analysis provided by MachineBasicBlock::computeRegisterLiveness fails, fall back on the LivePhysReg utility. llvm-svn: 267623
*	[MachineBasicBlock] Take advantage of the partially dead information.	Quentin Colombet	2016-04-26	1	-2/+9
\| \| \| \| \| \| \|	Thanks to that information we wouldn't lie on a register being live whereas it is not. llvm-svn: 267622
*	[MachineInstrBundle] Improvement the recognition of dead definitions.	Quentin Colombet	2016-04-26	1	-3/+7
\| \| \| \| \| \| \|	Now, it is possible to know that partial definitions are dead definitions and recognize that clobbered registers are also dead. llvm-svn: 267621
*	[LVI] Apply transfer rule for overdefine inputs for binary operators	Philip Reames	2016-04-26	1	-11/+16
\| \| \| \| \| \| \| \|	As pointed out by John Regehr over in http://reviews.llvm.org/D19485, LVI was being incredibly stupid about applying its transfer rules. Rather than gathering local facts from the expression itself, it was simply giving up entirely if one of the inputs was overdefined. This greatly impacts the precision of the overall analysis and makes it far more fragile as well. This patch builds on 267609 which did the same thing for unary casts. llvm-svn: 267620
*	[NVPTX] Fix some usages of CodeGenOpt::None.	Jingyue Wu	2016-04-26	1	-5/+9
\| \| \| \| \| \| \| \| \| \|	NVPTXLowerKernelArgs is required for correctness, so it should not be guarded by CodeGenOpt::None. NVPTXPeephole is optimization only, so it should be skipped when CodeGenOpt::None. llvm-svn: 267619
*	[LVI] A better fix for the assertion error introduced by 267609	Philip Reames	2016-04-26	1	-10/+11
\| \| \| \| \| \|	Essentially, I was using the wrong size function. For types which were sized, but not primitive, I wasn't getting a useful size for the operand and failed an assert. I fixed this, and also added a guard that the input is a sized type. Test case is for the original mistake. I'm not sure how to actually exercise the sized type check. llvm-svn: 267618
*	[LVI] Speculative fix for assertion seen in clang bots	Philip Reames	2016-04-26	1	-0/+6
\| \| \| \| \| \|	I'll clean this up and add a test case shortly. I want to make sure this does actually fix the bots; if not, I'll revert. llvm-svn: 267617
*	[LowerExpectIntrinsic] make default likely/unlikely ratio bigger	Sanjay Patel	2016-04-26	1	-6/+18
\| \| \| \| \| \| \| \| \| \|	We need the default ratio to be sufficiently large that it triggers transforms based on block frequency info (BFI) and plays well with the recently introduced BranchProbability used by CGP. Differential Revision: http://reviews.llvm.org/D19435 llvm-svn: 267615
*	Reassociate: Simplify using lambdas. NFC	Justin Bogner	2016-04-26	1	-17/+7
\| \| \| \|	llvm-svn: 267614
*	[LVI] Infer local facts from unary expressions	Philip Reames	2016-04-26	1	-16/+20
\| \| \| \| \| \| \| \| \| \|	As pointed out by John Regehr over in http://reviews.llvm.org/D19485, LVI was being incredibly stupid about applying its transfer rules. Rather than gathering local facts from the expression itself, it was simply giving up entirely if one of the inputs was overdefined. This greatly impacts the precision of the overall analysis and makes it far more fragile as well. This patch implements only the unary operation case. Once this is in, I'll implement the same for the binary operations. Differential Revision: http://reviews.llvm.org/D19492 llvm-svn: 267609
*	Optimization bisect support in X86-specific passes	Andrew Kaylor	2016-04-26	5	-3/+13
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19439 llvm-svn: 267608
*	[CodeGen] Add getBuildVector and getSplatBuildVector helpers. NFCI.	Ahmed Bougacha	2016-04-26	12	-178/+141
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D17176 llvm-svn: 267606
*	Revert "[SimplifyLibCalls] sprintf doesn't copy null bytes"	David Majnemer	2016-04-26	1	-4/+3
\| \| \| \| \| \| \| \| \| \|	The destination buffer that sprintf uses is restrict qualified, we do not need to worry about derived pointers referenced via format specifiers. This reverts commit r267580. llvm-svn: 267605
*	Remove more unused variables.	Zachary Turner	2016-04-26	1	-3/+0
\| \| \| \|	llvm-svn: 267598
*	Masked Store in Loop Vectorizer - bugfix	Elena Demikhovsky	2016-04-26	1	-13/+9
\| \| \| \| \| \| \| \|	Fixed a bug in loop vectorization with conditional store. Differential Revision: http://reviews.llvm.org/D19532 llvm-svn: 267597
*	PM: Port Internalize to the new pass manager	Justin Bogner	2016-04-26	5	-100/+77
\| \| \| \|	llvm-svn: 267596
*	[llvm-pdbdump] Fix version reading on big endian systems.	Zachary Turner	2016-04-26	1	-1/+2
\| \| \| \|	llvm-svn: 267595
*	Add optimization bisect opt-in calls for Hexagon passes	Andrew Kaylor	2016-04-26	18	-1/+52
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19509 llvm-svn: 267593
*	Fix warnings and -Werror build on clang.	Zachary Turner	2016-04-26	1	-1/+1
\| \| \| \|	llvm-svn: 267589
*	Parse and dump PDB DBI Stream Header Information	Zachary Turner	2016-04-26	5	-1/+191
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The DBI stream contains a lot of bookkeeping information for other streams. In particular it contains information about section contributions and linked modules. This patch is a first attempt at parsing some of the information out of the DBI stream. It currently only parses and dumps the headers of the DBI stream, so none of the module data or section contribution data is pulled out. This is just a proof of concept that we understand the basic properties of the DBI stream's metadata, and followup patches will try to extract more detailed information out. Differential Revision: http://reviews.llvm.org/D19500 Reviewed By: majnemer, ruiu llvm-svn: 267585
*	[Tail duplication] Handle source registers with subregisters	Krzysztof Parzyszek	2016-04-26	1	-34/+80
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a block is tail-duplicated, the PHI nodes from that block are replaced with appropriate COPY instructions. When those PHI nodes contained use operands with subregisters, the subregisters were dropped from the COPY instructions, resulting in incorrect code. Keep track of the subregister information and use this information when remapping instructions from the duplicated block. Differential Revision: http://reviews.llvm.org/D19337 llvm-svn: 267583
*	Reapply: "ARM: put correct symbol index on indirect pointers in __thread_ptr.""	Tim Northover	2016-04-26	1	-1/+2
\| \| \| \| \| \| \|	A latent bug in llvm-objdump used the wrong format specifier on 32-bit targets, causing the test to fail. This fixes the issue. llvm-svn: 267582
*	[SimplifyLibCalls] sprintf doesn't copy null bytes	David Majnemer	2016-04-26	1	-3/+4
\| \| \| \| \| \| \| \| \| \|	sprintf doesn't read or copy the terminating null byte from it's string operands. sprintf will append it's own after processing all of the format specifiers. This fixes PR27526. llvm-svn: 267580
*	Swift Calling Convention: use %RAX for sret.	Manman Ren	2016-04-26	3	-1/+16
\| \| \| \| \| \| \|	We don't need to copy the sret argument into %rax upon return. rdar://25671494 llvm-svn: 267579
*	[AMDGPU] Move reserved vgpr count for trap handler usage to ↵	Konstantin Zhuravlyov	2016-04-26	6	-9/+20
\| \| \| \| \| \| \| \|	SIMachineFunctionInfo + minor commenting changes Differential Revision: http://reviews.llvm.org/D19537 llvm-svn: 267573
*	[CodeGenPrepare] use branch weight metadata to decide if a select should be ↵	Sanjay Patel	2016-04-26	4	-33/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	turned into a branch This is part of solving PR27344: https://llvm.org/bugs/show_bug.cgi?id=27344 CGP should undo the SimplifyCFG transform for the same reason that earlier patches have used this same mechanism: it's possible that passes between SimplifyCFG and CGP may be able to optimize the IR further with a select in place. For the TLI hook default, >99% taken or not taken is chosen as the default threshold for a highly predictable branch. Even the most limited HW branch predictors will be correct on this branch almost all the time, so even a massive mispredict penalty perf loss would be overcome by the win from all the times the branch was predicted correctly. As a follow-up, we could make the default target hook less conservative by using the SchedMachineModel's MispredictPenalty. Or we could just let targets override the default by implementing the hook with that and other target-specific options. Note that trying to statically determine mispredict rates for close-to-balanced profile weight data is generally impossible if the HW is sufficiently advanced. Ie, 50/50 taken/not-taken might still be 100% predictable. Finally, note that this patch as-is will not solve PR27344 because the current __builtin_unpredictable() branch weight default values are 4 and 64. A proposal to change that is in D19435. Differential Revision: http://reviews.llvm.org/D19488 llvm-svn: 267572
*	Fix build broken due to order of initialization problem.	Zachary Turner	2016-04-26	1	-2/+1
\| \| \| \|	llvm-svn: 267571
*	Refactor some more PDB reading code into DebugInfoPDB.	Zachary Turner	2016-04-26	3	-0/+164
\| \| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19445 Reviewed By: David Majnemer llvm-svn: 267564
*	[AMDGPU] Reserve VGPRs for trap handler usage if instructed	Konstantin Zhuravlyov	2016-04-26	6	-1/+48
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19235 llvm-svn: 267563
*	Use gcc's rules for parsing gcc-style response files	Nico Weber	2016-04-26	1	-7/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In gcc, \ escapes every character in response files. It is true that this makes it harder to mention Windows files in rsp files, but not doing this means clang disagrees with gcc, and also disagrees with the shell (on non-Windows) which rsp file quoting is supposed to match. clang isn't free to choose what to do here. In general, the idea for response files is to take bits of your command line and write them to a file unchanged, and have things work the same way. Since the command line would've been interpreted by the shell, things in the rsp file need to be subject to the same shell quoting rules. People who want to put Windows-style paths in their response files either need to do any of: * escape their backslashes * or use clang-cl which uses cl.exe/cmd.exe quoting rules * pass --rsp-quoting=windows to clang to tell it to use cl.exe/cmd.exe quoting rules for response files. Fixes PR27464. http://reviews.llvm.org/D19417 llvm-svn: 267556
*	[AMDGPU] Assembler: basic support for SDWA instructions	Sam Kolton	2016-04-26	8	-58/+414
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Support for SDWA instructions for VOP1 and VOP2 encoding. Not done yet: - converters for support optional operands and modifiers - VOPC - sext() modifier - intrinsics - VOP2b (see vop_dpp.s) - V_MAC_F32 (see vop_dpp.s) Differential Revision: http://reviews.llvm.org/D19360 llvm-svn: 267553
*	[X86] PR27502: Fix the LEA optimization pass.	Andrey Turetskiy	2016-04-26	1	-2/+6
\| \| \| \| \| \| \| \|	Handle MachineBasicBlock as a memory displacement operand in the LEA optimization pass. Differential Revision: http://reviews.llvm.org/D19409 llvm-svn: 267551
*	[Sparc] Fix build error introduced by rL267545.	Marcin Koscielnicki	2016-04-26	1	-1/+1
\| \| \| \|	llvm-svn: 267549
*	[PowerPC] Add support for llvm.thread.pointer	Marcin Koscielnicki	2016-04-26	1	-0/+10
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19304 llvm-svn: 267546
*	[SPARC] [SSP] Add support for LOAD_STACK_GUARD.	Marcin Koscielnicki	2016-04-26	6	-1/+40
\| \| \| \| \| \| \| \|	This fixes PR22248 on sparc. Differential Revision: http://reviews.llvm.org/D19386 llvm-svn: 267545
*	[SPARC] Add support for llvm.thread.pointer.	Marcin Koscielnicki	2016-04-26	2	-0/+18
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19387 llvm-svn: 267544
*	ThinLTOCodeGenerator: preserve linkonce when in "MustPreserved" set	Mehdi Amini	2016-04-26	1	-4/+10
\| \| \| \| \| \| \| \| \|	If the linker specifically requested for a linkonce to be preserved, we need to make sure we won't drop it even if all the uses in the current module disappear. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267543
*	Revert "ARM: put correct symbol index on indirect pointers in __thread_ptr."	Renato Golin	2016-04-26	1	-2/+1
\| \| \| \| \| \|	This reverts commit r267488, as it broke some ARM buildbots. llvm-svn: 267541
*	[ppc64] Reenable sibling call optimization on ppc64 since fixed tsan library ↵	Chuang-Yu Cheng	2016-04-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	tail-call issue print-stack-trace.cc test failure of compiler-rt has been fixed by r266869 (http://reviews.llvm.org/D19148), so reenable sibling call optimization on ppc64 Reviewers: nemanjai kbarton llvm-svn: 267527
*	[AArch64] Expand v1i64 and v2i64 ctlz.	Craig Topper	2016-04-26	1	-0/+3
\| \| \| \| \| \|	The default is legal, which results in 'Cannot select' errors. llvm-svn: 267522