bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	fix typos; NFC	Sanjay Patel	2016-04-04	1	-2/+2
\| \| \| \|	llvm-svn: 265356
*	Swift Calling Convention: add swifterror attribute.	Manman Ren	2016-04-01	3	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \|	A ``swifterror`` attribute can be applied to a function parameter or an AllocaInst. This commit does not include any target-specific change. The target-specific optimization will come as a follow-up patch. Differential Revision: http://reviews.llvm.org/D18092 llvm-svn: 265189
*	Don't use an i64 return type with webkit_jscc	Sanjoy Das	2016-04-01	1	-6/+7
\| \| \| \| \| \| \| \| \|	Re-enable an assertion enabled by Justin Lebar in rL265092. rL265092 was breaking test/CodeGen/X86/deopt-intrinsic.ll because webkit_jscc does not like non-i64 return types. Change the test case to not do that. llvm-svn: 265099
*	Revert "Protect some assertions with NDEBUG rather than DEBUG()."	Justin Lebar	2016-04-01	1	-7/+6
\| \| \| \| \| \|	This reverts r265092, because it breaks CodeGen/X86/deopt-intrinsic.ll. llvm-svn: 265093
*	Protect some assertions with NDEBUG rather than DEBUG().	Justin Lebar	2016-04-01	1	-6/+7
\| \| \| \| \| \| \|	DEBUG() only runs if you pass -debug, but these assertions are generally useful. llvm-svn: 265092
*	fix typo; NFC	Sanjay Patel	2016-03-31	1	-2/+1
\| \| \| \|	llvm-svn: 265054
*	Prevent X86ISelLowering from merging volatile loads	Nirav Dave	2016-03-31	2	-14/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	Change isConsecutiveLoads to check that loads are non-volatile as this is a requirement for any load merges. Propagate change to two callers. Reviewers: RKSimon Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18546 llvm-svn: 265013
*	LegalizeDAG: Don't replace vector store with integer if not legal	Matt Arsenault	2016-03-30	3	-41/+87
\| \| \| \| \| \| \| \| \| \| \|	For the same reason as the corresponding load change. Note that ExpandStore is completely broken for non-byte sized element vector stores, but preserve the current broken behavior which has tests for it. The behavior should be the same, but now introduces a new typed store that is incorrectly split later rather than doing it directly. llvm-svn: 264928
*	LegalizeDAG: Don't replace vector load with integer unless legal	Matt Arsenault	2016-03-30	3	-28/+71
\| \| \| \| \| \| \| \| \| \| \| \| \|	On AMDGPU we want to be able to promote i64/f64 loads to v2i32. If the access is unaligned, this would conclude that since i64 is legal, it would convert it back to i64 and there is an endless legalization loop. Extract the logic for scalarizing the load into a new TargetLowering function, where this can also replace the custom function AMDGPU has for this. llvm-svn: 264927
*	Add support for no-jump-tables	Nirav Dave	2016-03-29	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add function soft attribute to the generation of Jump Tables in CodeGen as initial step towards clang support of gcc's no-jump-table support Reviewers: hans, echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18321 llvm-svn: 264756
*	Swift Calling Convention: add swiftself attribute.	Manman Ren	2016-03-29	3	-0/+9
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D17866 llvm-svn: 264754
*	[Codegen] Decrease minimum jump table density.	Kyle Butt	2016-03-29	2	-8/+23
\| \| \| \| \| \| \| \| \| \| \|	Minimum density for both optsize and non optsize are now options -sparse-jump-table-density (default 10) for non optsize functions -dense-jump-table-density (default 40) for optsize functions, which matches the current default. This improves several benchmarks at google at the cost of a small codesize increase. For code compiled with -Os, the old behavior continues llvm-svn: 264689
*	Minor code cleanup. NFC.	Junmo Park	2016-03-26	1	-1/+1
\| \| \| \|	llvm-svn: 264505
*	Prevent construction of cycle in DAG store merge	Nirav Dave	2016-03-25	3	-40/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When merging stores in DAGCombiner, add check to ensure that no dependenices exist that would cause the construction of a cycle in our DAG. This may happen if one store has a data dependence on another instruction (e.g. a load) which itself has a (chain) dependence on another store being merged. These stores cannot be merged safely and doing so results in a cycle that is discovered in LegalizeDAG. This test is only done in cases where Antialias analysis is used (UseAA) as non-AA store merge candidates will be merged logically after all loads which have been checked to not alias. Reviewers: ahatanak, spatel, niravd, arsenm, hfinkel, tstellarAMD, jyknight Subscribers: llvm-commits, tberghammer, danalbert, srhines Differential Revision: http://reviews.llvm.org/D18336 llvm-svn: 264461
*	CXX TLS: collect return blocks after SelectAllBasicBlocks.	Manman Ren	2016-03-24	1	-7/+15
\| \| \| \| \| \| \| \| \| \|	It is incorrect to get the corresponding MBB for a ReturnInst before SelectAllBasicBlocks since SelectAllBasicBlocks can change the correspondence between a ReturnInst and the MBB it is in. PR27062 llvm-svn: 264358
*	Reduce code duplication by extracting out a helper function; NFC	Sanjoy Das	2016-03-24	2	-30/+21
\| \| \| \|	llvm-svn: 264355
*	Lower varargs correctly in deopt bundle lowering	Sanjoy Das	2016-03-24	1	-0/+1
\| \| \| \| \| \| \|	Earlier we were ignoring varargs in LowerCallSiteWithDeoptBundle because populateCallLoweringInfo does not set CallLoweringInfo::IsVarArg. llvm-svn: 264354
*	Add lowering support for llvm.experimental.deoptimize	Sanjoy Das	2016-03-24	3	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Only adds support for "naked" calls to llvm.experimental.deoptimize. Support for round-tripping through RewriteStatepointsForGC will come as a separate patch (should be simpler than this one). Reviewers: reames Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18429 llvm-svn: 264329
*	[Statepoints] Fix yet another issue around gc pointer uniqueing	Sanjoy Das	2016-03-24	2	-19/+22
\| \| \| \| \| \| \| \| \| \| \| \| \|	Given that StatepointLowering now uniques derived pointers before putting them in the per-statepoint spill map, we may end up with missing entries for derived pointers when we visit a gc.relocate on a pointer that was de-duplicated away. Fix this by keeping two maps, one mapping gc pointers to their de-duplicated values, and one mapping a de-duplicated value to the slot it is spilled in. llvm-svn: 264320
*	Minor cosmestic changes (NFC)	Sanjoy Das	2016-03-24	1	-7/+7
\| \| \| \| \| \| \|	- Reflow comments - Rename function llvm-svn: 264319
*	CodeGen: extend RHS when splitting ATOMIC_CMP_SWAP_WITH_SUCCESS.	Tim Northover	2016-03-24	1	-3/+18
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the operation's type has been promoted during type legalization, we need to account for the fact that the high bits of the comparison operand are likely unspecified. The LHS is usually zero-extended, but MIPS sign extends it, so we have to be slightly careful. Patch by Simon Dardis. llvm-svn: 264296
*	Remove unsafe AssertZext after promoting result of FP_TO_FP16	Pirama Arumuga Nainar	2016-03-24	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Some target lowerings of FP_TO_FP16, for instance ARM's vcvtb.f16.f32 instruction, do not guarantee that the top 16 bits are zeroed out. Remove the unsafe AssertZext and add tests to exercise this. Reviewers: jmolloy, sbaranga, kristof.beyls, aadg Subscribers: llvm-commits, srhines, aemerson Differential Revision: http://reviews.llvm.org/D18426 llvm-svn: 264285
*	SelectionDAG: Remove a tautological dyn_cast. NFC	Justin Bogner	2016-03-23	1	-3/+2
\| \| \| \| \| \|	Index is already a StoreSDNode, so this dyn_cast doesn't do anything. llvm-svn: 264177
*	Remove stale comment	Sanjoy Das	2016-03-23	1	-2/+1
\| \| \| \|	llvm-svn: 264131
*	[StatepointLowering] Don't do two DenseMap lookups; nfci	Sanjoy Das	2016-03-23	1	-2/+3
\| \| \| \|	llvm-svn: 264130
*	[StatepointLowering] Minor NFC cleanups	Sanjoy Das	2016-03-23	1	-11/+12
\| \| \| \| \| \| \| \| \|	- Use auto - Name variables in LLVM style - Use llvm::find instead of std::find - Blank lines between declarations llvm-svn: 264129
*	[StatepointLowering] Minor nfc refactoring	Sanjoy Das	2016-03-23	1	-29/+6
\| \| \| \| \| \| \| \| \| \|	Now that StatepointLoweringInfo represents base pointers, derived pointers and gc relocates as SmallVectors and not ArrayRefs, we no longer need to allocate "backing storage" on stack in LowerStatepoint. So elide the backing storage, and inline the trivial body of getIncomingStatepointGCValues. llvm-svn: 264128
*	[StatepointLowering] Schedule gc relocates before uniqueing them	Sanjoy Das	2016-03-23	2	-9/+13
\| \| \| \| \| \|	Otherwise we can see an "unexpected" gc.relocate that we uniqued away. llvm-svn: 264127
*	[SelectionDAG] Ensure constant folded legalized vector element types are ↵	Simon Pilgrim	2016-03-22	1	-1/+1
\| \| \| \| \| \| \| \|	compatible with the BUILD_VECTOR type Found during fuzz testing - 32-bit x86 targets were legalizing a <2 x i1> compare result to <2 x i32> when <2 x i64> was expected. llvm-svn: 264085
*	CodeGen: check return types match when emitting tail call to builtin.	Tim Northover	2016-03-22	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \|	We were just completely ignoring the types when determining whether we could safely emit a libcall as a tail call. This is clearly wrong. Theoretically, we could dig deeper looking for incidental matches (much like the generic code in Analysis.cpp does), but it's probably not worth it for the few libcalls that exist. llvm-svn: 264084
*	Allow lowering call sites with both funclets and deopt state	Sanjoy Das	2016-03-22	1	-5/+1
\| \| \| \| \| \| \|	Lowering funclets is a no-op, so we can just go ahead and lower the deopt state. llvm-svn: 264078
*	Add a hasOperandBundlesOtherThan helper, and use it; NFC	Sanjoy Das	2016-03-22	1	-12/+6
\| \| \| \|	llvm-svn: 264072
*	[X86][SSE] Reapplied: Simplify vector LOAD + EXTEND on pre-SSE41 hardware	Simon Pilgrim	2016-03-22	2	-0/+102
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Improve vector extension of vectors on hardware without dedicated VSEXT/VZEXT instructions. We already convert these to SIGN_EXTEND_VECTOR_INREG/ZERO_EXTEND_VECTOR_INREG but can further improve this by using the legalizer instead of prematurely splitting into legal vectors in the combine as this only properly helps for lowering to VSEXT/VZEXT. Removes a lot of unnecessary any_extend + mask pattern - (Fix for PR25718). Reapplied with a fix for PR26953 (missing vector widening legalization). Differential Revision: http://reviews.llvm.org/D17932 llvm-svn: 264062
*	Add "first class" lowering for deopt operand bundles	Sanjoy Das	2016-03-22	4	-24/+99
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: After this change, deopt operand bundles can be lowered directly by SelectionDAG into STATEPOINT instructions (which are then lowered to a call or sequence of nop, with an associated __llvm_stackmaps entry0. This obviates the need to round-trip deoptimization state through gc.statepoint via RewriteStatepointsForGC. Reviewers: reames, atrick, majnemer, JosephTremoulet, pgavlin Subscribers: sanjoy, mcrosier, majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D18257 llvm-svn: 264015
*	[DAGCombine] Catch the case where extract_vector_elt can cause an any_ext ↵	Silviu Baranga	2016-03-21	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	while processing AND SDNodes Summary: extract_vector_elt can cause an implicit any_ext if the types don't match. When processing the following pattern: (and (extract_vector_elt (load ([non_ext\|any_ext\|zero_ext] V))), c) DAGCombine was ignoring the possible extend, and sometimes removing the AND even though it was required to maintain some of the bits in the result to 0, resulting in a miscompile. This change fixes the issue by limiting the transformation only to cases where the extract_vector_elt doesn't perform the implicit extend. Reviewers: t.p.northover, jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18247 llvm-svn: 263935
*	[CXX_FAST_TLS] fix issues with O0 on ARM, AArch64 and X86.	Manman Ren	2016-03-18	1	-1/+1
\| \| \| \| \| \| \|	Since at O0, explicit copies via SplitCSR may not be removed even if they are unnecessary, we choose not to use SplitCSR at O0. llvm-svn: 263855
*	[SelectionDAG] Remove visitStatepoint; NFC	Sanjoy Das	2016-03-17	3	-11/+2
\| \| \| \| \| \| \|	This way we have a single entry point into StatepointLowering. The method was a direct dispatch to LowerStatepoint anyway. llvm-svn: 263682
*	Fix indentation; NFC	Sanjoy Das	2016-03-16	1	-3/+2
\| \| \| \|	llvm-svn: 263672
*	Extract out a SelectionDAGBuilder::LowerAsStatepoint; NFC	Sanjoy Das	2016-03-16	2	-144/+197
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is a step towards implementing "direct" lowering of calls and invokes with deopt operand bundles into STATEPOINT nodes (as opposed to having them mandatorily pass through RewriteStatepointsForGC, which is the case today). This change extracts out a `SelectionDAGBuilder::LowerAsStatepoint` helper function that is able to lower a "statepoint like thing", and uses it to lower `gc.statepoint` calls. This is an NFC now, but in a later change we will use `LowerAsStatepoint` to directly lower calls and invokes with operand bundles without going through an intermediate `gc.statepoint` IR representation. FYI: I expect `SelectionDAGBuilder::StatepointInfo` will evolve as I add support for lowering non gc.statepoints, right now it is fairly tightly coupled with an IR level `gc.statepoint`. Reviewers: reames, pgavlin, JosephTremoulet Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18106 llvm-svn: 263671
*	Tweak some atomics functions in preparation for larger changes; NFC.	James Y Knight	2016-03-16	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Rename getATOMIC to getSYNC, as llvm will soon be able to emit both '__sync' libcalls and '__atomic' libcalls, and this function is for the '__sync' ones. - getInsertFencesForAtomic() has been replaced with shouldInsertFencesForAtomic(Instruction), so that the decision can be made per-instruction. This functionality will be used soon. - emitLeadingFence/emitTrailingFence are no longer called if shouldInsertFencesForAtomic returns false, and thus don't need to check the condition themselves. llvm-svn: 263665
*	[SelectionDAG] Extract out populateCallLoweringInfo; NFC	Sanjoy Das	2016-03-16	3	-30/+31
\| \| \| \| \| \| \| \| \|	SelectionDAGBuilder::populateCallLoweringInfo is now used instead of SelectionDAGBuilder::lowerCallOperands. The populateCallLoweringInfo interface is more composable in face of design changes like http://reviews.llvm.org/D18106 llvm-svn: 263663
*	Removed trailing whitespace	Simon Pilgrim	2016-03-16	1	-12/+12
\| \| \| \|	llvm-svn: 263650
*	[StatepointLowering] Move an assertion; NFCI	Sanjoy Das	2016-03-15	1	-6/+4
\| \| \| \| \| \| \| \|	Instead of running an explicit loop over `gc.relocate` calls hanging off of a `gc.statepoint`, assert the validity of the type of the value being relocated in `visitRelocate`. llvm-svn: 263516
*	Temporarily Revert "[X86][SSE] Simplify vector LOAD + EXTEND on	Eric Christopher	2016-03-14	2	-40/+0
\| \| \| \| \| \| \| \| \|	pre-SSE41 hardware" as it seems to be causing crashes during code generation in halide. PR forthcoming. This reverts commit r263303. llvm-svn: 263512
*	[DAG] use !isUndef() ; NFCI	Sanjay Patel	2016-03-14	4	-13/+11
\| \| \| \|	llvm-svn: 263453
*	[DAG] use isUndef() ; NFCI	Sanjay Patel	2016-03-14	5	-98/+94
\| \| \| \|	llvm-svn: 263448
*	Make gc relocates more strongly typed; NFC	Sanjoy Das	2016-03-12	1	-10/+13
\| \| \| \| \| \| \|	Don't use a `Value ` where we can use a stronger `GCRelocateInst ` type. llvm-svn: 263327
*	[X86][SSE] Simplify vector LOAD + EXTEND on pre-SSE41 hardware	Simon Pilgrim	2016-03-11	2	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \|	Improve vector extension of vectors on hardware without dedicated VSEXT/VZEXT instructions. We already convert these to SIGN_EXTEND_VECTOR_INREG/ZERO_EXTEND_VECTOR_INREG but can further improve this by using the legalizer instead of prematurely splitting into legal vectors in the combine as this only properly helps for lowering to VSEXT/VZEXT. Removes a lot of unnecessary any_extend + mask pattern - (Fix for PR25718). Differential Revision: http://reviews.llvm.org/D17932 llvm-svn: 263303
*	[X86][SSE] Reapplied: Improve vector ZERO_EXTEND by combining to ↵	Simon Pilgrim	2016-03-10	2	-2/+19
\| \| \| \| \| \| \| \| \| \| \| \|	ZERO_EXTEND_VECTOR_INREG Generalise the existing SIGN_EXTEND to SIGN_EXTEND_VECTOR_INREG combine to support zero extension as well and get rid of a lot of unnecessary ANY_EXTEND + mask patterns. Reapplied with a fix for PR26870 (avoid premature use of TargetConstant in ZERO_EXTEND_VECTOR_INREG expansion). Differential Revision: http://reviews.llvm.org/D17691 llvm-svn: 263159
*	SelectionDAG: Fix a crash on inline asm when output register supports ↵	Tom Stellard	2016-03-09	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	multiple types Summary: The code in SelectionDAG did not handle the case where the register type and output types were different, but had the same size. Reviewers: arsenm, echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17940 llvm-svn: 263022