bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] Update SelectionDAGDumper to print the extension type and expanding ↵	Craig Topper	2019-01-24	1	-0/+30
\| \| \| \| \| \|	flag for masked loads. Add truncating and compressing for masked stores. llvm-svn: 352029
*	DebugInfo: Use assembly label arithmetic for address pool size for easier ↵	David Blaikie	2019-01-24	2	-9/+17
\| \| \| \| \| \| \| \| \|	reading/editing Recommits 350048, 350050 That broke buildbots because of some typos in the test case. llvm-svn: 352019
*	[ADT] Notify ilist traits about in-list transfers	Reid Kleckner	2019-01-23	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously no client of ilist traits has needed to know about transfers of nodes within the same list, so as an optimization, ilist doesn't call transferNodesFromList in that case. However, now there are clients that want to use ilist traits to cache instruction ordering information to optimize dominance queries of instructions in the same basic block. This change updates the existing ilist traits users to detect in-list transfers and do nothing in that case. After this change, we can start caching instruction ordering information in LLVM IR data structures. There are two main ways to do that: - by putting an order integer into the Instruction class - by maintaining order integers in a hash table on BasicBlock I plan to implement and measure both, but I wanted to commit this change first to enable other out of tree ilist clients to implement this optimization as well. Reviewers: lattner, hfinkel, chandlerc Subscribers: hiraditya, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D57120 llvm-svn: 351992
*	[MC][X86] Correctly model additional operand latency caused by transfer ↵	Andrea Di Biagio	2019-01-23	1	-2/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	delays from the integer to the floating point unit. This patch adds a new ReadAdvance definition named ReadInt2Fpu. ReadInt2Fpu allows x86 scheduling models to accurately describe delays caused by data transfers from the integer unit to the floating point unit. ReadInt2Fpu currently defaults to a delay of zero cycles (i.e. no delay) for all x86 models excluding BtVer2. That means, this patch is only a functional change for the Jaguar cpu model only. Tablegen definitions for instructions (V)PINSR* have been updated to account for the new ReadInt2Fpu. That read is mapped to the the GPR input operand. On Jaguar, int-to-fpu transfers are modeled as a +6cy delay. Before this patch, that extra delay was added to the opcode latency. In practice, the insert opcode only executes for 1cy. Most of the actual latency is actually contributed by the so-called operand-latency. According to the AMD SOG for family 16h, (V)PINSR* latency is defined by expression f+1, where f is defined as a forwarding delay from the integer unit to the fpu. When printing instruction latency from MCA (see InstructionInfoView.cpp) and LLC (only when flag -print-schedule is speified), we now need to account for any extra forwarding delays. We do this by checking if scheduling classes declare any negative ReadAdvance entries. Quoting a code comment in TargetSchedule.td: "A negative advance effectively increases latency, which may be used for cross-domain stalls". When computing the instruction latency for the purpose of our scheduling tests, we now add any extra delay to the formula. This avoids regressing existing codegen and mca schedule tests. It comes with the cost of an extra (but very simple) hook in MCSchedModel. Differential Revision: https://reviews.llvm.org/D57056 llvm-svn: 351965
*	[DAGCombine] Enable more pre-indexed stores	Sam Parker	2019-01-23	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \|	The current check in CombineToPreIndexedLoadStore is too conversative, preventing a pre-indexed store when the base pointer is a predecessor of the value being stored. Instead, we should check the pointer operand of the store. Differential Revision: https://reviews.llvm.org/D56719 llvm-svn: 351933
*	[Pipeliner] Add two pragmas to control software pipelining optimization	Brendon Cahoon	2019-01-23	1	-7/+76
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	#pragma clang loop pipeline(disable) Disable SWP optimization for the next loop. “disable” is the only possible value. #pragma clang loop pipeline_initiation_interval(number) Set value of initiation interval for SWP optimization to specified number value for the next loop. Number is the positive value greater than 0. These pragmas could be used for debugging or reducing compile time purposes. It is possible to disable SWP for concrete loops to save compilation time or to find bugs by not doing SWP to certain loops. It is possible to set value of initiation interval to concrete number to save compilation time by not doing extra pipeliner passes or to check created schedule for specific initiation interval. That is llvm part of the fix Clang part of fix: https://reviews.llvm.org/D55710 Patch by Alexey Lapshin! Differential Revision: https://reviews.llvm.org/D56403 llvm-svn: 351923
*	[CodeView] Allow empty types in member functions	Josh Stone	2019-01-23	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: `CodeViewDebug::lowerTypeMemberFunction` used to default to a `Void` return type if the function's type array was empty. After D54667, it started blindly indexing the 0th item for the return type, which fails in `getOperand` for empty arrays if assertions are enabled. This patch restores the `Void` return type for empty type arrays, and adds a test generated by Rust in line-only debuginfo mode. Reviewers: zturner, rnk Reviewed By: rnk Subscribers: hiraditya, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D57070 llvm-svn: 351910
*	[LegalizeTypes] Add debug prints to the top of PromoteFloatOperand and ↵	Craig Topper	2019-01-22	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	PromoteFloatResult. Also add debug prints in the default case of the switches in these routines. Most if not all of the type legalization handlers already do this so this makes promoting floats consistent llvm-svn: 351890
*	GlobalISel: Allow shift amount to be a different type	Matt Arsenault	2019-01-22	1	-17/+47
\| \| \| \| \| \| \| \| \|	For AMDGPU the shift amount is never 64-bit, and this needs to use a 32-bit shift. X86 uses i8, but seemed to be hacking around this before. llvm-svn: 351882
*	GlobalISel: Make buildConstant handle vectors	Matt Arsenault	2019-01-22	1	-4/+38
\| \| \| \| \| \| \|	Produce a splat build_vector similar to how SelectionDAG::getConstant does. llvm-svn: 351880
*	GlobalISel: Implement widen for extract_vector_elt elt type	Matt Arsenault	2019-01-22	1	-1/+16
\| \| \| \|	llvm-svn: 351871
*	GlobalISel: Implement fewerElementsVector for basic FP ops	Matt Arsenault	2019-01-22	1	-7/+37
\| \| \| \|	llvm-svn: 351866
*	GlobalISel: Support narrowing zextload/sextload	Matt Arsenault	2019-01-22	1	-0/+27
\| \| \| \|	llvm-svn: 351856
*	[SelectionDAGBuilder] Defer C_Register Assignments to be in line with	Nirav Dave	2019-01-22	1	-13/+3
\| \| \| \| \| \|	those of C_RegisterClass. NFCI. llvm-svn: 351854
*	GlobalISel: Disallow vectors for G_CONSTANT/G_FCONSTANT	Matt Arsenault	2019-01-22	2	-4/+12
\| \| \| \|	llvm-svn: 351853
*	Codegen support for atomicrmw fadd/fsub	Matt Arsenault	2019-01-22	3	-0/+5
\| \| \| \|	llvm-svn: 351851
*	Reapply "IR: Add fp operations to atomicrmw"	Matt Arsenault	2019-01-22	1	-0/+6
\| \| \| \| \| \| \|	This reapplies commits r351778 and r351782 with RISCV test fixes. llvm-svn: 351850
*	[DEBUG_INFO, NVPTX] Fix relocation info.	Alexey Bataev	2019-01-22	3	-22/+54
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Initial function labels must follow the debug location for the correct relocation info generation. Reviewers: tra, jlebar, echristo Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D45784 llvm-svn: 351843
*	[DAGCombiner] narrow vector binop with 2 insert subvector operands	Sanjay Patel	2019-01-22	1	-1/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vecbo (insertsubv undef, X, Z), (insertsubv undef, Y, Z) --> insertsubv VecC, (vecbo X, Y), Z This is another step in generic vector narrowing. It's also a step towards more horizontal op formation specifically for x86 (although we still failed to match those in the affected tests). The scalarization cases are also not optimal (we should be scalarizing those), but it's still an improvement to use a narrower vector op when we know part of the result must be constant because both inputs are undef in some vector lanes. I think a similar match but checking for a constant operand might help some of the cases in D51553. Differential Revision: https://reviews.llvm.org/D56875 llvm-svn: 351825
*	Revert r351778: IR: Add fp operations to atomicrmw	Chandler Carruth	2019-01-22	1	-6/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	This broke the RISCV build, and even with that fixed, one of the RISCV tests behaves surprisingly differently with asserts than without, leaving there no clear test pattern to use. Generally it seems bad for hte IR to differ substantially due to asserts (as in, an alloca is used with asserts that isn't needed without!) and nothing I did simply would fix it so I'm reverting back to green. This also required reverting the RISCV build fix in r351782. llvm-svn: 351796
*	IR: Add fp operations to atomicrmw	Matt Arsenault	2019-01-22	1	-0/+6
\| \| \| \| \| \|	Add just fadd/fsub for now. llvm-svn: 351778
*	GlobalISel: Fix out of bounds crashes in verifier	Matt Arsenault	2019-01-22	1	-3/+8
\| \| \| \|	llvm-svn: 351769
*	[DAGCombiner] fix crash when converting build vector to shuffle	Sanjay Patel	2019-01-21	1	-5/+11
\| \| \| \| \| \| \| \| \| \|	The regression test is reduced from the example shown in D56281. This does raise a question as noted in the test file: do we want to handle this pattern? I don't have a motivating example for that on x86 yet, but it seems like we could have that pattern there too, so we could avoid the back-and-forth using a shuffle. llvm-svn: 351753
*	GlobalISel: Add isPointer legality predicates	Matt Arsenault	2019-01-20	1	-0/+14
\| \| \| \|	llvm-svn: 351699
*	GlobalISel: Implement widenScalar for basic FP ops	Matt Arsenault	2019-01-20	1	-4/+13
\| \| \| \|	llvm-svn: 351696
*	Update the file headers across all of the LLVM projects in the monorepo	Chandler Carruth	2019-01-19	283	-1132/+849
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636
*	Reapply "[CGP] Check for existing inttotpr before creating new one"	Roman Tereshin	2019-01-19	1	-4/+18
\| \| \| \| \| \|	Original commit: r351582 llvm-svn: 351626
*	Revert "Reapply "[CGP] Check for existing inttotpr before creating new one""	Roman Tereshin	2019-01-19	1	-17/+4
\| \| \| \| \| \| \| \| \| \|	This reverts commit r351618. Compiler RT + ASAN tests are failing for PowerPC. Not sure how would I reproduce these on macOS, so reverting (again) until I do. llvm-svn: 351619
*	Reapply "[CGP] Check for existing inttotpr before creating new one"	Roman Tereshin	2019-01-19	1	-4/+17
\| \| \| \| \| \|	Original commit: r351582 llvm-svn: 351618
*	Revert r351584: "GlobalISel: Verify g_zextload and g_sextload"	Amara Emerson	2019-01-19	1	-14/+1
\| \| \| \| \| \|	This new assertion triggered on the AArch64 GlobalISel bots. Reverting while it's being investigated. llvm-svn: 351617
*	Revert "[CGP] Check for existing inttotpr before creating new one"	Roman Tereshin	2019-01-18	1	-13/+4
\| \| \| \| \| \| \| \|	This reverts commit r351582. Bots are failing. Reverting this to fix and re-commit later. llvm-svn: 351598
*	GlobalISel: Verify G_BITCAST	Matt Arsenault	2019-01-18	1	-0/+13
\| \| \| \|	llvm-svn: 351594
*	GlobalISel: Verify G_ICMP/G_FCMP vector types	Matt Arsenault	2019-01-18	1	-0/+11
\| \| \| \|	llvm-svn: 351591
*	GlobalISel: Verify g_zextload and g_sextload	Matt Arsenault	2019-01-18	1	-1/+14
\| \| \| \|	llvm-svn: 351584
*	[CGP] Check for existing inttotpr before creating new one	Roman Tereshin	2019-01-18	1	-4/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Make sure CodeGenPrepare doesn't emit multiple inttoptr instructions of the same integer value while sinking address computations, but rather CSEs them on the fly: excessive inttoptr's confuse SCEV into thinking that related pointers have nothing to do with each other. This problem blocks LoadStoreVectorizer from vectorizing some of the loads / stores in a downstream target. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D56838 llvm-svn: 351582
*	[SelectionDAG] Updates for -dag-dump-verbose	Bjorn Pettersson	2019-01-18	2	-35/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch makes some changes related to -dag-dump-verbose. Main use case has been when debugging how SelectionDAG is dealing with debug info (SDDbgValue nodes). 1) We now print the number of DbgValues that are mapped to each SDNode. 2) Removed duplicated printing of DebugLoc (nowadays DebugLoc is printed also when not using -dag-dump-verbose). 3) Renamed SDDbgValue::dump to SDDbgValue::print, and added a new SDDbgValue::dump that will start a new line after calling print. 4) SDDbgValue::print now prints "Order", and it also prints some additional information when kind is CONST/FRAMEIX/VREG. 5) SelectionDAG::dump() now dumps all SDDbgValue nodes after the list of SDNodes (both "regular" and "ByVal" SDDbgValue:s). Invalidated nodes are not printed. 6) Prohibit inline printing of SDNode operands that has SDDbgValue nodes associated to them. Reviewers: jmorse, aprantl Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56793 llvm-svn: 351581
*	[GlobalISel] Change to range-based invocation of llvm::sort	Mandeep Singh Grang	2019-01-18	1	-6/+3
\| \| \| \|	llvm-svn: 351574
*	[SelectionDAG] Split very large token factors for chained stores to 64k chunks.	Florian Hahn	2019-01-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Similar to D55073. Without this change, the DAG combiner crashes on code with more than 64k of stores in a single basic block that form parallelizable chains. No test case, as it would be very IR file. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D56740 llvm-svn: 351571
*	[SelectionDAGBuilder] Cleanup InlineAsm Output generation. NFCI.	Nirav Dave	2019-01-18	1	-115/+104
\| \| \| \| \| \| \| \|	Defer inline asm's output fixup work until after we've generated the inline asm node itself. Remove StoresToEmit, IndirectStoresToEmit, and RetValRegs in favor of using ConstraintOperands. llvm-svn: 351558
*	[SelectionDAG] Add getTokenFactor, which splits nodes with > 64k operands.	Florian Hahn	2019-01-18	2	-13/+14
\| \| \| \| \| \| \| \| \|	This functionality is required at multiple places which potentially create large operand lists, like SelectionDAGBuilder or DAGCombiner. Differential Revision: https://reviews.llvm.org/D56739 llvm-svn: 351552
*	[SelectionDAG] Add static getMaxNumOperands function to SDNode.	Florian Hahn	2019-01-18	2	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Use this helper to make sure we use the same value at various places. This will likely be needed at more places were we currently crash because we use more operands than possible. Also makes it easier to change in the future. Reviewers: RKSimon, craig.topper, efriedma, aemerson Reviewed By: RKSimon Subscribers: hiraditya, arsenm, llvm-commits Differential Revision: https://reviews.llvm.org/D56859 llvm-svn: 351537
*	[ScheduleDAGRRList] Do not preschedule the node has ADJCALLSTACKDOWN parent	Shiva Chen	2019-01-18	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \|	We should not pre-scheduled the node has ADJCALLSTACKDOWN parent, or else, when bottom-up scheduling, ADJCALLSTACKDOWN and ADJCALLSTACKUP may hold CallResource too long and make other calls can't be scheduled. If there's no other available node to schedule, the scheduler will try to rename the register by creating copy to avoid the conflict which will fail because CallResource is not a real physical register. llvm-svn: 351527
*	[CodeGen] Fix bugs in LiveDebugVariables when debug labels are generated.	Hsiangkai Wang	2019-01-18	1	-13/+125
\| \| \| \| \| \| \| \| \| \| \| \|	Remove DBG_LABELs in LiveDebugVariables and generate them in VirtRegRewriter. This bug is reported in https://bugs.chromium.org/p/chromium/issues/detail?id=898152. Differential Revision: https://reviews.llvm.org/D54465 llvm-svn: 351525
*	[mips] Emit .reloc R_{MICRO}MIPS_JALR along with j(al)r(c) $25	Vladimir Stefanovic	2019-01-17	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \|	The callee address is added as an optional operand (MCSymbol) in AdjustInstrPostInstrSelection() and then used by asm printer to insert: '.reloc tmplabel, R_MIPS_JALR, symbol tmplabel:'. Controlled with '-mips-jalr-reloc', default is true. Differential revision: https://reviews.llvm.org/D56694 llvm-svn: 351485
*	Allow FP types for atomicrmw xchg	Matt Arsenault	2019-01-17	5	-1/+70
\| \| \| \|	llvm-svn: 351427
*	[AsmPrinter] Collapse .loc 0 0 directives	Jonas Devlieghere	2019-01-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we do not always collapse subsequent .loc 0 0 directives. The reason is that we were checking for a PrevInstLoc which is not set when we emit a line-0 record. We should only check the LastAsmLine, which seems to be created exactly for this purpose. // When we emit a line-0 record, we don't update PrevInstLoc; so look at // the last line number actually emitted, to see if it was line 0. unsigned LastAsmLine = Asm->OutStreamer->getContext().getCurrentDwarfLoc().getLine(); Differential revision: https://reviews.llvm.org/D56767 llvm-svn: 351395
*	[COFF, ARM64] Implement support for SEH extensions __try/__except/__finally	Mandeep Singh Grang	2019-01-16	2	-9/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch supports MS SEH extensions __try/__except/__finally. The intrinsics localescape and localrecover are responsible for communicating escaped static allocas from the try block to the handler. We need to preserve frame pointers for SEH. So we create a new function/property HasLocalEscape. Reviewers: rnk, compnerd, mstorsjo, TomTan, efriedma, ssijaric Reviewed By: rnk, efriedma Subscribers: smeenai, jrmuizel, alex, majnemer, ssijaric, ehsan, dmajor, kristina, javed.absar, kristof.beyls, chrib, llvm-commits Differential Revision: https://reviews.llvm.org/D53540 llvm-svn: 351370
*	[DebugInfo] Allow creation of DBG_VALUEs in blocks where the operand is not used	Jeremy Morse	2019-01-16	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	dbg.value intrinsics can appear in blocks where their operand is not used, meaning the operand never receives an SDNode, and thus no DBG_VALUE will be created. Get around this by looking to see whether the operand has already been allocated a virtual register. This allows dbg.values of Phi node and Values that are used across basic blocks to successfully be translated into DBG_VALUEs. Differential Revision: https://reviews.llvm.org/D56678 llvm-svn: 351358
*	[SelectionDAG] Update check in createOperands to reflect max() is a valid value.	Florian Hahn	2019-01-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The value returned by max() is the last valid value, adjust the comparison accordingly. The code added in D55073 creates TokenFactors with max() operands. Reviewers: aemerson, efriedma, RKSimon, craig.topper Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D56738 llvm-svn: 351318
*	[DAGCombine] Fix ReduceLoadWidth for shifted offsets	Sam Parker	2019-01-16	1	-12/+8
\| \| \| \| \| \| \| \| \| \| \| \|	ReduceLoadWidth can trigger using a shifted mask is used and this requires that the function return a shl node to correct for the offset. However, the way that this was implemented meant that the returned result could be an existing node, which would be incorrect. This fixes the method of inserting the new node and replacing uses. Differential Revision: https://reviews.llvm.org/D50432 llvm-svn: 351310