bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[MCP] Add stats for backward copy propagation. NFC.	Kai Luo	2019-12-30	1	-1/+5
\|
*	[SelectionDAT] Simplify SelectionDAGBuilder::visitInlineAsm	Fangrui Song	2019-12-29	1	-11/+3
\| \| \| \| \| \|	Indirect C_Immediate or C_Other constraints have been excluded. Also simplify an unneeded change to indirect 'X' by D60942.
*	[SelectionDAG] Disallow indirect "i" constraint	Fangrui Song	2019-12-29	1	-0/+6
\| \| \| \| \| \| \| \| \|	This allows us to delete InlineAsm::Constraint_i workarounds in SelectionDAGISel::SelectInlineAsmMemoryOperand overrides and TargetLowering::getInlineAsmMemConstraint overrides. They were introduced to X86 in r237517 to prevent crashes for constraints like "=*imr". They were later copied to other targets.
*	SimplifyDemandedBits - Remove duplicate getOperand() call. NFC.	Simon Pilgrim	2019-12-28	1	-9/+7
\| \| \| \|	Pulled out from D56387 - cleanup variable names, move shift amount legalization inside if() of its only user and remove duplicate getOperand() call.
*	[TargetLowering] Update comment to reference the correct compiler-rt ↵	Craig Topper	2019-12-27	1	-1/+1
\| \| \| \|	function the code is based on. NFC
*	Delete setjmp_undefined_for_msvc workaround after llvm.setjmp was removed	Fangrui Song	2019-12-27	2	-16/+0
\|
*	TailDuplication: Clear NoPHIs property	Matt Arsenault	2019-12-27	1	-0/+5
\| \| \| \| \| \|	The early tail duplicator pass introduces new ones, so a MIR test that infers no phis since there were none on the input would fail the verifier after running.
*	Delete llvm.{sig,}{setjmp,longjmp} remnant after r136821	Fangrui Song	2019-12-27	3	-36/+0
\| \| \| \| \| \| \|	Intrinsic has incorrect argument type! i32 (i32) @llvm.setjmp wipes tear
*	[X86][FPEnv] Promote some float strictfp operations to double on ↵	Craig Topper	2019-12-26	1	-0/+31
\| \| \| \| \| \| \| \|	i686-pc-windows-msvc to match what we do for non-strict. The float libcalls are inlined in MSVC's math header where they just cast to double and use the double libcall. Do the same when we emit libcalls.
*	[DebugInfo][SelectionDAG] Change order while transferring SDDbgValue to ↵	Kristina Bessonova	2019-12-26	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	another node SelectionDAG::transferDbgValues() can 'reattach' SDDbgValue from one to another node, but doesn't change its source order. If the destination node has the order greater than the SDDbgValue, there are two possible issues revealed later: * If debug info is attached to an instruction that is the first definition of a register, this ends up with a def-after-use and the debug info gets 'undef' later. * If MIR has another definition of a register above the debug info, the debug info may represent a source variable incorrectly because it appears (significantly) before an instruction corresponded to this debug info. So, the patch changes the order of an SDDbgValue when it is moved to a node with greater order. Reviewers: dblaikie, jmorse, aprantl Reviewed By: aprantl Subscribers: aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71175
*	[X86] Enable STRICT_SINT_TO_FP/STRICT_UINT_TO_FP on X86 backend	Wang, Pengfei	2019-12-26	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Enable STRICT_SINT_TO_FP/STRICT_UINT_TO_FP on X86 backend Reviewers: craig.topper, RKSimon, LiuChen3, uweigand, andrew.w.kaylor Subscribers: hiraditya, llvm-commits, LuoYuanke Tags: #llvm Differential Revision: https://reviews.llvm.org/D71871
*	GlobalISel: Update syntax in debug printing	Matt Arsenault	2019-12-24	1	-1/+1
\| \| \| \|	Physical register names now start with $, not %
*	GlobalISel: Fix naming variables "brank" instead of "bank"	Matt Arsenault	2019-12-24	1	-7/+7
\|
*	[TypePromotion] Make TypeSize a class member	Sam Parker	2019-12-24	1	-87/+98
\| \| \| \| \| \| \| \| \|	Having TypeSize as a static class variable was causing problems with multi-threading. Several static functions have now been converted into methods of TypePromotion and a few other members of TypePromotion and IRPromoter have been added or removed. Differential Revision: https://reviews.llvm.org/D71832
*	DebugInfo: Correct the form of DW_AT_macro_info in .dwo files (sec_offset, ↵	David Blaikie	2019-12-24	1	-1/+1
\| \| \| \|	rather than data4)
*	DebugInfo: Add {} to address -Wdangling-else warning.	David Blaikie	2019-12-24	1	-1/+2
\|
*	[DebugInfo] Fix v4 macinfo for dwo files.	Sourabh Singh Tomar	2019-12-24	1	-4/+9
\| \| \| \| \|	Dwo files must contain have DW_AT_macro_info attribute, when macro information is emitted. Adjusted the test case for the same.
*	[SelectionDAG] Change SelectionDAGISel::{funcInfo,SDB} to use unique_ptr	Fangrui Song	2019-12-23	2	-24/+24
\| \| \| \| \|	CurDAG is referenced more than 2000 times and used in many gerated .cpp files. Don't touch it for now.
*	[SelectionDAG] Don't repeatedly add a node to the worklist in ↵	Fangrui Song	2019-12-23	1	-6/+3
\| \| \| \| \| \|	ComputeLiveOutVRegInfo. NFC For sqlite3 amalgram, this decreases the number of Worklist.push_back calls (603084) by 10%.
*	[FPEnv][X86] More strict int <-> FP conversion fixes	Ulrich Weigand	2019-12-23	2	-25/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix several several additional problems with the int <-> FP conversion logic both in common code and in the X86 target. In particular: - The STRICT_FP_TO_UINT expansion emits a floating-point compare. This compare can raise exceptions and therefore needs to be a strict compare. I've made it signaling (even though quiet would also be correct) as signaling is the more usual default for an LT. This code exists both in common code and in the X86 target. - The STRICT_UINT_TO_FP expansion algorithm was incorrect for strict mode: it emitted two STRICT_SINT_TO_FP nodes and then used a select to choose one of the results. This can cause spurious exceptions by the STRICT_SINT_TO_FP that ends up not chosen. I've fixed the algorithm to use only a single STRICT_SINT_TO_FP instead. - The !isStrictFPEnabled logic in DoInstructionSelection would sometimes do the wrong thing because it calls getOperationAction using the result VT. But for some opcodes, incuding [SU]INT_TO_FP, getOperationAction needs to be called using the operand VT. - Remove some (obsolete) code in X86DAGToDAGISel::Select that would mutate STRICT_FP_TO_[SU]INT to non-strict versions unnecessarily. Reviewed by: craig.topper Differential Revision: https://reviews.llvm.org/D71840
*	[DAGCombine] visitEXTRACT_SUBVECTOR - 'little to big' ↵	Sanjay Patel	2019-12-23	1	-1/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	extract_subvector(bitcast()) support This moves the X86 specific transform from rL364407 into DAGCombiner to generically handle 'little to big' cases (for example: extract_subvector(v2i64 bitcast(v16i8))). This allows us to remove both the x86 implementation and the aarch64 bitcast(extract_subvector(bitcast())) combine. Earlier patches that dealt with regressions initially exposed by this patch: rG5e5e99c041e4 rG0b38af89e2c0 Patch by: @RKSimon (Simon Pilgrim) Differential Revision: https://reviews.llvm.org/D63815
*	[AArch64] [Windows] Use COFF stubs for calls to extern_weak functions	Martin Storsjö	2019-12-23	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As the extern_weak target might be missing, resolving to the absolute address zero, we can't use the normal direct PC-relative branch instructions (as that would result in relocations out of range). Improve the classifyGlobalFunctionReference method to set MO_DLLIMPORT/MO_COFFSTUB, and simplify the existing code in AArch64TargetLowering::LowerCall to use the return value from classifyGlobalFunctionReference for these cases. Add code in both AArch64FastISel and GlobalISel/IRTranslator to bail out for function calls to extern weak functions on windows, to let SelectionDAG handle them. This matches what was done for X86 in 6bf108d77a3c. Differential Revision: https://reviews.llvm.org/D71721
*	[DAGCombiner] Check term use before applying aggressive FSUB optimisations	Carl Ritson	2019-12-23	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Without this check unnecessary FMA instructions are generated when the FSUB terms are reused. This also has the side-effect that the same value is computed to different levels of precision, which can create undesirable effects if the results are used together in subsequent computation. Reviewers: arsenm, nhaehnle, foad, tpr, dstuttard, spatel Reviewed By: arsenm Subscribers: jvesely, wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71656
*	[SelectionDAG] Copy FP flags when visiting a binary instruction.	Valentin Churavy	2019-12-22	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We noticed in Julia that the sequence below no longer turned into a sequence of FMA instructions in LLVM 7+, but it did in LLVM 6. ``` %29 = fmul contract <4 x double> %wide.load, %wide.load16 %30 = fmul contract <4 x double> %wide.load13, %wide.load17 %31 = fmul contract <4 x double> %wide.load14, %wide.load18 %32 = fmul contract <4 x double> %wide.load15, %wide.load19 %33 = fadd fast <4 x double> %vec.phi, %29 %34 = fadd fast <4 x double> %vec.phi10, %30 %35 = fadd fast <4 x double> %vec.phi11, %31 %36 = fadd fast <4 x double> %vec.phi12, %32 ``` Unlike Clang, Julia doesn't set the `unsafe-fp-math=true` function attribute, but rather emits more local instruction flags. This partially undoes https://reviews.llvm.org/D46854 and if required I can try to minimize the test further. Reviewers: spatel, mcberg2017 Reviewed By: spatel Subscribers: chriselrod, merge_guards_bot, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71495
*	Revert "[ARM][TypePromotion] Enable by default"	Reid Kleckner	2019-12-22	1	-21/+3
\| \| \| \| \| \| \| \|	This reverts commit ee7579409b7d940c4e1314d126e900db30c4edff. It causes crashes during ThinLTO. I suspect the issue is related to races on the global TypeSize variable, which is 80 at the time of the crash.
*	[ms] [X86] Use "P" modifier on operands to call instructions in inline X86 ↵	Eric Astor	2019-12-22	1	-4/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	assembly. Summary: This is documented as the appropriate template modifier for call operands. Fixes PR44272, and adds a regression test. Also adds support for operand modifiers in Intel-style inline assembly. Reviewers: rnk Reviewed By: rnk Subscribers: merge_guards_bot, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71677
*	DebugInfo: Remove out of date comment	David Blaikie	2019-12-21	1	-4/+0
\|
*	[NFC][MachineOutliner] Rewrite setSuffixIndices to be iterative	Jessica Paquette	2019-12-20	1	-18/+25
\| \| \| \| \| \| \| \|	Having this function be recursive could use up way too much stack space. Rewrite it as an iterative traversal in the tree instead to prevent this. Fixes PR44344.
*	[DWARF] Defer creating declaration DIEs until we prepare call site info	Vedant Kumar	2019-12-20	1	-7/+0
\| \| \| \| \| \| \| \|	It isn't necessary to create DIEs for all of the declaration subprograms in a CU's retainedTypes list. We can defer creating these subprograms until we need to prepare a call site tag that refers to one. This cleanup was mentioned in passing in D70350.
*	Reland: [DWARF] Allow cross-CU references of subprogram definitions	Vedant Kumar	2019-12-20	4	-7/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This allows a call site tag in CU A to reference a callee DIE in CU B without resorting to creating an incomplete duplicate DIE for the callee inside of CU A. We already allow cross-CU references of subprogram declarations, so it doesn't seem like definitions ought to be special. This improves entry value evaluation and tail call frame synthesis in the LTO setting. During LTO, it's common for cross-module inlining to produce a call in some CU A where the callee resides in a different CU, and there is no declaration subprogram for the callee anywhere. In this case llvm would (unnecessarily, I think) emit an empty DW_TAG_subprogram in order to fill in the call site tag. That empty 'definition' defeats entry value evaluation etc., because the debugger can't figure out what it means. As a follow-up, maybe we could add a DWARF verifier check that a DW_TAG_subprogram at least has a DW_AT_name attribute. Update: Reland with a fix to create a declaration DIE when the declaration is missing from the CU's retainedTypes list. The declaration is left out of the retainedTypes list in two cases: 1) Re-compiling pre-r266445 bitcode (in which declarations weren't added to the retainedTypes list), and 2) Doing LTO function importing (which doesn't update the retainedTypes list). It's possible to handle (1) and (2) by modifying the retainedTypes list (in AutoUpgrade, or in the LTO importing logic resp.), but I don't see an advantage to doing it this way, as it would cause more DWARF to be emitted compared to creating the declaration DIEs lazily. Tested with a stage2 ThinLTO+RelWithDebInfo build of clang, and with a ReleaseLTO-g build of the test suite. rdar://46577651, rdar://57855316, rdar://57840415 Differential Revision: https://reviews.llvm.org/D70350
*	[WebAssembly] Use TargetIndex operands in DbgValue to track WebAssembly ↵	Yury Delendik	2019-12-20	5	-3/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	operands locations Extends DWARF expression language to express locals/globals locations. (via target-index operands atm) (possible variants are: non-virtual registers or address spaces) The WebAssemblyExplicitLocals can replace virtual registers to targertindex operand type at the time when WebAssembly backend introduces {get,set,tee}_local instead of corresponding virtual registers. Reviewed By: aprantl, dschuff Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D52634
*	Rename DW_AT_LLVM_isysroot to DW_AT_LLVM_sysroot	Adrian Prantl	2019-12-20	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	This is a purely cosmetic change that is NFC in terms of the binary output. I bugs me that I called the attribute DW_AT_LLVM_isysroot since the "i" is an artifact of GCC command line option syntax (-isysroot is in the category of -i options) and doesn't carry any useful information otherwise. This attribute only appears in Clang module debug info. Differential Revision: https://reviews.llvm.org/D71722
*	[OPT-DBG] Teach DbgEntityHistoryCalculator about meta-instructions.	Tom Weaver	2019-12-20	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The calculator was considering instructions such as KILLs as clobbers of a physical address. This is wrong as meta instructions such as KILLs produce no output in the final program and thus don't clobber or change any physical location's value. As a result they're safe to ignore whilst calculating location list ranges. reviewers: aprantl, vsk diff revision: https://reviews.llvm.org/D70497 fixes: https://bugs.llvm.org/show_bug.cgi?id=38753
*	[ARM][MVE] Fixes for tail predication.	Sam Parker	2019-12-20	1	-5/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1) Fix an issue with the incorrect value being used for the number of elements being passed to [d\|w]lstp. We were trying to check that the value was available at LoopStart, but this doesn't consider that the last instruction in the block could also define the register. Two helpers have been added to RDA for this. 2) Insert some code to now try to move the element count def or the insertion point so that we can perform more tail predication. 3) Related to (1), the same off-by-one could prevent us from generating a low-overhead loop when a mov lr could have been the last instruction in the block. 4) Fix up some instruction attributes so that not all the low-overhead loop instructions are labelled as branches and terminators - as this is not true for dls/dlstp. Differential Revision: https://reviews.llvm.org/D71609
*	[StackMaps] Be explicit about label formation [NFC] (try 2)	Philip Reames	2019-12-19	1	-11/+11
\| \| \| \| \| \|	Recommit after making the same API change in non-x86 targets. This has been build for all targets, and tested for effected ones. Why the difference? Because my disk filled up when I tried make check for all. For auto-padding assembler support, we'll need to bundle the label with the instructions (nops or call sequences) so that they don't get separated. This just rearranges the code to make the upcoming change more obvious.
*	Temporarily Revert "[StackMaps] Be explicit about label formation [NFC]"	Eric Christopher	2019-12-19	1	-11/+11
\| \| \| \| \| \|	as it broke the aarch64 build. This reverts commit bc7595d934b958ab481288d7b8e768fe5310be8f.
*	[StackMaps] Be explicit about label formation [NFC]	Philip Reames	2019-12-19	1	-11/+11
\| \| \| \|	For auto-padding assembler support, we'll need to bundle the label with the instructions (nops or call sequences) so that they don't get separated. This just rearranges the code to make the upcoming change more obvious.
*	[FaultMaps] Make label formation a bit more explicit [NFC]	Philip Reames	2019-12-19	1	-3/+1
\| \| \| \|	This is in advance of assembler padding directives support where we'll need to bundle the label w/the corresponding faulting instruction to avoid padding being inserted between.
*	[LegalizeDAG] Add return to the strict node handling in ↵	Craig Topper	2019-12-19	1	-0/+1
\| \| \| \|	PromoteLegalINT_TO_FP to prevent an invalid strict fp node from being created by falling into non-strict code path.
*	Make more use of MachineInstr::mayLoadOrStore.	Jay Foad	2019-12-19	3	-4/+4
\|
*	Enable STRICT_FP_TO_SINT/UINT on X86 backend	Liu, Chen3	2019-12-19	1	-5/+26
\| \| \| \| \| \|	This patch is mainly for custom lowering the vector operation. Differential Revision: https://reviews.llvm.org/D71592
*	DebugInfo: Include DW_AT_base_addr even in gmlt with no inline functions	David Blaikie	2019-12-18	1	-1/+1
\| \| \| \| \| \| \| \| \|	Since the address pool doesn't get populated in this case (due to the lack of inlining, no child DIEs are added to the CU - so no addresses are needed for the DIEs themselves) until the range list is emitted - at the time the attributes are added to the CU, the address pool is empty. So check whether the address pool will be used for the range lists & add an addr_base if that's the case.
*	Reapply "NFC: DebugInfo: Refactor RangeSpanList to be a struct, like ↵	David Blaikie	2019-12-18	4	-19/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DebugLocStream::List" Move these data structures closer together so their emission code can eventually share more of its implementation. Was an egregious bug (completely untested, evidently) where I hadn't inverted a DWARFv5 test as needed, so it was doing the exact opposite of what was required & thus tried to emit a DWARFv5 range list header in DWARFv4. Reapply 8e04896288d22ed8bef7ac367923374f96b753d6 which was reverted in a8154e5e0c83d2f0f65f3b4fb1a0bc68785bd975.
*	[FPEnv] Strict versions of llvm.minimum/llvm.maximum	Ulrich Weigand	2019-12-18	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add new intrinsics llvm.experimental.constrained.minimum llvm.experimental.constrained.maximum as strict versions of llvm.minimum and llvm.maximum. Includes SystemZ back-end support. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D71624
*	[SelectionDAGBuilder] Use getConstant instead of getTargetConstant to build ↵	Craig Topper	2019-12-18	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	the offset for struct types in getUniformBase. getTargetConstant prevents any optimizations from operating on the value and basically says its already been iseled. But since we want the index to be in a register, this isn't true. Prior to this we were generating a vbroadcast with an immediate argument which is illegal and was flagged by the expensive checks bot.
*	Reapply: [DebugInfo] Correctly handle salvaged casts and split fragments at ISel	stozer	2019-12-18	1	-2/+19
\| \| \| \| \| \| \| \|	This reverts commit 1f3dd83cc1f2b8f72b9d59e2b4221b12fb7f9a95, reapplying commit bb1b0bc4e57428ce364d3d6c075ff03cb8973462. The original commit failed on some builds seemingly due to the use of a bracketed constructor with an std::array, i.e. `std::array<> arr({...})`.
*	[gicombiner] Import tryCombineIndexedLoadStore()	Daniel Sanders	2019-12-18	1	-12/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Now that arbitrary data is supported, import tryCombineIndexedLoadStore() Depends on D69147 Reviewers: bogner, volkan Reviewed By: volkan Subscribers: hiraditya, arphaman, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69151
*	Revert "[DebugInfo] Correctly handle salvaged casts and split fragments at ISel"	stozer	2019-12-18	1	-19/+2
\| \| \| \| \| \|	Reverted due to build failure on windows bots. This reverts commit bb1b0bc4e57428ce364d3d6c075ff03cb8973462.
*	[DebugInfo] Correctly handle salvaged casts and split fragments at ISel	stozer	2019-12-18	1	-2/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, LLVM had no functional way of performing casts inside of a DIExpression(), which made salvaging cast instructions other than Noop casts impossible. This patch enables the salvaging of casts by using the DW_OP_LLVM_convert operator for SExt and Trunc instructions. There is another issue which is exposed by this fix, in which fragment DIExpressions (which are preserved more readily by this patch) for values that must be split across registers in ISel trigger an assertion, as the 'split' fragments extend beyond the bounds of the fragment DIExpression causing an error. This patch also fixes this issue by checking the fragment status of DIExpressions which are to be split, and dropping fragments that are invalid.
*	[AArch64] Enable clustering memory accesses to fixed stack objects	Jay Foad	2019-12-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: r347747 added support for clustering mem ops with FI base operands including support for fixed stack objects in shouldClusterFI, but apparently this was never tested. This patch fixes shouldClusterFI to work with scaled as well as unscaled load/store instructions, and fixes the ordering of memory ops in MemOpInfo::operator< to ensure that memory addresses always increase, regardless of which direction the stack grows. Subscribers: MatzeB, kristof.beyls, hiraditya, javed.absar, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71334