bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[LegalizeTypes] Remove unnecessary if before calling ReplaceValueWith on the ↵	Craig Topper	2019-12-13	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	chain in SoftenFloatRes_LOAD. I believe this is a leftover from when fp128 was softened to fp128 on X86-64. In that case type legalization must have been able to create a load that was the same as N which would make this replacement fail or assert. Since we no longer do that, this check should be unneeded.
*	Reapply [LVI] Normalize pointer behavior	Nikita Popov	2019-12-13	1	-79/+88
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a rebase of the change over D70376, which fixes an LVI cache invalidation issue that also affected this patch. ----- Related to D69686. As noted there, LVI currently behaves differently for integer and pointer values: For integers, the block value is always valid inside the basic block, while for pointers it is only valid at the end of the basic block. I believe the integer behavior is the correct one, and CVP relies on it via its getConstantRange() uses. The reason for the special pointer behavior is that LVI checks whether a pointer is dereferenced in a given basic block and marks it as non-null in that case. Of course, this information is valid only after the dereferencing instruction, or in conservative approximation, at the end of the block. This patch changes the treatment of dereferencability: Instead of including it inside the block value, we instead treat it as something similar to an assume (it essentially is a non-nullness assume) and incorporate this information in intersectAssumeOrGuardBlockValueConstantRange() if the context instruction is the terminator of the basic block. This happens either when determining an edge-value internally in LVI, or when a terminator was explicitly passed to getValueAt(). The latter case makes this change not fully NFC, because we can now fold terminator icmps based on the dereferencability information in the same block. This is the reason why I changed one JumpThreading test (it would optimize the condition away without the change). Of course, we do not want to recompute dereferencability on each intersectAssume call, so we need a new cache for this. The dereferencability analysis requires walking the entire basic block and computing underlying objects of all memory operands. This was previously done separately for each queried pointer value. In the new implementation (both because this makes the caching simpler, and because it is faster), I instead only walk the full BB once and cache all the dereferenced pointers. So the traversal is now performed only once per BB, instead of once per queried pointer value. I think the overall model now makes more sense than before, and there will be no more pitfalls due to differing integer/pointer behavior. Differential Revision: https://reviews.llvm.org/D69914
*	Revert an accidental commit af5ca40b47b3e85c3add81ccdc0b787c4bc355ae	Rui Ueyama	2019-12-13	1	-6/+0
\|
*	temporary	Rui Ueyama	2019-12-13	1	-0/+6
\|
*	[NFC][AArch64] Fix typo.	Nate Voorhies	2019-12-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Coaleascer should be coalescer. Reviewers: qcolombet, Jim Reviewed By: Jim Subscribers: Jim, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70731
*	Temporarily revert "NFC: DebugInfo: Refactor RangeSpanList to be a struct, ↵	Eric Christopher	2019-12-12	4	-11/+19
\| \| \| \| \| \| \| \|	like DebugLocStream::List" as it was causing bot and build failures. This reverts commit 8e04896288d22ed8bef7ac367923374f96b753d6.
*	NFC: DebugInfo: Refactor RangeSpanList to be a struct, like DebugLocStream::List	David Blaikie	2019-12-12	4	-19/+11
\| \| \| \| \|	Move these data structures closer together so their emission code can eventually share more of its implementation.
*	NFC: DebugInfo: Refactor debug_loc/loclist emission into a common function	David Blaikie	2019-12-12	2	-21/+15
\| \| \| \| \| \| \| \| \|	(except for v4 loclists, which are sufficiently different to not fit well in this generic implementation) In subsequent patches I intend to refactor the DebugLoc and ranges data structures to be more similar so I can common more of the implementation here.
*	hwasan: add tag_offset DWARF attribute to optimized debug info	Evgenii Stepanov	2019-12-12	6	-13/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Support alloca-referencing dbg.value in hwasan instrumentation. Update AsmPrinter to emit DW_AT_LLVM_tag_offset when location is in loclist format. Reviewers: pcc Subscribers: srhines, aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70753
*	[AArch64][SVE] Add integer arithmetic with immediate instructions.	Danilo Carvalho Grael	2019-12-12	3	-9/+65
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add pattern matching for the following instructions: - add, sub, subr, sqadd, sqsub, uqadd, uqsub This patch required complex patterns to match the immediate with optinal left shift. I re-used the Select function from the other SVE repo to implement the complext pattern. I plan on doing another patch to also match constant vector of the same immediate. Reviewers: sdesmalen, huntergr, rengolin, efriedma, c-rhodes, mgudim, kmclaughlin Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits, amehsan Tags: #llvm Differential Revision: https://reviews.llvm.org/D71370
*	[Attributor][FIX] Do treat byval arguments special	Johannes Doerfert	2019-12-12	1	-12/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we reason about the pointer argument that is byval we actually reason about a local copy of the value passed at the call site. This was not the case before and we wrongly introduced attributes based on the surrounding function. AAMemoryBehaviorArgument, AAMemoryBehaviorCallSiteArgument and AANoCaptureCallSiteArgument are made aware of byval now. The code to skip "subsuming positions" for reasoning follows a common pattern and we should refactor it. A TODO was added. Discovered by @efriedma as part of D69748.
*	[NFC][InstSimplify] Refactoring ThreadCmpOverSelect function	Denis Bakhvalov	2019-12-12	1	-44/+79
\| \| \| \| \| \| \|	Removed code duplication in ThreadCmpOverSelect and broke it into several smaller functions for reusing them. Differential Revision: https://reviews.llvm.org/D71158
*	Revert "[DAGCombiner] fold shift-trunc-shift to shift-mask-trunc"	Sanjay Patel	2019-12-12	1	-12/+0
\| \| \| \| \|	This reverts commit 8963332c3327daa652ba3e26d35f9109b6991985. There was a logic bug typo in this code, but it wasn't visible in the asm for the tests.
*	[DAGCombiner] fold shift-trunc-shift to shift-mask-trunc	Sanjay Patel	2019-12-12	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fold is done in IR by instcombine, and we have a special form of it already here in DAGCombiner, but we want the more general transform too: https://rise4fun.com/Alive/3jZm Name: general Pre: (C1 + zext(C2) < 64) %s = lshr i64 %x, C1 %t = trunc i64 %s to i16 %r = lshr i16 %t, C2 => %s2 = lshr i64 %x, C1 + zext(C2) %a = and i64 %s2, zext((1 << (16 - C2)) - 1) %r = trunc %a to i16 Name: special Pre: C1 == 48 %s = lshr i64 %x, C1 %t = trunc i64 %s to i16 %r = lshr i16 %t, C2 => %s2 = lshr i64 %x, C1 + zext(C2) %r = trunc %s2 to i16 ...because D58017 exposes a regression without this fold.
*	[LTO] Support for embedding bitcode section during LTO	Teresa Johnson	2019-12-12	2	-1/+144
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds support for embedding bitcode in a binary during LTO. The libLTO gains supports the `-lto-embed-bitcode` flag. The option allows users of the LTO library to embed a bitcode section. For example, LLD can pass the option via `ld.lld -mllvm=-lto-embed-bitcode`. This feature allows doing something comparable to `clang -c -fembed-bitcode`, but on the (LTO) linker level. Having bitcode alongside native code has many use-cases. To give an example, the MacOS linker can create a `-bitcode_bundle` section containing bitcode. Also, having this feature built into LLVM is an alternative to 3rd party tools such as [[ https://github.com/travitch/whole-program-llvm \| wllvm ]] or [[ https://github.com/SRI-CSL/gllvm \| gllvm ]]. As with these tools, this feature simplifies creating "whole-program" llvm bitcode files, but in contrast to wllvm/gllvm it does not rely on a specific llvm frontend/driver. Patch by Josef Eisl <josef.eisl@oracle.com> Reviewers: #llvm, #clang, rsmith, pcc, alexshap, tejohnson Reviewed By: tejohnson Subscribers: tejohnson, mehdi_amini, inglorion, hiraditya, aheejin, steven_wu, dexonsmith, dang, cfe-commits, llvm-commits, #llvm, #clang Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D68213
*	Rename LoopInfo::isRotated() to LoopInfo::isRotatedForm().	Kit Barton	2019-12-12	1	-1/+1
\| \| \| \| \| \|	This patch renames the LoopInfo::isRotated() method to LoopInfo::isRotatedForm() to make it clear that the method checks whether the loop is in rotated form, not whether the loop has been rotated by the LoopRotation pass.
*	[SystemZ] Implement the packed stack layout	Jonas Paulsson	2019-12-12	4	-112/+239
\| \| \| \| \| \| \| \| \|	Any llvm function with the "packed-stack" attribute will be compiled to use the packed stack layout which reuses unused parts of the incoming register save area. This is needed for building the Linux kernel. Review: Ulrich Weigand https://reviews.llvm.org/D70821
*	[DAGCombiner] improve readability	Sanjay Patel	2019-12-12	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \|	This is not quite NFC because I changed the SDLoc to use the more standard 'N' (the starting node for the fold). This transform is a special-case of a more general fold that we do in IR, but it seems like the general fold is needed here too to avoid a potential regression seen in D58017. https://rise4fun.com/Alive/3jZm
*	[BasicAA] Use GEP as context for computeKnownBits in aliasGEP.	Florian Hahn	2019-12-12	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	In order to use assumptions, computeKnownBits needs a context instruction. We can use the GEP, if it is an instruction. We already pass the assumption cache, but it cannot be used without a context instruction. Reviewers: anemet, asbirlea, hfinkel, spatel Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D71264
*	[amdgpu] Fix `-Wenum-compare` warning. NFC.	Michael Liao	2019-12-12	1	-6/+6
\|
*	[Matrix] Add first set of matrix intrinsics and initial lowering pass.	Florian Hahn	2019-12-12	6	-0/+495
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the first patch adding an initial set of matrix intrinsics and a corresponding lowering pass. This has been discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2019-October/136240.html The first patch introduces four new intrinsics (transpose, multiply, columnwise load and store) and a LowerMatrixIntrinsics pass, that lowers those intrinsics to vector operations. Matrixes are embedded in a 'flat' vector (e.g. a 4 x 4 float matrix embedded in a <16 x float> vector) and the intrinsics take the dimension information as parameters. Those parameters need to be ConstantInt. For the memory layout, we initially assume column-major, but in the RFC we also described how to extend the intrinsics to support row-major as well. For the initial lowering, we split the input of the intrinsics into a set of column vectors, transform those column vectors and concatenate the result columns to a flat result vector. This allows us to lower the intrinsics without any shape propagation, as mentioned in the RFC. In follow-up patches, we plan to submit the following improvements: * Shape propagation to eliminate the embedding/splitting for each intrinsic. * Fused & tiled lowering of multiply and other operations. * Optimization remarks highlighting matrix expressions and costs. * Generate loops for operations on large matrixes. * More general block processing for operation on large vectors, exploiting shape information. We would like to add dedicated transpose, columnwise load and store intrinsics, even though they are not strictly necessary. For example, we could instead emit a large shufflevector instruction instead of the transpose. But we expect that to (1) become unwieldy for larger matrixes (even for 16x16 matrixes, the resulting shufflevector masks would be huge), (2) risk instcombine making small changes, causing us to fail to detect the transpose, preventing better lowerings For the load/store, we are additionally planning on exploiting the intrinsics for better alias analysis. Reviewers: anemet, Gerolf, reames, hfinkel, andrew.w.kaylor, efriedma, rengolin Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D70456
*	[ARM][MVE] findVCMPToFoldIntoVPS. NFC.	Sjoerd Meijer	2019-12-12	1	-30/+28
\| \| \| \| \| \| \|	This adds ReachingDefAnalysis (RDA) to the VPTBlock pass, so that we can reimplement findVCMPToFoldIntoVPS with just a few calls to RDA. Differential Revision: https://reviews.llvm.org/D71330
*	[Alignment][NFC] Adding Align compatible methods to IntrinsicInst/IRBuilder	Guillaume Chatelet	2019-12-12	1	-20/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71420
*	AMDGPU/SILoadStoreOptimizer: Simplify function	Tom Stellard	2019-12-12	1	-62/+50
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: merge_guards_bot, kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71044
*	[ARM][MVE] Sink vector shift operand	Sam Parker	2019-12-12	1	-3/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Recommit e0b966643fc2. sub instructions were being generated for the negated value, and for some reason they were the register only ones. I think the problem was because I was grabbing the 'zero' from vmovimm, which is a target constant. Now I'm just generating a new Constant zero and so rsb instructions are now generated. Original commit message: The shift amount operand can be provided in a general purpose register so sink it. Flip the vdup and negate so the existing patterns can be used for matching. Differential Revision: https://reviews.llvm.org/D70841
*	[llvm-dwarfdump] Add blank line after printing line table	James Henderson	2019-12-12	1	-0/+4
\| \| \| \| \| \| \| \|	This helps delineate it in the output from later tables or other output. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D71344
*	[Attributor][NFC] Fix comments and unnecessary comma	Hideto Ueno	2019-12-12	1	-6/+7
\|
*	[Attributor] [NFC] Use `checkForAllUses` helpr in ↵	Hideto Ueno	2019-12-12	1	-35/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`AAHeapToStackImpl::updateImpl` Summary: Remove `Worklist` iteration and make use `checkForAllUses`. There is no test chage. Reviewers: sstefan1, jdoerfert Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71352
*	[Attributor][NFC] Refactoring `AANoFreeArgument::updateImpl`	Hideto Ueno	2019-12-12	1	-38/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Refactoring `AANoFreeArgument::updateImpl`. There is no test change. Reviewers: sstefan1, jdoerfert Reviewed By: sstefan1 Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71349
*	[DebugInfo] Prevent invalid fragments at ISel from dropping debug info	stozer	2019-12-12	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	During SelectionDAG, if a value which is associated with a DBG_VALUE needs to be split across multiple registers, the DBG_VALUE will be split into a set of fragment expressions to recreate the original value. If one or more of these fragments cannot be created, they would previously be silently dropped, causing the old debug value to live past its expiry date. This patch fixes this issue by keeping invalid fragments while setting their value as Undef. Differential revision: https://reviews.llvm.org/D70248
*	[Support] Try to fix bot failure after 8ddcd1dc26	Russell Gallop	2019-12-12	1	-0/+1
\| \| \| \|	http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/41755
*	[Support] Extend TimeProfiler to support multiple threads	Russell Gallop	2019-12-12	1	-17/+80
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This makes TimeTraceProfilerInstance thread local. Added timeTraceProfilerFinishThread() which moves the thread local instance to a global vector of instances. timeTraceProfilerWrite() then writes recorded data from all instances. Threads are identified based on their thread ids. Totals are reported with artificial thread ids higher than the real ones. Replaced raw pointer for TimeTraceProfilerInstance with unique_ptr. Differential Revision: https://reviews.llvm.org/D71059
*	[Mips] Add support for min/max/umin/umax atomics	Mirko Brkusanin	2019-12-12	6	-28/+385
\| \| \| \| \| \| \| \|	In order to properly implement these atomic we need one register more than other binary atomics. It is used for storing result from comparing values in addition to the one that is used for actual result of operation. https://reviews.llvm.org/D71028
*	Temporarily Revert "[DataLayout] Fix occurrences that size and range of ↵	Nicola Zaghen	2019-12-12	14	-64/+54
\| \| \| \| \| \| \| \| \|	pointers are assumed to be the same." This reverts commit 5f6208778ff92567c57d7c1e2e740c284d7e69a5. This caused failures in Transforms/PhaseOrdering/scev-custom-dl.ll const: Assertion `getBitWidth() == CR.getBitWidth() && "ConstantRange types don't agree!"' failed.
*	[DataLayout] Fix occurrences that size and range of pointers are assumed to ↵	Nicola Zaghen	2019-12-12	14	-54/+64
\| \| \| \| \| \| \| \| \| \| \| \|	be the same. GEP index size can be specified in the DataLayout, introduced in D42123. However, there were still places in which getIndexSizeInBits was used interchangeably with getPointerSizeInBits. This notably caused issues with Instcombine's visitPtrToInt; but the unit tests was incorrect, so this remained undiscovered. Differential Revision: https://reviews.llvm.org/D68328 Patch by Joseph Faulls!
*	[AArch64][SVE] Remove nxv1f32 and nxv1f64 as legal types	Cullen Rhodes	2019-12-12	3	-31/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Also cleans up ZPR register class definition. Reviewers: sdesmalen, cameron.mcinally, efriedma Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71351
*	[NFC][llvm][MIRVRegNamerUtils] Moving methods around. Making some private.	Puyan Lotfi	2019-12-12	2	-13/+9
\| \| \| \| \|	Making all externally unused methods private in MIRVRegNamerUtils.h. Moving or deleting a couple other methods around.
*	[DWARF5][DWARFVerifier] Check that Skeleton compilation unit does not have ↵	Alexey Lapshin	2019-12-12	1	-0/+8
\| \| \| \| \| \| \| \| \|	children. That patch adds checking into DWARFVerifier that the Skeleton compilation unit does not have children. Differential Revision: https://reviews.llvm.org/D71244
*	Revert "[ARM][MVE] Sink vector shift operand"	Sam Parker	2019-12-12	2	-28/+3
\| \| \| \| \| \|	This reverts commit e0b966643fc2030442ffbae9b677247be697673b. Instruction selection is failing with expensive checks.
*	[ARM][MVE] Sink vector shift operand	Sam Parker	2019-12-12	2	-3/+28
\| \| \| \| \| \| \| \|	The shift amount operand can be provided in a general purpose register so sink it. Flip the vdup and negate so the existing patterns can be used for matching. Differential Revision: https://reviews.llvm.org/D70841
*	[AutoFDO] Statistic for context sensitive profile guided inlining	Wenlei He	2019-12-11	1	-3/+40
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: AutoFDO compilation has two places that do inlining - the sample profile loader that does inlining with context sensitive profile, and the regular inliner as CGSCC pass. Ideally we want most inlining to come from sample profile loader as that is driven by context sensitive profile and also retains context sensitivity after inlining. However the reality is most of the inlining actually happens during regular inliner. To track the number of inline instances from sample profile loader and help move more inlining to sample profile loader, I'm adding statistics and optimization remarks for sample profile loader's inlining. Reviewers: wmi, davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70584
*	[llvm][MIRVRegNamerUtils] Adding hashing on memoperands.	Puyan Lotfi	2019-12-11	1	-0/+11
\| \| \| \| \| \| \| \|	No more hash collisions for memoperands. Now the MIRCanonicalization pass shouldn't hit hash collisions when dealing with nearly identical memory accessing instructions when their memoperands are in fact different. Differential Revision: https://reviews.llvm.org/D71328
*	[AArch64][SVE] Add patterns for scalable vselect	Cameron McInally	2019-12-11	2	-2/+12
\| \| \| \| \| \|	This patch matches scalable vector selects to predicated move instructions. Differential Revision: https://reviews.llvm.org/D71298
*	[IR] Split out target specific intrinsic enums into separate headers	Reid Kleckner	2019-12-11	60	-23/+111
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This has two main effects: - Optimizes debug info size by saving 221.86 MB of obj file size in a Windows optimized+debug build of 'all'. This is 3.03% of 7,332.7MB of object file size. - Incremental step towards decoupling target intrinsics. The enums are still compact, so adding and removing a single target-specific intrinsic will trigger a rebuild of all of LLVM. Assigning distinct target id spaces is potential future work. Part of PR34259 Reviewers: efriedma, echristo, MaskRay Reviewed By: echristo, MaskRay Differential Revision: https://reviews.llvm.org/D71320
*	Rename TTI::getIntImmCost for instructions and intrinsics	Reid Kleckner	2019-12-11	16	-59/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Soon Intrinsic::ID will be a plain integer, so this overload will not be possible. Rename both overloads to ensure that downstream targets observe this as a build failure instead of a runtime failure. Split off from D71320 Reviewers: efriedma Differential Revision: https://reviews.llvm.org/D71381
*	Revert "[DWARF] Allow cross-CU references of subprogram definitions"	Vedant Kumar	2019-12-11	4	-26/+7
\| \| \| \| \| \| \| \| \|	This reverts commit 30038da15b18ac4e34b9ea7a648382ae481e4770. It causes the stage2 thinLTO bot to fail with: Assertion failed: (CU.getDIE(CalleeSP) && "Expected declaration subprogram DIE for callee") rdar://57840415
*	Revert "[SDAG] remove use restriction in isNegatibleForFree() when called ↵	Sanjay Patel	2019-12-11	3	-26/+20
\| \| \| \| \| \| \|	from getNegatedExpression()" This reverts commit d1f0bdf2d2df9bdf11ee2ddfff3df50e53f2f042. The patch can cause infinite loops in DAGCombiner.
*	[LegalizeTypes] In SoftenFloatRes_FP_EXTEND, move the check for input ↵	Craig Topper	2019-12-11	1	-9/+8
\| \| \| \| \| \| \| \| \|	already being promoted above the check for fp16 converting to something other than fp32. The fp16 to larger than fp32 inserts an extend that need to re-legalized if fp16 is promoted. But if we check for fp16 promotion first, then we can avoid emiting the fp_extend all together.
*	[OpenMP] Introduce the OpenMP-IR-Builder	Johannes Doerfert	2019-12-11	3	-1/+291
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the initial patch for the OpenMP-IR-Builder, as discussed on the mailing list ([1] and later) and at the US Dev Meeting'19. The design is similar to D61953 but: - in a non-WIP status, with proper documentation and working. - using a OpenMPKinds.def file to manage lists of directives, runtime functions, types, ..., similar to the current Clang implementation. - restricted to handle only (simple) barriers, to implement most `#pragma omp barrier` directives and most implicit barriers. - properly hooked into Clang to be used if possible (D69922). - compatible with the remaining code generation. Parts have been extracted into D69853. The plan is to have multiple people working on moving logic from Clang here once the initial scaffolding (=this patch) landed. [1] http://lists.flang-compiler.org/pipermail/flang-dev_lists.flang-compiler.org/2019-May/000197.html Reviewers: kiranchandramohan, ABataev, RaviNarayanaswamy, gtbercea, grokos, sdmitriev, JonChesterfield, hfinkel, fghanim Subscribers: mgorny, hiraditya, bollu, guansong, jfb, cfe-commits, llvm-commits, penzn, ppenzin Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D69785
*	[WebAssembly] Add new `export_name` clang attribute for controlling wasm ↵	Sam Clegg	2019-12-11	6	-8/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	export names This is equivalent to the existing `import_name` and `import_module` attributes which control the import names in the final wasm binary produced by lld. This maps the existing This attribute currently requires a string rather than using the symbol name for a couple of reasons: 1. Avoid confusion with static and dynamic linking which is based on symbol name. Exporting a function from a wasm module using this directive is orthogonal to both static and dynamic linking. 2. Avoids name mangling. Differential Revision: https://reviews.llvm.org/D70520