bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SanitizerCoverage] Don't duplicate code to get section pointers	Jonathan Metzman	2018-10-16	1	-33/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Merge code used to get section start and section end pointers for SanitizerCoverage constructors. This includes code that handles getting the start pointers when targeting MSVC. Reviewers: kcc, morehouse Reviewed By: morehouse Subscribers: kcc, hiraditya Differential Revision: https://reviews.llvm.org/D53211 llvm-svn: 344657
*	[X86] Match (cmp (and (shr X, C), mask), 0) to BEXTR+TEST.	Craig Topper	2018-10-16	1	-15/+32
\| \| \| \| \| \| \| \| \| \|	Without this we match the CMP+AND to a TEST and then match the SHR separately. I'm trusting analyzeCompare to remove the TEST during the peephole pass. Otherwise we need to check the flag users to see if they only use the Z flag. This recovers a case lost by r344270. Differential Revision: https://reviews.llvm.org/D53310 llvm-svn: 344649
*	[InstCombine] Cleanup libfunc attribute inferring	David Bolvansky	2018-10-16	4	-56/+74
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53338 llvm-svn: 344645
*	[ORC] Make the VModuleKey optional, propagate it via MaterializationUnit and	Lang Hames	2018-10-16	10	-71/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MaterializationResponsibility. VModuleKeys are intended to enable selective removal of modules from a JIT session, however for a wide variety of use cases selective removal is not needed and introduces unnecessary overhead. As of this commit, the default constructed VModuleKey value is reserved as a "do not track" value, and becomes the default when adding a new module to the JIT. This commit also changes the propagation of VModuleKeys. They were passed alongside the MaterializationResponsibity instance in XXLayer::emit methods, but are now propagated as part of the MaterializationResponsibility instance itself (and as part of MaterializationUnit when stored in a JITDylib). Associating VModuleKeys with MaterializationUnits in this way should allow for a thread-safe module removal mechanism in the future, even when a module is in the process of being compiled, by having the MaterializationResponsibility object check in on its VModuleKey's state before commiting its results to the JITDylib. llvm-svn: 344643
*	Revert "[WebAssembly] LSDA info generation"	Krasimir Georgiev	2018-10-16	16	-264/+62
\| \| \| \| \| \| \| \|	This reverts commit r344575. Newly introduced test eh-lsda.ll.test fails with use-after-free under ASAN build. llvm-svn: 344639
*	[PATCH] [NFC][AArch64] Fix refactoring of macro fusion	Evandro Menezes	2018-10-16	1	-8/+4
\| \| \| \| \| \|	Fix compiler error. llvm-svn: 344632
*	[Intrinsic] Signed Saturation Addition Intrinsic	Leonard Chan	2018-10-16	10	-0/+114
\| \| \| \| \| \| \| \| \| \| \|	Add an intrinsic that takes 2 integers and perform saturation addition on them. This is a part of implementing fixed point arithmetic in clang where some of the more complex operations will be implemented as intrinsics. Differential Revision: https://reviews.llvm.org/D53053 llvm-svn: 344629
*	[NFC][ARM] Refactor macro fusion	Evandro Menezes	2018-10-16	1	-19/+5
\| \| \| \| \| \|	Simplify code for wildcards. llvm-svn: 344625
*	[NFC][AArch64] Refactor macro fusion	Evandro Menezes	2018-10-16	1	-76/+90
\| \| \| \| \| \|	Simplify API of checking functions. llvm-svn: 344624
*	[LV] Teach vectorizer about variant value store into uniform address	Anna Thomas	2018-10-16	3	-14/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach vectorizer about vectorizing variant value stores to uniform address. Similar to rL343028, we do not allow vectorization if we have multiple stores to the same uniform address. Cost model already has the change for considering the extract instruction cost for a variant value store. See added test cases for how vectorization is done. The patch also contains changes to the ORE messages. Reviewers: Ayal, mkuper, anemet, hsaito Subscribers: rkruppe, llvm-commits Differential Revision: https://reviews.llvm.org/D52656 llvm-svn: 344613
*	revert rL344609: [InstCombine] try harder to form select from logic ops	Sanjay Patel	2018-10-16	2	-41/+29
\| \| \| \| \| \| \| \|	I noticed a missing check and added it at rL344610, but there actually are codegen tests that will fail without that, so I'll edit those and submit a fixed patch with more tests. llvm-svn: 344612
*	[InstCombine] make sure type is integer before calling ComputeNumSignBits	Sanjay Patel	2018-10-16	1	-1/+2
\| \| \| \|	llvm-svn: 344610
*	[InstCombine] try harder to form select from logic ops	Sanjay Patel	2018-10-16	2	-29/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is part of solving PR37549: https://bugs.llvm.org/show_bug.cgi?id=37549 The patterns shown here are a special case of something that we already convert to select. Using ComputeNumSignBits() catches that case (but not the more complicated motivating patterns yet). The backend has hooks/logic to convert back to logic ops if that's better for the target. llvm-svn: 344609
*	[LegalizeDAG] ExpandLegalINT_TO_FP - cleanup UINT_TO_FP i64 -> f64 expansion.	Simon Pilgrim	2018-10-16	1	-20/+17
\| \| \| \| \| \| \| \|	Use SrcVT/DestVT types, correct shift type and AND instead of ZERO_EXTEND_IN_REG. Part of prep work for D52965 llvm-svn: 344602
*	[NFC] Introduce ICFLoopSafetyInfo	Max Kazantsev	2018-10-16	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is an alternative implementation of LoopSafetyInfo that uses the implicit control flow tracking to give precise answers on queries "whether or not this block contains throwing instructions". This rules out false-positive answers on LoopSafetyInfo's queries. This patch only introduces the new implementation. It is not currently used in any pass. The enabling patches will go separately, through review. The plan is to completely replace all uses of LoopSafetyInfo with ICFLoopSafetyInfo in the future, but to avoid introducing functional problems, we will do it pass by pass. llvm-svn: 344601
*	[X86] Fix Skylake ReadAfterLd for PADDrm etc.	Simon Pilgrim	2018-10-16	2	-4/+8
\| \| \| \| \| \|	Missed in rL343868 as due to their custom InstrRW. llvm-svn: 344600
*	[NFC] Remove obsolete method headerMayThrow	Max Kazantsev	2018-10-16	1	-13/+2
\| \| \| \|	llvm-svn: 344596
*	[NFC] Make LoopSafetyInfo abstract to allow alternative implementations	Max Kazantsev	2018-10-16	5	-12/+12
\| \| \| \|	llvm-svn: 344592
*	[mips][micromips] Fix how values in .gcc_except_table are calculated	Aleksandar Beserminji	2018-10-16	3	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \|	When a landing pad is calculated in a program that is compiled for micromips, it will point to an even address. Such an error will cause a segmentation fault, as the instructions in micromips are aligned on odd addresses. This patch sets the last bit of the offset where a landing pad is, to 1, which will effectively be an odd address and point to the instruction exactly. Differential Revision: https://reviews.llvm.org/D52985 llvm-svn: 344591
*	[NFC] Encapsulate work with BlockColors in LoopSafetyInfo	Max Kazantsev	2018-10-16	2	-10/+22
\| \| \| \|	llvm-svn: 344590
*	[DebugInfo][LCSSA] Rewrite pre-existing debug values outside loop	David Stenberg	2018-10-16	2	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extend LCSSA so that debug values outside loops are rewritten to use the PHI nodes that the pass creates. This fixes PR39019. In that case, we ran LCSSA on a loop that was later on vectorized, which left us with something like this: for.cond.cleanup: %add.lcssa = phi i32 [ %add, %for.body ], [ %34, %middle.block ] call void @llvm.dbg.value(metadata i32 %add, ret i32 %add.lcssa for.body: %add = [...] br i1 %exitcond, label %for.cond.cleanup, label %for.body which later resulted in the debug.value becoming undef when removing the scalar loop (and the location would have probably been wrong for the vectorized case otherwise). As we now may need to query the AvailableVals cache more than once for a basic block, FindAvailableVals() in SSAUpdaterImpl is changed so that it updates the cache for blocks that we do not create a PHI node for, regardless of the block's number of predecessors. The debug value in the attached IR reproducer would not be properly rewritten without this. Debug values residing in blocks where we have not inserted any PHI nodes are currently left as-is by this patch. I'm not sure what should be done with those uses. Reviewers: mattd, aprantl, vsk, probinson Reviewed By: mattd, aprantl Subscribers: jmorse, gbedwell, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D53130 llvm-svn: 344589
*	[NFC] Move block throw check inside allLoopPathsLeadToBlock	Max Kazantsev	2018-10-16	1	-6/+10
\| \| \| \|	llvm-svn: 344588
*	[NFC] Turn isGuaranteedToExecute into a method	Max Kazantsev	2018-10-16	3	-12/+12
\| \| \| \|	llvm-svn: 344587
*	[SCEV] Limit AddRec "simplifications" to avoid combinatorial explosions	Max Kazantsev	2018-10-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SCEV's transform that turns `{A1,+,A2,+,...,+,An}<L> * {B1,+,B2,+,...,+,Bn}<L>` into a single AddRec of size `2n+1` with complex combinatorial coefficients can easily trigger exponential growth of the SCEV (in case if nothing gets folded and simplified). We tried to restrain this transform using the option `scalar-evolution-max-add-rec-size`, but its default value seems to be insufficiently small: the test attached to this patch with default value of this option `16` has a SCEV of >3M symbols (when printed out). This patch reduces the simplification limit. It is not a cure to combinatorial explosions, but at least it reduces this corner case to something more or less reasonable. Differential Revision: https://reviews.llvm.org/D53282 Reviewed By: sanjoy llvm-svn: 344584
*	[WebAssembly] LSDA info generation	Heejin Ahn	2018-10-16	16	-62/+264
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds support for LSDA (exception table) generation for wasm EH. Wasm EH mostly follows the structure of Itanium-style exception tables, with one exception: a call site table entry in wasm EH corresponds to not a call site but a landing pad. In wasm EH, the VM is responsible for stack unwinding. After an exception occurs and the stack is unwound, the control flow is transferred to wasm 'catch' instruction by the VM, after which the personality function is called from the compiler-generated code. (Refer to WasmEHPrepare pass for more information on this part.) This patch: - Changes wasm.landingpad.index intrinsic to take a token argument, to make this 1:1 match with a catchpad instruction - Stores landingpad index info and catch type info MachineFunction in before instruction selection - Lowers wasm.lsda intrinsic to an MCSymbol pointing to the start of an exception table - Adds WasmException class with overridden methods for table generation - Adds support for LSDA section in Wasm object writer Reviewers: dschuff, sbc100, rnk Subscribers: mgorny, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52748 llvm-svn: 344575
*	[X86] Remove some isel patterns that shouldn't be possible.	Craig Topper	2018-10-15	2	-6/+0
\| \| \| \| \| \|	These included a bitcast of a load from v4f32 to v2f64, but DAG combine should have already changed the type of the load to remove the cast. llvm-svn: 344573
*	[ORC] Rename ORC layers to make the "new" ORC layers the default.	Lang Hames	2018-10-15	10	-44/+44
\| \| \| \| \| \| \| \| \| \| \| \| \|	This commit adds a 'Legacy' prefix to old ORC layers and utilities, and removes the '2' suffix from the new ORC layers. If you wish to continue using the old ORC layers you will need to add a 'Legacy' prefix to your classes. If you were already using the new ORC layers you will need to drop the '2' suffix. The legacy layers will remain in-tree until the new layers reach feature parity with them. This will involve adding support for removing code from the new layers, and ensuring that performance is comperable. llvm-svn: 344572
*	[ORC] Rename MultiThreadedSimpleCompiler to ConcurrentIRCompiler.	Lang Hames	2018-10-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	The new name is a better fit: This class does not actually spawn any new threads for compilation, it is just safe to call from multiple threads concurrently. The "Simple" part of the name did not convey much either, so it was dropped. llvm-svn: 344567
*	Change a TerminatorInst* to an Instruction* in HotColdSplitting.cpp.	Lang Hames	2018-10-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	r344558 added an assignment to a TerminatorInst* from BasicBlock::getTerminatorInst(), but BasicBlock::getTerminatorInst() returns an Instruction* rather than a TerminatorInst* since r344504 so this fails to compile. Changing the variable to an Instruction* should get the bots building again. llvm-svn: 344566
*	[ORC] Switch to DenseMap/DenseSet for ORC symbol map/set types.	Lang Hames	2018-10-15	2	-29/+38
\| \| \| \|	llvm-svn: 344565
*	NFC: Fix a -Wsign-conversion warning	Erik Pilkington	2018-10-15	1	-5/+11
\| \| \| \|	llvm-svn: 344564
*	[X86] Fix a bad bitcast in the load form of vXi16 uniform shift patterns for ↵	Craig Topper	2018-10-15	1	-9/+10
\| \| \| \| \| \|	EVEX encoded instructions. llvm-svn: 344563
*	[hot-cold-split] fix static analysis of cold regions	Sebastian Pop	2018-10-15	1	-7/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Make the code of blockEndsInUnreachable to match the function blockEndsInUnreachable in CodeGen/BranchFolding.cpp. I also have added a note to make sure the code of this function will not be modified unless the back-end version is also modified. An early return before outlining has been added to avoid outlining the full function body when the first block in the function is marked cold. The static analysis of cold code has been amended to avoid marking the whole function as cold by back-propagation because the back-propagation would mark blocks with return statements as cold. The patch adds debug statements to help discover these problems. Differential Revision: https://reviews.llvm.org/D52904 llvm-svn: 344558
*	[AARCH64] Improve vector popcnt lowering with ADDLP	Simon Pilgrim	2018-10-15	1	-12/+36
\| \| \| \| \| \| \| \| \| \|	AARCH64 equivalent to D53257 - uses widening pairwise adds on vXi8 CTPOP to support i16/i32/i64 vectors. This is a blocker for generic vector CTPOP expansion (P32655) - this will remove the aarch64 diff from D53258. Differential Revision: https://reviews.llvm.org/D53259 llvm-svn: 344554
*	AMDGPU: Generate .amdgcn_target for object code v3	Konstantin Zhuravlyov	2018-10-15	1	-3/+10
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D53221 llvm-svn: 344552
*	[CodeExtractor] Erase debug intrinsics in outlined thunks (fix PR22900)	Vedant Kumar	2018-10-15	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Variable updates within the outlined function are invisible to debuggers. This could be improved by defining a DISubprogram for the new function. For the moment, simply erase the debug intrinsics instead. This fixes verifier failures about function-local metadata being used in the wrong function, seen while testing the hot/cold splitting pass. rdar://45142482 Differential Revision: https://reviews.llvm.org/D53267 llvm-svn: 344545
*	[SelectionDAG] allow FP binops in SimplifyDemandedVectorElts	Sanjay Patel	2018-10-15	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This is intended to make the backend on par with functionality that was added to the IR version of SimplifyDemandedVectorElts in: rL343727 ...and the original motivation is that we need to improve demanded-vector-elements in several ways to avoid problems that would be exposed in D51553. Differential Revision: https://reviews.llvm.org/D52912 llvm-svn: 344541
*	[DAGCombiner] allow undef elts in vector fmul matching	Sanjay Patel	2018-10-15	1	-1/+1
\| \| \| \|	llvm-svn: 344534
*	[DAGCombiner] refactor folds for fadd (fmul X, -2.0), Y; NFCI	Sanjay Patel	2018-10-15	1	-16/+18
\| \| \| \| \| \|	The transform doesn't work if the vector constant has undef elements. llvm-svn: 344532
*	[DAGCombiner] allow undef elts in vector fma matching	Sanjay Patel	2018-10-15	1	-21/+22
\| \| \| \|	llvm-svn: 344528
*	[DAGCombiner] allow undef elts in vector fma matching	Sanjay Patel	2018-10-15	1	-9/+10
\| \| \| \|	llvm-svn: 344525
*	Revert "[NewPM] teach -passes= to emit meaningful error messages"	Fedor Sergeev	2018-10-15	2	-218/+162
\| \| \| \| \| \|	This reverts r344519 due to failures in pipeline-parsing test. llvm-svn: 344524
*	[NewPM] teach -passes= to emit meaningful error messages	Fedor Sergeev	2018-10-15	2	-162/+218
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: All the PassBuilder::parse interfaces now return descriptive StringError instead of a plain bool. It allows to make -passes/aa-pipeline parsing errors context-specific and thus less confusing. TODO: ideally we should also make suggestions for misspelled pass names, but that requires some extensions to PassBuilder. Reviewed By: philip.pfaffe, chandlerc Differential Revision: https://reviews.llvm.org/D53246 llvm-svn: 344519
*	[mips][micromips] Fix overlaping FDEs error	Aleksandar Beserminji	2018-10-15	2	-0/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When compiling static executable for micromips, CFI symbols are incorrectly labeled as MICROMIPS, which cause ".eh_frame_hdr refers to overlapping FDEs." error. This patch does not label CFI symbols as MICROMIPS, and FDEs do not overlap anymore. This patch also exposes another bug, which is fixed here: https://reviews.llvm.org/D52985 Differential Revision: https://reviews.llvm.org/D52987 llvm-svn: 344516
*	[mips][micromips] Revert "Fix overlaping FDEs error"	Aleksandar Beserminji	2018-10-15	2	-24/+0
\| \| \| \| \| \|	This reverts r344511. llvm-svn: 344515
*	[ARM][NEON] Improve vector popcnt lowering with PADDL (PR39281)	Simon Pilgrim	2018-10-15	1	-130/+26
\| \| \| \| \| \| \| \| \| \|	As I suggested on PR39281, this patch uses PADDL pairwise addition to widen from the vXi8 CTPOP result to the target vector type. This is a blocker for moving more x86 code to generic vector CTPOP expansion (P32655 + D53258) - ARM's vXi64 CTPOP currently expands, which would generate a vXi64 MUL but ARM's custom lowering expands the general MUL case and vectors aren't well handled in LegalizeDAG - improving the CTPOP lowering was a lot easier than fixing the MUL lowering for this one case...... Differential Revision: https://reviews.llvm.org/D53257 llvm-svn: 344512
*	[mips][micromips] Fix overlaping FDEs error	Aleksandar Beserminji	2018-10-15	2	-0/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When compiling static executable for micromips, CFI symbols are incorrectly labeled as MICROMIPS, which cause ".eh_frame_hdr refers to overlapping FDEs." error. This patch does not label CFI symbols as MICROMIPS, and FDEs do not overlap anymore. This patch also exposes another bug, which is fixed here: https://reviews.llvm.org/D52985 Differential Revision: https://reviews.llvm.org/D52987 llvm-svn: 344511
*	[NewPM] implement SCC printing for -print-before-all/-print-after-all	Fedor Sergeev	2018-10-15	1	-4/+28
\| \| \| \| \| \| \| \| \| \| \| \|	Removing deficiency of initial implementation of -print-before-all/-after-all - it was effectively skipping IR printing for all the SCC passes. Now LazyCallGraph:SCC gets its IR printed. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D53270 llvm-svn: 344505
*	[TI removal] Make `getTerminator()` return a generic `Instruction`.	Chandler Carruth	2018-10-15	4	-14/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This removes the primary remaining API producing `TerminatorInst` which will reduce the rate at which code is introduced trying to use it and generally make it much easier to remove the remaining APIs across the codebase. Also clean up some of the stragglers that the previous mechanical update of variables missed. Users of LLVM and out-of-tree code generally will need to update any explicit variable types to handle this. Replacing `TerminatorInst` with `Instruction` (or `auto`) almost always works. Most of these edits were made in prior commits using the perl one-liner: ``` perl -i -ple 's/TerminatorInst(\b.* = .*getTerminator\(\))/Instruction\1/g' ``` This also my break some rare use cases where people overload for both `Instruction` and `TerminatorInst`, but these should be easily fixed by removing the `TerminatorInst` overload. llvm-svn: 344504
*	[TI removal] Rework `InstVisitor` to support visiting instructions that	Chandler Carruth	2018-10-15	2	-24/+24
\| \| \| \| \| \| \| \| \|	are terminators without relying on the specific `TerminatorInst` type. This required cleaning up two users of `InstVisitor`s usage of `TerminatorInst` as well. llvm-svn: 344503