bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[llvm-readobj] Add experimental support for SHT_RELR sections	Jake Ehrlich	2018-06-28	2	-0/+129
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change adds experimental support for SHT_RELR sections, proposed here: https://groups.google.com/forum/#!topic/generic-abi/bX460iggiKg Definitions for the new ELF section type and dynamic array tags, as well as the encoding used in the new section are all under discussion and are subject to change. Use with caution! Author: rahulchaudhry Differential Revision: https://reviews.llvm.org/D47919 llvm-svn: 335922
*	[InstCombine] fix opcode check in shuffle fold	Sanjay Patel	2018-06-28	1	-1/+1
\| \| \| \| \| \| \| \| \|	There's no way to expose this difference currently, but we should use the updated variable because the original opcodes can go stale if we transform into something new. llvm-svn: 335920
*	[COFF] Fix constant sharing regression for MinGW	Martin Storsjo	2018-06-28	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	This fixes a regression since SVN r334523, where the object files built targeting MinGW were rejected by GNU binutils tools. Prior to that commit, we only put constants in comdat for MSVC configurations. Differential Revision: https://reviews.llvm.org/D48567 llvm-svn: 335918
*	[ThinLTO] Port InlinerFunctionImportStats handling to new PM	Teresa Johnson	2018-06-28	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The InlinerFunctionImportStats will collect and dump stats regarding how many function inlined into the module were imported by ThinLTO. Reviewers: wmi, dexonsmith Subscribers: mehdi_amini, inglorion, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D48729 llvm-svn: 335914
*	[NVPTX] Delete dead code	Benjamin Kramer	2018-06-28	5	-63/+0
\| \| \| \| \| \|	No functionality change. llvm-svn: 335913
*	[ARM] Add missing Thumb2 assembler diagnostics.	Eli Friedman	2018-06-28	1	-62/+116
\| \| \| \| \| \| \| \| \| \|	Mostly just adding checks for Thumb2 instructions which correspond to ARM instructions which already had diagnostics. While I'm here, also fix ARM-mode strd to check the input registers correctly. Differential Revision: https://reviews.llvm.org/D48610 llvm-svn: 335909
*	[SROA] Preserve DebugLoc when rewriting alloca partitions	Anastasis Grammenos	2018-06-28	1	-0/+2
\| \| \| \| \| \| \| \| \|	When rewriting an alloca partition copy the DL from the old alloca over the the new one. Differential Revision: https://reviews.llvm.org/D48640 llvm-svn: 335904
*	Add a flag to FileOutputBuffer that allows modification.	Zachary Turner	2018-06-28	2	-23/+57
\| \| \| \| \| \| \| \| \| \| \| \| \|	FileOutputBuffer creates a temp file and on commit atomically renames the temp file to the destination file. Sometimes we want to modify an existing file in place, but still have the atomicity guarantee. To do this we can initialize the contents of the temp file from the destination file (if it exists), that way the resulting FileOutputBuffer can have only selective bytes modified. Committing will then atomically replace the destination file as desired. llvm-svn: 335902
*	Remove unnecessary semicolon. NFCI.	Simon Pilgrim	2018-06-28	1	-2/+2
\| \| \| \| \| \|	Fixes -Wpedantic warning. llvm-svn: 335901
*	[X86] Suppress load folding into and/or/xor if it will prevent matching ↵	Craig Topper	2018-06-28	1	-0/+29
\| \| \| \| \| \| \| \| \| \|	btr/bts/btc. This is a follow up to r335753. At the time I forgot about isProfitableToFold which makes this pretty easy. Differential Revision: https://reviews.llvm.org/D48706 llvm-svn: 335895
*	Revert "Re-land r335297 "[X86] Implement more of x86-64 large and medium PIC ↵	Jonas Devlieghere	2018-06-28	6	-121/+30
\| \| \| \| \| \| \| \| \| \| \| \| \|	code models"" Reverting because this is causing failures in the LLDB test suite on GreenDragon. LLVM ERROR: unsupported relocation with subtraction expression, symbol '__GLOBAL_OFFSET_TABLE_' can not be undefined in a subtraction expression llvm-svn: 335894
*	[InstCombine] allow shl+mul combos with shuffle (select) fold (PR37806)	Sanjay Patel	2018-06-28	1	-5/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is an enhancement to D48401 that was discussed in: https://bugs.llvm.org/show_bug.cgi?id=37806 We can convert a shift-left-by-constant into a multiply (we canonicalize IR in the other direction because that's generally better of course). This allows us to remove the shuffle as we do in the regular opcodes-are-the-same cases. This requires a small hack to make sure we don't introduce any extra poison: https://rise4fun.com/Alive/ZGv Other examples of opcodes where this would work are add+sub and fadd+fsub, but we already canonicalize those subs into adds, so there's nothing to do for those cases AFAICT. There are planned enhancements for opcode transforms such or -> add. Note that there's a different fold needed if we've already managed to simplify away a binop as seen in the test based on PR37806, but we manage to get that one case here because this fold is positioned above the demanded elements fold currently. Differential Revision: https://reviews.llvm.org/D48485 llvm-svn: 335888
*	[MachineOutliner] Define MachineOutliner support in TargetOptions	Jessica Paquette	2018-06-28	6	-15/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Targets should be able to define whether or not they support the outliner without the outliner being added to the pass pipeline. Before this, the outliner pass would be added, and ask the target whether or not it supports the outliner. After this, it's possible to query the target in TargetPassConfig, before the outliner pass is created. This ensures that passing -enable-machine-outliner will not modify the pass pipeline of any target that does not support it. https://reviews.llvm.org/D48683 llvm-svn: 335887
*	[DAGCombiner] Ensure we use the correct CC result type in visitSDIV (REAPPLIED)	Simon Pilgrim	2018-06-28	1	-5/+6
\| \| \| \| \| \| \| \| \| \|	We could get away with it for constant folded cases, but not for rL335719. Thanks to Krzysztof Parzyszek for noticing. Reapply original commit rL335821 which was reverted at rL335871 due to a WebAssembly bug that was fixed at rL335884. llvm-svn: 335886
*	[WebAssembly] Add getSetCCResultType placeholder override to handle vector ↵	Simon Pilgrim	2018-06-28	2	-0/+12
\| \| \| \| \| \| \| \|	compare results. Necessary to get the rL335821 bugfix (which was reverted at rL335871) un-reverted. llvm-svn: 335884
*	Revert "[MachineOutliner] Add always and never options to ↵	Jessica Paquette	2018-06-28	1	-12/+4
\| \| \| \| \| \| \| \| \|	-enable-machine-outliner" I accidentally committed this instead of D48683 because I haven't had coffee yet. llvm-svn: 335883
*	Revert "[MachineOutliner] Never add the outliner in -O0"	Jessica Paquette	2018-06-28	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \|	This reverts commit 9c7c10e4073a0bc6a759ce5cd33afbac74930091. It relies on r335872 since that introduces the machine outliner flags test. I meant to commit D48683 in that commit, but got mixed up and committed D48682 instead. So, I'm reverting this and r335872, since D48682 hasn't made it through review yet. llvm-svn: 335882
*	[MachineOutliner] Never add the outliner in -O0	Jessica Paquette	2018-06-28	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	We shouldn't add the outliner when compiling at -O0 even if -enable-machine-outliner is passed in. This makes sure that we don't add it in this case. This also updates machine-outliner-flags to reflect the change and improves the comment describing what that test does. llvm-svn: 335879
*	SelectionDAGBuilder, mach-o: Skip trap after noreturn call (for Mach-O)	Matthias Braun	2018-06-28	4	-6/+26
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add NoTrapAfterNoreturn target option which skips emission of traps behind noreturn calls even if TrapUnreachable is enabled. Enable the feature on Mach-O to save code size; Comments suggest it is not possible to enable it for the other users of TrapUnreachable. rdar://41530228 DifferentialRevision: https://reviews.llvm.org/D48674 llvm-svn: 335877
*	[MachineOutliner] Add always and never options to -enable-machine-outliner	Jessica Paquette	2018-06-28	1	-4/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	To enable the MachineOutliner by default on AArch64, we need to be able to disable the MachineOutliner and also provide an option to "always" enable the outliner. This adds that capability. It allows the user to still use the old -enable-machine-outliner option, which defaults to "always". This is building up to allowing the user to specify "always" versus the target-default outlining behaviour. llvm-svn: 335872
*	Revert "[DAGCombiner] Ensure we use the correct CC result type in visitSDIV"	Haojian Wu	2018-06-28	1	-6/+5
\| \| \| \| \| \| \| \|	This reverts commit r335821. This crashes the webassembly test, run "ninja check-llvm-codegen-webassembly" to reproduce. llvm-svn: 335871
*	[AMDGPU] Early expansion of 32 bit udiv/urem	Stanislav Mekhanoshin	2018-06-28	1	-4/+316
\| \| \| \| \| \| \| \| \| \| \| \|	This allows hoisting of a common code, for instance if denominator is loop invariant. Current change is expansion only, adding licm to the target pass list going to be a separate patch. Given this patch changes to codegen are minor as the expansion is similar to that on DAG. DAG expansion still must remain for R600. Differential Revision: https://reviews.llvm.org/D48586 llvm-svn: 335868
*	[AMDGPU] Overload llvm.amdgcn.fmad.ftz to support f16	Stanislav Mekhanoshin	2018-06-28	1	-5/+9
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D48677 llvm-svn: 335866
*	Add a PhiValuesAnalysis pass to calculate the underlying values of phis	John Brawn	2018-06-28	5	-0/+201
\| \| \| \| \| \| \| \| \| \| \| \|	This pass is being added in order to make the information available to BasicAA, which can't do caching of this information itself, but possibly this information may be useful for other passes. Incorporates code based on Daniel Berlin's implementation of Tarjan's algorithm. Differential Revision: https://reviews.llvm.org/D47893 llvm-svn: 335857
*	Revert "Add support for generating a call graph profile from Branch ↵	Benjamin Kramer	2018-06-28	6	-172/+7
\| \| \| \| \| \| \| \|	Frequency Info." This reverts commits r335794 and r335797. Breaks ThinLTO+FDO selfhost. llvm-svn: 335851
*	[ARM] Parallel DSP Pass	Sjoerd Meijer	2018-06-28	5	-1/+624
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Armv6 introduced instructions to perform 32-bit SIMD operations. The purpose of this pass is to do some straightforward IR pattern matching to create ACLE DSP intrinsics, which map on these 32-bit SIMD operations. Currently, only the SMLAD instruction gets recognised. This instruction performs two multiplications with 16-bit operands, and stores the result in an accumulator. We will follow this up with patches to recognise SMLAD in more cases, and also to generate other DSP instructions (like e.g. SADD16). Patch by: Sam Parker and Sjoerd Meijer Differential Revision: https://reviews.llvm.org/D48128 llvm-svn: 335850
*	Comment change to verify commit rights. NFC.	Jesper Antonsson	2018-06-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Summary: Just a silly one-character correction. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48709 llvm-svn: 335832
*	s/TablesChecked/TableChecked/ after r335823	Hans Wennborg	2018-06-28	2	-2/+2
\| \| \| \|	llvm-svn: 335831
*	AMDGPU: Remove MFI::ABIArgOffset	Matt Arsenault	2018-06-28	7	-33/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We have too many mechanisms for tracking the various offsets used for kernel arguments, so remove one. There's still a lot of confusion with these because there are two different "implicit" argument areas located at the beginning and end of the kernarg segment. Additionally, the offset was determined based on the memory size of the split element types. This would break in a future commit where v3i32 is decomposed into separate i32 pieces. llvm-svn: 335830
*	AMDGPU: Error on calls from graphics shaders	Matt Arsenault	2018-06-28	1	-0/+7
\| \| \| \| \| \| \| \|	In principle nothing should stop these from working, but work is necessary to create an ABI for dealing with the stack related registers. llvm-svn: 335829
*	AMDGPU: Fix AMDGPUCodeGenPrepare using uninitialized AMDGPUAS struct	Matt Arsenault	2018-06-28	1	-1/+2
\| \| \| \| \| \|	Not sure how this wasn't noticed before. llvm-svn: 335828
*	AMDGPU: Fix assert on aggregate type kernel arguments	Matt Arsenault	2018-06-28	1	-2/+4
\| \| \| \| \| \| \| \| \| \|	Just fix the crash for now by not doing the optimization since figuring out how to properly convert the bits for an arbitrary struct is a pain. Also fix a crash when there is only an empty struct argument. llvm-svn: 335827
*	Unify sorted asserts to use the existing atomic pattern	Benjamin Kramer	2018-06-28	3	-9/+10
\| \| \| \| \| \| \|	These are all benign races and only visible in !NDEBUG. tsan complains about it, but a simple atomic bool is sufficient to make it happy. llvm-svn: 335823
*	[DAGCombiner] Ensure we use the correct CC result type in visitSDIV	Simon Pilgrim	2018-06-28	1	-5/+6
\| \| \| \| \| \| \| \|	We could get away with it for constant folded cases, but not for rL335719. Thanks to Krzysztof Parzyszek for noticing. llvm-svn: 335821
*	[SCCP] Mark CFG as preserved.	Florian Hahn	2018-06-28	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	SCCP does not change the CFG, so we can mark it as preserved. Reviewers: dberlin, efriedma, davide Reviewed By: davide Differential Revision: https://reviews.llvm.org/D47149 llvm-svn: 335820
*	[DAGCombiner] Remove unused variable. NFCI.	Simon Pilgrim	2018-06-28	1	-2/+0
\| \| \| \| \| \|	Noticed in D45806 review. llvm-svn: 335817
*	[IndVarSimplify] Ignore unreachable users of truncs	Max Kazantsev	2018-06-28	1	-0/+4
\| \| \| \| \| \| \|	If a trunc has a user in a block which is not reachable from entry, we can safely perform trunc elimination as if this user didn't exist. llvm-svn: 335816
*	[DwarfDebug] Remove unused argument (NFC)	Petar Jovanovic	2018-06-28	1	-3/+2
\| \| \| \| \| \| \| \| \| \|	Remove unused ByteStreamer argument from function emitDebugLocValue. Patch by Nikola Prica. Differential Revision: https://reviews.llvm.org/D48590 llvm-svn: 335811
*	[X86] Use PatFrag with hardcoded numbers for FROUND_NO_EXC/FROUND_CURRENT ↵	Craig Topper	2018-06-28	1	-4/+2
\| \| \| \| \| \| \| \| \| \|	instead of ImmLeafs with predicates where one of the two numbers was hardcoded. This more efficient for the isel table generator since we can use CheckChildInteger instead of MoveChild, CheckPredicate, MoveParent. This reduced the table size by 1-2K. I wish there was a way to share the values with X86BaseInfo.h and still use a PatFrag like this. These numbers are fixed by the X86 intrinsic spec going back many years and we should never need to change them. So we shouldn't waste table bytes to support sharing. llvm-svn: 335806
*	[X86] Change how we prefer shift by immediate over folding a load into a shift.	Craig Topper	2018-06-28	3	-57/+68
\| \| \| \| \| \| \| \| \| \| \| \|	BMI2 added new shift by register instructions that have the ability to fold a load. Normally without doing anything special isel would prefer folding a load over folding an immediate because the load folding pattern has higher "complexity". This would require an instruction to move the immediate into a register. We would rather fold the immediate instead and have a separate instruction for the load. We used to enforce this priority by artificially lowering the complexity of the load pattern. This patch changes this to instead reject the load fold in isProfitableToFoldLoad if there is an immediate. This is more consistent with other binops and feels less hacky. llvm-svn: 335804
*	[CGProfile] Fix unused variable warning.	Michael J. Spencer	2018-06-28	1	-1/+1
\| \| \| \|	llvm-svn: 335797
*	Add support for generating a call graph profile from Branch Frequency Info.	Michael J. Spencer	2018-06-27	6	-7/+172
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	=== Generating the CG Profile === The CGProfile module pass simply gets the block profile count for each BB and scans for call instructions. For each call instruction it adds an edge from the current function to the called function with the current BB block profile count as the weight. After scanning all the functions, it generates an appending module flag containing the data. The format looks like: ``` !llvm.module.flags = !{!0} !0 = !{i32 5, !"CG Profile", !1} !1 = !{!2, !3, !4} ; List of edges !2 = !{void ()* @a, void ()* @b, i64 32} ; Edge from a to b with a weight of 32 !3 = !{void (i1)* @freq, void ()* @a, i64 11} !4 = !{void (i1)* @freq, void ()* @b, i64 20} ``` Differential Revision: https://reviews.llvm.org/D48105 llvm-svn: 335794
*	Move some code from PDBFileBuilder to MSFBuilder.	Zachary Turner	2018-06-27	2	-73/+91
\| \| \| \| \| \| \| \|	The code to emit the pieces of the MSF file were actually in PDBFileBuilder. Move this to MSFBuilder so that we can theoretically emit an MSF without having a PDB file. llvm-svn: 335789
*	[X86] Make folding table checking threadsafe	Benjamin Kramer	2018-06-27	1	-4/+3
\| \| \| \| \| \| \|	This is a benign race, but tsan likes to complain about it. Just make it happy. llvm-svn: 335788
*	[X86] In X86DAGToDAGISel::PreprocessISelDAG, make sure we don't access N ↵	Craig Topper	2018-06-27	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	after we delete it. If we turn X86ISD::AND into ISD::AND, we delete N. But we were continuing onto the next block of code even though N no longer existed. Just happened to notice it. I assume asan didn't notice it because we explicitly unpoison deleted nodes and give them a DELETE_NODE opcode. llvm-svn: 335787
*	[RISCV] Add machine function pass to merge base + offset	Sameer AbuAsal	2018-06-27	5	-208/+296
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In r333455 we added a peephole to fix the corner cases that result from separating base + offset lowering of global address.The peephole didn't handle some of the cases because it only has a basic block view instead of a function level view. This patch replaces that logic with a machine function pass. In addition to handling the original cases it handles uses of the global address across blocks in function and folding an offset from LW\SW instruction. This pass won't run for OptNone compilation, so there will be a negative impact overall vs the old approach at O0. Reviewers: asb, apazos, mgrang Reviewed By: asb Subscribers: MartinMosbeck, brucehoult, the_o, rogfer01, mgorny, rbar, johnrusso, simoncook, niosHD, kito-cheng, shiva0217, zzheng, llvm-commits, edward-jones Differential Revision: https://reviews.llvm.org/D47857 llvm-svn: 335786
*	[DAGCombine] Disable TokenFactor simplifications when optnone.	Nirav Dave	2018-06-27	1	-0/+4
\| \| \| \|	llvm-svn: 335773
*	[X86] Fix unmatched parenthesis in r335768	Fangrui Song	2018-06-27	1	-1/+1
\| \| \| \|	llvm-svn: 335769
*	[X86] Teach the disassembler to use %eiz/%riz instead of NoRegister when the ↵	Craig Topper	2018-06-27	1	-5/+20
\| \| \| \| \| \| \| \| \| \|	SIB byte is present, but doesn't encode an index register and there was another shorter encoding that would achieve the same result. The %eiz/%riz are dummy registers that force the encoder to emit a SIB byte when it normally wouldn't. By emitting them in the disassembly output we ensure that assembling the disassembler output would also produce a SIB byte. This should match the behavior of objdump from binutils. llvm-svn: 335768
*	[globalisel][legalizer] Add AtomicOrdering to LegalityQuery and use it in ↵	Daniel Sanders	2018-06-27	3	-8/+18
\| \| \| \| \| \| \| \| \| \| \| \| \|	AArch64 Now that we have the ability to legalize based on MMO's. Add support for legalizing based on AtomicOrdering and use it to correct the legalization of the atomic instructions. Also extend all() to be a variadic template as this ruleset now requires 3 and 4 argument versions. llvm-svn: 335767