bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	unused variable warning fix.	Simon Pilgrim	2015-08-12	1	-1/+1
\| \| \| \|	llvm-svn: 244725
*	[InstCombine] Move SSE/AVX vector blend folding to instcombiner	Simon Pilgrim	2015-08-12	2	-60/+17
\| \| \| \| \| \| \| \| \| \| \| \|	As discussed in D11886, this patch moves the SSE/AVX vector blend folding to instcombiner from PerformINTRINSIC_WO_CHAINCombine (which allows us to remove this completely). InstCombiner already had partial support for this, I just had to add support for zero (ConstantAggregateZero) masks and also the case where both selection inputs were the same (allowing us to ignore the mask). I also moved all the relevant combine tests into InstCombine/blend_x86.ll Differential Revision: http://reviews.llvm.org/D11934 llvm-svn: 244723
*	X86: hoist a condition into a variable (NFC)	Saleem Abdulrasool	2015-08-12	1	-7/+8
\| \| \| \| \| \| \| \|	The same value is used multiple times through the function. Hoist the condition into a variable. This should fix a silly static analysis warning where the conditions flip around. No functional change intended. llvm-svn: 244713
*	[libFuzzer] add two flags, -tbm_depth and -tbm_width to control how the ↵	Kostya Serebryany	2015-08-12	7	-12/+31
\| \| \| \| \| \|	trace-based-mutations are applied llvm-svn: 244712
*	[libFuzzer] add colons to the stats output to avoid confusion	Kostya Serebryany	2015-08-12	1	-2/+3
\| \| \| \|	llvm-svn: 244708
*	[libFuzzer] use raw C IO to reduce the risk of a deadlock in a signal handler.	Kostya Serebryany	2015-08-12	1	-2/+5
\| \| \| \|	llvm-svn: 244707
*	[x86] enable machine combiner reassociations for 256-bit vector FP mul/add	Sanjay Patel	2015-08-12	1	-0/+4
\| \| \| \|	llvm-svn: 244705
*	PseudoSourceValue: Transform the mips subclass to target independent subclasses	Alex Lorenz	2015-08-11	3	-82/+51
\| \| \| \| \| \| \| \| \| \| \| \|	This commit transforms the mips-specific 'MipsCallEntry' subclass of the 'PseudoSourceValue' class into two, target-independent subclasses named 'GlobalValuePseudoSourceValue' and 'ExternalSymbolPseudoSourceValue'. This change makes it easier to serialize the pseudo source values by removing target-specific pseudo source values. Reviewers: Akira Hatanaka llvm-svn: 244698
*	PseudoSourceValue: Replace global manager with a manager in a machine function.	Alex Lorenz	2015-08-11	48	-532/+558
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit removes the global manager variable which is responsible for storing and allocating pseudo source values and instead it introduces a new manager class named 'PseudoSourceValueManager'. Machine functions now own an instance of the pseudo source value manager class. This commit also modifies the 'get...' methods in the 'MachinePointerInfo' class to construct pseudo source values using the instance of the pseudo source value manager object from the machine function. This commit updates calls to the 'get...' methods from the 'MachinePointerInfo' class in a lot of different files because those calls now need to pass in a reference to a machine function to those methods. This change will make it easier to serialize pseudo source values as it will enable me to transform the mips specific MipsCallEntry PseudoSourceValue subclass into two target independent subclasses. Reviewers: Akira Hatanaka llvm-svn: 244693
*	PseudoSourceValue: Introduce a 'PSVKind' enumerator.	Alex Lorenz	2015-08-11	2	-19/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit introduces a new enumerator named 'PSVKind' in the 'PseudoSourceValue' class. This enumerator is now used to distinguish between the various kinds of pseudo source values. This change is done in preparation for the changes to the pseudo source value object management and to the PseudoSourceValue's class hierarchy - the next two PseudoSourceValue commits will get rid of the global variable that manages the pseudo source values and the mips specific MipsCallEntry subclass. Reviewers: Akira Hatanaka llvm-svn: 244687
*	PseudoSourceValue: Update comments and fix lowercase variable names. NFC.	Alex Lorenz	2015-08-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This commit updates the documentation comments in PseudoSourceValue.cpp and PseudoSourceValue.h based on the LLVM's documentation style. It also fixes several instances of variable names that started with a lowercase letter. This change is done in preparation for the changes to the pseudo source value object management and to the PseudoSourceValue's class hierarchy. llvm-svn: 244686
*	Reformat PseudoSourceValue.cpp and PseudoSourceValue.h. NFC.	Alex Lorenz	2015-08-11	1	-29/+26
\| \| \| \| \| \| \| \| \|	This commit reformats the files lib/CodeGen/PseudoSourceValue.cpp and include/llvm/CodeGen/PseudoSourceValue.h using clang-format. This change is done in preparation for the changes to the pseudo source value object management and to the PseudoSourceValue's class hierarchy. llvm-svn: 244685
*	Use 32-bit divides instead of 64-bit divides where possible.	Mark Heffernan	2015-08-11	1	-0/+4
\| \| \| \| \| \| \| \| \|	For NVPTX, try to use 32-bit division instead of 64-bit division when the dividend and divisor fit in 32 bits. This speeds up some internal benchmarks significantly. The underlying reason is that many index computations are carried out in 64-bits but never actually exceed the capacity of a 32-bit word. llvm-svn: 244684
*	Make DW_AT_[MIPS_]linkage_name optional, and off by default for SCE.	Paul Robinson	2015-08-11	3	-1/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Mangled "linkage" names can be huge, and if the debugger (or other tools) have no use for them, the size savings can be very impressive (on the order of 40%). Add one test for controlling behavior, and modify a number of tests to either stop using linkage names, or make llc emit them (so these tests will still run when the default triple is for PS4). Differential Revision: http://reviews.llvm.org/D11374 llvm-svn: 244678
*	Fix PR24354.	Sanjoy Das	2015-08-11	1	-3/+2
\| \| \| \| \| \| \| \| \| \|	`InstCombiner::OptimizeOverflowCheck` was asserting an invariant (operands to binary operations are ordered by decreasing complexity) that wasn't really an invariant. Fix this by instead having `InstCombiner::OptimizeOverflowCheck` establish the invariant if it does not hold. llvm-svn: 244676
*	don't repeat function names in comments; NFC	Sanjay Patel	2015-08-11	1	-39/+34
\| \| \| \|	llvm-svn: 244672
*	fix 80-cols; NFC	Sanjay Patel	2015-08-11	1	-19/+22
\| \| \| \|	llvm-svn: 244668
*	NFC SelectionDAGDumper: fix typo	JF Bastien	2015-08-11	1	-1/+1
\| \| \| \| \| \| \| \|	Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11959 llvm-svn: 244667
*	WebAssembly: implement comparison.	JF Bastien	2015-08-11	4	-25/+36
\| \| \| \| \| \| \| \| \| \| \| \|	Some of the FP comparisons (ueq, one, ult, ule, ugt, uge) are currently broken, I'll fix them in a follow-up. Reviewers: sunfish Subscribers: llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11924 llvm-svn: 244665
*	[x86] enable machine combiner reassociations for 128-bit vector ↵	Sanjay Patel	2015-08-11	1	-2/+6
\| \| \| \| \| \|	single/double multiplies llvm-svn: 244657
*	[LowerSwitch] Skip dead blocks for processSwitchInst()	Chen Li	2015-08-11	1	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds check for dead blocks and skip them for processSwitchInst(). This will help reduce compilation time. Reviewers: reames, hans Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11953 llvm-svn: 244656
*	WebAssembly: implement WebAssemblyTargetLowering::getTargetNodeName	JF Bastien	2015-08-11	2	-1/+13
\| \| \| \| \| \| \| \| \| \|	Summary: Implementation is the same as in AArch64. Subscribers: aemerson, jfb, llvm-commits, sunfish Differential Revision: http://reviews.llvm.org/D11956 llvm-svn: 244655
*	fix minsize detection: minsize attribute implies optimizing for size	Sanjay Patel	2015-08-11	1	-2/+1
\| \| \| \| \| \|	Also, add a test for optsize because this was not part of any existing regression test. llvm-svn: 244651
*	SelectionDAG: Prefer to combine multiplication with less uses for fma	Jingyue Wu	2015-08-11	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For example: s6 = s0s5; s2 = s6s6 + s6; ... s4 = s6*s3; We notice that it is possible for s2 is folded to fma (s0, s5, fmul (s6 s6)). This only happens when Aggressive is true, otherwise hasOneUse() check already prevents from folding the multiplication with more uses. Test Plan: test/CodeGen/NVPTX/fma-assoc.ll Patch by Xuetian Weng Reviewers: hfinkel, apazos, jingyue, ohsallen, arsenm Subscribers: arsenm, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D11855 llvm-svn: 244649
*	[LowerSwitch] Fix a bug when LowerSwitch deletes the default block	Chen Li	2015-08-11	1	-5/+10
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: LowerSwitch crashed with the attached test case after deleting the default block. This happened because the current implementation of deleting dead blocks is wrong. After the default block being deleted, it contains no instruction or terminator, and it should no be traversed anymore. However, since the iterator is advanced before processSwitchInst() function is executed, the block advanced to could be deleted inside processSwitchInst(). The deleted block would then be visited next and crash dyn_cast<SwitchInst>(Cur->getTerminator()) because Cur->getTerminator() returns a nullptr. This patch fixes this problem by recording dead default blocks into a list, and delete them after all processSwitchInst() has been done. It still possible to visit dead default blocks and waste time process them. But it is a compile time issue, and I plan to have another patch to add support to skip dead blocks. Reviewers: kariddi, resistor, hans, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11852 llvm-svn: 244642
*	Use llvm::make_unique to fix the MSVC build.	Rafael Espindola	2015-08-11	1	-1/+1
\| \| \| \|	llvm-svn: 244641
*	fix minsize detection: minsize attribute implies optimizing for size	Sanjay Patel	2015-08-11	1	-4/+3
\| \| \| \|	llvm-svn: 244631
*	Enable EliminateAvailableExternally pass in the LTO pipeline.	Teresa Johnson	2015-08-11	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For LTO we need to enable this pass in the LTO pipeline, as it is skipped during the "-flto -c" compile step (when PrepareForLTO is set). Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11919 llvm-svn: 244622
*	Variable names should start with an upper case letter; NFC	Sanjay Patel	2015-08-11	1	-9/+9
\| \| \| \|	llvm-svn: 244618
*	fix minsize detection: minsize attribute implies optimizing for size	Sanjay Patel	2015-08-11	1	-7/+7
\| \| \| \|	llvm-svn: 244617
*	[GlobalMerge] Use private linkage for MergedGlobals variables	John Brawn	2015-08-11	1	-25/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Other objects can never reference the MergedGlobals symbol so external linkage is never needed. Using private instead of internal linkage means the object is more similar to what it looks like when global merging is not enabled, with the only difference being that the merged variables are addressed indirectly relative to the start of the section they are in. Also add aliases for merged variables with internal linkage, as this also makes the object be more like what it is when they are not merged. Differential Revision: http://reviews.llvm.org/D11942 llvm-svn: 244615
*	fix code that was accidentally commented out in previous commit	Sanjay Patel	2015-08-11	1	-2/+2
\| \| \| \|	llvm-svn: 244610
*	fix typos in comments; NFC	Sanjay Patel	2015-08-11	1	-5/+5
\| \| \| \|	llvm-svn: 244609
*	fix typo in comment; NFC	Sanjay Patel	2015-08-11	1	-1/+1
\| \| \| \|	llvm-svn: 244607
*	fix minsize detection: minsize attribute implies optimizing for size	Sanjay Patel	2015-08-11	1	-3/+1
\| \| \| \|	llvm-svn: 244604
*	[X86] Allow merging of immediates within a basic block for code size savings	Michael Kuperstein	2015-08-11	3	-7/+117
\| \| \| \| \| \| \| \| \| \| \|	First step in preventing immediates that occur more than once within a single basic block from being pulled into their users, in order to prevent unnecessary large instruction encoding .Currently enabled only when optimizing for size. Patch by: zia.ansari@intel.com Differential Revision: http://reviews.llvm.org/D11363 llvm-svn: 244601
*	[AArch64] Match fminnum/fmaxnum for vector fminnm/fmaxnm instead of an ↵	James Molloy	2015-08-11	2	-8/+17
\| \| \| \| \| \| \| \| \| \| \|	intrinsic. Lower Intrinsic::aarch64_neon_fmin/fmax to fminnum/fmannum and match that instead. Minimal functional change: - Extra tests added because coverage of scalar fminnm/fmaxnm instructions was nonexistant. - f16 test updated because now we actually generate scalar fminnm/fmaxnm we no longer need to bail out to a libcall! llvm-svn: 244595
*	[AArch64] Replace the custom AArch64ISD::FMIN/MAX nodes with ISD::FMINNAN/MAXNAN	James Molloy	2015-08-11	3	-19/+15
\| \| \| \| \| \|	NFCI. This just removes custom ISDNodes that are no longer needed. llvm-svn: 244594
*	[ARM] Match fminnan/fmaxnan for vector vmin/vmax instead of an intrinsic	James Molloy	2015-08-11	2	-4/+20
\| \| \| \| \| \| \| \|	Lower Intrinsic::arm_neon_vmins/vmaxs to fminnan/fmaxnan and match that instead. This is important because SDAG will soon be able to select FMINNAN itself, so we need a unified lowering path for intrinsics and SDAG. NFCI. llvm-svn: 244593
*	[ARM] Match fminnum/fmaxnum for vector vminnm/vmaxnm instead of an intrinsic	James Molloy	2015-08-11	2	-4/+16
\| \| \| \| \| \| \| \|	Lower the intrinsic to a FMINNUM/FMAXNUM node and select that instead. This is important because soon SDAG will be able to select FMINNUM/FMAXNUM itself, so we need an integrated lowering path between SDAG and intrinsics. NFCI. llvm-svn: 244592
*	[ARM] Replace ARMISD::VMINNM/VMAXNM with ISD::FMINNUM/FMAXNUM	James Molloy	2015-08-11	4	-18/+10
\| \| \| \| \| \|	NFCI. This replaces another custom ISDNode with a generic equivalent. llvm-svn: 244591
*	[ARM] Replace ARMISD::FMIN/FMAX with the shiny new ISD::FMINNAN/FMAXNAN.	James Molloy	2015-08-11	3	-13/+12
\| \| \| \| \| \|	NFCI. This removes a custom ISDNode. llvm-svn: 244590
*	[X86] Add SAL mnemonics for Intel syntax	Marina Yatsina	2015-08-11	1	-0/+1
\| \| \| \| \| \| \| \|	SAL and SHL instructions perform the same operation Differential Revision: http://reviews.llvm.org/D11882 llvm-svn: 244588
*	[X86] Fix REPE, REPZ, REPNZ for intel syntax	Marina Yatsina	2015-08-11	1	-3/+3
\| \| \| \| \| \| \| \| \|	REPE, REPZ, REPNZ, REPNE should have mnemonics for Intel syntax as well. Currently using these instructions causes compilation errors for Intel syntax. Differential Revision: http://reviews.llvm.org/D11794 llvm-svn: 244584
*	[X86] Fix imul alias for intel syntax	Marina Yatsina	2015-08-11	1	-6/+6
\| \| \| \| \| \| \| \| \|	The "imul reg, imm" alias is not defined for intel syntax. In intel syntax there is no w/l/q suffix for the imul instruction. Differential Revision: http://reviews.llvm.org/D11887 llvm-svn: 244582
*	Add new ISD nodes: ISD::FMINNAN and ISD::FMAXNAN	James Molloy	2015-08-11	4	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The intention of these is to be a corollary to ISD::FMINNUM/FMAXNUM, differing only on how NaNs are treated. FMINNUM returns the non-NaN input (when given one NaN and one non-NaN), FMINNAN returns the NaN input instead. This patch includes support for scalarizing, widening and splitting vectors, but not expansion or softening. The reason is that these should never be needed - FMINNAN nodes are only going to be created in one place (SDAGBuilder::visitSelect) and there we'll check if the node is legal or custom. I could preemptively add expand and soften code, but I'm fairly opposed to adding code I can't test. It's bad enough I can't create tests with this patch, but at least this code will be exercised by the ARM and AArch64 backends fairly shortly. llvm-svn: 244581
*	Add support for floating-point minnum and maxnum	James Molloy	2015-08-11	5	-45/+170
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The select pattern recognition in ValueTracking (as used by InstCombine and SelectionDAGBuilder) only knew about integer patterns. This teaches it about minimum and maximum operations. matchSelectPattern() has been extended to return a struct containing the existing Flavor and a new enum defining the pattern's behavior when given one NaN operand. C minnum() is defined to return the non-NaN operand in this case, but the idiomatic C "a < b ? a : b" would return the NaN operand. ARM and AArch64 at least have different instructions for these different cases. llvm-svn: 244580
*	[mips] Remap move as or.	Vasileios Kalintiris	2015-08-11	8	-10/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch remaps the assembly idiom 'move' to 'or' instead of 'daddu' or 'addu'. The use of addu/daddu instead of or as move was highlighted as a performance issue during the analysis of a recent 64bit design. Originally move was encoded as 'or' by binutils but was changed for the r10k cpu family due to their pipeline which had 2 arithmetic units and a single logical unit, and so could issue multiple (d)addu based moves at the same time but only 1 logical move. This patch preserves the disassembly behaviour so that disassembling a old style (d)addu move still appears as move, but assembling move always gives an or Patch by Simon Dardis. Reviewers: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11796 llvm-svn: 244579
*	[X86] When optimizing for minsize, use POP for small post-call stack clean-up	Michael Kuperstein	2015-08-11	2	-1/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When optimizing for size, replace "addl $4, %esp" and "addl $8, %esp" following a call by one or two pops, respectively. We don't try to do it in general, but only when the stack adjustment immediately follows a call - which is the most common case. That allows taking a short-cut when trying to find a free register to pop into, instead of a full-blown liveness check. If the adjustment immediately follows a call, then every register the call clobbers but doesn't define should be dead at that point, and can be used. Differential Revision: http://reviews.llvm.org/D11749 llvm-svn: 244578
*	Allow PeepholeOptimizer to fold a few more cases	Michael Kuperstein	2015-08-11	1	-5/+4
\| \| \| \| \| \| \| \| \| \|	The condition for clearing the folding candidate list was clamped together with the "uninteresting instruction" condition. This is too conservative, e.g. we don't need to clear the list when encountering an IMPLICIT_DEF. Differential Revision: http://reviews.llvm.org/D11591 llvm-svn: 244577