bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] Allow merging of immediates within a basic block for code size savings	Michael Kuperstein	2015-08-11	3	-7/+117
\| \| \| \| \| \| \| \| \| \| \|	First step in preventing immediates that occur more than once within a single basic block from being pulled into their users, in order to prevent unnecessary large instruction encoding .Currently enabled only when optimizing for size. Patch by: zia.ansari@intel.com Differential Revision: http://reviews.llvm.org/D11363 llvm-svn: 244601
*	[AArch64] Match fminnum/fmaxnum for vector fminnm/fmaxnm instead of an ↵	James Molloy	2015-08-11	2	-8/+17
\| \| \| \| \| \| \| \| \| \| \|	intrinsic. Lower Intrinsic::aarch64_neon_fmin/fmax to fminnum/fmannum and match that instead. Minimal functional change: - Extra tests added because coverage of scalar fminnm/fmaxnm instructions was nonexistant. - f16 test updated because now we actually generate scalar fminnm/fmaxnm we no longer need to bail out to a libcall! llvm-svn: 244595
*	[AArch64] Replace the custom AArch64ISD::FMIN/MAX nodes with ISD::FMINNAN/MAXNAN	James Molloy	2015-08-11	3	-19/+15
\| \| \| \| \| \|	NFCI. This just removes custom ISDNodes that are no longer needed. llvm-svn: 244594
*	[ARM] Match fminnan/fmaxnan for vector vmin/vmax instead of an intrinsic	James Molloy	2015-08-11	2	-4/+20
\| \| \| \| \| \| \| \|	Lower Intrinsic::arm_neon_vmins/vmaxs to fminnan/fmaxnan and match that instead. This is important because SDAG will soon be able to select FMINNAN itself, so we need a unified lowering path for intrinsics and SDAG. NFCI. llvm-svn: 244593
*	[ARM] Match fminnum/fmaxnum for vector vminnm/vmaxnm instead of an intrinsic	James Molloy	2015-08-11	2	-4/+16
\| \| \| \| \| \| \| \|	Lower the intrinsic to a FMINNUM/FMAXNUM node and select that instead. This is important because soon SDAG will be able to select FMINNUM/FMAXNUM itself, so we need an integrated lowering path between SDAG and intrinsics. NFCI. llvm-svn: 244592
*	[ARM] Replace ARMISD::VMINNM/VMAXNM with ISD::FMINNUM/FMAXNUM	James Molloy	2015-08-11	4	-18/+10
\| \| \| \| \| \|	NFCI. This replaces another custom ISDNode with a generic equivalent. llvm-svn: 244591
*	[ARM] Replace ARMISD::FMIN/FMAX with the shiny new ISD::FMINNAN/FMAXNAN.	James Molloy	2015-08-11	3	-13/+12
\| \| \| \| \| \|	NFCI. This removes a custom ISDNode. llvm-svn: 244590
*	[X86] Add SAL mnemonics for Intel syntax	Marina Yatsina	2015-08-11	1	-0/+1
\| \| \| \| \| \| \| \|	SAL and SHL instructions perform the same operation Differential Revision: http://reviews.llvm.org/D11882 llvm-svn: 244588
*	[X86] Fix REPE, REPZ, REPNZ for intel syntax	Marina Yatsina	2015-08-11	1	-3/+3
\| \| \| \| \| \| \| \| \|	REPE, REPZ, REPNZ, REPNE should have mnemonics for Intel syntax as well. Currently using these instructions causes compilation errors for Intel syntax. Differential Revision: http://reviews.llvm.org/D11794 llvm-svn: 244584
*	[X86] Fix imul alias for intel syntax	Marina Yatsina	2015-08-11	1	-6/+6
\| \| \| \| \| \| \| \| \|	The "imul reg, imm" alias is not defined for intel syntax. In intel syntax there is no w/l/q suffix for the imul instruction. Differential Revision: http://reviews.llvm.org/D11887 llvm-svn: 244582
*	Add new ISD nodes: ISD::FMINNAN and ISD::FMAXNAN	James Molloy	2015-08-11	4	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The intention of these is to be a corollary to ISD::FMINNUM/FMAXNUM, differing only on how NaNs are treated. FMINNUM returns the non-NaN input (when given one NaN and one non-NaN), FMINNAN returns the NaN input instead. This patch includes support for scalarizing, widening and splitting vectors, but not expansion or softening. The reason is that these should never be needed - FMINNAN nodes are only going to be created in one place (SDAGBuilder::visitSelect) and there we'll check if the node is legal or custom. I could preemptively add expand and soften code, but I'm fairly opposed to adding code I can't test. It's bad enough I can't create tests with this patch, but at least this code will be exercised by the ARM and AArch64 backends fairly shortly. llvm-svn: 244581
*	Add support for floating-point minnum and maxnum	James Molloy	2015-08-11	5	-45/+170
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The select pattern recognition in ValueTracking (as used by InstCombine and SelectionDAGBuilder) only knew about integer patterns. This teaches it about minimum and maximum operations. matchSelectPattern() has been extended to return a struct containing the existing Flavor and a new enum defining the pattern's behavior when given one NaN operand. C minnum() is defined to return the non-NaN operand in this case, but the idiomatic C "a < b ? a : b" would return the NaN operand. ARM and AArch64 at least have different instructions for these different cases. llvm-svn: 244580
*	[mips] Remap move as or.	Vasileios Kalintiris	2015-08-11	8	-10/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch remaps the assembly idiom 'move' to 'or' instead of 'daddu' or 'addu'. The use of addu/daddu instead of or as move was highlighted as a performance issue during the analysis of a recent 64bit design. Originally move was encoded as 'or' by binutils but was changed for the r10k cpu family due to their pipeline which had 2 arithmetic units and a single logical unit, and so could issue multiple (d)addu based moves at the same time but only 1 logical move. This patch preserves the disassembly behaviour so that disassembling a old style (d)addu move still appears as move, but assembling move always gives an or Patch by Simon Dardis. Reviewers: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11796 llvm-svn: 244579
*	[X86] When optimizing for minsize, use POP for small post-call stack clean-up	Michael Kuperstein	2015-08-11	2	-1/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When optimizing for size, replace "addl $4, %esp" and "addl $8, %esp" following a call by one or two pops, respectively. We don't try to do it in general, but only when the stack adjustment immediately follows a call - which is the most common case. That allows taking a short-cut when trying to find a free register to pop into, instead of a full-blown liveness check. If the adjustment immediately follows a call, then every register the call clobbers but doesn't define should be dead at that point, and can be used. Differential Revision: http://reviews.llvm.org/D11749 llvm-svn: 244578
*	Allow PeepholeOptimizer to fold a few more cases	Michael Kuperstein	2015-08-11	1	-5/+4
\| \| \| \| \| \| \| \| \| \|	The condition for clearing the folding candidate list was clamped together with the "uninteresting instruction" condition. This is too conservative, e.g. we don't need to clear the list when encountering an IMPLICIT_DEF. Differential Revision: http://reviews.llvm.org/D11591 llvm-svn: 244577
*	[GMR] Be a bit smarter about which globals don't alias when doing recursive ↵	Michael Kuperstein	2015-08-11	1	-7/+23
\| \| \| \| \| \| \| \| \| \|	lookups Should hopefully fix the remainder of PR24288. Differential Revision: http://reviews.llvm.org/D11900 llvm-svn: 244575
*	[RuntimeDyld][AArch64] Add explicit addends before calling relocationValueRef.	Lang Hames	2015-08-11	1	-5/+4
\| \| \| \| \| \|	relocationValueRef uses the addend, so it has to be set before the call. llvm-svn: 244574
*	Fix unused variable 'X' in release builds.	Nick Lewycky	2015-08-11	1	-0/+2
\| \| \| \|	llvm-svn: 244571
*	WebAssembly: NFC fix release build break, unused variable.	JF Bastien	2015-08-11	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Summary: Caused by D11914, pointed out by blaikie. Subscribers: llvm-commits, jfb, dblaikie Differential Revision: http://reviews.llvm.org/D11929 llvm-svn: 244570
*	[IR] Verify EH pad predecessors	David Majnemer	2015-08-11	1	-14/+51
\| \| \| \| \| \| \|	Make sure that an EH pad's predecessors are using their unwind edge to transfer control to the EH pad. llvm-svn: 244563
*	WebAssembly: add basic floating-point tests	JF Bastien	2015-08-11	1	-4/+8
\| \| \| \| \| \| \| \| \| \|	Summary: I somehow forgot to add these when I added the basic floating-point opcodes. Also remove ceil/floor/trunc/nearestint for now, and add them only when properly tested. Subscribers: llvm-commits, sunfish, jfb Differential Revision: http://reviews.llvm.org/D11927 llvm-svn: 244562
*	[libFuzzer] add -only_ascii flag	Kostya Serebryany	2015-08-11	5	-2/+28
\| \| \| \|	llvm-svn: 244559
*	[WinEHPrepare] Add rudimentary support for the new EH instructions	David Majnemer	2015-08-11	2	-9/+374
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds somewhat basic preparation functionality including: - Formation of funclets via coloring basic blocks. - Cloning of polychromatic blocks to ensure that funclets have unique program counters. - Demotion of values used between different funclets. - Some amount of cleanup once we have removed predecessors from basic blocks. - Verification that we are left with a CFG that makes some amount of sense. N.B. Arguments and numbering still need to be done. Differential Revision: http://reviews.llvm.org/D11750 llvm-svn: 244558
*	Explicitly clear the MI operand list when getInstruction() is called. Call ↵	Cameron Esfahani	2015-08-11	4	-22/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	MI.clear() within MCD::OPC_Decode case and inside of translateInstruction() for the X86 target. Remove now unnecessary MI.clear() from ARMDisassembler. Summary: Explicitly clear the MI operand list when getInstruction() is called. Reviewers: hfinkel, t.p.northover, hvarga, kparzysz, jyknight, qcolombet, uweigand Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11665 llvm-svn: 244557
*	Print vectorization analysis when loop hint is specified.	Tyler Nowicki	2015-08-11	2	-18/+39
\| \| \| \| \| \|	This patch and a relatec clang patch solve the problem of having to explicitly enable analysis when specifying a loop hint pragma to get the diagnostics. Passing AlwasyPrint as the pass name (see below) causes the front-end to print the diagnostic if the user has specified '-Rpass-analysis' without an '=<target-pass>’. Users of loop hints can pass that compiler option without having to specify the pass and they will get diagnostics for only those loops with loop hints. llvm-svn: 244555
*	Moved LoopVectorizeHints and related functions before ↵	Tyler Nowicki	2015-08-11	1	-270/+270
\| \| \| \| \| \|	LoopVectorizationLegality and LoopVectorizationCostModel. llvm-svn: 244552
*	WebAssembly: simply assert on SNaN and NaNs with payloads	JF Bastien	2015-08-11	1	-4/+5
\| \| \| \| \| \| \| \| \| \|	Summary: convertToHexString doesn't represent them correctly at this point in time. This is a follow-up to sunfish's suggestion in D11914. Subscribers: llvm-commits, sunfish, jfb Differential Revision: http://reviews.llvm.org/D11925 llvm-svn: 244551
*	Simplify processLoop() by moving loop hint verification into ↵	Tyler Nowicki	2015-08-11	1	-26/+35
\| \| \| \| \| \|	Hints::allowVectorization(). llvm-svn: 244550
*	MIR Serialization: Serialize UsedPhysRegMask from the machine register info.	Alex Lorenz	2015-08-11	2	-0/+46
\| \| \| \| \| \| \| \| \| \| \| \|	This commit serializes the UsedPhysRegMask register mask from the machine register information class. The mask is serialized as an inverted 'calleeSavedRegisters' mask to keep the output minimal. This commit also allows the MIR parser to infer this mask from the register mask operands if the machine function doesn't specify it. Reviewers: Duncan P. N. Exon Smith llvm-svn: 244548
*	use range-based for loops; NFCI	Sanjay Patel	2015-08-11	1	-8/+2
\| \| \| \|	llvm-svn: 244545
*	[libFuzzer] don't crash if the condition in a switch has unusual type (e.g. i72)	Kostya Serebryany	2015-08-11	1	-0/+3
\| \| \| \|	llvm-svn: 244544
*	[LAA] Change name from addRuntimeCheck to addRuntimeChecks, NFC	Adam Nemet	2015-08-11	3	-6/+6
\| \| \| \| \| \|	This was requested by Hal in D11205. llvm-svn: 244540
*	MIR Parser: Report an error when a stack object is redefined.	Alex Lorenz	2015-08-10	1	-2/+5
\| \| \| \|	llvm-svn: 244536
*	Add lduw and lwua aliases for SPARCv9.	Joerg Sonnenberger	2015-08-10	1	-0/+3
\| \| \| \|	llvm-svn: 244535
*	MIR Parser: Report an error when a fixed stack object is redefined.	Alex Lorenz	2015-08-10	1	-2/+6
\| \| \| \|	llvm-svn: 244534
*	Load/store for float registers from/to alternate space.	Joerg Sonnenberger	2015-08-10	1	-6/+6
\| \| \| \|	llvm-svn: 244532
*	use range-based for loop; NFCI	Sanjay Patel	2015-08-10	1	-5/+5
\| \| \| \|	llvm-svn: 244531
*	MIR Serialization: Serialize the liveout register mask machine operands.	Alex Lorenz	2015-08-10	4	-0/+47
\| \| \| \|	llvm-svn: 244529
*	fix minsize detection: minsize attribute implies optimizing for size	Sanjay Patel	2015-08-10	1	-3/+1
\| \| \| \|	llvm-svn: 244528
*	[LoopVer] Remove unused pointer partition argument, NFC.	Adam Nemet	2015-08-10	1	-2/+1
\| \| \| \|	llvm-svn: 244527
*	Extend late diagnostics to include late test for runtime pointer checks.	Tyler Nowicki	2015-08-10	2	-14/+38
\| \| \| \| \| \|	This patch moves checking the threshold of runtime pointer checks to the vectorization requirements (late diagnostics) and emits a diagnostic that infroms the user the loop would be vectorized if not for exceeding the pointer-check threshold. Clang will also append the options that can be used to allow vectorization. llvm-svn: 244523
*	WebAssembly: print immediates	JF Bastien	2015-08-10	3	-20/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For now output using C99's hexadecimal floating-point representation. This patch also cleans up how machine operands are printed: instead of special-casing per type of machine instruction, the code now handles operands generically. Reviewers: sunfish Subscribers: llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11914 llvm-svn: 244520
*	Add support for the signx instrution alias of SPARCv9.	Joerg Sonnenberger	2015-08-10	1	-0/+5
\| \| \| \|	llvm-svn: 244519
*	NFC. Fix some format issues in lib/CodeGen/MachineBasicBlock.cpp.	Cong Hou	2015-08-10	1	-11/+13
\| \| \| \|	llvm-svn: 244518
*	MachineVerifier: Handle the optional def operand in a PATCHPOINT instruction.	Alex Lorenz	2015-08-10	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	The PATCHPOINT instructions have a single optional defined register operand, but the machine verifier can't verify the optional defined register operands. This commit makes sure that the machine verifier won't report an error when a PATCHPOINT instruction doesn't have its optional defined register operand. This change will allow us to enable the machine verifier for the code generation tests for the patchpoint intrinsics. Reviewers: Juergen Ributzka llvm-svn: 244513
*	remove function names from comments; NFC	Sanjay Patel	2015-08-10	1	-22/+20
\| \| \| \|	llvm-svn: 244509
*	StackMap: FastISel: Add an appropriate number of immediate operands to the	Alex Lorenz	2015-08-10	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	frame setup instruction. This commit ensures that the stack map lowering code in FastISel adds an appropriate number of immediate operands to the frame setup instruction. The previous code added just one immediate operand, which was fine for a target like AArch64, but on X86 the ADJCALLSTACKDOWN64 instruction needs two explicit operands. This caused the machine verifier to report an error when the old code added just one. Reviewers: Juergen Ributzka Differential Revision: http://reviews.llvm.org/D11853 llvm-svn: 244508
*	x86: Emit LAHF/SAHF instead of PUSHF/POPF	JF Bastien	2015-08-10	2	-27/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	NaCl's sandbox doesn't allow PUSHF/POPF out of security concerns (priviledged emulators have forgotten to mask system bits in the past, and EFLAGS's DF bit is a constant source of hilarity). Commit r220529 fixed PR20376 by saving cmpxchg's flags result using EFLAGS, this commit now generated LAHF/SAHF instead, for all of x86 (not just NaCl) because it leads to an overall performance gain over PUSHF/POPF. As with the previous patch this code generation is pretty bad because it occurs very later, after register allocation, and in many cases it rematerializes flags which were already available (e.g. already in a register through SETE). Fortunately it's somewhat rare that this code needs to fire. I did [[ https://github.com/jfbastien/benchmark-x86-flags \| a bit of benchmarking ]], the results on an Intel Haswell E5-2690 CPU at 2.9GHz are: \| Time per call (ms) \| Runtime (ms) \| Benchmark \| \| 0.000012514 \| 6257 \| sete.i386 \| \| 0.000012810 \| 6405 \| sete.i386-fast \| \| 0.000010456 \| 5228 \| sete.x86-64 \| \| 0.000010496 \| 5248 \| sete.x86-64-fast \| \| 0.000012906 \| 6453 \| lahf-sahf.i386 \| \| 0.000013236 \| 6618 \| lahf-sahf.i386-fast \| \| 0.000010580 \| 5290 \| lahf-sahf.x86-64 \| \| 0.000010304 \| 5152 \| lahf-sahf.x86-64-fast \| \| 0.000028056 \| 14028 \| pushf-popf.i386 \| \| 0.000027160 \| 13580 \| pushf-popf.i386-fast \| \| 0.000023810 \| 11905 \| pushf-popf.x86-64 \| \| 0.000026468 \| 13234 \| pushf-popf.x86-64-fast \| Clearly `PUSHF`/`POPF` are suboptimal. It doesn't really seems to be worth teaching LLVM about individual flags, at least not for this purpose. Reviewers: rnk, jvoung, t.p.northover Subscribers: llvm-commits Differential revision: http://reviews.llvm.org/D6629 llvm-svn: 244503
*	fix minsize detection: minsize attribute implies optimizing for size	Sanjay Patel	2015-08-10	1	-5/+2
\| \| \| \|	llvm-svn: 244499
*	[InstCombine] Move SSE2/AVX2 arithmetic vector shift folding to instcombiner	Simon Pilgrim	2015-08-10	2	-51/+31
\| \| \| \| \| \| \| \|	As discussed in D11760, this patch moves the (V)PSRA(WD) arithmetic shift-by-constant folding to InstCombine to match the logical shift implementations. Differential Revision: http://reviews.llvm.org/D11886 llvm-svn: 244495