bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86][SSE] Lower 128-bit MOVDDUP with existing VBROADCAST mechanisms	Simon Pilgrim	2016-03-02	1	-39/+51
\| \| \| \| \| \| \| \| \| \| \| \|	We have a number of useful lowering strategies for VBROADCAST instructions (both from memory and register element 0) which the 128-bit form of the MOVDDUP instruction can make use of. This patch tweaks lowerVectorShuffleAsBroadcast to enable it to broadcast 2f64 args using MOVDDUP as well. It does require a slight tweak to the lowerVectorShuffleAsBroadcast mechanism as the existing MOVDDUP lowering uses isShuffleEquivalent which can match binary shuffles that can lower to (unary) broadcasts. Differential Revision: http://reviews.llvm.org/D17680 llvm-svn: 262478
*	Revert "[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure ↵	Nikolay Haustov	2016-03-02	4	-370/+0
\| \| \| \| \| \| \| \|	fields" Build failure with clang. llvm-svn: 262477
*	Revert "[AMDGPU] Using table-driven amd_kernel_code_t field parser in ↵	Nikolay Haustov	2016-03-02	2	-8/+157
\| \| \| \| \| \| \| \|	assembler." Build failure with clang. llvm-svn: 262475
*	[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler.	Nikolay Haustov	2016-03-02	2	-157/+8
\| \| \| \| \| \| \| \| \| \|	complementary patch to table-driven amd_kernel_code_t field parser/printer utility. lit tests passed. Patch by: Valery Pykhtin Differential Revision: http://reviews.llvm.org/D17151 llvm-svn: 262474
*	[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields	Nikolay Haustov	2016-03-02	4	-0/+370
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is going to be used in .hsatext disassembler and can be used in current assembler parser (lit tests passed on parsing). Code using this helpers isn't included in this patch. Benefits: unified approach fast field name lookup on parsing Later I would like to enhance some of the field naming/syntax using this code. Patch by: Valery Pykhtin Differential Revision: http://reviews.llvm.org/D17150 llvm-svn: 262473
*	libfuzzer: fix compiler warnings	Dmitry Vyukov	2016-03-02	2	-6/+12
\| \| \| \| \| \| \| \|	- unused sigaction/setitimer result (used in assert) - unchecked fscanf return value - signed/unsigned comparison llvm-svn: 262472
*	[X86] Remove unnecessary call to isReg from emitter's DestMem handling for ↵	Craig Topper	2016-03-02	1	-7/+5
\| \| \| \| \| \|	VEX prefix. The operand is always a register. NFC llvm-svn: 262468
*	[X86] Make X86MCCodeEmitter::DetermineREXPrefix locate operands more like ↵	Craig Topper	2016-03-02	1	-54/+50
\| \| \| \| \| \|	how VEX prefix handling does. llvm-svn: 262467
*	[X86] Permit reading of the FLAGS register without it being previously defined	David Majnemer	2016-03-02	2	-3/+8
\| \| \| \| \| \| \| \| \| \| \|	We modeled the RDFLAGS{32,64} operations as "using" {E,R}FLAGS. While technically correct, this is not be desirable for folks who want to examine aspects of the FLAGS register which are not related to computation like whether or not CPUID is a valid instruction. Differential Revision: http://reviews.llvm.org/D17782 llvm-svn: 262465
*	[X86] Remove assertion I accidentally left in.	Craig Topper	2016-03-02	1	-1/+0
\| \| \| \|	llvm-svn: 262464
*	[X86] Be more structured about how we capture the register number when it is ↵	Craig Topper	2016-03-02	1	-41/+39
\| \| \| \| \| \| \| \| \| \|	encoded in bits 7:4 of the immediate. For some instructions the register is not the last operand and the immediate handling had to detect this and hardcode the index to find it. It also required CurOp to be pointing at the last operand handled in the Form switch whereas for any instruction it would be pointing at the next operand. Now we just capture the value in the Form switch when we know exactly where it is and the CurOp pointer can behave normally. llvm-svn: 262462
*	[SCEV] Minor naming, braces cleanup; NFC	Sanjoy Das	2016-03-02	1	-5/+4
\| \| \| \|	llvm-svn: 262459
*	[X86] Use MCPhysReg and uint16_t for static arrays of registers and opcodes ↵	Craig Topper	2016-03-02	5	-16/+16
\| \| \| \| \| \|	respectively should reduce size tiny bit. NFC llvm-svn: 262458
*	AMDGPU: Fix bug 26659.	Matt Arsenault	2016-03-02	1	-1/+1
\| \| \| \| \| \| \| \|	Fix checking the same instruction twice instead of the second branch that uses vccz. I don't think this matters currently because s_branch_vccnz is always used currently. llvm-svn: 262457
*	AMDGPU: Cleanup suggested in bug 23960	Matt Arsenault	2016-03-02	1	-6/+3
\| \| \| \|	llvm-svn: 262456
*	Bug 20810: Use report_fatal_error instead of unreachable	Matt Arsenault	2016-03-02	1	-6/+6
\| \| \| \|	llvm-svn: 262455
*	Add a comment with a rational for the unusual code structure	Sanjoy Das	2016-03-02	1	-0/+3
\| \| \| \|	llvm-svn: 262454
*	Qualify getRangeForAffineAR with this-> for MSVC	Sanjoy Das	2016-03-02	1	-2/+2
\| \| \| \|	llvm-svn: 262453
*	Attempt to fix ASAN failure in a MemorySSA test.	George Burgess IV	2016-03-02	1	-4/+4
\| \| \| \|	llvm-svn: 262452
*	Perturb code in an attempt to appease MSVC	Sanjoy Das	2016-03-02	1	-9/+9
\| \| \| \| \| \| \| \|	For some reason MSVC seems to think I'm calling getConstant() from a static context. Try to avoid this issue by explicitly specifying 'this->' (though I'm not confident that this will actually work). llvm-svn: 262451
*	More code permutation to appease MSVC	Sanjoy Das	2016-03-02	1	-4/+7
\| \| \| \|	llvm-svn: 262449
*	Remove "auto" to appease the MSVC bots	Sanjoy Das	2016-03-02	1	-2/+2
\| \| \| \|	llvm-svn: 262448
*	DAGCombiner: Make sure an integer is being truncated	Matt Arsenault	2016-03-02	1	-1/+1
\| \| \| \|	llvm-svn: 262446
*	revert r262424 because there's a clang test for AArch64 that checks -O3 ↵	Sanjay Patel	2016-03-02	1	-17/+5
\| \| \| \| \| \| \| \|	asm output that is broken by this change llvm-svn: 262440
*	[SCEV] Make getRange smarter around selects	Sanjoy Das	2016-03-02	1	-0/+83
\| \| \| \| \| \| \| \| \| \| \| \|	Have ScalarEvolution::getRange re-consider cases like "{C?A:B,+,C?P:Q}" by factoring out "C" and computing RangeOf{A,+,P} union RangeOf({B,+,Q}) instead. The latter can be easier to compute precisely in cases like "{C?0:N,+,C?1:-1}" N is the backedge taken count of the loop; since in such cases the latter form simplifies to [0,N+1) union [0,N+1). llvm-svn: 262438
*	[SCEV] Extract out a getRangeForAffineAR; NFC	Sanjoy Das	2016-03-02	1	-57/+71
\| \| \| \| \| \|	Pure code-motion change. Will be used later in making getRange more clever. llvm-svn: 262437
*	[InstCombine] convert 'isPositive' and 'isNegative' vector comparisons to ↵	Sanjay Patel	2016-03-01	1	-5/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	shifts (PR26701) As noted in the code comment, I don't think we can do the same transform that we do for scalar integers comparisons to vector integers comparisons because it might pessimize the general case. Exhibit A for an incomplete integer comparison ISA remains x86 SSE/AVX: it only has EQ and GT for integer vectors. But we should now recognize all the variants of this construct and produce the optimal code for the cases shown in: https://llvm.org/bugs/show_bug.cgi?id=26701 llvm-svn: 262424
*	Perform InstructioinCombiningPass before SampleProfile pass.	Dehao Chen	2016-03-01	2	-21/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: SampleProfile pass needs to be performed after InstructionCombiningPass, which helps eliminate un-inlinable function calls. Reviewers: davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17742 llvm-svn: 262419
*	[libFuzzer] deprecate exit_on_first flag	Kostya Serebryany	2016-03-01	4	-12/+10
\| \| \| \|	llvm-svn: 262417
*	[libFuzzer] add generic signal handlers so that libFuzzer can report at ↵	Kostya Serebryany	2016-03-01	7	-21/+94
\| \| \| \| \| \|	least something if ASan is not handlig the signals for us. Remove abort_on_timeout flag. llvm-svn: 262415
*	[NFC] Convert tabs to spaces.	Colin LeMahieu	2016-03-01	1	-2/+2
\| \| \| \|	llvm-svn: 262411
*	AArch64: Reenable CompleteModel for A53, A57 and Kryo models	Matthias Braun	2016-03-01	3	-3/+3
\| \| \| \| \| \|	The fixes in r262393 completed them as well. llvm-svn: 262408
*	[Hexagon] Modifying r262258 to only be in effect in the hand assembler path, ↵	Colin LeMahieu	2016-03-01	2	-14/+18
\| \| \| \| \| \|	not the integrated assembler. llvm-svn: 262400
*	DAGCombiner: Turn truncate of a bitcasted vector to an extract	Matt Arsenault	2016-03-01	1	-0/+16
\| \| \| \| \| \| \| \| \| \| \|	On AMDGPU where operations i64 operations are often bitcasted to v2i32 and back, this pattern shows up regularly where it breaks some expected combines on i64, such as load width reducing. This fixes some test failures in a future commit when i64 loads are changed to promote. llvm-svn: 262397
*	Add LLVMBuild for ObjectYAML.	Rafael Espindola	2016-03-01	2	-0/+15
\| \| \| \| \| \|	Should fix the DBUILD_SHARED_LIBS bots. llvm-svn: 262396
*	[lanai] Add ELF enum value and relocations.	Jacques Pienaar	2016-03-01	2	-0/+12
\| \| \| \| \| \| \| \| \| \|	Add ELF enum value and relocations for Lanai backed. General Lanai backend discussion on llvm-dev thread "[RFC] Lanai backend" (http://lists.llvm.org/pipermail/llvm-dev/2016-February/095118.html). Differential Revision: http://reviews.llvm.org/D17008 llvm-svn: 262394
*	AArch64: Add missing schedinfo, check completeness for cyclone	Matthias Braun	2016-03-01	8	-13/+41
\| \| \| \| \| \| \| \| \|	This adds some missing generic schedule info definitions, enables completeness checking for cyclone and fixes a typo uncovered by that. Differential Revision: http://reviews.llvm.org/D17748 llvm-svn: 262393
*	[Power9] Implement new vector compare, extract, insert instructions	Kit Barton	2016-03-01	2	-0/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change implements the following vector operations: - Vector Compare Not Equal - vcmpneb(.) vcmpneh(.) vcmpnew(.) - vcmpnezb(.) vcmpnezh(.) vcmpnezw(.) - Vector Extract Unsigned - vextractub vextractuh vextractuw vextractd - vextublx vextubrx vextuhlx vextuhrx vextuwlx vextuwrx - Vector Insert - vinsertb vinserth vinsertw vinsertd 26 instructions. Phabricator: http://reviews.llvm.org/D15916 llvm-svn: 262392
*	[x86] use getBitcast()	Sanjay Patel	2016-03-01	1	-20/+20
\| \| \| \| \| \| \| \|	This isn't quite NFC because some of the SDLocs may change which could cause scheduling differences. But no regression tests are affected and there is no functional change intended. llvm-svn: 262391
*	Fix some warnings a bit harder/different	David Blaikie	2016-03-01	2	-3/+1
\| \| \| \| \| \| \|	This is an alternate fix to 262378 and a fix to a pessimizing-move warning. llvm-svn: 262390
*	Revert "[AArch64] Fix isLegalAddImmediate() to return true for valid ↵	Geoff Berry	2016-03-01	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	negative values." Revert r262248 in an attempt to fix the clang-native-aarch64-full bot and to investigate a performance regression in SingleSource/Benchmarks/CoyoteBench/huffbench llvm-svn: 262388
*	Revert "[mips] Promote the result of SETCC nodes to GPR width."	Vasileios Kalintiris	2016-03-01	18	-566/+424
\| \| \| \| \| \| \| \| \|	This reverts commit r262316. It seems that my change breaks an out-of-tree chromium buildbot, so I'm reverting this in order to investigate the situation further. llvm-svn: 262387
*	New file to track implementation status of new POWER9 instructions	Kit Barton	2016-03-01	1	-0/+442
\| \| \| \|	llvm-svn: 262386
*	TableGen: Check scheduling models for completeness	Matthias Braun	2016-03-01	19	-3/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	TableGen checks at compiletime that for scheduling models with "CompleteModel = 1" one of the following holds: - Is marked with the hasNoSchedulingInfo flag - The instruction is a subclass of Sched - There are InstRW definitions in the scheduling model Typical steps necessary to complete a model: - Ensure all pseudo instructions that are expanded before machine scheduling (usually everything handled with EmitYYY() functions in XXXTargetLowering). - If a CPU does not support some instructions mark the corresponding resource unsupported: "WriteRes<WriteXXX, []> { let Unsupported = 1; }". - Add missing scheduling information. Differential Revision: http://reviews.llvm.org/D17747 llvm-svn: 262384
*	[NVPTX] Annotate param loads/stores as mayLoad/mayStore.	Justin Lebar	2016-03-01	2	-56/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Tablegen was unable to determine that param loads/stores were actually reading or writing from memory. I think this isn't a problem in practice for param stores, because those occur in a block right before we make our call. But param loads don't have to at the very beginning of a function, so should be annotated as mayLoad so we don't incorrectly optimize them. Reviewers: jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D17471 llvm-svn: 262381
*	[NVPTX] Remove workaround for tablegen crash in NVPTXInstrInfo.td.	Justin Lebar	2016-03-01	1	-28/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Looks like this was caused by a typo. Reviewers: jholewinski Subscribers: jholewinski, llvm-commits, tra Differential Revision: http://reviews.llvm.org/D17357 llvm-svn: 262380
*	Fix -Wnon-virtual-dtor warnings	Reid Kleckner	2016-03-01	1	-0/+2
\| \| \| \|	llvm-svn: 262378
*	Fix an issue where fast math flags were dropped during scalarization.	Owen Anderson	2016-03-01	1	-2/+4
\| \| \| \| \| \| \|	Most portions of InstCombine properly propagate fast math flags, but apparently the vector scalarization section was overlooked. llvm-svn: 262376
*	[SCEV] Minor cleanup: rename method, C++11'ify; NFC	Sanjoy Das	2016-03-01	1	-4/+3
\| \| \| \|	llvm-svn: 262374
*	[NVPTX] Use different, convergent MIs for convergent calls.	Justin Lebar	2016-03-01	4	-52/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Calls sometimes need to be convergent. This is already handled at the LLVM IR level, but it also needs to be handled at the MI level. Ideally we'd propagate convergence from instructions, down through the selection DAG, and into MIs. But this is Hard, and would affect optimizations in the SDNs -- right now only SDNs with two operands have any flags at all. Instead, here's a much simpler hack: Add new opcodes for NVPTX for convergent calls, and generate these when lowering convergent LLVM calls. Reviewers: jholewinski Subscribers: jholewinski, chandlerc, joker.eph, jhen, tra, llvm-commits Differential Revision: http://reviews.llvm.org/D17423 llvm-svn: 262373