bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86] Use instruction aliases to replace custom asm parser code for ↵	Craig Topper	2016-05-23	3	-51/+53
\| \| \| \| \| \|	optimizing moves to use 2 byte VEX prefix. llvm-svn: 270394
*	Revert "Modify emitTypeInformation to use MemoryTypeTableBuilder"	David Majnemer	2016-05-23	2	-19/+42
\| \| \| \| \| \| \|	This reverts commit r270106. It results in certain function types omitted in the output. llvm-svn: 270389
*	[AVX512] Add patterns to implement stores of extracts of least signficant ↵	Craig Topper	2016-05-22	1	-0/+123
\| \| \| \| \| \| \| \|	subvectors using XMM or YMM stores instead of the vector extract instructions. Similar is already done for AVX and we had lost it going to AVX512VL. llvm-svn: 270383
*	[x86, AVX] don't add a vzeroupper if that's what the code is already doing ↵	Sanjay Patel	2016-05-22	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR27823) This isn't the complete fix, but it handles the trivial examples of duplicate vzero* ops in PR27823: https://llvm.org/bugs/show_bug.cgi?id=27823 ...and amusingly, the bogus cases already exist as regression tests, so let's take this baby step. We'll need to do more in the general case where there's legitimate AVX usage in the function + there's already a vzero in the code. Differential Revision: http://reviews.llvm.org/D20477 llvm-svn: 270378
*	reduce indent; NFC	Sanjay Patel	2016-05-22	1	-19/+19
\| \| \| \|	llvm-svn: 270372
*	use 'auto' with 'dyn_cast'; fix formatting; NFC	Sanjay Patel	2016-05-22	1	-10/+8
\| \| \| \|	llvm-svn: 270370
*	[ValueTracking, InstCombine] extend isKnownToBeAPowerOfTwo() to handle ↵	Sanjay Patel	2016-05-22	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	vector splat constants We could try harder to handle non-splat vector constants too, but that seems much rarer to me. Note that the div test isn't resolved because there's a check for isIntegerTy() guarding that transform. Differential Revision: http://reviews.llvm.org/D20497 llvm-svn: 270369
*	[AVX512] Implement missing patterns for any_extend load lowering.	Igor Breger	2016-05-22	2	-55/+88
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20513 llvm-svn: 270357
*	[AVX512] The AVX512 file only need subtract_subvector index 0 patterns where ↵	Craig Topper	2016-05-22	1	-15/+35
\| \| \| \| \| \|	the source is 512-bits. The 256-bit source patterns were redundant with AVX. llvm-svn: 270356
*	[AVX512] Add an AddedComplexity line to the 512-bit insert_subvector undef ↵	Craig Topper	2016-05-22	1	-0/+2
\| \| \| \| \| \|	index 0 patterns. This gives them higher priority than the memory patterns. This matches AVX1/2. llvm-svn: 270355
*	[AVX512] Change the AddedComplexity on some patterns to match their AVX/SSE ↵	Craig Topper	2016-05-22	1	-9/+13
\| \| \| \| \| \|	equivalents. This helps group them close together in the isel tables and enable table compression. llvm-svn: 270354
*	[AVX512] Add a couple patterns to fix some cases where two vector mask ↵	Craig Topper	2016-05-22	1	-0/+11
\| \| \| \| \| \|	inversions could appear in a row. llvm-svn: 270344
*	[AVX512] Remove seemingly unnecessary AddedComplexity adjustment.	Craig Topper	2016-05-22	1	-2/+0
\| \| \| \|	llvm-svn: 270343
*	[profile] Static counter allocation for value profiling (part-1)	Xinliang David Li	2016-05-21	1	-12/+96
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20459 llvm-svn: 270336
*	[X86] Remove unnecessary alignment check on patterns that use VEXTRACTF128 ↵	Craig Topper	2016-05-21	1	-8/+8
\| \| \| \| \| \|	for integer types when only AVX1 is supported. llvm-svn: 270335
*	[AVX512] Add patterns for extracting subvectors and storing to memory.	Craig Topper	2016-05-21	2	-10/+15
\| \| \| \|	llvm-svn: 270334
*	[AVX512] Capitalize the Z in VEXTRACTPSzmr. Lowercase z has been primarily ↵	Craig Topper	2016-05-21	1	-2/+2
\| \| \| \| \| \|	used to indicating the zero masking behavior which is not the case here. NFC llvm-svn: 270333
*	[AVX512] Rename vector extract instructions so 'mr' intead of 'rm' to ↵	Craig Topper	2016-05-21	1	-2/+2
\| \| \| \| \| \|	reflect the fact that memory is the destination. llvm-svn: 270332
*	[AVX512] Fix copy/paste mistake a I made in a comment.	Craig Topper	2016-05-21	1	-1/+1
\| \| \| \|	llvm-svn: 270331
*	Fix 80-column violation.	Chad Rosier	2016-05-21	1	-1/+2
\| \| \| \|	llvm-svn: 270329
*	[LiveIntervalAnalysis] Don't dereference an end iterator in ↵	Hal Finkel	2016-05-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	repairIntervalsInRange This fixes a bug introduced in: r262115 - CodeGen: Take MachineInstr& in SlotIndexes and LiveIntervals, NFC The iterator End here might == MBB->end(), and so we can't unconditionally dereference it. This often goes unnoticed (I don't have a test case that always crashes, and ASAN does not catch it either) because the function call arguments are turned right back into iterators. MachineInstrBundleIterator's constructor, however, does have an assert which might randomly fire. llvm-svn: 270323
*	[Clang][AVX512][intrinsics] Fix rcp and sqrt intrinsics.	Michael Zuckerman	2016-05-21	4	-7/+10
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20438 llvm-svn: 270322
*	[Clang][AVX512][intrinsics] Fix vscalef intrinsics.	Michael Zuckerman	2016-05-21	5	-8/+11
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20324 llvm-svn: 270321
*	[AVX512] Add patterns for VEXTRACT v16i16->v8i16 and v32i8->v16i8. Disable ↵	Craig Topper	2016-05-21	2	-1/+9
\| \| \| \| \| \|	AVX2 versions of vector extract when AVX512VL is enabled. llvm-svn: 270318
*	[AVX512] Disable AVX2 VPERMD, VPERMQ, VPERMPS, and VPERMPD patterns when ↵	Craig Topper	2016-05-21	2	-30/+38
\| \| \| \| \| \|	AVX512VL is enabled. Also add shuffle comment printing for AVX512VL VPERMPD/VPERMQ to keep some tests that now use these instructions instead of the AVX2 ones. llvm-svn: 270317
*	[AVX512] Disable AVX/AVX2 VBROADCASTSS/VBROADCASTSD patterns when AVX512VL ↵	Craig Topper	2016-05-21	1	-4/+4
\| \| \| \| \| \|	is enabled. llvm-svn: 270316
*	[SimplifyCFG] Remove cleanuppads which are empty except for calls to ↵	David Majnemer	2016-05-21	1	-5/+17
\| \| \| \| \| \| \| \| \| \| \| \| \|	lifetime.end A cleanuppad is not cheap, they turn into many instructions and result in additional spills and fills. It is not worth keeping a cleanuppad around if all it does is hold a lifetime.end instruction. N.B. We first try to merge the cleanuppad with another cleanuppad to avoid dropping the lifetime and debug info markers. llvm-svn: 270314
*	AMDGPU: Define priorities for register classes	Matt Arsenault	2016-05-21	1	-11/+31
\| \| \| \| \| \| \| \| \| \|	Allocating larger register classes first should give better allocation results (and more importantly for myself, make the lit tests more stable with respect to scheduler changes). Patch by Matthias Braun llvm-svn: 270312
*	[AVX512] Disable AVX/AVX2 patterns for VPSADBW and VPMULUDQ when the ↵	Craig Topper	2016-05-21	1	-4/+4
\| \| \| \| \| \|	AVX512VL/AVX512BWI equivalents are available. llvm-svn: 270311
*	[X86] Convert some SSE2/AVX2 intrinsics to ISD opcodes during lowering ↵	Craig Topper	2016-05-21	2	-12/+24
\| \| \| \| \| \|	instead of pattern matching the intrinsics. This unifies handling with AVX512 and allows these intrinsics to select EVEX encoded instructions to increase available registers. llvm-svn: 270310
*	[IRCE] Don't use an allocator for range checks; NFC	Sanjoy Das	2016-05-21	1	-37/+28
\| \| \| \| \| \| \| \|	The InductiveRangeCheck struct is only five words long; so passing these around value is fine. The allocator makes the code look more complex than it is. llvm-svn: 270309
*	[IRCE] Don't pass IRBuilder<> where unnecessary; NFC	Sanjoy Das	2016-05-21	1	-8/+6
\| \| \| \|	llvm-svn: 270308
*	AMDGPU: Cleanup lowering actions	Matt Arsenault	2016-05-21	3	-292/+248
\| \| \| \| \| \| \| \|	These are kind of a mess and hard to follow, particularly for loads and stores. Fix various redundant, unnecessary and dead settings. llvm-svn: 270307
*	[GuardWidening] Fix incorrect use of remove_if	Sanjoy Das	2016-05-21	1	-25/+29
\| \| \| \| \| \| \| \| \| \| \|	I had used `std::remove_if` under the assumption that it moves the predicate matching elements to the end, but actaully the elements remaining towards the end (after the iterator returned by `std::remove_if`) are indeterminate. Fix the bug (and make the code more straightforward) by using a temporary SmallVector, and add a test case demonstrating the issue. llvm-svn: 270306
*	AMDGPU: Fix high bits after division optimization	Matt Arsenault	2016-05-21	1	-17/+36
\| \| \| \| \| \| \|	This is essentially doing a 24-bit signed division with FP. We need to truncate to the N bit result. llvm-svn: 270305
*	[RegBankSelect] Compute the repairing cost for copies.	Quentin Colombet	2016-05-21	1	-15/+40
\| \| \| \| \| \| \|	Prior to this patch, we were using 1 for all the repairing costs. Now, we use the information from the target to get this information. llvm-svn: 270304
*	[AVR] Add AVRMCAsmInfo	Dylan McKay	2016-05-21	3	-0/+60
\| \| \| \|	llvm-svn: 270302
*	AMDGPU: Fix verifier error when spilling SGPRs	Matt Arsenault	2016-05-21	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \|	The current SGPR spilling test does not stress this because it is using s_buffer_load instructions to increase SGPR pressure and spill, but their output operands have the same SReg_32_XM0 constraint. This fixes an error when the SReg_32 output from most instructions is spilled. llvm-svn: 270301
*	AMDGPU: Fix relationship between SReg_32 and SReg_32_XM0	Matt Arsenault	2016-05-21	1	-6/+5
\| \| \| \|	llvm-svn: 270300
*	Fix implicit type conversion. NFC.	Chris Bieneman	2016-05-21	1	-1/+1
\| \| \| \|	llvm-svn: 270299
*	[AVR] Fix header files in MCTargetDesc	Dylan McKay	2016-05-21	6	-4/+39
\| \| \| \| \| \| \|	Everything now compiles successfully, but there are still undefined references. llvm-svn: 270298
*	AMDGPU: Handle cbranch vccz/vccnz	Matt Arsenault	2016-05-21	2	-1/+21
\| \| \| \|	llvm-svn: 270297
*	AMDGPU: Implement ReverseBranchCondition	Matt Arsenault	2016-05-21	2	-0/+10
\| \| \| \|	llvm-svn: 270296
*	AMDGPU: Implement AnalyzeBranch	Matt Arsenault	2016-05-21	2	-1/+130
\| \| \| \| \| \|	Original patch by Tom Stellard llvm-svn: 270295
*	[WebAssembly] Optimize away return instructions using fallthroughs.	Dan Gohman	2016-05-21	6	-10/+111
\| \| \| \| \| \| \| \| \|	This saves a small amount of code size, and is a first small step toward passing values on the stack across block boundaries. Differential Review: http://reviews.llvm.org/D20450 llvm-svn: 270294
*	Fix constant folding of addrspacecast of null	Matt Arsenault	2016-05-21	1	-1/+2
\| \| \| \| \| \| \|	This should not be making assumptions on the value of the casted pointer. llvm-svn: 270293
*	[AVR] Fix signuature of AVRTargetMachine constructor	Dylan McKay	2016-05-20	2	-4/+7
\| \| \| \|	llvm-svn: 270292
*	LiveIntervalAnalysis: Rework constructMainRangeFromSubranges()	Matthias Braun	2016-05-20	4	-246/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We now use LiveRangeCalc::extendToUses() instead of a specially designed algorithm in constructMainRangeFromSubranges(): - The original motivation for constructMainRangeFromSubranges() were differences between the main liverange and subranges because of hidden dead definitions. This case however cannot happen anymore with the DetectDeadLaneMasks pass in place. - It simplifies the code. - This fixes a longstanding bug where we did not properly create new SSA values on merging control flow (the MachineVerifier missed most of these cases). - Move constructMainRangeFromSubranges() to LiveIntervalAnalysis and LiveRangeCalc to better match the implementation/available helper functions. This re-applies r269016. The fixes from r270290 and r270259 should avoid the machine verifier problems this time. llvm-svn: 270291
*	MachineVerifier: subregs so not require defs/valnos on every path	Matthias Braun	2016-05-20	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \|	It is fine for subregister ranges to be undefined on some CFG paths as we may have a "vregX:other_subreg<read-undef> =" def on that path. We do not (and should not) have live segments for the subregister ranges. The MachineVerifier should not complain about this. This is a slight variant of http://llvm.org/PR27705 llvm-svn: 270290
*	Fix struct member names and simplify. NFC.	Rui Ueyama	2016-05-20	1	-6/+4
\| \| \| \|	llvm-svn: 270289