bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[mips][microMIPS] Implement CACHE, PREF, SSNOP, EHB and PAUSE instructions	Jozef Kolek	2014-12-23	9	-7/+122
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D5204 llvm-svn: 224785
*	Reverting 224775 until mayLoad flag is addressed.	Colin LeMahieu	2014-12-23	8	-52/+48
\| \| \| \|	llvm-svn: 224783
*	Finish removing DestroySource.	Rafael Espindola	2014-12-23	7	-37/+10
\| \| \| \| \| \|	Fixes pr21901. llvm-svn: 224782
*	DIBuilder: Similar to createPointerType, make createMemberPointerType take	Adrian Prantl	2014-12-23	2	-5/+10
\| \| \| \| \| \| \| \| \|	a size and alignment. Several assertions in DwarfDebug rely on all variable types to report back a size, or to be derived from a type with a size. Tested in CFE. llvm-svn: 224780
*	Always assert in DAGCombine and not only when -debug is enabled	Mehdi Amini	2014-12-23	1	-5/+6
\| \| \| \| \| \| \| \| \|	Right now in DAG Combine check the validity of the returned type only when -debug is given on the command line. However usually the test cases in the validation does not use -debug. An Assert build should always check this. llvm-svn: 224779
*	Pass LSAN_OPTIONS down so that it is possible to add suppressions.	Rafael Espindola	2014-12-23	1	-1/+2
\| \| \| \|	llvm-svn: 224777
*	Fix a leak found by asan.	Rafael Espindola	2014-12-23	1	-5/+12
\| \| \| \|	llvm-svn: 224776
*	[Hexagon] Adding word loads.	Colin LeMahieu	2014-12-23	8	-48/+52
\| \| \| \|	llvm-svn: 224775
*	[Hexagon] Adding signed halfword loads.	Colin LeMahieu	2014-12-23	6	-30/+34
\| \| \| \|	llvm-svn: 224774
*	Fix a leak found by asan.	Rafael Espindola	2014-12-23	1	-2/+3
\| \| \| \|	llvm-svn: 224773
*	[Hexagon] Adding unsigned halfword load.	Colin LeMahieu	2014-12-23	5	-20/+18
\| \| \| \|	llvm-svn: 224772
*	[mips][microMIPS] Implement LWSP and SWSP instructions	Jozef Kolek	2014-12-23	10	-0/+126
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D6416 llvm-svn: 224771
*	[OCaml] PR22014: OCaml bindings didn't link to libLLVM-*.so with -Wl,--as-needed	Peter Zotov	2014-12-23	1	-2/+2
\| \| \| \| \| \|	Patch by Evangelos Foutras <evangelos@foutrelis.com>. llvm-svn: 224766
*	[ValueTracking] Move GlobalAlias handling to be after the max depth check in ↵	Michael Kuperstein	2014-12-23	2	-15/+38
\| \| \| \| \| \| \| \| \| \| \| \|	computeKnownBits() GlobalAlias handling used to be after GlobalValue handling, which meant it was, in practice, dead code. r220165 moved GlobalAlias handling to be before GlobalValue handling, but also moved it to be before the max depth check, causing an assert due to a recursion depth limit violation. This moves GlobalAlias handling forward to where it's safe, and changes the GlobalValue handling to only look at GlobalObjects. Differential Revision: http://reviews.llvm.org/D6758 llvm-svn: 224765
*	AVX-512: Added FMA instructions, intrinsics an tests for KNL and SKX targets	Elena Demikhovsky	2014-12-23	6	-81/+677
\| \| \| \| \| \| \| \|	by Asaf Badouh http://reviews.llvm.org/D6456 llvm-svn: 224764
*	[PowerPC] Don't mark the return-address slot as immutable	Hal Finkel	2014-12-23	2	-1/+26
\| \| \| \| \| \| \| \| \| \| \| \| \|	It is tempting to mark the fixed stack slot used to store the return address as immutable when lowering @llvm.returnaddress(i32 0). Unfortunately, within the function, it is not completely immutable: it is written during the function prologue. When using post-RA instruction scheduling, the prologue instructions are available for scheduling, and we're not free to interchange the order of a particular store in the prologue with loads from that stack location. Fixes PR21976. llvm-svn: 224761
*	AVX-512: BLENDM - fixed encoding of the broadcast version	Elena Demikhovsky	2014-12-23	5	-3/+348
\| \| \| \| \| \|	Added more intrinsics and encoding tests. llvm-svn: 224760
*	[DagCombine] Improve DAGCombiner BUILD_VECTOR when it has two sources of ↵	Michael Kuperstein	2014-12-23	2	-12/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	elements This partially fixes PR21943. For AVX, we go from: vmovq (%rsi), %xmm0 vmovq (%rdi), %xmm1 vpermilps $-27, %xmm1, %xmm2 ## xmm2 = xmm1[1,1,2,3] vinsertps $16, %xmm2, %xmm1, %xmm1 ## xmm1 = xmm1[0],xmm2[0],xmm1[2,3] vinsertps $32, %xmm0, %xmm1, %xmm1 ## xmm1 = xmm1[0,1],xmm0[0],xmm1[3] vpermilps $-27, %xmm0, %xmm0 ## xmm0 = xmm0[1,1,2,3] vinsertps $48, %xmm0, %xmm1, %xmm0 ## xmm0 = xmm1[0,1,2],xmm0[0] To the expected: vmovq (%rdi), %xmm0 vmovhpd (%rsi), %xmm0, %xmm0 retq Fixing this for AVX2 is still open. Differential Revision: http://reviews.llvm.org/D6749 llvm-svn: 224759
*	[PowerPC] Don't attempt a 64-bit pow2 division on PPC32	Hal Finkel	2014-12-23	2	-0/+11
\| \| \| \| \| \| \| \| \| \|	In r224033, in moving the signed power-of-2 division expansion into BuildSDIVPow2, I accidentally made it possible to attempt the lowering for a 64-bit division on PPC32. This later asserts. Fixes PR21928. llvm-svn: 224758
*	[SimplifyCFG] Revise common code sinking	Michael Liao	2014-12-23	2	-32/+62
\| \| \| \| \| \| \| \| \| \|	- Fix the case where more than 1 common instructions derived from the same operand cannot be sunk. When a pair of value has more than 1 derived values in both branches, only 1 derived value could be sunk. - Replace BB1 -> (BB2, PN) map with joint value map, i.e. map of (BB1, BB2) -> PN, which is more accurate to track common ops. llvm-svn: 224757
*	Remove a bad cast in CloneModule()	Michael Kuperstein	2014-12-23	1	-1/+1
\| \| \| \| \| \|	A cast that was introduced in r209007 was accidentally left in after the changes made to GlobalAlias rules in r210062. This crashes if the aliasee is a now-leggal ConstantExpr. llvm-svn: 224756
*	[ARM] Don't break alignment when combining base updates into load/stores.	Ahmed Bougacha	2014-12-23	4	-26/+152
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	r223862/r224203 tried to also combine base-updating load/stores. There was a mistake there: the alignment was added as is as an operand to the ARMISD::VLD/VST node. However, the VLD/VST selection logic doesn't care about less-than-standard alignment attributes. For example, no matter the alignment of a v2i64 load (say 1), SelectVLD picks VLD1q64 (because of the memory type). But VLD1q64 ("vld1.64 {dXX, dYY}") is 8-aligned, per ARMARMv7a 3.2.1. For the 1-aligned load, what we really want is VLD1q8. This commit introduces bitcasts if necessary, and changes the vld/vst type to one whose standard alignment matches the original load/store alignment. Differential Revision: http://reviews.llvm.org/D6759 llvm-svn: 224754
*	Fix UBSan bootstrap: replace shift of negative value with multiplication.	Alexey Samsonov	2014-12-23	1	-1/+1
\| \| \| \|	llvm-svn: 224752
*	Fix UBSan bootstrap: don't bind reference to nullptr.	Alexey Samsonov	2014-12-23	1	-2/+4
\| \| \| \|	llvm-svn: 224751
*	Revert r224739: Debug info: Teach SROA how to update debug info for	Chandler Carruth	2014-12-23	4	-233/+5
\| \| \| \| \| \| \| \| \| \| \|	fragmented variables. This caused codegen to start crashing when we built somewhat large programs with debug info and optimizations. 'check-msan' hit in, and I suspect a bootstrap would as well. I mailed a test case to the review thread. llvm-svn: 224750
*	X86: Don't over-align combined loads.	Jim Grosbach	2014-12-23	2	-8/+38
\| \| \| \| \| \| \| \| \| \| \|	When combining consecutive loads+inserts into a single vector load, we should keep the alignment of the base load. Doing otherwise can, and does, lead to using overly aligned instructions. In the included test case, for example, using a 32-byte vmovaps on a 16-byte aligned value. Oops. rdar://19190968 llvm-svn: 224746
*	Make musttail more robust for vector types on x86	Reid Kleckner	2014-12-22	6	-100/+302
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously I tried to plug musttail into the existing vararg lowering code. That turned out to be a mistake, because non-vararg calls use significantly different register lowering, even on x86. For example, AVX vectors are usually passed in registers to normal functions and memory to vararg functions. Now musttail uses a completely separate lowering. Hopefully this can be used as the basis for non-x86 perfect forwarding. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D6156 llvm-svn: 224745
*	Remove dynamic allocation/indirection from GCOVBlocks owned by GCOVFunction	David Blaikie	2014-12-22	1	-22/+25
\| \| \| \| \| \| \| \| \|	Since these are all created in the DenseMap before they are referenced, there's no problem with pointer validity by the time it's required. This removes another use of DeleteContainerSeconds/manual memory management which I'm cleaning up from time to time. llvm-svn: 224744
*	Thumb1 frame lowering: Mark CFI instructions with the FrameSetup flag.	Adrian Prantl	2014-12-22	2	-8/+16
\| \| \| \| \| \| \| \| \| \| \| \| \|	Followup to r224294: ARM/AArch64: Attach the FrameSetup MIFlag to CFI instructions. Debug info marks the first instruction without the FrameSetup flag as being the end of the function prologue. Any CFI instructions in the middle of the function prologue would cause debug info to end the prologue too early and worse, attach the line number of the CFI instruction, which incidentally is often 0. llvm-svn: 224743
*	[SROA] Lift the logic for traversing the alloca slices one partition at	Chandler Carruth	2014-12-22	1	-157/+303
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a time into a partition iterator and a Partition class. There is a lot of knock-on simplification that this enables, largely stemming from having a Partition object to refer to in lots of helpers. I've only done a minimal amount of that because enoguh stuff is changing as-is in this commit. This shouldn't change any observable behavior. I've worked hard to preserve the exact traversal semantics which were originally present even though some of them make no sense. I'll be changing some of this in subsequent commits now that the logic is carefully factored into a reusable place. The primary motivation for this change is to break the rewriting into phases in order to support more intelligent rewriting. For example, I'm planning to change how split loads and stores are rewritten to remove the significant overuse of integer bit packing in the resulting code and allow more effective secondary splitting of aggregates. For any of this to work, they have to share the exact traversal logic. llvm-svn: 224742
*	[LCSSA] Handle PHI insertion in disjoint loops	Bruno Cardoso Lopes	2014-12-22	5	-16/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Take two disjoint Loops L1 and L2. LoopSimplify fails to simplify some loops (e.g. when indirect branches are involved). In such situations, it can happen that an exit for L1 is the header of L2. Thus, when we create PHIs in one of such exits we are also inserting PHIs in L2 header. This could break LCSSA form for L2 because these inserted PHIs can also have uses in L2 exits, which are never handled in the current implementation. Provide a fix for this corner case and test that we don't assert/crash on that. Differential Revision: http://reviews.llvm.org/D6624 rdar://problem/19166231 llvm-svn: 224740
*	Debug info: Teach SROA how to update debug info for fragmented variables.	Adrian Prantl	2014-12-22	4	-5/+233
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This allows us to generate debug info for extremely advanced code such as typedef struct { long int a; int b;} S; int foo(S s) { return s.b; } which at -O1 on x86_64 is codegen'd into define i32 @foo(i64 %s.coerce0, i32 %s.coerce1) #0 { ret i32 %s.coerce1, !dbg !24 } with this patch we emit the following debug info for this TAG_formal_parameter [3] AT_location( 0x00000000 0x0000000000000000 - 0x0000000000000006: rdi, piece 0x00000008, rsi, piece 0x00000004 0x0000000000000006 - 0x0000000000000008: rdi, piece 0x00000008, rax, piece 0x00000004 ) AT_name( "s" ) AT_decl_file( "/Volumes/Data/llvm/_build.ninja.release/test.c" ) Thanks to chandlerc, dblaikie, and echristo for their feedback on all previous iterations of this patch! llvm-svn: 224739
*	Fix Windows unwind info for functions in sections other than .text	Reid Kleckner	2014-12-22	2	-35/+98
\| \| \| \| \| \| \| \| \| \| \|	Previously we assumed the section name had the form .text$foo, which is what we used to do for inline functions. If the dollar wasn't present, we'd put unwind data in the .pdata and .xdata sections for the main .text section, which is incorrect. Fixes PR22001. llvm-svn: 224738
*	[Hexagon] Adding memb instruction. Fixing whitespace in test from 224730.	Colin LeMahieu	2014-12-22	6	-30/+32
\| \| \| \|	llvm-svn: 224735
*	Use iterators rather than indices to make this forwards-compatible with a ↵	David Blaikie	2014-12-22	1	-4/+5
\| \| \| \| \| \|	change to the underlying container (to std::list) llvm-svn: 224734
*	unique_ptrify MatchableInfo(const CodeGenInstAlias *Alias)'s parameter	David Blaikie	2014-12-22	1	-14/+11
\| \| \| \|	llvm-svn: 224733
*	[Hexagon] Adding classes and load unsigned byte instruction, updating usages.	Colin LeMahieu	2014-12-22	7	-28/+137
\| \| \| \|	llvm-svn: 224730
*	[x86] Add vector @llvm.ctpop intrinsic custom lowering	Bruno Cardoso Lopes	2014-12-22	2	-0/+311
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, when ctpop is supported for scalar types, the expansion of @llvm.ctpop.vXiY uses vector element extractions, insertions and individual calls to @llvm.ctpop.iY. When not, expansion with bit-math operations is used for the scalar calls. Local haswell measurements show that we can improve vector @llvm.ctpop.vXiY expansion in some cases by using a using a vector parallel bit twiddling approach, based on: v = v - ((v >> 1) & 0x55555555); v = (v & 0x33333333) + ((v >> 2) & 0x33333333); v = ((v + (v >> 4) & 0xF0F0F0F) v = v + (v >> 8) v = v + (v >> 16) v = v & 0x0000003F (from http://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel) When scalar ctpop isn't supported, the approach above performs better for v2i64, v4i32, v4i64 and v8i32 (see numbers below). And even when scalar ctpop is supported, this approach performs ~2x better for v8i32. Here, x86_64 implies -march=corei7-avx without ctpop and x86_64h includes ctpop support with -march=core-avx2. == [x86_64h - new] v8i32: 0.661685 v4i32: 0.514678 v4i64: 0.652009 v2i64: 0.324289 == [x86_64h - old] v8i32: 1.29578 v4i32: 0.528807 v4i64: 0.65981 v2i64: 0.330707 == [x86_64 - new] v8i32: 1.003 v4i32: 0.656273 v4i64: 1.11711 v2i64: 0.754064 == [x86_64 - old] v8i32: 2.34886 v4i32: 1.72053 v4i64: 1.41086 v2i64: 1.0244 More work for other vector types will come next. llvm-svn: 224725
*	Remove unused header. NFC.	Juergen Ributzka	2014-12-22	1	-1/+0
\| \| \| \|	llvm-svn: 224722
*	Add a C++ marker to this header file.	Adrian Prantl	2014-12-22	1	-1/+1
\| \| \| \|	llvm-svn: 224721
*	[C API] Expose LLVMGetGlobalValueAddress and LLVMGetFunctionAddress.	Peter Zotov	2014-12-22	3	-0/+50
\| \| \| \| \| \|	Patch by Ramkumar Ramachandra <artagnon@gmail.com> llvm-svn: 224720
*	[CodeGenPrepare] Handle properly the promotion of operands when this does not	Quentin Colombet	2014-12-22	2	-3/+32
\| \| \| \| \| \| \| \| \|	generate instructions. Fixes PR21978. Related to <rdar://problem/18310086> llvm-svn: 224717
*	AVX-512: Added all forms of BLENDM instructions,	Elena Demikhovsky	2014-12-22	7	-57/+479
\| \| \| \| \| \|	intrinsics, encoding tests for AVX-512F and skx instructions. llvm-svn: 224707
*	Lower multiply-negate operation to mneg on AArch64	Karthik Bhat	2014-12-22	2	-0/+19
\| \| \| \| \| \| \| \| \| \| \|	This patch pattern matches code such as- neg w8, w8 mul w8, w9, w8 to mneg w8, w8, w9 Review: http://reviews.llvm.org/D6754 llvm-svn: 224706
*	Convert a few tests to FileCheck. NFC.	Rafael Espindola	2014-12-22	4	-14/+38
\| \| \| \|	llvm-svn: 224705
*	The leak detector is dead, long live asan and valgrind.	Rafael Espindola	2014-12-22	13	-273/+0
\| \| \| \| \| \| \|	In resent times asan and valgrind have found way more memory management bugs in llvm than the special purpose leak detector. llvm-svn: 224703
*	CodeGen: minor style tweaks to SSP	Saleem Abdulrasool	2014-12-21	1	-13/+15
\| \| \| \| \| \|	Clean up some style related things in the StackProtector CodeGen. NFC. llvm-svn: 224693
*	[X86] Add hasSideEffects = 0 to CALLpcrel16. This matches what is inferred ↵	Craig Topper	2014-12-21	1	-4/+5
\| \| \| \| \| \|	from patterns for the 32-bit version. llvm-svn: 224692
*	Enable (sext x) == C --> x == (trunc C) combine	Matt Arsenault	2014-12-21	5	-37/+570
\| \| \| \| \| \| \| \| \|	Extend the existing code which handles this for zext. This makes this more useful for targets with ZeroOrNegativeOne BooleanContent and obsoletes a custom combine SI uses for i1 setcc (sext(i1), 0, setne) since the constant will now be shrunk to i1. llvm-svn: 224691
*	[X86] Swap operand order in Intel syntax on a bunch of aliases.	Craig Topper	2014-12-20	1	-18/+18
\| \| \| \|	llvm-svn: 224687