bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	CodeGen: strengthen WoA AEABI avoidance tests	Saleem Abdulrasool	2014-05-04	1	-0/+22
\| \| \| \| \| \|	Add additional test cases for WoA AEABI avoidance checking. llvm-svn: 207942
*	MC: support FK_SecRel_4 for Windows on ARM	Saleem Abdulrasool	2014-05-04	1	-0/+41
\| \| \| \| \| \| \| \| \|	Add handling for FK_SecRel_4 (4-byte section relative relocations). These are used by the generation of DWARF debug information (the abbrevations use section relative relocations). This will also be used in generation of CodeView line tables. llvm-svn: 207941
*	LoopUnroll: If we're doing partial unrolling, use the PartialThreshold to ↵	Benjamin Kramer	2014-05-04	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \|	limit unrolling. Otherwise we use the same threshold as for complete unrolling, which is way too high. This made us unroll any loop smaller than 150 instructions by 8 times, but only if someone specified -march=core2 or better, which happens to be the default on darwin. llvm-svn: 207940
*	SLPVectorizer: Bring back the insertelement patch (r205965) with fixes	Arnold Schwaighofer	2014-05-04	2	-0/+140
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When can't assume a vectorized tree is rooted in an instruction. The IRBuilder could have constant folded it. When we rebuild the build_vector (the series of InsertElement instructions) use the last original InsertElement instruction. The vectorized tree root is guaranteed to be before it. Also, we can't assume that the n-th InsertElement inserts the n-th element into a vector. This reverts r207746 which reverted the revert of the revert of r205018 or so. Fixes the test case in PR19621. llvm-svn: 207939
*	AVX-512: minor change in rndscale intrinsic	Elena Demikhovsky	2014-05-04	1	-2/+2
\| \| \| \|	llvm-svn: 207937
*	X86: repair export compatibility with MinGW/cygwin	Saleem Abdulrasool	2014-05-04	1	-36/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Both MinGW and cygwin (i686) construct export directives without the global leader prefix. This is mostly due to the fact that they use GNU ld which does not correctly handle the export directive. This apparently has been been broken for a while. However, this was recently reported as being broken by mingwandroid and diorcety of the msys2 project. Remove the global leader prefix if targeting MinGW or cygwin, otherwise, retain the global leader prefix. Add an explicit test for cygwin's behaviour of export directives. llvm-svn: 207926
*	Fix pr19645.	Rafael Espindola	2014-05-03	2	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The fix itself is fairly simple: move getAccessVariant to MCValue so that we replace the old weak expression evaluation with the far more general EvaluateAsRelocatable. This then requires that EvaluateAsRelocatable stop when it finds a non trivial reference kind. And that in turn requires the ELF writer to look harder for weak references. Last but not least, this found a case where we were being bug by bug compatible with gas and accepting an invalid input. I reported pr19647 to track it. llvm-svn: 207920
*	[ARM64] Correctly select ANDWri in FastISel.	Joey Gouly	2014-05-03	1	-1/+1
\| \| \| \| \| \|	http://reviews.llvm.org/D3598 llvm-svn: 207917
*	Vectorize intrinsic math function calls in SLPVectorizer.	Karthik Bhat	2014-05-03	1	-0/+128
\| \| \| \| \| \| \|	This patch adds support to recognize and vectorize intrinsic math functions in SLPVectorizer. Review: http://reviews.llvm.org/D3560 and http://reviews.llvm.org/D3559 llvm-svn: 207901
*	[LSR] Add llc testcase for r207271/r207569.	Adam Nemet	2014-05-02	1	-0/+70
\| \| \| \| \| \| \|	See PR19608 for the details but to summarize it was easy to modify the .ll file to get the desired def-use ordering. llvm-svn: 207887
*	[sanitizers] Propagate the sanitizer options through to the lit context.	Chandler Carruth	2014-05-02	1	-0/+5
\| \| \| \| \| \| \| \|	This makes it really easy to debug leaks FYI: ASAN_OPTIONS=detect_leaks=1 ./bin/llvm-lit -v <path to test> llvm-svn: 207874
*	llvm-cov: Fix handling of line zero appearing in a line table	Justin Bogner	2014-05-02	4	-1/+30
\| \| \| \| \| \| \| \| \| \| \| \| \|	Reading line tables in llvm-cov was pretty broken, but would happen to work as long as no line in the table was 0. It's not clear to me whether a line of zero should show up in these tables, but deciding to read a string in the middle of the line table is certainly the wrong thing to do if it does. I've also added some comments, as trying to figure out what this block of code was doing was fairly unpleasant. llvm-svn: 207866
*	[tablegen] !strconcat accepts more than two arguments but this wasn't ↵	Daniel Sanders	2014-05-02	1	-1/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	documented or tested. Summary: * Updated the documentation * Added a test for >2 arguments * Added a check for the lexical concatenation * Made the existing test a bit stricter. Reviewers: t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, llvm-commits Differential Revision: http://reviews.llvm.org/D3485 llvm-svn: 207865
*	Teach GlobalDCE how to remove empty global_ctor entries.	Nico Weber	2014-05-02	2	-0/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This moves most of GlobalOpt's constructor optimization code out of GlobalOpt into Transforms/Utils/CDtorUtils.{h,cpp}. The public interface is a single function OptimizeGlobalCtorsList() that takes a predicate returning which constructors to remove. GlobalOpt calls this with a function that statically evaluates all constructors, just like it did before. This part of the change is behavior-preserving. Also add a call to this from GlobalDCE with a filter that removes global constructors that contain a "ret" instruction and nothing else – this fixes PR19590. llvm-svn: 207856
*	[GVN] Pass the phi-translated address of a load instead of the untranslated	Akira Hatanaka	2014-05-02	1	-0/+87
\| \| \| \| \| \| \| \| \|	address to AnalyzeLoadFromClobberingLoad. This fixes a bug in load-PRE where PRE is applied to a load that is not partially redundant. <rdar://problem/16638765>. llvm-svn: 207853
*	MC: place .file records into the correct section	Saleem Abdulrasool	2014-05-02	2	-0/+60
\| \| \| \| \| \| \| \| \|	.file records are supposed to have a section identifier of 65534 (IMAGE_SCN_DEBUG) rather than 0. This is spelt out clearly within the PE/COFF specification. Fix this minor oversight with the implementation for support for .file records. llvm-svn: 207851
*	DAGCombine: prevent formation of illegal ConstantFP nodes.	Tim Northover	2014-05-02	1	-0/+14
\| \| \| \|	llvm-svn: 207850
*	R600: Expand vector sin and cos.	Tom Stellard	2014-05-02	2	-22/+65
\| \| \| \| \| \| \| \|	v2: move code to AMDGPUISelLowering.cpp squash with tests (both EG and SI) Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207845
*	R600: Expand TruncStore i64 -> {i16,i8}	Tom Stellard	2014-05-02	1	-0/+40
\| \| \| \|	llvm-svn: 207844
*	AArch64/ARM64: add patterns for post-indexed ST1 ops.	Tim Northover	2014-05-02	1	-0/+211
\| \| \| \|	llvm-svn: 207840
*	AArch64/ARM64: support indexed loads/stores on vector types.	Tim Northover	2014-05-02	1	-0/+402
\| \| \| \| \| \| \| \|	While post-indexed LD1/ST1 instructions do exist for vector loads, this patch makes use of the more flexible addressing-modes in LDR/STR instructions. llvm-svn: 207838
*	Allow SelectionDAG::FoldConstantArithmetic to work when it's called with a ↵	Benjamin Kramer	2014-05-02	1	-0/+10
\| \| \| \| \| \|	vector VT but scalar values. llvm-svn: 207835
*	Fold strlen(expr ? "str1" : "str2") to x ? len1 : len2. This fires about 330 ↵	Nick Lewycky	2014-05-02	1	-0/+12
\| \| \| \| \| \|	times in a bootstrap of clang. llvm-svn: 207828
*	[IR] Make {extract,insert}element accept an index of any integer type.	Michael J. Spencer	2014-05-01	2	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Given the following C code llvm currently generates suboptimal code for x86-64: __m128 bss4( const __m128 ptr, size_t i, size_t j ) { float f = ptr[i][j]; return (__m128) { f, f, f, f }; } ================================================= define <4 x float> @_Z4bss4PKDv4_fmm(<4 x float> nocapture readonly %ptr, i64 %i, i64 %j) #0 { %a1 = getelementptr inbounds <4 x float>* %ptr, i64 %i %a2 = load <4 x float>* %a1, align 16, !tbaa !1 %a3 = trunc i64 %j to i32 %a4 = extractelement <4 x float> %a2, i32 %a3 %a5 = insertelement <4 x float> undef, float %a4, i32 0 %a6 = insertelement <4 x float> %a5, float %a4, i32 1 %a7 = insertelement <4 x float> %a6, float %a4, i32 2 %a8 = insertelement <4 x float> %a7, float %a4, i32 3 ret <4 x float> %a8 } ================================================= shlq $4, %rsi addq %rdi, %rsi movslq %edx, %rax vbroadcastss (%rsi,%rax,4), %xmm0 retq ================================================= The movslq is uneeded, but is present because of the trunc to i32 and then sext back to i64 that the backend adds for vbroadcastss. We can't remove it because it changes the meaning. The IR that clang generates is already suboptimal. What clang really should emit is: %a4 = extractelement <4 x float> %a2, i64 %j This patch makes that legal. A separate patch will teach clang to do it. Differential Revision: http://reviews.llvm.org/D3519 llvm-svn: 207801
*	Add basic functionality for assignment of ints.	Reed Kotler	2014-05-01	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This creates a lot of core infrastructure in which to add, with little effort, quite a bit more to mips fast-isel Test Plan: simplestore.ll Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3527 llvm-svn: 207790
*	Don't propagate StorageClass and ComplexType to aliases.	Rafael Espindola	2014-05-01	2	-2/+2
\| \| \| \| \| \| \| \| \|	This matches gas' behaviour on COFF. I think that this yak is now sufficiently shaved for aliases with offset to work. llvm-svn: 207786
*	Add an optimization that does CSE in a group of similar GEPs.	Eli Bendersky	2014-05-01	3	-0/+165
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This optimization merges the common part of a group of GEPs, so we can compute each pointer address by adding a simple offset to the common part. The optimization is currently only enabled for the NVPTX backend, where it has a large payoff on some benchmarks. Review: http://reviews.llvm.org/D3462 Patch by Jingyue Wu. llvm-svn: 207783
*	DebugInfo: Correct the attribute type kind.	David Blaikie	2014-05-01	1	-2/+2
\| \| \| \| \| \|	Post commit review feedback from Paul Robinson regarding r207777. llvm-svn: 207782
*	PR19623: Implement typedefs of void.	David Blaikie	2014-05-01	1	-0/+32
\| \| \| \| \| \| \| \|	This the LLVM portion that will allow Clang and other frontends to emit typedefs of void by providing a null type for the typedef's underlying type. llvm-svn: 207777
*	R600/SI: Fix verifier error with pseudo store instructions.	Matt Arsenault	2014-05-01	3	-4/+4
\| \| \| \| \| \| \| \|	Use i32 instead of specifying SReg_32. When this is the pseudo INDIRECT_BASE_ADDR, this would give a bogus verifier error. llvm-svn: 207770
*	Compute the correct section for zed = foo + 1 in COFF.	Rafael Espindola	2014-05-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	This fixes pr19147. There are a few more related issues to fix, but the testcase in the bug now passes. llvm-svn: 207763
*	[ARM64] Prefer generation of bzero on Darwin only	Bradley Smith	2014-05-01	1	-5/+12
\| \| \| \|	llvm-svn: 207760
*	Don't force symbols to be globals in .thumb_set.	Rafael Espindola	2014-05-01	1	-15/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We currently force symbols to be globals in .thumb_set. The intent seems to be that given .thumb_set foo, bar we emit an undefined symbol to bar if it is never defined. The side effect is that we mark bar as global, even if it is defined, which gas does not. Producing an undefined reference to bar is a general difference from MC and gas. For example, given a = b gas will produce an undefined reference to b, MC will not. I would be surprised if any code depends on this, but it it does, we should fix the general difference, not special case .thumb_set. llvm-svn: 207757
*	AArch64/ARM64: rewrite test to use FileCheck & add ARM64 lines	Tim Northover	2014-05-01	1	-22/+45
\| \| \| \|	llvm-svn: 207754
*	AArch64/ARM64: port basic disassembly tests to ARM64.	Tim Northover	2014-05-01	8	-1145/+1153
\| \| \| \|	llvm-svn: 207753
*	AArch64/ARM64: print BFM instructions as BFI or BFXIL	Tim Northover	2014-05-01	7	-82/+67
\| \| \| \| \| \| \|	The canonical form of the BFM instruction is always one of the more explicit extract or insert operations, which makes reading output much easier. llvm-svn: 207752
*	Correction to assert statemtent to allow 32-bit unsigned numbers with the ↵	Richard Barton	2014-05-01	1	-0/+7
\| \| \| \| \| \| \| \|	top bit set. This fixes an ARM assembler crash - regression test added. llvm-svn: 207747
*	Revert r205965, which essentially reverts r205018 for the second time.	Chandler Carruth	2014-05-01	1	-24/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	=[ Turns out that this was the root cause of PR19621. We found a crasher only recently (likely due to improvements elsewhere in the SLP vectorizer) but the reduced test case failed all the way back to here. I've confirmed that reverting this patch both fixes the reduced test case in PR19621 and the actual source file that led to it, so it seems to really be rooted here. I've replied to the commit thread with discussion of my (feeble) attempts to debug this. Didn't make it very far, so reverting now that we have a good test case so that things can get back to healthy while the debugging carries on. llvm-svn: 207746
*	[llvm-readobj] Add support for Mips specific ELF header e_flags.	Simon Atanasyan	2014-05-01	1	-4/+5
\| \| \| \|	llvm-svn: 207744
*	[ARM64] Conditionalize CPU specific system registers on subtarget features	Bradley Smith	2014-05-01	2	-2/+10
\| \| \| \|	llvm-svn: 207742
*	[mips] Move expansion of .cpsetup to target streamer.	Matheus Almeida	2014-05-01	1	-11/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There are two functional changes: 1) The directive is not expanded for the ASM->ASM code path. 2) If PIC is not set, there's no expansion for the ASM->OBJ code path (same behaviour as GAS). Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3482 llvm-svn: 207741
*	[mips] Removed two-operand alias for sllv, sr[al]v, rotrv, dsllv, dsr[al]v, ↵	Daniel Sanders	2014-05-01	11	-47/+0
\| \| \| \| \| \| \| \| \| \|	and drotrv GAS doesn't actually accept these particular cases. The mnemonic without the trailing 'v' still supports two-operand aliases. llvm-svn: 207740
*	Record the DWARF version in MCContext	Oliver Stannard	2014-05-01	1	-3/+11
\| \| \| \| \| \| \|	Record the DWARF version in MCContext, and use it when emitting the dwarf version into the debug info. llvm-svn: 207739
*	Start fixing pr19147.	Rafael Espindola	2014-05-01	1	-0/+19
\| \| \| \| \| \| \|	This makes the coff writer compute the correct symbol value for the test in pr19147. The section is still incorrect, that will be fixed in a followup patch. llvm-svn: 207728
*	Revert "Emit DW_AT_object_pointer once, on the declaration, for each function."	David Blaikie	2014-04-30	2	-30/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Breaks GDB buildbot (http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/14517) GCC emits DW_AT_object_pointer /everywhere/ (declaration, abstract definition, inlined subroutine), but it looks like GCC relies on it being somewhere other than the declaration, at least. I'll experiment further & can hopefully still remove it from the inlined_subroutine. This reverts commit r207705. llvm-svn: 207719
*	DebugInfo: Omit DW_AT_artificial on DW_TAG_formal_parameters in ↵	David Blaikie	2014-04-30	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	DW_TAG_inlined_subroutines. They just don't need to be there - they're inherited from the abstract definition. In theory I would like them to be inherited from the declaration, but the DWARF standard doesn't quite say that... we can probably do it anyway but I'm less confident about that so I'll leave it for a separate commit. llvm-svn: 207717
*	Move a testcase from ELF to ARM64, incorrectly placed in r207627.	James Molloy	2014-04-30	1	-0/+0
\| \| \| \|	llvm-svn: 207706
*	Emit DW_AT_object_pointer once, on the declaration, for each function.	David Blaikie	2014-04-30	2	-6/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This effectively reverts r164326, but adds some comments and justification and ensures we /don't/ emit the DW_AT_object_pointer on the (abstract and concrete) definitions. (while still preserving it on standalone definitions involving ObjC Blocks) This does increase the size of member function declarations from 7 to 11 bytes, unfortunately, but still seems like the Right Thing to do so that callers that see only the declaration still have the information about the object pointer. That said, I don't know what, if any, DWARF consumers don't have a heuristic to guess this in the case of normal C++ member functions - perhaps we can remove it entirely. llvm-svn: 207705
*	Don't expect to find fpcmp and PerfectShuffle when running lit tests	Alexey Samsonov	2014-04-30	1	-2/+0
\| \| \| \|	llvm-svn: 207704
*	[ARM64] Prevent bit extraction to be adjusted by following shift	Weiming Zhao	2014-04-30	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For pattern like ((x >> C1) & Mask) << C2, DAG combiner may convert it into (x >> (C1-C2)) & (Mask << C2), which makes pattern matching of ubfx more difficult. For example: Given %shr = lshr i64 %x, 4 %and = and i64 %shr, 15 %arrayidx = getelementptr inbounds [8 x [64 x i64]]* @arr, i64 0, %i64 2, i64 %and %0 = load i64* %arrayidx With current shift folding, it takes 3 instrs to compute base address: lsr x8, x0, #1 and x8, x8, #0x78 add x8, x9, x8 If using ubfx, it only needs 2 instrs: ubfx x8, x0, #4, #4 add x8, x9, x8, lsl #3 This fixes bug 19589 llvm-svn: 207702