bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[x86] Switch a constant selection test to use positive assertions and to	Chandler Carruth	2014-12-07	1	-5/+17
\| \| \| \| \| \| \|	store to real pointers so that its clear that the right code is in fact being generated. llvm-svn: 223612
*	[x86] Cleanup the combining vector shuffle tests a bit by merging	Chandler Carruth	2014-12-07	1	-62/+18
\| \| \| \| \| \|	identical checks for different SSE variants into a single block. llvm-svn: 223611
*	[x86] Clean up the shift lowering vector shuffle tests a bit using my	Chandler Carruth	2014-12-07	4	-93/+26
\| \| \| \| \| \| \|	script. Notably this folds all the SSE cases together into a single FileCheck block. It also adds a vex prefix. llvm-svn: 223610
*	ConstantFold: Don't optimize comparisons with weak linkage objects	David Majnemer	2014-12-06	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Consider: void f() {} void __attribute__((weak)) g() {} bool b = &f != &g; It's possble for g to resolve to f if --defsym=g=f is passed on to the linker. llvm-svn: 223585
*	InstSimplify: Optimize away useless unsigned comparisons	David Majnemer	2014-12-06	1	-0/+48
\| \| \| \| \| \|	Code like X < Y && Y == 0 should always be folded away to false. llvm-svn: 223583
*	R600/SI: Restore PrivateGlobalPrefix to the default ELF value of ".L"	Tom Stellard	2014-12-06	1	-1/+5
\| \| \| \| \| \|	This was changed in r223323. llvm-svn: 223579
*	IR: Disallow function-local metadata attachments	Duncan P. N. Exon Smith	2014-12-06	3	-5/+5
\| \| \| \| \| \| \| \|	Metadata attachments to instructions cannot be function-local. This is part of PR21532. llvm-svn: 223574
*	Add a proper triple to switch-jump-table.ll	Hans Wennborg	2014-12-06	1	-1/+1
\| \| \| \|	llvm-svn: 223571
*	llvm/test/CodeGen/X86/switch-jump-table.ll: Add explicit triple. Local ↵	NAKAMURA Takumi	2014-12-06	1	-1/+1
\| \| \| \| \| \|	labels have a prefix "." for targeting i686-cygming. llvm-svn: 223570
*	[X86] Refactor PMOV[SZ]Xrm to add missing AVX2 patterns.	Ahmed Bougacha	2014-12-06	4	-2/+388
\| \| \| \| \| \| \| \|	Most patterns will go away once the extload legalization changes land. Differential Revision: http://reviews.llvm.org/D6125 llvm-svn: 223567
*	SelectionDAG switch lowering: Replace unreachable default with most popular ↵	Hans Wennborg	2014-12-06	3	-6/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	case. This can significantly reduce the size of the switch, allowing for more efficient lowering. I also worked with the idea of exploiting unreachable defaults by omitting the range check for jump tables, but always ended up with a non-neglible binary size increase. It might be worth looking into some more. SimplifyCFG currently does this transformation, but I'm working towards changing that so we can optimize harder based on unreachable defaults. Differential Revision: http://reviews.llvm.org/D6510 llvm-svn: 223566
*	IR: Disallow complicated function-local metadata	Duncan P. N. Exon Smith	2014-12-06	3	-12/+11
\| \| \| \| \| \| \| \| \| \|	Disallow complex types of function-local metadata. The only valid function-local metadata is an `MDNode` whose sole argument is a non-metadata function-local value. Part of PR21532. llvm-svn: 223564
*	Add target triples to all dfsan tests.	Peter Collingbourne	2014-12-05	11	-0/+11
\| \| \| \|	llvm-svn: 223536
*	Recommit of r223513 and r223514.	Kuba Brecka	2014-12-05	1	-0/+1
\| \| \| \| \| \|	Reviewed at http://reviews.llvm.org/D6488 llvm-svn: 223532
*	[Hexagon] Adding sub/and/or reg, imm forms	Colin LeMahieu	2014-12-05	1	-0/+6
\| \| \| \|	llvm-svn: 223522
*	Optimize merging of scalar loads for 32-byte vectors [X86, AVX]	Sanjay Patel	2014-12-05	1	-10/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix the poor codegen seen in PR21710 ( http://llvm.org/bugs/show_bug.cgi?id=21710 ). Before we crack 32-byte build vectors into smaller chunks (and then subsequently glue them back together), we should look for the easy case where we can just load all elements in a single op. An example of the codegen change is: From: vmovss 16(%rdi), %xmm1 vmovups (%rdi), %xmm0 vinsertps $16, 20(%rdi), %xmm1, %xmm1 vinsertps $32, 24(%rdi), %xmm1, %xmm1 vinsertps $48, 28(%rdi), %xmm1, %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 retq To: vmovups (%rdi), %ymm0 retq Differential Revision: http://reviews.llvm.org/D6536 llvm-svn: 223518
*	[Hexagon] Updating mux_ir/ri/ii/rr with encoding bits	Colin LeMahieu	2014-12-05	1	-0/+8
\| \| \| \|	llvm-svn: 223515
*	Use 32-bit ebp for NaCl64 in a limited case: llvm.frameaddress.	Jan Wen Voung	2014-12-05	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Follow up to [x32] "Use ebp/esp as frame and stack pointer": http://reviews.llvm.org/D4617 In that earlier patch, NaCl64 was made to always use rbp. That's needed for most cases because rbp should hold a full 64-bit address within the NaCl sandbox so that load/stores off of rbp don't require sandbox adjustment (zeroing the top 32-bits, then filling those by adding r15). However, llvm.frameaddress returns a pointer and pointers are 32-bit for NaCl64. In this case, use ebp instead, which will make the register copy type check. A similar mechanism may be needed for llvm.eh.return, but is not added in this change. Test Plan: test/CodeGen/X86/frameaddr.ll Reviewers: dschuff, nadav Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D6514 llvm-svn: 223510
*	[PowerPC]Update Power VSX test cases to also test fast-isel	Bill Seurer	2014-12-05	7	-131/+541
\| \| \| \| \| \| \| \|	Update of some of the VSX test cases for Power to check fast-isel codegen as well as the regular codegen. http://reviews.llvm.org/D6357 llvm-svn: 223509
*	[Hexagon] Adding tfrih/l instructions.	Colin LeMahieu	2014-12-05	1	-0/+4
\| \| \| \|	llvm-svn: 223506
*	[X86] Improved lowering of packed vector shifts to vpsllq/vpsrlq.	Andrea Di Biagio	2014-12-05	1	-8/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	SSE2/AVX non-constant packed shift instructions only use the lower 64-bit of the shift count. This patch teaches function 'getTargetVShiftNode' how to deal with shifts where the shift count node is of type MVT::i64. Before this patch, function 'getTargetVShiftNode' only knew how to deal with shift count nodes of type MVT::i32. This forced the backend to wrongly truncate the shift count to MVT::i32, and then zero-extend it back to MVT::i64. llvm-svn: 223505
*	[Hexagon] Adding add reg, imm form with encoding bits and test.	Colin LeMahieu	2014-12-05	1	-0/+2
\| \| \| \|	llvm-svn: 223504
*	BFI: Saturate when combining edges to a successor	Duncan P. N. Exon Smith	2014-12-05	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \|	When a loop gets bundled up, its outgoing edges are quite large, and can just barely overflow 64-bits. If one successor has multiple incoming edges -- and that successor is getting all the incoming mass -- combining just its edges can overflow. Handle that by saturating rather than asserting. This fixes PR21622. llvm-svn: 223500
*	[Hexagon] Adding DoubleRegs decoder. Moving C2_mux and A2_nop. Adding ↵	Colin LeMahieu	2014-12-05	1	-0/+2
\| \| \| \| \| \|	combine imm-imm form. llvm-svn: 223494
*	Fix a bug when pretty-printing DW_OP_deref.	Adrian Prantl	2014-12-05	2	-1/+4
\| \| \| \|	llvm-svn: 223493
*	Regenerate this stale testcase from source.	Adrian Prantl	2014-12-05	1	-114/+112
\| \| \| \|	llvm-svn: 223492
*	[Hexagon] Adding combine reg-reg forms.	Colin LeMahieu	2014-12-05	1	-0/+8
\| \| \| \|	llvm-svn: 223485
*	[Hexagon] Marking several instructions as isCodeGenOnly=0 and adding direct ↵	Colin LeMahieu	2014-12-05	3	-2/+39
\| \| \| \| \| \|	disassembly tests for many instructions. llvm-svn: 223482
*	Improvements to ARM assembler tests	Asiri Rathnayake	2014-12-05	1	-70/+445
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	No functional changes. Got myself bitten in r223113 when adding support for modified immediate syntax (regressions reported by joerg@britannica.bec.de, fixes in r223366 and r223381). Our assembler tests did not cover serveral different syntax variants. This patch expands the test coverage to check for the following cases: 1. Modified immediate operands may be expressed with expressions, as in #(4 * 2) instead of #8. 2. Modified immediate operands may be _optionally_ prefixed by a '#' symbol or a '$' symbol. 3. Certain instructions (e.g. ADD) support single input register variants; [ADD r0, #mod_imm] is same as [ADD r0, r0, #mod_imm]. 4. Certain instructions have aliases which convert plain immediates to modified immediates. For an example, [ADD r0, -10] is not valid because -10 (in two's complement) cannot be encoded as a modified immediate, but ARMInstrInfo.td defines an alias which can transform this into a [SUB r0, 10]. llvm-svn: 223475
*	[msan] Avoid extra origin address realignment.	Evgeniy Stepanov	2014-12-05	1	-0/+73
\| \| \| \| \| \| \| \| \|	Do not realign origin address if the corresponding application address is at least 4-byte-aligned. Saves 2.5% code size in track-origins mode. llvm-svn: 223464
*	[X86] Avoid introducing extra shuffles when lowering packed vector shifts.	Andrea Di Biagio	2014-12-05	2	-1/+161
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When lowering a vector shift node, the backend checks if the shift count is a shuffle with a splat mask. If so, then it introduces an extra dag node to extract the splat value from the shuffle. The splat value is then used to generate a shift count of a target specific shift. However, if we know that the shift count is a splat shuffle, we can use the splat index 'I' to extract the I-th element from the first shuffle operand. The advantage is that the splat shuffle may become dead since we no longer use it. Example: ;; define <4 x i32> @example(<4 x i32> %a, <4 x i32> %b) { %c = shufflevector <4 x i32> %b, <4 x i32> undef, <4 x i32> zeroinitializer %shl = shl <4 x i32> %a, %c ret <4 x i32> %shl } ;; Before this patch, llc generated the following code (-mattr=+avx): vpshufd $0, %xmm1, %xmm1 # xmm1 = xmm1[0,0,0,0] vpxor %xmm2, %xmm2 vpblendw $3, %xmm1, %xmm2, %xmm1 # xmm1 = xmm1[0,1],xmm2[2,3,4,5,6,7] vpslld %xmm1, %xmm0, %xmm0 retq With this patch, the redundant splat operation is removed from the code. vpxor %xmm2, %xmm2 vpblendw $3, %xmm1, %xmm2, %xmm1 # xmm1 = xmm1[0,1],xmm2[2,3,4,5,6,7] vpslld %xmm1, %xmm0, %xmm0 retq llvm-svn: 223461
*	Add missing FP build attribute tests.	Charlie Turner	2014-12-05	1	-28/+148
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The test file test/CodeGen/ARM/build-attributes.ll was missing several floating-point build attribute tests. The intention of this commit is that for each CPU / architecture currently tested, there are now tests that make sure the following attributes are sufficiently checked, * Tag_ABI_FP_rounding * Tag_ABI_FP_denormal * Tag_ABI_FP_exceptions * Tag_ABI_FP_user_exceptions * Tag_ABI_FP_number_model Also in this commit, the -unsafe-fp-math flag has been augmented with the full suite of flags Clang sends to LLVM when you pass -ffast-math to Clang. That is, `-unsafe-fp-math' has been changed to `-enable-unsafe-fp-math -disable-fp-elim -enable-no-infs-fp-math -enable-no-nans-fp-math -fp-contract=fast' Change-Id: I35d766076bcbbf09021021c0a534bf8bf9a32dfc llvm-svn: 223454
*	Revert "r223440 - Consider subregs when calling MI::registerDefIsDead for ↵	Hal Finkel	2014-12-05	1	-168/+0
\| \| \| \| \| \| \| \| \|	phys deps" Reverting this because, while it fixes the problem in the reduced test case, it does not fix the problem in the full test case from the bug report. llvm-svn: 223442
*	Consider subregs when calling MI::registerDefIsDead for phys deps	Hal Finkel	2014-12-05	1	-0/+168
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The scheduling dependency graph is built bottom-up within each scheduling region, and ScheduleDAGInstrs::addPhysRegDeps is called to add output/anti dependencies, based on physical registers, to the SUs for instructions based on those that come before them. In the test case, we start before post-RA scheduling with a block that looks like this: ... INLINEASM <... andc $0,$0,$2 stdcx. $0,0,$3 bne- 1b > [sideeffect] [mayload] [maystore] [attdialect], $0:[regdef-ec:G8RC], %X6<earlyclobber,def,dead>, $1:[mem], %X3<kill>, $2:[reguse:G8RC], %X5<kill>, $3:[reguse:G8RC], %X3, $4:[mem], %X3, $5:[clobber], %CC<earlyclobber,imp-def,dead>, <<badref>> ... %X4<def,dead> = ANDIo8 %X4<kill>, 1, %CR0<imp-def,dead>, %CR0GT<imp-def> ... %R29<def> = ISEL %R3<undef>, %R4<kill>, %CR0GT<kill> where it is relevant that %CC is an alias to %CR0, and that %CR0GT is a subregister of %CR0. However, for post-RA scheduling, no dependency was added to prevent the INLINEASM from being scheduled in between the ANDIo8 and the ISEL (which communicate via the %CR0GT register). In ScheduleDAGInstrs::addPhysRegDeps, when called for the %CC operand, we'd iterate over all of its aliases (which include %CC itself and also %CR0), and look for previously-encountered defs of those registers. We'd find the ANDIo8, but decide not to add a dependency between the INLINEASM and the ANDIo8 because both the INLINEASM's def of %CC is dead, and also the ANDIo8 def of %CR0 is dead. This ignores, however, that ANDIo8 has a non-dead def of %CR0GT, a subregister of %CR0, and thus a dependency still must exist. To fix this problem, when calling registerDefIsDead on the SU with the def, we also check all subregisters for possible non-dead defs, and add the dependency if any are found. Fixes PR21742. llvm-svn: 223440
*	Add a comment.	Adrian Prantl	2014-12-05	1	-1/+1
\| \| \| \|	llvm-svn: 223427
*	Add a few extra cases to the test. NFC.	Rafael Espindola	2014-12-05	2	-5/+30
\| \| \| \|	llvm-svn: 223417
*	Re-add support to llvm-objdump for Mach-O universal files and archives with ↵	Kevin Enderby	2014-12-04	4	-1/+26
\| \| \| \| \| \| \| \| \| \| \|	-macho with fixes. Includes the move of tests for llvm-objdump for universal files to an X86 directory. And the fix where it was failing on linux Rafael tracked down with asan. I had both Jim Grosbach and Adam Hemet look over the second fix since I could not set up asan to reproduce with the old version but not with the fix. llvm-svn: 223416
*	Convert test to use an extra Input file. NFC.	Rafael Espindola	2014-12-04	3	-8/+9
\| \| \| \|	llvm-svn: 223414
*	Simplify implementation and testcase of r223401 based on feedback from dblaikie.	Adrian Prantl	2014-12-04	1	-23/+18
\| \| \| \|	llvm-svn: 223405
*	Debug info: If the RegisterCoalescer::reMaterializeTrivialDef() is	Adrian Prantl	2014-12-04	1	-0/+70
\| \| \| \| \| \| \|	eliminating all uses of a vreg, update any DBG_VALUE describing that vreg to point to the rematerialized register instead. llvm-svn: 223401
*	Add some tests for SimplifyCFG's TurnSwitchRangeIntoICmp(). NFC.	Hans Wennborg	2014-12-04	1	-0/+50
\| \| \| \|	llvm-svn: 223396
*	Add some tests for SimplifyCFG's ConstantFoldTerminator(). NFC.	Hans Wennborg	2014-12-04	1	-0/+64
\| \| \| \|	llvm-svn: 223395
*	[AArch64] Combining Load and IntToFp should check for neon availability	Weiming Zhao	2014-12-04	1	-0/+16
\| \| \| \|	llvm-svn: 223382
*	Fix yet another unseen regression caused by r223113	Asiri Rathnayake	2014-12-04	2	-1/+7
\| \| \| \| \| \| \| \| \| \|	r223113 added support for ARM modified immediate assembly syntax. Which assumes all immediate operands are prefixed with a '#'. This assumption is wrong as per the ARMARM - which recommends that all '#' characters be treated optional. The current patch fixes this regression and adds a test case. A follow-up patch will expand the test coverage to other instructions. llvm-svn: 223381
*	Fix thumbv4t indirect calls	Jonathan Roelofs	2014-12-04	2	-2/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	So there are a couple of issues with indirect calls on thumbv4t. First, the most 'obvious' instruction, 'blx' isn't available until v5t. And secondly, the next-most-obvious sequence: 'mov lr, pc; bx rN' doesn't DTRT in thumb code because the saved off pc has its thumb bit cleared, so when the callee returns we end up in ARM mode.... yuck. The solution is to 'bl' to a nearby landing pad with a 'bx rN' in it. We could cut down on code size by sharing the landing pads between call sites that are close enough, but for the moment let's do correctness first and look at performance later. Patch by: Iain Sandoe http://reviews.llvm.org/D6519 llvm-svn: 223380
*	Add a test case for argument type coercion in an invoke of a vararg function	Philip Reames	2014-12-04	1	-0/+20
\| \| \| \| \| \|	This would have caught the bug I fixed in 223370. llvm-svn: 223378
*	Revert "r223364 - Revert r223347 which has caused crashes on bootstrap bots."	Hal Finkel	2014-12-04	1	-1/+152
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reapply r223347, with a fix to not crash on uninserted instructions (or more precisely, instructions in uninserted blocks). bugpoint was able to reduce the test case somewhat, but it is still somewhat large (and relies on setting things up to be simplified during inlining), so I've not included it here. Nevertheless, it is clear what is going on and why. Original commit message: Restrict somewhat the memory-allocation pointer cmp opt from r223093 Based on review comments from Richard Smith, restrict this optimization from applying to globals that might resolve lazily to other dynamically-loaded modules, and also from dynamic allocas (which might be transformed into malloc calls). In short, take extra care that the compared-to pointer is really simultaneously live with the memory allocation. llvm-svn: 223371
*	Fix a minor regression introduced in r223113	Asiri Rathnayake	2014-12-04	1	-0/+42
\| \| \| \| \| \| \| \| \| \|	r223113 added support for ARM modified immediate assembly syntax. That patch has broken support for immediate expressions, as in: add r0, #(4 * 4) It wasn't caught because we don't have any tests for this feature. This patch fixes this regression and adds test cases. llvm-svn: 223366
*	Revert r223347 which has caused crashes on bootstrap bots.	Alexander Potapenko	2014-12-04	1	-152/+1
\| \| \| \|	llvm-svn: 223364
*	Revert "[Thumb/Thumb2] Added restrictions on PC, LR, SP in the register list ↵	Rafael Espindola	2014-12-04	3	-38/+7
\| \| \| \| \| \| \| \| \| \|	for PUSH/POP/LDM/STM. <Differential Revision: http://reviews.llvm.org/D6090>" This reverts commit r223356. It was failing check-all (MC/ARM/thumb.s in particular). llvm-svn: 223363