bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[ASan] Allow disabling init-order checks for globals by source file name.	Alexey Samsonov	2013-04-11	1	-1/+2
\| \| \| \|	llvm-svn: 179280
*	Add braces around \|\| in && to pacify GCC.	Benjamin Kramer	2013-04-11	1	-4/+4
\| \| \| \|	llvm-svn: 179275
*	Rename the C function to create a SLPVectorizerPass to something sane and ↵	Benjamin Kramer	2013-04-11	2	-4/+5
\| \| \| \| \| \|	expose it in the header file. llvm-svn: 179272
*	Optimize vector select from all 0s or all 1s	Michael Liao	2013-04-11	1	-0/+45
\| \| \| \| \| \| \| \|	As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane, vector select could be simplified to AND/OR or removed if one or both values being selected is all 0s or all 1s. llvm-svn: 179267
*	Add CLAC/STAC instruction encoding/decoding support	Michael Liao	2013-04-11	5	-34/+56
\| \| \| \| \| \| \|	As these two instructions in AVX extension are privileged instructions for special purpose, it's only expected to be used in inlined assembly. llvm-svn: 179266
*	Enhance bool simplifcation in X86 to handle more cases	Michael Liao	2013-04-11	1	-6/+35
\| \| \| \| \| \| \| \| \| \| \| \|	This patch is revised based on patch from Victor Umansky <victor.umansky@intel.com>. More cases are handled in X86's bool simplification, i.e. - SETCC_CARRY - value is truncated to i1 with AND As a by-product, PR5443 is also fixed. llvm-svn: 179265
*	R600ControlFlowFinalizer.cpp: Fix a warning. [-Wunused-variable]	NAKAMURA Takumi	2013-04-11	1	-0/+1
\| \| \| \|	llvm-svn: 179263
*	Whitespace.	NAKAMURA Takumi	2013-04-11	1	-2/+1
\| \| \| \|	llvm-svn: 179262
*	Fix MachO's getRelocationAdditionalInfo.	Rafael Espindola	2013-04-11	1	-0/+6
\| \| \| \| \| \| \|	It was returning the loaded address of the section containing the relocation, which really doesn't seem to be the intent of this function. llvm-svn: 179255
*	Make PPCInstrInfo::isPredicated always return false	Hal Finkel	2013-04-11	1	-16/+8
\| \| \| \| \| \| \|	Because of how predication in implemented on PPC (only for branches), I think that this is the right thing to do. No functionality change intended. llvm-svn: 179252
*	MC: Support COFF image-relative MCSymbolRefs	Nico Rieck	2013-04-10	3	-9/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for the COFF relocation types IMAGE_REL_I386_DIR32NB and IMAGE_REL_AMD64_ADDR32NB for 32- and 64-bit respectively. These are similar to normal 4-byte relocations except that they do not include the base address of the image. Image-relative relocations are used for debug information (32-bit) and SEH unwind tables (64-bit). A new MCSymbolRef variant called 'VK_COFF_IMGREL32' is introduced to specify such relocations. For AT&T assembly, this variant can be accessed using the symbol suffix '@imgrel'. llvm-svn: 179240
*	Manually remove successors in if conversion when CopyAndPredicateBlock is used	Hal Finkel	2013-04-10	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the simple and triangle if-conversion cases, when CopyAndPredicateBlock is used because the to-be-predicated block has other predecessors, we need to explicitly remove the old copied block from the successors list. Normally if conversion relies on TII->AnalyzeBranch combined with BB->CorrectExtraCFGEdges to cleanup the successors list, but if the predicated block contained an un-analyzable branch (such as a now-predicated return), then this will fail. These extra successors were causing a problem on PPC because it was causing later passes (such as PPCEarlyReturm) to leave dead return-only basic blocks in the code. llvm-svn: 179227
*	No need to have this return a bool.	Bill Wendling	2013-04-10	1	-4/+4
\| \| \| \|	llvm-svn: 179226
*	fixed xsave, xsaveopt, xrstor mnemonics with intel syntax; added test cases	Kay Tiong Khoo	2013-04-10	1	-3/+3
\| \| \| \|	llvm-svn: 179223
*	Track the compact unwind encoding for when we are unable to generate compact ↵	Bill Wendling	2013-04-10	2	-6/+11
\| \| \| \| \| \| \| \| \| \|	unwind information. Compact unwind has an encoding for when we're not able to generate compact unwind and must generate an EH frame instead. Track that, but still emit that CU encoding. llvm-svn: 179220
*	fixed to disassemble with tab after mnemonic rather than space	Kay Tiong Khoo	2013-04-10	1	-2/+2
\| \| \| \|	llvm-svn: 179215
*	In the X86 back end, getMemoryOperandNo() returns the offset	Preston Gurd	2013-04-10	2	-12/+22
\| \| \| \| \| \| \| \| \| \| \| \|	into the operand array of the start of the memory reference descriptor. Additional code in EncodeInstruction provides an additional adjustment. This patch places that additional code in a separate function, called getOperandBias, so that any caller of getMemoryOperandNo can also call getOperandBias. llvm-svn: 179211
*	Tidy up, fix and simplify a few of the SMLocs. Prior to r179109 the Start SMLoc	Chad Rosier	2013-04-10	1	-11/+13
\| \| \| \| \| \| \| \|	wasn't always the start of the operand. If there was a symbol reference, then Start pointed to that token. It's very likely there are other places that need to be updated. llvm-svn: 179210
*	Make the SLP store-merger less paranoid about function calls. We check for ↵	Nadav Rotem	2013-04-10	1	-4/+0
\| \| \| \| \| \|	function calls when we check if it is safe to sink instructions. llvm-svn: 179207
*	We require DataLayout for analyzing the size of stores.	Nadav Rotem	2013-04-10	2	-1/+6
\| \| \| \|	llvm-svn: 179206
*	Remove unused variable.	Chad Rosier	2013-04-10	1	-1/+0
\| \| \| \|	llvm-svn: 179205
*	PPC: Don't predicate a diamond with two counter decrements	Hal Finkel	2013-04-10	2	-3/+24
\| \| \| \| \| \| \| \| \| \|	I've not seen this happen in practice, and probably can't until we start allowing decrement-counter-based conditional branches to be double predicated, but just in case, don't allow predication of a diamond in which both sides have ctr-defining branches. Even though the branching behavior of these can be predicated, the counter-decrementing behavior cannot be. llvm-svn: 179199
*	Reapply r179115, but use parsePrimaryExpression a little more judiciously.	Chad Rosier	2013-04-10	2	-3/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Test cases that regressed due to r179115, plus a few more, were added in r179182. Original commit message below: [ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to parse an identifier. Otherwise, parseExpression may parse multiple tokens, which makes it impossible to properly compute an immediate displacement. An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in the below example: __asm mov eax, [Symbol + ImmDisp] Part of rdar://13611297 llvm-svn: 179187
*	R600/SI: Add pattern for AMDGPUurecip	Michel Danzer	2013-04-10	3	-3/+13
\| \| \| \| \| \| \|	21 more little piglits with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 179186
*	This is for an experimental option -mips-os16. The idea is to compile all	Reed Kotler	2013-04-10	6	-1/+185
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Mips32 code as Mips16 unless it can't be compiled as Mips 16. For now this would happen as long as floating point instructions are not needed. Probably it would also make sense to compile as mips32 if atomic operations are needed too. There may be other cases too. A module pass prescans the IR and adds the mips16 or nomips16 attribute to functions depending on the functions needs. Mips 16 mode can result in a 40% code compression by utililizing 16 bit encoding of many instructions. The hope is for this to replace the traditional gcc way of dealing with Mips16 code using floating point which involves essentially using soft float but with a library implemented using mips32 floating point. This gcc method also requires creating stubs so that Mips32 code can interact with these Mips 16 functions that have floating point needs. My conjecture is that in reality this traditional gcc method would never win over this new method. I will be implementing the traditional gcc method also. Some of it is already done but I needed to do the stubs to finish the work and those required this mips16/32 mixed mode capability. I have more ideas for to make this new method much better and I think the old method will just live in llvm for anyone that needs the backward compatibility but I don't for what reason that would be needed. llvm-svn: 179185
*	Use a scheme closer to that of GNU as when deciding the type of a	Peter Collingbourne	2013-04-10	1	-6/+34
\| \| \| \| \| \| \| \|	symbol with multiple .type declarations. Differential Revision: http://llvm-reviews.chandlerc.com/D607 llvm-svn: 179184
*	Template MachOObjectFile over endianness too.	Rafael Espindola	2013-04-10	1	-9/+9
\| \| \| \|	llvm-svn: 179179
*	R600: Add VTX_READ_* and RAT_WRITE_CACHELESS_* when computing cf addr	Vincent Lejeune	2013-04-10	1	-1/+10
\| \| \| \|	llvm-svn: 179174
*	ARM: Make "SMC" instructions conditional on new TrustZone architecture feature.	Tim Northover	2013-04-10	5	-7/+22
\| \| \| \| \| \| \| \| \| \| \| \|	These instructions aren't universally available, but depend on a specific extension to the normal ARM architecture (rather than, say, v6/v7/...) so a new feature is appropriate. This also enables the feature by default on A-class cores which usually have these extensions, to avoid breaking existing code and act as a sensible default. llvm-svn: 179171
*	Change CloneFunctionInto to always clone Argument attributes induvidually,	Joey Gouly	2013-04-10	1	-22/+19
\| \| \| \| \| \| \|	rather than checking if the source and destination have the same number of arguments and copying the attributes over directly. llvm-svn: 179169
*	R600/SI: dynamical figure out the reg class of MIMG	Christian Konig	2013-04-10	6	-2/+63
\| \| \| \| \| \| \| \|	Depending on the number of bits set in the writemask. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179166
*	R600/SI: adjust writemask to only the used components	Christian Konig	2013-04-10	4	-2/+91
\| \| \| \| \| \|	Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179165
*	R600/SI: remove image sample writemask	Christian Konig	2013-04-10	2	-14/+13
\| \| \| \| \| \|	Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179164
*	Cleanup PPCInstrInfo::DefinesPredicate	Hal Finkel	2013-04-10	1	-5/+10
\| \| \| \| \| \|	Implement suggestions made by Bill Schmidt in post-commit review. Thanks! llvm-svn: 179162
*	RegionInfo: Add helpers to replace entry/exit recursively	Tobias Grosser	2013-04-10	1	-0/+32
\| \| \| \| \| \|	Contributed by: Star Tan <tanmx_star@yeah.net> llvm-svn: 179157
*	PPC: Prep for if conversion of bctr[l]	Hal Finkel	2013-04-10	3	-2/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds in-principle support for if-converting the bctr[l] instructions. These instructions are used for indirect branching. It seems, however, that the current if converter will never actually predicate these. To do so, it would need the ability to hoist a few setup insts. out of the conditionally-executed block. For example, code like this: void foo(int a, int (*bar)()) { if (a != 0) bar(); } becomes: ... beq 0, .LBB0_2 std 2, 40(1) mr 12, 4 ld 3, 0(4) ld 11, 16(4) ld 2, 8(4) mtctr 3 bctrl ld 2, 40(1) .LBB0_2: ... and it would be safe to do all of this unconditionally with a predicated beqctrl instruction. llvm-svn: 179156
*	Template the MachO types over endianness.	Rafael Espindola	2013-04-10	1	-6/+6
\| \| \| \| \| \|	For now they are still only used as little endian. llvm-svn: 179147
*	__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in	Evan Cheng	2013-04-10	1	-5/+19
\| \| \| \| \| \| \| \|	xmm0 / xmm1. rdar://13599493 llvm-svn: 179141
*	Generalize the PassConfig API and remove addFinalizeRegAlloc().	Andrew Trick	2013-04-10	1	-36/+50
\| \| \| \| \| \| \| \| \| \|	The target hooks are getting out of hand. What does it mean to run before or after regalloc anyway? Allowing either Pass* or AnalysisID pass identification should make it much easier for targets to use the substitutePass and insertPass APIs, and create less need for badly named target hooks. llvm-svn: 179140
*	Mips specific inline asm operand modifier 'D'	Jack Carter	2013-04-09	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Modifier 'D' is to use the second word of a double integer. We had previously implemented the pure register varient of the modifier and this patch implements the memory reference. #include "stdio.h" int b[8] = {0,1,2,3,4,5,6,7}; void main() { int i; // The first word. Notice, no 'D' {asm ( "lw %0,%1;" : "=r" (i) : "m" ((b+4)) );} printf("%d\n",i); // The second word {asm ( "lw %0,%D1;" : "=r" (i) : "m" ((b+4)) );} printf("%d\n",i); } llvm-svn: 179135
*	Allow PPC B and BLR to be if-converted into some predicated forms	Hal Finkel	2013-04-09	5	-0/+208
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This enables us to form predicated branches (which are the same conditional branches we had before) and also a larger set of predicated returns (including instructions like bdnzlr which is a conditional return and loop-counter decrement all in one). At the moment, if conversion does not capture all possible opportunities. A simple example is provided in early-ret2.ll, where if conversion forms one predicated return, and then the PPCEarlyReturn pass picks up the other one. So, at least for now, we'll keep both mechanisms. llvm-svn: 179134
*	Fix some comment typos.	Bob Wilson	2013-04-09	1	-2/+2
\| \| \| \|	llvm-svn: 179132
*	Cleanup. No functional change intended.	Chad Rosier	2013-04-09	1	-5/+5
\| \| \| \|	llvm-svn: 179129
*	Cleanup. No functional change intended.	Chad Rosier	2013-04-09	1	-16/+16
\| \| \| \|	llvm-svn: 179125
*	Remove unused method and default values.	Rafael Espindola	2013-04-09	1	-2/+1
\| \| \| \|	llvm-svn: 179124
*	Revert r179115 as it looks to have killed the ASan tests.	Chad Rosier	2013-04-09	2	-10/+5
\| \| \| \|	llvm-svn: 179120
*	Rationalize the formatting of these case labels. Having two sorted	Chandler Carruth	2013-04-09	1	-15/+29
\| \| \| \| \| \|	columns is essentially impossible to edit. llvm-svn: 179119
*	This patch enables llvm to switch between compiling for mips32/mips64	Reed Kotler	2013-04-09	15	-11/+333
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and mips16 on a per function basis. Because this patch is somewhat involved I have provide an overview of the key pieces of it. The patch is written so as to not change the behavior of the non mixed mode. We have tested this a lot but it is something new to switch subtargets so we don't want any chance of regression in the mainline compiler until we have more confidence in this. Mips32/64 are very different from Mip16 as is the case of ARM vs Thumb1. For that reason there are derived versions of the register info, frame info, instruction info and instruction selection classes. Now we register three separate passes for instruction selection. One which is used to switch subtargets (MipsModuleISelDAGToDAG.cpp) and then one for each of the current subtargets (Mips16ISelDAGToDAG.cpp and MipsSEISelDAGToDAG.cpp). When the ModuleISel pass runs, it determines if there is a need to switch subtargets and if so, the owning pointers in MipsTargetMachine are appropriately changed. When 16Isel or SEIsel is run, they will return immediately without doing any work if the current subtarget mode does not apply to them. In addition, MipsAsmPrinter needs to be reset on a function basis. The pass BasicTargetTransformInfo is substituted with a null pass since the pass is immutable and really needs to be a function pass for it to be used with changing subtargets. This will be fixed in a follow on patch. llvm-svn: 179118
*	Add support for bottom-up SLP vectorization infrastructure.	Nadav Rotem	2013-04-09	5	-0/+707
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit adds the infrastructure for performing bottom-up SLP vectorization (and other optimizations) on parallel computations. The infrastructure has three potential users: 1. The loop vectorizer needs to be able to vectorize AOS data structures such as (sum += A[i] + A[i+1]). 2. The BB-vectorizer needs this infrastructure for bottom-up SLP vectorization, because bottom-up vectorization is faster to compute. 3. A loop-roller needs to be able to analyze consecutive chains and roll them into a loop, in order to reduce code size. A loop roller does not need to create vector instructions, and this infrastructure separates the chain analysis from the vectorization. This patch also includes a simple (100 LOC) bottom up SLP vectorizer that uses the infrastructure, and can vectorize this code: void SAXPY(int x, int y, int a, int i) { x[i] = a * x[i] + y[i]; x[i+1] = a * x[i+1] + y[i+1]; x[i+2] = a * x[i+2] + y[i+2]; x[i+3] = a * x[i+3] + y[i+3]; } llvm-svn: 179117
*	[ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to	Chad Rosier	2013-04-09	2	-5/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	parse an identifier. Otherwise, parseExpression may parse multiple tokens, which makes it impossible to properly compute an immediate displacement. An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in the below example: __asm mov eax, [Symbol + ImmDisp] The existing test cases exercise this patch. rdar://13611297 llvm-svn: 179115