bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[PM] Add a collection of no-op analysis passes and switch the new pass	Chandler Carruth	2015-01-06	4	-7/+57
\| \| \| \| \| \| \| \| \| \| \| \|	manager tests to use them and be significantly more comprehensive. This, naturally, uncovered a bug where the CGSCC pass manager wasn't printing analyses when they were run. The only remaining core manipulator is I think an invalidate pass similar to the require pass. That'll be next. =] llvm-svn: 225240
*	[PM] Sink the no-op pass parsing logic into the .def-based registry to	Chandler Carruth	2015-01-06	2	-21/+3
\| \| \| \| \| \| \| \|	simplify things. This will become more important as I add no-op analyses that want to re-use the logic we already have for analyses in the registry. For now, no functionality changed. llvm-svn: 225238
*	[PM] Move the analysis registry into the Passes.cpp file and provide	Chandler Carruth	2015-01-06	3	-12/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a normal interface for it in Passes.h. This gives us essentially a single interface for running pass managers which are provided from the bottom of the LLVM stack through interfaces at the top of the LLVM stack that populate them with all of the different analyses available throughout. It also means there is a single blob of code that needs to include all of the pass headers and needs to deal with the registry of passes and parsing names. No functionality changed intended, should just be cleanup. llvm-svn: 225237
*	[PM] Add a utility to the new pass manager for generating a pass which	Chandler Carruth	2015-01-06	3	-4/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	is a no-op other than requiring some analysis results be available. This can be used in real pass pipelines to force the usually lazy analysis running to eagerly compute something at a specific point, and it can be used to test the pass manager infrastructure (my primary use at the moment). I've also added bit of pipeline parsing magic to support generating these directly from the opt command so that you can directly use these when debugging your analysis. The syntax is: require<analysis-name> This can be used at any level of the pass manager. For example: cgscc(function(require<my-analysis>,no-op-function)) This would produce a no-op function pass requiring my-analysis, followed by a fully no-op function pass, both of these in a function pass manager which is nested inside of a bottom-up CGSCC pass manager which is in the top-level (implicit) module pass manager. I have zero attachment to the particular syntax I'm using here. Consider it a straw man for use while I'm testing and fleshing things out. Suggestions for better syntax welcome, and I'll update everything based on any consensus that develops. I've used this new functionality to more directly test the analysis printing rather than relying on the cgscc pass manager running an analysis for me. This is still minimally tested because I need to have analyses to run first! ;] That patch is next, but wanted to keep this one separate for easier review and discussion. llvm-svn: 225236
*	Add a testcase that would have found the problem in r225048.	Rafael Espindola	2015-01-06	1	-0/+27
\| \| \| \|	llvm-svn: 225235
*	Remove dead variable.	Eric Christopher	2015-01-06	2	-2/+1
\| \| \| \|	llvm-svn: 225233
*	Use the same call off of the TargetMachine rather than the subtarget.	Eric Christopher	2015-01-06	1	-1/+1
\| \| \| \|	llvm-svn: 225232
*	Rewrite the Mips16HardFloat pass to avoid using the Subtarget.	Eric Christopher	2015-01-06	4	-26/+18
\| \| \| \|	llvm-svn: 225231
*	Revert r225048: It broke ObjC on AArch64.	Lang Hames	2015-01-06	19	-303/+247
\| \| \| \| \| \|	I've filed http://llvm.org/PR22100 to track this issue. llvm-svn: 225228
*	Remove X86 .quad workaround for buggy GNU assembler on OpenBSD / Bitrig.	Brad Smith	2015-01-06	1	-5/+0
\| \| \| \|	llvm-svn: 225227
*	IR: Don't drop MDNode uniquing on null operands	Duncan P. N. Exon Smith	2015-01-05	2	-7/+22
\| \| \| \| \| \| \| \| \| \|	Now that `LLVMContextImpl` can call `MDNode::dropAllReferences()` to prevent teardown madness, stop dropping uniquing just because an operand drops to null. Part of PR21532. llvm-svn: 225223
*	Revert "Use the integrated assembler by default on 32-bit PowerPC and SPARC"	Duncan P. N. Exon Smith	2015-01-05	2	-2/+4
\| \| \| \| \| \| \| \| \|	This reverts commit r225213. It's failing on multiple buildbots [1][2]. [1]: http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/22032 [2]: http://lab.llvm.org:8080/green/view/Clang/job/clang-stage1-cmake-RA-incremental_check/2357/ llvm-svn: 225222
*	[PowerPC] Fix test to pass on Darwin hosts	Hal Finkel	2015-01-05	1	-1/+3
\| \| \| \|	llvm-svn: 225220
*	[PowerPC] Remove old README.txt entry	Hal Finkel	2015-01-05	1	-10/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We no longer generate horrible code for the stated function: void f(signed char a, _Bool b, _Bool c) { signed char t = 0; if (b) t = a; if (c) *a = t; } for which we now generate: .L.f: andi. 5, 5, 1 cmpldi 1, 4, 0 li 5, 0 beq 1, .LBB0_2 lbz 5, 0(3) .LBB0_2: # %if.end bclr 4, 1, 0 stb 5, 0(3) blr so we don't need the README.txt entry. llvm-svn: 225217
*	[X86][SSE] lowerVectorShuffleAsByteShift tidyup	Simon Pilgrim	2015-01-05	1	-21/+14
\| \| \| \| \| \|	Removed local isSequential predicate and use standard helper isSequentialOrUndefInRange instead. llvm-svn: 225216
*	[PowerPC] Convert a README.txt entry into a better test	Hal Finkel	2015-01-05	2	-14/+7
\| \| \| \| \| \| \|	We now produce the desired code as noted in the README.txt file (no spurious or). Remove the README entry and improve the regression test. llvm-svn: 225214
*	Use the integrated assembler by default on 32-bit PowerPC and SPARC	Brad Smith	2015-01-05	2	-4/+2
\| \| \| \|	llvm-svn: 225213
*	[PowerPC] Remove README.txt entry	Hal Finkel	2015-01-05	1	-34/+0
\| \| \| \| \| \| \|	This entry has been rendered irrelevant now that we have proper CR bit tracking. llvm-svn: 225211
*	[Hexagon] Adding add/sub with carry, logical shift left by immediate and ↵	Colin LeMahieu	2015-01-05	5	-226/+180
\| \| \| \| \| \|	memop instructions. Removing old defs without bits and updating references. llvm-svn: 225210
*	[PowerPC] Add a test for truncating a shifted load	Hal Finkel	2015-01-05	2	-18/+18
\| \| \| \| \| \| \|	We now produce the desired code as noted in the README.txt file. Remove the README entry and add a regression test. llvm-svn: 225209
*	Make DIE.h a public CodeGen header.	Frederic Riss	2015-01-05	10	-9/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	dsymutil would like to use all the AsmPrinter/MCStreamer infrastructure to stream out the DWARF. In order to do so, it will reuse the DIE object and so this header needs to be public. The interface exposed here has some corners that cannot be used without a DwarfDebug object, but clients that want to stream Dwarf can just avoid these. Differential Revision: http://reviews.llvm.org/D6695 llvm-svn: 225208
*	[dsymutil] Implement the BinaryHolder object and gain archive support.	Frederic Riss	2015-01-05	8	-34/+265
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This object is meant to own the ObjectFiles and their underlying MemoryBuffer. It is basically the equivalent of an OwningBinary except that it efficiently handles Archives. It is optimized for efficiently providing mappings of members of the same archive when they are opened successively (which is standard in Darwin debug maps, objects from the same archive will be contiguous). Of course, the BinaryHolder will also be used by the DWARF linker once it is commited, but for now only the debug map parser uses it. With this change, you can run llvm-dsymutil on your Darwin debug build of clang and get a complete debug map for it. Differential Revision: http://reviews.llvm.org/D6690 llvm-svn: 225207
*	[autoconf] llvm/cmake/modules/Makefile: Make sure to regenerate ↵	NAKAMURA Takumi	2015-01-05	1	-1/+1
\| \| \| \| \| \|	LLVMConfig.cmake whenever Makefile is updated. llvm-svn: 225206
*	[PowerPC] Add another test for load/store with update	Hal Finkel	2015-01-05	2	-34/+19
\| \| \| \| \| \| \|	We now produce the desired code as noted in the README.txt file. Remove the README entry and add a regression test. llvm-svn: 225205
*	[autoconf] Export LLVM_LIBDIR_SUFFIX with empty string in LLVMConfig.cmake. ↵	NAKAMURA Takumi	2015-01-05	1	-0/+1
\| \| \| \| \| \|	tools/llvm-config is also doing so. llvm-svn: 225204
*	[PowerPC] Fold i1 extensions with other ops	Hal Finkel	2015-01-05	3	-17/+125
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Consider this function from our README.txt file: int foo(int a, int b) { return (a < b) << 4; } We now explicitly track CR bits by default, so the comment in the README.txt about not really having a SETCC is no longer accurate, but we did generate this somewhat silly code: cmpw 0, 3, 4 li 3, 0 li 12, 1 isel 3, 12, 3, 0 sldi 3, 3, 4 blr which generates the zext as a select between 0 and 1, and then shifts the result by a constant amount. Here we preprocess the DAG in order to fold the results of operations on an extension of an i1 value into the SELECT_I[48] pseudo instruction when the resulting constant can be materialized using one instruction (just like the 0 and 1). This was not implemented as a DAGCombine because the resulting code would have been anti-canonical and depends on replacing chained user nodes, which does not fit well into the lowering paradigm. Now we generate: cmpw 0, 3, 4 li 3, 0 li 12, 16 isel 3, 12, 3, 0 blr which is less silly. llvm-svn: 225203
*	[X86][SSE] Fixed description for isSequentialOrUndefInRange. NFC.	Simon Pilgrim	2015-01-05	1	-1/+1
\| \| \| \|	llvm-svn: 225202
*	[Hexagon] Adding rounding reg/reg variants, accumulating multiplies, and ↵	Colin LeMahieu	2015-01-05	4	-57/+202
\| \| \| \| \| \|	accumulating shifts. llvm-svn: 225201
*	IR: Prune arguments to ValueAsMetadata::ValueAsMetadata()	Duncan P. N. Exon Smith	2015-01-05	2	-7/+7
\| \| \| \| \| \|	`LLVMContext` isn't actually used. llvm-svn: 225200
*	[Hexagon] Adding V4 bit manipulating instructions, removing ALU defs without ↵	Colin LeMahieu	2015-01-05	3	-251/+126
\| \| \| \| \| \|	encoding bits. llvm-svn: 225199
*	[Hexagon] Adding V4 logic-logic instructions and tests.	Colin LeMahieu	2015-01-05	2	-0/+81
\| \| \| \|	llvm-svn: 225198
*	[Hexagon] Adding orand, bitsplit reg/reg, and modwrap instructions.	Colin LeMahieu	2015-01-05	3	-0/+63
\| \| \| \|	llvm-svn: 225197
*	[PowerPC] Remove zexts after i32 ctlz	Hal Finkel	2015-01-05	3	-5/+31
\| \| \| \| \| \| \| \| \|	The 64-bit semantics of cntlzw are not special, the 32-bit population count is stored as a 64-bit value in the range [0,32]. As a result, it is always zero extended, and it can be added to the PPCISelDAGToDAG peephole optimization as a frontier instruction for the removal of unnecessary zero extensions. llvm-svn: 225192
*	[PowerPC] Remove zexts after byte-swapping loads	Hal Finkel	2015-01-05	3	-0/+46
\| \| \| \| \| \| \| \| \|	lhbrx and lwbrx not only load their data with byte swapping, but also clear the upper 32 bits (at least). As a result, they can be added to the PPCISelDAGToDAG peephole optimization as frontier instructions for the removal of unnecessary zero extensions. llvm-svn: 225189
*	[Hexagon] Adding round reg/imm and bitsplit instructions.	Colin LeMahieu	2015-01-05	4	-0/+29
\| \| \| \|	llvm-svn: 225188
*	SymbolRewriter: use iplist::splice	Saleem Abdulrasool	2015-01-05	1	-1/+1
\| \| \| \| \| \| \| \|	The swap implementation for iplist is currently unsupported. Simply splice the old list into place, which achieves the same purpose. This is needed in order to thread the -frewrite-map-file frontend option correctly. NFC. llvm-svn: 225186
*	SymbolRewriter: 80-column	Saleem Abdulrasool	2015-01-05	1	-2/+4
\| \| \| \| \| \|	Wrap a couple of lines. NFC. llvm-svn: 225185
*	[AArch64] Improve codegen of store lane instructions by avoiding GPR usage.	Ahmed Bougacha	2015-01-05	2	-6/+106
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We used to generate code similar to: umov.b w8, v0[2] strb w8, [x0, x1] because the STRro patterns were preferred to ST1. Instead, we can avoid going through GPRs, and generate: add x8, x0, x1 st1.b { v0 }[2], [x8] This patch increases the ST1 AddedComplexity to achieve that. rdar://16372710 Differential Revision: http://reviews.llvm.org/D6202 llvm-svn: 225183
*	[AArch64] Improve codegen of store lane 0 instructions by directly storing ↵	Ahmed Bougacha	2015-01-05	2	-0/+119
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the subregister. For 0-lane stores, we used to generate code similar to: fmov w8, s0 str w8, [x0, x1, lsl #2] instead of: str s0, [x0, x1, lsl #2] To correct that: for store lane 0 patterns, directly match to STR <subreg>0. Byte-sized instructions don't have the special case for a 0 index, because FPR8s are defined to have untyped content. rdar://16372710 Differential Revision: http://reviews.llvm.org/D6772 llvm-svn: 225181
*	llvm/test/lit.cfg: have_ld_plugin_support(): Use decode() for stdout.	NAKAMURA Takumi	2015-01-05	1	-1/+1
\| \| \| \|	llvm-svn: 225171
*	Select lower fsub,fabs pattern to fabd on AArch64	Karthik Bhat	2015-01-05	2	-0/+81
\| \| \| \| \| \| \| \| \| \| \| \|	This patch lowers patterns such as- fsub v0.4s, v0.4s, v1.4s fabs v0.4s, v0.4s to fabd v0.4s, v0.4s, v1.4s on AArch64. Review: http://reviews.llvm.org/D6791 llvm-svn: 225169
*	Parse Tag_compatibility correctly.	Charlie Turner	2015-01-05	4	-6/+12
\| \| \| \| \| \| \| \|	Tag_compatibility takes two arguments, but before this patch it would erroneously accept just one, it now produces an error in that case. Change-Id: I530f918587620d0d5dfebf639944d6083871ef7d llvm-svn: 225167
*	Emit the build attribute Tag_conformance.	Charlie Turner	2015-01-05	6	-6/+40
\| \| \| \| \| \| \| \| \| \| \|	Claim conformance to version 2.09 of the ARM ABI. This build attribute must be emitted first amongst the build attributes when written to an object file. This is to simplify conformance detection by consumers. Change-Id: If9eddcfc416bc9ad6e5cc8cdcb05d0031af7657e llvm-svn: 225166
*	Select lower sub,abs pattern to sabd on AArch64	Karthik Bhat	2015-01-05	2	-0/+128
\| \| \| \| \| \| \| \| \| \| \| \|	This patch lowers patterns such as- sub v0.4s, v0.4s, v1.4s abs v0.4s, v0.4s to sabd v0.4s, v0.4s, v1.4s on AArch64. Review: http://reviews.llvm.org/D6781 llvm-svn: 225165
*	Fix broken test from r225159.	Michael Kuperstein	2015-01-05	1	-1/+1
\| \| \| \|	llvm-svn: 225164
*	[PM] Don't run the machinery of invalidating all the analysis passes	Chandler Carruth	2015-01-05	4	-4/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	when all are being preserved. We want to short-circuit this for a couple of reasons. One, I don't really want passes to grow a dependency on actually receiving their invalidate call when they've been preserved. I'm thinking about removing this entirely. But more importantly, preserving everything is likely to be the common case in a lot of scenarios, and it would be really good to bypass all of the invalidation and preservation machinery there. Avoiding calling N opaque functions to try to invalidate things that are by definition still valid seems important. =] This wasn't really inpsired by much other than seeing the spam in the logging for analyses, but it seems better ot get it checked in rather than forgetting about it. llvm-svn: 225163
*	[PM] Add names and debug logging for analysis passes to the new pass	Chandler Carruth	2015-01-05	8	-4/+93
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	manager. This starts to allow us to test analyses more easily, but it's really only the beginning. Some of the code here is still untestable without manual changes to create analysis passes, but I wanted to factor it into a small of chunks as possible. Next up in order to be able to test things are, in no particular order: - No-op analyses passes so we don't have to use real ones to exercise the pass maneger itself. - Automatic way of generating dummy passes that require an analysis be run, including a variant that calls a 'print' method on a pass to make it even easier to print out the results of an analysis. - Dummy passes that invalidate all analyses for their IR unit so we can test invalidation and re-runs. - Automatic way to print each analysis pass as it is re-run. - Automatic but optional verification of analysis passes everywhere possible. I'm not claiming I'll get to all of these immediately, but that's what is in the pipeline at some stage. I'm fleshing out exactly what I need and what to prioritize by working on converting analyses and then trying to test the conversion. =] llvm-svn: 225162
*	Replace several 'assert(false' with 'llvm_unreachable' or fold a condition ↵	Craig Topper	2015-01-05	16	-56/+38
\| \| \| \| \| \|	into the assert. llvm-svn: 225160
*	Fixed a bug in memory dependence checking module of loop vectorization. The ↵	Jiangning Liu	2015-01-05	2	-48/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	following loop should not be vectorized with current algorithm. {code} // loop body ... = a[i] (1) ... = a[i+1] (2) ....... a[i+1] = .... (3) a[i] = ... (4) {code} The algorithm tries to collect memory access candidates from AliasSetTracker, and then check memory dependences one another. The memory accesses are unique in AliasSetTracker, and a single memory access in AliasSetTracker may map to multiple entries in AccessAnalysis, which could cover both 'read' and 'write'. Originally the algorithm only checked 'write' entry in Accesses if only 'write' exists. This is incorrect and the consequence is it ignored all read access, and finally some RAW and WAR dependence are missed. For the case given above, if we ignore two reads, the dependence between (1) and (3) would not be able to be captured, and finally this loop will be incorrectly vectorized. The fix simply inserts a new loop to find all entries in Accesses. Since it will skip most of all other memory accesses by checking the Value pointer at the very beginning of the loop, it should not increase compile-time visibly. llvm-svn: 225159
*	Convert SmallMapVector from a class to a struct.	Michael Gottesman	2015-01-05	1	-5/+3
\| \| \| \|	llvm-svn: 225158