bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	This patch adds support for the vector quadword add/sub instructions introduced	Kit Barton	2015-05-25	3	-10/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	in POWER8: vadduqm vaddeuqm vaddcuq vaddecuq vsubuqm vsubeuqm vsubcuq vsubecuq In addition to adding the instructions themselves, it also adds support for the v1i128 type for intrinsics (Intrinsics.td, Function.cpp, and IntrinsicEmitter.cpp). http://reviews.llvm.org/D9081 llvm-svn: 238144
*	Move bundle info from MCSectionData to MCSection.	Rafael Espindola	2015-05-25	4	-34/+34
\| \| \| \|	llvm-svn: 238143
*	Add a isBundleLocked helper to MCELFStreamer.	Rafael Espindola	2015-05-25	1	-14/+17
\| \| \| \|	llvm-svn: 238142
*	Move LayoutOrder to MCSection.	Rafael Espindola	2015-05-25	2	-2/+2
\| \| \| \|	llvm-svn: 238141
*	Stop forwarding getOrdinal and setOrdinal.	Rafael Espindola	2015-05-25	5	-15/+16
\| \| \| \|	llvm-svn: 238139
*	Move Ordinal from MCSectionData to MCSection. NFC.	Rafael Espindola	2015-05-25	1	-3/+6
\| \| \| \| \| \|	Part of the work to merge MCSectionData and MCSection. llvm-svn: 238137
*	Simplify boolean conditional return statements.	Rafael Espindola	2015-05-25	1	-4/+1
\| \| \| \| \| \|	Patch by Richard <legalize@xmission.com> llvm-svn: 238134
*	Refactor: Simplify boolean conditional return statements in ↵	Benjamin Kramer	2015-05-25	2	-8/+3
\| \| \| \| \| \| \| \| \| \| \|	llvm/lib/DebugInfo/DWARF Use clang-tidy to simplify boolean conditional return statements. Patch by Richard Thomson <legalize@xmission.com>! Differential Revision: http://reviews.llvm.org/D9972 llvm-svn: 238132
*	[X86] When pattern-matching scalar FMA3 intrinsics, don't re-arrange the ↵	Michael Kuperstein	2015-05-25	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	first and second operands. The semantics of the scalar FMA intrinsics are that the high vector elements are copied from the first source. The existing pattern switches src1 and src2 around, to match the "213" order, which ends up tying the original src2 to the dest. Since the actual scalar fma3 instructions copy the high elements from the dest register, the wrong values are copied. This modifies the pattern to leave src1 and src2 in their original order. Differential Revision: http://reviews.llvm.org/D9908 llvm-svn: 238131
*	Added promotion to EXTRACT_SUBVECTOR operand.	Elena Demikhovsky	2015-05-25	3	-0/+16
\| \| \| \| \| \| \|	I encountered with this case in one of KNL tests for i1 vectors. v16i1 = EXTRACT_SUBVECTOR v32i1, x llvm-svn: 238130
*	Reformat.	NAKAMURA Takumi	2015-05-25	6	-92/+88
\| \| \| \|	llvm-svn: 238126
*	Prune CRLFs.	NAKAMURA Takumi	2015-05-25	9	-1628/+1628
\| \| \| \|	llvm-svn: 238125
*	[Unroll] Switch from an eagerly populated SCEV cache to one that is	Chandler Carruth	2015-05-25	1	-89/+116
\| \| \| \| \| \| \| \| \| \|	lazily built. Also, make it a much more generic SCEV cache, which today exposes only a reduced GEP model description but could be extended in the future to do other profitable caching of SCEV information. llvm-svn: 238124
*	AsmPrinter: Avoid creating symbols in DwarfStringPool	Duncan P. N. Exon Smith	2015-05-24	2	-4/+14
\| \| \| \| \| \| \| \| \| \| \| \|	Stop creating symbols we don't need in `DwarfStringPool`. The consumers only call `DwarfStringPoolEntryRef::getSymbol()` when DWARF is relocatable, so this just stops creating the unused symbols when it's not. This drops memory usage from 851 MB to 845 MB, around 0.7%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238122
*	AsmPrinter: Prune an include, NFC	Duncan P. N. Exon Smith	2015-05-24	3	-1/+3
\| \| \| \|	llvm-svn: 238121
*	AsmPrinter: Remove dead code, NFC	Duncan P. N. Exon Smith	2015-05-24	1	-17/+0
\| \| \| \|	llvm-svn: 238120
*	AsmPrinter: Avoid EmitLabelDifference() in DwarfAccelTable	Duncan P. N. Exon Smith	2015-05-24	2	-1/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Mint a new function, `AsmPrinter::emitDwarfStringOffset()`, which takes a `DwarfStringPoolEntryRef`. When DWARF is relocatable across sections, this defers to `emitSectionOffset()` and emits the `MCSymbol`; otherwise, just emit the offset directly, without using any intermediate symbols. `EmitLabelDifference()` is already optimized to emit absolute label differences cheaply when possible, so there aren't any major memory savings here (853 MB down to 851 MB, or 0.2%). However, it prepares for making the `MCSymbol`s in the `DwarfStringPool` optional. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238119
*	AsmPrinter: Use DwarfStringPoolEntry in DwarfAccelTable, NFC	Duncan P. N. Exon Smith	2015-05-24	3	-17/+11
\| \| \| \| \| \| \|	This is just an API change, but it prepares to stop using `EmitLabelDifference()` when possible. llvm-svn: 238118
*	AsmPrinter: Make DIEString small	Duncan P. N. Exon Smith	2015-05-24	2	-29/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Expose the `DwarfStringPool` entry in a header, and store a pointer to it directly in `DIEString`. Instead of choosing at creation time how to emit it, use the `dwarf::Form` to determine that at emission time. Besides avoiding the other `DIEValue`, this shaves two pointers off of `DIEString`; the data is now a single pointer. This is a nice cleanup on its own -- and drops memory usage from 861 MB down to 853 MB, around 0.9% -- but it's also preparation for passing `DIEValue`s by value. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238117
*	AsmPrinter: Extract DwarfStringPoolEntry from DwarfStringPool, NFC	Duncan P. N. Exon Smith	2015-05-24	2	-14/+14
\| \| \| \| \| \| \| \| \|	Extract out `DwarfStringPoolEntry` and `DwarfStringPoolRef` from `DwarfStringPool` so that downstream users can start using `DwarfStringPool::getEntry()` directly. This will allow users to delay the decision between emitting a symbol or an offset until later. llvm-svn: 238116
*	AsmPrinter: Emit the DwarfStringPool offset directly when possible	Duncan P. N. Exon Smith	2015-05-24	3	-23/+35
\| \| \| \| \| \| \| \| \| \| \| \| \|	Change `DwarfStringPool` to calculate byte offsets on-the-fly, and update `DwarfUnit::getLocalString()` to use a `DIEInteger` instead of a `DIEDelta` when Dwarf doesn't use relocations (i.e., Mach-O). This eliminates another call to `EmitLabelDifference()`, and drops memory usage from 865 MB down to 861 MB, around 0.5%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238114
*	AsmPrinter: Refactor DwarfStringPool::getEntry(), NFC	Duncan P. N. Exon Smith	2015-05-24	2	-14/+11
\| \| \| \| \| \| \| \|	Move `DwarfStringPool`'s `getEntry()` to the header (and make it a member function) in preparation for calculating symbol offsets on-the-fly. llvm-svn: 238112
*	Move parseSubArch to ARMTargetParser. NFC	Renato Golin	2015-05-24	1	-30/+58
\| \| \| \| \| \| \| \| \| \| \| \|	Using getCanonicalArchName() is the right way to parse ARM arch names. Mapping ARMTargetParser IDs to Triple Arch IDs is temporary, until they are merged into a TargetDescription class. This was the last LLVM FIXME to move things to ARMTargetParser. Now on to Clang and beyond. llvm-svn: 238110
*	Add target hook to allow merging stores of nonzero constants	Matt Arsenault	2015-05-24	3	-3/+20
\| \| \| \| \| \| \| \| \| \|	On GPU targets, materializing constants is cheap and stores are expensive, so only doing this for zero vectors was silly. Most of the new testcases aren't optimally merged, and are for later improvements. llvm-svn: 238108
*	Bump SmallString to the minimum required amount for raw_ostream to avoid ↵	Benjamin Kramer	2015-05-23	1	-1/+1
\| \| \| \| \| \| \| \|	allocation. NFC. llvm-svn: 238104
*	[Mips] Prefer Twine::utohexstr over utohexstr, saves a string copy.	Benjamin Kramer	2015-05-23	1	-3/+2
\| \| \| \| \| \|	NFC. llvm-svn: 238103
*	[AArch64] Clean up the ELF streamer a bit.	Benjamin Kramer	2015-05-23	2	-15/+4
\| \| \| \|	llvm-svn: 238102
*	[AArch64] Move AArch64TargetStreamer out of MCStreamer.h	Benjamin Kramer	2015-05-23	4	-6/+47
\| \| \| \| \| \|	It doesn't belong in the shared MC layer. NFC. llvm-svn: 238101
*	Silencing a spurious -Wreturn-type warning; NFC.	Aaron Ballman	2015-05-23	1	-0/+1
\| \| \| \|	llvm-svn: 238099
*	[PowerPC] Fix fast-isel when compare is split from branch	Hal Finkel	2015-05-23	1	-19/+32
\| \| \| \| \| \| \| \| \| \| \|	When the compare feeding a branch was in a different BB from the branch, we'd try to "regenerate" the compare in the block with the branch, possibly trying to make use of values not available there. Copy a page from AArch64's play book here to fix the problem (at least in terms of correctness). Fixes PR23640. llvm-svn: 238097
*	Update ExecutionEngine/LLVMBuild.txt, to add LLVMCodeGen.	NAKAMURA Takumi	2015-05-23	1	-1/+1
\| \| \| \|	llvm-svn: 238096
*	Give more meaningful names than I and J to some for loop variables after ↵	Craig Topper	2015-05-23	1	-10/+10
\| \| \| \| \| \|	converting to range-based loops. llvm-svn: 238095
*	Fix an unused variable warning in release builds.	Craig Topper	2015-05-23	1	-0/+1
\| \| \| \|	llvm-svn: 238094
*	Use range-based for loops. NFC.	Craig Topper	2015-05-23	1	-76/+36
\| \| \| \|	llvm-svn: 238093
*	[lib/Fuzzer] doxygen-ify the comments for the user interface	Kostya Serebryany	2015-05-23	1	-13/+22
\| \| \| \|	llvm-svn: 238086
*	AsmPrinter: Remove the vtable-entry from DIEValue	Duncan P. N. Exon Smith	2015-05-23	1	-29/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove all virtual functions from `DIEValue`, dropping the vtable pointer from its layout. Instead, create "impl" functions on the subclasses, and use the `DIEValue::Type` to implement the dynamic dispatch. This is necessary -- obviously not sufficient -- for passing `DIEValue`s around by value. However, this change stands on its own: we make tons of these. I measured a drop in memory usage from 888 MB down to 860 MB, or around 3.2%. (I'm looking at `llc` memory usage on `verify-uselistorder.lto.opt.bc`; see r236629 for details.) llvm-svn: 238084
*	CodeGen: Remove redundant DIETypeSignature::dump(), NFC	Duncan P. N. Exon Smith	2015-05-23	1	-2/+0
\| \| \| \| \| \|	We already have this in `DIEValue`; no reason to shadow it. llvm-svn: 238082
*	[lib/Fuzzer] fully get rid of std::cerr in libFuzzer	Kostya Serebryany	2015-05-23	3	-38/+23
\| \| \| \|	llvm-svn: 238081
*	Stop resetting NoFramePointerElim in TargetMachine::resetTargetOptions.	Akira Hatanaka	2015-05-23	9	-27/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is part of the work to remove TargetMachine::resetTargetOptions. In this patch, instead of updating global variable NoFramePointerElim in resetTargetOptions, its use in DisableFramePointerElim is replaced with a call to TargetFrameLowering::noFramePointerElim. This function determines on a per-function basis if frame pointer elimination should be disabled. There is no change in functionality except that cl:opt option "disable-fp-elim" can now override function attribute "no-frame-pointer-elim". llvm-svn: 238080
*	Simplify and rename function overrideFunctionAttributes. NFC.	Akira Hatanaka	2015-05-23	2	-13/+20
\| \| \| \| \| \| \|	This is in preparation to making changes needed to stop resetting NoFramePointerElim in resetTargetOptions. llvm-svn: 238079
*	[lib/Fuzzer] start getting rid of std::cerr. Sadly, these parts of C++ ↵	Kostya Serebryany	2015-05-23	4	-56/+47
\| \| \| \| \| \|	library used in libFuzzer badly interract with the same code used in the target function and also with dfsan. It's easier to just not use std::cerr than to defeat these issues. llvm-svn: 238078
*	Revert "make reciprocal estimate code generation more flexible by adding ↵	Rafael Espindola	2015-05-23	7	-267/+54
\| \| \| \| \| \| \| \| \| \| \| \|	command-line options" This reverts commit r238051. It broke some bots: http://lab.llvm.org:8011/builders/llvm-ppc64-linux1/builds/18190 llvm-svn: 238075
*	Produce a single string table in a ELF .o	Rafael Espindola	2015-05-22	1	-43/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Normally an ELF .o has two string tables, one for symbols, one for section names. With the scheme of naming sections like ".text.foo" where foo is a symbol, there is a big potential saving in using a single one. Building llvm+clang+lld with master and with this patch the results were: master: 193,267,008 bytes patch: 186,107,952 bytes master non unique section names: 183,260,192 bytes patch non unique section names: 183,118,632 bytes So using non usique saves 10,006,816 bytes, and the patch saves 7,159,056 while still using distinct names for the sections. llvm-svn: 238073
*	Extend EarlyCSE to handle basic cases from JumpThreading and CVP	Philip Reames	2015-05-22	3	-21/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch extends EarlyCSE to take advantage of the information that a controlling branch gives us about the value of a Value within this and dominated basic blocks. If the current block has a single predecessor with a controlling branch, we can infer what the branch condition must have been to execute this block. The actual change to support this is downright simple because EarlyCSE's existing scoped hash table logic deals with most of the complexity around merging. The patch actually implements two optimizations. 1) The first is analogous to JumpThreading in that it enables EarlyCSE's CSE handling to fold branches which are exactly redundant due to a previous branch to branches on constants. (It doesn't actually replace the branch or change the CFG.) This is pretty clearly a win since it enables substantial CFG simplification before we start trying to inline. 2) The second is analogous to CVP in that it exploits the knowledge gained to replace dominated uses of the original value. EarlyCSE does not otherwise reason about specific uses, so this is the more arguable one. It does enable further simplication and constant folding within the rest of the visit by EarlyCSE. In both cases, the added code only handles the easy dominance based case of each optimization. The general case is deferred to the existing passes. Differential Revision: http://reviews.llvm.org/D9763 llvm-svn: 238071
*	[InstCombine] Don't eagerly propagate nsw for AB+AC => A*(B+C)	David Majnemer	2015-05-22	1	-3/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	InstCombine transforms A nsw B +nsw A nsw C to A *nsw (B + C). This is incorrect -- e.g. if A = -1, B = 1, C = INT_SMAX. Then nothing in the LHS overflows, but the multiplication in RHS overflows. We need to first make sure that we won't multiple by INT_SMAX + 1. Test case `add_of_mul` contributed by Sanjoy Das. This fixes PR23635. Differential Revision: http://reviews.llvm.org/D9629 llvm-svn: 238066
*	[lib/Fuzzer] remove -use_coverage_pairs=1, an experimental feature that is ↵	Kostya Serebryany	2015-05-22	5	-30/+1
\| \| \| \| \| \|	unlikely to ever scale llvm-svn: 238063
*	[lib/Fuzzer] extend the fuzzer interface to allow user-supplied mutators	Kostya Serebryany	2015-05-22	12	-67/+258
\| \| \| \|	llvm-svn: 238059
*	[AArch64][CGP] Sink zext feeding stxr/stlxr into the same block.	Ahmed Bougacha	2015-05-22	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \|	The usual CodeGenPrepare trickery, on a target-specific intrinsic. Without this, the expansion of atomics will usually have the zext be hoisted out of the loop, defeating the various patterns we have to catch this precise case. Differential Revision: http://reviews.llvm.org/D9930 llvm-svn: 238054
*	make reciprocal estimate code generation more flexible by adding ↵	Sanjay Patel	2015-05-22	7	-54/+267
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	command-line options This patch adds a class for processing many recip codegen possibilities. The TargetRecip class is intended to handle both command-line options to llc as well as options passed in from a front-end such as clang with the -mrecip option. The x86 backend is updated to use the new functionality. Only -mcpu=btver2 with -ffast-math should see a functional change from this patch. All other CPUs continue to not use reciprocal estimates by default with -ffast-math. Differential Revision: http://reviews.llvm.org/D8982 llvm-svn: 238051
*	Reinforce ARMTargetParser::getCanonicalArchName validation	Renato Golin	2015-05-22	1	-14/+20
\| \| \| \| \| \| \| \| \| \| \| \| \|	Before, getCanonicalArchName was relying on parseArch() to validate the arch name, which was a problem when other methods, that also needed to call it, were duplicating the steps. But to dissociate getCanonicalArchName from parseArch, we needed to make getCanonicalArchName more robust in detecting valid arch names. It's still not perfect, but will do for the time being, until we merge Triple with TargetParser into a TargetDescription mega class. llvm-svn: 238047