bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix alloca_instruments_all_paddings.cc test to work under higher -O levels ↵	Kuba Brecka	2015-02-24	1	-7/+5
\| \| \| \| \| \| \| \| \| \|	(llvm part) When AddressSanitizer only a single dynamic alloca and no static allocas, due to an early exit from FunctionStackPoisoner::poisonStack we forget to unpoison the dynamic alloca. This patch fixes that. Reviewed at http://reviews.llvm.org/D7810 llvm-svn: 230316
*	[X86] Remove the AbsMem32 type from the assembly parser. Only really need ↵	Craig Topper	2015-02-24	2	-12/+2
\| \| \| \| \| \|	the 16-bit version which will automatically get prioritized over AbsMem. llvm-svn: 230313
*	Beginning of alloca implementation for Mips fast-isel	Reed Kotler	2015-02-24	1	-20/+139
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Begin to add various address modes; including alloca. Test Plan: Make sure there are no regressions in test-suite at O0/02 in mips32r1/r2 Reviewers: dsanders Reviewed By: dsanders Subscribers: echristo, rfuhler, llvm-commits Differential Revision: http://reviews.llvm.org/D6426 llvm-svn: 230300
*	Fix handling of negative offsets for AddrModeT2_i8s4 in rewriteT2FrameIndex.	Bob Wilson	2015-02-24	1	-5/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a follow up to r230233 to fix something that I noticed by inspection. The AddrModeT2_i8s4 addressing mode does not support negative offsets. I spent a good chunk of the day trying to come up with a testcase for this but was not successful. This addressing mode is used to spill and restore GPRPair registers in Thumb2 code and that does not happen often. We also make very limited used of negative offsets when lowering frame indexes. I am going ahead with the change anyway, because I am pretty confident that it is correct. I also added a missing assertion to check that the low bits of the scaled offset are zero. llvm-svn: 230297
*	Fix bug 22641	Sanjoy Das	2015-02-24	1	-17/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The bug was a result of getPreStartForExtend interpreting nsw/nuw flags on an add recurrence more strongly than is legal. {S,+,X}<nsw> implies S+X is nsw only if the backedge of the loop is taken at least once. NOTE: I had accidentally committed an unrelated change with the commit message of this change in r230275 (r230275 was reverted in r230279). This is the correct change for this commit message. Differential Revision: http://reviews.llvm.org/D7808 llvm-svn: 230291
*	[LTO API] add lto_codegen_set_module to set the destination module.	Manman Ren	2015-02-24	2	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When debugging LTO issues with ld64, we use -save-temps to save the merged optimized bitcode file, then invoke ld64 again on the single bitcode file to speed up debugging code generation passes and ld64 stuff after code generation. llvm linking a single bitcode file via lto_codegen_add_module will generate a different bitcode file from the single input. With the newly-added lto_codegen_set_module, we can make sure the destination module is the same as the input. lto_codegen_set_module will transfer the ownship of the module to code generator. rdar://19024554 llvm-svn: 230290
*	[LoopAccesses] LAA::getInfo to use const reference for stride parameter	Adam Nemet	2015-02-24	1	-25/+24
\| \| \| \| \| \|	And other required const-correctness fixes to make this work. llvm-svn: 230289
*	X86: Only use 'lea' in Win64 epilogues if a frame pointer exists	David Majnemer	2015-02-24	1	-7/+21
\| \| \| \| \| \| \|	We can only use 'add' in epilogues, 'lea' is not permitted unless we've established a frame pointer in the prologue. llvm-svn: 230286
*	New instcombine rule: max(~a,~b) -> ~min(a, b)	Sanjoy Das	2015-02-24	3	-23/+66
\| \| \| \| \| \| \| \| \| \|	This case is interesting because ScalarEvolutionExpander lowers min(a, b) as ~max(~a,~b). I think the profitability heuristics can be made more clever/aggressive, but this is a start. Differential Revision: http://reviews.llvm.org/D7821 llvm-svn: 230285
*	Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap	Sanjoy Das	2015-02-23	1	-2/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When emitting the increment operation, SCEVExpander marks the operation as nuw or nsw based on the flags on the preincrement SCEV. This is incorrect because, for instance, it is possible that {-6,+,1} is <nuw> while {-6,+,1}+1 = {-5,+,1} is not. This change teaches SCEV to mark the increment as nuw/nsw only if it can explicitly prove that the increment operation won't overflow. Apart from the attached test case, another (more realistic) manifestation of the bug can be seen in Transforms/IndVarSimplify/pr20680.ll. NOTE: this change was landed with an incorrect commit message in rL230275 and was reverted for that reason in rL230279. This commit message is the correct one. Differential Revision: http://reviews.llvm.org/D7778 llvm-svn: 230280
*	Revert 230275.	Sanjoy Das	2015-02-23	1	-31/+2
\| \| \| \| \| \| \| \|	230275 got committed with an incorrect commit message due to a mixup on my side. Will re-land in a few moments with the correct commit message. llvm-svn: 230279
*	Fix based on post-commit comment on D7816 & rL230177 - BUILD_VECTOR operand ↵	Simon Pilgrim	2015-02-23	1	-4/+5
\| \| \| \| \| \|	truncation was using the the BV's output scalar type instead of the input type. llvm-svn: 230278
*	[X86] Teach how to custom lower double-to-half conversions under fast-math.	Andrea Di Biagio	2015-02-23	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch teaches the backend how to expand a double-half conversion into a double-float conversion immediately followed by a float-half conversion. We do this only under fast-math, and if float-half conversions are legal for the target. Added test CodeGen/X86/fastmath-float-half-conversion.ll Differential Revision: http://reviews.llvm.org/D7832 llvm-svn: 230276
*	Fix bug 22641	Sanjoy Das	2015-02-23	1	-2/+31
\| \| \| \| \| \| \| \| \| \| \|	The bug was a result of getPreStartForExtend interpreting nsw/nuw flags on an add recurrence more strongly than is legal. {S,+,X}<nsw> implies S+X is nsw only if the backedge of the loop is taken at least once. Differential Revision: http://reviews.llvm.org/D7808 llvm-svn: 230275
*	Fix invalid cast.	Rafael Espindola	2015-02-23	1	-1/+1
\| \| \| \| \| \| \| \|	Fixes PR22525. Patch by Ben Longbons with testcase by me. llvm-svn: 230271
*	X86: Use a smaller 'mov' instruction for stack probe calls	David Majnemer	2015-02-23	1	-3/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	Prologue emission, in some cases, requires calls to a stack probe helper function. The amount of stack to probe is passed as a register argument in the Win64 ABI but the instruction sequence used is pessimistic: it assumes that the number of bytes to probe is greater than 4 GB. Instead, select a more appropriate opcode depending on the number of bytes we are going to probe. llvm-svn: 230270
*	X86: Use 'mov' instead of 'lea' in Win64 SEH prologues when possible	David Majnemer	2015-02-23	1	-2/+5
\| \| \| \| \| \| \|	'mov' and 'lea' are equivalent when the displacement applied with 'lea' is zero. However, 'mov' should encode smaller. llvm-svn: 230269
*	X86: Explain why we cannot use a 'mov' in a Win64 epilogue	David Majnemer	2015-02-23	1	-0/+6
\| \| \| \|	llvm-svn: 230268
*	X86: Consistently use 'epilogue' instead of 'epilog'	David Majnemer	2015-02-23	1	-1/+1
\| \| \| \|	llvm-svn: 230267
*	add newline for easier reading; NFC	Sanjay Patel	2015-02-23	1	-1/+1
\| \| \| \|	llvm-svn: 230265
*	[AsmPrinter] Access pointers to globals via pcrel GOT entries	Bruno Cardoso Lopes	2015-02-23	3	-15/+231
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Front-ends could use global unnamed_addr to hold pointers to other symbols, like @gotequivalent below: @foo = global i32 42 @gotequivalent = private unnamed_addr constant i32* @foo @delta = global i32 trunc (i64 sub (i64 ptrtoint (i32** @gotequivalent to i64), i64 ptrtoint (i32* @delta to i64)) to i32) The global @delta holds a data "PC"-relative offset to @gotequivalent, an unnamed pointer to @foo. The darwin/x86-64 assembly output for this follows: .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta Since unnamed_addr indicates that the address is not significant, only the content, we can optimize the case above by replacing pc-relative accesses to "GOT equivalent" globals, by a PC relative access to the GOT entry of the final symbol instead. Therefore, "delta" can contain a pc relative relocation to foo's GOT entry and we avoid the emission of "gotequivalent", yielding the assembly code below: .globl _foo _foo: .long 42 .globl _delta _delta: .long _foo@GOTPCREL+4 There are a couple of advantages of doing this: (1) Front-ends that need to emit a great deal of data to store pointers to external symbols could save space by not emitting such "got equivalent" globals and (2) IR constructs combined with this opt opens a way to represent GOT pcrel relocations by using the LLVM IR, which is something we previously had no way to express. Differential Revision: http://reviews.llvm.org/D6922 rdar://problem/18534217 llvm-svn: 230264
*	Removing unused private field.	Andrew Kaylor	2015-02-23	1	-3/+1
\| \| \| \|	llvm-svn: 230259
*	Second attempt to fix WinEHCatchDirector build failures.	Andrew Kaylor	2015-02-23	1	-1/+0
\| \| \| \|	llvm-svn: 230257
*	Attempting to fix WinEHCatchDirector destructor related build failures.	Andrew Kaylor	2015-02-23	1	-0/+1
\| \| \| \|	llvm-svn: 230252
*	Remap frame variables for native Windows exception handling.	Andrew Kaylor	2015-02-23	2	-392/+649
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D7770 llvm-svn: 230249
*	Revert "[X86][MMX] Add MMX instructions to foldable tables"	Bruno Cardoso Lopes	2015-02-23	1	-82/+0
\| \| \| \| \| \|	This reverts commit r230226 since it breaks win buildbots. llvm-svn: 230248
*	Revert "Revert "Raising minimum required CMake version to 2.8.12.2.""	Chad Rosier	2015-02-23	2	-2/+2
\| \| \| \| \| \|	This reverts commit r230240, which was an accidental commit. llvm-svn: 230246
*	Rewrite the global merge pass to be subprogram agnostic for now.	Eric Christopher	2015-02-23	7	-42/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	It was previously using the subtarget to get values for the global offset without actually checking each function as it was generating code. Go ahead and solidify the current behavior and make the existing FIXMEs more prominent. As a note the ARM backend previously had a thumb1 and non-thumb1 set of defaults. Only the former was tested so I've changed the behavior to only use that for now. llvm-svn: 230245
*	Prevent hoisting fmul from THEN/ELSE to IF if there is fmsub/fmadd opportunity.	Chad Rosier	2015-02-23	4	-2/+41
\| \| \| \| \| \| \| \| \| \| \|	This patch adds the isProfitableToHoist API. For AArch64, we want to prevent a fmul from being hoisted in cases where it is more profitable to form a fmsub/fmadd. Phabricator Review: http://reviews.llvm.org/D7299 Patch by Lawrence Hu <lawrence@codeaurora.org> llvm-svn: 230241
*	Revert "Raising minimum required CMake version to 2.8.12.2."	Chad Rosier	2015-02-23	2	-2/+2
\| \| \| \| \| \|	This reverts commit 247aed4710e8befde76da42b27313661dea7cf66. llvm-svn: 230240
*	InstSimplify: simplify 0 / X if nnan and nsz	Mehdi Amini	2015-02-23	2	-16/+34
\| \| \| \| \|	From: Fiona Glaser <fglaser@apple.com> llvm-svn: 230238
*	[mips] Honour -mno-odd-spreg for vector insert/extract when MSA is enabled.	Daniel Sanders	2015-02-23	2	-5/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: -mno-odd-spreg prohibits the use of odd-numbered single-precision floating point registers. However, vector insert/extract was still using them when manipulating the subregisters of an MSA register. Fixed this by ensuring that insertion/extraction is only performed on even-numbered vector registers when -mno-odd-spreg is given. Reviewers: vmedic, sstankovic Reviewed By: sstankovic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7672 llvm-svn: 230235
*	Fix incorrect immediate size for AddrModeT2_i8s4 in rewriteT2FrameIndex.	Bob Wilson	2015-02-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The natural way to handle this addressing mode would be to say that it has 8 bits and gets scaled by 4, but since the MC layer is expecting the scaling to be already reflected in the immediate value, we have been setting the Scale to 1. That's fine, but then NumBits needs to be adjusted to reflect the effective increase in the range of the immediate. That adjustment was missing. The consequence is that the register scavenger can fail. The estimateRSStackSizeLimit() function in ARMFrameLowering.cpp correctly assumes that the AddrModeT2_i8s4 address mode can handle scaled offsets up to 1020. Under just the right circumstances, we fail to reserve space for the scavenger because it thinks that nothing will be needed. However, the overly pessimistic behavior in rewriteT2FrameIndex causes some frame indexes to be out of range and require scavenged registers, and so the scavenger asserts. Unfortunately I have not been able to come up with a testcase for this. I can only reproduce it on an internal branch where the frame layout and register allocation is slightly different than trunk. We really need a way to serialize MachineInstr-level IR to write reasonable tests for things like this. rdar://problem/19909005 llvm-svn: 230233
*	Sync the __builtin_expects for our 3 quadratically probed hash table ↵	Benjamin Kramer	2015-02-23	2	-11/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	implementations. This assumes that a) finding the bucket containing the value is LIKELY b) finding an empty bucket is LIKELY c) growing the table is UNLIKELY I also switched the a) and b) cases for SmallPtrSet as we seem to use the set mostly more for insertion than for checking existence. In a simple benchmark consisting of 2^21 insertions of 2^20 unique pointers into a DenseMap or SmallPtrSet a few percent speedup on average, but nothing statistically significant. llvm-svn: 230232
*	[X86][MMX] Add MMX instructions to foldable tables	Bruno Cardoso Lopes	2015-02-23	1	-0/+82
\| \| \| \| \| \| \| \|	Teach the peephole optimizer to work with MMX instructions by adding entries into the foldable tables. This covers folding opportunities not handled during isel. llvm-svn: 230226
*	[X86][MMX] Support folding loads in psll, psrl and psra intrinsics	Bruno Cardoso Lopes	2015-02-23	2	-0/+21
\| \| \| \|	llvm-svn: 230225
*	AVX-512: recommitted 229837 + bugfix + test	Elena Demikhovsky	2015-02-23	5	-47/+87
\| \| \| \|	llvm-svn: 230223
*	restructured X86 scalar unary operation templates	Elena Demikhovsky	2015-02-23	1	-164/+118
\| \| \| \| \| \| \|	I made the templates general, no need to define pattern separately for each instruction/intrinsic. Now only need to add r_Int pattern for AVX. llvm-svn: 230221
*	AsmParser: Check ConstantExpr insertvalue operands for type correctness	David Majnemer	2015-02-23	1	-1/+8
\| \| \| \|	llvm-svn: 230206
*	[llvm-pdbdump] Very minor code cleanup.	Zachary Turner	2015-02-23	3	-20/+15
\| \| \| \| \| \| \|	This just removes some dead enums as well as some debug flushes of stdout. llvm-svn: 230204
*	[llvm-pdbdump] Add an option to dump full class definitions.	Zachary Turner	2015-02-23	5	-4/+95
\| \| \| \| \| \| \| \| \| \| \|	This adds the --class-definitions flag. If specified, when dumping types, instead of "class Foo" you will see the full class definition, with member functions, constructors, access specifiers. NOTE: Using this option can be very slow, as generating a full class definition requires accessing many different parts of the PDB. llvm-svn: 230203
*	AsmParser: Call instructions can't have an alignment	David Majnemer	2015-02-23	1	-2/+10
\| \| \| \|	llvm-svn: 230193
*	AsmParser: Check ConstantExpr GEP operands for validity	David Majnemer	2015-02-22	1	-2/+24
\| \| \| \|	llvm-svn: 230188
*	[llvm-pdbdump] Rewrite dumper using visitor pattern.	Zachary Turner	2015-02-22	35	-498/+293
\| \| \| \| \| \| \| \| \| \|	This increases the flexibility of how to dump different symbol types -- necessary for context-sensitive formatting of symbol types -- and also improves the modularity by allowing the dumping to be implemented in the actual dumper, as opposed to in the PDB library. llvm-svn: 230184
*	[llvm-pdbdump] Simplify options and output.	Zachary Turner	2015-02-22	1	-0/+1
\| \| \| \| \| \| \| \| \|	This removes a wealth of options, and instead now only provides three options. -symbols, -types, and -compilands. This greatly simplifies use of the tool, and makes it easier to understand what you're going to see when you run the tool. llvm-svn: 230182
*	Roll condition into an assert then wrap it 'ifndef NDEBUG' to protect from ↵	David Blaikie	2015-02-22	1	-8/+7
\| \| \| \| \| \|	the inevitable "unused variable" warning in a non-asserts build. llvm-svn: 230181
*	Use common parse routine to read alignment values from bitcode	JF Bastien	2015-02-22	2	-14/+46
\| \| \| \| \| \| \| \| \| \|	While fuzzing LLVM bitcode files, I discovered that (1) the bitcode reader doesn't check that alignments are no larger than 2**29; (2) downstream code doesn't check the range; and (3) for values out of range, corresponding large memory requests (based on alignment size) will fail. This code fixes the bitcode reader to check for valid alignments, fixing this problem. This CL fixes alignment value on global variables, functions, and instructions: alloca, load, load atomic, store, store atomic. Patch by Karl Schimpf (kschimpf@google.com). llvm-svn: 230180
*	[LICM] Refactor to expose functionality as utility functions	Hal Finkel	2015-02-22	1	-173/+199
\| \| \| \| \| \| \| \| \| \| \|	This refactors the core functionality of LICM: HoistRegion, SinkRegion and PromoteAliasSet (renamed to promoteLoopAccessesToScalars) as utility functions in LoopUtils. This will enable other transformations to make use of them directly. Patch by Ashutosh Nema. llvm-svn: 230178
*	[DagCombiner] Generalized BuildVector Vector Concatenation	Simon Pilgrim	2015-02-22	1	-23/+38
\| \| \| \| \| \| \| \| \| \| \| \|	The CONCAT_VECTORS combiner pass can transform the concat of two BUILD_VECTOR nodes into a single BUILD_VECTOR node. This patch generalises this to support any number of BUILD_VECTOR nodes, and also permits UNDEF nodes to be included as well. This was noticed as AVX vec128 -> vec256 canonicalization sometimes creates a CONCAT_VECTOR with a real vec128 lower and an vec128 UNDEF upper. Differential Revision: http://reviews.llvm.org/D7816 llvm-svn: 230177
*	[DAGCombine] Don't assume integer-type legailty in ↵	Hal Finkel	2015-02-22	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	reduceBuildVecConvertToConvertBuildVec DAGCombine will rewrite an BUILD_VECTOR where all non-undef inputs some from [US]INT_TO_FP, as a BUILD_VECTOR of integers with the conversion applied as a vector operation. We check operation legality of the conversion, but fail to check legality of the integer vector type itself. Because targets don't normally override operation legality defaults for illegal types, we need to check this also. This came up in the context of the QPX vector entensions for PowerPC (which can have legal floating-point vector types without corresponding legal integer vector types). No in-tree test case for this yes, but one can be added once the QPX support has been committed. llvm-svn: 230176