bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[LazyValueInfo] Look through Phi nodes when trying to prove a predicate	Philip Reames	2015-08-31	1	-5/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If asked to prove a predicate about a value produced by a PHI node, LazyValueInfo was unable to do so even if the predicate was known to be true for each input to the PHI. This prevented JumpThreading from eliminating a provably redundant branch. The problematic test case looks something like this: ListNode *p = ...; while (p != null) { if (!p) return; x = g->x; // unrelated p = p->next } The null check at the top of the loop is redundant since the value of 'p' is null checked on entry to the loop and before executing the backedge. This resulted in us a) executing an extra null check per iteration and b) not being able to LICM unrelated loads after the check since we couldn't prove they would execute or that their dereferenceability wasn't effected by the null check on the first iteration. Differential Revision: http://reviews.llvm.org/D12383 llvm-svn: 246465
*	Rework of the new interface for shrink wrapping	Kit Barton	2015-08-31	2	-21/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Based on comments from Hal (http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20150810/292978.html), I've changed the interface to add a callback mechanism to the TargetFrameLowering class to query whether the specific target supports shrink wrapping. By default, shrink wrapping is disabled by default. Each target can override the default behaviour using the TargetFrameLowering::targetSupportsShrinkWrapping() method. Shrink wrapping can still be explicitly enabled or disabled from the command line, using the existing -enable-shrink-wrap=<true\|false> option. Phabricator: http://reviews.llvm.org/D12293 llvm-svn: 246463
*	AArch64: Fix loads to lower NEON vector lanes using GPR registers	Matthias Braun	2015-08-31	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	The ISelLowering code turned insertion turned the element for the lowest lane of a BUILD_VECTOR into an INSERT_SUBREG, this prohibited the patterns for SCALAR_TO_VECTOR(Load) to match later. Restrict this to cases without a load argument. Reported in rdar://22223823 Differential Revision: http://reviews.llvm.org/D12467 llvm-svn: 246462
*	X86: Fix FastISel SSESelect register class	Matthias Braun	2015-08-31	1	-3/+9
\| \| \| \| \| \| \| \| \|	X86FastISel has been using the wrong register class for VBLENDVPS which produces a VR128 and needs an extra copy to the target register. The problem was already hit by the existing test cases when using > llvm-lit -Dllc="llc -verify-machineinstr" llvm-svn: 246461
*	[BitcodeReader] Ensure we can read constant vector selects with an i1 condition	Filipe Cabecinhas	2015-08-31	1	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Constant vectors weren't allowed to have an i1 condition in the BitcodeReader. Make sure we have the same restrictions that are documented, not more. Reviewers: nlewycky, rafael, kschimpf Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12440 llvm-svn: 246459
*	[MC/AsmParser] Avoid setting MCSymbol.IsUsed in some cases	Vedant Kumar	2015-08-31	1	-9/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Avoid marking some MCSymbols as used in MC/AsmParser.cpp when no uses exist. This fixes a bug in parseAssignmentExpression() which inadvertently sets IsUsed, thereby triggering: "invalid re-assignment of non-absolute variable" on otherwise valid code. No other functionality change intended. The original version of this patch touched many calls to MCSymbol accessors. On rafael's advice, I have stripped this patch down a bit. As a follow-up, I intend to find the call sites which intentionally set IsUsed and force them to do so explicitly. Differential Revision: http://reviews.llvm.org/D12347 llvm-svn: 246457
*	Change comment to verify commit accesss.	Karl Schimpf	2015-08-31	1	-1/+1
\| \| \| \|	llvm-svn: 246451
*	Revert "Repress sanitization on User dtor. Modify msan macros for applying ↵	Naomi Musgrave	2015-08-31	2	-6/+2
\| \| \| \| \| \| \| \|	attribute" This reverts commit 5e3bfbb38eb3fb6f568b107f6b239e0aa4c5f334. llvm-svn: 246450
*	Repress sanitization on User dtor. Modify msan macros for applying attribute	Naomi Musgrave	2015-08-31	2	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to repress sanitization. Move attribute for repressing sanitization to operator delete for User, MDNode. Summary: In response to bug 24578, reported against failing LLVM test. Reviewers: chandlerc, rsmith, eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12335 llvm-svn: 246449
*	[SectionMemoryManager] Use range-based for loops. No functional change intended.	Benjamin Kramer	2015-08-31	1	-21/+10
\| \| \| \|	llvm-svn: 246440
*	AVX512: ktest implemantation	Igor Breger	2015-08-31	4	-14/+16
\| \| \| \| \| \| \| \|	Added tests for encoding. Differential Revision: http://reviews.llvm.org/D11979 llvm-svn: 246439
*	AVX512: Implemented encoding and intrinsics for vdbpsadbw	Igor Breger	2015-08-31	5	-1/+15
\| \| \| \| \| \| \| \|	Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D12491 llvm-svn: 246436
*	AVX512: kadd implementation	Igor Breger	2015-08-31	1	-2/+4
\| \| \| \| \| \| \| \|	Added tests for encoding. Differential Revision: http://reviews.llvm.org/D11973 llvm-svn: 246432
*	AVX512: Implemented encoding and intrinsics for vpalignr	Igor Breger	2015-08-31	4	-34/+92
\| \| \| \| \| \| \| \|	Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D12270 llvm-svn: 246428
*	[AggressiveAntiDepBreaker] Check for EarlyClobber on defining instruction	Hal Finkel	2015-08-31	1	-0/+14
\| \| \| \| \| \| \| \| \|	AggressiveAntiDepBreaker was doing some EarlyClobber checking, but was not checking that the register being potentially renamed was defined by an early-clobber def where there was also a use, in that instruction, of the register being considered as the target of the rename. Fixes PR24014. llvm-svn: 246423
*	[JumpThreading] make jump threading respect convergent annotation.	Jingyue Wu	2015-08-31	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: JumpThreading shouldn't duplicate a convergent call, because that would move a convergent call into a control-inequivalent location. For example, if (cond) { ... } else { ... } convergent_call(); if (cond) { ... } else { ... } should not be optimized to if (cond) { ... convergent_call(); ... } else { ... convergent_call(); ... } Test Plan: test/Transforms/JumpThreading/basic.ll Patch by Xuetian Weng. Reviewers: resistor, arsenm, jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12484 llvm-svn: 246415
*	Support: Support LLVM_ENABLE_THREADS=0 in llvm/Support/thread.h.	Peter Collingbourne	2015-08-31	1	-2/+2
\| \| \| \| \| \| \| \|	Specifically, the header now provides llvm::thread, which is either a typedef of std::thread or a replacement that calls the function synchronously depending on the value of LLVM_ENABLE_THREADS. llvm-svn: 246402
*	[PowerPC] Fixup SELECT_CC (and SETCC) patterns with i1 comparison operands	Hal Finkel	2015-08-30	4	-5/+168
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There were really two problems here. The first was that we had the truth tables for signed i1 comparisons backward. I imagine these are not very common, but if you have: setcc i1 x, y, LT this has the '0 1' and the '1 0' results flipped compared to: setcc i1 x, y, ULT because, in the signed case, '1 0' is really '-1 0', and the answer is not the same as in the unsigned case. The second problem was that we did not have patterns (at all) for the unsigned comparisons select_cc nodes for i1 comparison operands. This was the specific cause of PR24552. These had to be added (and a missing Altivec promotion added as well) to make sure these function for all types. I've added a bunch more test cases for these patterns, and there are a few FIXMEs in the test case regarding code-quality. Fixes PR24552. llvm-svn: 246400
*	NFC: Code style in VectorUtils.cpp	Elena Demikhovsky	2015-08-30	1	-10/+12
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D12478 llvm-svn: 246381
*	Revert "Revert "New interface function is added to VectorUtils Value ↵	Renato Golin	2015-08-30	2	-17/+39
\| \| \| \| \| \| \| \| \|	getSplatValue(Value Val);"" This reverts commit r246379. It seems that the commit was not the culprit, and the bot will be investigated for instability. llvm-svn: 246380
*	Revert "New interface function is added to VectorUtils Value ↵	Renato Golin	2015-08-30	2	-39/+17
\| \| \| \| \| \| \| \| \| \|	getSplatValue(Value Val);" This reverts commit r246371, as it cause a rather obscure bug in AArch64 test-suite paq8p (time outs, seg-faults). I'll investigate it before reapplying. llvm-svn: 246379
*	Stop calling the flat out insane ARM target parsing code unless the	Chandler Carruth	2015-08-30	1	-8/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	architecture string is something quite weird. Similarly delay calling the BPF parsing code, although that is more reasonable. To understand why I was motivated to make this change, it cuts the time for running the ADT TripleTest unittests by a factor of two in non-optimized builds (the developer default) and reduces my 'check-llvm' time by a full 15 seconds. The implementation of parseARMArch is that slow. I tried to fix it in the prior series of commits, but frankly, I have no idea how to finish fixing it. The entire premise of the function (to allow 'v7a-unknown-linux' or some such to parse as an 'arm-unknown-linux' triple) seems completely insane to me, but I'll let the ARM folks sort that out. At least it is now out of the critical path of every developer working on LLVM. It also will likely make some other folks' code significantly faster as I've heard reports of 2% of time spent in triple parsing even in optimized builds! I'm not done making this code faster, but I am done trying to improve the ARM target parsing code. llvm-svn: 246378
*	Remove a linear walk to find the default FPU for a given CPU by directly	Chandler Carruth	2015-08-30	1	-7/+6
\| \| \| \| \| \|	expanding the .def file within a StringSwitch. llvm-svn: 246377
*	[MIR Serialization] static -> static const in ↵	Hal Finkel	2015-08-30	3	-5/+5
\| \| \| \| \| \| \| \| \|	getSerializable*MachineOperandTargetFlags Make the arrays 'static const' instead of just 'static'. Post-commit review comment from Roman Divacky on IRC. NFC. llvm-svn: 246376
*	Teach the target parsing framework to directly compute the length of all	Chandler Carruth	2015-08-30	2	-45/+72
\| \| \| \| \| \| \| \| \| \|	of its strings when expanding the string literals from the macros, and push all of the APIs to be StringRef instead of C-string APIs. This (remarkably) removes a very non-trivial number of strlen calls. It even deletes code and complexity from one of the primary users -- Clang. llvm-svn: 246374
*	[PowerPC/MIR Serialization] Target flags serialization support	Hal Finkel	2015-08-30	2	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for MIR serialization of PowerPC-specific operand target flags (based on the generic infrastructure added in r244185 and r245383). I won't even pretend that this is good test coverage, but this includes the regression test associated with r246372. Adding an MIR test for that fix is far superior to adding an IR-level test because particular instruction-scheduling decisions are necessary in order to expose the bug, and using an MIR test we can start the pipeline post-scheduling. llvm-svn: 246373
*	[PowerPC] Don't assume ADDISdtprelHA's source is r3	Hal Finkel	2015-08-30	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Even through ADDISdtprelHA generally has r3 as its source register, it is possible for the instruction scheduler to move things around such that some other register is the source. We need to print the actual source register, not always r3. Fixes PR24394. The test case will come in a follow-up commit because it depends on MIR target-flags parsing. llvm-svn: 246372
*	New interface function is added to VectorUtils	Elena Demikhovsky	2015-08-30	2	-17/+39
\| \| \| \| \| \| \| \| \| \| \| \| \|	Value getSplatValue(Value Val); It complements the CreateVectorSplat(), which creates 2 instructions - insertelement and shuffle with all-zero mask. The new function recognizes the pattern - insertelement+shuffle and returns the splat value (or nullptr). It also returns a splat value form ConstantDataVector, for completeness. Differential Revision: http://reviews.llvm.org/D11124 llvm-svn: 246371
*	Refactor the ARM target parsing to use a def file with macros to expand	Chandler Carruth	2015-08-30	1	-164/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	the necessary tables. This will allow me to restructure the code and structures using this to be significantly more efficient. It also removes the duplication of the list of several enumerators. It also enshrines that the order of enumerators match the order of the entries in the tables, something the implementation code actually uses. No functionality changed (yet). llvm-svn: 246370
*	[Triple] Use clang-format to normalize the formatting of the ARM target	Chandler Carruth	2015-08-30	1	-36/+35
\| \| \| \| \| \| \| \| \| \| \| \| \|	parsing logic prior to making substantial changes to it. This parsing logic is incredibly wasteful, so I'm planning to rewrite it. Just unittesting the triple parsing logic spends well over 80% of its time in the ARM parsing logic, and others have measured significant time spent here in real production compiles. Stay tuned... llvm-svn: 246369
*	[Triple] Stop abusing a class to have only static methods and just use	Chandler Carruth	2015-08-30	5	-49/+49
\| \| \| \| \| \| \|	the namespace that we are already using for the enums that are produced by the parsing. llvm-svn: 246367
*	SelectionDAG: add missing ComputeSignBits case for SELECT_CC	Fiona Glaser	2015-08-29	1	-0/+5
\| \| \| \| \| \|	Identical to SELECT, just with different operand numbers. llvm-svn: 246366
*	Fix shared library build.	Peter Collingbourne	2015-08-29	1	-0/+7
\| \| \| \|	llvm-svn: 246365
*	[ARM] Hoist fabs/fneg above a conversion to float.	James Molloy	2015-08-29	1	-1/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is especially visible in softfp mode, for example in the implementation of libm fabs/fneg functions. If we have: %1 = vmovdrr r0, r1 %2 = fabs %1 then move the fabs before the vmovdrr: %1 = and r1, #0x7FFFFFFF %2 = vmovdrr r0, r1 This is never a lose, and could be a serious win because the vmovdrr may be followed by a vmovrrd, which would enable us to remove the conversion into FPRs completely. We already do this for f32, but not for f64. Tests are added for both. llvm-svn: 246360
*	AMDGPU: Add sdst operand to VOP2b instructions	Matt Arsenault	2015-08-29	2	-20/+30
\| \| \| \| \| \| \| \| \| \|	The VOP3 encoding of these allows any SGPR pair for the i1 output, but this was forced before to always use vcc. This doesn't yet try to use this, but does add the operand to the definitions so the main change is adding vcc to the output of the VOP2 encoding. llvm-svn: 246358
*	AMDGPU: Set mem operands for spill instructions	Matt Arsenault	2015-08-29	3	-25/+55
\| \| \| \|	llvm-svn: 246357
*	AMDGPU: Fix dropping mem operands when moving to VALU	Matt Arsenault	2015-08-29	1	-11/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	Without a memory operand, mayLoad or mayStore instructions are treated as hasUnorderedMemRef, which results in much worse scheduling. We really should have a verifier check that any non-side effecting mayLoad or mayStore has a memory operand. There are a few instructions (interp and images) which I'm not sure what / where to add these. llvm-svn: 246356
*	AMDGPU/SI: Fix some invaild assumptions when folding 64-bit immediates	Tom Stellard	2015-08-29	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We were assuming tha if the use operand had a sub-register that the immediate was 64-bits, but this was breaking the case of folding a 64-bit immediate into another 64-bit instruction. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12255 llvm-svn: 246354
*	AMDGPU/SI: Factor operand folding code into its own function	Tom Stellard	2015-08-28	1	-67/+79
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12254 llvm-svn: 246353
*	DI: Set DILexicalBlock columns >= 65536 to 0/unknown	Duncan P. N. Exon Smith	2015-08-28	1	-0/+3
\| \| \| \| \| \| \| \| \|	This fixes PR24621 and matches what we do for `DILocation`. Although the limit seems somewhat artificial, there are places in the backend that also assume 16-bit columns, so we may as well just be consistent about the limits. llvm-svn: 246349
*	[X86] NFC: Clean up and clang-format a few lines	Vedant Kumar	2015-08-28	1	-5/+5
\| \| \| \|	llvm-svn: 246340
*	DI: Add Function::getSubprogram()	Duncan P. N. Exon Smith	2015-08-28	2	-1/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add `Function::setSubprogram()` and `Function::getSubprogram()`, convenience methods to forward to `setMetadata()` and `getMetadata()`, respectively, and deal in `DISubprogram` instead of `MDNode`. Also add a verifier check to enforce that `!dbg` attachments are always subprograms. Originally (when I had the llvm-dev discussion back in April) I thought I'd store a pointer directly on `llvm::Function` for these attachments -- we frequently have debug info, and that's much cheaper than using map in the context if there are no other function-level attachments -- but for now I'm just using the generic infrastructure. Let's add the extra complexity only if this shows up in a profile. llvm-svn: 246339
*	AsmPrinter: Allow null subroutine type	Duncan P. N. Exon Smith	2015-08-28	2	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently the DWARF backend requires that subprograms have a type, and the type is ignored if it has an empty type array. The long term direction here -- see PR23079 -- is instead to skip the type entirely if there's no valid type. It turns out we have cases in tree of missing types on subprograms, but since they're not referenced by compile units, the backend never crashes on them. One option would be to add a Verifier check that subprograms have types, and fix the bitrot. However, this is a fair bit of churn (20-30 testcases) that would be reversed anyway by PR23079. I found this inconsistency because of a WIP patch and upgrade script for PR23367 that started crashing on test/DebugInfo/2010-10-01-crash.ll. This commit updates the testcase to reference the subprogram from the compile unit, and fixes the resulting crash (in line with the direction of PR23079). This also updates `DIBuilder` to stop assuming a non-null pointer for the subroutine types. llvm-svn: 246333
*	Revert r246232 and r246304.	David Majnemer	2015-08-28	2	-14/+51
\| \| \| \| \| \| \| \| \|	This reverts isSafeToSpeculativelyExecute's use of ReadNone until we split ReadNone into two pieces: one attribute which reasons about how the function reasons about memory and another attribute which determines how it may be speculated, CSE'd, trap, etc. llvm-svn: 246331
*	DI: Require subprogram definitions to be distinct	Duncan P. N. Exon Smith	2015-08-28	3	-1/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As a follow-up to r246098, require `DISubprogram` definitions (`isDefinition: true`) to be 'distinct'. Specifically, add an assembler check, a verifier check, and bitcode upgrading logic to combat testcase bitrot after the `DIBuilder` change. While working on the testcases, I realized that test/Linker/subprogram-linkonce-weak-odr.ll isn't relevant anymore. Its purpose was to check for a corner case in PR22792 where two subprogram definitions match exactly and share the same metadata node. The new verifier check, requiring that subprogram definitions are 'distinct', precludes that possibility. I updated almost all the IR with the following script: git grep -l -E -e '= !DISubprogram\(.* isDefinition: true' \| grep -v test/Bitcode \| xargs sed -i '' -e 's/= \(!DISubprogram(.*, isDefinition: true\)/= distinct \1/' Likely some variant of would work for out-of-tree testcases. llvm-svn: 246327
*	[InstCombine] Fix PR24605.	Sanjoy Das	2015-08-28	2	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	PR24605 is caused due to an incorrect insert point in instcombine's IR builder. When simplifying %t = add X Y ... %m = icmp ... %t the replacement for %t should be placed before %t, not before %m, as there could be a use of %t between %t and %m. llvm-svn: 246315
*	Optimize memcmp(x,y,n)==0 for small n and suitably aligned x/y.	Chad Rosier	2015-08-28	1	-0/+22
\| \| \| \| \| \| \|	http://reviews.llvm.org/D6952 PR20673 llvm-svn: 246313
*	[mips64][mcjit] Add N64R6 relocations tests and fix N64R2 tests	Petar Jovanovic	2015-08-28	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a test for MIPS64R6 relocations, it corrects check expressions for R_MIPS_26 and R_MIPS_PC16 relocations in MIPS64R2 test, and it adds run for big endian in MIPS64R2 test. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D11217 llvm-svn: 246311
*	[mips] Remove incorrect DebugLoc entries from prologue	Petar Jovanovic	2015-08-28	3	-4/+3
\| \| \| \| \| \| \| \| \| \|	This has been causing the prologue_end to be incorrectly positioned. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D11293 llvm-svn: 246309
*	Make MergeConsecutiveStores look at other stores on same chain	Matt Arsenault	2015-08-28	1	-24/+149
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When combiner AA is enabled, look at stores on the same chain. Non-aliasing stores are moved to the same chain so the existing code fails because it expects to find an adajcent store on a consecutive chain. Because of how DAGCombiner tries these store combines, MergeConsecutiveStores doesn't see the correct set of stores on the chain when it visits the other stores. Each store individually has its chain fixed before trying to merge consecutive stores, and then tries to merge stores from that point before the other stores have been processed to have their chains fixed. To fix this, attempt to use FindBetterChain on any possibly neighboring stores in visitSTORE. Suppose you have 4 32-bit stores that should be merged into 1 vector store. One store would be visited first, fixing the chain. What happens is because not all of the store chains have yet been fixed, 2 of the stores are merged. The other 2 stores later have their chains fixed, but because the other stores were already merged, they have different memory types and merging the two different sized stores is not supported and would be more difficult to handle. llvm-svn: 246307