bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Restore original intent of memset instcombine test	Daniel Neilson	2017-06-29	1	-4/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The original intent of test/Transforms/InstCombine/memset.ll was to test for lowering of llvm.memset into stores when the size of the memset is 1, 2, 4, or 8. Sometime between then and now the test has stopped testing for that, but remained passing due to testing for the absence of llvm.memset calls rather than the presence of store instructions. Right now this test ends up with an empty function body because the alloca is eliminated as safe-to-remove, which results in the llvm.memset calls's being eliminated due to their pointer args being undef; so it is not testing for conversion of llvm.memset into store instructions at all. This change alters the test to verify that store instructions are created, and moves the target of the memset to an arg of the proc to avoid it being eliminated as unused. Reviewers: anna, efriedma Reviewed By: efriedma Subscribers: efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D34642 llvm-svn: 306681
*	Explicitly check for presence of correct results in instcombine memmove test	Daniel Neilson	2017-06-29	1	-17/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Rather than testing for expected results, test/Transforms/InstCombine/memmove.ll is testing for the absence of calls to llvm.memmove. In the case of test3, the test has stopped testing for materialization of loads/stores, but remained passing due to testing for the absence of llvm.memset calls rather than the presence of load/store instructions. Right now this test ends up with an empty function body because the alloca is eliminated as safe-to-remove, which results in the llvm.memmove calls being eliminated due to a pointer arg being undef; so it is not testing for conversion of llvm.memmove into load/store instructions at all. Reviewers: eli.friedman, anna, efriedma Reviewed By: efriedma Subscribers: efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D34645 llvm-svn: 306679
*	[InstCombine] Retain TBAA when narrowing memory accesses	Keno Fischer	2017-06-28	1	-0/+45
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: As discussed on the mailing list it is legal to propagate TBAA to loads/stores from/to smaller regions of a larger load tagged with TBAA. Do so for (load->extractvalue)=>(gep->load) and similar foldings. Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D31954 llvm-svn: 306615
*	[InstCombine] add tests for icmp with bitreversed ops; NFC	Sanjay Patel	2017-06-28	1	-0/+30
\| \| \| \| \| \|	This is similar enough to bswap that we might as well handle them together in one patch. llvm-svn: 306591
*	[AArch64][Falkor] Try to avoid exhausting HW prefetcher resources when ↵	Geoff Berry	2017-06-28	1	-0/+169
\| \| \| \| \| \| \| \| \| \| \| \|	unrolling. Reviewers: t.p.northover, mcrosier Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34533 llvm-svn: 306584
*	[BBVectorize][X86] Regenerate simple tests	Simon Pilgrim	2017-06-28	4	-152/+240
\| \| \| \|	llvm-svn: 306578
*	[InstCombine] Remove 64-bit bit width restriction from m_ConstantInt(uint64_t*&)	Craig Topper	2017-06-28	1	-8/+3
\| \| \| \| \| \| \| \| \| \|	I think we only need to make sure the value fits in 64-bits not that bit width is 64-bit. This helps places that use this for shift amounts since the shift amount needs to be the same bitwidth as the LHS, but can't be larger than the bit width. Differential Revision: https://reviews.llvm.org/D34737 llvm-svn: 306577
*	[LV] Fix PR33613 - retain order of insertelement per part	Ayal Zaks	2017-06-28	1	-4/+55
\| \| \| \| \| \| \| \| \| \| \| \|	r306381 caused PR33613, by reversing the order in which insertelements were generated per unroll part. This patch fixes PR33613 by retraining this order, placing each set of insertelements per part immediately after the last scalar being packed for this part. Includes a test case derived from PR33613. Reference: https://bugs.llvm.org/show_bug.cgi?id=33613 Differential Revision: https://reviews.llvm.org/D34760 llvm-svn: 306575
*	[BBVectorize] Regenerate simple tests	Simon Pilgrim	2017-06-28	4	-578/+609
\| \| \| \|	llvm-svn: 306571
*	[LoopUnroll] Fix bug in computeUnrollCount causing it to not honor MaxCount	Geoff Berry	2017-06-28	1	-0/+31
\| \| \| \| \| \| \| \| \| \|	Reviewers: sanjoy, anna, reames, apilipenko, igor-laevsky, mkuper Subscribers: mcrosier, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D34532 llvm-svn: 306564
*	[InstCombine] add tests for icmp with bswapped operands; NFC	Sanjay Patel	2017-06-28	1	-0/+30
\| \| \| \|	llvm-svn: 306563
*	Revert r306528	Nikolai Bozhenov	2017-06-28	1	-1/+1
\| \| \| \|	llvm-svn: 306536
*	[ValueTracking] Enabling existing ValueTracking patch by default.	Nikolai Bozhenov	2017-06-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original patch was an improvement to IR ValueTracking on non-negative integers. It has been checked in to trunk (D18777, r284022). But was disabled by default due to performance regressions. Perf impact has improved. The patch would be enabled by default. Reviewers: reames Differential Revision: https://reviews.llvm.org/D34101 Patch by: Olga Chupina <olga.chupina@intel.com> llvm-svn: 306528
*	[InstCombine] Canonicalize clamp of float types to minmax in fast mode.	Nikolai Bozhenov	2017-06-28	1	-27/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This commit allows matchSelectPattern to recognize clamp of float arguments in the presence of FMF the same way as already done for integers. This case is a little different though. With integers, given the min/max pattern is recognized, DAGBuilder starts selecting MIN/MAX "automatically". That is not the case for float, because for them only full FMINNAN/FMINNUM/FMAXNAN/FMAXNUM ISD nodes exist and they do care about NaNs. On the other hand, some backends (e.g. X86) have only FMIN/FMAX nodes that do not care about NaNS and the former NAN/NUM nodes are illegal thus selection is not happening. So I decided to do such kind of transformation in IR (InstCombiner) instead of complicating the logic in the backend. Reviewers: spatel, jmolloy, majnemer, efriedma, craig.topper Reviewed By: efriedma Subscribers: hiraditya, javed.absar, n.bozhenov, llvm-commits Patch by Andrei Elovikov <andrei.elovikov@intel.com> Differential Revision: https://reviews.llvm.org/D33186 llvm-svn: 306525
*	Add tests to document current InstCombine behavior for clamp pattern.	Nikolai Bozhenov	2017-06-28	1	-0/+500
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This commit adds the tests for clamp pattern as a prerequisite of D33186 to make the impact of that fix more clear and also to document current behavior. Reviewers: spatel, jmolloy Reviewed By: spatel Subscribers: n.bozhenov, llvm-commits Patch by Andrei Elovikov <andrei.elovikov@intel.com> Differential Revision: https://reviews.llvm.org/D34350 llvm-svn: 306524
*	[InstCombine] Add test case demonstrating that we don't handle icmp eq ↵	Craig Topper	2017-06-28	1	-0/+21
\| \| \| \| \| \|	(trunc (lshr(X, cst1)), cst->icmp (and X, mask), cst when the shift type is larger than 64-bits. NFC llvm-svn: 306510
*	Revert r306508 "[InstCombine] Add test case demonstrating that we don't ↵	Craig Topper	2017-06-28	1	-21/+0
\| \| \| \| \| \| \| \|	handle icmp eq (trunc (lshr(X, cst1)), cst->icmp (and X, mask), cst when the shift type is larger than 64-bits. NFC" I accidentally had a extra change in there. llvm-svn: 306509
*	[InstCombine] Add test case demonstrating that we don't handle icmp eq ↵	Craig Topper	2017-06-28	1	-0/+21
\| \| \| \| \| \|	(trunc (lshr(X, cst1)), cst->icmp (and X, mask), cst when the shift type is larger than 64-bits. NFC llvm-svn: 306508
*	[CGP] add specialization for memcmp expansion with only one basic block	Sanjay Patel	2017-06-27	1	-90/+33
\| \| \| \|	llvm-svn: 306485
*	[NewPM/Inliner] Reduce threshold for cold callsites in the non-PGO case	Easwaran Raman	2017-06-27	2	-43/+90
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D34312 llvm-svn: 306484
*	[AArch64] Inline callee if its target-features are a subset of the caller	Florian Hahn	2017-06-27	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Similar to X86, it should be safe to inline callees if their target-features are a subset of the caller. This change matches GCC's inlining behavior with respect to attributes [1]. [1] https://gcc.gnu.org/onlinedocs/gcc/AArch64-Function-Attributes.html#AArch64-Function-Attributes Reviewers: kristof.beyls, javed.absar, rengolin, t.p.northover Reviewed By: t.p.northover Subscribers: aemerson, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D34698 llvm-svn: 306478
*	re-commit r306336: Enable vectorizer-maximize-bandwidth by default.	Dehao Chen	2017-06-27	11	-67/+76
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D33341 llvm-svn: 306473
*	[CGP] eliminate a sub instruction in memcmp expansion	Sanjay Patel	2017-06-27	1	-30/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As noted in D34071, there are some IR optimization opportunities that could be handled by normal IR passes if this expansion wasn't happening so late in CGP. Regardless of that, it seems wasteful to knowingly produce suboptimal IR here, so I'm proposing this change: %s = sub i32 %x, %y %r = icmp ne %s, 0 => %r = icmp ne %x, %y Changing the predicate to 'eq' mimics what InstCombine would do, so that's just an efficiency improvement if we decide this expansion should happen sooner. The fact that the PowerPC backend doesn't eliminate the 'subf.' might be something for PPC folks to investigate separately. Differential Revision: https://reviews.llvm.org/D34416 llvm-svn: 306471
*	Clean up a test case	Xinliang David Li	2017-06-27	1	-34/+41
\| \| \| \|	llvm-svn: 306468
*	[InstCombine] Propagate nsw flag when turning mul by pow2 into shift when ↵	Craig Topper	2017-06-27	1	-5/+3
\| \| \| \| \| \| \| \| \| \| \| \|	the constant is a vector splat or the scalar bit width is larger than 64-bits The check to see if we can propagate the nsw flag used m_ConstantInt(uint64_t*&) which doesn't work with splat vectors and has a restriction that the bitwidth of the ConstantInt must be 64-bits are less. This patch changes it to use m_APInt to remove both these issues Differential Revision: https://reviews.llvm.org/D34699 llvm-svn: 306457
*	[CodeExtractor] Prevent extraction of block involving blockaddress	Serge Guelton	2017-06-27	2	-0/+79
\| \| \| \| \| \| \| \| \|	BlockAddress are only valid within their function context, which does not interact well with CodeExtractor. Detect this case and prevent it. Differential Revision: https://reviews.llvm.org/D33839 llvm-svn: 306448
*	[SROA] Fix APInt size when alloca address space is not 0	Yaxun Liu	2017-06-27	1	-1/+30
\| \| \| \| \| \| \| \|	SROA assumes alloca address space is 0, which causes assertion. This patch fixes that. Differential Revision: https://reviews.llvm.org/D34104 llvm-svn: 306440
*	[InstCombine] canonicalize icmp predicate feeding select	Sanjay Patel	2017-06-27	8	-63/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This canonicalization was suggested in D33172 as a way to make InstCombine behavior more uniform. We have this transform for icmp+br, so unless there's some reason that icmp+select should be treated differently, we should do the same thing here. The benefit comes from increasing the chances of creating identical instructions. This is shown in the tests in logical-select.ll (PR32791). InstCombine doesn't fold those directly, but EarlyCSE can simplify the identical cmps, and then InstCombine can fold the selects together. The possible regression for the tests in select.ll raises questions about poison/undef: http://lists.llvm.org/pipermail/llvm-dev/2017-May/113261.html ...but that transform is just as likely to be triggered by this canonicalization as it is to be missed, so we're just pointing out a commutation deficiency in the pattern matching: https://reviews.llvm.org/rL228409 Differential Revision: https://reviews.llvm.org/D34242 llvm-svn: 306435
*	[InstCombine] Add test case demonstrating that we don't propagate nsw flag ↵	Craig Topper	2017-06-27	1	-0/+10
\| \| \| \| \| \|	when converting mul by pow2 to shl when the type is larger than 64-bits. NFC llvm-svn: 306427
*	[InstCombine] Add test cases to show that we don't propagate 'nsw' flags ↵	Craig Topper	2017-06-27	1	-0/+20
\| \| \| \| \| \|	when converting mul by pow2 constant to shl for splat vectors. NFC llvm-svn: 306426
*	[PatternMatch] Remove 64-bit or less restriction from m_SpecificInt	Craig Topper	2017-06-27	1	-8/+4
\| \| \| \| \| \| \| \| \| \|	Not sure why this restriction existed, but it seems like we should support any size Constant here. The particular pattern in the tests is not the only use of this matcher in the tree. There's one in CodeGenPrepare and one in InstSimplify as well. Differential Revision: https://reviews.llvm.org/D34666 llvm-svn: 306417
*	[JumpThreading] Add test case that was supposed to go with r306085.	Craig Topper	2017-06-27	1	-0/+125
\| \| \| \| \| \|	Looks like I forgot to 'git add' when I submitted the commit. Thanks to Chandler for noticing. llvm-svn: 306416
*	[SROA] Fix PR32902 by more carefully propagating !nonnull metadata.	Chandler Carruth	2017-06-27	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is based heavily on the work done ni D34285. I mostly wanted to do test cleanup for the author to save them some time, but I had a really hard time understanding why it was so hard to write better test cases for these issues. The problem is that because SROA does a second rewrite of the loads and because we don't propagate !nonnull for non-pointer loads, we first introduced invalid !nonnull metadata and then stripped it back off just in time to avoid most ways of this PR manifesting. Moving to the more careful utility only fixes this by changing the predicate to look at the new load's type rather than the target type. However, that does fix the bug, and the utility is much nicer including adding range metadata to model the nonnull property after a conversion to an integer. However, we have bigger problems because we don't actually propagate range metadata, and the utility to do this extracted from instcombine isn't really in good shape to do this currently. It only handles the case of copying range metadata from an integer load to a pointer load. It doesn't even handle the trivial cases of propagating from one integer load to another when they are the same width! This utility will need to be beefed up prior to using in this location to get the metadata to fully survive. And even then, we need to go and teach things to turn the range metadata into an assume the way we do with nonnull so that when we promote an integer we don't lose the information. All of this will require a new test case that looks kind-of like `preserve-nonnull.ll` does here but focuses on range metadata. It will also likely require more testing because it needs to correctly handle changes to the integer width, especially as SROA actively tries to change the integer width! Last but not least, I'm a little worried about hooking the range metadata up here because the instcombine logic for converting from a range metadata to a nonnull metadata node seems broken in the face of non-zero address spaces where null is not mapped to the integer `0`. So that probably needs to get fixed with test cases both in SROA and in instcombine to cover it. But this does extract the core PR fix from D34285 of preventing the !nonnull metadata from being propagated in a broken state just long enough to feed into promotion and crash value tracking. On D34285 there is some discussion of zero-extend handling because it isn't necessary. First, the new load size covers all of the non-undef (ie, possibly initialized) bits. This may even extend past the original alloca if loading those bits could produce valid data. The only way its valid for us to zero-extend an integer load in SROA is if the original code had a zero extend or those bits were undef. And we get to assume things like undef never satifies nonnull, so non undef bits can participate here. No need to special case the zero-extend handling, it just falls out correctly. The original credit goes to Ariel Ben-Yehuda! I'm mostly landing this to save a few rounds of trivial edits fixing style issues and test case formulation. Differental Revision: D34285 llvm-svn: 306379
*	[Reassociate] Make sure EraseInst sets MadeChange	Mikael Holmen	2017-06-27	1	-0/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: EraseInst didn't report that it made IR changes through MadeChange. It is essential that changes to the IR are reported correctly, since for example ReassociatePass::run() will indicate that all analyses are preserved otherwise. And the CGPassManager determines if the CallGraph is up-to-date based on status from InstructionCombiningPass::runOnFunction(). Reviewers: craig.topper, rnk, davide Reviewed By: rnk, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34616 llvm-svn: 306368
*	[InstCombine] Add test cases demonstrating that we don't optmize ↵	Craig Topper	2017-06-27	1	-0/+27
\| \| \| \| \| \|	select+cmp+cttz/ctlz when the bitwidth is larger than 64 bits. llvm-svn: 306365
*	[SROA] Further test cleanup and add a test for the actual propagation of	Chandler Carruth	2017-06-27	1	-4/+25
\| \| \| \| \| \|	the nonnull attribute distinct from rewriting it into an assume. llvm-svn: 306358
*	[SROA] Clean up a test case a bit prior to adding more testing for	Chandler Carruth	2017-06-27	1	-15/+13
\| \| \| \| \| \|	nonnull as part of fixing PR32902. llvm-svn: 306353
*	revert r306336 for breaking ppc test.	Dehao Chen	2017-06-26	11	-76/+67
\| \| \| \|	llvm-svn: 306344
*	Enable vectorizer-maximize-bandwidth by default.	Dehao Chen	2017-06-26	11	-67/+76
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: vectorizer-maximize-bandwidth is generally useful in terms of performance. I've tested the impact of changing this to default on speccpu benchmarks on sandybridge machines. The result shows non-negative impact: spec/2006/fp/C++/444.namd 26.84 -0.31% spec/2006/fp/C++/447.dealII 46.19 +0.89% spec/2006/fp/C++/450.soplex 42.92 -0.44% spec/2006/fp/C++/453.povray 38.57 -2.25% spec/2006/fp/C/433.milc 24.54 -0.76% spec/2006/fp/C/470.lbm 41.08 +0.26% spec/2006/fp/C/482.sphinx3 47.58 -0.99% spec/2006/int/C++/471.omnetpp 22.06 +1.87% spec/2006/int/C++/473.astar 22.65 -0.12% spec/2006/int/C++/483.xalancbmk 33.69 +4.97% spec/2006/int/C/400.perlbench 33.43 +1.70% spec/2006/int/C/401.bzip2 23.02 -0.19% spec/2006/int/C/403.gcc 32.57 -0.43% spec/2006/int/C/429.mcf 40.35 +0.27% spec/2006/int/C/445.gobmk 26.96 +0.06% spec/2006/int/C/456.hmmer 24.4 +0.19% spec/2006/int/C/458.sjeng 27.91 -0.08% spec/2006/int/C/462.libquantum 57.47 -0.20% spec/2006/int/C/464.h264ref 46.52 +1.35% geometric mean +0.29% The regression on 453.povray seems real, but is due to secondary effects as all hot functions are bit-identical with and without the flag. I started this patch to consult upstream opinions on this. It will be greatly appreciated if the community can help test the performance impact of this change on other architectures so that we can decided if this should be target-dependent. Reviewers: hfinkel, mkuper, davidxl, chandlerc Reviewed By: chandlerc Subscribers: rengolin, sanjoy, javed.absar, bjope, dorit, magabari, RKSimon, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D33341 llvm-svn: 306336
*	Fix the bug when handling shufflevector for aarch64.	Dehao Chen	2017-06-26	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This Fixes https://bugs.llvm.org/show_bug.cgi?id=33600 Reviewers: mssimpso, davidxl, Carrot Reviewed By: mssimpso Subscribers: aemerson, rengolin, sanjoy, javed.absar, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D34641 llvm-svn: 306334
*	[GVN] Recommit the patch "Add phi-translate support in scalarpre".	Wei Mi	2017-06-26	3	-4/+135
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The recommit fixes three bugs: The first one is to use CurrentBlock instead of PREInstr's Parent as param of performScalarPREInsertion because the Parent of a clone instruction may be uninitialized. The second one is stop PRE when CurrentBlock to its predecessor is a backedge and an operand of CurInst is defined inside of CurrentBlock. The same value defined inside of loop in last iteration can not be regarded as available. The third one is an out-of-bound array access in a flipped if guard. Right now scalarpre doesn't have phi-translate support, so it will miss some simple pre opportunities. Like the following testcase, current scalarpre cannot recognize the last "a * b" is fully redundent because a and b used by the last "a * b" expr are both defined by phis. long a[100], b[100], g1, g2, g3; __attribute__((pure)) long goo(); void foo(long a, long b, long c, long d) { g1 = a * b; if (__builtin_expect(g2 > 3, 0)) { a = c; b = d; g2 = a * b; } g3 = a * b; // fully redundant. } The patch adds phi-translate support in scalarpre. This is only a temporary solution before the newpre based on newgvn is available. llvm-svn: 306313
*	[X86][LLVM][test]Expanding Supports lowerInterleavedStore() in ↵	Michael Zuckerman	2017-06-26	1	-0/+17
\| \| \| \| \| \| \| \|	X86InterleavedAccess test. Adding base tast (to trunk) for Store strid=4 vf=32. llvm-svn: 306286
*	[LoopSimplify] Re-instate r306081 with a bug fix w.r.t. indirectbr.	Chandler Carruth	2017-06-25	4	-3/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was reverted in r306252, but I already had the bug fixed and was just trying to form a test case. The original commit factored the logic for forming dedicated exits inside of LoopSimplify into a helper that could be used elsewhere and with an approach that required fewer intermediate data structures. See that commit for full details including the change to the statistic, etc. The code looked fine to me and my reviewers, but in fact didn't handle indirectbr correctly -- it left the 'InLoopPredecessors' vector dirty. If you have code that looks just right, you can end up leaking these predecessors into a subsequent rewrite, and crash deep down when trying to update PHI nodes for predecessors that don't exist. I've added an assert that makes the bug much more obvious, and then changed the code to reliably clear the vector so we don't get this bug again in some other form as the code changes. I've also added a test case that does manage to catch this while also giving some nice positive coverage in the face of indirectbr. The real code that found this came out of what I think is CPython's interpreter loop, but any code with really "creative" interpreter loops mixing indirectbr and other exit paths could manage to tickle the bug. I was hard to reduce the original test case because in addition to having a particular pattern of IR, the whole thing depends on the order of the predecessors which is in turn depends on use list order. The test case added here was designed so that in multiple different predecessor orderings it should always end up going down the same path and tripping the same bug. I hope. At least, it tripped it for me without manipulating the use list order which is better than anything bugpoint could do... llvm-svn: 306257
*	[LoopSimplify] Improve a test for loop simplify minorly. NFC.	Chandler Carruth	2017-06-25	1	-12/+150
\| \| \| \| \| \| \| \|	I did some basic testing while looking for a bug in my recent change to loop simplify and even though it didn't find the bug it seems like a useful improvement anyways. llvm-svn: 306256
*	Revert "[LoopSimplify] Factor the logic to form dedicated exits into a utility."	Daniel Jasper	2017-06-25	3	-0/+3
\| \| \| \| \| \| \|	This leads to a segfault. Chandler already has a test case and should be able to recommit with a fix soon. llvm-svn: 306252
*	[InstCombine] add (sext i1 X), 1 --> zext (not X)	Sanjay Patel	2017-06-25	1	-8/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	http://rise4fun.com/Alive/i8Q A narrow bitwise logic op is obviously better than math for value tracking, and zext is better than sext. Typically, the 'not' will be folded into an icmp predicate. The IR difference would even survive through codegen for x86, so we would see worse code: https://godbolt.org/g/C14HMF one_or_zero(int, int): # @one_or_zero(int, int) xorl %eax, %eax cmpl %esi, %edi setle %al retq one_or_zero_alt(int, int): # @one_or_zero_alt(int, int) xorl %ecx, %ecx cmpl %esi, %edi setg %cl movl $1, %eax subl %ecx, %eax retq llvm-svn: 306243
*	[PGO] Implementate profile counter regiser promotion	Xinliang David Li	2017-06-25	3	-0/+222
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D34085 llvm-svn: 306231
*	fix trivial typos in comment, NFC	Hiroshi Inoue	2017-06-24	1	-2/+2
\| \| \| \| \| \|	dereferencable -> dereferenceable llvm-svn: 306210
*	[InstCombine] Don't replace allocas with smaller globals	Vitaly Buka	2017-06-24	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: InstCombine replaces large allocas with small globals consts causing buffer overflows on valid code, see PR33372. This fix permits this optimization only if the global is dereference for alloca size. Fixes PR33372 Reviewers: eugenis, majnemer, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34311 llvm-svn: 306194
*	[InlineCost] Do not take INT_MAX when Cost is negative	Jun Bum Lim	2017-06-23	1	-0/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: visitSwitchInst should not take INT_MAX when Cost is negative. Instead of INT_MAX , we also use a valid upperbound cost when overflow occurs in Cost. Reviewers: hans, echristo, dmgreen Reviewed By: dmgreen Subscribers: mcrosier, javed.absar, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D34436 llvm-svn: 306118