bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add a counter-function insertion pass	Hal Finkel	2016-09-01	2	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As discussed in https://reviews.llvm.org/D22666, our current mechanism to support -pg profiling, where we insert calls to mcount(), or some similar function, is fundamentally broken. We insert these calls in the frontend, which means they get duplicated when inlining, and so the accumulated execution counts for the inlined-into functions are wrong. Because we don't want the presence of these functions to affect optimizaton, they should be inserted in the backend. Here's a pass which would do just that. The knowledge of the name of the counting function lives in the frontend, so we're passing it here as a function attribute. Clang will be updated to use this mechanism. Differential Revision: https://reviews.llvm.org/D22825 llvm-svn: 280347
*	[XRay] Detect and emit sleds for sibling/tail calls	Dean Michael Berris	2016-09-01	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change promotes the 'isTailCall(...)' member function to TargetInstrInfo as a query interface for determining on a per-target basis whether a given MachineInstr is a tail call instruction. We build upon this in the XRay instrumentation pass to emit special sleds for tail call optimisations, where we emit the correct kind of sled. The tail call sleds look like a mix between the function entry and function exit sleds. Form-wise, the sled comes before the "jmp" instruction that implements the tail call similar to how we do it for the function entry sled. Functionally, because we know this is a tail call, it behaves much like an exit sled -- i.e. at runtime we may use the exit trampolines instead of a different kind of trampoline. A follow-up change to recognise these sleds will be done in compiler-rt, so that we can start intercepting these initially as exits, but also have the option to have different log entries to more accurately reflect that this is actually a tail call. Reviewers: echristo, rSerge, majnemer Subscribers: mehdi_amini, dberris, llvm-commits Differential Revision: https://reviews.llvm.org/D23986 llvm-svn: 280334
*	Revert "Add asm.js-style setjmp/longjmp handling for wasm"	Heejin Ahn	2016-09-01	4	-303/+21
\| \| \| \| \| \|	This reverts commit r280302, it broke the integration tests. llvm-svn: 280329
*	Add -fprofile-dir= to clang.	Nick Lewycky	2016-08-31	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	-fprofile-dir=path allows the user to specify where .gcda files should be emitted when the program is run. In particular, this is the first flag that causes the .gcno and .o files to have different paths, LLVM is extended to support this. -fprofile-dir= does not change the file name in the .gcno (and thus where lcov looks for the source) but it does change the name in the .gcda (and thus where the runtime library writes the .gcda file). It's different from a GCOV_PREFIX because a user can observe that the GCOV_PREFIX_STRIP will strip paths off of -fprofile-dir= but not off of a supplied GCOV_PREFIX. To implement this we split -coverage-file into -coverage-data-file and -coverage-notes-file to specify the two different names. The !llvm.gcov metadata node grows from a 2-element form {string coverage-file, node dbg.cu} to 3-elements, {string coverage-notes-file, string coverage-data-file, node dbg.cu}. In the 3-element form, the file name is already "mangled" with .gcno/.gcda suffixes, while the 2-element form left that to the middle end pass. llvm-svn: 280306
*	Add asm.js-style setjmp/longjmp handling for wasm	Heejin Ahn	2016-08-31	4	-21/+303
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds asm.js-style setjmp/longjmp handling support for WebAssembly. It also uses JavaScript's try and catch mechanism. Reviewers: jpp, dschuff Subscribers: jfb, dschuff Differential Revision: https://reviews.llvm.org/D23928 llvm-svn: 280302
*	Revert "Add an optional parameter with a list of undefs to extendToIndices"	Reid Kleckner	2016-08-31	1	-112/+0
\| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r280268, it causes all MSVC 2013 to ICE. This appears to have been fixed in a later MSVC 2013 update, because I cannot reproduce it locally. That said, all upstream LLVM bots are broken right now, so I am reverting. Also reverts dependent change r280275, "[Hexagon] Deal with undefs when extending live intervals". llvm-svn: 280301
*	[InstCombine] allow icmp (shr exact X, C2), C fold for splat constant vectors	Sanjay Patel	2016-08-31	1	-3/+1
\| \| \| \| \| \| \|	The enhancement to foldICmpDivConstant ( http://llvm.org/viewvc/llvm-project?view=revision&revision=280299 ) allows us to remove the ConstantInt check; no other changes needed. llvm-svn: 280300
*	[InstCombine] allow icmp (div X, Y), C folds for splat constant vectors	Sanjay Patel	2016-08-31	4	-44/+27
\| \| \| \| \| \|	Converting all of the overflow ops to APInt looked risky, so I've left that as a TODO. llvm-svn: 280299
*	AMDGPU: Fix introducing stack access on unaligned v16i8	Matt Arsenault	2016-08-31	2	-6/+53
\| \| \| \|	llvm-svn: 280298
*	GlobalISel: use G_TYPE to annotate physregs with a type.	Tim Northover	2016-08-31	23	-269/+269
\| \| \| \| \| \| \| \| \| \|	More preparation for dropping source types from MachineInstrs: regsters coming out of already-selected code (i.e. non-generic instructions) don't have a type, but that information is needed so we must add it manually. This is done via a new G_TYPE instruction. llvm-svn: 280292
*	[WebAssembly] Disable folding of GA+reg into load/store constant offsets	Derek Schuff	2016-08-31	2	-40/+87
\| \| \| \| \| \| \| \| \| \| \|	Summary: If the register has a negative value then unsigned overflow will occur; this case is sometimes even created intentionally by LSR. For now disable GA+reg folding. Fixes PR29127 Differential Revision: https://reviews.llvm.org/D24053 llvm-svn: 280285
*	[EarlyCSE] Optionally use MemorySSA. NFC.	Geoff Berry	2016-08-31	15	-0/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Use MemorySSA, if requested, to do less conservative memory dependency checking. This change doesn't enable the MemorySSA enhanced EarlyCSE in the default pipelines, so should be NFC. Reviewers: dberlin, sanjoy, reames, majnemer Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19821 llvm-svn: 280279
*	Actually check for the diagnostic to be emitted!	Quentin Colombet	2016-08-31	1	-1/+3
\| \| \| \| \| \|	This makes the test case in r280273 actually useful! llvm-svn: 280276
*	[Hexagon] Deal with undefs when extending live intervals	Krzysztof Parzyszek	2016-08-31	1	-0/+112
\| \| \| \|	llvm-svn: 280275
*	AMDGPU/SI: Make sure llvm.amdgcn.implicitarg.ptr() is at least 4-byte aligned	Tom Stellard	2016-08-31	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes some OpenCV tests that were broken by libclc commit r276443. Reviewers: arsenm, jvesely Subscribers: arsenm, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D24051 llvm-svn: 280274
*	[TargetPassConfig] Add a hook to tell whether GlobalISel should warm on ↵	Quentin Colombet	2016-08-31	1	-1/+4
\| \| \| \| \| \| \| \| \|	fallback. Thanks to this patch, we know have a way to easly see if GlobalISel failed. llvm-svn: 280273
*	Next set of additional error checks for invalid Mach-O files for bad load ↵	Kevin Enderby	2016-08-31	6	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	commands that use the Mach::linkedit_data_command type for the load commands that are currently used in the MachOObjectFile constructor. This contains the missing checks for LC_DATA_IN_CODE and LC_LINKER_OPTIMIZATION_HINT load commands and the fields for the Mach::linkedit_data_command type. Checking for other load commands that use this type will be added later. Also fixed a couple of places that was using sizeof(MachOObjectFile::LoadCommandInfo) that should have been using sizeof(MachO::load_command). llvm-svn: 280267
*	[EarlyCSE] Allow forwarding a non-invariant load into an invariant load.	Geoff Berry	2016-08-31	1	-7/+3
\| \| \| \| \| \| \| \| \| \|	Reviewers: sanjoy Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D23935 llvm-svn: 280265
*	[LTO] Fix common test to reflect r279911 and move to X86 subdirectory	Teresa Johnson	2016-08-31	3	-9/+15
\| \| \| \| \| \| \| \| \| \| \| \|	Adjust the test to reflect the changes to common handling in r279911. This test wasn't running due to an incorrect REQUIRES and thus missed being modified for r279911 before. It was changed to XFAIL when the bad REQUIRES was discovered. Remove the XFAIL and move to a new X86 subdirectory that will properly disable on non-X86. llvm-svn: 280256
*	[codeview] Emit vtable shape information	Reid Kleckner	2016-08-31	2	-42/+548
\| \| \| \| \| \| \| \| \| \| \| \| \|	The shape of the vtable is passed down as the size of the __vtbl_ptr_type. This special pointer type appears both as the pointee type of the vptr type, and by itself in every dynamic class. For classes with multiple vtables, only the shape of the primary vftable is included, as the shape of all secondary vftables will be the same as in the base class. Fixes PR28150 llvm-svn: 280254
*	[statepoints][experimental] Add support for live-in semantics of values in ↵	Philip Reames	2016-08-31	2	-0/+154
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	deopt bundles This is a first step towards supporting deopt value lowering and reporting entirely with the register allocator. I hope to build on this in the near future to support live-on-return semantics, but I have a use case which allows me to test and investigate code quality with just the live-in semantics so I've chosen to start there. For those curious, my use cases is our implementation of the "__llvm_deoptimize" function we bind to @llvm.deoptimize. I'm choosing not to hard code that fact in the patch and instead make it configurable via function attributes. The basic approach here is modelled on what is done for the "Live In" values on stackmaps and patchpoints. (A secondary goal here is to remove one of the last barriers to merging the pseudo instructions.) We start by adding the operands directly to the STATEPOINT SDNode. Once we've lowered to MI, we extend the remat logic used by the register allocator to fold virtual register uses into StackMap::Indirect entries as needed. This does rely on the fact that the register allocator rematerializes. If it didn't along some code path, we could end up with more vregs than physical registers and fail to allocate. Today, we only fold in the register allocator. This can create some weird effects when combined with arguments passed on the stack because we don't fold them appropriately. I have an idea how to fix that, but it needs this patch in place to work on that effectively. (There's some weird interaction with the scheduler as well, more investigation needed.) My near term plan is to land this patch off-by-default, experiment in my local tree to identify any correctness issues and then start fixing codegen problems one by one as I find them. Once I have the live-in lowering fully working (both correctness and code quality), I'm hoping to move on to the live-on-return semantics. Note: I don't have any known miscompiles with this patch enabled, but I'm pretty sure I'll find at least a couple. Thus, the "experimental" tag and the fact it's off by default. Differential Revision: https://reviews.llvm.org/D24000 llvm-svn: 280250
*	[X86][SSE] Improve awareness of (v)cvtpd2ps implicit zeroing of upper ↵	Simon Pilgrim	2016-08-31	1	-13/+11
\| \| \| \| \| \| \| \| \| \|	64-bits of xmm result Associate x86_sse2_cvtpd2ps with X86ISD::VFPROUND to avoid inserting unnecessary zeroing shuffles. Differential Revision: https://reviews.llvm.org/D23797 llvm-svn: 280249
*	Clang patch r280064 introduced ways to set the FP exceptions and denormal	Sjoerd Meijer	2016-08-31	1	-0/+11
\| \| \| \| \| \| \| \| \| \|	types. This is the LLVM counterpart and it adds options that map onto FP exceptions and denormal build attributes allowing better fp math library selections. Differential Revision: https://reviews.llvm.org/D24070 llvm-svn: 280246
*	Fixed spill stack objects are mutable	Krzysztof Parzyszek	2016-08-31	1	-0/+69
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D24039 llvm-svn: 280244
*	Revert "[SimplifyCFG] Change the algorithm in SinkThenElseCodeToEnd"	James Molloy	2016-08-31	1	-60/+0
\| \| \| \| \| \|	This reverts commit r280216 - it caused buildbot failures. llvm-svn: 280234
*	Revert "[SimplifyCFG] Handle tail-sinking of more than 2 incoming branches"	James Molloy	2016-08-31	6	-133/+22
\| \| \| \| \| \|	This reverts commit r280217. r280216 caused buildbot failures - backing out the entire chain. llvm-svn: 280233
*	Revert "[SimplifyCFG] Add a workaround to fix PR30188"	James Molloy	2016-08-31	1	-23/+0
\| \| \| \| \| \|	This reverts commit r280219. r280216 caused buildbot failures - backing out the entire chain. llvm-svn: 280232
*	Revert "[SimplifyCFG] Fix bootstrap failure after r280220"	James Molloy	2016-08-31	1	-23/+0
\| \| \| \| \| \|	This reverts commit r280228. r280216 caused buildbot failures - backing out the entire sequence. llvm-svn: 280231
*	[SimplifyCFG] Fix bootstrap failure after r280220	James Molloy	2016-08-31	1	-0/+23
\| \| \| \| \| \|	We check that a sinking candidate is used by only one PHI node during our legality checks. However for instructions that are used by other sinking candidates our heuristic is less conservative. This can result in a candidate actually being illegal when we come to sink it because of how we sunk a predecessor. Do the used-by-only-one-PHI checks again during sinking to ensure we don't crash. llvm-svn: 280228
*	AMDGPU/SI: Handle aliases in AMDGPUAlwaysInlinePass	Nikolay Haustov	2016-08-31	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Simply replace usage of aliases to functions with aliasee. This came up when bitcode linking to builtin library and calls to aliases not being resolved. Also made minor improvements to existing test. Reviewers: tstellarAMD, alex-t, vpykhtin Subscribers: arsenm, wdng, rampitec Differential Revision: https://reviews.llvm.org/D24023 llvm-svn: 280221
*	[SimplifyCFG] Add a workaround to fix PR30188	James Molloy	2016-08-31	1	-0/+23
\| \| \| \| \| \| \| \|	We're sinking stores, which is a good thing, but in the process creating selects for the store address operand, which SROA/Mem2Reg can't look through, which caused serious regressions. The real fix is in SROA, which I'll be looking into. llvm-svn: 280219
*	[SimplifyCFG] Handle tail-sinking of more than 2 incoming branches	James Molloy	2016-08-31	6	-22/+133
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was a real restriction in the original version of SinkIfThenCodeToEnd. Now it's been rewritten, the restriction can be lifted. As part of this, we handle a very common and useful case where one of the incoming branches is actually conditional. Consider: if (a) x(1); else if (b) x(2); This produces the following CFG: [if] / \ [x(1)] [if] \| \| \ \| \| \ \| [x(2)] \| \ \| / [ end ] [end] has two unconditional predecessor arcs and one conditional. The conditional refers to the implicit empty 'else' arc. This same pattern can also be caused by an empty default block in a switch. We can't sink the call to x() down to end because no call to x() happens on the third incoming arc (assume that x() has sideeffects for the sake of argument; if something is safe to speculate we could indeed sink nevertheless but this cannot happen in the general case and causes many extra selects). We are now able to detect this case and split off the unconditional arcs to a common successor: [if] / \ [x(1)] [if] \| \| \ \| \| \ \| [x(2)] \| \ / \| [sink.split] \| \ / [ end ] Now we can sink the call to x() into %sink.split. This can cause significant code simplification in many testcases. llvm-svn: 280217
*	[SimplifyCFG] Change the algorithm in SinkThenElseCodeToEnd	James Molloy	2016-08-31	1	-0/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	r279460 rewrote this function to be able to handle more than two incoming edges and took pains to ensure this didn't regress anything. This time we change the logic for determining if an instruction should be sunk. Previously we used a single pass greedy algorithm - sink instructions until one requires more than one PHI node or we run out of instructions to sink. This had the problem that sinking instructions that had non-identical but trivially the same operands needed extra logic so we sunk them aggressively. For example: %a = load i32* %b %d = load i32* %b %c = gep i32* %a, i32 0 %e = gep i32* %d, i32 1 Sinking %c and %e would naively require two PHI merges as %a != %d. But the loads are obviously equivalent (and maybe can't be hoisted because there is no common predecessor). This is why we implemented the fairly complex function areValuesTriviallySame(), to look through trivial differences like this. However it's just not clever enough. Instead, throw areValuesTriviallySame away, use pointer equality to check equivalence of operands and switch to a two-stage algorithm. In the "scan" stage, we look at every sinkable instruction in isolation from end of block to front. If it's sinkable, we keep track of all operands that required PHI merging. In the "sink" stage, we iteratively sink the last non-terminator in the source blocks. But when calculating how many PHIs are actually required to be inserted (to work out if we should stop or not) we remove any values that have already been sunk from the set of PHI-merges required, which allows us to be more aggressive. This turns an algorithm with potentially recursive lookahead (looking through GEPs, casts, loads and any other instruction potentially not CSE'd) to two linear scans. llvm-svn: 280216
*	[SimplifyCFG] Tail-merge calls with sideeffects	James Molloy	2016-08-31	1	-1/+24
\| \| \| \| \| \| \| \| \| \| \| \| \|	This was deliberately disabled during my rewrite of SinkIfThenToEnd to keep behaviour at least vaguely consistent with the previous version and keep it as close to NFC as I could. There's no real reason not to merge sideeffect calls though, so let's do it! Small fixup along the way to ensure we don't create indirect calls. Should fix PR28964. llvm-svn: 280215
*	[X86][SSE] Improve awareness of fptrunc implicit zeroing of upper 64-bits of ↵	Simon Pilgrim	2016-08-31	1	-51/+43
\| \| \| \| \| \| \| \| \| \|	xmm result Add patterns to avoid inserting unnecessary zeroing shuffles when lowering fptrunc to (v)cvtpd2ps Differential Revision: https://reviews.llvm.org/D23797 llvm-svn: 280214
*	[AVX-512] Add patterns to select masked logical operations if the select has ↵	Craig Topper	2016-08-31	2	-192/+96
\| \| \| \| \| \| \| \|	a floating point type. This is needed in order to replace the masked floating point logical op intrinsics with native IR. llvm-svn: 280195
*	[AVX-512] Add test cases for masked floating point logic operations with ↵	Craig Topper	2016-08-31	2	-1/+1192
\| \| \| \| \| \| \| \|	bitcasts between the logic ops and the select. We don't currently select masked operations for these cases. Test cases taken from optimized clang output after trying to convert the masked floating point logical op intrinsics to native IR. llvm-svn: 280194
*	[X86] Regenerate a test using update_llc_test_checks.py.	Craig Topper	2016-08-31	1	-48/+81
\| \| \| \|	llvm-svn: 280193
*	[XRay] Support multiple return instructions in a single basic block	Dean Michael Berris	2016-08-31	2	-0/+60
\| \| \| \| \| \| \|	Add a .mir test to catch this case, and fix the xray-instrumentation pass to handle it appropriately. llvm-svn: 280192
*	[PowerPC] Don't spill the frame pointer twice	Hal Finkel	2016-08-31	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a function contains something, such as inline asm, which explicitly clobbers the register used as the frame pointer, don't spill it twice. If we need a frame pointer, it will be saved/restored in the prologue/epilogue code. Explicitly spilling it again will reuse the same spill slot used by the prologue/epilogue code, thus clobbering the saved value. The same applies to the base-pointer or PIC-base register. Partially fixes PR26856. Thanks to Ulrich for his analysis and the small inline-asm reproducer. llvm-svn: 280188
*	[Coroutines] Part 10: Add coroutine promise support.	Gor Nishanov	2016-08-31	1	-0/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: 1) CoroEarly now lowers llvm.coro.promise intrinsic that allows to obtain a coroutine promise pointer from a coroutine frame and vice versa. 2) CoroFrame now interprets Promise argument of llvm.coro.begin to place CoroutinPromise alloca at a deterministic offset from the coroutine frame. Now, the coroutine promise example from docs\Coroutines.rst compiles and produces expected result (see test/Transform/Coroutines/ex4.ll). Reviewers: majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23993 llvm-svn: 280184
*	[llvm-cov] Drop redundant "No." suffix in a column title	Vedant Kumar	2016-08-31	1	-1/+1
\| \| \| \|	llvm-svn: 280181
*	[LoadStoreVectorizer] Change VectorSet to Vector to match head and tail ↵	Alina Sbirlea	2016-08-30	2	-0/+94
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	positions. Resolves PR29148. Summary: LSV was using two vector sets (heads and tails) to track pairs of adjiacent position to vectorize. A recent optimization is trying to obtain the longest chain to vectorize and assumes the positions in heads(H) and tails(T) match, which is not the case is there are multiple tails for the same head. e.g.: i1: store a[0] i2: store a[1] i3: store a[1] Leads to: H: i1 T: i2 i3 Instead of: H: i1 i1 T: i2 i3 So the positions for instructions that follow i3 will have different indexes in H/T. This patch resolves PR29148. This issue also surfaced the fact that if the chain is too long, and TLI returns a "not-fast" answer, the whole chain will be abandoned for vectorization, even though a smaller one would be beneficial. Added a testcase and FIXME for this. Reviewers: tstellarAMD, arsenm, jlebar Subscribers: mzolotukhin, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D24057 llvm-svn: 280179
*	[InstCombine] add tests to show type limitations of InsertRangeTest and callers	Sanjay Patel	2016-08-30	3	-3/+56
\| \| \| \|	llvm-svn: 280175
*	Add a test file, macho-invalid-dysymtab-extreloff-nextrel,	Kevin Enderby	2016-08-30	1	-0/+0
\| \| \| \| \| \|	I forgot to do an svn add on. llvm-svn: 280167
*	Next set of additional error checks for invalid Mach-O files for bad ↵	Kevin Enderby	2016-08-30	15	-0/+45
\| \| \| \| \| \| \| \|	LC_DYSYMTAB’s. This contains the missing checks for LC_DYSYMTAB load command fields. llvm-svn: 280161
*	GlobalISel: combine extracts & sequences created for legalization	Tim Northover	2016-08-30	2	-10/+108
\| \| \| \| \| \| \| \|	Legalization ends up creating many G_SEQUENCE/G_EXTRACT pairs which leads to inefficient codegen (even for -O0), so add a quick pass over the function to remove them again. llvm-svn: 280155
*	AMDGPU: Relax SGPR asm constraint register class	Matt Arsenault	2016-08-30	1	-0/+10
\| \| \| \| \| \| \|	s should be SReg_32 to be as general as possible. This can avoid a copy from m0. llvm-svn: 280154
*	Revert "ELFDumper: Unversioned symbols must not have trailing @"	Hemant Kulkarni	2016-08-30	2	-13/+13
\| \| \| \| \| \| \| \|	This reverts commit 8df7a877949e8782a3a28e3ecdb0770c1e444056. Fixing other repositories and adding changes together. llvm-svn: 280152
*	[LoopVectorizer] Predicate instructions in blocks with several incoming edges	Michael Kuperstein	2016-08-30	2	-4/+62
\| \| \| \| \| \| \| \| \| \|	We don't need to limit predication to blocks that have a single incoming edge, we just need to use the right mask. This fixes PR30172. Differential Revision: https://reviews.llvm.org/D24009 llvm-svn: 280148