bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[InstCombine] Annotate strdup with deref_or_null	David Bolvansky	2019-09-17	1	-0/+7
\| \| \| \|	llvm-svn: 372098
*	[NFC] remove unused functions	Guillaume Chatelet	2019-09-16	1	-8/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67616 llvm-svn: 371994
*	[InstSimplify] simplifyUnsignedRangeCheck(): handle few tautological cases ↵	Roman Lebedev	2019-09-14	1	-16/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR43251) Summary: This is split off from D67356, since these cases produce a constant, no real need to keep them in instcombine. Alive proofs: https://rise4fun.com/Alive/u7Fk https://rise4fun.com/Alive/4lV https://bugs.llvm.org/show_bug.cgi?id=43251 Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67498 llvm-svn: 371921
*	[SCEV] Add smin support to getRangeRef	Philip Reames	2019-09-12	1	-0/+8
\| \| \| \| \| \| \| \|	We were failing to compute trip counts (both exact and maximum) for any loop which involved a comparison against either an umin or smin. It looks like this simply got missed when we added smin/umin to SCEV. (Note: umin was submitted separately earlier today. Turned out two folks hit this at the same time.) Differential Revision: https://reviews.llvm.org/D67514 llvm-svn: 371776
*	[ConstantFolding] Expand folding of some library functions	Evandro Menezes	2019-09-12	1	-3/+22
\| \| \| \| \| \| \| \| \|	Expanding the folding of `nearbyint()`, `rint()` and `trunc()` to library functions, in addition to the current support for intrinsics. Differential revision: https://reviews.llvm.org/D67468 llvm-svn: 371774
*	[SCEV] Support SCEVUMinExpr in getRangeRef.	Florian Hahn	2019-09-12	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds support for SCEVUMinExpr to getRangeRef, similar to the support for SCEVUMaxExpr. Reviewers: sanjoy.google, efriedma, reames, nikic Reviewed By: sanjoy.google Differential Revision: https://reviews.llvm.org/D67177 llvm-svn: 371768
*	[LICM/AST] Check if the AliasAny set is removed from the tracker.	Alina Sbirlea	2019-09-12	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Resolves PR38513. Credit to @bjope for debugging this. Reviewers: hfinkel, uabelho, bjope Subscribers: sanjoy.google, bjope, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67417 llvm-svn: 371752
*	[LV] Support invariant addresses in speculation logic	Philip Reames	2019-09-12	1	-10/+18
\| \| \| \| \| \| \| \| \| \|	Implement a TODO from rL371452, and handle loop invariant addresses in predicated blocks. If we can prove that the load is safe to speculate into the header, then we can avoid using a masked.load in favour of a normal load. This is mostly about vectorization robustness. In the common case, it's generally expected that LICM/LoadStorePromotion would have eliminated such loads entirely. Differential Revision: https://reviews.llvm.org/D67372 llvm-svn: 371745
*	[ConstProp] allow folding for fma that produces NaN	Sanjay Patel	2019-09-12	1	-7/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Folding for fma/fmuladd was added here: rL202914 ...and as seen in existing/unchanged tests, that works to propagate NaN if it's already an input, but we should fold an fma() that creates NaN too. From IEEE-754-2008 7.2 "Invalid Operation", there are 2 clauses that apply to fma, so I added tests for those patterns: c) fusedMultiplyAdd: fusedMultiplyAdd(0, ∞, c) or fusedMultiplyAdd(∞, 0, c) unless c is a quiet NaN; if c is a quiet NaN then it is implementation defined whether the invalid operation exception is signaled d) addition or subtraction or fusedMultiplyAdd: magnitude subtraction of infinities, such as: addition(+∞, −∞) Differential Revision: https://reviews.llvm.org/D67446 llvm-svn: 371735
*	[InstSimplify] simplifyUnsignedRangeCheck(): handle more cases (PR43251)	Roman Lebedev	2019-09-12	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I don't have a direct motivational case for this, but it would be good to have this for completeness/symmetry. This pattern is basically the motivational pattern from https://bugs.llvm.org/show_bug.cgi?id=43251 but with different predicate that requires that the offset is non-zero. The completeness bit comes from the fact that a similar pattern (offset != zero) will be needed for https://bugs.llvm.org/show_bug.cgi?id=43259, so it'd seem to be good to not overlook very similar patterns.. Proofs: https://rise4fun.com/Alive/21b Also, there is something odd with `isKnownNonZero()`, if the non-zero knowledge was specified as an assumption, it didn't pick it up (PR43267) Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67411 llvm-svn: 371718
*	[ConstantFolding] Refactor math functions to use LLVM ones (NFC)	Evandro Menezes	2019-09-11	1	-37/+42
\| \| \| \| \| \| \| \| \|	When possible, replace calls to library routines on the host with equivalent ones in LLVM. Differential revision: https://reviews.llvm.org/D67459 llvm-svn: 371677
*	[InstSimplify] Pass SimplifyQuery into simplifyUnsignedRangeCheck() and use ↵	Roman Lebedev	2019-09-11	1	-17/+15
\| \| \| \| \| \| \| \| \| \|	it for isKnownNonZero() This was actually the original intention in D67332, but i messed up and forgot about it. This patch was originally part of D67411, but precommitting this. llvm-svn: 371630
*	[TLI][AMDGPU] AMDPAL does not have library functions	Tim Renouf	2019-09-11	1	-12/+2
\| \| \| \| \| \| \| \| \| \| \|	Configure TLI to say that r600/amdgpu does not have any library functions, such that InstCombine does not do anything like turn sin/cos into the library function @tan with sufficient fast math flags. Differential Revision: https://reviews.llvm.org/D67406 Change-Id: I02f907d3e64832117ea9800e9f9285282856e5df llvm-svn: 371592
*	[MemorySSA] Do not create memoryaccesses for debug info intrinsics.	Alina Sbirlea	2019-09-10	1	-2/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Do not model debuginfo intrinsics in MemorySSA. Regularly these are non-memory modifying instructions. With -disable-basicaa, they were being modelled as Defs. Reviewers: george.burgess.iv Subscribers: aprantl, Prazek, sanjoy.google, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67307 llvm-svn: 371565
*	[Loads] Move generic code out of vectorizer into a location it might be ↵	Philip Reames	2019-09-10	1	-0/+50
\| \| \| \| \| \|	reused [NFC] llvm-svn: 371558
*	[ValueTracking] Factor our common speculation suppression logic [NFC]	Philip Reames	2019-09-10	1	-6/+13
\| \| \| \| \| \|	Expose a utility function so that all places which want to suppress speculation (when otherwise legal) due to ordering and/or sanitizer interaction can do so. llvm-svn: 371556
*	[BPI] Adjust the probability for floating point unordered comparison	Guozhi Wei	2019-09-10	1	-2/+14
\| \| \| \| \| \| \| \|	Since NaN is very rare in normal programs, so the probability for floating point unordered comparison should be extremely small. Current probability is 3/8, it is too large, this patch changes it to a tiny number. Differential Revision: https://reviews.llvm.org/D65303 llvm-svn: 371541
*	[InstSimplify] simplifyUnsignedRangeCheck(): if we know that X != 0, handle ↵	Roman Lebedev	2019-09-08	1	-9/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	more cases (PR43246) Summary: This is motivated by D67122 sanitizer check enhancement. That patch seemingly worsens `-fsanitize=pointer-overflow` overhead from 25% to 50%, which strongly implies missing folds. In this particular case, given ``` char* test(char& base, unsigned long offset) { return &base + offset; } ``` it will end up producing something like https://godbolt.org/z/LK5-iH which after optimizations reduces down to roughly ``` define i1 @t0(i8* nonnull %base, i64 %offset) { %base_int = ptrtoint i8* %base to i64 %adjusted = add i64 %base_int, %offset %non_null_after_adjustment = icmp ne i64 %adjusted, 0 %no_overflow_during_adjustment = icmp uge i64 %adjusted, %base_int %res = and i1 %non_null_after_adjustment, %no_overflow_during_adjustment ret i1 %res } ``` Without D67122 there was no `%non_null_after_adjustment`, and in this particular case we can get rid of the overhead: Here we add some offset to a non-null pointer, and check that the result does not overflow and is not a null pointer. But since the base pointer is already non-null, and we check for overflow, that overflow check will already catch the null pointer, so the separate null check is redundant and can be dropped. Alive proofs: https://rise4fun.com/Alive/WRzq There are more patterns of "unsigned-add-with-overflow", they are not handled here, but this is the main pattern, that we currently consider canonical, so it makes sense to handle it. https://bugs.llvm.org/show_bug.cgi?id=43246 Reviewers: spatel, nikic, vsk Reviewed By: spatel Subscribers: hiraditya, llvm-commits, reames Tags: #llvm Differential Revision: https://reviews.llvm.org/D67332 llvm-svn: 371349
*	[Intrinsic] Add the llvm.umul.fix.sat intrinsic	Bjorn Pettersson	2019-09-07	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add an intrinsic that takes 2 unsigned integers with the scale of them provided as the third argument and performs fixed point multiplication on them. The result is saturated and clamped between the largest and smallest representable values of the first 2 operands. This is a part of implementing fixed point arithmetic in clang where some of the more complex operations will be implemented as intrinsics. Patch by: leonardchan, bjope Reviewers: RKSimon, craig.topper, bevinh, leonardchan, lebedev.ri, spatel Reviewed By: leonardchan Subscribers: ychen, wuzish, nemanjai, MaskRay, jsji, jdoerfert, Ka-Ka, hiraditya, rjmccall, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57836 llvm-svn: 371308
*	[LVI] Look through extractvalue of insertvalue	Nikita Popov	2019-09-07	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This addresses the issue mentioned on D19867. When we simplify with.overflow instructions in CVP, we leave behind extractvalue of insertvalue sequences that LVI no longer understands. This means that we can not simplify any instructions based on the with.overflow anymore (until some over pass like InstCombine cleans them up). This patch extends LVI extractvalue handling by calling SimplifyExtractValueInst (which doesn't do anything more than constant folding + looking through insertvalue) and using the block value of the simplification. A possible alternative would be to do something similar to SimplifyIndVars, where we instead directly try to replace extractvalue users of the with.overflow. This would need some additional structural changes to CVP, as it's currently not legal to remove anything but the current instruction -- we'd have to introduce a worklist with instructions scheduled for deletion or similar. Differential Revision: https://reviews.llvm.org/D67035 llvm-svn: 371306
*	Change TargetLibraryInfo analysis passes to always require Function	Teresa Johnson	2019-09-07	16	-55/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is the first change to enable the TLI to be built per-function so that -fno-builtin* handling can be migrated to use function attributes. See discussion on D61634 for background. This is an enabler for fixing handling of these options for LTO, for example. This change should not affect behavior, as the provided function is not yet used to build a specifically per-function TLI, but rather enables that migration. Most of the changes were very mechanical, e.g. passing a Function to the legacy analysis pass's getTLI interface, or in Module level cases, adding a callback. This is similar to the way the per-function TTI analysis works. There was one place where we were looking for builtins but not in the context of a specific function. See FindCXAAtExit in lib/Transforms/IPO/GlobalOpt.cpp. I'm somewhat concerned my workaround could provide the wrong behavior in some corner cases. Suggestions welcome. Reviewers: chandlerc, hfinkel Subscribers: arsenm, dschuff, jvesely, nhaehnle, mehdi_amini, javed.absar, sbc100, jgravelle-google, eraman, aheejin, steven_wu, george.burgess.iv, dexonsmith, jfb, asbirlea, gchatelet, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66428 llvm-svn: 371284
*	[ConstantFolding] Refactor functions not available before C99 (NFC)	Evandro Menezes	2019-09-06	1	-1/+6
\| \| \| \| \| \| \|	Note the cases when calling a function at compile time may fail if the host does not support the C99 run time library. llvm-svn: 371236
*	[ConstantFolding] Refactor function match for better speed (NFC)	Evandro Menezes	2019-09-06	1	-102/+134
\| \| \| \| \| \|	Use an `enum` instead of string comparison to match the candidate function. llvm-svn: 371228
*	[LLVM][Alignment] Convert isLegalNTStore/isLegalNTLoad to llvm::Align	Guillaume Chatelet	2019-09-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a serie to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67223 llvm-svn: 371063
*	[MemorySSA] Re-enable MemorySSA use.	Alina Sbirlea	2019-09-04	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D58311 llvm-svn: 370957
*	[NFC] Switch last couple of invariant_load checks to use hasMetadata	Philip Reames	2019-09-04	1	-1/+1
\| \| \| \|	llvm-svn: 370948
*	[TargetLibraryInfo] Define enumerator for no library function (NFC)	Evandro Menezes	2019-09-04	1	-0/+1
\| \| \| \| \| \|	Add a null enumerator do designate no library function. llvm-svn: 370947
*	[Instruction] Add hasMetadata(Kind) helper [NFC]	Philip Reames	2019-09-04	1	-3/+3
\| \| \| \| \| \|	It's a common idiom, so let's add the obvious wrapper for metadata kinds which are basically booleans. llvm-svn: 370933
*	[MemorySSA] Move two verify calls under expensive checks.	Alina Sbirlea	2019-09-04	1	-2/+2
\| \| \| \|	llvm-svn: 370831
*	[MemorySSA] Disable MemorySSA use.	Alina Sbirlea	2019-09-03	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D58311 llvm-svn: 370821
*	[MemorySSA] Re-enable MemorySSA use.	Alina Sbirlea	2019-09-03	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D58311 llvm-svn: 370811
*	[ConstantFolding] Fix 'undef' folding for @llvm.[us]{add,sub}.with.overflow ↵	Roman Lebedev	2019-09-01	1	-11/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ops (PR43188) As we have already established/fixed in https://bugs.llvm.org/show_bug.cgi?id=42209 https://reviews.llvm.org/D63065 https://reviews.llvm.org/rL363522 the InstSimplify handling for @llvm.with.overflow ops with undefs is correct. Therefore if ConstantFolding produces different results, then it is wrong. This duplication of code hints at the need for some refactoring, but for now address the brokenness of ConstantFolding by copying the known-good handling from rL363522. Fixes https://bugs.llvm.org/show_bug.cgi?id=43188 llvm-svn: 370608
*	[LVI] Extract solveBlockValueExtractValue(); NFC	Nikita Popov	2019-08-31	1	-3/+15
\| \| \| \| \| \| \|	Extract this method in preparation for additional extractvalue support. llvm-svn: 370575
*	[MemorySSA] Rename all phi entries.	Alina Sbirlea	2019-08-30	1	-3/+8
\| \| \| \| \| \| \|	When renaming Phis incoming values, there may be multiple edges incoming from the same block (switch). Rename all. llvm-svn: 370548
*	Revert enabling MemorySSA.	Alina Sbirlea	2019-08-29	1	-1/+1
\| \| \| \| \| \| \| \|	Breaks sanitizers bots. Differential Revision: https://reviews.llvm.org/D58311 llvm-svn: 370397
*	[MemorySSA & LoopPassManager] Enable MemorySSA as loop dependency. Update tests.	Alina Sbirlea	2019-08-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I'm not planning to check this in at the moment, but feedback is very welcome, in particular how this affects performance. The feedback obtains here will guide the next steps towards enabling this. This patch enables the use of MemorySSA in the loop pass manager. Passes that currently use MemorySSA: - EarlyCSE Passes that use MemorySSA after this patch: - EarlyCSE - LICM - SimpleLoopUnswitch Loop passes that update MemorySSA (and do not use it yet, but could use it after this patch): - LoopInstSimplify - LoopSimplifyCFG - LoopUnswitch - LoopRotate - LoopSimplify - LCSSA Loop passes that do not update MemorySSA: - IndVarSimplify - LoopDelete - LoopIdiom - LoopSink - LoopUnroll - LoopInterchange - LoopUnrollAndJam - LoopVectorize - LoopReroll - IRCE Reviewers: chandlerc, george.burgess.iv, davide, sanjoy, gberry Subscribers: jlebar, Prazek, dmgreen, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58311 llvm-svn: 370384
*	Allow replaceAndRecursivelySimplify to list unsimplified visitees.	Joerg Sonnenberger	2019-08-29	1	-12/+18
\| \| \| \| \| \| \|	This is part of D65280 and split it to avoid ABI changes on the 9.0 release branch. llvm-svn: 370355
*	[InstSimplify] Drop leftover "division-by-zero guard" around ↵	Roman Lebedev	2019-08-29	1	-12/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`@llvm.umul.with.overflow` inverted overflow bit Summary: Now that with D65143/D65144 we've produce `@llvm.umul.with.overflow`, and with D65147 we've flattened the CFG, we now can see that the guard may have been there to prevent division by zero is redundant. We can simply drop it: ``` ---------------------------------------- Name: no overflow or zero %iszero = icmp eq i4 %y, 0 %umul = smul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %umul.ov.not = xor %umul.ov, -1 %retval.0 = or i1 %iszero, %umul.ov.not ret i1 %retval.0 => %iszero = icmp eq i4 %y, 0 %umul = smul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %umul.ov.not = xor %umul.ov, -1 %retval.0 = or i1 %iszero, %umul.ov.not ret i1 %umul.ov.not Done: 1 Optimization is correct! ``` Note that this is inverted from what we have in a previous patch, here we are looking for the inverted overflow bit. And that inversion is kinda problematic - given this particular pattern we neither hoist that `not` closer to `ret` (then the pattern would have been identical to the one without inversion, and would have been handled by the previous patch), neither do the opposite transform. But regardless, we should handle this too. I've filled [[ https://bugs.llvm.org/show_bug.cgi?id=42720 \| PR42720 ]]. Reviewers: nikic, spatel, xbolva00, RKSimon Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65151 llvm-svn: 370351
*	[InstSimplify] Drop leftover "division-by-zero guard" around ↵	Roman Lebedev	2019-08-29	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`@llvm.umul.with.overflow` overflow bit Summary: Now that with D65143/D65144 we've produce `@llvm.umul.with.overflow`, and with D65147 we've flattened the CFG, we now can see that the guard may have been there to prevent division by zero is redundant. We can simply drop it: ``` ---------------------------------------- Name: no overflow and not zero %iszero = icmp ne i4 %y, 0 %umul = umul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %retval.0 = and i1 %iszero, %umul.ov ret i1 %retval.0 => %iszero = icmp ne i4 %y, 0 %umul = umul_overflow i4 %x, %y %umul.ov = extractvalue {i4, i1} %umul, 1 %retval.0 = and i1 %iszero, %umul.ov ret %umul.ov Done: 1 Optimization is correct! ``` Reviewers: nikic, spatel, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65150 llvm-svn: 370350
*	[CostModel] Model all `extractvalue`s as free.	Roman Lebedev	2019-08-29	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As disscussed in https://reviews.llvm.org/D65148#1606412, `extractvalue` don't actually generate any code, so we should treat them as free. Reviewers: craig.topper, RKSimon, jnspaulsson, greened, asb, t.p.northover, jmolloy, dmgreen Reviewed By: jmolloy Subscribers: javed.absar, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66098 llvm-svn: 370339
*	Annotate return values of allocation functions with dereferenceable_or_null	David Bolvansky	2019-08-28	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Example define dso_local noalias i8* @_Z6maixxnv() local_unnamed_addr #0 { entry: %call = tail call noalias dereferenceable_or_null(64) i8* @malloc(i64 64) #6 ret i8* %call } Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: aaron.ballman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66651 llvm-svn: 370168
*	[NFC] Assert preconditions and merge all users into one codepath in Loads.cpp	Philip Reames	2019-08-27	1	-6/+11
\| \| \| \|	llvm-svn: 370128
*	[Loads/SROA] Remove blatantly incorrect code and fix a bug revealed in the ↵	Philip Reames	2019-08-27	1	-42/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	process The code we had isSafeToLoadUnconditionally was blatantly wrong. This function takes a "Size" argument which is supposed to describe the span loaded from. Instead, the code use the size of the pointer passed (which may be unrelated!) and only checks that span. For any Size > LoadSize, this can and does lead to miscompiles. Worse, the generic code just a few lines above correctly handles the cases which are valid. So, let's delete said code. Removing this code revealed two issues: 1) As noted by jdoerfert the removed code incorrectly handled external globals. The test update in SROA is to stop testing incorrect behavior. 2) SROA was confusing bytes and bits, but this wasn't obvious as the Size parameter was being essentially ignored anyway. Fixed. Differential Revision: https://reviews.llvm.org/D66778 llvm-svn: 370102
*	[NFC] Replace the FIXME I added in rL369989 with a comment clarifying the ↵	Philip Reames	2019-08-27	1	-3/+3
\| \| \| \| \| \| \| \|	current code The current approach is restrictive (as all of geps must be multiples of the alignment), but correct. llvm-svn: 370013
*	[MemorySSA] Fix insertUse.	Alina Sbirlea	2019-08-27	1	-5/+5
\| \| \| \| \| \| \| \|	Actually call the renamePass on inserted Phis. Fixes PR42940. Subscribers: llvm-commits llvm-svn: 369997
*	Reorganize code and add a fixme to point out a bug in existing code [NFC]	Philip Reames	2019-08-26	1	-12/+10
\| \| \| \|	llvm-svn: 369989
*	[TLI] Simplify code. NFCI.	Benjamin Kramer	2019-08-24	1	-9/+4
\| \| \| \|	llvm-svn: 369854
*	Fix some accidental global initializers by using StringLiteral instead of ↵	Benjamin Kramer	2019-08-24	1	-2/+3
\| \| \| \| \| \|	StringRef llvm-svn: 369850
*	[BasicAA] Use dereferenceability to reason about aliasing	Johannes Doerfert	2019-08-23	1	-4/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We already use the fact that an object with known size X does not alias another objection of size Y > X before. With this commit, we use dereferenceability information to determine a lower bound for Y and not only rely on the user provided query size. The result for @global_and_deref_arg_2() and @local_and_deref_ret_2() in test/Analysis/BasicAA/dereferenceable.ll improved with this patch. Reviewers: asbirlea, chandlerc, hfinkel, sanjoy Subscribers: hiraditya, bollu, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66157 llvm-svn: 369786
*	[MustExec] Add a generic "must-be-executed-context" explorer	Johannes Doerfert	2019-08-23	2	-0/+119
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Given an instruction I, the MustBeExecutedContextExplorer allows to easily traverse instructions that are guaranteed to be executed whenever I is. For now, these instruction have to be statically "after" I, in the same or different basic blocks. This patch also adds a pass which prints the must-be-executed-context for each instruction in a module. It is used to test the MustBeExecutedContextExplorer, for now on the examples given in the class comment of the MustBeExecutedIterator. Differential Revision: https://reviews.llvm.org/D65186 llvm-svn: 369765