bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Don't run a full verifier pass in coro-splitting's private pipeline.	John McCall	2019-08-14	1	-1/+6
\| \| \| \| \| \|	Potentially addresses rdar://49022293. llvm-svn: 368797
*	Remove unreachable blocks before splitting a coroutine.	John McCall	2019-08-14	1	-1/+19
\| \| \| \| \| \| \|	The suspend-crossing algorithm is not correct in the presence of uses that cannot be reached on some successor path from their defs. llvm-svn: 368796
*	Support swifterror in coroutine lowering.	John McCall	2019-08-14	3	-0/+238
\| \| \| \| \| \| \| \| \| \|	The support for swifterror allocas should work in all lowerings. The support for swifterror arguments only really works in a lowering with prototypes where you can ensure that the prototype also has a swifterror argument; I'm not really sure how it could possibly be made to work in the switch lowering. llvm-svn: 368795
*	In coro.retcon lowering, don't explode if the optimizer messes around with ↵	John McCall	2019-08-14	2	-1/+28
\| \| \| \| \| \|	the linkage of the prototype or the exact types of the yielded values. llvm-svn: 368793
*	Fix a use-after-free in the coro.alloca treatment.	John McCall	2019-08-14	1	-4/+10
\| \| \| \|	llvm-svn: 368792
*	Add intrinsics for doing frame-bound dynamic allocations within a coroutine.	John McCall	2019-08-14	3	-4/+241
\| \| \| \| \| \| \|	These rely on having an allocator provided to the coroutine and thus, for now, only work in retcon lowerings. llvm-svn: 368791
*	Guard dumps in the coro intrinsic validation logic behind NDEBUG checks. ↵	John McCall	2019-08-14	1	-0/+14
\| \| \| \| \| \|	dump() is not guaranteed to be defined in all builds. llvm-svn: 368790
*	Generalize llvm.coro.suspend.retcon to allow an arbitrary number of ↵	John McCall	2019-08-14	3	-24/+93
\| \| \| \| \| \|	arguments to be passed back to the continuation function. llvm-svn: 368789
*	Extend coroutines to support a "returned continuation" lowering.	John McCall	2019-08-14	7	-271/+1432
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A quick contrast of this ABI with the currently-implemented ABI: - Allocation is implicitly managed by the lowering passes, which is fine for frontends that are fine with assuming that allocation cannot fail. This assumption is necessary to implement dynamic allocas anyway. - The lowering attempts to fit the coroutine frame into an opaque, statically-sized buffer before falling back on allocation; the same buffer must be provided to every resume point. A buffer must be at least pointer-sized. - The resume and destroy functions have been combined; the continuation function takes a parameter indicating whether it has succeeded. - Conversely, every suspend point begins its own continuation function. - The continuation function pointer is directly returned to the caller instead of being stored in the frame. The continuation can therefore directly destroy the frame when exiting the coroutine instead of having to leave it in a defunct state. - Other values can be returned directly to the caller instead of going through a promise allocation. The frontend provides a "prototype" function declaration from which the type, calling convention, and attributes of the continuation functions are taken. - On the caller side, the frontend can generate natural IR that directly uses the continuation functions as long as it prevents IPO with the coroutine until lowering has happened. In combination with the point above, the frontend is almost totally in charge of the ABI of the coroutine. - Unique-yield coroutines are given some special treatment. llvm-svn: 368788
*	[GlobalISel]: Fix lowering of G_Shuffle_vector where we pick up the wrong ↵	Aditya Nandakumar	2019-08-14	1	-1/+1
\| \| \| \| \| \| \| \|	source index https://reviews.llvm.org/D66182 llvm-svn: 368781
*	[AArch64][GlobalISel] RBS: Treat s128s like vectors when unmerging.	Amara Emerson	2019-08-13	1	-1/+1
\| \| \| \| \| \| \| \|	The destinations should be FPRs (for now). Differential Revision: https://reviews.llvm.org/D66184 llvm-svn: 368775
*	[AArch64] Remove incorrect usage of MONonTemporal.	Eli Friedman	2019-08-13	1	-2/+1
\| \| \| \| \| \| \|	This has no effect at the moment, but might matter if we try to implement non-temporal loads in the future. llvm-svn: 368770
*	[GlobalISel]: Fix lowering of G_SHUFFLE_VECTOR with scalar sources	Aditya Nandakumar	2019-08-13	1	-5/+10
\| \| \| \| \| \|	https://reviews.llvm.org/D66171 llvm-svn: 368753
*	[AIX]Lowering global address for 32/64bit small/large code models	Xiangling Liao	2019-08-13	6	-46/+119
\| \| \| \| \| \| \| \| \| \| \| \|	This patch implements global address lowering for 32/64 bit with small/large code models. 1.For 32bit large code model on AIX, there are newly added pseudo opcode LWZtocL & ADDIStocHA32, the support of which on MC layer will be provided by future patches. 2.The default code model on AIX should be small code model. 3.Since AIX does not have medium code model, "report_fatal_error" when users specify it. Differential Revision: https://reviews.llvm.org/D63547 llvm-svn: 368744
*	[AMDGPU] Fix to 'Fold readlane from copy of SGPR or imm'	Tim Renouf	2019-08-13	1	-0/+3
\| \| \| \| \| \| \| \| \|	That change (r363670) could leave a copy from vgpr to sgpr. Fixed. Differential Revision: https://reviews.llvm.org/D66133 Change-Id: I00c3fe6fda2e8e1e36f53195b881b1449c777ea4 llvm-svn: 368736
*	[ARM] Add MVE beats vector cost model	David Green	2019-08-13	4	-21/+87
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The MVE architecture has the idea of "beats", where a vector instruction can be executed over several ticks of the architecture. This adds a similar system into the Arm backend cost model, multiplying the cost of all vector instructions by a factor. This factor essentially becomes the expected difference between scalar code and vector code, on average. MVE Vector instructions can also overlap so the a true cost of them is often lower. But equally scalar instructions can in some situations be dual issued, or have other optimisations such as unrolling or make use of dsp instructions. The default is chosen as 2. This should not prevent vectorisation is a most cases (as the vector instructions will still be doing at least 4 times the work), but it will help prevent over vectorising in cases where the benefits are less likely. This adds things so far to the obvious places in ARMTargetTransformInfo, and updates a few related costs like not treating float instructions as cost 2 just because they are floats. Differential Revision: https://reviews.llvm.org/D66005 llvm-svn: 368733
*	[llvm-profdata] Profile dump for compact binary format	Wenlei He	2019-08-13	1	-6/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fix "llvm-profdata show" so it can work with compact binary format profile. The change is to mark all functions "used" so SampleProfileReaderCompactBinary::read will read in all profiles available for dumping. The function names will be MD5 hash for compact binary format. Reviewers: wmi, davidxl, danielcdh Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65162 llvm-svn: 368731
*	[AutoUpgrader] Make ArcRuntime Autoupgrader more conservative	Steven Wu	2019-08-13	2	-7/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is a tweak to r368311 and r368646 which auto upgrades the calls to objc runtime functions to objc runtime intrinsics, in order to make sure that the auto upgrader does not trigger with up-to-date bitcode. It is possible for bitcode that is up-to-date to contain direct calls to objc runtime function and those are not inserted by compiler as part of ARC and they should not be upgraded. Now auto upgrader only triggers as when the old style of ARC marker is used so it is guaranteed that it won't trigger on update-to-date bitcode. This also means it won't do this upgrade for bitcode from llvm-8 and llvm-9, which preserves the behavior of those releases. Ideally they should be upgraded as well but it is more important to make sure AutoUpgrader will not trigger on up-to-date bitcode. Reviewers: ahatanak, rjmccall, dexonsmith, pete Reviewed By: dexonsmith Subscribers: hiraditya, jkorous, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66153 llvm-svn: 368730
*	Use Register over unsigned in LateEHPrepare (NFC)	Heejin Ahn	2019-08-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: While D65962 is pending for review, I landed D65475 that added one more use of `unsigned`. Changed it to `Register`. Reviewers: dsanders Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66064 llvm-svn: 368727
*	[SimplifyLibCalls] Add noalias from known callsites	David Bolvansky	2019-08-13	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Should be fine for memcpy, strcpy, strncpy. Reviewers: jdoerfert, efriedma Reviewed By: jdoerfert Subscribers: uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66135 llvm-svn: 368724
*	[ValueTracking] Improve reverse assumption inference	Nikita Popov	2019-08-13	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use isGuaranteedToTransferExecutionToSuccessor() instead of isSafeToSpeculativelyExecute() when seeing whether we can propagate the information in an assume backwards in isValidAssumeForContext(). The latter is more general - it also allows arbitrary loads/stores - and is also the condition we want: if our assume is guaranteed to execute, its condition not holding would be UB. Original patch by arielb1. Differential Revision: https://reviews.llvm.org/D37215 llvm-svn: 368723
*	Reland r368691: "[AIX] Implement LR prolog/epilog save/restore"	Hubert Tong	2019-08-13	2	-6/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Trying again with the code changes (and not just the new test). Summary: This patch fixes the offsets of fields in the stack frame linkage save area for AIX. Reviewers: sfertile, hubert.reinterpretcast, jasonliu, Xiangling_L, xingxue, ZarkoCA, daltenty Reviewed By: hubert.reinterpretcast Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64424 Patch by Chris Bowler! llvm-svn: 368721
*	[NFC][AIX] Use assert instead of llvm_unreachable	David Tenty	2019-08-13	3	-11/+11
\| \| \| \| \| \| \| \| \| \|	Addresses post-commit comments on https://reviews.llvm.org/D64825. Use assert instead of llvm_unreachable to check if invalid csect types are being generated. Use report_fatal_error on unimplemented XCOFF features. Differential Revision: https://reviews.llvm.org/D64825 llvm-svn: 368720
*	[Dwarf] Complete the list of type tags.	Jonas Devlieghere	2019-08-13	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	An incorrect verification error revealed that the list of type tags was incomplete. This patch adds the missing types by adding a tag kind to the Dwarf.def file, which is used by the `isType` function. A test was added for the original verification error. Differential revision: https://reviews.llvm.org/D65914 llvm-svn: 368718
*	[SLC] Improve dereferenceable bytes annotation	David Bolvansky	2019-08-13	1	-1/+5
\| \| \| \|	llvm-svn: 368715
*	GlobalISel: Partially implement fewerElementsVector G_UNMERGE_VALUES	Matt Arsenault	2019-08-13	2	-1/+72
\| \| \| \| \| \|	Odd sized vectors aren't handled yet. llvm-svn: 368713
*	[ARM] Fix detection of duplicates when parsing reg list operands	Momchil Velikov	2019-08-13	1	-19/+43
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D65957 llvm-svn: 368712
*	[ARM] Fix encoding of APSR in CLRM instruction	Momchil Velikov	2019-08-13	2	-16/+7
\| \| \| \| \| \| \| \| \|	The APSR is encoded by setting bit 15 in the register list of the CLRM instruction (cf. https://static.docs.arm.com/ddi0553/bh/DDI0553B_h_armv8m_arm.pdf). Differential Revision: https://reviews.llvm.org/D65873 llvm-svn: 368711
*	GlobalISel: Implement lower for G_SHUFFLE_VECTOR	Matt Arsenault	2019-08-13	2	-0/+43
\| \| \| \|	llvm-svn: 368709
*	[ORC] Refactor definition-generation, add a generator for static libraries.	Lang Hames	2019-08-13	2	-13/+134
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch replaces the JITDylib::DefinitionGenerator typedef with a class of the same name, and adds support for attaching a sequence of DefinitionGeneration objects to a JITDylib. This patch also adds a new definition generator, StaticLibraryDefinitionGenerator, that can be used to add symbols fom a static library to a JITDylib. An object from the static library will be added (via a supplied ObjectLayer reference) whenever a symbol from that object is referenced. To enable testing, lli is updated to add support for the --extra-archive option when running in -jit-kind=orc-lazy mode. llvm-svn: 368707
*	GlobalISel: Add more verifier checks for G_SHUFFLE_VECTOR	Matt Arsenault	2019-08-13	1	-1/+35
\| \| \| \|	llvm-svn: 368705
*	GlobalISel: Change representation of shuffle masks	Matt Arsenault	2019-08-13	9	-46/+94
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently shufflemasks get emitted as any other constant, and you end up with a bunch of virtual registers of G_CONSTANT with a G_BUILD_VECTOR. The AArch64 selector then asserts on anything that doesn't fit this pattern. This isn't an ideal representation, and should avoid legalization and have fewer opportunities for a representational error. Rather than invent a new shuffle mask operand type, similar to what ShuffleVectorSDNode does, just track the original IR Constant mask operand. I don't completely like the idea of adding another link to the IR, but MIR is already quite dependent on IR constants already, and this will allow sharing the shuffle mask utility functions with the IR. llvm-svn: 368704
*	[CodeGen][SelectionDAG] More efficient code for X % C == 0 (SREM case)	Roman Lebedev	2019-08-13	1	-5/+221
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This implements an optimization described in Hacker's Delight 10-17: when `C` is constant, the result of `X % C == 0` can be computed more cheaply without actually calculating the remainder. The motivation is discussed here: https://bugs.llvm.org/show_bug.cgi?id=35479. One huge caveat: this signed case is only valid for positive divisors. While we can freely negate negative divisors, we can't negate `INT_MIN`, so for now if `INT_MIN` is encountered, we bailout. As a follow-up, it should be possible to handle that more gracefully via extra `and`+`setcc`+`select`. This passes llvm's test-suite, and from cursory(!) cross-examination the folds (the assembly) match those of GCC, and manual checking via alive did not reveal any issues (other than the `INT_MIN` case) Reviewers: RKSimon, spatel, hermord, craig.topper, xbolva00 Reviewed By: RKSimon, xbolva00 Subscribers: xbolva00, thakis, javed.absar, hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65366 llvm-svn: 368702
*	[TargetLowering][NFC] prepareUREMEqFold(): fixup comment	Roman Lebedev	2019-08-13	1	-1/+1
\| \| \| \| \| \| \| \|	The comment initially matched the code, but the code was incorrect and was fixed after the initial revert back back when it was introduced, but the comment was never updated. llvm-svn: 368701
*	[InstCombine] Non-canonical clamp-like pattern handling	Roman Lebedev	2019-08-13	1	-0/+146
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Given a pattern like: ``` %old_cmp1 = icmp slt i32 %x, C2 %old_replacement = select i1 %old_cmp1, i32 %target_low, i32 %target_high %old_x_offseted = add i32 %x, C1 %old_cmp0 = icmp ult i32 %old_x_offseted, C0 %r = select i1 %old_cmp0, i32 %x, i32 %old_replacement ``` it can be rewritten as more canonical pattern: ``` %new_cmp1 = icmp slt i32 %x, -C1 %new_cmp2 = icmp sge i32 %x, C0-C1 %new_clamped_low = select i1 %new_cmp1, i32 %target_low, i32 %x %r = select i1 %new_cmp2, i32 %target_high, i32 %new_clamped_low ``` Iff `-C1 s<= C2 s<= C0-C1` Also, `ULT` predicate can also be `UGE`; or `UGT` iff `C0 != -1` (+invert result) Also, `SLT` predicate can also be `SGE`; or `SGT` iff `C2 != INT_MAX` (+invert result) If `C1 == 0`, then all 3 instructions must be one-use; else at most either `%old_cmp1` or `%old_x_offseted` can have extra uses. NOTE: if we could reuse `%old_cmp1` as one of the comparisons we'll have to build, this could be less limiting. So there are two icmp's, each one with 3 predicate variants, so there are 9 fold variants: \| \| ULT \| UGE \| UGT \| \| SLT \| https://rise4fun.com/Alive/yIJ \| https://rise4fun.com/Alive/5BfN \| https://rise4fun.com/Alive/INH \| \| SGE \| https://rise4fun.com/Alive/hd8 \| https://rise4fun.com/Alive/Abk \| https://rise4fun.com/Alive/PlzS \| \| SGT \| https://rise4fun.com/Alive/VYG \| https://rise4fun.com/Alive/oMY \| https://rise4fun.com/Alive/KrzC \| {F9730206} This fold was brought up in https://reviews.llvm.org/D65148#1603922 by @dmgreen, and is needed to unblock that patch. This patch requires D65530. Reviewers: spatel, nikic, xbolva00, dmgreen Reviewed By: spatel Subscribers: hiraditya, llvm-commits, dmgreen Tags: #llvm Differential Revision: https://reviews.llvm.org/D65765 llvm-svn: 368687
*	[InstCombine][NFC] Rename IsFreeToInvert() -> isFreeToInvert() for consistency	Roman Lebedev	2019-08-13	4	-18/+18
\| \| \| \| \| \|	As per https://reviews.llvm.org/D65530#inline-592325 llvm-svn: 368686
*	[InstCombine] foldXorOfICmps(): don't give up on non-single-use ICmp's if ↵	Roman Lebedev	2019-08-13	2	-10/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	all users are freely invertible Summary: This is rather unconventional.. As the comment there says, we don't have much folds for xor-of-icmps, we try to turn them into an and-of-icmps, for which we have plenty of folds. But if the ICmp we need to invert is not single-use - we give up. As discussed in https://reviews.llvm.org/D65148#1603922, we may have a non-canonical CLAMP pattern, with bit match and select-of-threshold that we'll potentially clamp. As it can be seen in `canonicalize-clamp-with-select-of-constant-threshold-pattern.ll`, out of all 8 variations of the pattern, only two are not canonicalized into the variant with and+icmp instead of bit math. The reason is because the ICmp we need to invert is not single-use - we give up. We indeed can't perform this fold at will, the general rule is that we should not increase instruction count in InstCombine, But we wouldn't end up increasing instruction count if we can adapt every other user to the inverted value. This way the `not` we create will get folded, and in the end the instruction count did not increase. For that, of course, we need to look at the users of a Value, which is again rather unconventional for InstCombine :S Thus i'm proposing to be a little bit more insistive in `foldXorOfICmps()`. The alternatives would be to not create that `not`, but add duplicate code to manually invert all users; or to add some even less general combine to handle some more specific pattern[s]. Reviewers: spatel, nikic, RKSimon, craig.topper Reviewed By: spatel Subscribers: hiraditya, jdoerfert, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65530 llvm-svn: 368685
*	[X86] XFormVExtractWithShuffleIntoLoad - handle shuffle mask scaling	Simon Pilgrim	2019-08-13	1	-13/+27
\| \| \| \| \| \| \| \| \| \|	If the target shuffle mask is from a wider type, attempt to scale the mask so that the extraction can attempt to peek through. Fixes the regression mentioned in rL368662 Reapplying this as rL368308 had to be reverted as part of rL368660 to revert rL368276 llvm-svn: 368663
*	[X86] SimplifyDemandedVectorElts - attempt to recombine target shuffle using ↵	Simon Pilgrim	2019-08-13	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	DemandedElts mask (reapplied) If we don't demand all elements, then attempt to combine to a simpler shuffle. At the moment we can only do this if Depth == 0 as combineX86ShufflesRecursively uses Depth to track whether the shuffle has really changed or not - we'll need to change this before we can properly start merging combineX86ShufflesRecursively into SimplifyDemandedVectorElts. The insertps-combine.ll regression is because XFormVExtractWithShuffleIntoLoad can't see through shuffles of different widths - this will be fixed in a follow-up commit. Reapplying this as rL368307 had to be reverted as part of rL368660 to revert rL368276 llvm-svn: 368662
*	Revert r368276 "[TargetLowering] SimplifyDemandedBits - call ↵	Hans Wennborg	2019-08-13	2	-55/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SimplifyMultipleUseDemandedBits for ISD::EXTRACT_VECTOR_ELT" This introduced a false positive MemorySanitizer warning about use of uninitialized memory in a vectorized crc function in Chromium. That suggests maybe something is not right with this transformation. See https://crbug.com/992853#c7 for a reproducer. This also reverts the follow-up commits r368307 and r368308 which depended on this. > This patch attempts to peek through vectors based on the demanded bits/elt of a particular ISD::EXTRACT_VECTOR_ELT node, allowing us to avoid dependencies on ops that have no impact on the extract. > > In particular this helps remove some unnecessary scalar->vector->scalar patterns. > > The wasm shift patterns are annoying - @tlively has indicated that the wasm vector shift codegen are to be refactored in the near-term and isn't considered a major issue. > > Differential Revision: https://reviews.llvm.org/D65887 llvm-svn: 368660
*	[SimplifyLibCalls] Add dereferenceable bytes from known callsites	David Bolvansky	2019-08-13	1	-13/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: int mm(char a, char b) { return memcmp(a,b,16); } Currently: define dso_local i32 @mm(i8* nocapture readonly %a, i8* nocapture readonly %b) local_unnamed_addr #1 { entry: %call = tail call i32 @memcmp(i8* %a, i8* %b, i64 16) ret i32 %call } After patch: define dso_local i32 @mm(i8* nocapture readonly %a, i8* nocapture readonly %b) local_unnamed_addr #1 { entry: %call = tail call i32 @memcmp(i8* dereferenceable(16) %a, i8* dereferenceable(16) %b, i64 16) ret i32 %call } Reviewers: jdoerfert, efriedma Reviewed By: jdoerfert Subscribers: javed.absar, spatel, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66079 llvm-svn: 368657
*	[PowerPC] Fix ICE when truncating some vectors	Qiu Chaofan	2019-08-13	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \|	The legalizer would hit an assertion on PowerPC platform when truncating a vector whose size is not power of 2. This patch is to add a check to prevent vectors with such odd-size elements from being custom lowered. Reviewed By: Hal Finkel Differential Revision: https://reviews.llvm.org/D65261 llvm-svn: 368654
*	[AArch64][GlobalISel] Replace explicit vreg creation with implicit using ↵	Amara Emerson	2019-08-13	1	-3/+4
\| \| \| \| \| \|	SrcOp. NFC. llvm-svn: 368653
*	[GlobalISel] Make the InstructionSelector instance non-const, allowing state ↵	Amara Emerson	2019-08-13	16	-66/+65
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to be maintained. Currently we can't keep any state in the selector object that we get from subtarget. As a result we have to plumb through all our variables through multiple functions. This change makes it non-const and adds a virtual init() method to allow further state to be captured for each target. AArch64 makes use of this in this patch to cache a call to hasFnAttribute() which is expensive to call, and is used on each selection of G_BRCOND. Differential Revision: https://reviews.llvm.org/D65984 llvm-svn: 368652
*	Added unit tests to check supported rounding modes	Serge Pavlov	2019-08-13	1	-1/+1
\| \| \| \| \| \| \| \|	Also added fixed misspelled metadata name. Differential Revision: https://reviews.llvm.org/D66073 llvm-svn: 368650
*	[GlobalISel]: Add KnownBits for G_XOR	Aditya Nandakumar	2019-08-13	1	-0/+13
\| \| \| \| \| \|	https://reviews.llvm.org/D66119 llvm-svn: 368648
*	Verifier: check prof branch_weights	Yevgeny Rouban	2019-08-13	1	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \|	This patch is to check some of constraints on !pro branch_weights metadata: https://llvm.org/docs/BranchWeightMetadata.html Reviewers: asbirlea, reames, chandlerc Reviewed By: reames Differential Revision: https://reviews.llvm.org/D61179 llvm-svn: 368647
*	Do not call replaceAllUsesWith to upgrade calls to ARC runtime functions	Akira Hatanaka	2019-08-13	1	-3/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to intrinsic calls This fixes a bug in r368311. It turns out that the ARC runtime functions in the IR can have pointer parameter types that are not i8* or i8**. Instead of RAUWing normal functions with intrinsics, manually bitcast the arguments before passing them to the intrinsic functions and bitcast the return value back to the type of the original call instruction. This recommits r368634, which was reverted in r368637. The loop in the patch was iterating over uses of a function and deleting function calls inside it, which caused bots to crash. rdar://problem/54125406 Differential Revision: https://reviews.llvm.org/D66047 llvm-svn: 368646
*	[AMDGPU] Fix msan failure in printf lowering	Stanislav Mekhanoshin	2019-08-13	1	-5/+3
\| \| \| \|	llvm-svn: 368645
*	Eliminate implicit Register->unsigned conversions in VirtRegMap. NFC	Daniel Sanders	2019-08-13	3	-35/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This was mostly an experiment to assess the feasibility of completely eliminating a problematic implicit conversion case in D61321 in advance of landing that* but it also happens to align with the goal of propagating the use of Register/MCRegister instead of unsigned so I believe it makes sense to commit it. The overall process for eliminating the implicit conversions from Register/MCRegister -> unsigned was to: 1. Add an explicit conversion to support genuinely required conversions to unsigned. For example, using them as an index for IndexedMap. Sadly it's not possible to have an explicit and implicit conversion to the same type and only deprecate the implicit one so I called the explicit conversion get(). 2. Temporarily annotate the implicit conversion to unsigned with LLVM_ATTRIBUTE_DEPRECATED to make them visible 3. Eliminate implicit conversions by propagating Register/MCRegister/ explicit-conversions appropriately 4. Remove the deprecation added in 2. * My conclusion is that it isn't feasible as there's too much code to update in one go. Depends on D65678 Reviewers: arsenm Subscribers: MatzeB, wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65685 llvm-svn: 368643