bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Implement -Watomic-implicit-seq-cst	JF Bastien	2018-09-10	1	-17/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: _Atomic and __sync_* operations are implicitly sequentially-consistent. Some codebases want to force explicit usage of memory order instead. This warning allows them to know where implicit sequentially-consistent memory order is used. The warning isn't on by default because _Atomic was purposefully designed to have seq_cst as the default: the idea was that it's the right thing to use most of the time. This warning allows developers who disagree to enforce explicit usage instead. A follow-up patch will take care of C++'s std::atomic. It'll be different enough from this patch that I think it should be separate: for C++ the atomic operations all have a memory order parameter (or two), but it's defaulted. I believe this warning should trigger when the default is used, but not when seq_cst is used explicitly (or implicitly as the failure order for cmpxchg). <rdar://problem/28172966> Reviewers: rjmccall Subscribers: dexonsmith, cfe-commits Differential Revision: https://reviews.llvm.org/D51084 llvm-svn: 341860
*	Remove deprecated API	Stephen Kelly	2018-09-06	1	-8/+0
\| \| \| \| \| \| \| \| \| \|	Reviewers: teemperor! Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D50353 llvm-svn: 341573
*	[Sema] Clean up some __builtin_*_chk diagnostics	Erik Pilkington	2018-09-06	1	-8/+34
\| \| \| \| \| \| \| \| \| \| \|	Namely, print the likely macro name when it's used, and include the actual computed sizes in the diagnostic message, which are sometimes not obvious. rdar://43909200 Differential revision: https://reviews.llvm.org/D51697 llvm-svn: 341566
*	[X86] Add kshift intrinsics to match gcc and icc.	Craig Topper	2018-08-31	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the following intrinsics: _kshiftli_mask8 _kshiftli_mask16 _kshiftli_mask32 _kshiftli_mask64 _kshiftri_mask8 _kshiftri_mask16 _kshiftri_mask32 _kshiftri_mask64 llvm-svn: 341234
*	[SEMA] add more -Wfloat-conversion to compound assigment analysis	Nick Desaulniers	2018-08-13	1	-27/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes Bug: https://bugs.llvm.org/show_bug.cgi?id=27061 Reviewers: aaron.ballman, acoomans Reviewed By: aaron.ballman, acoomans Subscribers: acoomans, cfe-commits, srhines, pirama Differential Revision: https://reviews.llvm.org/D50467 llvm-svn: 339581
*	Mark up deprecated methods as such	Stephen Kelly	2018-08-09	1	-2/+8
\| \| \| \| \| \| \| \| \| \|	Reviewers: teemperor! Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D50352 llvm-svn: 339403
*	Port getLocEnd -> getEndLoc	Stephen Kelly	2018-08-09	1	-58/+57
\| \| \| \| \| \| \| \| \| \|	Reviewers: teemperor! Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D50351 llvm-svn: 339386
*	Port getLocStart -> getBeginLoc	Stephen Kelly	2018-08-09	1	-320/+300
\| \| \| \| \| \| \| \| \| \|	Reviewers: teemperor! Subscribers: jholewinski, whisperity, jfb, cfe-commits Differential Revision: https://reviews.llvm.org/D50350 llvm-svn: 339385
*	Add getEndLoc API to replace getLocEnd	Stephen Kelly	2018-08-09	1	-1/+2
\| \| \| \| \| \| \| \|	Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D50348 llvm-svn: 339374
*	Add getBeginLoc API to replace getLocStart	Stephen Kelly	2018-08-09	1	-1/+2
\| \| \| \| \| \| \| \|	Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D50346 llvm-svn: 339372
*	__c11_atomic_load's _Atomic can be const	JF Bastien	2018-08-02	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: C++11 onwards specs the non-member functions atomic_load and atomic_load_explicit as taking the atomic<T> by const (potentially volatile) pointer. C11, in its infinite wisdom, decided to drop the const, and C17 will fix this with DR459 (the current draft forgot to fix B.16, but that’s not the normative part). clang’s lib/Headers/stdatomic.h implements these as #define to the __c11_* equivalent, which are builtins with custom typecheck. Fix the typecheck. D47613 takes care of the libc++ side. Discussion: http://lists.llvm.org/pipermail/cfe-dev/2018-May/058129.html <rdar://problem/27426936> Reviewers: rsmith Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D47618 llvm-svn: 338743
*	Remove trailing space	Fangrui Song	2018-07-30	1	-87/+87
\| \| \| \| \| \|	sed -Ei 's/[[:space:]]+$//' include/*/.{def,h,td} lib/*/.{cpp,h} llvm-svn: 338291
*	Fold -Wreturn-stack-address into general initialization lifetime	Richard Smith	2018-07-23	1	-409/+0
\| \| \| \| \| \|	checking. llvm-svn: 337743
*	[Sema] Don't emit -Wmemset-transposed-args for memset(p,0,0)	Erik Pilkington	2018-07-23	1	-3/+6
\| \| \| \| \| \|	Thanks to Arthur O'Dwyer for the suggestion! llvm-svn: 337706
*	Change \t to spaces	Fangrui Song	2018-07-20	1	-1/+1
\| \| \| \|	llvm-svn: 337530
*	[Sema] Add a new warning, -Wmemset-transposed-args	Erik Pilkington	2018-07-19	1	-9/+94
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This diagnoses calls to memset that have the second and third arguments transposed, for example: memset(buf, sizeof(buf), 0); This is done by checking if the third argument is a literal 0, or if the second is a sizeof expression (and the third isn't). The first check is also done for calls to bzero. Differential revision: https://reviews.llvm.org/D49112 llvm-svn: 337470
*	[Hexagon] Diagnose intrinsics not supported by selected CPU/HVX	Krzysztof Parzyszek	2018-07-12	1	-2/+792
\| \| \| \|	llvm-svn: 336933
*	[Sema] -Wformat-pedantic only for NSInteger/NSUInteger %tu/%td on Darwin	Alex Lorenz	2018-07-05	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The '%tu'/'%td' as formatting specifiers have been used to print out the NSInteger/NSUInteger values for a long time. Typically their ABI matches, but that's not the case on watchOS. The ABI difference boils down to the following: - Regular 32-bit darwin targets (like armv7) use 'ptrdiff_t' of type 'int', which matches 'NSInteger'. - WatchOS arm target (armv7k) uses 'ptrdiff_t' of type 'long', which doesn't match 'NSInteger' of type 'int'. Because of this ABI difference these specifiers trigger -Wformat warnings only for watchOS builds, which is really inconvenient for cross-platform code. This patch avoids this -Wformat warning for '%tu'/'%td' and NS[U]Integer only, and instead uses the new -Wformat-pedantic warning that JF introduced in https://reviews.llvm.org/D47290. This is acceptable because Darwin guarantees that, despite the watchOS ABI differences, sizeof(ptrdiff_t) == sizeof(NS[U]Integer), and alignof(ptrdiff_t) == alignof(NS[U]Integer) so the warning is therefore noisy for pedantic reasons. I'll update public documentation to ensure that this behaviour is properly communicated. rdar://41739204 Differential Revision: https://reviews.llvm.org/D48852 llvm-svn: 336396
*	[Sema] Consider all format_arg attributes.	Michael Kruse	2018-07-04	1	-6/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a function has multiple format_arg attributes, clang only considers the first it finds (because AttributeLists are in reverse order, not necessarily the textually first) and ignores all others. Loop over all FormatArgAttr to print warnings for all declared format_arg attributes. For instance, libintl's ngettext (select plural or singular version of format string) has two __format_arg__ attributes. Differential Revision: https://reviews.llvm.org/D48734 llvm-svn: 336239
*	[X86] Remove masking from the avx512 rotate builtins. Use a select builtin ↵	Craig Topper	2018-06-30	1	-12/+12
\| \| \| \| \| \|	instead. llvm-svn: 336036
*	[X86] Remove masking from the avx512 packed sqrt builtins. Use select ↵	Craig Topper	2018-06-29	1	-2/+2
\| \| \| \| \| \|	builtins instead. llvm-svn: 335945
*	[Sema] -Wformat-pedantic only for NSInteger/NSUInteger %zu/%zi on Darwin	JF Bastien	2018-06-22	1	-30/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Pick D42933 back up, and make NSInteger/NSUInteger with %zu/%zi specifiers on Darwin warn only in pedantic mode. The default -Wformat recently started warning for the following code because of the added support for analysis for the '%zi' specifier. NSInteger i = NSIntegerMax; NSLog(@"max NSInteger = %zi", i); The problem is that on armv7 %zi is 'long', and NSInteger is typedefed to 'int' in Foundation. We should avoid this warning as it's inconvenient to our users: it's target specific (happens only on armv7 and not arm64), and breaks their existing code. We should also silence the warning for the '%zu' specifier to ensure consistency. This is acceptable because Darwin guarantees that, despite the unfortunate choice of typedef, sizeof(size_t) == sizeof(NS[U]Integer), the warning is therefore noisy for pedantic reasons. Once this is in I'll update public documentation. Related discussion on cfe-dev: http://lists.llvm.org/pipermail/cfe-dev/2018-May/058050.html <rdar://36874921&40501559> Reviewers: ahatanak, vsapsai, alexshap, aaron.ballman, javed.absar, jfb, rjmccall Subscribers: kristof.beyls, aheejin, cfe-commits Differential Revision: https://reviews.llvm.org/D47290 llvm-svn: 335393
*	[x86] Teach the builtin argument range check to allow invalid ranges in	Chandler Carruth	2018-06-21	1	-5/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	dead code. This is important for C++ templates that essentially compute the valid input in a way that is constant and will cause all the invalid cases to be dead code that is deleted. Code in the wild actually does this and GCC also accepts these kinds of patterns so it is important to support it. To make this work, we provide a non-error path to diagnose these issues, and use a default-error warning instead. This keeps the relatively strict handling but prevents nastiness like SFINAE on these errors. It also allows us to safely use the system to diagnose this only when it occurs at runtime (in emitted code). Entertainingly, this required fixing the syntax in various other ways for the x86 test because we never bothered to diagnose that the returns were invalid. Since debugging these compile failures was super confusing, I've also improved the diagnostic to actually say what the value was. Most of the checks I've made ignore this to simplify maintenance, but I've checked it in a few places to make sure the diagnsotic is working. Depends on D48462. Without that, we might actually crash some part of the compiler after bypassing the error here. Thanks to Richard, Ben Kramer, and especially Craig Topper for all the help here. Differential Revision: https://reviews.llvm.org/D48464 llvm-svn: 335309
*	[X86] Remove masking from the 512-bit floating point max/min builtins. Use ↵	Craig Topper	2018-06-21	1	-4/+6
\| \| \| \| \| \|	select in IR instead. llvm-svn: 335200
*	Implement semantic checking for __builtin_signbit.	Aaron Ballman	2018-06-19	1	-4/+11
\| \| \| \| \| \|	r242675 changed the signature for the signbit builtin but did not introduce proper semantic checking to ensure the arguments are as-expected. This patch groups the signbit builtin along with the other fp classification builtins. Fixes PR28172. llvm-svn: 335050
*	Reverting due to line ending changes; will reapply after addressing that.	Aaron Ballman	2018-06-19	1	-816/+809
\| \| \| \|	llvm-svn: 335049
*	Implement semantic checking for __builtin_signbit.	Aaron Ballman	2018-06-19	1	-809/+816
\| \| \| \| \| \|	r242675 changed the signature for the signbit builtin but did not introduce proper semantic checking to ensure the arguments are as-expected. This patch groups the signbit builtin along with the other fp classification builtins. Fixes PR28172. llvm-svn: 335048
*	[X86] __builtin_ia32_prord512_mask, __builtin_ia32_prorq512_mask, ↵	Craig Topper	2018-06-15	1	-0/+2
\| \| \| \| \| \| \| \|	__builtin_ia32_shufpd should only accept an ICE constant. The rotates also need to check for the immediate to fit in 8-bits. Shufpd already checks its immediate range. llvm-svn: 334847
*	[X86] The immediate argument to getmantpd*_mask should be an ICE and it ↵	Craig Topper	2018-06-15	1	-0/+6
\| \| \| \| \| \| \| \|	should only be 4 bits wide. We already checked this for the scalar version, but missed the vector version somehow. llvm-svn: 334846
*	[X86] Rename __builtin_ia32_pslldqi128 to ↵	Craig Topper	2018-06-14	1	-8/+6
\| \| \| \| \| \| \| \| \| \| \| \|	__builtin_ia32_pslldqi128_byteshift and similar for other sizes. Remove the multiply by 8 from the header files. The previous names took the shift amount in bits to match gcc and required a multiply by 8 in the header. This creates a misleading error message when we check the range of the immediate to the builtin since the allowed range also got multiplied by 8. This commit changes the builtins to use a byte shift amount to match the underlying instruction and the Intel intrinsic. Fixes the remaining issue from PR37795. llvm-svn: 334773
*	Correct behavior of __builtin_*_overflow and constexpr.	Erich Keane	2018-06-13	1	-9/+21
\| \| \| \| \| \| \| \| \| \|	Enable these builtins to be called across a lambda boundary with captureless const/constexpr, as brought up by Eli here: https://reviews.llvm.org/D48040 Differential Revision: https://reviews.llvm.org/D48053 llvm-svn: 334597
*	[X86] Remove masking from avx512vbmi2 concat and shift by immediate ↵	Craig Topper	2018-06-13	1	-18/+18
\| \| \| \| \| \|	builtins. Use select builtins instead. llvm-svn: 334577
*	[AArch64] Corrected FP16 Intrinsic range checks in Clang + added Sema tests	Luke Geeson	2018-06-12	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes the ranges for the vcvth family of FP16 intrinsics in the clang front end. Previously it was accepting incorrect ranges -Changed builtin range checking in SemaChecking -added tests SemaCheck changes - included in their own file since no similar one exists -modified existing tests to reflect new ranges Reviewers: SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Subscribers: kristof.beyls, cfe-commits Differential Revision: https://reviews.llvm.org/D47592 llvm-svn: 334489
*	[X86] Properly account for the immediate being multiplied by 8 in the ↵	Craig Topper	2018-06-11	1	-1/+1
\| \| \| \| \| \| \| \|	immediate range checking for BI__builtin_ia32_psrldqi128 and friends. The limit was set to 1023 which only up to 1278. It needs to be 2047 to allow 2558. llvm-svn: 334416
*	[X86] Remove masking from dbpsadbw builtins, use select builtin instead.	Craig Topper	2018-06-11	1	-3/+3
\| \| \| \|	llvm-svn: 334385
*	[X86] Remove masking from the 512-bit packed floating point add/sub/mul/div ↵	Craig Topper	2018-06-10	1	-8/+8
\| \| \| \| \| \|	builtins. Use select in IR instead. llvm-svn: 334359
*	[X86] Fold masking into subvector extract builtins.	Craig Topper	2018-06-08	1	-12/+12
\| \| \| \| \| \| \| \|	I'm looking into making the select builtins require avx512f, avx512bw, or avx512vl since masking operations generally require those features. The extract builtins are funny because the 512-bit versions return a 128 or 256 bit vector with masking even when avx512vl is not supported. llvm-svn: 334330
*	[X86] Add builtins for vpermq/vpermpd instructions to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+4
\| \| \| \| \| \|	checking. llvm-svn: 334311
*	[X86] Change immediate type for some builtins from char to int.	Craig Topper	2018-06-08	1	-0/+4
\| \| \| \| \| \|	These builtins are all handled by CGBuiltin.cpp so it doesn't much matter what the immediate type is, but int matches the intrinsic spec. llvm-svn: 334310
*	[X86] Add builtins for shufps and shufpd to enable target feature and ↵	Craig Topper	2018-06-08	1	-0/+6
\| \| \| \| \| \|	immediate range checking. llvm-svn: 334266
*	[X86] Add builtins for pshufd, pshuflw, and pshufhw to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+9
\| \| \| \| \| \|	and immediate range checking. llvm-svn: 334265
*	[X86] Add subvector insert and extract builtins to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+32
\| \| \| \| \| \| \| \|	checking and immediate range checking. Test changes are due to differences in how we generate undef elements now. We also changed the types used for extractf128_si256/insertf128_si256 to match the signature of the builtin that previously existed which this patch resurrects. This also matches gcc. llvm-svn: 334261
*	[X86] Add builtins for vpermilps/pd instructions to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+6
\| \| \| \| \| \|	checking. llvm-svn: 334256
*	[X86] Add builtins for blend with immediate control to enforce target ↵	Craig Topper	2018-06-08	1	-0/+8
\| \| \| \| \| \|	feature requirements and check immediate range. llvm-svn: 334249
*	[X86] Add builtins for shuff32x4/shuff64x2/shufi32x4/shuff64x2 to enable ↵	Craig Topper	2018-06-07	1	-0/+8
\| \| \| \| \| \|	target feature checking and immediate range checking. llvm-svn: 334244
*	[MS] Re-add support for the ARM interlocked bittest intrinscs	Reid Kleckner	2018-06-07	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adds support for these intrinsics, which are ARM and ARM64 only: _interlockedbittestandreset_acq _interlockedbittestandreset_rel _interlockedbittestandreset_nf _interlockedbittestandset_acq _interlockedbittestandset_rel _interlockedbittestandset_nf Refactor the bittest intrinsic handling to decompose each intrinsic into its action, its width, and its atomicity. llvm-svn: 334239
*	[X86] Add builtins for VALIGNQ/VALIGND to enable proper target feature checking.	Craig Topper	2018-06-07	1	-0/+6
\| \| \| \| \| \| \| \|	We still emit shufflevector instructions we just do it from CGBuiltin.cpp now. This ensures the intrinsics that use this are only available on CPUs that support the feature. I also added range checking to the immediate, but only checked it is 8 bits or smaller. We should maybe be stricter since we never use all 8 bits, but gcc doesn't seem to do that. llvm-svn: 334237
*	[X86] Add back builtins for _mm_slli_si128/_mm_srli_si128 and similar ↵	Craig Topper	2018-06-07	1	-0/+8
\| \| \| \| \| \| \| \| \| \|	intrinsics. We still lower them to native shuffle IR, but we do it in CGBuiltin.cpp now. This allows us to check the target feature and ensure the immediate fits in 8 bits. This also improves our -O0 codegen slightly because we're able to see the zeroinitializer in the shuffle. It looks like it got lost behind a store+load previously. llvm-svn: 334208
*	[X86] Add back _mask, _maskz, and _mask3 builtins for some 512-bit ↵	Craig Topper	2018-06-07	1	-4/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fmadd/fmsub/fmaddsub/fmsubadd builtins. Summary: We recently switch to using a selects in the intrinsics header files for FMA instructions. But the 512-bit versions support flavors with rounding mode which must be an Integer Constant Expression. This has forced those intrinsics to be implemented as macros. As it stands now the mask and mask3 intrinsics evaluate one of their macro arguments twice. If that argument itself is another intrinsic macro, we can end up over expanding macros. Or if its something we can CSE later it would show up multiple times when it shouldn't. I tried adding __extension__ around the macro and making it an expression statement and declaring a local variable. But whatever name you choose for the local variable can never be used as the name of an input to the macro in user code. If that happens you would end up with the same name on the LHS and RHS of an assignment after expansion. We might be safe if we use __ in front of the variable names because those names are reserved and user code shouldn't use that, but I wasn't sure I wanted to make that claim. The other option which I've chosen here, is to add back _mask, _maskz, and _mask3 flavors of the builtin which we will expand in CGBuiltin.cpp to replicate the argument as needed and insert any fneg needed on the third operand to make a subtract. The _maskz isn't truly necessary if we have an unmasked version or if we use the masked version with a -1 mask and wrap a select around it. But I've chosen to make things more uniform. I separated out the scalar builtin handling to avoid too many things going on in EmitX86FMAExpr. It was different enough due to the extract and insert that the minor duplication of the CreateCall was probably worth it. Reviewers: tkrupa, RKSimon, spatel, GBuella Reviewed By: tkrupa Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D47724 llvm-svn: 334159
*	[X86] Add builtins for vector element insert and extract for different 128 ↵	Craig Topper	2018-06-06	1	-3/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	and 256 bit vector types. Use them to implement the extract and insert intrinsics. Previously we were just using extended vector operations in the header file. This unfortunately allowed non-constant indices to be used with the intrinsics. This is incompatible with gcc, icc, and MSVC. It also introduces a different performance characteristic because non-constant index gets lowered to a vector store and an element sized load. By adding the builtins we can check for the index to be a constant and ensure its in range of the vector element count. User code still has the option to use extended vector operations themselves if they need non-constant indexing. llvm-svn: 334057