bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[OPENMP] Do not consider address constant vars as possibly	Alexey Bataev	2018-06-25	1	-1/+2
\| \| \| \| \| \| \| \| \|	threadprivate. Do not delay emission of the address constant variables in OpenMP mode as they cannot be defined as threadprivate. llvm-svn: 335483
*	[CodeGen] Provide source locations for UBSan type checks when emitting ↵	Igor Kudrin	2018-06-25	2	-8/+10
\| \| \| \| \| \| \| \|	constructor calls. Differential Revision: https://reviews.llvm.org/D48531 llvm-svn: 335445
*	[Coroutines] Less IR for noexcept await_resume	Brian Gesiak	2018-06-23	1	-8/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In his review of https://reviews.llvm.org/D45860, @GorNishanov suggested avoiding generating additional exception-handling IR in the case that the resume function was marked as 'noexcept', and exceptions could not occur. This implements that suggestion. Test Plan: `check-clang` Reviewers: GorNishanov, EricWF Reviewed By: GorNishanov Subscribers: cfe-commits, GorNishanov Differential Revision: https://reviews.llvm.org/D47673 llvm-svn: 335422
*	Re-land "[LTO] Enable module summary emission by default for regular LTO"	Tobias Edler von Koch	2018-06-22	2	-13/+32
\| \| \| \| \| \| \| \| \| \| \| \|	Since we are now producing a summary also for regular LTO builds, we need to run the NameAnonGlobals pass in those cases as well (the summary cannot handle anonymous globals). See https://reviews.llvm.org/D34156 for details on the original change. This reverts commit 6c9ee4a4a438a8059aacc809b2dd57128fccd6b3. llvm-svn: 335385
*	[OPENMP, NVPTX] Fix reduction of the big data types/structures.	Alexey Bataev	2018-06-22	1	-21/+115
\| \| \| \| \| \| \| \|	If the shuffle is required for the reduced structures/big data type, current code may cause compiler crash because of the loading of the aggregate values. Patch fixes this problem. llvm-svn: 335377
*	[X86] Lower _mm[256\|512]_cmp[.]_mask intrinsics to native llvm IR	Gabor Buella	2018-06-22	1	-91/+74
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Lowering some vector comparision builtins to fcmp IR instructions. This ignores the signaling behaviour specified in the predicate argument of said builtins. Affected AVX512 builtins: __builtin_ia32_cmpps128_mask __builtin_ia32_cmpps256_mask __builtin_ia32_cmpps512_mask __builtin_ia32_cmppd128_mask __builtin_ia32_cmppd256_mask __builtin_ia32_cmppd512_mask Reviewers: craig.topper, uriel.k, RKSimon, andrew.w.kaylor, spatel, scanon, efriedma Reviewed By: craig.topper, spatel, efriedma Differential Revision: https://reviews.llvm.org/D45616 llvm-svn: 335339
*	[X86] Update handling in CGBuiltin to be tolerant of out of range immediates.	Craig Topper	2018-06-21	1	-13/+29
\| \| \| \| \| \| \| \|	D48464 contains changes that will loosen some of the range checks in SemaChecking to a DefaultError warning that can be disabled. This patch adds explicit masking to avoid using the upper bits of immediates to gracefully handle the warning being disabled. llvm-svn: 335308
*	Ignore blacklist when generating __cfi_check_fail.	Evgeniy Stepanov	2018-06-21	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes PR37898. Reviewers: pcc, vlad.tsyrklevich Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D48454 llvm-svn: 335305
*	Revert "[LTO] Enable module summary emission by default for regular LTO"	Tobias Edler von Koch	2018-06-21	2	-29/+11
\| \| \| \| \| \| \| \| \| \| \|	This is breaking a couple of buildbots. We need to run the NameAnonGlobal pass for regular LTO now as well (since we're producing a summary). I'll post a separate patch for review to make this happen and then re-commit. This reverts commit c0759b7b1f4a81ff9021b952aa38a222d5fa4dfd. llvm-svn: 335291
*	[OPENMP, NVPTX] Fix globalization of the variables passed to orphaned	Alexey Bataev	2018-06-21	2	-49/+61
\| \| \| \| \| \| \| \| \| \|	parallel region. If the current construct requires sharing of the local variable in the inner parallel region, this variable must be globalized to avoid runtime crash. llvm-svn: 335285
*	[LTO] Enable module summary emission by default for regular LTO	Tobias Edler von Koch	2018-06-21	2	-11/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: With D33921, we gained the ability to have module summaries in regular LTO modules without triggering ThinLTO compilation. Module summaries in regular LTO allow garbage collection (dead stripping) before LTO compilation and thus open up additional optimization opportunities. This patch enables summary emission in regular LTO for all targets except ld64-based ones (which use the legacy LTO API). Reviewers: pcc, tejohnson, mehdi_amini Subscribers: inglorion, eraman, cfe-commits Differential Revision: https://reviews.llvm.org/D34156 llvm-svn: 335284
*	[DebugInfo] Inline for without DebugLocation	Anastasis Grammenos	2018-06-21	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This test is a strip down version of a function inside the amalgamated sqlite source. When converted to IR clang produces a phi instruction without debug location. This patch fixes the above issue. Differential Revision: https://reviews.llvm.org/D47720 llvm-svn: 335255
*	[Fixed Point Arithmetic] Fixed Point Precision Bits and Fixed Point Literals	Leonard Chan	2018-06-20	2	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This diff includes the logic for setting the precision bits for each primary fixed point type in the target info and logic for initializing a fixed point literal. Fixed point literals are declared using the suffixes ``` hr: short _Fract uhr: unsigned short _Fract r: _Fract ur: unsigned _Fract lr: long _Fract ulr: unsigned long _Fract hk: short _Accum uhk: unsigned short _Accum k: _Accum uk: unsigned _Accum ``` Errors are also thrown for illegal literal values ``` unsigned short _Accum u_short_accum = 256.0uhk; // expected-error{{the integral part of this literal is too large for this unsigned _Accum type}} ``` Differential Revision: https://reviews.llvm.org/D46915 llvm-svn: 335148
*	IRgen: Mark aliases of ctors and dtors as unnamed_addr.	Peter Collingbourne	2018-06-18	3	-12/+8
\| \| \| \| \| \| \| \| \|	This is not only semantically correct but ensures that they will not be marked as address-significant once D48155 lands. Differential Revision: https://reviews.llvm.org/D48206 llvm-svn: 334982
*	Fix a bug introduced by rL334850	Tomasz Krupa	2018-06-18	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: All *_sqrt_round_s[s\|d] intrinsics should execute a square root on zeroth element from B (Ops[1]) and insert in to A (Ops[0]), not the other way around. Reviewers: itaraban, craig.topper Reviewed By: craig.topper Subscribers: craig.topper, cfe-commits Differential Revision: https://reviews.llvm.org/D48288 llvm-svn: 334964
*	[OPENMP, NVPTX] Emit simple reduction if requested.	Alexey Bataev	2018-06-18	1	-0/+6
\| \| \| \| \| \| \|	If simple reduction is requested, use the simple reduction instead of the runtime functions calls. llvm-svn: 334962
*	Call CreateTempAllocaWithoutCast for ActiveFlag	Yaxun Liu	2018-06-16	1	-2/+2
\| \| \| \| \| \|	This is partial re-commit of r332982. llvm-svn: 334879
*	[X86] Lowering sqrt intrinsics to native IR	Tomasz Krupa	2018-06-15	1	-1/+50
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: craig.topper, spatel, RKSimon, igorb, uriel.k Reviewed By: craig.topper Subscribers: tkrupa, cfe-commits Differential Revision: https://reviews.llvm.org/D41168 llvm-svn: 334850
*	[NFC] Add CreateMemTempWithoutCast and CreateTempAllocaWithoutCast	Yaxun Liu	2018-06-15	3	-34/+53
\| \| \| \| \| \|	This is partial re-commit of r332982 llvm-svn: 334837
*	[AArch64] Reverted rC334696 with Clang VCVTA test fix	Luke Geeson	2018-06-15	1	-0/+3
\| \| \| \|	llvm-svn: 334820
*	[X86] Rename __builtin_ia32_pslldqi128 to ↵	Craig Topper	2018-06-14	1	-10/+8
\| \| \| \| \| \| \| \| \| \| \| \|	__builtin_ia32_pslldqi128_byteshift and similar for other sizes. Remove the multiply by 8 from the header files. The previous names took the shift amount in bits to match gcc and required a multiply by 8 in the header. This creates a misleading error message when we check the range of the immediate to the builtin since the allowed range also got multiplied by 8. This commit changes the builtins to use a byte shift amount to match the underlying instruction and the Intel intrinsic. Fixes the remaining issue from PR37795. llvm-svn: 334773
*	[X86] Lowering Mask Scalar intrinsics to native IR (Clang part)	Tomasz Krupa	2018-06-14	1	-0/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Lowering add, sub, mul, and div mask scalar intrinsic calls to native IR. Reviewers: craig.topper, RKSimon, spatel, sroland Reviewed By: craig.topper Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D47979 llvm-svn: 334741
*	[Fixed Point Arithmetic] Addition of the remaining fixed point types and ↵	Leonard Chan	2018-06-14	3	-0/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	their saturated equivalents This diff includes changes for the remaining _Fract and _Sat fixed point types. ``` signed short _Fract s_short_fract; signed _Fract s_fract; signed long _Fract s_long_fract; unsigned short _Fract u_short_fract; unsigned _Fract u_fract; unsigned long _Fract u_long_fract; // Aliased fixed point types short _Accum short_accum; _Accum accum; long _Accum long_accum; short _Fract short_fract; _Fract fract; long _Fract long_fract; // Saturated fixed point types _Sat signed short _Accum sat_s_short_accum; _Sat signed _Accum sat_s_accum; _Sat signed long _Accum sat_s_long_accum; _Sat unsigned short _Accum sat_u_short_accum; _Sat unsigned _Accum sat_u_accum; _Sat unsigned long _Accum sat_u_long_accum; _Sat signed short _Fract sat_s_short_fract; _Sat signed _Fract sat_s_fract; _Sat signed long _Fract sat_s_long_fract; _Sat unsigned short _Fract sat_u_short_fract; _Sat unsigned _Fract sat_u_fract; _Sat unsigned long _Fract sat_u_long_fract; // Aliased saturated fixed point types _Sat short _Accum sat_short_accum; _Sat _Accum sat_accum; _Sat long _Accum sat_long_accum; _Sat short _Fract sat_short_fract; _Sat _Fract sat_fract; _Sat long _Fract sat_long_fract; ``` This diff only allows for declaration of these fixed point types. Assignment and other operations done on fixed point types according to http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1169.pdf will be added in future patches. Differential Revision: https://reviews.llvm.org/D46911 llvm-svn: 334718
*	[AArch64] reverting rC334693 due to build failures	Luke Geeson	2018-06-14	1	-3/+0
\| \| \| \|	llvm-svn: 334696
*	[AArch64] Added support for the vcvta_u16_f16 instrinsic for FP16 Armv8.2-A	Luke Geeson	2018-06-14	1	-0/+3
\| \| \| \|	llvm-svn: 334693
*	[COFF] Add ARM64 intrinsics: __yield, __wfe, __wfi, __sev, __sevl	Mandeep Singh Grang	2018-06-13	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: These intrinsics result in hint instructions. They are provided here for MSVC ARM64 compatibility. Reviewers: mstorsjo, compnerd, javed.absar Reviewed By: mstorsjo Subscribers: kristof.beyls, chrib, cfe-commits Differential Revision: https://reviews.llvm.org/D48132 llvm-svn: 334639
*	Add -fforce-emit-vtables	Piotr Padlewski	2018-06-13	1	-3/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In many cases we can't devirtualize because definition of vtable is not present. Most of the time it is caused by inline virtual function not beeing emitted. Forcing emitting of vtable adds a reference of these inline virtual functions. Note that GCC was always doing it. Reviewers: rjmccall, rsmith, amharc, kuhar Subscribers: llvm-commits, cfe-commits Differential Revision: https://reviews.llvm.org/D47108 Co-authored-by: Krzysztof Pszeniczny <krzysztof.pszeniczny@gmail.com> llvm-svn: 334600
*	Fix crash emitting transparent list initializer for a large aggregate.	Richard Smith	2018-06-13	1	-0/+2
\| \| \| \|	llvm-svn: 334565
*	[CUDA][HIP] Set kernel calling convention before arrange function	Yaxun Liu	2018-06-12	4	-7/+19
\| \| \| \| \| \| \| \| \| \| \| \| \|	Currently clang set kernel calling convention for CUDA/HIP after arranging function, which causes incorrect kernel function type since it depends on calling convention. This patch moves setting kernel convention before arranging function. Differential Revision: https://reviews.llvm.org/D47733 llvm-svn: 334457
*	[X86] Fix operand order in the shuffle created for blend builtins.	Craig Topper	2018-06-11	1	-1/+1
\| \| \| \| \| \|	This was broken when the builtin was added in r334249. llvm-svn: 334422
*	[MS] Use mangled names and comdats for string merging with ASan	Reid Kleckner	2018-06-11	1	-7/+5
\| \| \| \| \| \| \| \|	This should reduce the binary size penalty of ASan on Windows. After r334313, ASan will add red zones to globals in comdats, so we will still find OOB accesses to string literals. llvm-svn: 334417
*	[X86] Use target independent masked expandload and compressstore intrinsics ↵	Craig Topper	2018-06-10	1	-0/+74
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to implement expandload/compressstore builtins. Summary: We've had these target independent intrinsics for at least a year and a half. Looks like they do exactly what we need here and the backend already supports them. Reviewers: RKSimon, delena, spatel, GBuella Reviewed By: RKSimon Subscribers: cfe-commits, llvm-commits Differential Revision: https://reviews.llvm.org/D47693 llvm-svn: 334366
*	[NEON] Support VST1xN intrinsics in AArch32 mode (Clang part)	Ivan A. Kosarev	2018-06-10	1	-28/+29
\| \| \| \| \| \| \| \| \|	We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47446 llvm-svn: 334362
*	Use SmallPtrSet instead of SmallSet in places where we iterate over the set.	Craig Topper	2018-06-09	1	-1/+1
\| \| \| \| \| \| \| \|	SmallSet forwards to SmallPtrSet for pointer types. SmallPtrSet supports iteration, but a normal SmallSet doesn't. So if it wasn't for the forwarding, this wouldn't work. These places were found by hiding the begin/end methods in the SmallSet forwarding. llvm-svn: 334339
*	[X86] Add back some masked vector truncate builtins. Custom IRgen a a few ↵	Craig Topper	2018-06-08	1	-0/+29
\| \| \| \| \| \| \| \| \| \|	others. I'd like to make the select builtins require an avx512f, avx512bw, or avx512vl fature to match what is normally required to get masking. Truncate is special in that there are instructions with a 128/256-bit masked result even without avx512vl. By using special buitlins we can emit a select without using the 128/256-bit select builtins. llvm-svn: 334331
*	[X86] Fold masking into subvector extract builtins.	Craig Topper	2018-06-08	1	-16/+21
\| \| \| \| \| \| \| \|	I'm looking into making the select builtins require avx512f, avx512bw, or avx512vl since masking operations generally require those features. The extract builtins are funny because the 512-bit versions return a 128 or 256 bit vector with masking even when avx512vl is not supported. llvm-svn: 334330
*	[X86] Add builtins for vpermq/vpermpd instructions to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+18
\| \| \| \| \| \|	checking. llvm-svn: 334311
*	[CUDA] Fix emission of constant strings in sections	Jonas Hahnfeld	2018-06-08	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	CGM.GetAddrOfConstantCString() sets the adress of the created GlobalValue to unnamed. When emitting the object file LLVM will mark the surrounding section as SHF_MERGE iff the string is nul-terminated and contains no other nuls (see IsNullTerminatedString). This results in problems when saving temporaries because LLVM doesn't set an EntrySize, so reading in the serialized assembly file fails. This never happened for the GPU binaries because they usually contain a nul-character somewhere. Instead this only affected the module ID when compiling relocatable device code. However, this points to a potentially larger problem: If we put a constant string into a named section, we really want the data to end up in that section in the object file. To avoid LLVM merging sections this patch unmarks the GlobalVariable's address as unnamed which also fixes the problem of invalid serialized assembly files when saving temporaries. Differential Revision: https://reviews.llvm.org/D47902 llvm-svn: 334281
*	[X86] Add builtins for shufps and shufpd to enable target feature and ↵	Craig Topper	2018-06-08	1	-0/+30
\| \| \| \| \| \|	immediate range checking. llvm-svn: 334266
*	[X86] Add builtins for pshufd, pshuflw, and pshufhw to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+51
\| \| \| \| \| \|	and immediate range checking. llvm-svn: 334265
*	[X86] Add subvector insert and extract builtins to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+69
\| \| \| \| \| \| \| \|	checking and immediate range checking. Test changes are due to differences in how we generate undef elements now. We also changed the types used for extractf128_si256/insertf128_si256 to match the signature of the builtin that previously existed which this patch resurrects. This also matches gcc. llvm-svn: 334261
*	[X86] Add builtins for vpermilps/pd instructions to enable target feature ↵	Craig Topper	2018-06-08	1	-0/+27
\| \| \| \| \| \|	checking. llvm-svn: 334256
*	[CodeGen] Always use MSVC personality for windows-msvc targets	Shoaib Meenai	2018-06-08	1	-6/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The windows-msvc target is meant to be ABI compatible with MSVC, including the exception handling. Ensure that a windows-msvc triple always equates to the MSVC personality being used. This mostly affects the GNUStep and ObjFW Obj-C runtimes. To the best of my knowledge, those are normally not used with windows-msvc triples. I believe WinObjC is based on GNUStep (or it at least uses libobjc2), but that also takes the approach of wrapping Obj-C exceptions in C++ exceptions, so the MSVC personality function is the right one to use there as well. Differential Revision: https://reviews.llvm.org/D47862 llvm-svn: 334253
*	[X86] Add builtins for blend with immediate control to enforce target ↵	Craig Topper	2018-06-08	1	-0/+21
\| \| \| \| \| \|	feature requirements and check immediate range. llvm-svn: 334249
*	[X86] Add builtins for shuff32x4/shuff64x2/shufi32x4/shuff64x2 to enable ↵	Craig Topper	2018-06-07	1	-0/+29
\| \| \| \| \| \|	target feature checking and immediate range checking. llvm-svn: 334244
*	[MS] Re-add support for the ARM interlocked bittest intrinscs	Reid Kleckner	2018-06-07	1	-68/+117
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adds support for these intrinsics, which are ARM and ARM64 only: _interlockedbittestandreset_acq _interlockedbittestandreset_rel _interlockedbittestandreset_nf _interlockedbittestandset_acq _interlockedbittestandset_rel _interlockedbittestandset_nf Refactor the bittest intrinsic handling to decompose each intrinsic into its action, its width, and its atomicity. llvm-svn: 334239
*	[X86] Add builtins for VALIGNQ/VALIGND to enable proper target feature checking.	Craig Topper	2018-06-07	1	-0/+20
\| \| \| \| \| \| \| \|	We still emit shufflevector instructions we just do it from CGBuiltin.cpp now. This ensures the intrinsics that use this are only available on CPUs that support the feature. I also added range checking to the immediate, but only checked it is 8 bits or smaller. We should maybe be stricter since we never use all 8 bits, but gcc doesn't seem to do that. llvm-svn: 334237
*	[X86] Add back builtins for _mm_slli_si128/_mm_srli_si128 and similar ↵	Craig Topper	2018-06-07	1	-0/+62
\| \| \| \| \| \| \| \| \| \|	intrinsics. We still lower them to native shuffle IR, but we do it in CGBuiltin.cpp now. This allows us to check the target feature and ensure the immediate fits in 8 bits. This also improves our -O0 codegen slightly because we're able to see the zeroinitializer in the shuffle. It looks like it got lost behind a store+load previously. llvm-svn: 334208
*	[CodeGen] Improve diagnostics related to target attributes	Gabor Buella	2018-06-07	3	-10/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When requirement imposed by __target__ attributes on functions are not satisfied, prefer printing those requirements, which are explicitly mentioned in the attributes. This makes such messages more useful, e.g. printing avx512f instead of avx2 in the following scenario: ``` $ cat foo.c static inline void __attribute__((__always_inline__, __target__("avx512f"))) x(void) { } int main(void) { x(); } $ clang foo.c foo.c:7:2: error: always_inline function 'x' requires target feature 'avx2', but would be inlined into function 'main' that is compiled without support for 'avx2' x(); ^ 1 error generated. ``` bugzilla: https://bugs.llvm.org/show_bug.cgi?id=37338 Reviewers: craig.topper, echristo, dblaikie Reviewed By: craig.topper, echristo Differential Revision: https://reviews.llvm.org/D46541 llvm-svn: 334174
*	[X86] Add back _mask, _maskz, and _mask3 builtins for some 512-bit ↵	Craig Topper	2018-06-07	1	-61/+112
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fmadd/fmsub/fmaddsub/fmsubadd builtins. Summary: We recently switch to using a selects in the intrinsics header files for FMA instructions. But the 512-bit versions support flavors with rounding mode which must be an Integer Constant Expression. This has forced those intrinsics to be implemented as macros. As it stands now the mask and mask3 intrinsics evaluate one of their macro arguments twice. If that argument itself is another intrinsic macro, we can end up over expanding macros. Or if its something we can CSE later it would show up multiple times when it shouldn't. I tried adding __extension__ around the macro and making it an expression statement and declaring a local variable. But whatever name you choose for the local variable can never be used as the name of an input to the macro in user code. If that happens you would end up with the same name on the LHS and RHS of an assignment after expansion. We might be safe if we use __ in front of the variable names because those names are reserved and user code shouldn't use that, but I wasn't sure I wanted to make that claim. The other option which I've chosen here, is to add back _mask, _maskz, and _mask3 flavors of the builtin which we will expand in CGBuiltin.cpp to replicate the argument as needed and insert any fneg needed on the third operand to make a subtract. The _maskz isn't truly necessary if we have an unmasked version or if we use the masked version with a -1 mask and wrap a select around it. But I've chosen to make things more uniform. I separated out the scalar builtin handling to avoid too many things going on in EmitX86FMAExpr. It was different enough due to the extract and insert that the minor duplication of the CreateCall was probably worth it. Reviewers: tkrupa, RKSimon, spatel, GBuella Reviewed By: tkrupa Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D47724 llvm-svn: 334159