bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[APInt] Remove calls to clearUnusedBits from XorSlowCase and operator^=	Craig Topper	2017-01-24	2	-6/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There's a comment in XorSlowCase that says "0^0==1" which isn't true. 0 xored with 0 is still 0. So I don't think we need to clear any unused bits here. Now there is no difference between XorSlowCase and AndSlowCase/OrSlowCase other than the operation being performed Reviewers: majnemer, MatzeB, chandlerc, bkramer Reviewed By: MatzeB Subscribers: chfast, llvm-commits Differential Revision: https://reviews.llvm.org/D28986 llvm-svn: 292873
*	[PM] Try to make all three compilers happy when it comes to pretty printing.	Davide Italiano	2017-01-24	1	-14/+11
\| \| \| \| \| \| \|	Modeled after a similar change from Michael Kuperstein. Let's hope this sticks together. llvm-svn: 292872
*	LiveIntervalAnalysis: Calculate liveness even if a superreg is reserved.	Matthias Braun	2017-01-24	2	-6/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A register unit may be allocatable and non-reserved but some of the register(tuples) built with it are reserved. We still need to calculate liveness in this case. Note to out of tree targets: If you start seeing machine verifier errors with this commit, it probably means that you do not properly mark super registers of reserved register as reserved. See for example r292836 or r292870 for example on how to fix that. rdar://29996737 Differential Revision: https://reviews.llvm.org/D28881 llvm-svn: 292871
*	PowerPC: Mark super regs of reserved regs reserved.	Matthias Braun	2017-01-24	1	-35/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a register like R1 is reserved, X1 should be reserved as well. This was already done "manually" when 64bit code was enabled, however using the markSuperRegs() function on the base register is more convenient and allows to use the checksAllSuperRegsMarked() function even in 32bit mode to avoid accidental breakage in the future. This is also necessary to allow https://reviews.llvm.org/D28881 Differential Revision: https://reviews.llvm.org/D29056 llvm-svn: 292870
*	[LTO] Add test to show up we don't support ThinLTO yet.	Davide Italiano	2017-01-24	1	-0/+13
\| \| \| \|	llvm-svn: 292865
*	[LTO] Teach lib/LTO about the new pass manager.	Davide Italiano	2017-01-24	2	-4/+70
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28997 llvm-svn: 292864
*	[PM] Flesh out the new pass manager LTO pipeline.	Davide Italiano	2017-01-24	3	-12/+263
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28996 llvm-svn: 292863
*	[sanitizer-coverage] emit __sanitizer_cov_trace_pc_guard w/o a preceding ↵	Kostya Serebryany	2017-01-24	3	-4/+4
\| \| \| \| \| \|	'if' by default. Update the docs, also add deprecation notes around other parts of sanitizer coverage llvm-svn: 292862
*	[APFloat] Add PPCDoubleDouble multiplication	Tim Shen	2017-01-24	2	-40/+256
\| \| \| \| \| \| \| \| \| \|	Reviewers: echristo, hfinkel, kbarton, iteratee Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D28382 llvm-svn: 292860
*	[WebAssembly] Update LibFunc::Func -> LibFunc	Derek Schuff	2017-01-24	2	-2/+2
\| \| \| \| \| \|	Fixes compile failures after r292848 llvm-svn: 292857
*	SimplifyLibCalls: Replace more unary libcalls with intrinsics	Matt Arsenault	2017-01-23	7	-154/+313
\| \| \| \|	llvm-svn: 292855
*	[LoopUnroll] First form LCSSA, then loop-simplify	Michael Kuperstein	2017-01-23	2	-18/+72
\| \| \| \| \| \| \| \| \| \| \| \| \|	Running non-LCSSA-preserving LoopSimplify followed by LCSSA on (roughly) the same loop is incorrect, since LoopSimplify may break LCSSA arbitrarily higher in the loop nest. Instead, run LCSSA first, and then run LCSSA-preserving LoopSimplify on the result. This fixes PR31718. Differential Revision: https://reviews.llvm.org/D29055 llvm-svn: 292854
*	[AMDGPU] Fix obsolete comments, spotted by Malcolm Parsons. (NFC)	Eugene Zelenko	2017-01-23	1	-3/+4
\| \| \| \|	llvm-svn: 292853
*	Makes promoteIndirectCall an external function.	Dehao Chen	2017-01-23	2	-17/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: promoteIndirectCall should be a utility function that could be invoked by other optimization passes. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29051 llvm-svn: 292850
*	[Analysis] Add LibFunc_ prefix to enums in TargetLibraryInfo. (NFC)	David L. Jones	2017-01-23	23	-1369/+1367
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The LibFunc::Func enum holds enumerators named for libc functions. Unfortunately, there are real situations, including libc implementations, where function names are actually macros (musl uses "#define fopen64 fopen", for example; any other transitively visible macro would have similar effects). Strictly speaking, a conforming C++ Standard Library should provide any such macros as functions instead (via <cstdio>). However, there are some "library" functions which are not part of the standard, and thus not subject to this rule (fopen64, for example). So, in order to be both portable and consistent, the enum should not use the bare function names. The old enum naming used a namespace LibFunc and an enum Func, with bare enumerators. This patch changes LibFunc to be an enum with enumerators prefixed with "LibFFunc_". (Unfortunately, a scoped enum is not sufficient to override macros.) There are additional changes required in clang. Reviewers: rsmith Subscribers: mehdi_amini, mzolotukhin, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D28476 llvm-svn: 292848
*	AMDGPU: Custom lower more vector operations	Matt Arsenault	2017-01-23	6	-18/+618
\| \| \| \| \| \|	This avoids stack usage. llvm-svn: 292846
*	[RDF] Add registers to live set even if they are live already	Krzysztof Parzyszek	2017-01-23	1	-6/+5
\| \| \| \| \| \| \| \|	When calculating kills, a register may be considered live because a part of it is live, but if there is a use of that (whole) register, the whole register (and its subregisters) need to be added to the live set. llvm-svn: 292845
*	[libFuzzer] mutate empty input using the regular mutators (instead of a ↵	Kostya Serebryany	2017-01-23	1	-14/+5
\| \| \| \| \| \|	custom dummy one). This way when we mutate an empty input there is a chance we will get a dictionary word llvm-svn: 292843
*	DAG: Don't fold vector extract into load if target doesn't want to	Matt Arsenault	2017-01-23	2	-0/+36
\| \| \| \| \| \| \|	Fixes turning a 32-bit scalar load into an extending vector load for AMDGPU when dynamically indexing a vector. llvm-svn: 292842
*	[InstSimplify] add tests to show missing folds from 'icmp (add nsw)'; NFC	Sanjay Patel	2017-01-23	1	-0/+169
\| \| \| \|	llvm-svn: 292841
*	Revert "Refactor SampleProfile.cpp to move computation inside a branch. (NFC)"	Evgeniy Stepanov	2017-01-23	1	-2/+2
\| \| \| \| \| \|	Causes MSan failures on the buildbot. llvm-svn: 292840
*	[APFloat] Switch from (PPCDoubleDoubleImpl, IEEEdouble) layout to ↵	Tim Shen	2017-01-23	4	-199/+744
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(IEEEdouble, IEEEdouble) Summary: This patch changes the layout of DoubleAPFloat, and adjust all operations to do either: 1) (IEEEdouble, IEEEdouble) -> (uint64_t, uint64_t) -> PPCDoubleDoubleImpl, then run the old algorithm. 2) Do the right thing directly. 1) includes multiply, divide, remainder, mod, fusedMultiplyAdd, roundToIntegral, convertFromString, next, convertToInteger, convertFromAPInt, convertFromSignExtendedInteger, convertFromZeroExtendedInteger, convertToHexString, toString, getExactInverse. 2) includes makeZero, makeLargest, makeSmallest, makeSmallestNormalized, compare, bitwiseIsEqual, bitcastToAPInt, isDenormal, isSmallest, isLargest, isInteger, ilogb, scalbn, frexp, hash_value, Profile. I could split this into two patches, e.g. use 1) for all operatoins first, then incrementally change some of them to 2). I didn't do that, because 1) involves code that converts data between PPCDoubleDoubleImpl and (IEEEdouble, IEEEdouble) back and forth, and may pessimize the compiler. Instead, I find easy functions and use approach 2) for them directly. Next step is to implement move multiply and divide from 1) to 2). I don't have plans for other functions in 1). Differential Revision: https://reviews.llvm.org/D27872 llvm-svn: 292839
*	AMDGPU: Combine fp16/fp64 subtarget features	Matt Arsenault	2017-01-23	13	-63/+117
\| \| \| \| \| \| \|	The same control register controls both, and are set to the same defaults. Keep the old names around as aliases. llvm-svn: 292837
*	[Hexagon] Explicitly reserve aliases of reserved registers	Krzysztof Parzyszek	2017-01-23	2	-13/+22
\| \| \| \|	llvm-svn: 292836
*	[libFuzzer] make sure we use the feedback from std::string operator ==	Kostya Serebryany	2017-01-23	4	-1/+31
\| \| \| \|	llvm-svn: 292835
*	Add support for the x86_thread_state32_t and	Kevin Enderby	2017-01-23	5	-2/+171
\| \| \| \| \| \| \| \| \| \| \| \| \|	in llvm-objdump for Mach-O files add the printing of the x86_thread_state32_t in the same format as otool-classic(1) on darwin. To do this the 32-bit x86 general tread state needed to be defined in include/llvm/Support/MachO.h . rdar://30110111 llvm-svn: 292829
*	[AArch64][GlobalISel] Legalize narrow scalar fp->int conversions.	Ahmed Bougacha	2017-01-23	3	-9/+28
\| \| \| \| \| \| \| \| \|	Since we're now avoiding operations using narrow scalar integer types, we have to legalize the integer side of the FP conversions. This requires teaching the legalizer how to do that. llvm-svn: 292828
*	[AArch64][GlobalISel] Legalize narrow scalar ops again.	Ahmed Bougacha	2017-01-23	12	-456/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since r279760, we've been marking as legal operations on narrow integer types that have wider legal equivalents (for instance, G_ADD s8). Compared to legalizing these operations, this reduced the amount of extends/truncates required, but was always a weird legalization decision made at selection time. So far, we haven't been able to formalize it in a way that permits the selector generated from SelectionDAG patterns to be sufficient. Using a wide instruction (say, s64), when a narrower instruction exists (s32) would introduce register class incompatibilities (when one narrow generic instruction is selected to the wider variant, but another is selected to the narrower variant). It's also impractical to limit which narrow operations are matched for which instruction, as restricting "narrow selection" to ranges of types clashes with potentially incompatible instruction predicates. Concerns were also raised regarding MIPS64's sign-extended register assumptions, as well as wrapping behavior. See discussions in https://reviews.llvm.org/D26878. Instead, legalize the operations. Should we ever revert to selecting these narrow operations, we should try to represent this more accurately: for instance, by separating a "concrete" type on operations, and an "underlying" type on vregs, we could move the "this narrow-looking op is really legal" decision to the legalizer, and let the selector use the "underlying" vreg type only, which would be guaranteed to map to a register class. In any case, we eventually should mitigate: - the performance impact by selecting no-op extract/truncates to COPYs (which we currently do), and the COPYs to register reuses (which we don't do yet). - the compile-time impact by optimizing away extract/truncate sequences in the legalizer. llvm-svn: 292827
*	Attempt to fix the testcase in r292824	Steven Wu	2017-01-23	1	-31/+8
\| \| \| \| \| \| \|	Try fix the testcase r292824 (failing on some bots) by reduce it to the minimal. If this fix doesn't work, I will revert this test. llvm-svn: 292826
*	[ARM] Classification Improvements to ARM Sched-Models. NFCI.	Javed Absar	2017-01-23	6	-115/+363
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a series of patches to enable adding of machine sched models for ARM processors easier and compact. They define new sched-readwrites for groups of ARM instructions. This has been missing so far, and as a consequence, machine scheduler models for individual sub-targets have tended to be larger than they needed to be. The current patch focuses on floating-point instructions. Reviewers: Diana Picus (rovka), Renato Golin (rengolin) Differential Revision: https://reviews.llvm.org/D28194 llvm-svn: 292825
*	Add LC_BUILD_VERSION load command	Steven Wu	2017-01-23	14	-1/+407
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add a new load command LC_BUILD_VERSION. It is a generic version of LC_*_VERSION_MIN load_command used on Apple platforms. Instead of having a seperate load command for each platform, LC_BUILD_VERSION is recording platform info as an enum. It also records SDK version, min_os, and tools that used to build the binary. rdar://problem/29781291 Reviewers: enderby Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29044 llvm-svn: 292824
*	[SLP] Additional test with extra args in horizontal reductions.	Alexey Bataev	2017-01-23	1	-0/+63
\| \| \| \|	llvm-svn: 292821
*	AMDGPU: Propagate fast math flags in fneg combines	Matt Arsenault	2017-01-23	1	-3/+3
\| \| \| \| \| \|	Can't for fma/mad since it seems they can't have flags currently. llvm-svn: 292818
*	Add unittests for empty bitvectors.	Matthias Braun	2017-01-23	1	-0/+37
\| \| \| \| \| \|	Addendum to r292575 llvm-svn: 292817
*	AMDGPU: Remove unnecessary check	Matt Arsenault	2017-01-23	1	-3/+0
\| \| \| \| \| \|	There are no scalar FP types that can be extended. llvm-svn: 292816
*	[PGO] add debug option to view annotated cfg after prof use annotation	Xinliang David Li	2017-01-23	2	-2/+36
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D28967 llvm-svn: 292815
*	DAG: Allow legalization of fcanonicalize vector types	Matt Arsenault	2017-01-23	2	-0/+217
\| \| \| \|	llvm-svn: 292814
*	[libFuzzer] deflake a test	Kostya Serebryany	2017-01-23	1	-0/+1
\| \| \| \|	llvm-svn: 292813
*	[InstSimplify] refactor finding limits for icmp with binop; NFCI	Sanjay Patel	2017-01-23	1	-105/+146
\| \| \| \|	llvm-svn: 292812
*	Refactor SampleProfile.cpp to move computation inside a branch. (NFC)	Dehao Chen	2017-01-23	1	-2/+2
\| \| \| \|	llvm-svn: 292803
*	Post-commit review feedback from dblaikie	Chris Bieneman	2017-01-23	1	-1/+1
\| \| \| \| \| \|	Use ASSERT_* instead of EXPECT_* for error condition. llvm-svn: 292798
*	[MemorySSA] Add new tests for invariant.groups	Piotr Padlewski	2017-01-23	1	-0/+182
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Next round of extra tests for MSSA. I have a prototype invariant.group handling implementation that fixes all the FIXMEs, and I think it will be easier to see what is the difference if I firstly post this, and then only fix fixits. Reviewers: george.burgess.iv, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29022 llvm-svn: 292797
*	[InstCombine][X86] Add MULDQ/MULUDQ constant folding support	Simon Pilgrim	2017-01-23	2	-27/+52
\| \| \| \|	llvm-svn: 292793
*	Tweak ASCII art in Simplify CFG. NFC	Amaury Sechet	2017-01-23	1	-1/+1
\| \| \| \|	llvm-svn: 292792
*	[SystemZ] Mark vector immediate load instructions with useful flags.	Jonas Paulsson	2017-01-23	2	-19/+23
\| \| \| \| \| \| \| \| \|	Vector immediate load instructions should have the isAsCheapAsAMove, isMoveImm and isReMaterializable flags set. With them, these instruction will get hoisted out of loops. Review: Ulrich Weigand llvm-svn: 292790
*	RuntimeDyldELF: add LDST128_ABS_LO12_NC reloc	Eugene Leviant	2017-01-23	2	-0/+9
\| \| \| \|	llvm-svn: 292788
*	RuntimeDyldELF: add LDST8_ABS_LO12_NC and LDST16_ABS_LO12_NC relocs	Eugene Leviant	2017-01-23	2	-4/+22
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D28863 llvm-svn: 292785
*	[InstCombine][X86] MULDQ/MULUDQ undef -> zero	Simon Pilgrim	2017-01-23	2	-8/+8
\| \| \| \| \| \|	Match generic mul behaviour so that <X x i64> multiply and muldq/muludq pattern act the same llvm-svn: 292784
*	[SLP] Additional test for SLP vectorizer with 31 reduction elements.	Alexey Bataev	2017-01-23	1	-0/+196
\| \| \| \|	llvm-svn: 292783
*	[InstCombine][SSE] Tests showing missed opportunities to constant fold ↵	Simon Pilgrim	2017-01-23	1	-0/+58
\| \| \| \| \| \|	PMULDQ/PMULUDQ llvm-svn: 292782