bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[AMDGPU] Ported and adopted AMDLibCalls pass	Stanislav Mekhanoshin	2017-08-11	6	-6/+2975
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pass does simplifications of well known AMD library calls. If given -amdgpu-prelink option it works in a pre-link mode which allows to reference new library functions which will be linked in later. In addition it also used to process traditional AMD option -fuse-native which allows to replace some of the functions with their fast native implementations from the library. The necessary glue to pass the prelink option and translate -fuse-native is to be added to the driver. Differential Revision: https://reviews.llvm.org/D36436 llvm-svn: 310731
*	[AVX512] Remove and autoupgrade many of the broadcast intrinsics	Craig Topper	2017-08-11	3	-42/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This autoupgrades most of the broadcast intrinsics. They've been unused in clang for some time. This leaves the 32x2 intrinsics because they are still used in clang. Reviewers: RKSimon, zvi, igorb Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36606 llvm-svn: 310725
*	[x86] Enable some support for lowerVectorShuffleWithUndefHalf with AVX-512	Craig Topper	2017-08-11	1	-2/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This teaches 512-bit shuffles to detect unused halfs in order to reduce shuffle size. We may need to refine the 512-bit exit point. I couldn't remember if we had good cross lane shuffles for 8/16 bit with AVX-512 or not. I believe this is step towards being able to handle D36454 without a special case. From here we need to improve our ability to combine extract_subvector with insert_subvector and other extract_subvectors. And we need to support narrowing binary operations where we don't demand all elements. This may be improvements to DAGCombiner::narrowExtractedVectorBinOp(by recognizing an insert_subvector in addition to concat) or we may need a target specific combiner. Reviewers: RKSimon, zvi, delena, jbhateja Reviewed By: RKSimon, jbhateja Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36601 llvm-svn: 310724
*	[x86] use more shift or LEA for select-of-constants (2nd try)	Sanjay Patel	2017-08-11	2	-64/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The previous rev (r310208) failed to account for overflow when subtracting the constants to see if they're suitable for shift/lea. This version add a check for that and more test were added in r310490. We can convert any select-of-constants to math ops: http://rise4fun.com/Alive/d7d For this patch, I'm enhancing an existing x86 transform that uses fake multiplies (they always become shl/lea) to avoid cmov or branching. The current code misses cases where we have a negative constant and a positive constant, so this is just trying to plug that hole. The DAGCombiner diff prevents us from hitting a terrible inefficiency: we can start with a select in IR, create a select DAG node, convert it into a sext, convert it back into a select, and then lower it to sext machine code. Some notes about the test diffs: 1. 2010-08-04-MaskedSignedCompare.ll - We were creating control flow that didn't exist in the IR. 2. memcmp.ll - Choose -1 or 1 is the case that got me looking at this again. We could avoid the push/pop in some cases if we used 'movzbl %al' instead of an xor on a different reg? That's a post-DAG problem though. 3. mul-constant-result.ll - The trade-off between sbb+not vs. setne+neg could be addressed if that's a regression, but those would always be nearly equivalent. 4. pr22338.ll and sext-i1.ll - These tests have undef operands, so we don't actually care about these diffs. 5. sbb.ll - This shows a win for what is likely a common case: choose -1 or 0. 6. select.ll - There's another borderline case here: cmp+sbb+or vs. test+set+lea? Also, sbb+not vs. setae+neg shows up again. 7. select_const.ll - These are motivating cases for the enhancement; replace cmov with cheaper ops. Assembly differences between movzbl and xor to avoid a partial reg stall are caused later by the X86 Fixup SetCC pass. Differential Revision: https://reviews.llvm.org/D35340 llvm-svn: 310717
*	[globalisel][tablegen] Support zero-instruction emission.	Daniel Sanders	2017-08-11	1	-1/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Support the case where an operand of a pattern is also the whole of the result pattern. In this case the original result and all its uses must be replaced by the operand. However, register class restrictions can require a COPY. This patch handles both cases by always emitting the copy and leaving it for the register allocator to optimize. Depends on D35833 Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Subscribers: javed.absar, kristof.beyls, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D36084 llvm-svn: 310716
*	[mips] Lift the assertion on the types that can be used with MipsGPRel	Simon Dardis	2017-08-11	4	-10/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Post commit review of rL308619 highlighted the need for handling N64 with -fno-pic. Testing reveale a stale assert when generating a GP relative addressing mode. This patch removes that assert and adds the necessary patterns for MIPS64 to perform gp relative addressing with -fno-pic (and the implicit -mno-abicalls + -mgpopt). Reviewers: atanasyan, nitesh.jain Differential Revision: https://reviews.llvm.org/D36472 llvm-svn: 310713
*	[cmake] Expose the dependencies of ExecutionEngine as PUBLIC	Michal Gorny	2017-08-11	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Expose the dependencies of LLVMExecutionEngine library as PUBLIC rather than PRIVATE when building a shared library. This is necessary because the library is not contained but exposes API of other LLVM libraries via its headers. This causes other libraries to fail to link if the linker verifies for correctness of -l flags (i.e. fails on indirect dependencies). This e.g. happens when building LLDB against shared LLVM: lib64/liblldbExpression.a(IRExecutionUnit.cpp.o):(.data.rel.ro._ZTIN4llvm18MCJITMemoryManagerE[_ZTIN4llvm18MCJITMemoryManagerE]+0x10): undefined reference to `typeinfo for llvm::RuntimeDyld::MemoryManager' lib64/liblldbExpression.a(IRExecutionUnit.cpp.o):(.data.rel.ro._ZTVN4llvm18MCJITMemoryManagerE[_ZTVN4llvm18MCJITMemoryManagerE]+0x60): undefined reference to `llvm::RuntimeDyld::MemoryManager::anchor()' lib64/liblldbExpression.a(IRExecutionUnit.cpp.o):(.data.rel.ro._ZTVN12lldb_private15IRExecutionUnit13MemoryManagerE[_ZTVN12lldb_private15IRExecutionUnit13MemoryManagerE]+0x48): undefined reference to `llvm::RTDyldMemoryManager::deregisterEHFrames()' lib64/liblldbExpression.a(IRExecutionUnit.cpp.o):(.data.rel.ro._ZTVN12lldb_private15IRExecutionUnit13MemoryManagerE[_ZTVN12lldb_private15IRExecutionUnit13MemoryManagerE]+0x60): undefined reference to `llvm::RuntimeDyld::MemoryManager::anchor()' lib64/liblldbExpression.a(IRExecutionUnit.cpp.o):(.data.rel.ro._ZTVN12lldb_private15IRExecutionUnit13MemoryManagerE[_ZTVN12lldb_private15IRExecutionUnit13MemoryManagerE]+0xd0): undefined reference to `llvm::JITSymbolResolver::anchor()' collect2: error: ld returned 1 exit status Declaring the dependencies as PUBLIC guarantees that any package using the ExecutionEngine library will also get explicit -l flags for the dependent libraries guaranteeing that the symbols exposed in headers could be resolved. Patch originally written by NAKAMURA Takumi. Differential Revision: https://reviews.llvm.org/D36211 llvm-svn: 310712
*	Improve handling of insert_subvector of bitcast values	Nirav Dave	2017-08-11	1	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \|	Fix insert_subvector / extract_subvector merges of bitcast values. Reviewers: efriedma, craig.topper, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D34571 llvm-svn: 310711
*	[X86][DAG] Switch X86 Target to post-legalized store merge	Nirav Dave	2017-08-11	3	-1/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Move store merge to happen after intrinsic lowering to allow lowered stores to be merged. Some regressions due in MergeConsecutiveStores to missing insert_subvector that are addressed in follow up patch. Reviewers: craig.topper, efriedma, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34559 llvm-svn: 310710
*	[AArch64] Enable ARMv8.3-A pointer authentication	Sam Parker	2017-08-11	8	-7/+294
\| \| \| \| \| \| \| \| \|	Add assembler and disassembler support for the ARMv8.3-A pointer authentication instructions. Differential Revision: https://reviews.llvm.org/D36517 llvm-svn: 310709
*	[ARM] Assembler support for the ARMv8.2a dot product instructions	Sjoerd Meijer	2017-08-11	8	-2/+85
\| \| \| \| \| \| \| \| \|	Commit r310480 added the AArch64 ARMv8.2a dot product instructions; this adds the AArch32 instructions. Differential Revision: https://reviews.llvm.org/D36575 llvm-svn: 310701
*	[DAGCombiner] Remove shuffle support from simplifyShuffleMask	Simon Pilgrim	2017-08-11	1	-2/+0
\| \| \| \| \| \| \| \|	rL310372 enabled simplifyShuffleMask to support undef shuffle mask inputs, but its causing hangs. Removing support until I can triage the problem llvm-svn: 310699
*	[IfConversion] Maintain the CFG when predicating/merging blocks in IfConvert*	Mikael Holmen	2017-08-11	1	-38/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes PR32721 in IfConvertTriangle and possible similar problems in IfConvertSimple, IfConvertDiamond and IfConvertForkedDiamond. In PR32721 we had a triangle EBB \| \ \| \| \| TBB \| / FBB where FBB didn't have any successors at all since it ended with an unconditional return. Then TBB and FBB were be merged into EBB, but EBB would still keep its successors, and the use of analyzeBranch and CorrectExtraCFGEdges wouldn't help to remove them since the return instruction is not analyzable (at least not on ARM). The edge updating code and branch probability updating code is now pushed into MergeBlocks() which allows us to share the same update logic between more callsites. This lets us remove several dependencies on analyzeBranch and completely eliminate RemoveExtraEdges. One thing that showed up with this patch was that IfConversion sometimes left a successor with 0% probability even if there was no branch or fallthrough to the successor. One such example from the test case ifcvt_bad_zero_prob_succ.mir. The indirect branch tBRIND can only jump to bb.1, but without the patch we got: bb.0: successors: %bb.1(0x80000000) bb.1: successors: %bb.1(0x80000000), %bb.2(0x00000000) tBRIND %r1, 1, %cpsr B %bb.1 bb.2: There is no way to jump from bb.1 to bb2, but still there is a 0% edge from bb.1 to bb.2. With the patch applied we instead get the expected: bb.0: successors: %bb.1(0x80000000) bb.1: successors: %bb.1(0x80000000) tBRIND %r1, 1, %cpsr B %bb.1 Since bb.2 had no predecessor at all, it was removed. Several testcases had to be updated due to this since the removed successor made the "Branch Probability Basic Block Placement" pass sometimes place blocks in a different order. Finally added a couple of new test cases: * PR32721_ifcvt_triangle_unanalyzable.mir: Regression test for the original problem dexcribed in PR 32721. * ifcvt_triangleWoCvtToNextEdge.mir: Regression test for problem that caused a revert of my first attempt to solve PR 32721. * ifcvt_simple_bad_zero_prob_succ.mir: Test case showing the problem where a wrong successor with 0% probability was previously left. * ifcvt_[diamond\|forked_diamond\|simple]_unanalyzable.mir Very simple test cases for the simple and (forked) diamond cases involving unanalyzable branches that can be nice to have as a base if wanting to write more complicated tests. Reviewers: iteratee, MatzeB, grosser, kparzysz Reviewed By: kparzysz Subscribers: kbarton, davide, aemerson, nemanjai, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34099 llvm-svn: 310697
*	[PM] Switch the CGSCC debug messages to use the standard LLVM debug	Chandler Carruth	2017-08-11	2	-36/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	printing techniques with a DEBUG_TYPE controlling them. It was a mistake to start re-purposing the pass manager `DebugLogging` variable for generic debug printing -- those logs are intended to be very minimal and primarily used for testing. More detailed and comprehensive logging doesn't make sense there (it would only make for brittle tests). Moreover, we kept forgetting to propagate the `DebugLogging` variable to various places making it also ineffective and/or unavailable. Switching to `DEBUG_TYPE` makes this a non-issue. llvm-svn: 310695
*	[MachineOutliner] Add RegState::Define to LDRXpost in insertOutlinedCall	Jessica Paquette	2017-08-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This fixes a MachineVerifier failure in machine-outliner.mir. Not explicitly adding RegState::Define to the LR argument makes it unhappy because an explicit definition is marked as a use. Build failure: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-expensive/7496/testReport/junit/LLVM/CodeGen_AArch64/machine_outliner_mir/ llvm-svn: 310671
*	Revert "[AsmParser] Hash is not a comment on some targets"	Ahmed Bougacha	2017-08-10	2	-0/+18
\| \| \| \| \| \| \| \|	This reverts commit r310457. It causes clang-produced IR to fail llvm codegen. llvm-svn: 310662
*	Revert "[DAG] Cleanup unused nodes after store merge. NFCI."	Nirav Dave	2017-08-10	1	-11/+1
\| \| \| \| \| \|	This reverts commit r310648 which causes an unexpected assertion failure llvm-svn: 310659
*	[InstCombine] Make (X\|C1)^C2 -> X^(C1^C2) iff X&~C1 == 0 work for splat vectors	Craig Topper	2017-08-10	1	-23/+18
\| \| \| \| \| \| \| \|	This also corrects the description to match what was actually implemented. The old comment said X^(C1\|C2), but it implemented X^((C1\|C2)&~(C1&C2)). I believe ((C1\|C2)&~(C1&C2)) is equivalent to (C1^C2). Differential Revision: https://reviews.llvm.org/D36505 llvm-svn: 310658
*	[DAG] Relax type restriction for store merge	Nirav Dave	2017-08-10	1	-24/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Allow stores of bitcastable types to be merged by peeking through BITCAST nodes and recasting stored values constant and vector extract nodes as necessary. Reviewers: jyknight, hfinkel, efriedma, RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34569 llvm-svn: 310655
*	[CostModel][X86] Add SSE2 two-src shuffle costs	Simon Pilgrim	2017-08-10	1	-0/+2
\| \| \| \|	llvm-svn: 310654
*	[ARM] Clarify legal addressing modes for ARM and Thumb2. NFC	Eli Friedman	2017-08-10	1	-3/+11
\| \| \| \| \| \| \| \| \|	The existing code is very clever, but not clear, which seems like the wrong tradeoff here. Differential Revision: https://reviews.llvm.org/D36559 llvm-svn: 310653
*	[CostModel][X86] Add avx1 two-src shuffle costs	Simon Pilgrim	2017-08-10	1	-0/+9
\| \| \| \|	llvm-svn: 310650
*	[DAG] Cleanup unused nodes after store merge. NFCI.	Nirav Dave	2017-08-10	1	-1/+11
\| \| \| \|	llvm-svn: 310648
*	[CostModel][X86] Add avx2 two-src shuffle costs	Simon Pilgrim	2017-08-10	1	-2/+11
\| \| \| \|	llvm-svn: 310645
*	Make .file directive to have basename only	Taewook Oh	2017-08-10	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently LLVM puts directory along with the filename in .file directive, but this behavior doesn't match gcc. There's a no clear description about which one is right (https://sourceware.org/binutils/docs/as/File.html#File), but one document (https://sourceware.org/gdb/current/onlinedocs/stabs/ELF-Linker-Relocation.html) suggests that STT_FILE symbol in elf file is expected to have basename only, which should have a same sting file .file directive according to (https://docs.oracle.com/cd/E26502_01/html/E28388/eoiyg.html). This also affects badly on the build system that uses hashing, as the directory info could be differnt from developer to developer even when they're working on same file. Reviewers: pcc, mehdi_amini Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36018 llvm-svn: 310642
*	[InstCombine] Fix a crash in getSelectCondition if we happen to have two ↵	Craig Topper	2017-08-10	1	-2/+3
\| \| \| \| \| \| \| \|	inverse vectors of i1 constants. We used to try to truncate the constant vector to vXi1, but if it's already i1 this would fail. Instead we now use IRBuilder::getZExtOrTrunc which should check the type and only create a trunc if needed. I believe this should trigger constant folding in the IRBuilder and ultimately do the same thing just with the additional type check. llvm-svn: 310639
*	[InstCombine] Add a DEBUG_COUNTER to InstCombine to limit how many ↵	Craig Topper	2017-08-10	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \|	instructions are visited for debug Sometimes it would be nice to stop InstCombine mid way through its combining to see the current IR. By using a debug counter we can place an upper limit on how many instructions to process. This will also allow skipping the first X combines, but that has the potential to change later combines since earlier canonicalizations might have been skipped. Differential Revision: https://reviews.llvm.org/D36553 llvm-svn: 310638
*	[DebugCounter] Move the semicolon out of the DEBUG_COUNTER macro and require ↵	Craig Topper	2017-08-10	2	-4/+5
\| \| \| \| \| \| \| \| \| \|	it to be placed at the end of each use. This make it consistent with STATISTIC which it will often appears near. While there move one DEBUG_COUNTER instance out of an anonymous namespace. It's already declaring a static variable so the namespace is unnecessary. llvm-svn: 310637
*	[CostModel][X86] Improve single src shuffle costs	Simon Pilgrim	2017-08-10	1	-11/+36
\| \| \| \| \| \|	Add missing SK_PermuteSingleSrc costs for AVX2 targets and earlier, also added some of the simpler SK_PermuteTwoSrc costs to support splitting of SK_PermuteSingleSrc shuffles llvm-svn: 310632
*	Add "Restored" flag to CalleeSavedInfo	Krzysztof Parzyszek	2017-08-10	23	-29/+39
\| \| \| \| \| \| \| \| \| \| \|	The liveness-tracking code assumes that the registers that were saved in the function's prolog are live outside of the function. Specifically, that registers that were saved are also live-on-exit from the function. This isn't always the case as illustrated by the LR register on ARM. Differential Revision: https://reviews.llvm.org/D36160 llvm-svn: 310619
*	[DAG] Rewrite expression. NFC.	Nirav Dave	2017-08-10	1	-2/+2
\| \| \| \|	llvm-svn: 310608
*	[X86] Keep dependencies when constructing loads in combineStore	Nirav Dave	2017-08-10	2	-9/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Preserve chain dependecies between old and new loads constructed to prevent loads from reordering below later stores. Fixes PR34088. Reviewers: craig.topper, spatel, RKSimon, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36528 llvm-svn: 310604
*	[Hexagon] Use isMetaInstruction instead of isDebugValue	Krzysztof Parzyszek	2017-08-10	3	-3/+4
\| \| \| \|	llvm-svn: 310601
*	[sanitizer-coverage] Change cmp instrumentation to distinguish const operands	Alexander Potapenko	2017-08-10	1	-4/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This implementation of SanitizerCoverage instrumentation inserts different callbacks depending on constantness of operands: 1. If both operands are non-const, then a usual __sanitizer_cov_trace_cmp[1248] call is inserted. 2. If exactly one operand is const, then a __sanitizer_cov_trace_const_cmp[1248] call is inserted. The first argument of the call is always the constant one. 3. If both operands are const, then no callback is inserted. This separation comes useful in fuzzing when tasks like "find one operand of the comparison in input arguments and replace it with the other one" have to be done. The new instrumentation allows us to not waste time on searching the constant operands in the input. Patch by Victor Chibotaru. llvm-svn: 310600
*	[NewGVN] Add CL option to control the generation of phi-of-ops (disable by ↵	Chad Rosier	2017-08-10	1	-0/+6
\| \| \| \| \| \| \| \|	default). Differential Revision: https://reviews.llvm.org/D36478539 llvm-svn: 310594
*	[SelectionDAG] Allow constant folding for implicitly truncating BUILD_VECTOR ↵	Guy Blank	2017-08-10	1	-2/+16
\| \| \| \| \| \| \| \| \| \| \| \| \|	nodes. In FoldConstantArithmetic, handle BUILD_VECTOR nodes that do implicit truncation on the elements. This is similar to what is done in FoldConstantVectorArithmetic. Differential Revision: https://reviews.llvm.org/D36506 llvm-svn: 310593
*	[libFuzzer] Update LibFuzzer w.r.t. the new comparisons instrumentation API	Alexander Potapenko	2017-08-10	1	-0/+35
\| \| \| \| \| \| \| \| \| \|	Added the _sanitizer_cov_trace_const_cmp[1248] callbacks. For now they are implemented the same way as _sanitizer_cov_trace_cmp[1248]. For more details, please see https://reviews.llvm.org/D36465. Patch by Victor Chibotaru. llvm-svn: 310592
*	[ValueTracking] Enabling ValueTracking patch by default (recommit). Part 2.	Nikolai Bozhenov	2017-08-10	1	-9/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original patch was an improvement to IR ValueTracking on non-negative integers. It has been checked in to trunk (D18777, r284022). But was disabled by default due to performance regressions. Perf impact has improved. The patch would be enabled by default. Reviewers: reames, hfinkel Differential Revision: https://reviews.llvm.org/D34101 Patch by: Olga Chupina <olga.chupina@intel.com> llvm-svn: 310583
*	[mips][microMIPS] Extending size reduction pass with XOR16	Zoran Jovanovic	2017-08-10	1	-5/+43
\| \| \| \| \| \| \| \| \| \|	Author: milena.vujosevic.janicic Reviewers: sdardis The patch extends size reduction pass for MicroMIPS. XOR instruction is transformed into 16-bit instruction XOR16, if possible. Differential Revision: https://reviews.llvm.org/D34239 llvm-svn: 310579
*	[AArch64] Assembler support for v8.3 RCpc	Sam Parker	2017-08-10	4	-1/+28
\| \| \| \| \| \| \| \| \| \|	Added assembler and disassembler support for the new Release Consistent processor consistent instructions, introduced with ARM v8.3-A for AArch64. Differential Revision: https://reviews.llvm.org/D36522 llvm-svn: 310575
*	[ARM][AArch64] ARMv8.3-A enablement	Sam Parker	2017-08-10	8	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The beta ARMv8.3 ISA specifications have been released for AArch64 and AArch32, these can be found at: https://developer.arm.com/products/architecture/a-profile/exploration-tools An introduction to this architecture update can be found at: https://community.arm.com/processors/b/blog/posts/armv8-a-architecture-2016-additions This patch is the first in a series which will add ARM v8.3-A support in LLVM and Clang. It adds the necessary changes that create targets for both the ARM and AArch64 backends. Differential Revision: https://reviews.llvm.org/D36514 llvm-svn: 310561
*	[SelectionDAG] When scalarizing vselect, don't assert on	Elad Cohen	2017-08-10	2	-2/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a legal cond operand. When scalarizing the result of a vselect, the legalizer currently expects to already have scalarized the operands. While this is true for the true/false operands (which have the same type as the result), it is not case for the condition operand. On X86 AVX512, v1i1 is legal - this leads to operations such as '< N x type> vselect < N x i1> < N x type> < N x type>' where < N x type > is illegal to hit an assertion during the scalarization. The handling is similar to r205625. This also exposes the fact that (v1i1 extract_subvector) should be legal and selectable on AVX512 - We do this by custom lowering to vector_extract_elt. This still leaves us in some cases with redundant dag nodes which will be combined in a separate soon to come patch. This fixes pr33349. Differential revision: https://reviews.llvm.org/D36511 llvm-svn: 310552
*	Revert part of r310296 to make it really NFC for instrumentation PGO.	Dehao Chen	2017-08-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Part of r310296 will disable PGOIndirectCallPromotion in ThinLTO backend if PGOOpt is None. However, as PGOOpt is not passed down to ThinLTO backend for instrumentation based PGO, that change would actually disable ICP entirely in ThinLTO backend, making it behave differently in instrumentation PGO mode. This change reverts that change, and only disable ICP there when it is SamplePGO. Reviewers: davidxl Reviewed By: davidxl Subscribers: sanjoy, mehdi_amini, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D36566 llvm-svn: 310550
*	[LCG] Fix an assert in a on-scope-exit lambda that checked the contents	Chandler Carruth	2017-08-10	1	-7/+9
\| \| \| \| \| \| \| \| \| \|	of the returned value. Checking the returned value from inside of a scoped exit isn't actually valid. It happens to work when NRVO fires and the stars align, which they reliably do with Clang but don't, for example, on MSVC builds. llvm-svn: 310547
*	[LVI] Fix LVI compile time regression around constantFoldUser()	Hiroshi Yamauchi	2017-08-10	1	-14/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Avoid checking each operand and calling getValueFromCondition() before calling constantFoldUser() when the instruction type isn't supported by constantFoldUser(). This fixes a large compile time regression in an internal build. Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36552 llvm-svn: 310545
*	Linker: Create a function declaration when moving a non-prevailing alias of ↵	Peter Collingbourne	2017-08-10	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	function type. We were previously creating a global variable of function type, which is invalid IR. This issue was exposed by r304690, in which we started asserting that global variables were of a valid type. Fixes PR33462. Differential Revision: https://reviews.llvm.org/D36438 llvm-svn: 310543
*	[InstSimplify] Add test cases that show that simplifySelectWithICmpCond ↵	Craig Topper	2017-08-10	1	-0/+1
\| \| \| \| \| \|	doesn't work with non-canonical comparisons. llvm-svn: 310542
*	[AMDGPU] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-08-10	14	-142/+227
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 310541
*	Fix thinlto cache key computation for cfi-icall.	Evgeniy Stepanov	2017-08-09	1	-18/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixed PR33966. CFI code generation for users (not just callers) of a function depends on whether this function has a jumptable entry or not. This information needs to be encoded in of thinlto cache key. We filter the jumptable list against functions that are actually referenced in the current module. Subscribers: mehdi_amini, inglorion, eraman, hiraditya Differential Revision: https://reviews.llvm.org/D36346 llvm-svn: 310536
*	ARM: Fix CMP_SWAP expansion	Matthias Braun	2017-08-09	2	-32/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Clean up after my misguided attempt in r304267 to "fix" CMP_SWAP returning an uninitialized status value. - I was always using tMOVi8 to zero the status register which cannot encode higher register numbers and llvm would silently miscompile) - Nobody was ever looking at that status value outside the expansion. ARMDAGToDAGISel::SelectCMP_SWAP() the only place creating CMP_SWAP instructions was not mapping anything to it. (The cmpxchg status value from llvm IR is lowered to a manual comparison after the CMP_SWAP) So this: - Renames the register from "status" to "temp" it make it obvious that it isn't used outside the expansion. - Remove the zeroing status/temp register. - Keep the live-in list improvements from r304267 Fixes http://llvm.org/PR34056 llvm-svn: 310534