bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[IR] Make Instruction::isAssociative method inline. Add LLVM_READONLY to the ↵	Craig Topper	2017-03-26	1	-11/+0
\| \| \| \| \| \|	static version. llvm-svn: 298826
*	[Target] Remove some code probably copy/pasted from another backend.	Davide Italiano	2017-03-26	1	-4/+0
\| \| \| \|	llvm-svn: 298825
*	[MachineScheduler] Reference the correct header.	Davide Italiano	2017-03-26	1	-1/+1
\| \| \| \|	llvm-svn: 298823
*	[LoopUnroll] Remap references in peeled iteration	Serge Pavlov	2017-03-26	1	-4/+5
\| \| \| \| \| \| \| \| \|	References in cloned blocks must be remapped prior to dominator calculation. Differential Revision: https://reviews.llvm.org/D31281 llvm-svn: 298811
*	[X86][SSE] Add computeKnownBitsForTargetNode support for (V)PSLL/(V)PSRL ↵	Simon Pilgrim	2017-03-26	1	-1/+26
\| \| \| \| \| \|	instructions llvm-svn: 298806
*	[X86][AVX512F] Fix reg class for VMOVSSZrr/VMOVSSZrrk and VMOVSDZrr/VMOVSDZrrk	Simon Pilgrim	2017-03-26	1	-11/+10
\| \| \| \| \| \| \| \| \| \|	Fixed -verify-machineinstrs errors in fast-isel-select-sse.ll (one of many in PR27481) The VMOVSSZrr/VMOVSSZrrk and VMOVSDZrr/VMOVSDZrrk instructions were assuming both source registers were V128X when the second is actually supposed to be FR32X/FR64X Differential Revision: https://reviews.llvm.org/D31200 llvm-svn: 298805
*	[GlobalISel][X86] support G_FRAME_INDEX instruction selection.	Igor Breger	2017-03-26	5	-5/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Support G_FRAME_INDEX instruction selection. Reviewers: zvi, rovka, ab, qcolombet Reviewed By: ab Subscribers: llvm-commits, dberris, kristof.beyls, eladcohen, guyblank Differential Revision: https://reviews.llvm.org/D30980 llvm-svn: 298800
*	Split the SimplifyCFG pass into two variants.	Joerg Sonnenberger	2017-03-26	5	-23/+80
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The first variant contains all current transformations except transforming switches into lookup tables. The second variant contains all current transformations. The switch-to-lookup-table conversion results in code that is more difficult to analyze and optimize by other passes. Most importantly, it can inhibit Dead Code Elimination. As such it is often beneficial to only apply this transformation very late. A common example is inlining, which can often result in range restrictions for the switch expression. Changes in execution time according to LNT: SingleSource/Benchmarks/Misc/fp-convert +3.03% MultiSource/Benchmarks/ASC_Sequoia/CrystalMk/CrystalMk -11.20% MultiSource/Benchmarks/Olden/perimeter/perimeter -10.43% and a couple of smaller changes. For perimeter it also results 2.6% a smaller binary. Differential Revision: https://reviews.llvm.org/D30333 llvm-svn: 298799
*	[IR] Make SwitchInst::CaseIt almost a normal iterator.	Chandler Carruth	2017-03-26	4	-17/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This moves it to the iterator facade utilities giving it full random access semantics, etc. It can also now be used with standard algorithms like std::all_of and std::any_of and range adaptors like llvm::reverse. Also make the semantics of iterating match what every other iterator uses and forbid decrementing past the begin iterator. This was used as a hacky way to work around iterator invalidation. However, every instance trying to do this failed to actually avoid touching invalid iterators despite the clear documentation that the removed and all subsequent iterators become invalid including the end iterator. So I've added a return of the next iterator to removeCase and rewritten the loops that were doing this to correctly follow the iterator pattern of either incremneting or removing and assigning fresh values to the iterator and the end. In one case we were trying to go backwards to make this cleaner but it doesn't actually work. I've made that code match the code we use everywhere else to remove cases as we iterate. This changes the order of cases in one test output and I moved that test to CHECK-DAG so it wouldn't care -- the order isn't semantically meaningful anyways. llvm-svn: 298791
*	[X86] Pull out repeated ScalarValueSizeInBits code. NFCI.	Simon Pilgrim	2017-03-25	1	-6/+4
\| \| \| \|	llvm-svn: 298783
*	[X86][SSE] Combine (VSRLI (VSRAI X, Y), (NumSignBits-1)) -> (VSRLI X, ↵	Simon Pilgrim	2017-03-25	1	-1/+9
\| \| \| \| \| \| \| \| \| \|	(NumSignBits-1)) Part 3 of 3. Differential Revision: https://reviews.llvm.org/D31347 llvm-svn: 298782
*	[X86][SSE] Added ComputeNumSignBitsForTargetNode support for (V)PSRAI	Simon Pilgrim	2017-03-25	1	-0/+9
\| \| \| \| \| \| \| \|	Part 2 of 3. Differential Revision: https://reviews.llvm.org/D31347 llvm-svn: 298780
*	[X86][SSE] Generalised CMP+AND1 combine to ZERO/ALLBITS+MASK	Simon Pilgrim	2017-03-25	1	-26/+22
\| \| \| \| \| \| \| \| \| \| \| \|	Patch to generalize combinePCMPAnd1 (for handling SETCC + ZEXT cases) to work for any input that has zero/all bits set masked with an 'all low bits' mask. Replaced the implicit assumption of shift availability with a call to SupportedVectorShiftWithImm. Part 1 of 3. Differential Revision: https://reviews.llvm.org/D31347 llvm-svn: 298779
*	[x86] use PMOVMSK to replace memcmp libcalls for 16-byte equality	Sanjay Patel	2017-03-25	3	-33/+63
\| \| \| \| \| \| \| \| \|	This is the payoff for D31156 - if a target has efficient comparison instructions for vector-sized equality, we can replace memcmp calls with inline code that is both smaller and faster. Differential Revision: https://reviews.llvm.org/D31290 llvm-svn: 298775
*	[InstCombine] Change the interface of SimplifyDemandedBits so that it takes ↵	Craig Topper	2017-03-25	3	-46/+46
\| \| \| \| \| \| \| \|	the instruction and operand instead of the Use. The first thing it did was get the User for the Use to get the instruction back. This requires looking through the Uses for the User using the waymarking walk. That's pretty fast, but its probably still better to just pass the Instruction we already had. llvm-svn: 298772
*	[AArch64] Refine Falkor Machine Model - Part1	Balaram Makam	2017-03-25	3	-88/+422
\| \| \| \|	llvm-svn: 298768
*	[NewGVN] Adjust NDEBUG markers.	Davide Italiano	2017-03-25	1	-2/+2
\| \| \| \| \| \| \|	This avoids 'used but not defined' warnings in Release builds with GCC. llvm-svn: 298760
*	[AMDGPU] Switch data layout by triple environment amdgiz	Yaxun Liu	2017-03-25	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \|	Switch data layout by target triple environment amdgiz and amdgizcl indicating using of an address space mapping in which generic address space is 0. amdgiz is for non-OpenCL environment where generic address space is 0. amdgizcl is for OpenCL environment where generic address space is 0. Differential Revision: https://reviews.llvm.org/D31211 llvm-svn: 298758
*	[libFuzzer] Fix test on Windows.	Vitaly Buka	2017-03-25	1	-1/+1
\| \| \| \|	llvm-svn: 298757
*	[asan] Put ctor/dtor in comdat.	Evgeniy Stepanov	2017-03-25	1	-9/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When possible, put ASan ctor/dtor in comdat. The only reason not to is global registration, which can be TU-specific. This is not the case when there are no instrumented globals. This is also limited to ELF targets, because MachO does not have comdat, and COFF linkers may GC comdat constructors. The benefit of this is a lot less __asan_init() calls: one per DSO instead of one per TU. It's also necessary for the upcoming gc-sections-for-globals change on Linux, where multiple references to section start symbols trigger quadratic behaviour in gold linker. llvm-svn: 298756
*	[libFuzzer] read asan's dedup_token while minimizing a crash and stop ↵	Kostya Serebryany	2017-03-25	2	-4/+53
\| \| \| \| \| \|	minimization if another bug was found during minimization (https://github.com/google/oss-fuzz/issues/452) llvm-svn: 298755
*	[ARM] Fix mixup between Lo and Hi in SMLALBB formation.	Eli Friedman	2017-03-25	1	-4/+4
\| \| \| \|	llvm-svn: 298752
*	Apply clang-format as commented in D31311. NFCI.	Simon Pilgrim	2017-03-24	1	-1/+2
\| \| \| \|	llvm-svn: 298751
*	[codeview] Don't assert when the user violates the ODR	Reid Kleckner	2017-03-24	1	-28/+2
\| \| \| \| \| \| \| \| \| \|	If we have an array of a user-defined aggregates for which there was an ODR violation, then the array size will not necessarily match the number of elements times the size of the element. Fixes PR32383 llvm-svn: 298750
*	[Outliner] Revert r298734.	Jessica Paquette	2017-03-24	1	-1/+1
\| \| \| \| \| \| \|	When I tested r298734, I thought that red zones were enabled by default like in X86. Since red zones are behind a flag on AArch64 the testing wasn't true. llvm-svn: 298747
*	[libFuzzer] be more careful when calling strlen of strcmp parameters, PR32357	Kostya Serebryany	2017-03-24	4	-3/+30
\| \| \| \|	llvm-svn: 298746
*	Revert r298711 "[InstCombine] Provide a way to calculate KnownZero/One for ↵	Craig Topper	2017-03-24	2	-40/+25
\| \| \| \| \| \| \| \|	Add/Sub in SimplifyDemandedUseBits without recursing into ComputeKnownBits" Tsan bot is failing. llvm-svn: 298745
*	[libFuzzer] honor -exact_artifact_path for all intermediate files during ↵	Kostya Serebryany	2017-03-24	2	-3/+6
\| \| \| \| \| \|	crash minimization (https://github.com/google/oss-fuzz/issues/250) llvm-svn: 298740
*	AMDGPU: Fix annotating loops with nested loop conditions	Matt Arsenault	2017-03-24	1	-9/+21
\| \| \| \| \| \| \| \|	If the branch condition for a loop was a phi which itself was fed from a phi from a loop, it isn't safe to try to delete the phi until after the loop is handled. llvm-svn: 298737
*	[MachineScheduler] Add missing machine pass dependency.	Davide Italiano	2017-03-24	1	-0/+1
\| \| \| \|	llvm-svn: 298736
*	Revert r298620: [LV] Vectorize GEPs	Ivan Krasin	2017-03-24	1	-117/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reason: breaks linking Chromium with LLD + ThinLTO (a pass crashes) LLVM bug: https://bugs.llvm.org//show_bug.cgi?id=32413 Original change description: [LV] Vectorize GEPs This patch adds support for vectorizing GEPs. Previously, we only generated vector GEPs on-demand when creating gather or scatter operations. All GEPs from the original loop were scalarized by default, and if a pointer was to be stored to memory, we would have to build up the pointer vector with insertelement instructions. With this patch, we will vectorize all GEPs that haven't already been marked for scalarization. The patch refines collectLoopScalars to more exactly identify the scalar GEPs. The function now more closely resembles collectLoopUniforms. And the patch moves vector GEP creation out of vectorizeMemoryInstruction and into the main vectorization loop. The vector GEPs needed for gather and scatter operations will have already been generated before vectoring the memory accesses. Original Differential Revision: https://reviews.llvm.org/D30710 llvm-svn: 298735
*	[Outliner] Remove no red zone requirment for AArch64	Jessica Paquette	2017-03-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	AArch64 doesn't require -mno-red-zone; stack fixups are sufficient here. This was unnecessarily copied over from the X86 target. (You can now outline with red zones! Yay!) Removing the requirement passes all Single/MultiSource tests. llvm-svn: 298734
*	[asan] Delay creation of asan ctor.	Evgeniy Stepanov	2017-03-24	2	-21/+26
\| \| \| \| \| \| \| \| \|	Create the constructor in the module pass. This in needed for the GC-friendly globals change, where the constructor can be put in a comdat in some cases, but we don't know about that in the function pass. llvm-svn: 298731
*	AMDGPU: Implement f16 fround	Matt Arsenault	2017-03-24	3	-14/+20
\| \| \| \|	llvm-svn: 298730
*	AMDGPU: Unify divergent function exits.	Matt Arsenault	2017-03-24	7	-15/+254
\| \| \| \| \| \| \| \| \| \|	StructurizeCFG can't handle cases with multiple returns creating regions with multiple exits. Create a copy of UnifyFunctionExitNodes that only unifies exit nodes that skips exit nodes with uniform branch sources. llvm-svn: 298729
*	AMDGPU: Fold rcp/rsq of undef to undef	Matt Arsenault	2017-03-24	1	-2/+15
\| \| \| \|	llvm-svn: 298725
*	TTI: Split IsSimple in MemIntrinsicInfo	Matt Arsenault	2017-03-24	2	-17/+9
\| \| \| \| \| \|	All this did before was assert in EarlyCSE. llvm-svn: 298724
*	[AMDGPU] Fold V_CNDMASK with identical source operands	Stanislav Mekhanoshin	2017-03-24	1	-0/+29
\| \| \| \| \| \| \| \|	Such instructions sometimes appear after lowering and folding. Differential Revision: https://reviews.llvm.org/D31318 llvm-svn: 298723
*	[AMDGPU] Rename Kind to ValueKind in metadata to be consistent	Konstantin Zhuravlyov	2017-03-24	2	-2/+2
\| \| \| \|	llvm-svn: 298722
*	[AMDGPU] Add AMDGPUAliasAnalysis to opt pipeline	Stanislav Mekhanoshin	2017-03-24	1	-1/+24
\| \| \| \| \| \| \| \|	Previously it was added only to the BE. Differential Revision: https://reviews.llvm.org/D31323 llvm-svn: 298721
*	[ThinLTO] Correct counting of functions in inliner stats	Teresa Johnson	2017-03-24	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Declarations need to be filtered out when counting functions. Reviewers: eraman Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31336 llvm-svn: 298720
*	[AMDGPU] Don't enforce constexpr, there are still old standard libraries ↵	Benjamin Kramer	2017-03-24	1	-4/+4
\| \| \| \| \| \|	around that don't have a constexpr std::pair. llvm-svn: 298719
*	[AMDGPU] Remove double map lookups in SI scheduler	Valery Pykhtin	2017-03-24	1	-25/+8
\| \| \| \| \| \| \| \|	Patch by Axel Davy (axel.davy@normalesup.org) Differential revision: https://reviews.llvm.org/D30382 llvm-svn: 298718
*	[PDB] Split item and type records when merging type streams	Reid Kleckner	2017-03-24	1	-53/+87
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: MSVC does this when producing a PDB. Reviewers: ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31316 llvm-svn: 298717
*	[InstCombine] Provide a way to calculate KnownZero/One for Add/Sub in ↵	Craig Topper	2017-03-24	2	-25/+40
\| \| \| \| \| \| \| \|	SimplifyDemandedUseBits without recursing into ComputeKnownBits SimplifyDemandedUseBits for Add/Sub already recursed down LHS and RHS for simplifying bits. If that didn't provide any simplifications we fall back to calling computeKnownBits which will recurse again. Instead just take the known bits for LHS and RHS we already have and call into a new function in ValueTracking that can calculate the known bits given the LHS/RHS bits. llvm-svn: 298711
*	[AMDGPU] Fix SGPR usage count in SI scheduler	Valery Pykhtin	2017-03-24	1	-2/+2
\| \| \| \| \| \| \| \|	Patch by Axel Davy (axel.davy@normalesup.org) Differential revision: https://reviews.llvm.org/D30149 llvm-svn: 298710
*	[AMDGPU] Add a new line after a debug message	Valery Pykhtin	2017-03-24	1	-0/+1
\| \| \| \| \| \| \| \|	Patch by Axel Davy (axel.davy@normalesup.org) Differential revision: https://reviews.llvm.org/D30146 llvm-svn: 298708
*	[X86][SSE] Generalised lowerTruncate by PACKSS to work with any 'zero/all ↵	Simon Pilgrim	2017-03-24	1	-17/+19
\| \| \| \| \| \| \| \| \| \|	bits' result, not just comparisons. Added vector compare opcodes to X86TargetLowering::ComputeNumSignBitsForTargetNode Covered by existing tests added for D22814. llvm-svn: 298704
*	Make GCC happy again.	Benjamin Kramer	2017-03-24	1	-2/+1
\| \| \| \|	llvm-svn: 298702
*	Don't build up std::vectors with constant sizes when an array suffices.	Benjamin Kramer	2017-03-24	1	-2/+6
\| \| \| \| \| \|	NFC. llvm-svn: 298701