bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[SampleFDO] Fix a bug in getOrCompHotCountThreshold/getOrCompColdCountThreshold	Wei Mi	2018-08-07	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	getOrCompHotCountThreshold/getOrCompColdCountThreshold introduced in https://reviews.llvm.org/D45377 contain a bad mistake and will only return 1 or 0 instead of the true hot/cold cutoff value. The patch fixes the mistake. But the mistake seems not causing big performance difference according to internal server benchmarks testing. Differential Revision: https://reviews.llvm.org/D50370 llvm-svn: 339162
*	[SelectionDAG] When splitting scatter nodes during DAGCombine, create a ↵	Craig Topper	2018-08-07	1	-12/+10
\| \| \| \| \| \| \| \| \| \|	serial chain dependency. Scatter could have multiple identical indices. We need to maintain sequential order. We get this right in LegalizeVectorTypes, but not in this code. Differential Revision: https://reviews.llvm.org/D50374 llvm-svn: 339157
*	[SelectionDAG][X86][SystemZ] Add a generic ↵	Craig Topper	2018-08-07	2	-8/+0
\| \| \| \| \| \| \| \|	nonvolatile_store/nonvolatile_load pattern fragment in TargetSelectionDAG.td Differential Revision: https://reviews.llvm.org/D50358 llvm-svn: 339156
*	[GVN,NewGVN] Keep nonnull if K does not move.	Florian Hahn	2018-08-07	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In combineMetadata, we should be able to preserve K's nonnull metadata, if K does not move. This condition should hold for all replacements by NewGVN/GVN, but I added a bunch of assertions to verify that. Fixes PR35038. There probably are additional kinds of metadata that could be preserved using similar reasoning. This is follow-up work. Reviewers: dberlin, davide, efriedma, nlopes Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D47339 llvm-svn: 339149
*	[ARM] FP16: codegen support for VACGT	Sjoerd Meijer	2018-08-07	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D50236 llvm-svn: 339148
*	[DAG] Allow non-uniform constant vectors to call BuildSDIV	Simon Pilgrim	2018-08-07	1	-1/+2
\| \| \| \| \| \| \| \|	This was missed in D50185. NFC until we add actual non-uniform support to BuildSDIV (similar BuildUDIV support in D49248) - for now it just early outs. llvm-svn: 339147
*	[TargetLowering] Use pre-computed Shift value type in BuildUDIV (NFCI)	Simon Pilgrim	2018-08-07	1	-9/+5
\| \| \| \| \| \|	This was missed in D49248 llvm-svn: 339146
*	[InstSimplify] move minnum/maxnum with common op fold from instcombine	Sanjay Patel	2018-08-07	2	-30/+11
\| \| \| \|	llvm-svn: 339144
*	[SystemZ] Comment update.	Jonas Paulsson	2018-08-07	1	-1/+1
\| \| \| \| \| \| \|	Update the comment in nextGroup since the ProcResourceCounters are not anymore always decremented with '1'. llvm-svn: 339140
*	[SystemZ] NFC: Remove redundant check in SystemZHazardRecognizer.	Jonas Paulsson	2018-08-07	1	-4/+3
\| \| \| \| \| \| \| \|	Remove the redundant check against zero when updating ProcResourceCounters in nextGroup(), as pointed out in https://reviews.llvm.org/D50187. Review: Ulrich Weigand. llvm-svn: 339139
*	[GVN,NewGVN] Move patchReplacementInstruction to Utils/Local.h	Florian Hahn	2018-08-07	3	-62/+31
\| \| \| \| \| \| \| \| \| \| \| \| \|	This function is shared between both implementations. I am not sure if Utils/Local.h is the best place though. Reviewers: davide, dberlin, efriedma, xbolva00 Reviewed By: efriedma, xbolva00 Differential Revision: https://reviews.llvm.org/D47337 llvm-svn: 339138
*	Fix inconsistency with/without debug information (-g)	Jonas Devlieghere	2018-08-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes an inconsistency in code generation when compiling with or without debug information (-g). When debug information is available in an empty block, the original test would fail, resulting in possibly different code. Patch by: Jeroen Dobbelaere Differential revision: https://reviews.llvm.org/D49467 llvm-svn: 339129
*	[mips] Handle branch expansion corner cases	Aleksandar Beserminji	2018-08-07	2	-93/+160
\| \| \| \| \| \| \| \| \| \| \| \|	When potential jump instruction and target are in the same segment, use jump instruction with immediate field. In cases where offset does not fit immediate value of a bc/j instructions, offset is stored into register, and then jump register instruction is used. Differential Revision: https://reviews.llvm.org/D48019 llvm-svn: 339126
*	[DebugInfo] Reduce debug_str_offsets section size	Pavel Labath	2018-08-07	4	-14/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The accelerator tables use the debug_str section to store their strings. However, they do not support the indirect method of access that is available for the debug_info section (DW_FORM_strx et al.). Currently our code is assuming that all strings can/will be referenced indirectly, and puts all of them into the debug_str_offsets section. This is generally true for regular (unsplit) dwarf, but in the DWO case, most of the strings in the debug_str section will only be used from the accelerator tables. Therefore the contents of the debug_str_offsets section will be largely unused and bloating the main executable. This patch rectifies this by teaching the DwarfStringPool to differentiate between strings accessed directly and indirectly. When a user inserts a string into the pool it has to declare whether that string will be referenced directly or not. If at least one user requsts indirect access, that string will be assigned an index ID and put into debug_str_offsets table. Otherwise, the offset table is skipped. This approach reduces the overall binary size (when compiled with -gdwarf-5 -gsplit-dwarf) in my tests by about 2% (debug_str_offsets is shrunk by 99%). Reviewers: probinson, dblaikie, JDevlieghere Subscribers: aprantl, mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D49493 llvm-svn: 339122
*	[TargetLowering] Add support for non-uniform vectors to BuildUDIV	Simon Pilgrim	2018-08-07	2	-61/+136
\| \| \| \| \| \| \| \| \| \|	This patch refactors the existing TargetLowering::BuildUDIV base implementation to support non-uniform constant vector denominators. It also includes a fold for MULHU by pow2 constants to SRL which can now more readily occur from BuildUDIV. Differential Revision: https://reviews.llvm.org/D49248 llvm-svn: 339121
*	[yaml2obj] - Add a support for changing EntSize.	George Rimar	2018-08-07	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	I was trying to add a test case for LLD and found that it is impossible to set sh_entsize via yaml. The patch implements the missing part. Differential revision: https://reviews.llvm.org/D50235 llvm-svn: 339113
*	AMDGPU: Add feature vi-insts	Matt Arsenault	2018-08-07	3	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \|	This is necessary to add a VI specific builtin, __builtin_amdgcn_s_dcache_wb. We already have an overly specific feature for one of these builtins, for s_memrealtime. I'm not sure whether it's better to add more of those, or to get rid of that and merge it with vi-insts. Alternatively, maybe this logically goes with scalar-stores? llvm-svn: 339104
*	[SelectionDAG][X86] Rename MaskedLoadSDNode::getSrc0 to getPassThru.	Craig Topper	2018-08-07	4	-32/+33
\| \| \| \| \| \|	Src0 doesn't really convey any meaning to what the operand is. Passthru matches what's used in the documentation for the intrinsic this comes from. llvm-svn: 339101
*	[SelectionDAG][X86] Rename getValue to getPassThru for gather SDNodes.	Craig Topper	2018-08-07	6	-41/+45
\| \| \| \| \| \|	getValue is more meaningful name for scatter than it is for gather. Split them and use getPassThru for gather. llvm-svn: 339096
*	[XRay] Improve error reporting when loading traces	Dean Michael Berris	2018-08-07	1	-152/+364
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change uses a single offset pointer used throughout the implementation of the individual record parsers. This allows us to report where in a trace file parsing failed. We're still in an intermediate step here as we prepare to refactor this further into a set of types and use object-oriented design principles for a cleaner implementation. The next steps will be to allow us to parse/dump files in a streaming fashion and incrementally build up the structures in memory instead of the current all-or-nothing approach. Reviewers: kpw, eizan Reviewed By: kpw Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D50169 llvm-svn: 339092
*	[NFC] Factor out implicit control flow logic from GVN	Max Kazantsev	2018-08-07	3	-75/+103
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Logic for tracking implicit control flow instructions was added to GVN to perform PRE optimizations correctly. It appears that GVN is not the only optimization that sometimes does PRE, so this logic is required in other places (such as Jump Threading). This is an NFC patch that encapsulates all ICF-related logic in a dedicated utility class separated from GVN. Differential Revision: https://reviews.llvm.org/D40293 llvm-svn: 339086
*	[WebAssembly] Enable atomic expansion for unsupported atomicrmws	Heejin Ahn	2018-08-07	3	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Wasm does not have direct counterparts to some of LLVM IR's atomicrmw instructions (min, max, umin, umax, and nand). This enables atomic expansion using cmpxchg instruction within a loop for those atomicrmw instructions. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D49440 llvm-svn: 339084
*	[WebAssembly] Replace SIMD expression types with V128	Derek Schuff	2018-08-06	3	-23/+13
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: The spec only defines a SIMD expression type of V128 and leaves interpretation of different vector types to the instructions. Differential Revision: https://reviews.llvm.org/D50367 Patch by Thomas Lively llvm-svn: 339082
*	AMDGPU: cvt_pk_rtz_f16 canonicalizes	Matt Arsenault	2018-08-06	2	-1/+15
\| \| \| \|	llvm-svn: 339078
*	AMDGPU: Handle some vector operations in isCanonicalized	Matt Arsenault	2018-08-06	1	-0/+20
\| \| \| \|	llvm-svn: 339077
*	AMDGPU: Push fcanonicalize through partially constant build_vector	Matt Arsenault	2018-08-06	1	-1/+37
\| \| \| \| \| \| \|	This usually avoids some re-packing code, and may help find canonical sources. llvm-svn: 339072
*	AMDGPU: Refactor fcanonicalize combine	Matt Arsenault	2018-08-06	2	-36/+32
\| \| \| \| \| \|	This will make more complex combines easier. llvm-svn: 339070
*	[LICM] Extract a helper function for readability [NFC]	Philip Reames	2018-08-06	1	-8/+12
\| \| \| \|	llvm-svn: 339069
*	MC: Redirect .addrsig directives referring to private (.L) symbols to the ↵	Peter Collingbourne	2018-08-06	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	section symbol. This matches our behaviour for regular (i.e. relocated) references to private symbols and therefore avoids needing to unnecessarily write address-significant .L symbols to the object file's symbol table, which can interfere with stack traces. Fixes check-cfi after r339050. llvm-svn: 339066
*	AMDGPU: Treat more custom operations as canonicalizing	Matt Arsenault	2018-08-06	2	-2/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Everything should quiet, and I think everything should flush. I assume the min3/med3/max3 follow the same rules as regular min/max for flushing, which should at least be conservatively correct. There are still more operations that need to be handled. llvm-svn: 339065
*	AMDGPU: Conversions always produce canonical results	Matt Arsenault	2018-08-06	1	-7/+2
\| \| \| \| \| \| \| \| \|	Not sure why this was checking for denormals for f16. My interpretation of the IEEE standard is conversions should produce a canonical result, and the ISA manual says denormals are created when appropriate. llvm-svn: 339064
*	AMDGPU: Fix implementation of isCanonicalized	Matt Arsenault	2018-08-06	2	-46/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If denormals are enabled, denormals are canonical. Also fix a few other issues. minnum/maxnum are supposed to canonicalize. Temporarily improve workaround for the instruction behavior change in gfx9. Handle selects and fcopysign. The tests were also largely broken, since they were checking for a flush used on some targets after the store of the result. llvm-svn: 339061
*	Fix a -Wsign-compare	Reid Kleckner	2018-08-06	1	-1/+1
\| \| \| \|	llvm-svn: 339059
*	[X86] Fix assertion in subreg extraction	Reid Kleckner	2018-08-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This assert fires when attempting to extract a subregister from the global PIC base register. This virtual register SD node is not in the VRBaseMap, so we shouldn't call getVR to look it up there. If this is a RegisterSDNode, we should be able to use the virtual register directly. Fixes PR38385 llvm-svn: 339056
*	[SLC] Fix shrinking of pow()	Evandro Menezes	2018-08-06	1	-13/+17
\| \| \| \| \| \| \| \| \|	Properly shrink `pow()` to `powf()` as a binary function and, when no other simplification applies, do not discard it. Differential revision: https://reviews.llvm.org/D50113 llvm-svn: 339046
*	[llvm-pdbutil] Support PDBs without a DBI stream	Alexandre Ganea	2018-08-06	1	-1/+3
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D50258 llvm-svn: 339045
*	[X86] Recognize a splat of negate in isFNEG	Easwaran Raman	2018-08-06	1	-18/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Expand isFNEG so that we generate the appropriate F(N)M(ADD\|SUB) instructions in more cases. For example, the following sequence a = _mm256_broadcast_ss(f) d = _mm256_fnmadd_ps(a, b, c) generates an fsub and fma without this patch and an fnma with this change. Reviewers: craig.topper Subscribers: llvm-commits, davidxl, wmi Differential Revision: https://reviews.llvm.org/D48467 llvm-svn: 339043
*	[X86] When using "and $0" and "orl $-1" to store 0 and -1 for minsize, make ↵	Craig Topper	2018-08-06	1	-6/+12
\| \| \| \| \| \| \| \| \| \|	sure the store isn't volatile If the store is volatile this might be a memory mapped IO access. In that case we shouldn't generate a load that didn't exist in the source Differential Revision: https://reviews.llvm.org/D50270 llvm-svn: 339041
*	[RegisterCoalescer] Delay live interval update work until the rematerialization	Wei Mi	2018-08-06	1	-6/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	for all the uses from the same def is done. We run into a compile time problem with flex generated code combined with `-fno-jump-tables`. The cause is that machineLICM hoists a lot of invariants outside of a big loop, and drastically increases the compile time in global register splitting and copy coalescing. https://reviews.llvm.org/D49353 relieves the problem in global splitting. This patch is to handle the problem in copy coalescing. About the situation where the problem in copy coalescing happens. After machineLICM, we have several defs outside of a big loop with hundreds or thousands of uses inside the loop. Rematerialization in copy coalescing happens for each use and everytime rematerialization is done, shrinkToUses will be called to update the huge live interval. Because we have 'n' uses for a def, and each live interval update will have at least 'n' complexity, the total update work is n^2. To fix the problem, we try to do the live interval update work in a collective way. If a def has many copylike uses larger than a threshold, each time rematerialization is done for one of those uses, we won't do the live interval update in time but delay that work until rematerialization for all those uses are completed, so we only have to do the live interval update work once. Delaying the live interval update could potentially change the copy coalescing result, so we hope to limit that change to those defs with many (like above a hundred) copylike uses, and the cutoff can be adjusted by the option -mllvm -late-remat-update-threshold=xxx. Differential Revision: https://reviews.llvm.org/D49519 llvm-svn: 339035
*	Fix raw_fd_ostream::write_impl hang due to an infinite loop with large output	Owen Reynolds	2018-08-06	1	-4/+4
\| \| \| \| \| \| \| \| \| \|	On windows when raw_fd_ostream::write_impl calls write, a 32 bit input is required for character count. As a variable with size_t is used for this argument, on x64 integral demotion occurs. In the case of large files an infinite loop follows. See: https://bugs.llvm.org/show_bug.cgi?id=37926 This fix allows the output of files larger than the previous int32 limit. Differential Revision: https://reviews.llvm.org/D48948 llvm-svn: 339027
*	AMDGPU: Fold v_lshl_or_b32 with 0 src0	Matt Arsenault	2018-08-06	1	-0/+13
\| \| \| \| \| \|	Appears from expansion of some packed cases. llvm-svn: 339025
*	ValueTracking: Handle canonicalize in CannotBeNegativeZero	Matt Arsenault	2018-08-06	1	-0/+1
\| \| \| \| \| \| \|	Also fix apparently missing test coverage for any of the handling here. llvm-svn: 339023
*	[NFC] Fixed unused function warnings	David Bolvansky	2018-08-06	1	-0/+2
\| \| \| \|	llvm-svn: 339021
*	Revert unused function fix	David Bolvansky	2018-08-06	1	-1/+1
\| \| \| \|	llvm-svn: 339020
*	[NFC] Fixed unused function warning	David Bolvansky	2018-08-06	1	-1/+1
\| \| \| \|	llvm-svn: 339019
*	[AArch64] Fix assertion failure on widened f16 BUILD_VECTOR	Bryan Chan	2018-08-06	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Ensure that NormalizedBuildVector returns a BUILD_VECTOR with operands of the same type. This fixes an assertion failure in VerifySDNode. Reviewers: SjoerdMeijer, t.p.northover, javed.absar Reviewed By: SjoerdMeijer Subscribers: kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D50202 llvm-svn: 339013
*	Fix modules build with different technique to suppress Knuth debugging	Tim Northover	2018-08-06	1	-37/+33
\| \| \| \| \| \| \| \| \| \|	Currently we use #pragma push_macro(LLVM_DEBUG) to fiddle with the LLVM_DEBUG macro so that we can silence debugging the Knuth division algorithm unless it's actually desired. Unfortunately this is incompatible with enabling modules while building LLVM (via LLVM_ENABLE_MODULES=ON), probably due to a bug being fixed by D33004. llvm-svn: 339009
*	ARM-MachO: don't add Thumb bit for addend to non-external relocation.	Tim Northover	2018-08-06	1	-0/+1
\| \| \| \| \| \| \| \| \|	ld64 supplies its own Thumb bit for Thumb functions, and intentionally zeroes out that part of any addend in an object file. But it only does that for symbols marked N_EXT -- i.e. external symbols. So LLVM should avoid setting that extra bit in other cases. llvm-svn: 339007
*	Re-enable "[ValueTracking] Teach isKnownNonNullFromDominatingCondition about ↵	Max Kazantsev	2018-08-06	1	-10/+33
\| \| \| \| \| \| \| \| \| \| \|	AND" The patch was reverted because of bug detected by sanitizer. The bug is fixed, respective tests added. Differential Revision: https://reviews.llvm.org/D50172 llvm-svn: 339005
*	Revert rL338990 to see if it causes sanitizer failures	Max Kazantsev	2018-08-06	1	-28/+10
\| \| \| \| \| \| \| \| \|	Multiple failues reported by sanitizer-x86_64-linux, seem to be caused by this patch. Reverting to see if they sustain without it. Differential Revision: https://reviews.llvm.org/D50172 llvm-svn: 338994