bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert r237046. See the testcase on the thread where r237046 was committed.	Nick Lewycky	2015-05-13	4	-56/+52
\| \| \| \|	llvm-svn: 237317
*	[DebugInfo] Debug locations for constant SD nodes	Sergey Dmitrouk	2015-05-13	1	-41/+76
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Several updates for [DebugInfo] Add debug locations to constant SD nodes (r235989). Includes: * re-enabling the change (disabled recently); * missing change for FP constants; * resetting debug location of constant node if it's used more than at one place to prevent emission of wrong locations in case of coalesced constants; * a couple of additional tests. Now all look ups in CSEMap are wrapped by additional method. Comment in D9084 suggests that debug locations aren't useful for "target constants", so there might be one more change related to this API (namely, dropping debug locations for getTarget*Constant methods). Differential Revision: http://reviews.llvm.org/D9604 llvm-svn: 237237
*	[Statepoints] Support for "patchable" statepoints.	Sanjoy Das	2015-05-12	1	-6/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change adds two new parameters to the statepoint intrinsic, `i64 id` and `i32 num_patch_bytes`. `id` gets propagated to the ID field in the generated StackMap section. If the `num_patch_bytes` is non-zero then the statepoint is lowered to `num_patch_bytes` bytes of nops instead of a call (the spill and reload code remains unchanged). A non-zero `num_patch_bytes` is useful in situations where a language runtime requires complete control over how a call is lowered. This change brings statepoints one step closer to patchpoints. With some additional work (that is not part of this patch) it should be possible to get rid of `TargetOpcode::STATEPOINT` altogether. PlaceSafepoints generates `statepoint` wrappers with `id` set to `0xABCDEF00` (the old default value for the ID reported in the stackmap) and `num_patch_bytes` set to `0`. This can be made more sophisticated later. Reviewers: reames, pgavlin, swaroop.sridhar, AndyAyers Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9546 llvm-svn: 237214
*	[Statepoints] Clean up statepoint argument accessors.	Pat Gavlin	2015-05-12	1	-22/+12
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9622 llvm-svn: 237191
*	[Statepoints] Split the calling convention and statepoint flags operand to ↵	Pat Gavlin	2015-05-12	1	-22/+15
\| \| \| \| \| \| \| \|	STATEPOINT into two separate operands. Differential Revision: http://reviews.llvm.org/D9623 llvm-svn: 237166
*	Reverse ordering of base and derived pointer during safepoint lowering.	Igor Laevsky	2015-05-12	1	-10/+12
\| \| \| \| \| \| \| \|	According to the documentation in StackMap section for the safepoint we should have: "The first Location in each pair describes the base pointer for the object. The second is the derived pointer actually being relocated." But before this change we emitted them in reverse order - derived pointer first, base pointer second. llvm-svn: 237126
*	Migrate existing backends that care about software floating point	Eric Christopher	2015-05-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to use the information in the module rather than TargetOptions. We've had and clang has used the use-soft-float attribute for some time now so have the backends set a subtarget feature based on a particular function now that subtargets are created based on functions and function attributes. For the one middle end soft float check go ahead and create an overloadable TargetLowering::useSoftFloat function that just checks the TargetSubtargetInfo in all cases. Also remove the command line option that hard codes whether or not soft-float is set by using the attribute for all of the target specific test cases - for the generic just go ahead and add the attribute in the one case that showed up. llvm-svn: 237079
*	propagate IR-level fast-math-flags to DAG nodes; 2nd try; NFC	Sanjay Patel	2015-05-11	4	-52/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a less ambitious version of: http://reviews.llvm.org/rL236546 because that was reverted in: http://reviews.llvm.org/rL236600 because it caused memory corruption that wasn't related to FMF but was actually due to making nodes with 2 operands derive from a plain SDNode rather than a BinarySDNode. This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. llvm-svn: 237046
*	Fixing build warnings	Andrew Kaylor	2015-05-11	1	-2/+2
\| \| \| \|	llvm-svn: 237042
*	[WinEH] Update exception numbering to give handlers their own base state.	Andrew Kaylor	2015-05-11	1	-15/+74
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9512 llvm-svn: 237014
*	[SelectionDAG] Fixed constant folding issue when legalised types are smaller ↵	Simon Pilgrim	2015-05-10	1	-2/+3
\| \| \| \| \| \| \| \|	then the folded type. Found when testing with llvm-stress on i686 targets. llvm-svn: 236954
*	Fix MergeConsecutiveStore for non-byte-sized memory accesses.	James Y Knight	2015-05-09	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The bug showed up as a compile-time assertion failure: Assertion `NumBits >= MIN_INT_BITS && "bitwidth too small"' failed when building msan tests on x86-64. Prior to r236850, this bug was masked due to a bogus alignment check, which also accidentally rejected non-byte-sized accesses. Afterwards, an invalid ElementSizeBytes == 0 got further into the function, and triggered the assertion failure. It would probably be a good idea to allow it to handle merging stores of unusual widths as well, but for now, to un-break it, I'm just making the minimal fix. Differential Revision: http://reviews.llvm.org/D9626 llvm-svn: 236927
*	[Fast-ISel] Don't mark the first use of a remat constant as killed.	Pete Cooper	2015-05-09	1	-4/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When emitting something like 'add x, 1000' if we remat the 1000 then we should be able to mark the vreg containing 1000 as killed. Given that we go bottom up in fast-isel, a later use of 1000 will be higher up in the BB and won't kill it, or be impacted by the lower kill. However, rematerialised constant expressions aren't generated bottom up. The local value save area grows downwards. This means that if you remat 2 constant expressions which both use 1000 then the first will kill it, then the second, which is lower in the BB will read a killed register. This is the case in the attached test where the 2 GEPs both need to generate 'add x, 6680' for the constant offset. Note that this commit only makes kill flag generation conservative. There's nothing else obviously wrong with the local value save area growing downwards, and in fact it needs to for handling arbitrarily complex constant expressions. However, it would be nice if there was a solution which would let us generate more accurate kill flags, or just kill flags completely. llvm-svn: 236922
*	Switch lowering: cluster adjacent fall-through cases even at -O0	Hans Wennborg	2015-05-08	1	-3/+5
\| \| \| \| \| \| \|	It's cheap to do, and codegen is much faster if cases can be merged into clusters. llvm-svn: 236905
*	[Fast-ISel] Clear kill flags on registers replaced by updateValueMap.	Pete Cooper	2015-05-08	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	When selecting an extract instruction, we don't actually generate code but instead work out which register we are reading, and rewrite uses of the extract def to the source register. This is done via updateValueMap,. However, its possible that the source register we are rewriting to to also have uses. If those uses are after a kill of the value we are rewriting from then we have uses after a kill and the verifier fails. This code checks for the case where the to register is also used, and if so it clears all kill on the from register. This is conservative, but better that always clearing kills on the from register. llvm-svn: 236897
*	Extend the statepoint intrinsic to allow statepoints to be marked as ↵	Pat Gavlin	2015-05-08	2	-11/+93
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	transitions from GC-aware code to code that is not GC-aware. This changes the shape of the statepoint intrinsic from: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 unused, ...call args, i32 # deopt args, ...deopt args, ...gc args) to: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 flags, ...call args, i32 # transition args, ...transition args, i32 # deopt args, ...deopt args, ...gc args) This extension offers the backend the opportunity to insert (somewhat) arbitrary code to manage the transition from GC-aware code to code that is not GC-aware and back. In order to support the injection of transition code, this extension wraps the STATEPOINT ISD node generated by the usual lowering lowering with two additional nodes: GC_TRANSITION_START and GC_TRANSITION_END. The transition arguments that were passed passed to the intrinsic (if any) are lowered and provided as operands to these nodes and may be used by the backend during code generation. Eventually, the lowering of the GC_TRANSITION_{START,END} nodes should be informed by the GC strategy in use for the function containing the intrinsic call; for now, these nodes are instead replaced with no-ops. Differential Revision: http://reviews.llvm.org/D9501 llvm-svn: 236888
*	Fix alignment checks in MergeConsecutiveStores.	James Y Knight	2015-05-08	1	-36/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1) check whether the alignment of the memory is sufficient for the merged store or load to be efficient. Not doing so can result in some ridiculously poor code generation, if merging creates a vector operation which must be aligned but isn't. 2) DON'T check that the alignment of each load/store is equal. If you're merging 2 4-byte stores, the first might have 8-byte alignment, but the second certainly will have 4-byte alignment. We do want to allow those to be merged. llvm-svn: 236850
*	Fix coding standart based on post submit comments.	Igor Laevsky	2015-05-08	1	-4/+4
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D7760 llvm-svn: 236849
*	Switch lowering: handle zero-weight branch probabilities	Hans Wennborg	2015-05-07	1	-16/+7
\| \| \| \| \| \| \| \| \| \|	After r236617, branch probabilities are no longer guaranteed to be >= 1. This patch makes the swich lowering code handle that correctly, without bumping the branch weights by 1 which might cause overflow and skews the probabilities. Covered by @zero_weight_tree in test/CodeGen/X86/switch.ll. llvm-svn: 236739
*	Fix incorrect kill flags in fastisel.	Pete Cooper	2015-05-06	1	-2/+6
\| \| \| \| \| \| \| \|	If called twice in the same BB on the same constant, FastISel::fastEmit_ri_ was marking the materialized vreg as killed on each use, instead of only the last use. Change this to only mark the last use as killed by making earlier uses check if the vreg is already used elsewhere. llvm-svn: 236650
*	[SelectionDAG] Delete SelectionDAGBuilder::removeValue. NFC.	Sanjoy Das	2015-05-06	1	-6/+0
\| \| \| \| \| \|	SelectionDAGBuilder::removeValue is dead now, after rL236563. llvm-svn: 236618
*	Allow 0-weight branches in BranchProbabilityInfo.	Diego Novillo	2015-05-06	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When computing branch weights in BPI, we used to disallow branches with weight 0. This is a minor nuisance, because a branch with weight 0 is different to "don't have information". In the context of instrumentation, it may mean "never executed", in the context of sampling, it means "never or seldom executed". In allowing 0 weight branches, I ran into issues with the switch expansion code in selection DAG. It is currently hardwired to not handle branches with weight 0. To maintain the current behaviour, I changed it to use 1 when it finds 0, but perhaps the algorithm needs changes to tolerate branches with weight zero. Reviewers: hansw Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9533 llvm-svn: 236617
*	Reformat.	NAKAMURA Takumi	2015-05-06	2	-6/+5
\| \| \| \|	llvm-svn: 236601
*	Revert r236546, "propagate IR-level fast-math-flags to DAG nodes (NFC)"	NAKAMURA Takumi	2015-05-06	4	-65/+60
\| \| \| \| \| \|	It caused undefined behavior. llvm-svn: 236600
*	SelectionDAG: Handle out-of-bounds index in extract vector element	Pawel Bylica	2015-05-06	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch correctly handles undef case of EXTRACT_VECTOR_ELT node where the element index is constant and not less than vector size. Test Plan: CodeGen for X86 test included. Also one incorrect regression test fixed. Reviewers: qcolombet, chandlerc, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D9250 llvm-svn: 236584
*	[Statepoint] Clean up StatepointLowering: symbolic constants.	Sanjoy Das	2015-05-06	1	-2/+3
\| \| \| \| \| \| \| \|	For accessors in the `Statepoint` class, use symbolic constants for offsets into the argument vector instead of literals. This makes the code intent clearer and simpler to change. llvm-svn: 236566
*	[Statepoint] Clean up Statepoint.h: accessor names.	Sanjoy Das	2015-05-06	1	-10/+10
\| \| \| \| \| \|	Use getFoo() as accessors consistently and some other naming changes. llvm-svn: 236564
*	[StatepointLowering] Don't create temporary instructions. NFCI.	Sanjoy Das	2015-05-06	1	-73/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Instead of creating a temporary call instruction and lowering that, use SelectionDAGBuilder::lowerCallOperands. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9480 llvm-svn: 236563
*	[SelectionDAG] Make an argument optional in RFV::getCopyToRegs. NFC.	Sanjoy Das	2015-05-05	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We default the value argument to nullptr. The only use of the value is in diagnosePossiblyInvalidConstraint and that seems to be resilient to it being nullptr. Reviewers: atrick, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9479 llvm-svn: 236555
*	[SelectionDAG] Move RegsForValue into SelectionDAGBuilder.h. NFC.	Sanjoy Das	2015-05-05	2	-85/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The exported class will be used in later change, in StatepointLowering.cpp. It is still internal to SelectionDAG (not exported via include/). Reviewers: reames, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9478 llvm-svn: 236554
*	[SelectionDAG] Pass explicit type to lowerCallOperands. NFC.	Sanjoy Das	2015-05-05	2	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently this does not change anything, but change will be used in a later change to StatepointLowering.cpp Reviewers: reames, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9477 llvm-svn: 236553
*	[StatepointLowering] Rename variable, NFC.	Sanjoy Das	2015-05-05	1	-3/+3
\| \| \| \| \| \|	Rename LoweredArgs to LoweredMetaArgs to clarify intent. llvm-svn: 236552
*	propagate IR-level fast-math-flags to DAG nodes (NFC)	Sanjay Patel	2015-05-05	4	-60/+65
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. There are 2 structural changes here: 1. The main diff is that we're preparing to extend the optimization flags to affect more than just binary SDNodes. Eg, IR intrinsics ( https://llvm.org/bugs/show_bug.cgi?id=21290 ) or non-binop nodes that don't even exist in IR such as FMA, FNEG, etc. 2. The other change is that we're actually copying the FP fast-math-flags from the IR instructions to SDNodes. Differential Revision: http://reviews.llvm.org/D8900 llvm-svn: 236546
*	[DAGCombiner] Account for getVectorIdxTy() when narrowing vector load	Ulrich Weigand	2015-05-05	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \|	This patch makes ReplaceExtractVectorEltOfLoadWithNarrowedLoad convert the element number from getVectorIdxTy() to PtrTy before doing pointer arithmetic on it. This is needed on z, where element numbers are i32 but pointers are i64. Original patch by Richard Sandiford. llvm-svn: 236530
*	[DAGCombiner] Fix ReplaceExtractVectorEltOfLoadWithNarrowedLoad for BE	Ulrich Weigand	2015-05-05	1	-7/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	For little-endian, the function would convert (extract_vector_elt (load X), Y) to X + Ysizeof(elt). For big-endian it would instead use X + sizeof(vec) - Ysizeof(elt). The big-endian case wasn't right since vector index order always follows memory/array order, even for big-endian. (Note that the current handling has to be wrong for Y==0 since it would access beyond the end of the vector.) Original patch by Richard Sandiford. llvm-svn: 236529
*	[LegalizeVectorTypes] Allow single loads and stores for more short vectors	Ulrich Weigand	2015-05-05	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When lowering a load or store for TypeWidenVector, the type legalizer would use a single load or store if the associated integer type was legal. E.g. it would load a v4i8 as an i32 if i32 was legal. This patch extends that behavior to promoted integers as well as legal ones. If the integer type for the full vector width is TypePromoteInteger, the element type is going to be TypePromoteInteger too, and it's still better to use a single promoting load or truncating store rather than N individual promoting loads or truncating stores. E.g. if you have a v2i8 on a target where i16 is promoted to i32, it's better to load the v2i8 as an i16 rather than load both i8s individually. Original patch by Richard Sandiford. llvm-svn: 236528
*	Masked gather and scatter intrinsics - enabled codegen for KNL.	Elena Demikhovsky	2015-05-03	3	-3/+182
\| \| \| \|	llvm-svn: 236394
*	[DAGCombiner] Enabled vector float/double -> int constant folding	Simon Pilgrim	2015-05-02	2	-4/+4
\| \| \| \|	llvm-svn: 236387
*	[SelectionDAG] Unary vector constant folding integer legality fixes	Simon Pilgrim	2015-05-01	1	-5/+25
\| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes issues with vector constant folding not correctly handling scalar input operands if they require implicit truncation - this was tested with llvm-stress as recommended by Patrik H Hagglund. The patch ensures that integer input scalars from a build vector are correctly truncated before folding, and that constant integer scalar results are promoted to a legal type before inclusion in the new folded build vector. I have added another crash test case and also a test for UINT_TO_FP / SINT_TO_FP using an non-truncated scalar input, which was failing before this patch. Differential Revision: http://reviews.llvm.org/D9282 llvm-svn: 236308
*	Reinstate revisions r234755, r234759, r234760	Jan Vesely	2015-04-30	1	-0/+32
\| \| \| \| \| \| \| \| \|	changes: Don't apply on hexagon and NVPTX since they no longer claim to support UADDO/USUBO Add location to getConstant Drop comment about the ops being turned into expand llvm-svn: 236240
*	Inline local variable to silence unused warning.	Daniel Jasper	2015-04-30	1	-2/+1
\| \| \| \|	llvm-svn: 236212
*	Masked gather and scatter - added DAGCombine visitors	Elena Demikhovsky	2015-04-30	2	-0/+144
\| \| \| \| \| \| \| \| \|	and AVX-512 instruction selection patterns. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 236211
*	Semantically revert r236031, which is not a good idea for in-order targets.	Owen Anderson	2015-04-30	1	-31/+0
\| \| \| \| \| \| \| \| \| \| \| \|	At the least it should be guarded by some kind of target hook. It also introduced catastrophic compile time and code quality regressions on some out of tree targets (test case still being reduced/sanitized). Sanjay agreed with reverting this patch until these issues can be resolved. llvm-svn: 236199
*	Switch lowering: use profile info to build weight-balanced binary search trees	Hans Wennborg	2015-04-30	1	-7/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This will cause hot nodes to appear closer to the root. The literature says building the tree like this makes it a near-optimal (in terms of search time given key frequencies) binary search tree. In LLVM's case, we can do up to 3 comparisons in each leaf node, so it might be better to opt for lower tree height in some cases; that's something to look into in the future. Differential Revision: http://reviews.llvm.org/D9318 llvm-svn: 236192
*	generalize binop reassociation; NFC	Sanjay Patel	2015-04-29	1	-17/+30
\| \| \| \| \| \| \| \| \| \| \| \| \|	Move the fold introduced in r236031: http://reviews.llvm.org/rL236031 to its own helper function, so we can use it for other binops. This is a preliminary step before partially solving: https://llvm.org/bugs/show_bug.cgi?id=21768 https://llvm.org/bugs/show_bug.cgi?id=23116 llvm-svn: 236171
*	Run StatepointLowering.{cpp,h} through clang-format.	Pat Gavlin	2015-04-29	2	-39/+28
\| \| \| \|	llvm-svn: 236166
*	tidy up; NFC	Sanjay Patel	2015-04-29	1	-41/+28
\| \| \| \|	llvm-svn: 236156
*	too much space again; NFC	Sanjay Patel	2015-04-29	1	-4/+0
\| \| \| \|	llvm-svn: 236150
*	too much space; NFC	Sanjay Patel	2015-04-29	1	-4/+0
\| \| \| \|	llvm-svn: 236147
*	IR: Give 'DI' prefix to debug info metadata	Duncan P. N. Exon Smith	2015-04-29	6	-16/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Finish off PR23080 by renaming the debug info IR constructs from `MD` to `DI`. The last of the `DIDescriptor` classes were deleted in r235356, and the last of the related typedefs removed in r235413, so this has all baked for about a week. Note: If you have out-of-tree code (like a frontend), I recommend that you get everything compiling and tests passing with the previous commit before updating to this one. It'll be easier to keep track of what code is using the `DIDescriptor` hierarchy and what you've already updated, and I think you're extremely unlikely to insert bugs. YMMV of course. Back to this commit: I did this using the rename-md-di-nodes.sh upgrade script I've attached to PR23080 (both code and testcases) and filtered through clang-format-diff.py. I edited the tests for test/Assembler/invalid-generic-debug-node-*.ll by hand since the columns were off-by-three. It should work on your out-of-tree testcases (and code, if you've followed the advice in the previous paragraph). Some of the tests are in badly named files now (e.g., test/Assembler/invalid-mdcompositetype-missing-tag.ll should be 'dicompositetype'); I'll come back and move the files in a follow-up commit. llvm-svn: 236120