bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[ppc] Correctly compute the cost of loading 32/64 bit memory into VSR	Guozhi Wei	2016-12-03	1	-5/+14
\| \| \| \| \| \| \| \|	VSX has instructions lxsiwax/lxsdx that can load 32/64 bit value into VSX register cheaply. That patch makes it known to memory cost model, so the vectorization of the test case in pr30990 is beneficial. Differential Revision: https://reviews.llvm.org/D26713 llvm-svn: 288560
*	IR: Change the gep_type_iterator API to avoid always exposing the "current" ↵	Peter Collingbourne	2016-12-02	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	type. Instead, expose whether the current type is an array or a struct, if an array what the upper bound is, and if a struct the struct type itself. This is in preparation for a later change which will make PointerType derive from Type rather than SequentialType. Differential Revision: https://reviews.llvm.org/D26594 llvm-svn: 288458
*	Move FrameInstructions from MachineModuleInfo to MachineFunction	Matthias Braun	2016-11-30	1	-9/+9
\| \| \| \| \| \| \| \| \| \| \|	This is per function data so it is better kept at the function instead of the module. This is a necessary step to have machine module passes work properly. Differential Revision: https://reviews.llvm.org/D27185 llvm-svn: 288291
*	[PowerPC] Preserve machine dominator tree in PPCVSXFMAMutate	Krzysztof Parzyszek	2016-11-30	1	-0/+4
\| \| \| \| \| \|	It is needed by LiveIntervalAnalysis. llvm-svn: 288243
*	[PowerPC] Improvements for BUILD_VECTOR Vol. 2	Nemanja Ivanovic	2016-11-29	1	-4/+98
\| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D26023 This patch adds support for converting a vector of loads into a single load if the loads are consecutive (in either direction). llvm-svn: 288219
*	[PowerPC] Improvements for BUILD_VECTOR Vol. 2	Nemanja Ivanovic	2016-11-29	2	-1/+98
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D25980 This is the 2nd patch in a series of 4 that improve the lowering and combining for BUILD_VECTOR nodes on PowerPC. This particular patch combines a build vector of fp-to-int conversions into an fp-to-int conversion of a build vector of fp values. For example: Converts (build_vector (fp_to_[su]i $A), (fp_to_[su]i $B), ...) Into (fp_to_[su]i (build_vector $A, $B, ...))). Which is a natural match for much cleaner code. llvm-svn: 288218
*	Revert https://reviews.llvm.org/rL287679	Nemanja Ivanovic	2016-11-29	2	-23/+6
\| \| \| \| \| \| \|	This commit caused some miscompiles that did not show up on any of the bots. Reverting until we can investigate the cause of those failures. llvm-svn: 288214
*	[PowerPC] Improvements for BUILD_VECTOR Vol. 1	Nemanja Ivanovic	2016-11-29	3	-58/+361
\| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D25912 This is the first patch in a series of 4 that improve the lowering and combining for BUILD_VECTOR nodes on PowerPC. llvm-svn: 288152
*	[PowerPC] Remove InstAlias definitions that cause incorrect assembly	Nemanja Ivanovic	2016-11-23	1	-11/+15
\| \| \| \| \| \| \| \| \| \| \| \| \|	In rL283190, I added some InstAlias definitions to generate extended mnemonics for some uses of the XXPERMDI instruction. However, when the assembler matches these extended mnemonics, it matches the new instruction in situations where it should match the old one. This patch removes these definitions and accomplishes that by defining these mnemonics with additional instructions that are isCodeGenOnly. Fixes PR31127. llvm-svn: 287765
*	[PowerPC] Emit VMX loads/stores for aligned ops to avoid adding swaps on LE	Nemanja Ivanovic	2016-11-22	2	-6/+23
\| \| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D26861 It also fixes PR30730. Committing on behalf of Lei Huang. llvm-svn: 287679
*	Fix comment typos. NFC.	Simon Pilgrim	2016-11-20	1	-2/+2
\| \| \| \| \| \|	Identified by Pedro Giffuni in PR27636. llvm-svn: 287486
*	Check that emitted instructions meet their predicates on all targets except ↵	Daniel Sanders	2016-11-19	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ARM, Mips, and X86. Summary: * ARM is omitted from this patch because this check appears to expose bugs in this target. * Mips is omitted from this patch because this check either detects bugs or deliberate emission of instructions that don't satisfy their predicates. One deliberate use is the SYNC instruction where the version with an operand is correctly defined as requiring MIPS32 while the version without an operand is defined as an alias of 'SYNC 0' and requires MIPS2. * X86 is omitted from this patch because it doesn't use the tablegen-erated MCCodeEmitter infrastructure. Patches for ARM and Mips will follow. Depends on D25617 Reviewers: tstellarAMD, jmolloy Subscribers: wdng, jmolloy, aemerson, rengolin, arsenm, jyknight, nemanjai, nhaehnle, tstellarAMD, llvm-commits Differential Revision: https://reviews.llvm.org/D25618 llvm-svn: 287439
*	[PPC] limit line width to 80 characters	Ehsan Amiri	2016-11-18	1	-1/+2
\| \| \| \| \| \|	NFC. Forgot to fix this in the original commit. llvm-svn: 287350
*	[Power9] Add patterns for vnegd, vnegw	Ehsan Amiri	2016-11-18	1	-2/+7
\| \| \| \| \| \| \|	Exploit new instructions by adding patterns to .td file. https://reviews.llvm.org/D26551 llvm-svn: 287334
*	[PPC][DAGCombine] Convert SETCC to subtract when the result is zero extended	Ehsan Amiri	2016-11-18	2	-1/+88
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we see a SETCC whose only users are zero extend operations, we can replace it with a subtraction. This results in doing all calculations in GPRs and avoids CR use. Currently we do this only for ULT, ULE, UGT and UGE condition codes. There are ways that this can be extended. For example for signed condition codes. In that case we will be introducing additional sign extend instructions, so more careful profitability analysis may be required. Another direction to extend this is for equal, not equal conditions. Also when users of SETCC are any_ext or sign_ext, we might be able to do something similar. llvm-svn: 287329
*	Always use relative jump table encodings on PowerPC64.	Joerg Sonnenberger	2016-11-16	2	-0/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For the default, small and medium code model, use the existing difference from the jump table towards the label. For all other code models, setup the picbase and use the difference between the picbase and the block address. Overall, this results in smaller data tables at the expensive of one or two more arithmetic operation at the jump site. Given that we only create jump tables with a lot more than two entries, it is a net win in size. For larger code models the assumption remains that individual functions are no larger than 2GB. Differential Revision: https://reviews.llvm.org/D26336 llvm-svn: 287059
*	vector load store with length (left justified) llvm portion	Zaara Syeda	2016-11-15	1	-4/+16
\| \| \| \|	llvm-svn: 286993
*	[PowerPC] Implement BE VSX load/store builtins - llvm portion.	Tony Jiang	2016-11-15	2	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \|	This patch implements all the overloads for vec_xl_be and vec_xst_be. On BE, they behaves exactly the same with vec_xl and vec_xst, therefore they are simply implemented by defining a matching macro. On LE, they are implemented by defining new builtins and intrinsics. For int/float/long long/double, it is just a load (lxvw4x/lxvd2x) or store(stxvw4x/stxvd2x). For char/char/short, we also need some extra shuffling before or after call the builtins to get the desired BE order. For int128, simply call vec_xl or vec_xst. llvm-svn: 286967
*	[PPC] Add intrinsic mapping to the xscvhpsp instruction	Sean Fertile	2016-11-14	1	-0/+9
\| \| \| \| \| \| \| \| \|	add an intrinsic to expose the 'VSX Scalar Convert Half-Precision to Single-Precision' instruction. Differential review: https://reviews.llvm.org/D26536 llvm-svn: 286862
*	[PPC] add intrinsics for vec extract exp/significand and vec test data class.	Sean Fertile	2016-11-14	1	-6/+18
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26272 llvm-svn: 286829
*	[PowerPC] Add remaining vector permute builtins in altivec.h - LLVM portion	Nemanja Ivanovic	2016-11-11	2	-5/+23
\| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D26480 Adds all the intrinsics used for various permute builtins that will be added to altivec.h. llvm-svn: 286638
*	[PowerPC] Add vector conversion builtins to altivec.h - LLVM portion	Nemanja Ivanovic	2016-11-11	1	-8/+16
\| \| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D26307 Adds all the intrinsics used for various conversion builtins that will be added to altivec.h. These are type conversions between various types of vectors. llvm-svn: 286596
*	Add a blank line for a test commit.	Sean Fertile	2016-11-11	1	-0/+1
\| \| \| \|	llvm-svn: 286550
*	[DAG Combiner] Fix the native computation of the Newton series for reciprocals	Evandro Menezes	2016-11-10	2	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \|	The generic infrastructure to compute the Newton series for reciprocal and reciprocal square root was conceived to allow a target to compute the series itself. However, the original code did not properly consider this condition if returned by a target. This patch addresses the issues to allow a target to compute the series on its own. Differential revision: https://reviews.llvm.org/D22975 llvm-svn: 286523
*	Sink all of the code relying on the MachO MachineModuleInfo to live	Chandler Carruth	2016-11-03	1	-47/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	behind the test that the MachineModuleInfo analysis was actually available and can be used. While the MachO bits may well be reasonable to assume in the darwin assembly printer, the analysis isn't constructively guaranteed anywhere I could find so it seems safest to avoid crashing here. This issue was found with PVS-Studio. Pretty sure the Clang Static Anaylzer flags similar issues but we've probably never pointed it at this code effectively. llvm-svn: 285972
*	NFC - Test commit.	Tony Jiang	2016-11-03	1	-1/+0
\| \| \| \| \| \|	Delete an empty line at the end of README.txt file. llvm-svn: 285964
*	Create the virtual register for the global base in the intersection of	Joerg Sonnenberger	2016-11-02	1	-2/+2
\| \| \| \| \| \| \| \|	GPRC and GPRC_NOR0 (or the 64bit equivalent) and not just the latter. GPRC_NOR0 contains ZERO as alternative meaning of r0 and is therefore not a true subclass of GPRC. llvm-svn: 285813
*	[PowerPC] Implement vector shift builtins - llvm portion	Nemanja Ivanovic	2016-11-01	1	-2/+4
\| \| \| \| \| \| \|	This patch corresponds to review https://reviews.llvm.org/D26095. Committing on behalf of Tony Jiang. llvm-svn: 285681
*	[PPC] add absolute difference altivec instructions and matching intrinsics	Nemanja Ivanovic	2016-10-31	1	-0/+11
\| \| \| \| \| \| \|	This patch corresponds to review https://reviews.llvm.org/D26072. Committing on behalf of Sean Fertile. llvm-svn: 285627
*	Implement vector count leading/trailing bytes with zero lsb and vector parity	Nemanja Ivanovic	2016-10-28	1	-7/+14
\| \| \| \| \| \| \| \| \|	builtins - llvm portion This patch corresponds to review https://reviews.llvm.org/D26003. Committing on behalf of Zaara Syeda. llvm-svn: 285434
*	[PowerPC] - No SExt/ZExt needed for count trailing zeros	Nemanja Ivanovic	2016-10-27	1	-2/+4
\| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D25896 It just eliminates the redundant ZExt after a count trailing zeros instruction. llvm-svn: 285267
*	[PowerPC] Implement vec_insert_exp builtins - llvm portion	Nemanja Ivanovic	2016-10-26	1	-2/+2
\| \| \| \| \| \| \|	This revision corresponds to review: https://reviews.llvm.org/D25957. Committing on behalf of Zaara Syeda. llvm-svn: 285225
*	Target: Change various section classifiers in TargetLoweringObjectFile to ↵	Peter Collingbourne	2016-10-24	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	take a GlobalObject. These functions are about classifying a global which will actually be emitted, so it does not make sense for them to take a GlobalValue which may for example be an alias. Change the Mach-O object writer and the Hexagon, Lanai and MIPS backends to look through aliases before using TargetLoweringObjectFile interfaces. These are functional changes but all appear to be bug fixes. Differential Revision: https://reviews.llvm.org/D25917 llvm-svn: 285006
*	[PPC] Generate positive FP zero using xor insn instead of loading from ↵	Ehsan Amiri	2016-10-24	5	-0/+43
\| \| \| \| \| \| \| \| \| \| \|	constant area https://reviews.llvm.org/D23614 Currently we load +0.0 from constant area. That can change to be generated using XOR instruction. llvm-svn: 284995
*	[PPC] Better codegen for AND, ANY_EXT, SRL sequence	Ehsan Amiri	2016-10-24	2	-0/+23
\| \| \| \| \| \| \| \|	https://reviews.llvm.org/D24924 This improves the code generated for a sequence of AND, ANY_EXT, SRL instructions. This is a targetted fix for this special pattern. The pattern is generated by target independet dag combiner and so a more general fix may not be necessary. If we come across other similar cases, some ideas for handling it are discussed on the code review. llvm-svn: 284983
*	[Target] remove TargetRecip class; 2nd try	Sanjay Patel	2016-10-20	2	-50/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a retry of r284495 which was reverted at r284513 due to use-after-scope bugs caused by faulty usage of StringRef. This version also renames a pair of functions: getRecipEstimateDivEnabled() getRecipEstimateSqrtEnabled() as suggested by Eric Christopher. original commit msg: [Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering This is a follow-up to https://reviews.llvm.org/D24816 - where we changed reciprocal estimates to be function attributes rather than TargetOptions. This patch is intended to be a structural, but not functional change. By moving all of the TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate state, shield the callers from the string format implementation, and simplify/localize the logic needed for a target to enable this. If a function has a "reciprocal-estimates" attribute, those settings may override the target's default reciprocal preferences for whatever operation and data type we're trying to optimize. If there's no attribute string or specific setting for the op/type pair, just use the target default settings. As noted earlier, a better solution would be to move the reciprocal estimate settings to IR instructions and SDNodes rather than function attributes, but that's a multi-step job that requires infrastructure improvements. I intend to work on that, but it's not clear how long it will take to get all the pieces in place. Differential Revision: https://reviews.llvm.org/D25440 llvm-svn: 284746
*	Do a sweep over move ctors and remove those that are identical to the default.	Benjamin Kramer	2016-10-20	1	-7/+0
\| \| \| \| \| \| \| \| \| \|	All of these existed because MSVC 2013 was unable to synthesize default move ctors. We recently dropped support for it so all that error-prone boilerplate can go. No functionality change intended. llvm-svn: 284721
*	revert r284495: [Target] remove TargetRecip class	Sanjay Patel	2016-10-18	2	-23/+50
\| \| \| \| \| \|	There's something wrong with the StringRef usage while parsing the attribute string. llvm-svn: 284513
*	[Target] remove TargetRecip class; move reciprocal estimate isel ↵	Sanjay Patel	2016-10-18	2	-50/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	functionality to TargetLowering This is a follow-up to D24816 - where we changed reciprocal estimates to be function attributes rather than TargetOptions. This patch is intended to be a structural, but not functional change. By moving all of the TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate state, shield the callers from the string format implementation, and simplify/localize the logic needed for a target to enable this. If a function has a "reciprocal-estimates" attribute, those settings may override the target's default reciprocal preferences for whatever operation and data type we're trying to optimize. If there's no attribute string or specific setting for the op/type pair, just use the target default settings. As noted earlier, a better solution would be to move the reciprocal estimate settings to IR instructions and SDNodes rather than function attributes, but that's a multi-step job that requires infrastructure improvements. I intend to work on that, but it's not clear how long it will take to get all the pieces in place. Differential Revision: https://reviews.llvm.org/D25440 llvm-svn: 284495
*	[PPC] Shorter sequence to load 64bit constant with same hi/lo words	Guozhi Wei	2016-10-14	1	-2/+23
\| \| \| \| \| \| \| \| \| \| \| \|	This is a patch to implement pr30640. When a 64bit constant has the same hi/lo words, we can use rldimi to copy the low word into high word of the same register. This optimization caused failure of test case bperm.ll because of not optimal heuristic in function SelectAndParts64. It chooses AND or ROTATE to extract bit groups from a register, and OR them together. This optimization lowers the cost of loading 64bit constant mask used in AND method, and causes different code sequence. But actually ROTATE method is better in this test case. The reason is in ROTATE method the final OR operation can be avoided since rldimi can insert the rotated bits into target register directly. So this patch also enhances SelectAndParts64 to prefer ROTATE method when the two methods have same cost and there are multiple bit groups need to be ORed together. Differential Revision: https://reviews.llvm.org/D25521 llvm-svn: 284276
*	[PPCMIPeephole] Fix splat elimination	Tim Shen	2016-10-12	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In PPCMIPeephole, when we see two splat instructions, we can't simply do the following transformation: B = Splat A C = Splat B => C = Splat A because B may still be used between these two instructions. Instead, we should make the second Splat a PPC::COPY and let later passes decide whether to remove it or not: B = Splat A C = Splat B => B = Splat A C = COPY B Fixes PR30663. Reviewers: echristo, iteratee, kbarton, nemanjai Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D25493 llvm-svn: 283961
*	Revert r283690, "MC: Remove unused entities."	Peter Collingbourne	2016-10-10	1	-1/+1
\| \| \| \|	llvm-svn: 283814
*	Move the global variables representing each Target behind accessor function	Mehdi Amini	2016-10-09	7	-23/+38
\| \| \| \| \| \| \| \|	This avoids "static initialization order fiasco" Differential Revision: https://reviews.llvm.org/D25412 llvm-svn: 283702
*	MC: Remove unused entities.	Peter Collingbourne	2016-10-09	1	-1/+1
\| \| \| \|	llvm-svn: 283691
*	Target: Remove unused entities.	Peter Collingbourne	2016-10-09	1	-1/+1
\| \| \| \|	llvm-svn: 283690
*	Revert "Revert "Add a static_assert to enforce that parameters to ↵	Mehdi Amini	2016-10-07	1	-1/+2
\| \| \| \| \| \| \| \| \|	llvm::format() are not totally unsafe"" This reverts commit r283510 and reapply r283509, with updates to clang-tools-extra as well. llvm-svn: 283525
*	Target: Remove unused patterns and transforms. NFC.	Peter Collingbourne	2016-10-07	1	-10/+0
\| \| \| \|	llvm-svn: 283515
*	Revert "Add a static_assert to enforce that parameters to llvm::format() are ↵	Mehdi Amini	2016-10-06	1	-2/+1
\| \| \| \| \| \| \| \|	not totally unsafe" This reverts commit r283509, clang is hitting the assert. llvm-svn: 283510
*	Add a static_assert to enforce that parameters to llvm::format() are not ↵	Mehdi Amini	2016-10-06	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	totally unsafe Summary: I had for the second time today a bug where llvm::format("%s", Str) was called with Str being a StringRef. The Linux and MacOS bots were fine, but windows having different calling convention, it printed garbage. Instead we can catch this at compile-time: it is never expected to call a C vararg printf-like function with non scalar type I believe. Reviewers: bogner, Bigcheese, dexonsmith Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25266 llvm-svn: 283509
*	[Target] move reciprocal estimate settings from TargetOptions to TargetLowering	Sanjay Patel	2016-10-04	2	-19/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The motivation for the change is that we can't have pseudo-global settings for codegen living in TargetOptions because that doesn't work with LTO. Ideally, these reciprocal attributes will be moved to the instruction-level via FMF, metadata, or something else. But making them function attributes is at least an improvement over the current state. The ingredients of this patch are: Remove the reciprocal estimate command-line debug option. Add TargetRecip to TargetLowering. Remove TargetRecip from TargetOptions. Clean up the TargetRecip implementation to work with this new scheme. Set the default reciprocal settings in TargetLoweringBase (everything is off). Update the PowerPC defaults, users, and tests. Update the x86 defaults, users, and tests. Note that if this patch needs to be reverted, the related clang patch checked in at r283251 should be reverted too. Differential Revision: https://reviews.llvm.org/D24816 llvm-svn: 283252