bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	revert rev. 252153 due to build failure on ubuntu	Asaf Badouh	2015-11-05	7	-309/+1
\| \| \| \| \| \|	[X86][AVX512] add comi with Sae llvm-svn: 252154
*	[X86][AVX512] add comi with Sae	Asaf Badouh	2015-11-05	7	-1/+309
\| \| \| \| \| \| \| \|	add builtin_ia32_vcomisd and builtin_ia32_vcomisd Differential Revision: http://reviews.llvm.org/D14331 llvm-svn: 252153
*	[SimplifyCFG] Tweak heuristic for merging conditional stores	James Molloy	2015-11-05	1	-7/+13
\| \| \| \| \| \|	We were correctly skipping dbginfo intrinsics and terminators, but the initial bailout wasn't, causing it to bail out on almost any block. llvm-svn: 252152
*	[X86][AVX512] small bugfix in VPBROADCASTM	Asaf Badouh	2015-11-05	2	-2/+17
\| \| \| \| \| \| \| \|	VPBROADCASTMW2D and VPBROADCASTMB2Q Differential Revision: http://reviews.llvm.org/D14335 llvm-svn: 252151
*	RuntimeDyld: fix -Wtype-limits	Saleem Abdulrasool	2015-11-05	1	-2/+2
\| \| \| \| \| \| \|	Adjust the casted type. By casting to the same size rather than just the signed-ness, we were asserting tautological statements. NFC. llvm-svn: 252150
*	Fix LoopAccessAnalysis when potentially nullptr check are involved	Mehdi Amini	2015-11-05	2	-1/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: GetUnderlyingObjects() can return "null" among its list of objects, we don't want to deduce that two pointers can point to the same memory in this case, so filter it out. Reviewers: anemet Subscribers: dexonsmith, llvm-commits From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252149
*	MCStreamer.h: Prune \return, corresponding to r252102. [-Wdocumentation]	NAKAMURA Takumi	2015-11-05	1	-1/+0
\| \| \| \|	llvm-svn: 252148
*	Fix a bug exposed by uses in CFE	Xinliang David Li	2015-11-05	1	-2/+4
\| \| \| \|	llvm-svn: 252146
*	AMDGPU: Also track whether SGPRs were spilled	Matt Arsenault	2015-11-05	3	-2/+20
\| \| \| \|	llvm-svn: 252145
*	AMDGPU: Print number user SGPRs	Matt Arsenault	2015-11-05	1	-0/+6
\| \| \| \| \| \| \|	This doesn't quite match how SC prints it, which doesn't put it in a comment. llvm-svn: 252144
*	AMDGPU: Disallow s[102:103] on VI in assembler	Matt Arsenault	2015-11-05	3	-18/+74
\| \| \| \|	llvm-svn: 252142
*	[FunctionAttrs] Remove a loop, NFC refactor	Sanjoy Das	2015-11-05	1	-16/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Remove the loop over the uses of the CallSite in ArgumentUsesTracker. Since we have the `Use *` for actual argument operand, we can just use pointer subtraction. The time complexity remains the same though (except for a vararg argument) -- `std::advance` is O(UseIndex) for the ArgumentList iterator. The real motivation is to make a later change adding support for operand bundles simpler. Reviewers: reames, chandlerc, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14363 llvm-svn: 252141
*	AMDGPU: Fix assert when legalizing atomic operands	Matt Arsenault	2015-11-05	4	-15/+111
\| \| \| \| \| \| \| \| \| \|	The operand layout is slightly different for the atomic opcodes from the usual MUBUF loads and stores. This should only fix it on SI/CI. VI is still broken because it still emits the addr64 replacement. llvm-svn: 252140
*	AMDGPU: Make addr64 atomic operand order consistent	Matt Arsenault	2015-11-05	1	-2/+2
\| \| \| \| \| \| \|	vaddr comes before srsrc in every other MUBUF instruction, and is the order it is printed. llvm-svn: 252139
*	Fix OSX build after r252118 (missing parameter for findModulesAndOffsets())	Mehdi Amini	2015-11-05	1	-1/+2
\| \| \| \| \|	From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252137
*	Remove empty lines	Mehdi Amini	2015-11-05	1	-2/+2
\| \| \| \| \|	From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252136
*	[WinEH] Fix establisher param reg in CLR funclets	Joseph Tremoulet	2015-11-05	2	-9/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The CLR's personality routine passes the pointer to the establisher frame in RCX, not RDX. Reviewers: pgavlin, majnemer, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14343 llvm-svn: 252135
*	[IR] Add bounds checking to dataOperandHasImpliedAttr	Sanjoy Das	2015-11-05	1	-0/+8
\| \| \| \| \| \|	This is similar to the bounds check added to paramHasAttr in r252073. llvm-svn: 252130
*	[libFuzzer] print a bit fewer lines	Kostya Serebryany	2015-11-05	2	-2/+3
\| \| \| \|	llvm-svn: 252123
*	Go back to producing relocations for out of range symbols.	Rafael Espindola	2015-11-05	3	-13/+9
\| \| \| \| \| \| \| \|	This brings back the behavior from before r252090 for out of range symbols. Should bring some arm bots back. llvm-svn: 252119
*	[Windows] Symbolize with llvm-symbolizer instead of dbghelp in a self-host	Reid Kleckner	2015-11-05	3	-97/+227
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: llvm-symbolizer understands both PDBs and DWARF, so it is more likely to succeed at symbolization. If llvm-symbolizer is unavailable, we will fall back to dbghelp. This also makes our crash traces more similar between Windows and Linux. Reviewers: Bigcheese, zturner, chapuni Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12884 llvm-svn: 252118
*	AMDGPU: Add missing v2f64 fadd tests	Matt Arsenault	2015-11-05	1	-10/+42
\| \| \| \|	llvm-svn: 252117
*	AMDGPU: Fix typo	Matt Arsenault	2015-11-05	1	-2/+2
\| \| \| \|	llvm-svn: 252116
*	[PGO] Use template file to define runtime structures	Xinliang David Li	2015-11-05	4	-71/+79
\| \| \| \| \| \| \| \| \| \| \|	With this change, instrumentation code and reader/write code related to profile data structs are kept strictly in-sync. THis will be extended to cfe and compile-rt references as well. Differential Revision: http://reviews.llvm.org/D13843 llvm-svn: 252113
*	Fix Abbrev emission in WriteIdentificationBlock	Mehdi Amini	2015-11-05	1	-1/+2
\| \| \| \| \| \| \|	This Abbrev was not emitted and basically unused, just leacking there. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252110
*	Fix pr24832.	Rafael Espindola	2015-11-05	4	-11/+16
\| \| \| \| \| \|	It is pretty simple now that the yak is shaved. llvm-svn: 252105
*	Simplify now that emitValueToOffset always returns false.	Rafael Espindola	2015-11-04	6	-24/+8
\| \| \| \|	llvm-svn: 252102
*	Simplify .org processing and make it a bit more powerful.	Rafael Espindola	2015-11-04	4	-21/+15
\| \| \| \| \| \| \|	We now always create the fragment, which lets us handle things like .org after a .align. llvm-svn: 252101
*	Define portable macros for packed struct definitions:	Xinliang David Li	2015-11-04	1	-0/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	1. A macro with argument: LLVM_PACKED(StructDefinition) 2. A pair of macros defining scope of region with packing: LLVM_PACKED_START struct A { ... }; struct B { ... }; LLVM_PACKED_END Differential Revision: http://reviews.llvm.org/D14337 llvm-svn: 252099
*	[SimplifyLibCalls] New transformation: tan(atan(x)) -> x	Davide Italiano	2015-11-04	4	-1/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is enabled only under -ffast-math. So, instead of emitting: 4007b0: 50 push %rax 4007b1: e8 8a fd ff ff callq 400540 <atanf@plt> 4007b6: 58 pop %rax 4007b7: e9 94 fd ff ff jmpq 400550 <tanf@plt> 4007bc: 0f 1f 40 00 nopl 0x0(%rax) for: float mytan(float x) { return tanf(atanf(x)); } we emit a single retq. Differential Revision: http://reviews.llvm.org/D14302 llvm-svn: 252098
*	[libFuzzer] when choosing the next unit to mutate, give some preference to ↵	Kostya Serebryany	2015-11-04	2	-26/+46
\| \| \| \| \| \|	the most recent units (they are more likely to be interesting) llvm-svn: 252097
*	fix typo; NFC	Sanjay Patel	2015-11-04	1	-1/+1
\| \| \| \|	llvm-svn: 252096
*	[CaptureTracking] Support operand bundles conservatively	Sanjoy Das	2015-11-04	2	-2/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Earlier CaptureTracking would assume all "interesting" operands to a call or invoke were its arguments. With operand bundles this is no longer true. Note: an earlier change got `doesNotCapture` working correctly with operand bundles. This change uses DSE to test the changes to CaptureTracking. DSE is a vehicle for testing only, and is not directly involved in this change. Reviewers: reames, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14306 llvm-svn: 252095
*	[CMake] Bug 25059 - CMake libllvm.so.$MAJOR.$MINOR shared object name not ↵	Chris Bieneman	2015-11-04	2	-8/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	compatible with ldconfig Summary: This change makes the CMake build system generate libraries for Linux and Darwin matching the makefile build system. Linux libraries follow the pattern lib${name}.${MAJOR}.${MINOR}.so so that ldconfig won't pick it up incorrectly. Darwin libraries are not versioned. Note: On linux the non-versioned symlink is generated at install-time not build time. I plan to fix that eventually, but I expect that is good enough for the purposes of fixing this bug. Reviewers: loladiro, tstellarAMD Subscribers: axw, llvm-commits Differential Revision: http://reviews.llvm.org/D13841 llvm-svn: 252093
*	Slightly saner handling of thumb branches.	Rafael Espindola	2015-11-04	3	-9/+42
\| \| \| \| \| \| \| \|	The generic infrastructure already did a lot of work to decide if the fixup value is know or not. It doesn't make sense to reimplement a very basic case: same fragment. llvm-svn: 252090
*	[x86] Teach the shrink-wrapping hooks to do the proper thing with Win64.	Quentin Colombet	2015-11-04	2	-0/+130
\| \| \| \| \| \| \| \| \| \|	Win64 has some strict requirements for the epilogue. As a result, we disable shrink-wrapping for Win64 unless the block that gets the epilogue is already an exit block. Fixes PR24193. llvm-svn: 252088
*	Fix some Clang-tidy modernize warnings, other minor fixes.	Eugene Zelenko	2015-11-04	17	-89/+82
\| \| \| \| \| \| \| \|	Fixed warnings are: modernize-use-override, modernize-use-nullptr and modernize-redundant-void-arg. Differential revision: http://reviews.llvm.org/D14312 llvm-svn: 252087
*	PM: Rephrase PrintLoopPass as a wrapper around a new-style pass. NFC	Justin Bogner	2015-11-04	3	-17/+36
\| \| \| \| \| \| \|	Splits PrintLoopPass into a new-style pass and a PrintLoopPassWrapper, much like we already do for PrintFunctionPass and PrintModulePass. llvm-svn: 252085
*	Add new interfaces to MBB for manipulating successors with probabilities ↵	Cong Hou	2015-11-04	3	-0/+132
\| \| \| \| \| \| \| \| \| \| \|	instead of weights. NFC. This is part-1 of the patch that replaces all edge weights in MBB by probabilities, which only adds new interfaces. No functional changes. Differential revision: http://reviews.llvm.org/D13908 llvm-svn: 252083
*	Warning fix.	Simon Pilgrim	2015-11-04	1	-2/+2
\| \| \| \|	llvm-svn: 252078
*	[IR] Add a `data_operand` abstraction	Sanjoy Das	2015-11-04	4	-8/+129
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Data operands of a call or invoke consist of the call arguments, and the bundle operands associated with the `call` (or `invoke`) instruction. The motivation for this change is that we'd like to be able to query "argument attributes" like `readonly` and `nocapture` for bundle operands naturally. This change also provides a conservative "implementation" for these attributes for any bundle operand, and an extension point for future work. Reviewers: chandlerc, majnemer, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14305 llvm-svn: 252077
*	llvm-config: Add --has-rtti option	Tom Stellard	2015-11-04	5	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This prints NO if LLVM was built with -fno-rtti or an equivalent flag and YES otherwise. The reasons to add -has-rtti rather than adding -fno-rtti to --cxxflags are: 1. Building LLVM with -fno-rtti does not always mean that client applications need this flag. 2. Some compilers have a different flag for disabling rtti, and the compiler being used to build LLVM may not be the compiler being used to build the application. Reviewers: echristo, chandlerc, beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11849 llvm-svn: 252075
*	[X86][SSE] Add general memory folding for (V)INSERTPS instruction	Simon Pilgrim	2015-11-04	8	-71/+141
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch improves the memory folding of the inserted float element for the (V)INSERTPS instruction. The existing implementation occurs in the DAGCombiner and relies on the narrowing of a whole vector load into a scalar load (and then converted into a vector) to (hopefully) allow folding to occur later on. Not only has this proven problematic for debug builds, it also prevents other memory folds (notably stack reloads) from happening. This patch removes the old implementation and moves the folding code to the X86 foldMemoryOperand handler. A new private 'special case' function - foldMemoryOperandCustom - has been added to deal with memory folding of instructions that can't just use the lookup tables - (V)INSERTPS is the first of several that could be done. It also tweaks the memory operand folding code with an additional pointer offset that allows existing memory addresses to be modified, in this case to convert the vector address to the explicit address of the scalar element that will be inserted. Unlike the previous implementation we now set the insertion source index to zero, although this is ignored for the (V)INSERTPSrm version, anything that relied on shuffle decodes (such as unfolding of insertps loads) was incorrectly calculating the source address - I've added a test for this at insertps-unfold-load-bug.ll Differential Revision: http://reviews.llvm.org/D13988 llvm-svn: 252074
*	[IR] Add bounds checking to paramHasAttr	Sanjoy Das	2015-11-04	2	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is intended to make a later change simpler. Note: adding this bounds checking required fixing `X86FastISel`. As far I can tell I've preserved original behavior but a careful review will be appreciated. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14304 llvm-svn: 252073
*	Orc: Streamline some lambda usage in a unit test	David Blaikie	2015-11-04	1	-9/+5
\| \| \| \|	llvm-svn: 252070
*	Relax the check for ninja.	Rafael Espindola	2015-11-04	1	-2/+2
\| \| \| \| \| \|	On fedora the ninja executable is called ninja-build :-( llvm-svn: 252062
*	Created new X86 FMA3 opcodes (FMA*_Int) that are used now for lowering of ↵	Andrew Kaylor	2015-11-04	4	-286/+1057
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	scalar FMA intrinsics. Patch by Slava Klochkov The key difference between FMA* and FMA_Int opcodes is that FMA_Int opcodes are handled more conservatively. It is illegal to commute the 1st operand of FMA*_Int instructions as the upper bits of scalar FMA intrinsic result must be taken from the 1st operand, but such commute transformation would change those upper bits and invalidate the intrinsic's result. Reviewers: Quentin Colombet, Elena Demikhovsky Differential Revision: http://reviews.llvm.org/D13710 llvm-svn: 252060
*	[ARM] Combine CMOV into BFI where possible	James Molloy	2015-11-04	3	-0/+129
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we have a CMOV, OR and AND combination such as: if (x & CN) y \|= CM; And: * CN is a single bit; * All bits covered by CM are known zero in y; Then we can convert this to a sequence of BFI instructions. This will always be a win if CM is a single bit, will always be no worse than the TST & OR sequence if CM is two bits, and for thumb will be no worse if CM is three bits (due to the extra IT instruction). llvm-svn: 252057
*	[ThinLTO] Always set linkage type to external when converting alias	Teresa Johnson	2015-11-04	2	-2/+15
\| \| \| \| \| \| \| \|	When converting an alias to a non-alias when the aliasee is not imported, ensure that the linkage type is set to external so that it is a valid linkage type. Added a test case that exposed this issue. llvm-svn: 252054
*	[SimplifyCFG] Merge conditional stores	James Molloy	2015-11-04	3	-4/+554
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can often end up with conditional stores that cannot be speculated. They can come from fairly simple, idiomatic code: if (c & flag1) a = x; if (c & flag2) a = y; ... There is no dominating or post-dominating store to a, so it is not legal to move the store unconditionally to the end of the sequence and cache the intermediate result in a register, as we would like to. It is, however, legal to merge the stores together and do the store once: tmp = undef; if (c & flag1) tmp = x; if (c & flag2) tmp = y; if (c & flag1 \|\| c & flag2) *a = tmp; The real power in this optimization is that it allows arbitrary length ladders such as these to be completely and trivially if-converted. The typical code I'd expect this to trigger on often uses binary-AND with constants as the condition (as in the above example), which means the ending condition can simply be truncated into a single binary-AND too: 'if (c & (flag1\|flag2))'. As in the general case there are bitwise operators here, the ladder can often be optimized further too. This optimization involves potentially increasing register pressure. Even in the simplest case, the lifetime of the first predicate is extended. This can be elided in some cases such as using binary-AND on constants, but not in the general case. Threading 'tmp' through all branches can also increase register pressure. The optimization as in this patch is enabled by default but kept in a very conservative mode. It will only optimize if it thinks the resultant code should be if-convertable, and additionally if it can thread 'tmp' through at least one existing PHI, so it will only ever in the worst case create one more PHI and extend the lifetime of a predicate. This doesn't trigger much in LNT, unfortunately, but it does trigger in a big way in a third party test suite. llvm-svn: 252051