bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Test commit access.	Nicola Zaghen	2018-05-14	1	-1/+1
\| \| \| \| \| \|	Remove trailing whitespace. llvm-svn: 332220
*	[LLVM-C] Add Bindings For Module Flags	Robert Widmann	2018-05-14	1	-0/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The first foray into merging debug info into the echo tests. - Add bindings to Module::getModuleFlagsMetadata() in the form of LLVMCopyModuleFlagsMetadata - Add the opaque type LLVMModuleFlagEntry to represent Module::ModuleFlagEntry - Add accessors for LLVMModuleFlagEntry's behavior, key, and metadata node. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: aprantl, JDevlieghere, llvm-commits, harlanhaskins Differential Revision: https://reviews.llvm.org/D46792 llvm-svn: 332219
*	Correct compatibility with the GNU Assembler's handling of comparison ops	Bill Wendling	2018-05-14	1	-2/+20
\| \| \| \| \| \| \| \|	GAS returns -1 for a comparison operator if the result is true and 0 if false. https://www.sourceware.org/binutils/docs-2.12/as.info/Infix-Ops.html#Infix%20Ops llvm-svn: 332215
*	[X86] Cleanup a multiclass that doesn't need as many parameters after recent ↵	Craig Topper	2018-05-14	1	-25/+12
\| \| \| \| \| \|	intrinsic removals. llvm-svn: 332207
*	[X86] Remove and autoupgrade the cvtusi2sd intrinsic. Use ↵	Craig Topper	2018-05-14	3	-8/+5
\| \| \| \| \| \|	uitofp+insertelement instead. llvm-svn: 332206
*	[X86] Add patterns for combining movss+uint_to_fp into the intrinsic ↵	Craig Topper	2018-05-13	1	-0/+40
\| \| \| \| \| \| \| \|	instructions under AVX512. This matches what we do for sint_to_fp. llvm-svn: 332205
*	[X86] Extend instcombine folds for pclmuldq intrinsics to the 256 and 512 ↵	Craig Topper	2018-05-13	1	-9/+12
\| \| \| \| \| \|	bit version. llvm-svn: 332202
*	[X86] Remove and autoupgrade masked vpermd/vpermps intrinsics.	Craig Topper	2018-05-13	3	-13/+13
\| \| \| \|	llvm-svn: 332198
*	AMDGPU: Rename OpenCL lowering pass to be R600 specific.	Matt Arsenault	2018-05-13	4	-10/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This pass is a) broken. b) r600 specific. Fixing (a) is a bit more non-trivial, but fixing (b) is easy. Move this pass to being R600 only for now. This pass does pass all the unit tests, however clang no longer generates code that looks like the unit test input, so fixing the pass requires fixing the tests and the pass as one, and checking it works with clang still. Patch by Dave Airlie llvm-svn: 332196
*	AMDGPU: Make undef legal for v2i16/v2f16	Matt Arsenault	2018-05-13	1	-0/+3
\| \| \| \| \| \| \|	This is apparently necessary to stop undef from being turned into a build_vector of 0s. llvm-svn: 332195
*	Fixing build bot error: adding const qualifiers to std::sort lambda.	Puyan Lotfi	2018-05-13	1	-1/+1
\| \| \| \| \| \|	Errors were not reproducible on clang-6.0 on ubuntu 16.04. llvm-svn: 332192
*	[NFC] MIR-Canon: switching to a stable string sorting of instructions.	Puyan Lotfi	2018-05-13	1	-2/+8
\| \| \| \|	llvm-svn: 332191
*	[X86] Add some load folding patterns for cvtsi2ss/sd into intrinsic ↵	Craig Topper	2018-05-13	2	-0/+60
\| \| \| \| \| \|	instructions. llvm-svn: 332189
*	[X86] Remove an autoupgrade legacy cvtss2sd intrinsics.	Craig Topper	2018-05-13	3	-14/+12
\| \| \| \|	llvm-svn: 332187
*	[X86] Remove and autoupgrade cvtsi2ss/cvtsi2sd intrinsics to match what ↵	Craig Topper	2018-05-12	4	-29/+24
\| \| \| \| \| \|	clang has used for a very long time. llvm-svn: 332186
*	[x86] Remove a comment obviated by r330269. Should have deleted the	Chandler Carruth	2018-05-12	1	-5/+0
\| \| \| \| \| \| \| \|	comment in the same revision but missed it. Thanks to Dimitry Andric for catching this! llvm-svn: 332177
*	Clear converters map after X86 Domain Reassignment to avoid crashes	Dimitry Andric	2018-05-12	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As reported in PR37264, in some cases the X86 Domain Reassignment `runOnMachineFunction()` is called twice. Because it only deletes the `.second` members of its `InstrConverterBaseMap`, and does not clean up the map itself, this can lead to double frees and crashes. Use `DeleteContainerSeconds()` instead, so the `Converters` map can safely be reinitialized and its members re-deleted for each X86 Domain Reassignment pass. Reviewers: guyblank, craig.topper Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46425 llvm-svn: 332176
*	[NFC] Remove inaccurate comment	JF Bastien	2018-05-12	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: r271558 moved getManagedStaticMutex's mutex from a function-local static to using call_once, but left a comment added in r211424. That comment is now erroneous, remove it. Reviewers: zturner, chandlerc Subscribers: aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D46784 llvm-svn: 332175
*	[X86] Add WriteFCMOV scheduler class for x87 CMOVs	Simon Pilgrim	2018-05-12	11	-16/+15
\| \| \| \|	llvm-svn: 332173
*	[mips] Initialize the long branch pass for testing purposes	Simon Dardis	2018-05-12	3	-2/+8
\| \| \| \|	llvm-svn: 332172
*	[X86] Remove some unused masked conversion intrinsics that can be replaced ↵	Craig Topper	2018-05-12	2	-21/+33
\| \| \| \| \| \| \| \|	with an older intrinsic and a select. This is what clang already uses. llvm-svn: 332170
*	Reapply "[PR16756] Use SSAUpdaterBulk in JumpThreading."	Michael Zolotukhin	2018-05-12	1	-26/+30
\| \| \| \| \| \| \| \| \|	Stage3/stage4 bootstrap miscompares should be fixed by a non-determinism fix in IDF (r332167). This reverts commit r330446. llvm-svn: 332168
*	[IDF] Enforce the returned blocks to be sorted.	Michael Zolotukhin	2018-05-12	1	-5/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently the order of blocks returned by `IDF::calculate` can be non-deterministic. This was discovered in several attempts to enable SSAUpdaterBulk for JumpThreading (which led to miscompare in bootstrap between stage 3 and stage4). Originally, the blocks were put into a priority queue with a depth level as their key, and this patch adds a DFSIn number as a second key to specify a deterministic order across blocks from one level. The solution was suggested by Daniel Berlin. Reviewers: dberlin, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46646 llvm-svn: 332167
*	[AMDGPU] Fix amdgpu-waves-per-eu accounting in scheduler	Stanislav Mekhanoshin	2018-05-12	2	-3/+7
\| \| \| \| \| \| \| \| \| \|	We cannot query this attribute from a subtarget given a machine function. At this point attribute itself is already unavailable and can only be obtained through MFI. Differential Revision: https://reviews.llvm.org/D46781 llvm-svn: 332166
*	[DAG] add convenience function to propagate FMF; NFC	Sanjay Patel	2018-05-11	1	-7/+1
\| \| \| \| \| \| \| \| \|	There's only one use of this currently, but that could change with D46563. Either way, we shouldn't have to update code outside of the flags struct when those flag definitions change. llvm-svn: 332155
*	AMDGPU/GlobalISel: Implement select() for >32-bit G_STORE	Tom Stellard	2018-05-11	2	-1/+28
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D46153 llvm-svn: 332154
*	[CodeExtractor] Allow extracting blocks with exception handling	Sergey Dmitriev	2018-05-11	1	-27/+91
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a CodeExtractor improvement which adds support for extracting blocks which have exception handling constructs if that is legal to do. CodeExtractor performs validation checks to ensure that extraction is legal when it finds invoke instructions or EH pads (landingpad, catchswitch, or cleanuppad) in blocks to be extracted. I have also added an option to allow extraction of blocks with alloca instructions, but no validation is done for allocas. CodeExtractor caller has to validate it himself before allowing alloca instructions to be extracted. By default allocas are still not allowed in extraction blocks. Differential Revision: https://reviews.llvm.org/D45904 llvm-svn: 332151
*	[DAG] clean up flag propagation for binops; NFCI	Sanjay Patel	2018-05-11	2	-36/+22
\| \| \| \|	llvm-svn: 332150
*	AMDGPU/SI: Don't promote alloca to vector for AddrSpaceCast instruction.	Changpeng Fang	2018-05-11	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We have no logic to promote alloca to vector for an AddrSpaceCast instruction. Reviewer: arsenm Differential Revision: https://reviews.llvm.org/D45993 llvm-svn: 332147
*	[X86] Remove and autoupgrade a bunch of FMA instrinsics that are no longer ↵	Craig Topper	2018-05-11	4	-34/+83
\| \| \| \| \| \|	used by clang. llvm-svn: 332146
*	[Split GEP] handle trunc() in separate-const-offset-from-gep pass.	Artem Belevich	2018-05-11	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \|	Let separate-const-offset-from-gep pass handle trunc() when it calculates constant offset relative to base. The pass itself may insert trunc() instructions when it canonicalises array indices to pointer-size integers and needs to handle trunc() in order to evaluate the offset. Differential Revision: https://reviews.llvm.org/D46732 llvm-svn: 332142
*	[AMDGPU] Fix compilation failure when IR contains comdat	Yaxun Liu	2018-05-11	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove a useless SwitchSection which also causes compilation failure when IR contains comdat. The SwitchSection is useless because the current section is already correct text section for the function therefore no need to switch. It causes compilation failure for comdat because functions with comdat has specific text section, not the default .text section. Since HIP uses comdat, this bug caused failures for HIP. Differential Revision: https://reviews.llvm.org/D46770 llvm-svn: 332137
*	[DAG] reduce code duplication; NFCI	Sanjay Patel	2018-05-11	1	-8/+7
\| \| \| \|	llvm-svn: 332133
*	[InstCombine] Handle atomic memset in the same way as regular memset	Daniel Neilson	2018-05-11	2	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change adds handling of the atomic memset intrinsic to the code path that simplifies the regular memset. In practice this means that we will now also expand a small constant-length atomic memset into a single unordered atomic store. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46660 llvm-svn: 332132
*	[DAGCombiner] Set the right SDLoc on extended SETCC uses (7/N)	Vedant Kumar	2018-05-11	1	-19/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ExtendSetCCUses updates SETCC nodes which use a load (OriginalLoad) to reflect a simplification to the load (ExtLoad). Based on my reading, ExtendSetCCUses may create new nodes to extend a constant attached to a SETCC. It also creates fresh SETCC nodes which refer to any updated operands. ISTM that the location applied to the new constant and SETCC nodes should be the same as the location of the ExtLoad. This was suggested by Adrian in https://reviews.llvm.org/D45995. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46216 llvm-svn: 332119
*	[DAGCombiner] Set the right SDLoc on a newly-created sextload (6/N)	Vedant Kumar	2018-05-11	1	-38/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This teaches tryToFoldExtOfLoad to set the right location on a newly-created extload. With that in place, the logic for performing a certain ([s\|z]ext (load ...)) combine becomes identical for sexts and zexts, and we can get rid of one copy of the logic. The test case churn is due to dependencies on IROrders inherited from the wrong SDLoc. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46158 llvm-svn: 332118
*	[DAGCombiner] Factor out duplicated logic for an extload combine, NFC (5/N)	Vedant Kumar	2018-05-11	1	-38/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Part of the logic for combining (zext (load ...)) and (sext (load ...)) is duplicated. This creates problems because bugs in one version have to be fixed again in the other version. To address this, as a first step, I've extracted the duplicate logic into a helper. I'll fix the debug location bug in the helper and eliminate the copy of its logic in a followup. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46157 llvm-svn: 332117
*	[InstCombine] snprintf optimizations	David Bolvansky	2018-05-11	1	-0/+90
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: spatel, efriedma, majnemer, rja, bkramer Reviewed By: rja, bkramer Subscribers: mstorsjo, rja, llvm-commits Differential Revision: https://reviews.llvm.org/D46285 llvm-svn: 332110
*	[X86][BtVer2] Model ymm move as double pumped instructions	Simon Pilgrim	2018-05-11	1	-7/+7
\| \| \| \| \| \|	We still need to handle mmx/xmm moves as 'decode-only' no-pipe instructions llvm-svn: 332109
*	[RISCV] Support .option rvc and norvc assembler directives	Alex Bradbury	2018-05-11	6	-2/+123
\| \| \| \| \| \| \| \| \| \|	These directives allow the 'C' (compressed) extension to be enabled/disabled within a single file. Differential Revision: https://reviews.llvm.org/D45864 Patch by Kito Cheng llvm-svn: 332107
*	[Analysis] Validate the return type of s(n)printf like libcalls	Martin Storsjo	2018-05-11	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the sprintf function is static (as on mingw-w64, where many stdio functions are static inline wrappers), earlier optimization passes could optimize out the return value altogether, and make it void, which could break optimizations of this libcall that touch the return value. This fixes the issue discussed in PR37408 for the sprintf function. Differential Revision: https://reviews.llvm.org/D46752 llvm-svn: 332106
*	[X86][MMX] Tag MMX Move/Load/Store as WriteVec schedule classes	Simon Pilgrim	2018-05-11	5	-17/+5
\| \| \| \| \| \|	Fixes an issue on SLM/Btver2 where we had instructions were being treated as scalar loads/stores llvm-svn: 332104
*	[AArch64] Fix performPostLD1Combine to check for constant lane index.	Geoff Berry	2018-05-11	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: performPostLD1Combine in AArch64ISelLowering looks for vector insert_vector_elt of a loaded value which it can optimize into a single LD1LANE instruction. The code checking for the pattern was not checking if the lane index was a constant which could cause two problems: - an assert when lowering the LD1LANE ISD node since it assumes an constant operand - an assert in isel if the lane index value depends on the post-incremented base register Both of these issues are avoided by simply checking that the lane index is a constant. Fixes bug 35822. Reviewers: t.p.northover, javed.absar Subscribers: rengolin, kristof.beyls, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46591 llvm-svn: 332103
*	[mips] Rename Filler to MipsDelaySlotFiller and initialize the pass	Simon Dardis	2018-05-11	3	-26/+44
\| \| \| \|	llvm-svn: 332102
*	Use iteration instead of recursion in CFIInserter	Sanjoy Das	2018-05-11	1	-15/+26
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This recursive step can overflow the stack. Reviewers: djokov, petarj Subscribers: mcrosier, jlebar, bixia, llvm-commits Differential Revision: https://reviews.llvm.org/D46671 llvm-svn: 332101
*	[Reassociate] Prevent infinite loops when processing PHIs.	Davide Italiano	2018-05-11	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Phi nodes can reside in live blocks but one of their incoming arguments can come from a dead block. Dead blocks and reassociate don't play nice together. In fact, reassociate performs an RPO as a first step to avoid processing dead blocks. The reason why Reassociate might not fixpoint when examining dead blocks is that the following: %xor0 = xor i16 %xor1, undef %xor1 = xor i16 %xor0, undef is perfectly valid LLVM IR (if it appears in a dead block), so the worklist algorithm keeps pushing the two instructions for reexamination. Note that this is not Reassociate fault, at least not entirely. It's llvm that has a weird definition of dominance. Fixes PR37390. llvm-svn: 332100
*	Remove unused SyncExecutor and make it clearer that the whole file is only ↵	Nico Weber	2018-05-11	1	-14/+5
\| \| \| \| \| \|	used if LLVM_ENABLE_THREADS llvm-svn: 332098
*	[mips] Enable disassembly of fused (negative) multiply add/sub instructions	Simon Dardis	2018-05-11	2	-37/+45
\| \| \| \| \| \| \| \|	Reviewers: atanasyan, smaksimovic, abeserminji Differential Revision: https://reviews.llvm.org/D46392 llvm-svn: 332097
*	[X86][SLM] Vector stores only use the MEC port.	Simon Pilgrim	2018-05-11	1	-10/+10
\| \| \| \| \| \| \| \|	Confirmed by both Agner and Intel's AOM - the IEC/FPC are not required for pure load/stores (even if its a partial update). Can't fix WriteStore until all RMW instructions are cleaned up though.... llvm-svn: 332096
*	[X86] Split WriteF/WriteVec Move/Load/Store scheduler classes by vector width	Simon Pilgrim	2018-05-11	10	-80/+137
\| \| \| \| \| \|	Fixes a SNB issue that was missing vlddqu/vmovntdqa ymm instructions llvm-svn: 332094