bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[mlir] Update mlir/CMakeLists.txt to install *.def files	Kern Handa	2020-01-06	1	-0/+1
\| \| \| \| \| \| \|	This is needed to consume mlir after it has been installed of the source tree. Without this, consuming mlir results a build error. Differential Revision: https://reviews.llvm.org/D72232
*	Add ExternalAAWrapperPass to createLegacyPMAAResults.	Neil Henning	2020-01-06	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Our out-of-tree custom aliasing solution for the HPC# Burst compiler here at Unity makes use of the `ExternalAAwrapperPass` infrastructure to insert our custom aliasing resolution into the core of LLVM. This is great for all cases except for function inlining, where because `createLegacyPMAAResults` does not make use of `ExternalAAWrapperPass`, when we have a definite no-alias result within a function it won't be propagated to the calling function during inlining. This commit just rectifies this oversight by adding the missing dependency. Differential Revision: https://reviews.llvm.org/D71348
*	[APFloat] Add recoverable string parsing errors to APFloat	Ehud Katz	2020-01-06	8	-196/+256
\| \| \| \| \| \|	Implementing the APFloat part in PR4745. Differential Revision: https://reviews.llvm.org/D69770
*	[Metadata] Add TBAA struct metadata to `AAMDNode`	Anton Afanasyev	2020-01-06	4	-21/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Make `AAMDNodes`' `getAAMetadata()` and `setAAMetadata()` to take `!tbaa.struct` into account as well as `!tbaa`. This impacts llvm.org/pr42022. This is a temprorary fix needed to keep `!tbaa.struct` tag by SROA pass. New field `TBAAStruct` should be deleted when `!tbaa` tag replaces `!tbaa.struct`. Merging two `!tbaa.struct`'s to one is conservatively considered to be `nullptr` (giving `MayAlias`) -- this could be enhanced, but relying on the said future replacement. Reviewers: RKSimon, spatel, vporpo Subscribers: hiraditya, kosarev, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70924
*	[Clang] Force rtlib=platform in test to avoid fails with CLANG_DEFAULT_RTLIB	Kristina Brooks	2020-01-06	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Driver test `cross-linux.c` fails when CLANG_DEFAULT_RTLIB is "compiler-rt" as the it expects a GCC-style `"crtbegin.o"` after `"crti.o"` but instead receives something akin to this in the frontend invocation: ``` "crt1.o" "crti.o" "/o/b/llvm/bin/../lib/clang/10.0.0/lib/linux/clang_rt.crtbegin-x86_64.o" ``` This patch adds an override to `cross-linux.c` tests so the expected result is produced regardless of the compile-time default rtlib, as having tests fail due to that is fairly confusing. After applying the patch, the test passes regardless of the CLANG_DEFAULT_RTLIB setting. Differential Revision: https://reviews.llvm.org/D72236
*	[TargetLowering] Use SETCC input type to call getBooleanContents instead of ↵	Craig Topper	2020-01-05	1	-1/+1
\| \| \| \| \| \| \| \| \|	the setcc result type. This isn't a functonal change since we also check the bit width is the same and the input type is integer. This guarantees the input and output type are the same. But passing the input type makes the code more readable.
*	[mlir][spirv] Update SPIR-V documentation with information about	MaheshRavishankar	2020-01-05	1	-6/+113
\| \| \| \| \| \| \| \| \|	lowering to SPIR-V dialect. Add information about - SPIRVTypeConverter - SPIRVOpLowering - Utility functions used in lowering to SPIR-V dialect.
*	[MC] Reorder members of MCFragment's subclasses to decrease padding	Fangrui Song	2020-01-05	1	-54/+14
\| \| \| \| \| \| \| \| \|	On a 64-bit platform: sizeof(MCBoundaryAlignFragment): 64 -> 56 sizeof(MCOrgFragment): 72 -> 64 sizeof(MCFillFragment): 80 -> 72 sizeof(MCLEBFragment): 88 -> 80
*	[MC] Reorder MCFragment members to decrease padding	Fangrui Song	2020-01-05	2	-17/+8
\| \| \| \| \| \| \|	sizeof(MCFragment) does not change, but some if its subclasses do, e.g. on a 64-bit platform, sizeof(MCEncodedFragment) decreases from 64 to 56, sizeof(MCDataFragment) decreases from 224 to 216.
*	[DAGCombine] Don't check the legality of type when combine the SIGN_EXTEND_INREG	QingShan Zhang	2020-01-06	2	-6/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the DAG node for SIGN_EXTEND_INREG : t21: v4i32 = sign_extend_inreg t18, ValueType:ch:v4i16 It has two operands. The first one is the value it want to extend, and the second one is the type to specify how to extend the value. For this example, it means that, it is signed extend the t18(v4i32) from v4i16 to v4i32. That is the semantics of c code: vector int foo(vector int m) { return m << 16 >> 16; } And it could be any vector type that hardware support the operation, though the type 'v4i16' is NOT legal for the target. When we are trying to combine the srl + sra, what we did now is calling the TLI.isOperationLegal(), which will also check the legality of the type. That doesn't make sense. Differential Revision: https://reviews.llvm.org/D70230
*	[MC] Delete MCFragment::isDummy. NFC	Fangrui Song	2020-01-05	2	-4/+1
\| \| \| \| \|	isa<...>, dyn_cast<...> and cast<...> are used by other fragments. Don't make MCDummyFragment special.
*	[X86] Improve v2i64->v2f32 and v4i64->v4f32 uint_to_fp on avx and avx2 targets.	Craig Topper	2020-01-05	5	-638/+514
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Based on Simon's D52965, but improved to handle strict fp and improve some of the shuffling. Rather than use v2i1/v4i1 and let type legalization continue, just generate all the code with legal types and use an explicit shuffle. I also added an explicit setcc to the v4i64 code to match the semantics of vselect which doesn't just use the sign bit. I'm also using a v4i64->v4i32 truncate instead of the shuffle in Simon's original code. With the setcc this will become a pack. Future work can look into using X86ISD::BLENDV and a different shuffle that only moves the sign bit. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71956
*	[NFC] Modify the format:	Liu, Chen3	2020-01-06	1	-2/+1
\| \| \| \|	Drop the else since we alerady returned in the if.
*	[Coroutines] Remove corresponding phi values when apply ↵	Brian Gesiak	2020-01-05	3	-12/+133
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	simplifyTerminatorLeadingToRet Summary: In addMustTailToCoroResumes, we set musttail on those resume instructions that are followed by a ret instruction. This is done by simplifyTerminatorLeadingToRet which replace a sequence of branches leading to a ret with a clone of the ret. However it forgets to remove corresponding PHI values that come from basic block of replaced branch, and may cause jumpthreading pass hangs (https://bugs.llvm.org/show_bug.cgi?id=43720) This patch fix this issue Test Plan: cppcoro library with O3+flto check-llvm Reviewers: modocache, GorNishanov, lewissbaker Reviewed By: modocache Subscribers: mehdi_amini, EricWF, hiraditya, dexonsmith, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71826 Patch by junparser (JunMa)!
*	Clang-format previous commit	Stephen Kelly	2020-01-05	1	-3/+1
\|
*	[MC][ARM] Delete MCSection::HasData and move SHF_ARM_PURECODE logic to ↵	Fangrui Song	2020-01-05	4	-17/+6
\| \| \| \| \| \| \| \|	ARMELFObjectWriter::addTargetSectionFlags This simplifies the generic interface and also makes SHF_ARM_PURECODE more robust (fixes a TODO). Inspecting MCDataFragment contents covers more cases than MCObjectStreamer::EmitBytes.
*	Add missing test	Stephen Kelly	2020-01-05	1	-0/+12
\|
*	[Gnu toolchain] Look at standard GCC paths for libstdcxx by default	Kristina Brooks	2020-01-05	6	-86/+121
\| \| \| \| \| \| \| \| \| \| \|	Linux' current addLibCxxIncludePaths and addLibStdCxxIncludePaths are actually almost non-Linux-specific at all, and can be reused almost as such for all gcc toolchains. Only keep Android/Freescale/Cray hacks in Linux's version. Patch by sthibaul (Samuel Thibault) Differential Revision: https://reviews.llvm.org/D69758
*	[MC] Delete MCSection::{rbegin,rend}	Fangrui Song	2020-01-05	2	-8/+2
\|
*	Allow using traverse() with bindings	Stephen Kelly	2020-01-05	3	-0/+38
\|
*	Fix oversight in AST traversal helper	Stephen Kelly	2020-01-05	1	-1/+1
\|
*	[MC] Merge MCSymbol::getSectionPtr into getSection and simplify	Fangrui Song	2020-01-05	1	-9/+1
\|
*	[MC] Drop an unused rule about absolute temporary symbols	Fangrui Song	2020-01-05	1	-4/+0
\|
*	[X86][SSE] Combine combineLogicBlendIntoConditionalNegate for VSELECT nodes ↵	Simon Pilgrim	2020-01-05	3	-139/+117
\| \| \| \| \| \| \| \|	(PR43660) Attempt to use combineLogicBlendIntoConditionalNegate for (select M, (sub 0, X), X) -> (sub (xor X, M), M) We limit this to cases that can't easily replace the VSELECT with a shuffle (non-constant masks) or where a BLENDV is likely to occur (which tends to result in slower codegen).
*	[X86] Move combineLogicBlendIntoConditionalNegate before combineSelect. NFCI.	Simon Pilgrim	2020-01-05	1	-62/+62
\| \| \| \|	Updates function order in preparation of future fix for PR43660
*	[X86] Merge (identical) LowerGC_TRANSITION_START and LowerGC_TRANSITION_END ↵	Simon Pilgrim	2020-01-05	2	-27/+4
\| \| \| \| \| \|	(NFC) Silences a copy+paste analyzer warning - all they are doing are inserting NOOPs in exactly the same way.
*	[ARM] Use isFMAFasterThanFMulAndFAdd for scalars as well as MVE vectors	David Green	2020-01-05	10	-25/+52
\| \| \| \| \| \| \| \| \| \| \|	This adds extra scalar handling to isFMAFasterThanFMulAndFAdd, allowing the target independent code to handle more folds in more situations (for example if the fast math flags are present, but the global AllowFPOpFusion option isnt). It also splits apart the HasSlowFPVMLx into HasSlowFPVFMx, to allow VFMA and VMLA to be controlled separately if needed. Differential Revision: https://reviews.llvm.org/D72139
*	[ARM] Fill in FP16 FMA patterns	David Green	2020-01-05	2	-64/+65
\| \| \| \| \| \|	This adds fp16 variants of all the fma patterns in the ARM backend. Differential Revision: https://reviews.llvm.org/D72138
*	[ARM] Add and update FMA tests. NFC	David Green	2020-01-05	3	-31/+480
\|
*	[ParserTest] Move raw string literal out of macro	David Green	2020-01-05	1	-3/+3
\| \| \| \| \|	Some combinations of gcc and ccache do not deal well with raw strings in macros. Moving the string out to attempt to fix the bots.
*	[LegalizeVectorOps][X86] Enable expansion of vector fp_to_uint in ↵	Craig Topper	2020-01-04	4	-258/+113
\| \| \| \| \| \| \| \| \| \| \| \| \|	LegalizeVectorOps to avoid scalarization. The code here isn't great in all caess. Particularly v4f64->v4i32 on 64-bit AVX targets. But there is some improvement in some configurations. There's definitely some issues with computeNumSignBits with X86ISD::STRICT_FCMP. As well as not being able to propagate sign bits through merge_values nodes that get created during custom legalization.
*	[TargetLowering] In expandFP_TO_UINT, add proper extend or truncate for the ↵	Craig Topper	2020-01-04	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \|	condition to feed the DstVT select. Previously, for vectors we created a vselect with a condition that didn't match what the target wanted according to getSetCCResultType. To make up for this, X86 had a special DAG combine to detect if the condition was all sign bits and then insert its own truncate or extend. By adding the extend/truncate here explicitly we can avoid that.
*	[LegalizeVectorOps] Split most of ExpandStrictFPOp into a separate ↵	Craig Topper	2020-01-04	1	-6/+13
\| \| \| \| \| \| \| \| \| \| \|	UnrollStrictFPOp method. Call that method from ExpandUINT_TO_FLOAT. ExpandStrictFPOp calls ExpandUINT_TO_FLOAT. Previously, ExpandUINT_TO_FLOAT returned SDValue() if it wasn't able to handle and needed to unroll. Then ExpandStrictFPOp would detect his SDValue() and do the unroll. After this change, ExpandUINT_TO_FLOAT will directly call UnrollStrictFPOp and return the unrolled result.
*	[ELF] Drop const qualifier to fix -Wrange-loop-analysis. NFC	Fangrui Song	2020-01-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	``` lld/ELF/Relocations.cpp:1622:56: warning: loop variable 'ts' of type 'const std::pair<ThunkSection , uint32_t>' (aka 'const pair<lld::elf::ThunkSection , unsigned int>') creates a copy from type 'const std::pair<ThunkSection , uint32_t>' [-Wrange-loop-analysis] for (const std::pair<ThunkSection , uint32_t> ts : isd->thunkSections) ``` Drop const qualifier to fix -Wrange-loop-analysis. We can make -Wrange-loop-analysis warnings (DiagnoseForRangeConstVariableCopies) on `const A` more permissive on more types (e.g. POD -> trivially copyable), unfortunately it will not make std::pair good, because `constexpr pair& operator=(const pair& p);` is unfortunately user-defined. Reviewed By: Mordante Differential Revision: https://reviews.llvm.org/D72211
*	GlobalISel: Scalarize all division operations	Matt Arsenault	2020-01-04	6	-0/+1742
\| \| \| \| \| \|	This only handled G_SDIV, but they all are trivially scalarizable. Also define placeholder AMDGPU division legalizer rules.
*	Revert "[SCEV] Move ScalarEvolutionExpander.cpp to Transforms/Utils (NFC)."	Florian Hahn	2020-01-04	25	-946/+858
\| \| \| \| \|	This reverts commit 51ef53f3bd23559203fe9af82ff2facbfedc1db3, as it breaks some bots.
*	[SCEV] Move ScalarEvolutionExpander.cpp to Transforms/Utils (NFC).	Florian Hahn	2020-01-04	25	-858/+946
\| \| \| \| \| \| \| \| \| \| \| \|	SCEVExpander modifies the underlying function so it is more suitable in Transforms/Utils, rather than Analysis. This allows using other transform utils in SCEVExpander. Reviewers: sanjoy.google, efriedma, reames Reviewed By: sanjoy.google Differential Revision: https://reviews.llvm.org/D71537
*	[SCEV] Remove unused ScalarEvolutionExpander.h includes (NFC).	Florian Hahn	2020-01-04	4	-4/+0
\|
*	GlobalISel: Define G_READCYCLECOUNTER	Matt Arsenault	2020-01-04	6	-0/+27
\|
*	AMDGPU/GlobalISel: Refine SMRD selection rules	Matt Arsenault	2020-01-04	3	-36/+172
\| \| \| \| \|	Fix selecting these for volatile global loads, and ensure the loads are constant enough.
*	AMDGPU/GlobalISel: Legalize more odd sized loads	Matt Arsenault	2020-01-04	5	-307/+43
\| \| \| \| \|	The attempts to widen sufficently aligned, odd sized loads wasn't consistently applied.
*	AMDGPU/GlobalISel: Assume vcc phis for any vcc input	Matt Arsenault	2020-01-04	3	-86/+69
\| \| \| \| \| \| \|	This produces more intelligible looking results, more comparabble to the DAG output in the simplest cases. This is probably wrong in complex control flow, but RegBankSelect doesn't attempt analyzing if this is on a masked path for selecting the bank yet.
*	[Pass Registration] XFAIL load_extension.ll test on macOS.	Florian Hahn	2020-01-04	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \|	This test fails on macOS, causing the following bots to fail http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/7438/ http://green.lab.llvm.org/green/job/clang-stage1-RA/5034/ Error: Error opening 'build/./lib/libBye.dylib': dlopen(build/./lib/libBye.dylib, 9): image not found -load request ignored.
*	AMDGPU/GlobalISel: Implement applyMappingImpl less incorrectly	Matt Arsenault	2020-01-04	1	-13/+23
\| \| \| \| \| \| \| \| \| \| \|	We're checking the current register bank of the registers in the instruction, but the mapping may have inserted cross bank copies and is expecting to replace the registers. We mostly get away with this currently, because VGPR->SGPR copies are illegal, and we assume this won't happen. In a future change, we'll start relying on more cross register bank copies being inserted, and this starts to break down.
*	[cmake] Remove install from add_llvm_example_library.	Florian Hahn	2020-01-04	1	-4/+3
\| \| \| \| \|	This should fix http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast/builds/30086
*	Re-apply "[Examples] Add IRTransformations directory to examples."	Florian Hahn	2020-01-04	17	-0/+1004
\| \| \| \| \| \|	This reverts commit 19fd8925a4afe6efd248688cce06aceff50efe0c. Should include a fix for PR44197.
*	NFC: Fix trivial typos in comments	Kazuaki Ishizaki	2020-01-04	68	-80/+80
\|
*	[AMDGPU] need to insert wait between the scalar load and vector store to the ↵	alex-t	2020-01-04	2	-0/+50
\| \| \| \| \| \| \| \| \| \|	same address to avoid WAR conflict. Reviewers: rampitec, vpykhtin, nhaehnle Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D71934
*	[NFCI][InstCombine] Refactor 'sink negation into select if that folds one ↵	Roman Lebedev	2020-01-04	1	-40/+35
\| \| \| \| \| \| \| \| \| \|	hand of select to 0' fold I would think it's better than having two practically identical folds next to eachother, but then generalization isn't all that pretty due to the fact that we need to produce different `sub` each time.. This change is no-functional-changes-intended refactoring.
*	[InstCombine] Sink sub into hands of select if one hand becomes zero. Part 2 ↵	Roman Lebedev	2020-01-04	3	-22/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR44426) This decreases use count of %Op0, makes one hand of select to be 0, and possibly exposes further folding potential. Name: sub %Op0, (select %Cond, %Op0, %FalseVal) -> select %Cond, 0, (sub %Op0, %FalseVal) %Op0 = %TrueVal %o = select i1 %Cond, i8 %Op0, i8 %FalseVal %r = sub i8 %Op0, %o => %n = sub i8 %Op0, %FalseVal %r = select i1 %Cond, i8 0, i8 %n Name: sub %Op0, (select %Cond, %TrueVal, %Op0) -> select %Cond, (sub %Op0, %TrueVal), 0 %Op0 = %FalseVal %o = select i1 %Cond, i8 %TrueVal, i8 %Op0 %r = sub i8 %Op0, %o => %n = sub i8 %Op0, %TrueVal %r = select i1 %Cond, i8 %n, i8 0 https://rise4fun.com/Alive/aHRt https://bugs.llvm.org/show_bug.cgi?id=44426