bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[COFF] Fix assembly output of comdat sections without an attached symbol	Martin Storsjo	2018-07-23	1	-10/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since SVN r335286, the .xdata sections are produced without an attached symbol, which requires using a different syntax when printing assembly output. Instead of the usual syntax of '.section <name>,"dr",discard,<symbol>', use '.section <name>,"dr"' + '.linkonce discard' (which is what GCC uses for all assembly output). This fixes PR38254. Differential Revision: https://reviews.llvm.org/D49651 llvm-svn: 337756
*	[AArch64] Use MCAsmInfoMicrosoft and MCAsmInfoGNUCOFF as base classes	Martin Storsjo	2018-07-23	2	-9/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This matches the structure used on X86 and ARM. This requires a little bit of duplication of the parts that are equal in both AArch64 COFF variants though. Before SVN r335286, these classes didn't add anything that MCAsmInfoCOFF didn't, but now they do. This makes AArch64 match X86 in how comdat is used for float constants for MinGW. Differential Revision: https://reviews.llvm.org/D49637 llvm-svn: 337755
*	[SelectionDAG] Reduce DanglingDebugInfo memory traffic, NFC	Vedant Kumar	2018-07-23	1	-19/+23
\| \| \| \| \| \| \|	This avoids approx. 2 x 10^5 DenseMap insertions in both non-debug and debug -O2 builds of the sqlite3 amalgamation. llvm-svn: 337751
*	[ThinLTO] Ensure the TargetLibraryInfo is constructed early enough	Teresa Johnson	2018-07-23	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Without this change, the WholeProgramDevirt pass, which requires the TargetLibraryInfo, will construct one from the default triple. Fixes PR38139. Reviewers: pcc Subscribers: mehdi_amini, inglorion, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D49278 llvm-svn: 337750
*	[DebugCounters] Keep track of total counts	George Burgess IV	2018-07-23	2	-11/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch makes debug counters keep track of the total number of times we've called `shouldExecute` for each counter, so it's easier to build automated tooling on top of these. A patch to print these counts is coming soon. Patch by Zhizhou Yang! Differential Revision: https://reviews.llvm.org/D49560 llvm-svn: 337748
*	ConstantFolding: Avoid a crash.	Manoj Gupta	2018-07-23	1	-6/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Check if the parent basic block and caller exists before calling CS.getCaller when constant folding strip.invariant.group instrinsic. This avoids a crash when the function containing the intrinsic is being inlined. The instruction is checked for any simplifiction but has not yet been added to a basic block. Reviewers: Prazek, rsmith, efriedma Reviewed By: efriedma Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D49690 llvm-svn: 337742
*	Re-land r335297 "[X86] Implement more of x86-64 large and medium PIC code ↵	Reid Kleckner	2018-07-23	6	-29/+132
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	models" Don't try to generate large PIC code for non-ELF targets. Neither COFF nor MachO have relocations for large position independent code, and users have been using "large PIC" code models to JIT 64-bit code for a while now. With this change, if they are generating ELF code, their JITed code will truly be PIC, but if they target MachO or COFF, it will contain 64-bit immediates that directly reference external symbols. For a JIT, that's perfectly fine. llvm-svn: 337740
*	Fix RegScavenger::unprocess	David Greene	2018-07-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	RegScavenger::unprocess walks backward, so it should undo the effects of defs before undoing effects of kills. Previously it did things in the opposite order, leaving a register apparently unused (dead) in the case where an instruction both used (killed) and defined a register. Differential Revision: https://reviews.llvm.org/D42200 llvm-svn: 337735
*	[Hexagon] Handle unnamed globals in HexagonConstExpr	Krzysztof Parzyszek	2018-07-23	1	-3/+15
\| \| \| \| \| \|	Instead of comparing names, compare positions in the parent module. llvm-svn: 337723
*	[Demangle] Attempt to fix arena memory leak	Reid Kleckner	2018-07-23	1	-1/+3
\| \| \| \|	llvm-svn: 337720
*	[ARM] Use unique_ptr to fix memory leak introduced in r337701	Fangrui Song	2018-07-23	1	-11/+9
\| \| \| \|	llvm-svn: 337714
*	OpChain has subclasses, so add a virtual destructor.	Jordan Rupprecht	2018-07-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: OpChain has subclasses, so add a virtual destructor. This fixes an issue when deleting subclasses of OpChain (see MatchSMLAD() specifically) in r337701. Reviewers: javed.absar Subscribers: llvm-commits, SjoerdMeijer, samparker Differential Revision: https://reviews.llvm.org/D49681 llvm-svn: 337713
*	[ARM] Follow-up to r337709.	Matt Morehouse	2018-07-23	1	-2/+0
\| \| \| \| \| \|	Fix double-free. llvm-svn: 337711
*	[ARM] Add doFinalization() to ARMCodeGenPrepare pass.	Matt Morehouse	2018-07-23	1	-0/+6
\| \| \| \| \| \| \|	Attempt to fix the leak introduced in r337687 and make sanitizer buildbots green again. llvm-svn: 337709
*	[Legalize] Elide MERGE_VALUES created by scalarizeVectorLoad.	Nirav Dave	2018-07-23	2	-3/+10
\| \| \| \| \| \| \|	scalarizeVectorLoad creates MERGE_VALUES nodes which are immediately decomposed in expandLoad. Elide the node in these cases. llvm-svn: 337708
*	[ARM][NFC] ParallelDSP reorganisation	Sam Parker	2018-07-23	1	-88/+103
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In preparing to allow ARMParallelDSP pass to parallelise more than smlads, I've restructed some elements: - The ParallelMAC struct has been renamed to BinOpChain. - The BinOpChain struct holds two value lists: LHS and RHS, as well as inheriting from the OpChain base class. - The OpChain struct holds all the values of the represented chain and has had the memory locations functionality inserted into it. - ParallelMACList becomes OpChainList and it now holds pointers instead of objects. Differential Revision: https://reviews.llvm.org/D49020 llvm-svn: 337701
*	[SystemZ] Fix dumpSU() method in SystemZHazardRecognizer.	Jonas Paulsson	2018-07-23	1	-1/+5
\| \| \| \| \| \| \| \|	Two minor issues: The new MCD SchedWrite name does not contain "Unit" like all the others, so a check is needed. Also, print "LSU" instead of "LS". Review: Ulrich Weigand llvm-svn: 337700
*	[FPEnv] Legalize double width StrictFP vector operations	Cameron McInally	2018-07-23	2	-0/+70
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D48809 llvm-svn: 337698
*	[ARM] ARMCodeGenPrepare backend pass	Sam Parker	2018-07-23	4	-0/+757
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Arm specific codegen prepare is implemented to perform type promotion on icmp operands, which can enable the removal of uxtb and uxth (unsigned extend) instructions. This is possible because performing type promotion before ISel alleviates this duty from the DAG builder which has to perform legalisation, but has a limited view on data ranges. The pass visits any instruction operand of an icmp and creates a worklist to traverse the use-def tree to determine whether the values can simply be promoted. Our concern is values in the registers overflowing the narrow (i8, i16) data range, so instructions marked with nuw can be promoted easily. For add and sub instructions, we are able to use the parallel dsp instructions to operate on scalar data types and avoid overflowing bits. Underflowing adds and subs are also permitted when the result is only used by an unsigned icmp. Differential Revision: https://reviews.llvm.org/D48832 llvm-svn: 337687
*	[GVN] Don't use the eliminated load as an available value in phi construction	John Brawn	2018-07-23	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \|	In ConstructSSAForLoadSet if an available value is actually the load that we're doing SSA construction to eliminate, then we can omit it as SSAUpdate will add in the value for the phi that will be replacing it anyway. This can result in simpler IR which can allow further optimisation. Differential Revision: https://reviews.llvm.org/D44160 llvm-svn: 337686
*	[MemorySSAUpdater] Update Phi operands after trivial Phi elimination	Alexandros Lamprineas	2018-07-23	1	-15/+13
\| \| \| \| \| \| \| \| \| \| \| \|	Bug fix for PR37445. The underlying problem and its fix are similar to PR37808. The bug lies in MemorySSAUpdater::getPreviousDefRecursive(), where PhiOps is computed before the call to tryRemoveTrivialPhi() and it ends up being out of date, pointing to stale data. We have now turned each of the PhiOps into a TrackingVH<MemoryAccess>. Differential Revision: https://reviews.llvm.org/D49425 llvm-svn: 337680
*	[Support] Add a UniqueStringSaver: like StringSaver, but deduplicating.	Sam McCall	2018-07-23	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Clarify contract of StringSaver (it null-terminates, callers rely on it). Reviewers: hokein Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49596 llvm-svn: 337677
*	[NFC][MCA] ZnVer1: Update RegisterFile to identify false dependencies on ↵	Roman Lebedev	2018-07-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	partially written registers. Summary: Pretty mechanical follow-up for D49196. As microarchitecture.pdf notes, "20 AMD Ryzen pipeline", "20.8 Register renaming and out-of-order schedulers": The integer register file has 168 physical registers of 64 bits each. The floating point register file has 160 registers of 128 bits each. "20.14 Partial register access": The processor always keeps the different parts of an integer register together. ... An instruction that writes to part of a register will therefore have a false dependence on any previous write to the same register or any part of it. Reviewers: andreadb, courbet, RKSimon, craig.topper, GGanesh Reviewed By: GGanesh Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D49393 llvm-svn: 337676
*	[GVNHoist] safeToHoistLdSt allows illegal hoisting	Alexandros Lamprineas	2018-07-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Bug fix for PR36787. When reasoning if it's safe to hoist a load we want to make sure that the defining memory access dominates the new insertion point of the hoisted instruction. safeToHoistLdSt calls firstInBB(InsertionPoint,DefiningAccess) which returns false if InsertionPoint == DefiningAccess, and therefore it falsely thinks it's safe to hoist. Differential Revision: https://reviews.llvm.org/D49555 llvm-svn: 337674
*	[x86/SLH] Fix a bug where we would harden tail calls twice -- once as	Chandler Carruth	2018-07-23	1	-1/+5
\| \| \| \| \| \| \| \| \|	a call, and then again as a return. Also added a comment to try and explain better why we would be doing what we're doing when hardening the (non-call) returns. llvm-svn: 337673
*	[x86/SLH] Rename and comment the main hardening function. NFC.	Chandler Carruth	2018-07-23	1	-4/+21
\| \| \| \| \| \| \| \| \| \|	This provides an overview of the algorithm used to harden specific loads. It also brings this our terminology further in line with hardening rather than checking. Differential Revision: https://reviews.llvm.org/D49583 llvm-svn: 337667
*	Test commit, fix a minor typo.	Jiading Gai	2018-07-22	1	-1/+1
\| \| \| \|	llvm-svn: 337657
*	[X86] Remove the max vector width restriction from combineLoopMAddPattern ↵	Craig Topper	2018-07-22	1	-7/+1
\| \| \| \| \| \| \| \|	and rely splitOpsAndApply to handle splitting. This seems to be a net improvement. There's still an issue under avx512f where we have a 512-bit vpaddd, but not vpmaddwd so we end up doing two 256-bit vpmaddwds and inserting the results before a 512-bit vpaddd. It might be better to do two 512-bits paddds with zeros in the upper half. Same number of instructions, but breaks a dependency. llvm-svn: 337656
*	[ORE] Move loop invariant ORE checks outside the PM loop.	Xin Tong	2018-07-22	3	-17/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This takes 22ms out of ~20s compiling sqlite3.c because we call it for every unit of compilation and every pass. Reviewers: paquette, anemet Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D49586 llvm-svn: 337654
*	[SelectionDAGBuilder] Use APInt::isZero instead of comparing ↵	Craig Topper	2018-07-22	1	-1/+1
\| \| \| \| \| \| \| \|	APInt::getZExtValue to 0 in a place where we can't be sure contents of the APInt fit in a uint64_t. This is used on an extract vector element index which is most cases is going to be an i32 or i64 and the element will be a valid element number. But it is possible to construct IR with a larger type and large out of range value. llvm-svn: 337652
*	[SelectionDAGBuilder] Restrict vector reduction check to types with a power ↵	Craig Topper	2018-07-22	1	-0/+4
\| \| \| \| \| \| \| \|	of 2 number of elements. The check for the shuffles usages probably isn't correct for non power of 2 vectors. llvm-svn: 337651
*	[mips] Factor out register class selection for global base register. NFC	Simon Atanasyan	2018-07-21	1	-18/+20
\| \| \| \| \| \| \|	Factor out register class selection for global base register into a separate function to escape long chain of ternary operators. llvm-svn: 337647
*	[mips] Move out the WrapperPat declaration from the NotInMicroMips predicate	Simon Atanasyan	2018-07-21	1	-5/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a follow-up to the rL335185. Those commit adds some WrapperPat patterns for microMIPS target. But declaration of the WrapperPat class is under the NotInMicroMips predicate and microMIPS patterns cannot be selected because predicate (Subtarget->inMicroMipsMode()) && (!Subtarget->inMicroMipsMode()) is always false. This change move out the WrapperPat class declaration from the NotInMicroMips predicate and enables microMIPS WrapperPat patterns. Differential revision: https://reviews.llvm.org/D49533 llvm-svn: 337646
*	Early exit with cheaper checks	Aditya Kumar	2018-07-21	1	-13/+12
\| \| \| \| \| \| \|	Reviewers: sebpop,davide,fhahn,trentxintong Differential Revision: https://reviews.llvm.org/D49617 llvm-svn: 337643
*	[InstrSimplify] fold sdiv if two operands are negated and non-overflow	Chen Zheng	2018-07-21	2	-9/+17
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D49382 llvm-svn: 337642
*	[ORC] Re-apply r336760 with fixes.	Lang Hames	2018-07-21	3	-4/+38
\| \| \| \|	llvm-svn: 337637
*	Re-apply r337595 with fix for LLVM_ENABLE_THREADS=Off.	Lang Hames	2018-07-20	5	-281/+518
\| \| \| \|	llvm-svn: 337626
*	Change the cap on the amount of padding for each vtable to 32-byte ↵	Peter Collingbourne	2018-07-20	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	(previously it was 128-byte) We tested different cap values with a recent commit of Chromium. Our results show that the 32-byte cap yields the smallest binary and all the caps yield similar performance. Based on the results, we propose to change the cap value to 32-byte. Patch by Zhaomo Yang! Differential Revision: https://reviews.llvm.org/D49405 llvm-svn: 337622
*	AMDGPU: Use existing function to check for VGPRs	Matt Arsenault	2018-07-20	1	-16/+7
\| \| \| \|	llvm-svn: 337621
*	Revert "[X86][AVX] Convert X86ISD::VBROADCAST demanded elts combine to use ↵	Benjamin Kramer	2018-07-20	2	-48/+17
\| \| \| \| \| \| \| \|	SimplifyDemandedVectorElts" This reverts commit r337547. It triggers an infinite loop. llvm-svn: 337617
*	[COFF] Use symbolic constants instead of hardcoded numbers. NFCI.	Martin Storsjo	2018-07-20	1	-1/+6
\| \| \| \| \| \|	Patch by Martell Malone. llvm-svn: 337614
*	[COFF] Adjust how we flag weak externals	Martin Storsjo	2018-07-20	2	-6/+5
\| \| \| \| \| \| \| \| \| \|	This fixes PR36096. Originally based on a patch by Martell Malone. Differential Revision: https://reviews.llvm.org/D44357 llvm-svn: 337613
*	Revert r337595 "[ORC] Add new symbol lookup methods to ExecutionSessionBase ↵	Reid Kleckner	2018-07-20	5	-518/+281
\| \| \| \| \| \| \| \|	in preparation for" Breaks the build with LLVM_ENABLE_THREADS=OFF. llvm-svn: 337608
*	Reapply "[LSV] Refactoring + supporting bitcasts to a type of different size"	Roman Tereshin	2018-07-20	1	-46/+65
\| \| \| \| \| \| \| \|	This reapplies commit r337489 reverted by r337541 Additionally, this commit contains a speculative fix to the issue reported in r337541 (the report does not contain an actionable reproducer, just a stack trace) llvm-svn: 337606
*	Remove a superfluous semicolon	Martin Storsjo	2018-07-20	1	-1/+1
\| \| \| \|	llvm-svn: 337599
*	[Demangler] Correctly factor in assignment when allocating.	Zachary Turner	2018-07-20	1	-24/+29
\| \| \| \| \| \| \| \| \|	Incidentally all allocations that we currently perform were properly aligned, but this was only an accident. Thanks to Erik Pilkington for catching this. llvm-svn: 337596
*	[ORC] Add new symbol lookup methods to ExecutionSessionBase in preparation for	Lang Hames	2018-07-20	5	-281/+518
\| \| \| \| \| \| \| \| \| \| \| \|	deprecating SymbolResolver and AsynchronousSymbolQuery. Both lookup overloads take a VSO search order to perform the lookup. The first overload is non-blocking and takes OnResolved and OnReady callbacks. The second is blocking, takes a boolean flag to indicate whether to wait until all symbols are ready, and returns a SymbolMap. Both overloads take a RegisterDependencies function to register symbol dependencies (if any) on the query. llvm-svn: 337595
*	[ORC] Simplify VSO::lookupFlags to return the flags map.	Lang Hames	2018-07-20	6	-31/+27
\| \| \| \| \| \| \| \| \| \| \|	This discards the unresolved symbols set and returns the flags map directly (rather than mutating it via the first argument). The unresolved symbols result made it easy to chain lookupFlags calls, but such chaining should be rare to non-existant (especially now that symbol resolvers are being deprecated) so the simpler method signature is preferable. llvm-svn: 337594
*	[ORC] Replace SymbolResolvers in the new ORC layers with search orders on VSOs.	Lang Hames	2018-07-20	5	-97/+148
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A search order is a list of VSOs to be searched linearly to find symbols. Each VSO now has a search order that will be used when fixing up definitions in that VSO. Each VSO's search order defaults to just that VSO itself. This is a first step towards removing symbol resolvers from ORC altogether. In practice symbol resolvers tended to be used to implement a search order anyway, sometimes with additional programatic generation of symbols. Now that VSOs support programmatic generation of definitions via fallback generators, search orders provide a cleaner way to achieve the desired effect (while removing a lot of boilerplate). llvm-svn: 337593
*	[Demangler] Add missing overrides	Benjamin Kramer	2018-07-20	1	-3/+3
\| \| \| \| \| \|	-Winconsistent-missing-override complains about this. llvm-svn: 337592