bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] EltsFromConsecutiveLoads - remove duplicate check for element size. NFCI.	Simon Pilgrim	2019-07-10	1	-6/+0
\| \| \| \| \| \|	We've already checked that each element is the correct contributory size for VT when we inspect the elements for Undef/Zero/Load. llvm-svn: 365656
*	[X86] EltsFromConsecutiveLoads - ensure element reg/store sizes are the same ↵	Simon Pilgrim	2019-07-10	1	-3/+5
\| \| \| \| \| \| \| \|	size. NFCI. This renames the type so it doesn't sound like its based off the load size - as we're moving towards supporting combining loads of different sizes. llvm-svn: 365655
*	AMDGPU: Serialize mode from MachineFunctionInfo	Matt Arsenault	2019-07-10	3	-1/+32
\| \| \| \|	llvm-svn: 365653
*	[PatternMatch] Generalize m_SpecificInt_ULT() to take ICmpInst::Predicate	Roman Lebedev	2019-07-10	2	-2/+4
\| \| \| \| \| \| \|	As discussed in the original review, this may be useful, so let's just do it. llvm-svn: 365652
*	[Remarks] Add cl::Hidden to -remarks-yaml-string-table	Francis Visoiu Mistrih	2019-07-10	1	-2/+3
\| \| \| \| \| \|	It was showing up in a lot of unrelated tools. llvm-svn: 365647
*	[AMDGPU] Allow abs/neg source modifiers on v_cndmask_b32	Jay Foad	2019-07-10	1	-7/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: D59191 added support for these modifiers in the assembler and disassembler. This patch just teaches instruction selection that it can use them. Reviewers: arsenm, tstellar Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64497 llvm-svn: 365640
*	[InstCombine] pow(C,x) -> exp2(log2(C)*x)	David Bolvansky	2019-07-10	1	-1/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Transform pow(C,x) To exp2(log2(C)*x) if C > 0, C != inf, C != NaN (and C is not power of 2, since we have some fold for such case already). log(C) is folded by the compiler and exp2 is much faster to compute than pow. Reviewers: spatel, efriedma, evandro Reviewed By: evandro Subscribers: lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64099 llvm-svn: 365637
*	[X86] EltsFromConsecutiveLoads - cleanup Zero/Undef/Load element collection. ↵	Simon Pilgrim	2019-07-10	1	-12/+17
\| \| \| \| \| \|	NFCI. llvm-svn: 365628
*	[MIPS GlobalISel] Select float and double phi	Petar Avramovic	2019-07-10	1	-4/+25
\| \| \| \| \| \| \| \|	Select float and double phi for MIPS32. Differential Revision: https://reviews.llvm.org/D64420 llvm-svn: 365627
*	[MIPS GlobalISel] Select float and double load and store	Petar Avramovic	2019-07-10	1	-22/+44
\| \| \| \| \| \| \| \|	Select float and double load and store for MIPS32. Differential Revision: https://reviews.llvm.org/D64419 llvm-svn: 365626
*	[FileCheck] Simplify numeric variable interface	Thomas Preud'homme	2019-07-10	1	-9/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch simplifies 2 aspects in the FileCheckNumericVariable code. First, setValue() method is turned into a void function since being called only on undefined variable is an invariant and is now asserted rather than returned. This remove the assert from the callers. Second, clearValue() method is also turned into a void function since the only caller does not check its return value since it may be trying to clear the value of variable that is already cleared without this being noteworthy. Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: JonChesterfield, rogfer01, hfinkel, kristina, rnk, tra, arichardson, grimar, dblaikie, probinson, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D64231 > llvm-svn: 365249 llvm-svn: 365625
*	[FileCheck] Fix @LINE value after match failure	Thomas Preud'homme	2019-07-10	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The value of the FileCheckNumericVariable class instance representing the @LINE numeric variable is set and cleared respectively before and after substitutions are made, if any. However, when a substitution fails, the value is not cleared. This causes the next substitution of @LINE later on to give the wrong value since setValue is a nop if the value is already set. This is what caused failures after commit r365249. Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, tra, rnk, kristina, hfinkel, rogfer01, JonChesterfield Tags: #llvm Differential Revision: https://reviews.llvm.org/D64449 llvm-svn: 365624
*	[NFC][ARM] Convert lambdas to static helpers	Sam Parker	2019-07-10	1	-57/+73
\| \| \| \| \| \| \|	Break up and convert some of the lambdas in ARMLowOverheadLoops into static functions. llvm-svn: 365623
*	[X86] EltsFromConsecutiveLoads - LDBase is non-null. NFCI.	Simon Pilgrim	2019-07-10	1	-6/+4
\| \| \| \| \| \|	Don't bother checking for LDBase != null - it should be (and we assert that it is). llvm-svn: 365622
*	[DAGCombine] visitINSERT_SUBVECTOR - use uint64_t subvector index. NFCI.	Simon Pilgrim	2019-07-10	1	-1/+1
\| \| \| \| \| \|	Keep the uint64_t type from getZExtValue() to stop truncation/extension overflow warnings in MSVC in subvector index math. llvm-svn: 365621
*	[X86] EltsFromConsecutiveLoads - store Loads on a per-element basis. NFCI.	Simon Pilgrim	2019-07-10	1	-9/+9
\| \| \| \| \| \|	Cache the LoadSDNode nodes so we can easily map to/from the element index instead of packing them together - this will be useful for future patches for PR16739 etc. llvm-svn: 365620
*	[ELF] Loose a condition for relocation with a symbol	Nikola Prica	2019-07-10	1	-5/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Deleted code was introduced as a work around for a bug in the gold linker (http://sourceware.org/PR16794). Test case that was given as a reason for this part of code, the one on previous link, now works for the gold. This condition is too strict and when a code is compiled with debug info it forces generation of numerous relocations with symbol for architectures that do not have relocation addend. Reviewers: arsenm, espindola Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D64327 llvm-svn: 365618
*	[X86][SSE] EltsFromConsecutiveLoads - add basic dereferenceable support	Simon Pilgrim	2019-07-10	1	-7/+15
\| \| \| \| \| \| \| \| \| \|	This patch checks to see if the vector element loads are based off a dereferenceable pointer that covers the entire vector width, in which case we don't need to have element loads at both extremes of the vector width - just the start (base pointer) of it. Another step towards partial vector loads...... Differential Revision: https://reviews.llvm.org/D64205 llvm-svn: 365614
*	Fix const/non-const lambda return type warning. NFCI.	Simon Pilgrim	2019-07-10	1	-1/+1
\| \| \| \|	llvm-svn: 365613
*	Fix "result of 32-bit shift implicitly converted to 64 bits" warning. NFCI.	Simon Pilgrim	2019-07-10	1	-1/+1
\| \| \| \|	llvm-svn: 365612
*	[SimpleLoopUnswitch] Don't consider unswitching `switch` insructions with ↵	Serguei Katkov	2019-07-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	one unique successor Only instructions with two or more unique successors should be considered for unswitching. Patch Author: Daniil Suchkov. Reviewers: reames, asbirlea, skatkov Reviewed By: skatkov Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D64404 llvm-svn: 365611
*	[ARM] Enable VPUSH/VPOP aliases when either MVE or VFP is present	Mikhail Maltsev	2019-07-10	2	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Use the same predicates as VSTMDB/VLDMIA since VPUSH/VPOP alias to these. Patch by Momchil Velikov. Reviewers: ostannard, simon_tatham, SjoerdMeijer, samparker, t.p.northover, dmgreen Reviewed By: dmgreen Subscribers: javed.absar, kristof.beyls, hiraditya, dmgreen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64413 llvm-svn: 365604
*	[X86] Limit getTargetConstantFromNode to only work on NormalLoads not ↵	Craig Topper	2019-07-10	1	-1/+1
\| \| \| \| \| \| \| \| \|	extending loads. This seems to fix a failure reported by Jordan Rupprecht, but we don't have a reduced test case yet. llvm-svn: 365589
*	[Support] Move llvm::MemoryBuffer to sys::fs::file_t	Reid Kleckner	2019-07-10	12	-96/+193
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: On Windows, Posix integer file descriptors are a compatibility layer over native file handles provided by the C runtime. There is a hard limit on the maximum number of file descriptors that a process can open, and the limit is 8192. LLD typically doesn't run into this limit because it opens input files, maps them into memory, and then immediately closes the file descriptor. This prevents it from running out of FDs. For various reasons, I'd like to open handles to every input file and keep them open during linking. That requires migrating MemoryBuffer over to taking open native file handles instead of integer FDs. Reviewers: aganea, Bigcheese Reviewed By: aganea Subscribers: smeenai, silvas, mehdi_amini, hiraditya, steven_wu, dexonsmith, dang, llvm-commits, zturner Tags: #llvm Differential Revision: https://reviews.llvm.org/D63453 llvm-svn: 365588
*	AMDGPU/GlobalISel: Add support for wide loads >= 256-bits	Tom Stellard	2019-07-10	4	-37/+219
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds support for the most commonly used wide load types: <8xi32>, <16xi32>, <4xi64>, and <8xi64> Reviewers: arsenm Reviewed By: arsenm Subscribers: hiraditya, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57399 llvm-svn: 365586
*	GlobalISel: Implement lower for G_FCOPYSIGN	Matt Arsenault	2019-07-09	2	-3/+52
\| \| \| \| \| \| \| \| \|	In SelectionDAG AMDGPU treated these as legal, but this was mostly because the bitcasts required for FP types were painful. Theoretically the bitpattern should eventually match to bfi, so don't bother trying to get the patterns to import. llvm-svn: 365583
*	[Bitcode] Explicitly include Bitstream/BitCodes.h and BitstreamWriter.h	Francis Visoiu Mistrih	2019-07-09	1	-0/+2
\| \| \| \| \| \|	This fixes a modules issue. llvm-svn: 365580
*	[X86] Don't form extloads in combineExtInVec unless the load extension is legal.	Craig Topper	2019-07-09	1	-7/+9
\| \| \| \| \| \| \| \| \| \|	This should prevent doing this on pre-sse4.1 targets or for 256 bit vectors without avx2. I don't know of a failure from this. Op legalization will probably take care of, but seemed better to be safe. llvm-svn: 365577
*	AMDGPU/GlobalISel: Fix legality for G_BUILD_VECTOR	Matt Arsenault	2019-07-09	1	-7/+4
\| \| \| \|	llvm-svn: 365575
*	[AMDGPU] gfx908 v_pk_fmac_f16 support	Stanislav Mekhanoshin	2019-07-09	2	-4/+10
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D64433 llvm-svn: 365573
*	GlobalISel: Combine unmerge of merge with intermediate cast	Matt Arsenault	2019-07-09	1	-3/+9
\| \| \| \| \| \| \|	This eliminates some illegal intermediate vectors when operations are scalarized. llvm-svn: 365566
*	[Profile] Support raw/indexed profiles larger than 4GB	Vedant Kumar	2019-07-09	1	-2/+2
\| \| \| \| \| \|	rdar://45955976 llvm-svn: 365565
*	[AMDGPU] gfx908 mAI instructions, MC part	Stanislav Mekhanoshin	2019-07-09	19	-18/+674
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D64446 llvm-svn: 365563
*	[SLP] Optimize getSpillCost(); NFCI	Nikita Popov	2019-07-09	1	-6/+10
\| \| \| \| \| \| \| \| \| \| \| \|	For a given set of live values, the spill cost will always be the same for each call. Compute the cost once and multiply it by the number of calls. (I'm not sure this spill cost modeling makes sense if there are multiple calls, as the spill cost will likely be shared across calls in that case. But that's how it currently works.) llvm-svn: 365552
*	hwasan: Improve precision of checks using short granule tags.	Peter Collingbourne	2019-07-09	2	-17/+152
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A short granule is a granule of size between 1 and `TG-1` bytes. The size of a short granule is stored at the location in shadow memory where the granule's tag is normally stored, while the granule's actual tag is stored in the last byte of the granule. This means that in order to verify that a pointer tag matches a memory tag, HWASAN must check for two possibilities: * the pointer tag is equal to the memory tag in shadow memory, or * the shadow memory tag is actually a short granule size, the value being loaded is in bounds of the granule and the pointer tag is equal to the last byte of the granule. Pointer tags between 1 to `TG-1` are possible and are as likely as any other tag. This means that these tags in memory have two interpretations: the full tag interpretation (where the pointer tag is between 1 and `TG-1` and the last byte of the granule is ordinary data) and the short tag interpretation (where the pointer tag is stored in the granule). When HWASAN detects an error near a memory tag between 1 and `TG-1`, it will show both the memory tag and the last byte of the granule. Currently, it is up to the user to disambiguate the two possibilities. Because this functionality obsoletes the right aligned heap feature of the HWASAN memory allocator (and because we can no longer easily test it), the feature is removed. Also update the documentation to cover both short granule tags and outlined checks. Differential Revision: https://reviews.llvm.org/D63908 llvm-svn: 365551
*	[PoisonChecking] Flesh out complete todo list for full coverage	Philip Reames	2019-07-09	1	-8/+24
\| \| \| \| \|	Note: I don't actually plan to implement all of the cases at the moment, I'm just documenting them for completeness. There's a couple of cases left which are practically useful for me in debugging loop transforms, and I'll probably stop there for the moment. llvm-svn: 365550
*	[X86][AMDGPU][DAGCombiner] Move call to allowsMemoryAccess into ↵	Craig Topper	2019-07-09	5	-24/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	isLoadBitCastBeneficial/isStoreBitCastBeneficial to allow X86 to bypass it Basically the problem is that X86 doesn't set the Fast flag from allowsMemoryAccess on certain CPUs due to slow unaligned memory subtarget features. This prevents bitcasts from being folded into loads and stores. But all vector loads and stores of the same width are the same cost on X86. This patch merges the allowsMemoryAccess call into isLoadBitCastBeneficial to allow X86 to skip it. Differential Revision: https://reviews.llvm.org/D64295 llvm-svn: 365549
*	Fix build error for VC STL, use llvm::make_unique	Reid Kleckner	2019-07-09	1	-1/+1
\| \| \| \|	llvm-svn: 365548
*	[AMDGPU] gfx908 register file changes	Stanislav Mekhanoshin	2019-07-09	6	-50/+621
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D64438 llvm-svn: 365546
*	[PoisonCheker] Support for out of bounds operands on shifts + ↵	Philip Reames	2019-07-09	1	-1/+41
\| \| \| \| \| \| \| \|	insert/extractelement These are sources of poison which don't come from flags, but are clearly documented in the LangRef. Left off support for scalable vectors for the moment, but should be easy to add if anyone is interested. llvm-svn: 365543
*	Boilerplate for producing XCOFF object files from the PowerPC backend.	Sean Fertile	2019-07-09	17	-3/+373
\| \| \| \| \| \| \| \| \| \|	Stubs out a number of the classes needed to produce a new object file format (XCOFF) for the powerpc-aix target. For testing input is an empty module which produces an object file with just a file header. Differential Revision: https://reviews.llvm.org/D61694 llvm-svn: 365541
*	[X86] LowerToHorizontalOp - use count_if to count non-UNDEF ops. NFCI.	Simon Pilgrim	2019-07-09	1	-5/+2
\| \| \| \|	llvm-svn: 365540
*	[PoisonChecking] Add validation rules for "exact" on sdiv/udiv	Philip Reames	2019-07-09	1	-0/+18
\| \| \| \| \| \|	As directly stated in the LangRef, no ambiguity here... llvm-svn: 365538
*	[ThinLTO] only emit used or referenced CFI records to index	Bob Haarman	2019-07-09	1	-8/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We emit CFI_FUNCTION_DEFS and CFI_FUNCTION_DECLS to distributed ThinLTO indices to implement indirect function call checking. This change causes us to only emit entries for functions that are either defined or used by the module we're writing the index for (instead of all functions in the combined index), which can make the indices substantially smaller. Fixes PR42378. Reviewers: pcc, vitalybuka, eugenis Subscribers: mehdi_amini, hiraditya, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63887 llvm-svn: 365537
*	Add a transform pass to make the executable semantics of poison explicit in ↵	Philip Reames	2019-07-09	5	-0/+288
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the IR Implements a transform pass which instruments IR such that poison semantics are made explicit. That is, it provides a (possibly partial) executable semantics for every instruction w.r.t. poison as specified in the LLVM LangRef. There are obvious parallels to the sanitizer tools, but this pass is focused purely on the semantics of LLVM IR, not any particular source language. The target audience for this tool is developers working on or targetting LLVM from a frontend. The idea is to be able to take arbitrary IR (with the assumption of known inputs), and evaluate it concretely after having made poison semantics explicit to detect cases where either a) the original code executes UB, or b) a transform pass introduces UB which didn't exist in the original program. At the moment, this is mostly the framework and still needs to be fleshed out. By reusing existing code we have decent coverage, but there's a lot of cases not yet handled. What's here is good enough to handle interesting cases though; for instance, one of the recent LFTR bugs involved UB being triggered by integer induction variables with nsw/nuw flags would be reported by the current code. (See comment in PoisonChecking.cpp for full explanation and context) Differential Revision: https://reviews.llvm.org/D64215 llvm-svn: 365536
*	Try to appease the Windows build bots.	Sean Fertile	2019-07-09	1	-4/+12
\| \| \| \| \| \| \|	Several of the conditonal operators commited in llvm-svn: 365524 fail to compile on the windows buildbots. Converting to an if and early return to try to fix. llvm-svn: 365535
*	[BPF] Fix a typo in the file name	Yonghong Song	2019-07-09	2	-1/+1
\| \| \| \| \| \| \| \|	Fixed the file name from BPFAbstrctMemberAccess.cpp to BPFAbstractMemberAccess.cpp. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 365532
*	[AMDGPU] gfx908 target	Stanislav Mekhanoshin	2019-07-09	7	-1/+104
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D64429 llvm-svn: 365525
*	[Object][XCOFF] Add support for 64-bit file header and section header dumping.	Sean Fertile	2019-07-09	5	-150/+260
\| \| \| \| \| \| \| \| \| \| \|	Adds a readobj dumper for 32-bit and 64-bit section header tables, and extend support for the file-header dumping to include 64-bit object files. Also refactors the binary file parsing to be done in a helper function in an attempt to cleanup error handeling. Differential Revision: https://reviews.llvm.org/D63843 llvm-svn: 365524
*	Revert "[HardwareLoops] NFC - move hardware loop checking code to ↵	Jinsong Ji	2019-07-09	2	-44/+34
\| \| \| \| \| \| \| \|	isHardwareLoopProfitable()" This reverts commit d95557306585404893d610784edb3e32f1bfce18. llvm-svn: 365520