bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[WebAssembly] Add wasm symbol table support to llvm-objdump	Sam Clegg	2017-05-04	1	-21/+47
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D32760 llvm-svn: 302185
*	[PPC] When restoring R30 (PIC base pointer), mark it as <def>	Krzysztof Parzyszek	2017-05-04	1	-2/+1
\| \| \| \| \| \| \| \| \|	This happened on the PPC32/SVR4 path and was discovered when building FreeBSD on PPC32. It was a typo-class error in the frame lowering code. This fixes PR26519. llvm-svn: 302183
*	Don't return an invalid line table if the DW_AT_stmt_list value is not in ↵	Greg Clayton	2017-05-04	1	-0/+4
\| \| \| \| \| \|	the .debug_line section. llvm-svn: 302180
*	[ms-inline-asm] Use the frontend size only for ambiguous instructions	Reid Kleckner	2017-05-04	2	-66/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This avoids problems on code like this: char buf[16]; __asm { movups xmm0, [buf] mov [buf], eax } The frontend size in this case (1) is wrong, and the register makes the instruction matching unambiguous. There are also enough bytes available that we shouldn't complain to the user that they are potentially using an incorrectly sized instruction to access the variable. Supersedes D32636 and D26586 and fixes PR28266 llvm-svn: 302179
*	[InstSimplify] move logic-of-icmps helper functions; NFC	Sanjay Patel	2017-05-04	1	-80/+80
\| \| \| \| \| \| \| \|	Putting these next to each other should make it easier to see what's missing from each side. Patch to plug one of those holes should be posted soon. llvm-svn: 302178
*	Re-apply r302108, "IR: Use pointers instead of GUIDs to represent edges in ↵	Peter Collingbourne	2017-05-04	9	-136/+133
\| \| \| \| \| \| \| \|	the module summary. NFCI." with a fix for the clang backend. llvm-svn: 302176
*	Fix a typo.	Michael Zolotukhin	2017-05-04	1	-2/+2
\| \| \| \|	llvm-svn: 302175
*	[NewGVN] Remove unneeded newline and format assertions. NFCI.	Davide Italiano	2017-05-04	1	-5/+4
\| \| \| \|	llvm-svn: 302173
*	[APInt] Reduce number of allocations involved in multiplying. Reduce worst ↵	Craig Topper	2017-05-04	1	-110/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	case multiply size Currently multiply is implemented in operator=. Operator makes a copy and uses operator= to modify the copy. Operator= itself allocates a temporary buffer to hold the multiply result as it computes it. Then copies it to the buffer in this. Operator= attempts to bound the size of the result based on the number of active bits in its inputs. It also has a couple special cases to handle 0 inputs without any memory allocations or multiply operations. The best case is that it calculates a single word regardless of input bit width. The worst case is that it calculates the a 2x input width result and drop the upper bits. Since operator* uses operator= it incurs two allocations, one for a copy of this and one for the temporary allocation. Neither of these allocations are kept after the method operation is done. The main usage in the backend appears to be ConstantRange::multiply which uses operator* rather than operator=. This patch moves the multiply operation to operator and implements operator= using it. This avoids the copy in operator. operator* now allocates a result buffer sized the same width as its inputs no matter what. This buffer will be used as the buffer for the returned APInt. Finally, we reuse tcMultiply to implement the multiply operation. This function is capable of not calculating additional upper words that will be discarded. This change does lose the special optimizations for the inputs using less words than their size implies. But it also removed the getActiveBits calls from all multiplies. If we think those optimizations are important we could look at providing additional bounds to tcMultiply to limit the computations. Differential Revision: https://reviews.llvm.org/D32830 llvm-svn: 302171
*	[PM] Add ProfileSummaryAnalysis as a required pass in the new pipeline.	Easwaran Raman	2017-05-04	1	-0/+4
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D32768 llvm-svn: 302170
*	[DAGCombine] Transform (fadd A, (fmul B, -2.0)) -> (fsub A, (fadd B, B)).	Chad Rosier	2017-05-04	1	-0/+18
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D32596 llvm-svn: 302153
*	Refactoring with range-based for, NFC	Krzysztof Parzyszek	2017-05-04	3	-119/+94
\| \| \| \| \| \| \| \|	Patch by Wei-Ren Chen. Differential Revision: https://reviews.llvm.org/D32682 llvm-svn: 302148
*	[SystemZ] Make copyPhysReg() add impl-use operands of super reg.	Jonas Paulsson	2017-05-04	1	-1/+7
\| \| \| \| \| \| \| \| \|	When a 128 bit COPY is lowered into two instructions, an impl-use operand of the super-reg should be added to each new instruction in case one of the sub-regs is undefined. Review: Ulrich Weigand llvm-svn: 302146
*	Revert "IR: Use pointers instead of GUIDs to represent edges in the module ↵	Eric Liu	2017-05-04	9	-133/+136
\| \| \| \| \| \| \| \| \| \|	summary. NFCI." This reverts commit r302108. This causes crash in clang bootstrap with LTO. Contacted the auther in the original commit. llvm-svn: 302140
*	[mips][XRay] Use the base version of emitXRayTable	Simon Dardis	2017-05-04	1	-34/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Follow up rL290858 by removing the MIPS specific version of XRayTable emission in favour of the basic version. This resolves a buildbot failure where the ELF sections were malformed causing the linker to reject the object files with xray related sections. Reviewers: dberris, slthakur Differential Revision: https://reviews.llvm.org/D32808 llvm-svn: 302138
*	[ArgPromotion] Fix a truncated variable	Martin Storsjo	2017-05-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This fixes a regression since SVN rev 273808 (which was supposed to not change functionality). The regression caused miscompilations (noted in the wild when targeting AArch64) on platforms with 32 bit long. Differential Revision: https://reviews.llvm.org/D32850 llvm-svn: 302137
*	[LLVM][inline-asm][Altmacor] Altmacro string delimiter '<..>'	Michael Zuckerman	2017-05-04	1	-4/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In this patch, I introduce a new altmacro string delimiter. This review is the second review in a series of four reviews. (one for each altmacro feature: LOCAL, string delimiter, string '!' escape sign and absolute expression as a string '%' ). In the alternate macro mode, you can delimit strings with matching angle brackets <..> when using it as a part of calling macro arguments. As described in the https://sourceware.org/binutils/docs-2.27/as/Altmacro.html "<string> You can delimit strings with matching angle brackets." assumptions: 1. If an argument begins with '<' and ends with '>'. The argument is considered as a string. 2. Except adding new string mark '<..>', a regular macro behavior is expected. 3. The altmacro cannot affect the regular less/greater behavior. 4. If a comma is present inside an angle brackets it considered as a character and not as a separator. Differential Revision: https://reviews.llvm.org/D32701 llvm-svn: 302135
*	[X86][AVX-512] Allow EVEX encoded instruction selection when available for ↵	Igor Breger	2017-05-04	1	-2/+2
\| \| \| \| \| \| \| \|	mul v8i32. Differential Revision: https://reviews.llvm.org/D32679 llvm-svn: 302127
*	[ARM] ACLE Chapter 9 intrinsics	Sam Parker	2017-05-04	4	-139/+338
\| \| \| \| \| \| \| \| \| \| \| \|	Added the integer data processing intrinsics from ACLE v2.1 Chapter 9 but I have missed out the saturation_occurred intrinsics for now. For the instructions that read and write the GE bits, a chain is included and the only instruction that reads these flags (sel) is only selectable via the implemented intrinsic. Differential Revision: https://reviews.llvm.org/D32281 llvm-svn: 302126
*	[X86] Disabling PLT in Regcall CC Functions	Oren Ben Simhon	2017-05-04	1	-2/+8
\| \| \| \| \| \| \| \| \| \|	According to psABI, PLT stub clobbers XMM8-XMM15. In Regcall calling convention those registers are used for passing parameters. Thus we need to prevent lazy binding in Regcall. Differential Revision: https://reviews.llvm.org/D32430 llvm-svn: 302124
*	[AVX] Fix vpcmpeqq predicate.	Igor Breger	2017-05-04	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fix vpcmpeqq predicate. AVX512 version of vpcmpeqq is not equivalent to AVX one. Split from https://reviews.llvm.org/D32679 Reviewers: craig.topper, zvi, aymanmus Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32786 llvm-svn: 302119
*	Use right function in LoopVectorize.	Jonas Paulsson	2017-05-04	1	-1/+1
\| \| \| \| \| \| \| \|	- unsigned AS = getMemInstAlignment(I); + unsigned AS = getMemInstAddressSpace(I); Review: Hal Finkel llvm-svn: 302114
*	[XRay] Use wordsize-dependent alignment for index	Dean Michael Berris	2017-05-04	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	This makes it simpler for the runtime to consistently handle the entries in the function sled index in both 32 and 64 bit platforms where the XRay runtime works. Follow-up on D32693. llvm-svn: 302111
*	[SelectionDAG] Improve known bits support for CTPOP.	Craig Topper	2017-05-04	1	-1/+4
\| \| \| \| \| \|	This is based on the same concept from ValueTracking's version of computeKnownBits. llvm-svn: 302110
*	[XRay] Create an Index of sleds per function	Dean Michael Berris	2017-05-04	1	-13/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change adds a new section to the xray-instrumented binary that stores an index into ranges of the instrumentation map, where sleds associated with the same function can be accessed as an array. At runtime, we can get access to this index by function ID offset allowing for selective patching and unpatching by function ID. Each entry in this new section (xray_fn_idx) will include two pointers indicating the start and one past the end of the sleds associated with the same function. These entries will be 16 bytes long on x86 and aarch64. On arm, we align to 16 bytes anyway so the runtime has to take that into consideration. __{start,stop}_xray_fn_idx will be the symbols that the runtime will look for when we implement the selective patching/unpatching by function id APIs. Because XRay synthesizes the function id's in a monotonically increasing manner at runtime now, implementations (and users) can use this table to look up the sleds associated with a specific function. This is useful in implementations that want to do things like: - Implement coverage mode for functions by patching everything pre-main, then as functions are encountered, the installed handler can unpatch the function that's been encountered after recording that it's been called. - Do "learning mode", so that the implementation can figure out some statistical information about function calls by function id for a time being, and then determine which functions are worth uninstrumenting at runtime. - Do "selective instrumentation" where an implementation can specifically instrument only certain function id's at runtime (either based on some external data, or through some other heuristics) instead of patching all the instrumented functions at runtime. Reviewers: dblaikie, echristo, chandlerc, javed.absar Subscribers: pelikan, aemerson, kpw, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D32693 llvm-svn: 302109
*	IR: Use pointers instead of GUIDs to represent edges in the module summary. ↵	Peter Collingbourne	2017-05-04	9	-136/+133
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	NFCI. When profiling a no-op incremental link of Chromium I found that the functions computeImportForFunction and computeDeadSymbols were consuming roughly 10% of the profile. The goal of this change is to improve the performance of those functions by changing the map lookups that they were previously doing into pointer dereferences. This is achieved by changing the ValueInfo data structure to be a pointer to an element of the global value map owned by ModuleSummaryIndex, and changing reference lists in the GlobalValueSummary to hold ValueInfos instead of GUIDs. This means that a ValueInfo will take a client directly to the summary list for a given GUID. Differential Revision: https://reviews.llvm.org/D32471 llvm-svn: 302108
*	[XRay] Detect loops in functions being lowered	Dean Michael Berris	2017-05-04	1	-16/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is an implementation of the loop detection logic that XRay needs to determine whether a function might take time at runtime. Without this heuristic, XRay will tend to not instrument short functions that have loops that might have runtime dependent on inputs or external values. While this implementation doesn't do any further analysis than just figuring out whether there is a loop in the MachineFunction being code-gen'ed, we're paving the way for being able to perform more sophisticated analysis of the function in the future (for example to determine whether the trip count for the loop might be constant, and make a decision on that instead). This enables us to cover more functions with the default heuristics, and potentially identify ones that have variable runtime latency just by looking for the presence of loops. Reviewers: chandlerc, rnk, pelikan Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32274 llvm-svn: 302103
*	[SCEV] createAddRecFromPHI: Optimize for the most common case.	Michael Zolotukhin	2017-05-03	1	-2/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The existing implementation creates a symbolic SCEV expression every time we analyze a phi node and then has to remove it, when the analysis is finished. This is very expensive, and in most of the cases it's also unnecessary. According to the data I collected, ~60-70% of analyzed phi nodes (measured on SPEC) have the following form: PN = phi(Start, OP(Self, Constant)) Handling such cases separately significantly speeds this up. Reviewers: sanjoy, pete Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32663 llvm-svn: 302096
*	[InstCombine][KnownBits] Use KnownBits better to detect nsw adds	Craig Topper	2017-05-03	1	-32/+44
\| \| \| \| \| \| \| \| \| \| \|	Change checkRippleForAdd from a heuristic to a full check - if it is provable that the add does not overflow return true, otherwise false. Patch by Yoav Ben-Shalom Differential Revision: https://reviews.llvm.org/D32686 llvm-svn: 302093
*	Mark functions as not having CFI once we finalize an x86 stack frame	Reid Kleckner	2017-05-03	1	-0/+4
\| \| \| \| \| \| \| \| \|	We'll set it back to true in emitPrologue if it gets called. It doesn't get called for naked functions. Fixes PR32912 llvm-svn: 302092
*	[KnownBits] Add methods for determining if KnownBits is a constant value	Craig Topper	2017-05-03	3	-11/+10
\| \| \| \| \| \| \| \|	This patch adds isConstant and getConstant for determining if KnownBits represents a constant value and to retrieve the value. Use them to simplify code. Differential Revision: https://reviews.llvm.org/D32785 llvm-svn: 302091
*	[ValueTracking] Remove handling for BitWidth being 0 in ComputeSignBit and ↵	Craig Topper	2017-05-03	1	-12/+5
\| \| \| \| \| \| \| \|	isKnownNonZero. I don't believe its possible to have non-zero values here since DataLayout became required. The APInt constructor inside of the KnownBits object will assert if this ever happens. llvm-svn: 302089
*	[KnownBits] Add zext, sext, and trunc methods to KnownBits	Craig Topper	2017-05-03	6	-66/+33
\| \| \| \| \| \| \| \|	This patch adds zext, sext, and trunc methods to KnownBits and uses them where possible. Differential Revision: https://reviews.llvm.org/D32784 llvm-svn: 302088
*	[TargetLowering] use isSubsetOf in SimplifyDemandedBits; NFCI	Sanjay Patel	2017-05-03	1	-18/+14
\| \| \| \| \| \| \| \| \|	This is the DAG equivalent of https://reviews.llvm.org/D32255 , which will hopefully be committed again. The functionality (preferring a 'not' op) is already here in the DAG, so this is just intended to be a clean-up and performance improvement. llvm-svn: 302087
*	clang-format and restyle DWARFFormValue before working on it. NFC	Paul Robinson	2017-05-03	1	-280/+301
\| \| \| \|	llvm-svn: 302086
*	DebugInfo: elide type index entries for synthetic types	Saleem Abdulrasool	2017-05-03	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Compiler emitted synthetic types may not have an associated DIFile (translation unit). In such a case, when generating CodeView debug type information, we would attempt to compute an absolute filepath which would result in a segfault due to a NULL DIFile*. If there is no source file associated with the type, elide the type index entry for the type and record the type information. This actually results in higher fidelity debug information than clang/C2 as of this writing. Resolves PR32668! llvm-svn: 302085
*	[AArch64] Fix variable name ambiguity in r302078.	Ahmed Bougacha	2017-05-03	1	-2/+4
\| \| \| \| \| \|	ArchKind is passed to the function, but it's also a type. llvm-svn: 302081
*	[TailCallElim] Remove an unused argument. NFCI	Xin Tong	2017-05-03	1	-9/+6
\| \| \| \|	llvm-svn: 302080
*	[AArch64] Make the TargetParser add CPU exts provided by the arch.	Ahmed Bougacha	2017-05-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Otherwise, each CPU has to manually specify the extensions it supports, even though they have to be a superset of the base arch extensions. And when there's redundant data there's stale data, so most of the CPUs lie about the features they support (almost none lists AEK_FP). Instead, do the saner thing: add the optional extensions on top of the base extensions provided by the architecture. The ARM TargetParser has the same behavior. Differential Revision: https://reviews.llvm.org/D32780 llvm-svn: 302078
*	[AArch64] armv8-A doesn't have CRC.	Ahmed Bougacha	2017-05-03	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \|	That's only a required extension as of v8.1a. Remove it from the "generic" CPU as well: it should only support the base ISA (and binutils agrees). Also unify the MC tests into crc.s and arm64-crc32.s llvm-svn: 302077
*	[libFuzzer] exit without running atexit handlers in libfuzzer's crash handler	Vitaly Buka	2017-05-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: It's not safe to assume that atexit handlers can be run once the app crashed. Patch by Jochen Eisinger. Reviewers: kcc, vitalybuka Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32640 llvm-svn: 302076
*	[Hexagon] Use automatically-generated scheduling information for HVX	Krzysztof Parzyszek	2017-05-03	27	-8023/+9998
\| \| \| \| \| \|	Patch by Jyotsna Verma. llvm-svn: 302073
*	Remove unused private field.	Zachary Turner	2017-05-03	1	-3/+2
\| \| \| \|	llvm-svn: 302069
*	Avoid warning of unused variable in release builds. NFC	Anna Thomas	2017-05-03	1	-0/+2
\| \| \| \|	llvm-svn: 302068
*	Fix typos in comment	Sanjoy Das	2017-05-03	1	-4/+4
\| \| \| \|	llvm-svn: 302063
*	Break verification down into smaller functions to keep code clean.	Greg Clayton	2017-05-03	1	-142/+167
\| \| \| \| \| \| \| \|	Adrian requested that we break things down to make things clean in the DWARFVerifier. This patch breaks everything down into nice individual functions and cleans up the code quite a bit and prepares us for the next round of verifiers. Differential Revision: https://reviews.llvm.org/D32812 llvm-svn: 302062
*	Fix PPC64 warning for missing parantheses. NFC.	Anna Thomas	2017-05-03	1	-3/+4
\| \| \| \|	llvm-svn: 302061
*	[IR] Abstract away ArgNo+1 attribute indexing as much as possible	Reid Kleckner	2017-05-03	19	-223/+242
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Do three things to help with that: - Add AttributeList::FirstArgIndex, which is an enumerator currently set to 1. It allows us to change the indexing scheme with fewer changes. - Add addParamAttr/removeParamAttr. This just shortens addAttribute call sites that would otherwise need to spell out FirstArgIndex. - Remove some attribute-specific getters and setters from Function that take attribute list indices. Most of these were only used from BuildLibCalls, and doesNotAlias was only used to test or set if the return value is malloc-like. I'm happy to split the patch, but I think they are probably easier to review when taken together. This patch should be NFC, but it sets the stage to change the indexing scheme to this, which is more convenient when indexing into an array: 0: func attrs 1: retattrs 2...: arg attrs Reviewers: chandlerc, pete, javed.absar Subscribers: david2050, llvm-commits Differential Revision: https://reviews.llvm.org/D32811 llvm-svn: 302060
*	[CodeView] Remove constructor initialization of a removed field.	Davide Italiano	2017-05-03	1	-2/+2
\| \| \| \| \| \|	I should've staged this with my last commit. llvm-svn: 302059
*	[RuntimeLoopUnroller] Add assert that we dont unroll non-rotated loops	Anna Thomas	2017-05-03	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Cloning basic blocks in the loop for runtime loop unroller depends on loop being in rotated form (i.e. loop latch target is the exit block). Assert that this is true, so that callers of runtime loop unroller pass in canonical loops. The single caller of this function has that check recently added: https://reviews.llvm.org/rL301239 Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32801 llvm-svn: 302058