bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[Clang][AVX512][BuiltIn] Adding support to intrinsics of VPERMD and VPERMW ↵	Michael Zuckerman	2016-04-25	13	-1/+802
\| \| \| \| \| \| \| \|	instruction set Differential Revision: http://reviews.llvm.org/D19195 llvm-svn: 267380
*	Fixing wrong mask size error. From __mmask8 to __mmask16.	Michael Zuckerman	2016-04-25	2	-6/+6
\| \| \| \| \| \| \|	Was reviewed over the shoulder by AsafBadouh. Connected to review http://reviews.llvm.org/D19195. llvm-svn: 267379
*	[Support/ELFRelocs] Add R_386_GOT32X.	Davide Italiano	2016-04-25	1	-1/+2
\| \| \| \| \| \| \| \|	The new relocation recently defined in the Intel386 psABI was still missing from this file. A subsequent commit will add support for GOT32X in MC, together with a test. llvm-svn: 267378
*	[X86] Replace a SmallVector used to pass 2 values to an ArrayRef parameter ↵	Craig Topper	2016-04-25	1	-3/+1
\| \| \| \| \| \|	with a fixed size array. NFC llvm-svn: 267377
*	[esan] Fix uninitialized warning from interception context	Derek Bruening	2016-04-25	1	-0/+2
\| \| \| \| \| \| \| \|	The interception context is not used by esan, but the compiler complains about it being uninitialized all the same. We set it to null to avoid the warning. llvm-svn: 267376
*	Minor code cleanups. NFC.	Junmo Park	2016-04-25	4	-23/+23
\| \| \| \|	llvm-svn: 267375
*	[llgo] llgoi: separate evaluation from printing	Andrew Wilkins	2016-04-25	7	-94/+124
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Separate the evaluation of expressions from printing of results. This is in preparation for splitting the core of the interpreter out for use in alternative interpreter frontends. At the same time, the output is made less noisy in response to comments on the golang-nuts announcement. We would ideally print out values using Go syntax, but this is impractical until we have libgo based on Go 1.5. When that happens, fmt's %#v will handle reflect.Value better, and so we can fix/filter type names to remove automatically generated package names. Reviewers: pcc Subscribers: llvm-commits, axw Differential Revision: http://reviews.llvm.org/D13761 llvm-svn: 267374
*	[X86] Add a complete set of tests for all operand sizes of cttz/ctlz with ↵	Craig Topper	2016-04-25	1	-6/+123
\| \| \| \| \| \|	and without zero undef being lowered to bsf/bsr. llvm-svn: 267373
*	Add a --element-count option to the expression command	Enrico Granata	2016-04-25	14	-13/+294
\| \| \| \| \| \| \| \| \| \| \| \| \|	This option evaluates an expression and, if the result is of pointer type, treats it as if it was an array of that many elements and displays such elements This has a couple subtle points but is mostly as straightforward as it sounds Add a parray N <expr> alias for this new mode Also, extend the --object-description mode to do the moral equivalent of the above but display each element in --object-description mode Add a poarray N <expr> alias for this llvm-svn: 267372
*	Add a note to the test explaining why it doesn't match gold's behaviour.	Peter Collingbourne	2016-04-25	1	-0/+3
\| \| \| \|	llvm-svn: 267371
*	Verifier: Verify that each inlinable callsite of a debug-info-bearing function	Adrian Prantl	2016-04-24	4	-2/+75
\| \| \| \| \| \| \| \| \| \| \| \| \|	in a debug-info-bearing function has a debug location attached to it. Failure to do so causes an "!dbg attachment points at wrong subprogram for function" assertion failure when the inliner sets up inline scope info. rdar://problem/25878916 This reaplies r267320 without changes after fixing an issue in the OpenMP IR generator in clang. llvm-svn: 267370
*	Debug info: Apply an empty debug location for global OpenMP destructors.	Adrian Prantl	2016-04-24	2	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \|	LLVM really wants a debug location on every inlinable call in a function with debug info, because it otherwise cannot set up inlining debug info. This change applies an artificial line 0 debug location (which is how DWARF marks automatically generated code that has no corresponding source code) to the .__kmpc_global_dtor_. functions to avoid the LLVM Verifier complaining. llvm-svn: 267369
*	clang-format: [JS] generator and async functions.	Martin Probst	2016-04-24	5	-11/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For generators, see: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Iterators_and_generators async functions are not quite in the spec yet, but stage 3 and already widely used: http://tc39.github.io/ecmascript-asyncawait/ Reviewers: djasper Subscribers: klimek Differential Revision: http://reviews.llvm.org/D19204 llvm-svn: 267368
*	Also check the IR.	Rafael Espindola	2016-04-24	1	-0/+4
\| \| \| \|	llvm-svn: 267367
*	Add a test for how we handle protected visibility.	Rafael Espindola	2016-04-24	2	-0/+22
\| \| \| \|	llvm-svn: 267366
*	unwind: remove unnecessary header	Saleem Abdulrasool	2016-04-24	1	-3/+0
\| \| \| \| \| \| \|	Availablity.h is not used within config.h. The locations which use the availability infrastructure already include the necessary header(s). NFC. llvm-svn: 267365
*	unwind: unify _LIBUNWIND_ABORT	Saleem Abdulrasool	2016-04-24	1	-18/+8
\| \| \| \| \| \| \| \| \| \| \|	Rather than use the `__assert_rtn` on libSystem based targets and a local `assert_rtn` function on others, expand the function definition into a macro which will perform the writing to stderr and then abort. This unifies the definition and behaviour across targets. Ensure that we flush stderr prior to aborting. llvm-svn: 267364
*	Fix unwind failures when PC points beyond the end of a function	Ulrich Weigand	2016-04-24	2	-6/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	RegisterContextLLDB::InitializeNonZerothFrame already has code to attempt to detect and handle the case where the PC points beyond the end of a function, but there are certain cases where this doesn't work correctly. In fact, there are two different places where this detection is attempted, and the failure is in fact a result of an unfortunate interaction between those two separate attempts. First, the ResolveSymbolContextForAddress routine is called with the resolve_tail_call_address flag set to true. This causes the routine to internally accept a PC pointing beyond the end of a function, and still resolving the PC to that function symbol. Second, the InitializeNonZerothFrame routine itself maintains a "decr_pc_and_recompute_addr_range" flag and, if that turns out to be true, itself decrements the PC by one and searches again for a symbol at that new PC value. Both approaches correctly identify the symbol associated with the PC. However, the problem is now that later on, we also need to find the DWARF CFI record associated with the PC. This is done in the RegisterContextLLDB::GetFullUnwindPlanForFrame routine, and uses the "m_current_offset_backed_up_one" member variable. However, that variable only actually contains the PC "backed up by one" if the second approach above was taken. If the function was already identified via the first approach above, that member variable is not backed up by one but simply points to the original PC. This in turn causes GetEHFrameUnwindPlan to not correctly identify the DWARF CFI record associated with the PC. Now, in many cases, if the first method had to back up the PC by one, we still use the second method too, because of this piece of code: // Or if we're in the middle of the stack (and not "above" an asynchronous event like sigtramp), // and our "current" pc is the start of a function... if (m_sym_ctx_valid && GetNextFrame()->m_frame_type != eTrapHandlerFrame && GetNextFrame()->m_frame_type != eDebuggerFrame && addr_range.GetBaseAddress().IsValid() && addr_range.GetBaseAddress().GetSection() == m_current_pc.GetSection() && addr_range.GetBaseAddress().GetOffset() == m_current_pc.GetOffset()) { decr_pc_and_recompute_addr_range = true; } In many cases, when the PC is one beyond the end of the current function, it will indeed then be exactly at the start of the next function. But this is not always the case, e.g. if there happens to be alignment padding between the end of one function and the start of the next. In those cases, we may sucessfully look up the function symbol via ResolveSymbolContextForAddress, but not set decr_pc_and_recompute_addr_range, and therefore fail to find the correct DWARF CFI record. A very simple fix for this problem is to just never use the first method. Call ResolveSymbolContextForAddress with resolve_tail_call_address set to false, which will cause it to fail if the PC is beyond the end of the current function; or else, identify the next function if the PC is also at the start of the next function. In either case, we will then set the decr_pc_and_recompute_addr_range variable and back up the PC anyway, but this time also find the correct DWARF CFI. A related problem is that the ResolveSymbolContextForAddress sometimes returns a "symbol" with empty name. This turns out to be an ELF section symbol. Now, usually those get type eSymbolTypeInvalid. However, there is code in ObjectFileELF::ParseSymbols that tries to change the type of invalid symbols to eSymbolTypeCode or eSymbolTypeData if the symbol lies within the code or data section. Unfortunately, this check also hits the symbol for the code section itself, which is then marked as eSymbolTypeCode. While the size of the section symbol is 0 according to the ELF file, LLDB considers this size invalid and attempts to figure out the "correct" size. Depending on how this goes, we may end up with a symbol that overlays part of the code section, even outside areas covered by real function symbols. Therefore, if we call ResolveSymbolContextForAddress with PC pointing beyond the end of a function, we may get this bogus section symbol. This again means InitializeNonZerothFrame thinks we have a valid PC, but then we don't find any unwind info for it. The fix for this problem is me to simply always leave ELF section symbols as type eSymbolTypeInvalid. Differential Revision: http://reviews.llvm.org/D18975 llvm-svn: 267363
*	[X86][AVX] Added PR24935 test case	Simon Pilgrim	2016-04-24	1	-0/+39
\| \| \| \|	llvm-svn: 267362
*	ARM: fix __chkstk Frame Setup on WoA	Saleem Abdulrasool	2016-04-24	6	-11/+13
\| \| \| \| \| \| \| \| \| \| \| \|	This corrects the MI annotations for the stack adjustment following the __chkstk invocation. We were marking the original SP usage as a Def rather than Kill. The (new) assigned value is the definition, the original reference is killed. Adjust the ISelLowering to mark Kills and FrameSetup as well. This partially resolves PR27480. llvm-svn: 267361
*	Tweak comments to make it clear that these combines are for SSE scalar ↵	Simon Pilgrim	2016-04-24	1	-4/+5
\| \| \| \| \| \|	instructions. llvm-svn: 267360
*	[InstCombine][SSE] Reduce DIVSS/DIVSD to FDIV if only first element is required	Simon Pilgrim	2016-04-24	3	-11/+11
\| \| \| \| \| \|	As discussed on D19318, if we only demand the first element of a DIVSS/DIVSD intrinsic, then reduce to a FDIV call. This matches the existing FADD/FSUB/FMUL patterns. llvm-svn: 267359
*	[ELF] Reinstate 'else' which was previously removed.	Davide Italiano	2016-04-24	1	-1/+2
\| \| \| \| \| \|	It turns out it's actually needed. llvm-svn: 267358
*	[InstCombine][SSE] Demanded vector elements for scalar intrinsics (Part 2 of 2)	Simon Pilgrim	2016-04-24	5	-183/+98
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Split from D17490. This patch improves support for determining the demanded vector elements through SSE scalar intrinsics: 1 - demanded vector element support for unary and some extra binary scalar intrinsics (RCP/RSQRT/SQRT/FRCZ and ADD/CMP/DIV/ROUND). 2 - addss/addsd get simplified to a fadd call if we aren't interested in the pass through elements 3 - if we don't need the lowest element of a scalar operation then just use the first argument (the pass through elements) directly We can add support for propagating demanded elements through any equivalent packed SSE intrinsics in a future patch (these wouldn't use the pass through patterns). Differential Revision: http://reviews.llvm.org/D19318 llvm-svn: 267357
*	[InstCombine][SSE] Demanded vector elements for scalar intrinsics (Part 1 of 2)	Simon Pilgrim	2016-04-24	4	-145/+126
\| \| \| \| \| \| \| \| \| \| \| \|	This patch improves support for determining the demanded vector elements through SSE scalar intrinsics: 1 - recognise that we only need the lowest element of the second input for binary scalar operations (and all the elements of the first input) 2 - recognise that the roundss/roundsd intrinsics use the lowest element of the second input and the remaining elements from the first input Differential Revision: http://reviews.llvm.org/D17490 llvm-svn: 267356
*	[InstCombine] Avoid updating argument demanded elements in separate passes.	Simon Pilgrim	2016-04-24	1	-7/+15
\| \| \| \| \| \|	As discussed on D17490, we should attempt to update an intrinsic's arguments demanded elements in one pass if we can. llvm-svn: 267355
*	Fix typo in comment. NFC	Nick Lewycky	2016-04-24	1	-1/+1
\| \| \| \|	llvm-svn: 267354
*	Remove emacs mode markers from .cpp files. NFC	Nick Lewycky	2016-04-24	2	-2/+2
\| \| \| \| \| \|	.cpp files are unambiguously C++, you only need the mode markers on .h files. llvm-svn: 267353
*	[X86][InstCombine] Tidyup VPERMILVAR -> shufflevector conversion to helper ↵	Simon Pilgrim	2016-04-24	1	-36/+47
\| \| \| \| \| \|	function. NFCI. llvm-svn: 267352
*	[X86][InstCombine] Tidyup PSHUFB -> shufflevector conversion to helper ↵	Simon Pilgrim	2016-04-24	1	-40/+45
\| \| \| \| \| \|	function. NFCI. llvm-svn: 267351
*	[X86][SSE] getTargetShuffleMaskIndices - dropped (unused) UNDEF handling	Simon Pilgrim	2016-04-24	1	-5/+0
\| \| \| \| \| \|	We aren't currently making use of this in any successful mask decode and its actually incorrect as it inserts the wrong number of SM_SentinelUndef mask elements. llvm-svn: 267350
*	[X86][SSE] Use range loop. NFCI.	Simon Pilgrim	2016-04-24	1	-3/+2
\| \| \| \|	llvm-svn: 267349
*	[Lanai] Use EVT::getEVTString() to print a type as a string instead of an ↵	Craig Topper	2016-04-24	1	-1/+1
\| \| \| \| \| \|	enum encoding value. llvm-svn: 267348
*	[X86][SSE] Added SSSE3/AVX/AVX2 BITREVERSE tests	Simon Pilgrim	2016-04-24	1	-52/+14603
\| \| \| \| \| \|	Codegen is pretty bad at the moment but could use PSHUFB quite efficiently llvm-svn: 267347
*	[X86][XOP] Fixed VPPERM permute op decoding (PR27472).	Simon Pilgrim	2016-04-24	2	-2/+2
\| \| \| \| \| \|	Fixed issue with VPPERM target shuffle mask decoding that was incorrectly masking off the 3-bit permute op with a 2-bit mask. llvm-svn: 267346
*	BitcodeReader: Delay metadata parsing until reading a function body	Duncan P. N. Exon Smith	2016-04-24	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \|	There's hardly any functionality change here. Instead of calling materializeMetadata on the first call to materialize(GlobalValue*), wait until the first one that's actually going to do something. Noticed by inspection; I don't have a concrete case where this makes a difference. Added an assertion in materializeMetadata to be sure this (or a future change) doesn't delay materializeMetadata after function-level metadata. llvm-svn: 267345
*	[ThinLTO] Remove GlobalValueInfo class from index	Teresa Johnson	2016-04-24	8	-266/+197
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Remove the GlobalValueInfo and change the ModuleSummaryIndex to directly reference summary objects. The info structure was there to support lazy parsing of the combined index summary objects, which is no longer needed and not supported. Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19462 llvm-svn: 267344
*	[X86][SSE] Improved support for decoding target shuffle masks through bitcasts	Simon Pilgrim	2016-04-24	3	-33/+29
\| \| \| \| \| \| \| \|	Reused the ability to split constants of a type wider than the shuffle mask to work with masks generated from scalar constants transfered to xmm. This fixes an issue preventing PSHUFB target shuffle masks decoding rematerialized scalar constants and also exposes the XOP VPPERM bug described in PR27472. llvm-svn: 267343
*	ModuleSummaryIndex: Avoid enum bitfields for MSVC portability	Duncan P. N. Exon Smith	2016-04-24	1	-2/+4
\| \| \| \| \| \| \| \|	Enum bitfields have crazy portability issues with MSVC. Use unsigned instead of LinkageTypes here in the ModuleSummaryIndex to address Takumi's concerns from r267335. llvm-svn: 267342
*	Revert "Declare GlobalValue::LinkageTypes based on unsigned."	Duncan P. N. Exon Smith	2016-04-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This reverts commit r267335. The build has been broken for hours because of it: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_build/23352/ The correct fix is avoid using any enum in a bitfield. llvm-svn: 267341
*	[SystemZ] [SSP] Add support for LOAD_STACK_GUARD.	Marcin Koscielnicki	2016-04-24	5	-0/+87
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes PR22248 on s390x. The previous attempt at this was D19101, which was before LOAD_STACK_GUARD existed. Compared to the previous version, this always emits a rather ugly block of 4 instructions, involving a thread pointer load that can't be shared with other potential users. However, this is necessary for SSP - spilling the guard value (or thread pointer used to load it) is counter to the goal, since it could be overwritten along with the frame it protects. Differential Revision: http://reviews.llvm.org/D19363 llvm-svn: 267340
*	[X86][SSE] Demonstrate issue with decoding shuffle masks that have been ↵	Simon Pilgrim	2016-04-24	2	-0/+37
\| \| \| \| \| \| \| \|	lowered as rematerialized constants on scalar unit Found whilst investigating PR27472 llvm-svn: 267339
*	Improve diagnostic checking for va_start to also warn on other instances of ↵	Aaron Ballman	2016-04-24	6	-19/+44
\| \| \| \| \| \| \| \|	undefined behavior, such as a parameter declared with the register keyword in C, or a parameter of a type that undergoes default argument promotion. This helps cover some more of the CERT secure coding rule EXP58-CPP. Pass an object of the correct type to va_start (https://www.securecoding.cert.org/confluence/display/cplusplus/EXP58-CPP.+Pass+an+object+of+the+correct+type+to+va_start). llvm-svn: 267338
*	Silence two C4806 warnings ('\|': unsafe operation: no value of type 'bool' ↵	Aaron Ballman	2016-04-24	1	-2/+2
\| \| \| \| \| \|	promoted to type 'const unsigned int' can equal the given constant). The fact that they trigger with this code seems like it may be a bug, but the warning itself is still generally useful enough to retain it for now. llvm-svn: 267337
*	Remove unused iterators [NFC]	Johannes Doerfert	2016-04-24	1	-13/+0
\| \| \| \|	llvm-svn: 267336
*	Declare GlobalValue::LinkageTypes based on unsigned.	NAKAMURA Takumi	2016-04-24	1	-1/+1
\| \| \| \| \| \|	Or, "LinkageTypes Linkage : 4;" might be sign-extended on msc. llvm-svn: 267335
*	llvm/test/tools/gold/X86/thinlto.ll: Possible fix corresponding to r267318.	NAKAMURA Takumi	2016-04-24	1	-0/+1
\| \| \| \|	llvm-svn: 267334
*	[ELF] Simplify. Remove unneeded else. NFC.	Davide Italiano	2016-04-24	1	-2/+1
\| \| \| \|	llvm-svn: 267333
*	BitcodeReader: Fix some holes in upgrade from r267296	Duncan P. N. Exon Smith	2016-04-24	3	-9/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add tests for some missing cases to bitcode upgrade in r267296. - DICompositeType with an 'elements:' field, which will cause it to be involved in a cycle after the upgrade. - A DIDerivedType that references a class in 'extraData:'. I updated test/Bitcode/dityperefs-3.8.ll with the missing cases and regenerated test/Bitcode/dityperefs-3.8.ll.bc. llvm-svn: 267332
*	[X86] Merge LowerCTLZ and LowerCTLZ_ZERO_UNDEF into a single function that ↵	Craig Topper	2016-04-24	1	-38/+16
\| \| \| \| \| \|	branches internally for the one difference, allowing the rest of the code to be common. NFC llvm-svn: 267331