bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SDAG] Make the new zext-vector-inreg node default to expand so targets	Chandler Carruth	2014-07-09	2	-2/+4
\| \| \| \| \| \| \| \| \| \| \|	don't need to set it manually. This is based on feedback from Tom who pointed out that if every target needs to handle this we need to reach out to those maintainers. In fact, it doesn't make sense to duplicate everything when anything other than expand seems unlikely at this stage. llvm-svn: 212661
*	Recommit r212203: Don't try to construct debug LexicalScopes hierarchy for ↵	David Blaikie	2014-07-09	4	-4/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	functions that do not have top level debug information. Reverted by Eric Christopher (Thanks!) in r212203 after Bob Wilson reported LTO issues. Duncan Exon Smith and Aditya Nandakumar helped provide a reduced reproduction, though the failure wasn't too hard to guess, and even easier with the example to confirm. The assertion that the subprogram metadata associated with an llvm::Function matches the scope data referenced by the DbgLocs on the instructions in that function is not valid under LTO. In LTO, a C++ inline function might exist in multiple CUs and the subprogram metadata nodes will refer to the same llvm::Function. In this case, depending on the order of the CUs, the first intance of the subprogram metadata may not be the one referenced by the instructions in that function and the assertion will fail. A test case (test/DebugInfo/cross-cu-linkonce-distinct.ll) is added, the assertion removed and a comment added to explain this situation. Original commit message: If a function isn't actually in a CU's subprogram list in the debug info metadata, ignore all the DebugLocs and don't try to build scopes, track variables, etc. While this is possibly a minor optimization, it's also a correctness fix for an incoming patch that will add assertions to LexicalScopes and the debug info verifier to ensure that all scope chains lead to debug info for the current function. Fix up a few test cases that had broken/incomplete debug info that could violate this constraint. Add a test case where this occurs by design (inlining a debug-info-having function in an attribute nodebug function - we want this to work because /if/ the nodebug function is then inlined into a debug-info-having function, it should be fine (and will work fine - we just stitch the scopes up as usual), but should the inlining not happen we need to not assert fail either). llvm-svn: 212649
*	Decouple llvm::SpecialCaseList text representation and its LLVM IR semantics.	Alexey Samsonov	2014-07-09	4	-56/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Turn llvm::SpecialCaseList into a simple class that parses text files in a specified format and knows nothing about LLVM IR. Move this class into LLVMSupport library. Implement two users of this class: * DFSanABIList in DFSan instrumentation pass. * SanitizerBlacklist in Clang CodeGen library. The latter will be modified to use actual source-level information from frontend (source file names) instead of unstable LLVM IR things (LLVM Module identifier). Remove dependency edge from ClangCodeGen/ClangDriver to LLVMTransformUtils. No functionality change. llvm-svn: 212643
*	Add trunc (select c, a, b) -> select c (trunc a), (trunc b) combine.	Matt Arsenault	2014-07-09	1	-0/+14
\| \| \| \| \| \|	Do this if the truncate is free and the select is legal. llvm-svn: 212640
*	AArch64: Better codegen for storing to __fp16.	Jim Grosbach	2014-07-09	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \|	Storing will generally be immediately preceded by rounding from an f32 or f64, so make sure to match those patterns directly to convert into the FPR16 register class directly rather than going through the integer GPRs. This also eliminates an extra step in the convert-from-f64 path which was first converting to f32 and then to f16 from there. rdar://17594379 llvm-svn: 212638
*	TargetRegisterInfo: Remove function that fell out of use years ago.	Benjamin Kramer	2014-07-09	2	-19/+0
\| \| \| \|	llvm-svn: 212636
*	[X86] AVX512: Enable it in the Loop Vectorizer	Adam Nemet	2014-07-09	1	-1/+5
\| \| \| \| \| \| \| \| \| \|	This lets us experiment with 512-bit vectorization without passing force-vector-width manually. The code generated for a simple integer memset loop is properly vectorized. Disassembly is still broken for it though :(. llvm-svn: 212634
*	Make AArch64FastISel::EmitIntExt explicitly check its source and destination ↵	Louis Gerbarg	2014-07-09	1	-3/+8
\| \| \| \| \| \| \| \| \| \| \| \|	types This is a follow up to r212492. There should be no functional difference, but this patch makes it clear that SrcVT must be an i1/i8/16/i32 and DestVT must be an i8/i16/i32/i64. rdar://17516686 llvm-svn: 212633
*	Fix for PR20059 (instcombine reorders shufflevector after instruction that ↵	Sanjay Patel	2014-07-09	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \|	may trap) In PR20059 ( http://llvm.org/pr20059 ), instcombine eliminates shuffles that are necessary before performing an operation that can trap (srem). This patch calls isSafeToSpeculativelyExecute() and bails out of the optimization in SimplifyVectorOp() if needed. Differential Revision: http://reviews.llvm.org/D4424 llvm-svn: 212629
*	Add Imagination Technologies to the vendors in llvm::Triple	Daniel Sanders	2014-07-09	1	-0/+2
\| \| \| \| \| \| \| \|	Summary: This is a pre-requisite for supporting the mips-img-linux-gnu triple in clang. Differential Revision: http://reviews.llvm.org/D4435 llvm-svn: 212626
*	Generic: add range-adapter for option parsing.	Tim Northover	2014-07-09	1	-17/+13
\| \| \| \| \| \|	I want to use it in lld, but while I'm here I'll update LLVM uses. llvm-svn: 212615
*	[x86] Fix a bug in my new zext-vector-inreg DAG trickery where we were	Chandler Carruth	2014-07-09	2	-0/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	not widening the input type to the node sufficiently to let the ext take place in a register. This would in turn result in a mysterious bitcast assertion failure downstream. First change here is to add back the helpful assert I had in an earlier version of the code to catch this immediately. Next change is to add support to the type legalization to detect when we have widened the operand either too little or too much (for whatever reason) and find a size-matched legal vector type to convert it to first. This can also fail so we get a new fallback path, but that seems OK. With this, we no longer crash on vec_cast2.ll when using widening. I've also added the CHECK lines for the zero-extend cases here. We still need to support sign-extend and trunc (or something) to get plausible code for the other two thirds of this test which is one of the regression tests that showed the most scalarization when widening was force-enabled. Slowly closing in on widening being a viable legalization strategy without it resorting to scalarization at every turn. =] llvm-svn: 212614
*	Sink two variables only used in an assert into the assert itself. Should	Chandler Carruth	2014-07-09	1	-3/+3
\| \| \| \| \| \|	fix the release builds with Werror. llvm-svn: 212612
*	X86: When lowering v8i32 himuls use the correct shuffle masks for AVX2.	Benjamin Kramer	2014-07-09	1	-5/+13
\| \| \| \| \| \| \| \| \| \|	Turns out my trick of using the same masks for SSE4.1 and AVX2 didn't work out as we have to blend two vectors. While there remove unecessary cross-lane moves from the shuffles so the backend can lower it to palignr instead of vperm. Fixes PR20118, a miscompilation of vector sdiv by constant on AVX2. llvm-svn: 212611
*	[x86] Add a ZERO_EXTEND_VECTOR_INREG DAG node and use it when widening	Chandler Carruth	2014-07-09	6	-1/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vector types to be legal and a ZERO_EXTEND node is encountered. When we use widening to legalize vector types, extend nodes are a real challenge. Either the input or output is likely to be legal, but in many cases not both. As a consequence, we don't really have any way to represent this situation and the prior code in the widening legalization framework would just scalarize the extend operation completely. This patch introduces a new DAG node to represent doing a zero extend of a vector "in register". The core of the idea is to allow legal but different vector types in the input and output. The output vector must have fewer lanes but wider elements. The operation is defined to zero extend the low elements of the input to the size of the output elements, and drop all of the high elements which don't have a corresponding lane in the output vector. It also includes generic expansion of this node in terms of blending a zero vector into the high elements of the vector and bitcasting across. This in turn yields extremely nice code for x86 SSE2 when we use the new widening legalization logic in conjunction with the new shuffle lowering logic. There is still more to do here. We need to support sign extension, any extension, and potentially int-to-float conversions. My current plan is to continue using similar synthetic nodes to model each of these transitions with generic lowering code for each one. However, with this patch LLVM already reaches performance parity with GCC for the core C loops of the x264 code (assuming you disable the hand-written assembly versions) when compiling for SSE2 and SSE3 architectures and enabling the new widening and lowering logic for vectors. Differential Revision: http://reviews.llvm.org/D4405 llvm-svn: 212610
*	[mips][mips64r6] Correct select patterns that have the condition or ↵	Daniel Sanders	2014-07-09	2	-26/+26
\| \| \| \| \| \| \| \| \| \|	true/false values backwards Summary: This bug caused SingleSource/Regression/C/uint64_to_float and SingleSource/UnitTests/2002-05-02-CastTest3 to fail (among others). Differential Revision: http://reviews.llvm.org/D4388 llvm-svn: 212608
*	[mips][mips64r6] Correct cond names in the cmp.cond.[ds] instructions	Daniel Sanders	2014-07-09	2	-41/+42
\| \| \| \| \| \| \| \| \| \|	Summary: It seems we accidentally read the wrong column of the table MIPS64r6 spec and used the names for c.cond.fmt instead of cmp.cond.fmt. Differential Revision: http://reviews.llvm.org/D4387 llvm-svn: 212607
*	[x86] Initialize a pointer to null to fix a bug in r212602.	Chandler Carruth	2014-07-09	1	-1/+1
\| \| \| \| \| \| \|	This should restore GCC hosts (which happen to put the bad stuff into the pointer) and MSan, etc. llvm-svn: 212606
*	[mips][mips64r6] Use JALR for indirect branches instead of JR (which is not ↵	Daniel Sanders	2014-07-09	5	-25/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	available on MIPS32r6/MIPS64r6) Summary: This completes the change to use JALR instead of JR on MIPS32r6/MIPS64r6. Reviewers: jkolek, vmedic, zoran.jovanovic, dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4269 llvm-svn: 212605
*	[mips][mips64r6] Use JALR for returns instead of JR (which is not available ↵	Daniel Sanders	2014-07-09	10	-26/+103
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	on MIPS32r6/MIPS64r6) Summary: RET, and RET_MM have been replaced by a pseudo named PseudoReturn. In addition a version with a 64-bit GPR named PseudoReturn64 has been added. Instruction selection for a return matches RetRA, which is expanded post register allocation to PseudoReturn/PseudoReturn64. During MipsAsmPrinter, this PseudoReturn/PseudoReturn64 are emitted as: - (JALR64 $zero, $rs) on MIPS64r6 - (JALR $zero, $rs) on MIPS32r6 - (JR_MM $rs) on microMIPS - (JR $rs) otherwise On MIPS32r6/MIPS64r6, 'jr $rs' is an alias for 'jalr $zero, $rs'. To aid development and review (specifically, to ensure all cases of jr are updated), these aliases are temporarily named 'r6.jr' instead of 'jr'. A follow up patch will change them back to the correct mnemonic. Added (JALR $zero, $rs) to MipsNaClELFStreamer's definition of an indirect jump, and removed it from its definition of a call. Note: I haven't accounted for MIPS64 in MipsNaClELFStreamer since it's doesn't appear to account for any MIPS64-specifics. The return instruction created as part of eh_return expansion is now expanded using expandRetRA() so we use the right return instruction on MIPS32r6/MIPS64r6 ('jalr $zero, $rs'). Also, fixed a misuse of isABI_N64() to detect 64-bit wide registers in expandEhReturn(). Reviewers: jkolek, vmedic, mseaborn, zoran.jovanovic, dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4268 llvm-svn: 212604
*	[x86] Re-apply a variant of the x86 side of r212324 now that the rest	Chandler Carruth	2014-07-09	1	-74/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	has settled without incident, removing the x86-specific and overly strict 'isVectorSplat' routine in favor of generic and more powerful splat detection. The primary motivation and result of this is that the x86 backend can now see through splats which contain undef elements. This is essential if we are using a widening form of legalization and I've updated a test case to also run in that mode as before this change the generated code for the test case was completely scalarized. This version of the patch much more carefully handles the undef lanes. - We aren't overly conservative about them in the shift lowering (where we will never use the splat itself). - One place where the splat would have been re-used by the existing code now explicitly constructs a new constant splat that will be safe. - The broadcast lowering is much more reasonable with undefs by doing a correct check of whether the splat is the only user of a loaded value, checking that the splat actually crosses multiple lanes before using a broadcast, and handling broadcasts of non-constant splats. As a consequence of the last bullet, the weird usage of vpshufd instead of vbroadcast is gone, and we actually can lower an AVX splat with vbroadcastss where before we emitted a really strange pattern of a vector load and a manual splat across the vector. llvm-svn: 212602
*	[ASan/Win] Don't instrument COMDAT globals. Properly fixes PR20244.	Timur Iskhodzhanov	2014-07-09	1	-8/+4
\| \| \| \|	llvm-svn: 212596
*	SourceMgr: consistently use 'unsigned' for the memory buffer ID type	Dmitri Gribenko	2014-07-09	1	-3/+3
\| \| \| \|	llvm-svn: 212595
*	[SDAG] At the suggestion of Hal, switch to an output parameter that	Chandler Carruth	2014-07-09	3	-22/+27
\| \| \| \| \| \| \| \| \| \|	tracks which elements of the build vector are in fact undef. This should make actually inpsecting them (likely in my next patch) reasonably pretty. Also makes the output parameter optional as it is clear now that most users are happy with undefs in their splats. llvm-svn: 212581
*	MipsTargetStreamer.h: Avoid "using" to appease msc17.	NAKAMURA Takumi	2014-07-08	1	-1/+1
\| \| \| \|	llvm-svn: 212577
*	AArch64: Better codegen for loading from __fp16.	Jim Grosbach	2014-07-08	1	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \| \|	Loading will generally extend to an f32 or an 64, so make sure to match those patterns directly to load into the FPR16 register class directly rather than going through the integer GPRs. This also eliminates an extra step in the convert-to-f64 path which was first converting to f32 and then to f64 from there. rdar://17594379 llvm-svn: 212573
*	Improve BasicAA CS-CS queries	Hal Finkel	2014-07-08	3	-126/+142
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	BasicAA contains knowledge of certain intrinsics, such as memcpy and memset, and uses that information to form more-accurate answers to CallSite vs. Loc ModRef queries. Unfortunately, it did not use this information when answering CallSite vs. CallSite queries. Generically, when an intrinsic takes one or more pointers and the intrinsic is marked only to read/write from its arguments, the offset/size is unknown. As a result, the generic code that answers CallSite vs. CallSite (and CallSite vs. Loc) queries in AA uses UnknownSize when forming Locs from an intrinsic's arguments. While BasicAA's CallSite vs. Loc override could use more-accurate size information for some intrinsics, it did not do the same for CallSite vs. CallSite queries. This change refactors the intrinsic-specific logic in BasicAA into a generic AA query function: getArgLocation, which is overridden by BasicAA to supply the intrinsic-specific knowledge, and used by AA's generic implementation. This allows the intrinsic-specific knowledge to be used by both CallSite vs. Loc and CallSite vs. CallSite queries, and simplifies the BasicAA implementation. Currently, only one function, Mac's memset_pattern16, is handled by BasicAA (all the rest are intrinsics). As a side-effect of this refactoring, BasicAA's getModRefBehavior override now also returns OnlyAccessesArgumentPointees for this function (which is an improvement). llvm-svn: 212572
*	Add support for BSD format Archive map symbols (aka the table of contents	Kevin Enderby	2014-07-08	1	-6/+63
\| \| \| \| \| \|	from a __.SYMDEF or "__.SYMDEF SORTED" archive member). llvm-svn: 212568
*	Revert "GlobalDCE: Delete available_externally initializers if it allows ↵	Pete Cooper	2014-07-08	1	-40/+3
\| \| \| \| \| \| \| \| \| \|	removing the value the initializer is referring to." This reverts commit 5b55a47e94e28fbb56d0cd5d72c3db9105c15b4c. A test case was found to crash after this was applied. I'll file a bug to track fixing this with the test case needed. llvm-svn: 212550
*	[PowerPC] Implement atomic NAND operations as actual NAND	Ulrich Weigand	2014-07-08	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \|	This changes the implementation of atomic NAND operations from "a & ~b" (compatible with GCC < 4.4) to actual "~(a & b)" (compatible with GCC >= 4.4). This is in line with the common-code and ARM back-end change implemented in r212433. llvm-svn: 212547
*	[DAG] Teach how to combine a pair of shuffles into a single shuffle if the ↵	Andrea Di Biagio	2014-07-08	1	-3/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	resulting mask is legal. This patch teaches how to fold a shuffle according to rule: shuffle (shuffle (x, undef, M0), undef, M1) -> shuffle(x, undef, M2) We do this only if the resulting mask M2 is legal; this is to avoid introducing illegal shuffles that are potentially expanded into a sub-optimal sequence of target specific dag nodes. This patch has the advantage of being target independent, since it works on ISD nodes. Therefore, all targets (not only x86) can take advantage of this rule. The idea behind this patch is that most shuffle pairs can be safely combined before we run the legalizer on vector operations. This allows us to combine/simplify dag nodes earlier in the process and not only immediately before instruction selection stage. That said. This patch is not meant to replace any existing target specific combine rules; backends might still introduce new shuffles during legalization stage. Also, this rule is very simple and avoids to aggressively optimize shuffles. llvm-svn: 212539
*	Fix some Twine locals.	Benjamin Kramer	2014-07-08	3	-17/+18
\| \| \| \| \| \|	Two of those are use after frees. Found by clang-tidy, fixed by me. llvm-svn: 212537
*	[ASan/Win] Don't instrument private COMDAT globals until PR20244 is properly ↵	Timur Iskhodzhanov	2014-07-08	1	-0/+7
\| \| \| \| \| \|	fixed llvm-svn: 212530
*	[mips] Fixed struct/class mismatch introduced in r212522.	Daniel Sanders	2014-07-08	1	-1/+1
\| \| \| \| \| \|	Clang emits a warning about this. llvm-svn: 212528
*	Fix r212522 - [mips] Improve encapsulation of the .MIPS.abiflags ↵	Daniel Sanders	2014-07-08	1	-0/+3
\| \| \| \| \| \| \| \|	implementation and limit scope of related enums Added two lines that should have been in r212522. llvm-svn: 212523
*	[mips] Improve encapsulation of the .MIPS.abiflags implementation and limit ↵	Daniel Sanders	2014-07-08	7	-298/+353
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	scope of related enums Summary: Follow on to r212519 to improve the encapsulation and limit the scope of the enums. Also merged two very similar parser functions, fixed a bug where ASE's were not being reported, and marked CPR1's as being 128-bit when MSA is enabled. Differential Revision: http://reviews.llvm.org/D4384 llvm-svn: 212522
*	Revert "Refactor ARM subarchitecture parsing"	Renato Golin	2014-07-08	2	-101/+82
\| \| \| \| \| \|	This reverts commit 7b4a6882467e7fef4516a0cbc418cbfce0fc6f6d. llvm-svn: 212521
*	Truncate the immediate in logical operation to the register width	Arnaud A. de Grandmaison	2014-07-08	1	-2/+7
\| \| \| \| \| \|	And continue to produce an error if the 32 most significant bits are not all ones or zeros. llvm-svn: 212520
*	Mips.abiflags is a new implicitly generated section that will be present on ↵	Vladimir Medic	2014-07-08	6	-65/+528
\| \| \| \| \| \|	all new modules. The section contains a versioned data structure which represents essentially information to allow a program loader to determine the requirements of the application. This patch implements mips.abiflags section and provides test cases for it. llvm-svn: 212519
*	[x86,SDAG] Sink the logic for folding shuffles of splats more	Chandler Carruth	2014-07-08	2	-46/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	aggressively from the x86 shuffle lowering to the generic SDAG vector shuffle formation code. This code already tried to fold away shuffles of splats! It just had lots of bugs and couldn't handle the case my new x86 shuffle lowering needed. First, it failed to correctly compute whether N2 was undef because it pre-computed this, then did transformations which could make N2 undef, then failed to ever re-consider the precomputed state. Second, it didn't look through bitcasts at all, even in the safe cases where they are just element-type bitcasts with no change to the number of elements. Third, it didn't handle all-zero bit casts nicely the way my code in the x86 side of things did, which is essential to getting good zext-shuffle lowerings. But all of these are generic. I just ported the code down to this layer and fixed the surrounding bugs. Tests exercising this in the x86 backend still pass and some silly code in widen_cast-6.ll gets better. I updated that test to be a bit more precise but it's still pretty unclear what the value of the test is in this day and age. llvm-svn: 212517
*	[SDAG] Actually check for a non-constant splat and clarify comments	Chandler Carruth	2014-07-08	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	around the handling of UNDEF lanes in boolean vector content analysis. The code before my changes here also failed to check for non-constant splats in a buildvector. I have no idea how to trigger this, I just spotted by inspection when trying to understand the code. It seems extremely unlikely to be worth the trouble to teach the only caller of this code (DAG combining setcc patterns) how to cleverly handle undef lanes, so I've just commented more thoroughly that we're giving up there. llvm-svn: 212515
*	[SDAG] Build up a more rich set of APIs for querying build-vector SDAG	Chandler Carruth	2014-07-08	3	-13/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	nodes about whether they are splats. This is factored out and improved from r212324 which got reverted as it was far too aggressive. The new API should help more conservatively handle buildvectors that are a mixture of splatted and undef values. No functionality change at this point. The hope is to slowly re-introduce the undef-tolerant optimization of splats, but each time being forced to make a concious decision about how to handle the undefs in a way that doesn't lead to contradicting assumptions about the collapsed value. Hal has pointed out in discussions that this may not end up being the desired API and instead it may be more convenient to get a mask of the undef elements or something similar. I'm starting simple and will expand the API as I adapt actual callers and see exactly what they need. llvm-svn: 212514
*	[ASan] Completely remove sanitizer blacklist file from instrumentation pass.	Alexey Samsonov	2014-07-08	1	-17/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	All blacklisting logic is now moved to the frontend (Clang). If a function (or source file it is in) is blacklisted, it doesn't get sanitize_address attribute and is therefore not instrumented. If a global variable (or source file it is in) is blacklisted, it is reported to be blacklisted by the entry in llvm.asan.globals metadata, and is not modified by the instrumentation. The latter may lead to certain false positives - not all the globals created by Clang are described in llvm.asan.globals metadata (e.g, RTTI descriptors are not), so we may start reporting errors on them even if "module" they appear in is blacklisted. We assume it's fine to take such risk: 1) errors on these globals are rare and usually indicate wild memory access 2) we can lazily add descriptors for these globals into llvm.asan.globals lazily. llvm-svn: 212505
*	[X86] AVX512: Only allow k1-k7 as predicates to vpcmp*	Adam Nemet	2014-07-08	1	-7/+7
\| \| \| \| \| \| \| \| \|	As destination k0 is allowed but not as predicate/writemask. I also modified the test to allow checking of error messages by the assembler. I applied a similar approach to the test ret.s in the same directory. llvm-svn: 212504
*	Kill unnecessary include	Alexey Samsonov	2014-07-08	1	-1/+0
\| \| \| \|	llvm-svn: 212503
*	[x86] Fix assertion failure caused by a wrong combine of PSHUFD nodes with ↵	Andrea Di Biagio	2014-07-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	different types. When combining a sequence of two PSHUFD dag nodes into a single PSHUFD, make sure that we assign the correct type to the resulting PSHUFD. X86ISD::PSHUFD dag nodes can be either MVT::v4i32 or MVT::v4f32. Before this change, an assertion failure was triggered in method 'DAGCombinerInfo::CombineTo' when trying to combine the shuffles from the test below into a single PSHUFD. define <4 x float> @test1(<4 x float> %V) { %1 = shufflevector <4 x float> %V, <4 x float> undef, <4 x i32> <i32 3, i32 0, i32 2, i32 1> %2 = shufflevector <4 x float> %1, <4 x float> undef, <4 x i32> <i32 3, i32 0, i32 2, i32 1> ret <4 x float> %2 } llvm-svn: 212498
*	fixed some typos	Sanjay Patel	2014-07-07	1	-4/+4
\| \| \| \|	llvm-svn: 212495
*	[FastISel][X86] Fix smul.with.overflow.i8 lowering.	Juergen Ributzka	2014-07-07	1	-3/+19
\| \| \| \| \| \| \| \| \| \| \|	Add custom lowering code for signed multiply instruction selection, because the default FastISel instruction selection for ISD::MUL will use unsigned multiply for the i8 type and signed multiply for all other types. This would set the incorrect flags for the overflow check. This fixes <rdar://problem/17549300> llvm-svn: 212493
*	Allow AArch64FastISel to degrade graceully in the presence of an MVT::i128	Louis Gerbarg	2014-07-07	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently AArch64FastISel crashes if it tries to extend an integer into an MVT::i128. This can happen by creating 128 bit integers like so: typedef unsigned int uint128_t __attribute__((mode(TI))); typedef int sint128_t __attribute__((mode(TI))); This patch makes EmitIntExt check for their presence and then falls back to SelectionDAG. Tests included. rdar://17516686 llvm-svn: 212492
*	Fix for PR17073 ( http://llvm.org/pr17073 ), simplifycfg illegally hoists an ↵	Sanjay Patel	2014-07-07	1	-3/+20
\| \| \| \| \| \| \| \| \| \|	operation in a phi node that can trap. This patch adds to an existing loop over phi nodes in SimplifyCondBranchToCondBranch() to check for trapping ops and bails out of the optimization if we find one of those. The test cases verify that trapping ops are not hoisted and non-trapping ops are still optimized as expected. llvm-svn: 212490