bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Fix some build errors and warnings.	Zachary Turner	2017-05-18	1	-1/+1
\| \| \| \|	llvm-svn: 303391
*	[CodeView] Raise the source to ID map out of the TypeStreamMerger.	Zachary Turner	2017-05-18	1	-4/+8
\| \| \| \| \| \| \|	This map will be needed to rewrite symbol streams after re-writing the corresponding type streams. llvm-svn: 303390
*	[llvm-pdbdump] Add the ability to merge PDBs.	Zachary Turner	2017-05-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Merging PDBs is a feature that will be used heavily by the linker. The functionality already exists but does not have deep test coverage because it's not easily exposed through any tools. This patch aims to address that by adding the ability to merge PDBs via llvm-pdbdump. It takes arbitrarily many PDBs and outputs a single PDB. Using this new functionality, a test is added for merging type records. Future patches will add the ability to merge symbol records, module information, etc. llvm-svn: 303389
*	[CodeView] Provide a common interface for type collections.	Zachary Turner	2017-05-18	14	-282/+541
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Right now we have multiple notions of things that represent collections of types. Most commonly used are TypeDatabase, which is supposed to keep mappings from TypeIndex to type name when reading a type stream, which happens when reading PDBs. And also TypeTableBuilder, which is used to build up a collection of types dynamically which we will later serialize (i.e. when writing PDBs). But often you just want to do some operation on a collection of types, and you may want to do the same operation on any kind of collection. For example, you might want to merge two TypeTableBuilders or you might want to merge two type streams that you loaded from various files. This dichotomy between reading and writing is responsible for a lot of the existing code duplication and overlapping responsibilities in the existing CodeView library classes. For example, after building up a TypeTableBuilder with a bunch of type records, if we want to dump it we have to re-invent a bunch of extra glue because our dumper takes a TypeDatabase or a CVTypeArray, which are both incompatible with TypeTableBuilder. This patch introduces an abstract base class called TypeCollection which is shared between the various type collection like things. Wherever we previously stored a TypeDatabase& in some common class, we now store a TypeCollection&. The advantage of this is that all the details of how the collection are implemented, such as lazy deserialization of partial type streams, is completely transparent and you can just treat any collection of types the same regardless of where it came from. Differential Revision: https://reviews.llvm.org/D33293 llvm-svn: 303388
*	[NewGVN] Replace predicate info leftovers.	Davide Italiano	2017-05-18	1	-0/+6
\| \| \| \| \| \| \|	This time with an additional fix, i.e. we remove the dead @llvm.ssa.copy instruction. llvm-svn: 303385
*	[InstCombine] add helper to foldXorOfICmps(); NFCI	Sanjay Patel	2017-05-18	2	-41/+48
\| \| \| \| \| \| \| \| \| \| \|	Also, fix the old-style capitalization of the related functions and move them to the 'private' section of the class since they are just helpers of the visit* functions. As shown in the post-commit comments for D32143, we are missing folds for xor-of-icmps. llvm-svn: 303381
*	Revert r303375 "LLVM_FALLTHROUGH instead of fall-through comment."	Rui Ueyama	2017-05-18	1	-1/+1
\| \| \| \| \| \|	This reverts commit r303375 since it didn't compile. llvm-svn: 303377
*	LLVM_FALLTHROUGH instead of fall-through comment.	Galina Kistanova	2017-05-18	1	-1/+1
\| \| \| \|	llvm-svn: 303375
*	Revert r302938 "Add LiveRangeShrink pass to shrink live range within BB."	Hans Wennborg	2017-05-18	4	-214/+0
\| \| \| \| \| \| \| \| \|	This also reverts follow-ups r303292 and r303298. It broke some Chromium tests under MSan, and apparently also internal tests at Google. llvm-svn: 303369
*	Reduce gcc-7 warnings by fall-through comments.	Galina Kistanova	2017-05-18	1	-1/+1
\| \| \| \|	llvm-svn: 303365
*	[IR] De-virtualize ~Value to save a vptr	Reid Kleckner	2017-05-18	24	-120/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Implements PR889 Removing the virtual table pointer from Value saves 1% of RSS when doing LTO of llc on Linux. The impact on time was positive, but too noisy to conclusively say that performance improved. Here is a link to the spreadsheet with the original data: https://docs.google.com/spreadsheets/d/1F4FHir0qYnV0MEp2sYYp_BuvnJgWlWPhWOwZ6LbW7W4/edit?usp=sharing This change makes it invalid to directly delete a Value, User, or Instruction pointer. Instead, such code can be rewritten to a null check and a call Value::deleteValue(). Value objects tend to have their lifetimes managed through iplist, so for the most part, this isn't a big deal. However, there are some places where LLVM deletes values, and those places had to be migrated to deleteValue. I have also created llvm::unique_value, which has a custom deleter, so it can be used in place of std::unique_ptr<Value>. I had to add the "DerivedUser" Deleter escape hatch for MemorySSA, which derives from User outside of lib/IR. Code in IR cannot include MemorySSA headers or call the MemoryAccess object destructors without introducing a circular dependency, so we need some level of indirection. Unfortunately, no class derived from User may have any virtual methods, because adding a virtual method would break User::getHungOffOperands(), which assumes that it can find the use list immediately prior to the User object. I've added a static_assert to the appropriate OperandTraits templates to help people avoid this trap. Reviewers: chandlerc, mehdi_amini, pete, dberlin, george.burgess.iv Reviewed By: chandlerc Subscribers: krytarowski, eraman, george.burgess.iv, mzolotukhin, Prazek, nlewycky, hans, inglorion, pcc, tejohnson, dberlin, llvm-commits Differential Revision: https://reviews.llvm.org/D31261 llvm-svn: 303362
*	[LSR] Call canonicalize after we generate a new Formula in ↵	Wei Mi	2017-05-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	GenerateTruncates. Fix PR33077. The testcase in PR33077 generates a LSR Use Formula with two SCEVAddRecExprs for the same loop. Such uncommon formula will become non-canonical after GenerateTruncates adds sign extension to the ScaledReg of the Formula, and it will break the assertion that every Formula to be inserted is canonical. The fix is to call canonicalize for the raw Formula generated by GenerateTruncates before inserting it. llvm-svn: 303361
*	[LegacyPassManager] Remove TargetMachine constructors	Francis Visoiu Mistrih	2017-05-18	45	-293/+293
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This provides a new way to access the TargetMachine through TargetPassConfig, as a dependency. The patterns replaced here are: * Passes handling a null TargetMachine call `getAnalysisIfAvailable<TargetPassConfig>`. * Passes not handling a null TargetMachine `addRequired<TargetPassConfig>` and call `getAnalysis<TargetPassConfig>`. * MachineFunctionPasses now use MF.getTarget(). * Remove all the TargetMachine constructors. * Remove INITIALIZE_TM_PASS. This fixes a crash when running `llc -start-before prologepilog`. PEI needs StackProtector, which gets constructed without a TargetMachine by the pass manager. The StackProtector pass doesn't handle the case where there is no TargetMachine, so it segfaults. Related to PR30324. Differential Revision: https://reviews.llvm.org/D33222 llvm-svn: 303360
*	Fix some minor issues in PDB parsing library.	Zachary Turner	2017-05-18	2	-11/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1) Until now I'd never seen a valid PDB where the DBI stream and the PDB Stream disagreed on the "Age" field. Because of that, we had code to assert that they matched. Recently though I was given a PDB where they disagreed, so this assumption has proven to be incorrect. Remove this check. 2) We were walking the entire list of hash values for types up front and then throwing away the values. For large PDBs this was a significant slow down. Remove this. With this patch, I can dump the list of all compilands from a 1.5GB PDB file in just a few seconds. llvm-svn: 303351
*	[JumpThreading] Dont RAUW condition incorrectly	Anna Thomas	2017-05-18	1	-17/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We have a bug when RAUWing the condition if experimental.guard or assumes is a use of that condition. This is because LazyValueInfo may have used the guards/assumes to identify the value of the condition at the end of the block. RAUW replaces the uses at the guard/assume as well as uses before the guard/assume. Both of these are incorrect. For now, disable RAUW for conditions and fix the logic as a next step: https://reviews.llvm.org/D33257 Reviewers: sanjoy, reames, trentxintong Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33279 llvm-svn: 303349
*	[AMDGPU] SDWA operands should not intersect with potential MIs	Sam Kolton	2017-05-18	1	-13/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There should be no intesection between SDWA operands and potential MIs. E.g.: ``` v_and_b32 v0, 0xff, v1 -> src:v1 sel:BYTE_0 v_and_b32 v2, 0xff, v0 -> src:v0 sel:BYTE_0 v_add_u32 v3, v4, v2 ``` In that example it is possible that we would fold 2nd instruction into 3rd (v_add_u32_sdwa) and then try to fold 1st instruction into 2nd (that was already destroyed). So if SDWAOperand is also a potential MI then do not apply it. Reviewers: vpykhtin, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D32804 llvm-svn: 303347
*	[MVT] add v1i1 MVT	Guy Blank	2017-05-18	1	-0/+2
\| \| \| \| \| \| \| \|	Adds the v1i1 MVT as a preparation for another commit (https://reviews.llvm.org/D32273) Differential Revision: https://reviews.llvm.org/D32540 llvm-svn: 303346
*	[GlobalISel][X86] G_ADD/G_SUB vector legalizer/selector support.	Igor Breger	2017-05-18	1	-1/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: G_ADD/G_SUB vector legalizer/selector support. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33232 llvm-svn: 303345
*	[X86][AVX512] Add 512-bit vector ctpop costs + tests	Simon Pilgrim	2017-05-18	1	-0/+6
\| \| \| \|	llvm-svn: 303342
*	Re-commit: [globalisel][tablegen] Import rules containing intrinsic_wo_chain.	Daniel Sanders	2017-05-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As of this patch, 1018 out of 3938 rules are currently imported. Depends on D32275 Reviewers: qcolombet, kristof.beyls, rovka, t.p.northover, ab, aditya_nandakumar Reviewed By: qcolombet Subscribers: dberris, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D32278 The previous commit failed on test-suite/Bitcode/simd_ops/AArch64_halide_runtime.bc because isImmOperandEqual() assumed MO was a register operand and that's not always true. llvm-svn: 303341
*	[SCEV][NFC] Remove duplication of isLoopInvariant code	Max Kazantsev	2017-05-18	1	-2/+2
\| \| \| \| \| \| \| \| \|	Replace two places that duplicate the code of isLoopInvariant method with the invocation of this method. Differential Revision: https://reviews.llvm.org/D33313 llvm-svn: 303336
*	[DWARF] - Simplify RelocVisitor implementation.	George Rimar	2017-05-18	1	-2/+2
\| \| \| \| \| \| \| \| \|	We do not need to store relocation width field. Patch removes relative code, that simplifies implementation. Differential revision: https://reviews.llvm.org/D33274 llvm-svn: 303335
*	[X86] Replace slow LEA instructions in X86	Lama Saba	2017-05-18	5	-43/+238
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	According to Intel's Optimization Reference Manual for SNB+: " For LEA instructions with three source operands and some specific situations, instruction latency has increased to 3 cycles, and must dispatch via port 1: - LEA that has all three source operands: base, index, and offset - LEA that uses base and index registers where the base is EBP, RBP,or R13 - LEA that uses RIP relative addressing mode - LEA that uses 16-bit addressing mode " This patch currently handles the first 2 cases only. Differential Revision: https://reviews.llvm.org/D32277 llvm-svn: 303333
*	[lib/Object] - Minor API update for llvm::Decompressor.	George Rimar	2017-05-18	2	-6/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I revisited Decompressor API (issue with it was triggered during D32865 review) and found it is probably provides more then we really need. Issue was about next method's signature: Error decompress(SmallString<32> &Out); It is too strict. At first I wanted to change it to decompress(SmallVectorImpl<char> &Out), but then found it is still not flexible because sticks to SmallVector. During reviews was suggested to use templating to simplify code. Patch do that. Differential revision: https://reviews.llvm.org/D33200 llvm-svn: 303331
*	[BPI] Reduce the probability of unreachable edge to minimal value greater than 0	Serguei Katkov	2017-05-18	1	-40/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The probability of edge coming to unreachable block should be as low as possible. The change reduces the probability to minimal value greater than zero. The bug https://bugs.llvm.org/show_bug.cgi?id=32214 show the example when the probability of edge coming to unreachable block is greater than for edge coming to out of the loop and it causes incorrect loop rotation. Please note that with this change the behavior of unreachable heuristic is a bit different than others. Specifically, before this change the sum of probabilities coming to unreachable blocks have the same weight for all branches (it was just split over all edges of this block coming to unreachable blocks). With this change it might be slightly different but not to much due to probability of taken branch to unreachable block is really small. Reviewers: chandlerc, sanjoy, vsk, congh, junbuml, davidxl, dexonsmith Reviewed By: chandlerc, dexonsmith Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D30633 llvm-svn: 303327
*	[ThinLTO] Do not assert when adding a module with a different but	Akira Hatanaka	2017-05-18	3	-44/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	compatible target triple Currently, an assertion fails in ThinLTOCodeGenerator::addModule when the target triple of the module being added doesn't match that of the one stored in TMBuilder. This patch relaxes the constraint and makes changes to allow target triples that only differ in their version numbers on Apple platforms, similarly to what r228999 did. rdar://problem/30133904 Differential Revision: https://reviews.llvm.org/D33291 llvm-svn: 303326
*	[Target/X86] Remove unneeded return. NFCI.	Davide Italiano	2017-05-18	1	-3/+1
\| \| \| \|	llvm-svn: 303323
*	[Statistics] Add a method to atomically update a statistic that contains a ↵	Craig Topper	2017-05-18	3	-10/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	maximum Summary: There are several places in the codebase that try to calculate a maximum value in a Statistic object. We currently do this in one of two ways: MaxNumFoo = std::max(MaxNumFoo, NumFoo); or MaxNumFoo = (MaxNumFoo > NumFoo) ? MaxNumFoo : NumFoo; The first version reads from MaxNumFoo one time and uncontionally rwrites to it. The second version possibly reads it twice depending on the result of the first compare. But we have no way of knowing if the value was changed by another thread between the reads and the writes. This patch adds a method to the Statistic object that can ensure that we only store if our value is the max and the previous max didn't change after we read it. If it changed we'll recheck if our value should still be the max or not and try again. This spawned from an audit I'm trying to do of all places we uses the implicit conversion to unsigned on the Statistics objects. See my previous thread on llvm-dev https://groups.google.com/forum/#!topic/llvm-dev/yfvxiorKrDQ Reviewers: dberlin, chandlerc, hfinkel, dblaikie Reviewed By: chandlerc Subscribers: llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D33301 llvm-svn: 303318
*	CodeGen: BlockPlacement: Add Message strings to asserts. NFC	Kyle Butt	2017-05-17	1	-16/+29
\| \| \| \| \| \| \| \|	Add message strings to all the unlabeled asserts in the file. Differential Revision: https://reviews.llvm.org/D33078 llvm-svn: 303316
*	[Statistics] Use Statistic::operator+= instead of adding and assigning ↵	Craig Topper	2017-05-17	2	-2/+2
\| \| \| \| \| \| \| \|	separately. I believe this technically fixes a multithreaded race condition in this code. But my primary concern was as part of looking at removing the ability to treat Statistics like a plain unsigned. There are many weird operations on Statistics in the codebase. llvm-svn: 303314
*	[InstCombine] handle icmp i1 X, C early to avoid creating an unknown pattern	Sanjay Patel	2017-05-17	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \|	The missing optimization for xor-of-icmps still needs to be added, but by being more efficient (not generating unnecessary logic ops with constants) we avoid the bug. See discussion in post-commit comments: https://reviews.llvm.org/D32143 llvm-svn: 303312
*	[InstCombine] move icmp bool canonicalizations to helper; NFC	Sanjay Patel	2017-05-17	1	-43/+54
\| \| \| \| \| \| \| \|	As noted in the post-commit comments in D32143, we should be catching the constant operand cases sooner to be more efficient and less likely to expose a missing fold. llvm-svn: 303309
*	AMDGPU: Start defining a calling convention	Matt Arsenault	2017-05-17	22	-116/+461
\| \| \| \| \| \| \| \|	Partially implement callee-side for arguments and return values. byval doesn't work properly, and most likely sret or other on-stack return values most as well. llvm-svn: 303308
*	CodeGen: Power: Add lowering for shifts of v1i128.	Kyle Butt	2017-05-17	2	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \|	When legalizing vector operations on vNi128, they will be split to v1i128 because that is a legal type on ppc64, but then the compiler will crash in selection dag because it fails to select for these operations. This patch fixes shift operations. Logical shift right and left shift can be performed in the vector unit, but algebraic shift right requires being split. Differential Revision: https://reviews.llvm.org/D32774 llvm-svn: 303307
*	Fix PR33028	Michael Liao	2017-05-17	2	-13/+39
\| \| \| \| \| \| \| \| \| \| \| \|	- '-verify-mahcineinstrs' starts to complain allocatable live-in physical registers on non-entry or non-landing-pad basic blocks. - Refactor the XBEGIN translation to define EAX on a dedicated fallback code path due to XABORT. Add a pseudo instruction to define EAX explicitly to avoid add physical register live-in. Differential Revision: https://reviews.llvm.org/D33168 llvm-svn: 303306
*	AMDGPU: Expand frame indexes to be relative to scratch wave offset	Matt Arsenault	2017-05-17	1	-6/+71
\| \| \| \| \| \| \| \| \| \| \| \|	In order for an arbitrary callee to access an object in a caller's stack frame, the 32-bit offset used as the private pointer needs to be relative to the kernel's scratch wave offset register. Convert to this by finding the difference from the current stack frame and scaling by the wavefront size. llvm-svn: 303303
*	AMDGPU: Change mubuf soffset register when SP relative	Matt Arsenault	2017-05-17	2	-15/+53
\| \| \| \| \| \| \| \| \| \|	Check the MachinePointerInfo for whether the access is supposed to be relative to the stack pointer. No tests because this is used in later commits implementing calls. llvm-svn: 303301
*	[X86][AVX512] Add 512-bit vector ctlz costs + tests	Simon Pilgrim	2017-05-17	1	-0/+24
\| \| \| \|	llvm-svn: 303300
*	[llvm-pdbdump] in yaml2pdb, generate default output filename if none given	Bob Haarman	2017-05-17	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: llvm-pdbdump yaml2pdb used to fail with a misleading error message ("An I/O error occurred on the file system") if no output file was specified. This change adds an assert to PDBFileBuilder to check that an output file name is specified, and makes llvm-pdbdump generate an output file name based on the input file name if no output file name is explicitly specified. Reviewers: amccarth, zturner Reviewed By: zturner Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D33296 llvm-svn: 303299
*	Add some helpers for manipulating BinaryStreamRefs.	Zachary Turner	2017-05-17	1	-0/+5
\| \| \| \|	llvm-svn: 303297
*	AMDGPU: Make better use of op_sel with high components	Matt Arsenault	2017-05-17	2	-8/+57
\| \| \| \| \| \|	Handle more general swizzles. llvm-svn: 303296
*	[InstSimplify] handle all icmp i1 X, C in one place; NFCI	Sanjay Patel	2017-05-17	1	-28/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We already handled all of the new tests identically, but several of those went through a lot of unnecessary processing before getting folded. Another motivation for grouping these cases together is that InstCombine needs a similar fold. Currently, it handles the 'not' cases inefficiently which can lead to bugs as described in the post-commit comments of: https://reviews.llvm.org/D32143 llvm-svn: 303295
*	[BinaryStream] Reduce the amount of boiler plate needed to use.	Zachary Turner	2017-05-17	4	-4/+158
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Often you have an array and you just want to use it. With the current design, you have to first construct a `BinaryByteStream`, and then create a `BinaryStreamRef` from it. Worse, the `BinaryStreamRef` holds a pointer to the `BinaryByteStream`, so you can't just create a temporary one to appease the compiler, you have to actually hold onto both the `ArrayRef` as well as the `BinaryByteStream` AND the `BinaryStreamReader` on top of that. This makes for very cumbersome code, often requiring one to store a `BinaryByteStream` in a class just to circumvent this. At the cost of some added complexity (not exposed to users, but internal to the library), we can do better than this. This patch allows us to construct `BinaryStreamReaders` and `BinaryStreamWriters` directly from source data (e.g. `StringRef`, `MutableArrayRef<uint8_t>`, etc). Not only does this reduce the amount of code you have to type and make it more obvious how to use it, but it solves real lifetime issues when it's inconvenient to hold onto a `BinaryByteStream` for a long time. The additional complexity is in the form of an added layer of indirection. Whereas before we simply stored a `BinaryStream` in the ref, we now store both a `BinaryStream` and a `std::shared_ptr<BinaryStream>`. When the user wants to construct a `BinaryStreamRef` directly from an `ArrayRef` etc, we allocate an internal object that holds ownership over a `BinaryByteStream` and forwards all calls, and store this in the `shared_ptr<>`. This also maintains the ref semantics, as you can copy it by value and references refer to the same underlying stream -- the one being held in the object stored in the `shared_ptr`. Differential Revision: https://reviews.llvm.org/D33293 llvm-svn: 303294
*	[X86][AVX512] Add 512-bit vector cttz costs + tests	Simon Pilgrim	2017-05-17	1	-0/+6
\| \| \| \|	llvm-svn: 303293
*	Only enable LiveRangeShrink for x86.	Dehao Chen	2017-05-17	2	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Moving LiveRangeShrink to x86 as this pass is mostly useful for archtectures with great register pressure. Reviewers: MatzeB, qcolombet Reviewed By: qcolombet Subscribers: jholewinski, jyknight, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33294 llvm-svn: 303292
*	AMDGPU: Try to use op_sel when selecting packed instructions	Matt Arsenault	2017-05-17	1	-1/+29
\| \| \| \| \| \| \| \| \| \| \| \|	Avoids instructions to pack a vector when the source is really a scalar being broadcast. Also be smarter and look for per-component fneg. Doesn't yet handle scalar from upper half of register or other swizzles. llvm-svn: 303291
*	[WebAssembly][NFC] Update expected testsuite failures for newly passing tests	Jacob Gravelle	2017-05-17	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: r303050 fixes crashes when calling scalarizeMaskedMemIntrin pass from WebAssembly backend. This updates expected test failures for that. Reviewers: sbc100 Subscribers: jfb, llvm-commits, dschuff Differential Revision: https://reviews.llvm.org/D33295 llvm-svn: 303288
*	AMDGPU: Use appropriate soffset for spilling	Matt Arsenault	2017-05-17	2	-20/+20
\| \| \| \| \| \| \|	This needs to be the frame offset register, and not the global scratch wave offset register. For kernels, these are the same. llvm-svn: 303287
*	Revert r303015, because it has the unintended side effect of breaking	Dimitry Andric	2017-05-17	1	-24/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	driver-mode recognition in clang (this is because the sysctl method always returns one and only one executable path, even for an executable with multiple links): Fix DynamicLibraryTest.cpp on FreeBSD and NetBSD Summary: After rL301562, on FreeBSD the DynamicLibrary unittests fail, because the test uses getMainExecutable("DynamicLibraryTests", Ptr), and since the path does not contain any slashes, retrieving the main executable will not work. Reimplement getMainExecutable() for FreeBSD and NetBSD using sysctl(3), which is more reliable than fiddling with relative or absolute paths. Also add retrieval of the original argv[] from the GoogleTest framework, to use as a fallback for other OSes. Reviewers: emaste, marsupial, hans, krytarowski Reviewed By: krytarowski Subscribers: krytarowski, llvm-commits Differential Revision: https://reviews.llvm.org/D33171 llvm-svn: 303285
*	AMDGPU: Fix min3/max3 combines for f16/i16	Matt Arsenault	2017-05-17	3	-2/+25
\| \| \| \| \| \|	Fix missing instruction definitions for min3/max3. llvm-svn: 303284