bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[MachineOutliner] Recommit r312194, missed optimization remarks	Jessica Paquette	2017-08-31	1	-0/+73
\| \| \| \| \| \| \| \| \| \| \| \| \|	Before, this commit caused a buildbot failure: http://bb.pgr.jp/builders/test-llvm-i686-linux-RA/builds/6026/steps/test_llvm/logs/LLVM%20%3A%3A%20CodeGen__AArch64__machine-outliner-remarks.ll This was caused by the Key value in DiagnosticInfoOptimizationBase being deallocated before emitting the remarks defined in MachineOutliner.cpp. As of r312277 this should no longer be an issue. llvm-svn: 312280
*	[x86] add more tests for horizontal ops; NFC	Sanjay Patel	2017-08-31	2	-19/+159
\| \| \| \|	llvm-svn: 312279
*	[codeview] Generalize DIExpression parsing to handle load chains	Reid Kleckner	2017-08-31	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Hopefully this also clarifies exactly when and why we're rewriting certiain S_LOCALs using reference types: We're using the reference type to stand in for a zero-offset load. Reviewers: inglorion Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D37309 llvm-svn: 312247
*	Revert r311525: "[XRay][CodeGen] Use PIC-friendly code in XRay sleds; remove ↵	Daniel Jasper	2017-08-31	11	-61/+85
\| \| \| \| \| \| \| \|	synthetic references in .text" Breaks builds internally. Will forward repo instructions to author. llvm-svn: 312243
*	[X86] Added run line to intrinsics upgrade test. NFC.	Yael Tsafrir	2017-08-31	1	-0/+1
\| \| \| \|	llvm-svn: 312241
*	AMD family 17h (znver1) scheduler model update.	Ashutosh Nema	2017-08-31	19	-654/+654
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch enables the following: 1) Regex based Instruction itineraries for integer instructions. 2) The instructions are grouped as per the nature of the instructions (move, arithmetic, logic, Misc, Control Transfer). 3) FP instructions and their itineraries are added which includes values for SSE4A, BMI, BMI2 and SHA instructions. Patch by Ganesh Gopalasubramanian Reviewers: RKSimon, craig.topper Subscribers: vprasad, shivaram, ddibyend, andreadb, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D36617 llvm-svn: 312237
*	[AArch64] Support COFF linker directives	Martin Storsjo	2017-08-31	1	-0/+74
\| \| \| \| \| \| \| \| \| \|	This is similar to what was done for ARM in SVN r269574; the code and the test are straight copypaste to the corresponding AArch64 code and test directory. Differential revision: https://reviews.llvm.org/D37204 llvm-svn: 312223
*	Revert r312194: "[MachineOutliner] Add missed optimization remarks for the ↵	Daniel Jasper	2017-08-31	1	-73/+0
\| \| \| \| \| \| \| \| \|	outliner." Breaks on buildbot: http://bb.pgr.jp/builders/test-llvm-i686-linux-RA/builds/6026/steps/test_llvm/logs/LLVM%20%3A%3A%20CodeGen__AArch64__machine-outliner-remarks.ll llvm-svn: 312219
*	Temporarily revert "Update branch coalescing to be a PowerPC specific pass"	Eric Christopher	2017-08-31	2	-44/+21
\| \| \| \| \| \| \| \|	From comments and code review it wasn't intended to be enabled by default yet. This reverts commit r311588. llvm-svn: 312214
*	[MachineOutliner] Add missed optimization remarks for the outliner.	Jessica Paquette	2017-08-30	1	-0/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds missed optimization remarks which report viable candidates that were not outlined because they would increase code size. Other remarks will come in separate commits. This will help to diagnose code size regressions and changes in outliner behaviour in projects using the outliner. https://reviews.llvm.org/D37085 llvm-svn: 312194
*	Revert r312154 "Re-enable "[MachineCopyPropagation] Extend pass to do COPY ↵	Hans Wennborg	2017-08-30	72	-302/+319
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	source forwarding"" It caused PR34387: Assertion failed: (RegNo < NumRegs && "Attempting to access record for invalid register number!") > Issues identified by buildbots addressed since original review: > - Fixed ARMLoadStoreOptimizer bug exposed by this change in r311907. > - The pass no longer forwards COPYs to physical register uses, since > doing so can break code that implicitly relies on the physical > register number of the use. > - The pass no longer forwards COPYs to undef uses, since doing so > can break the machine verifier by creating LiveRanges that don't > end on a use (since the undef operand is not considered a use). > > [MachineCopyPropagation] Extend pass to do COPY source forwarding > > This change extends MachineCopyPropagation to do COPY source forwarding. > > This change also extends the MachineCopyPropagation pass to be able to > be run during register allocation, after physical registers have been > assigned, but before the virtual registers have been re-written, which > allows it to remove virtual register COPY LiveIntervals that become dead > through the forwarding of all of their uses. llvm-svn: 312178
*	[ARM] Use Swift error registers on non-Darwin targets	Brian Gesiak	2017-08-30	1	-0/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Remove a check for `ARMSubtarget::isTargetDarwin` when determining whether to use Swift error registers, so that Swift errors work properly on non-Darwin ARM32 targets (specifically Android). Before this patch, generated code would save and restores ARM register r8 at the entry and returns of a function that throws. As r8 is used as a virtual return value for the object being thrown, this gets overwritten by the restore, and calling code is unable to catch the error. In turn this caused Swift code that used `do`/`try`/`catch` to work improperly on Android ARM32 targets. Addresses Swift bug report https://bugs.swift.org/browse/SR-5438. Patch by John Holdsworth. Reviewers: manmanren, rjmccall, aschwaighofer Reviewed By: aschwaighofer Subscribers: srhines, aschwaighofer, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D35835 llvm-svn: 312164
*	[GISel]: Add a clean up combiner during legalization.	Aditya Nandakumar	2017-08-30	16	-98/+217
\| \| \| \| \| \| \| \| \| \| \|	Added a combiner which can clean up truncs/extends that are created in order to make the types work during legalization. Also moved the combineMerges to the LegalizeCombiner. https://reviews.llvm.org/D36880 llvm-svn: 312158
*	Re-enable "[MachineCopyPropagation] Extend pass to do COPY source forwarding"	Geoff Berry	2017-08-30	72	-319/+302
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Issues identified by buildbots addressed since original review: - Fixed ARMLoadStoreOptimizer bug exposed by this change in r311907. - The pass no longer forwards COPYs to physical register uses, since doing so can break code that implicitly relies on the physical register number of the use. - The pass no longer forwards COPYs to undef uses, since doing so can break the machine verifier by creating LiveRanges that don't end on a use (since the undef operand is not considered a use). [MachineCopyPropagation] Extend pass to do COPY source forwarding This change extends MachineCopyPropagation to do COPY source forwarding. This change also extends the MachineCopyPropagation pass to be able to be run during register allocation, after physical registers have been assigned, but before the virtual registers have been re-written, which allows it to remove virtual register COPY LiveIntervals that become dead through the forwarding of all of their uses. llvm-svn: 312154
*	[WebAssembly] Add target feature for atomics	Derek Schuff	2017-08-30	1	-0/+19
\| \| \| \| \| \| \| \| \| \|	Summary: This tracks the WebAssembly threads feature proposal at https://github.com/WebAssembly/threads/blob/master/proposals/threads/Overview.md Differential Revision: https://reviews.llvm.org/D37300 llvm-svn: 312145
*	Canonicalize the representation of empty an expression in ↵	Adrian Prantl	2017-08-30	18	-142/+142
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DIGlobalVariableExpression This change simplifies code that has to deal with DIGlobalVariableExpression and mirrors how we treat DIExpressions in debug info intrinsics. Before this change there were two ways of representing empty expressions on globals, a nullptr and an empty !DIExpression(). If someone needs to upgrade out-of-tree testcases: perl -pi -e 's/(!DIGlobalVariableExpression\(var: ![0-9]*)\)/\1, expr: !DIExpression())/g' <MYTEST.ll> will catch 95%. llvm-svn: 312144
*	[AVX512] Don't use 32-bit elements version of AND/OR/XOR/ANDN during isel ↵	Craig Topper	2017-08-30	9	-32/+32
\| \| \| \| \| \| \| \| \| \| \| \|	unless we're matching a masked op or broadcast Selecting 32-bit element logical ops without a select or broadcast requires matching a bitconvert on the inputs to the and. But that's a weird thing to rely on. It's entirely possible that one of the inputs doesn't have a bitcast and one does. Since there's no functional difference, just remove the extra patterns and save some isel table size. Differential Revision: https://reviews.llvm.org/D36854 llvm-svn: 312138
*	[GlobalISel][X86] Support variadic function call.	Igor Breger	2017-08-30	2	-0/+163
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Support variadic function call. Port the implementation from X86FastISel. Reviewers: zvi, guyblank, oren_ben_simhon Reviewed By: guyblank Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D37261 llvm-svn: 312130
*	Re-land MachineInstr: Reason locally about some memory objects before going ↵	Balaram Makam	2017-08-30	13	-64/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to AA. Summary: Reverts r311008 to reinstate r310825 with a fix. Refine alias checking for pseudo vs value to be conservative. This fixes the original failure in builtbot unittest SingleSource/UnitTests/2003-07-09-SignedArgs. Reviewers: hfinkel, nemanjai, efriedma Reviewed By: efriedma Subscribers: bjope, mcrosier, nhaehnle, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D36900 llvm-svn: 312126
*	[MIPS] Add support to match more patterns for BBIT instruction	Strahinja Petrovic	2017-08-30	1	-0/+56
\| \| \| \| \| \| \| \| \| \|	This patch supports one more pattern for bbit0 and bbit1 instructions, CBranchBitNum class is expanded so it can take 32 bit immidate. Differential Revision: https://reviews.llvm.org/D36222 llvm-svn: 312111
*	[AArch64] allow v4f16 types when FullFP16 is supported	Sjoerd Meijer	2017-08-30	2	-334/+641
\| \| \| \| \| \| \| \| \| \|	Support for scalars was committed in r311154, this adds support for allowing v4f16 vector types (thus avoiding conversions from/to single precision for these types). Differential Revision: https://reviews.llvm.org/D37145 llvm-svn: 312104
*	[X86][Skylake] Fixing duplicated prefixes in the run command of Code Gen ↵	Gadi Haber	2017-08-30	10	-10/+2465
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	regression tests NFC. Replaced duplicated HASWELL prefixes in run commands in the X86 Code Gen regression tests by the SKYLAKE prefix when the -mcpu is set to skylake. The fix is needed in preparation of an upcoming patch containing the Skylake scheduling info. Reviewers: zvi, RKSimon, aymanmus, igorb Differential Revision: https://reviews.llvm.org/D37258 llvm-svn: 312103
*	[AVX512] Correct isel patterns to support selecting masked ↵	Craig Topper	2017-08-30	1	-0/+155
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vbroadcastf32x2/vbroadcasti32x2 Summary: This patch adjusts the patterns to make the result type of the broadcast node vXf64/vXi64. Then adds a bitcast to vXi32 after that. Intrinsic lowering was also adjusted to generate this new pattern. Fixes PR34357 We should probably just drop the intrinsic entirely and use native IR, but I'll leave that for a future patch. Any idea what instruction we should be lowering the floating point 128-bit result version of this pattern to? There's a 128-bit v2i32 integer broadcast but not an fp one. Reviewers: aymanmus, zvi, igorb Reviewed By: aymanmus Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37286 llvm-svn: 312101
*	[AVX512] Use 256-bit extract instructions for extracting bits [255:128] from ↵	Craig Topper	2017-08-30	13	-151/+151
\| \| \| \| \| \| \| \| \| \|	a 512-bit register This enables the use of a smaller encoding by using a VEX instruction when possible. Differential Revision: https://reviews.llvm.org/D37092 llvm-svn: 312100
*	[X86] Apply SlowIncDec feature to Sandybridge/Ivybridge CPUs as well	Craig Topper	2017-08-30	2	-3/+3
\| \| \| \| \| \| \| \|	Currently we start applying this on Haswell and newer. I don't believe anything changed in the Haswell architecture to make this the right cutoff point. The partial flag handling around this has been roughly the same since Sandybridge. Differential Revision: https://reviews.llvm.org/D37250 llvm-svn: 312099
*	[X86] Provide a separate feature bit for macro fusion support instead of ↵	Craig Topper	2017-08-30	5	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	basing it on the AVX flag Summary: Currently we determine if macro fusion is supported based on the AVX flag as a proxy for the processor being Sandy Bridge". This is really strange as now AMD supports AVX. It also means if user explicitly disables AVX we disable macro fusion. This patch adds an explicit macro fusion feature. I've also enabled for the generic 64-bit CPU (which doesn't have AVX) This is probably another candidate for being in the MI layer, but for now I at least wanted to correct the overloading of the AVX feature. Reviewers: spatel, chandlerc, RKSimon, zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37280 llvm-svn: 312097
*	[AMDGPU] Use v_max_f* for fcanonicalize	Stanislav Mekhanoshin	2017-08-30	3	-32/+109
\| \| \| \| \| \| \| \| \| \|	If denorms are not flushed we can use max instead of multiplication by 1. For double that is simply faster, while for float and half it is shorter, because mul uses constant bus and VOP3. Differential Revision: https://reviews.llvm.org/D36856 llvm-svn: 312095
*	AMDGPU: Select clamp pattern with v2f16	Matt Arsenault	2017-08-30	1	-34/+190
\| \| \| \|	llvm-svn: 312087
*	[dwarfdump] Pretty print location expressions and location lists	Reid Kleckner	2017-08-29	7	-48/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Based on Fred's patch here: https://reviews.llvm.org/D6771 I can't seem to commandeer the old review, so I'm creating a new one. With that change the locations exrpessions are pretty printed inline in the DIE tree. The output looks like this for debug_loc entries: DW_AT_location [DW_FORM_data4] (0x00000000 0x0000000000000001 - 0x000000000000000b: DW_OP_consts +3 0x000000000000000b - 0x0000000000000012: DW_OP_consts +7 0x0000000000000012 - 0x000000000000001b: DW_OP_reg0 RAX, DW_OP_piece 0x4 0x000000000000001b - 0x0000000000000024: DW_OP_breg5 RDI+0) And like this for debug_loc.dwo entries: DW_AT_location [DW_FORM_sec_offset] (0x00000000 Addr idx 2 (w/ length 190): DW_OP_consts +0, DW_OP_stack_value Addr idx 3 (w/ length 23): DW_OP_reg0 RAX, DW_OP_piece 0x4) Simple locations without ranges are printed inline: DW_AT_location [DW_FORM_block1] (DW_OP_reg4 RSI, DW_OP_piece 0x4, DW_OP_bit_piece 0x20 0x0) The debug_loc(.dwo) dumping in changed accordingly to factor the code. Reviewers: dblaikie, aprantl, friss Subscribers: mgorny, javed.absar, hiraditya, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D37123 llvm-svn: 312042
*	Reland r311957 [codeview] support more DW_OPs for more complete debug info	Bob Haarman	2017-08-29	1	-0/+253
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Some variables show up in Visual Studio as "optimized out" even in -O0 -Od builds. This change fixes two issues that would cause this to happen. The first issue is that not all DIExpressions we generate were recognized by the CodeView writer. This has been addressed by adding support for DW_OP_constu, DW_OP_minus, and DW_OP_plus. The second issue is that we had no way to encode DW_OP_deref in CodeView. We get around that by changinge the type we encode in the debug info to be a reference to the type in the source code. This fixes PR34261. The reland adds two extra checks to the original: It checks if the DbgVariableLocation is valid before checking any of its fields, and it only emits ranges with nonzero registers. Reviewers: aprantl, rnk, zturner Reviewed By: rnk Subscribers: mgorny, llvm-commits, aprantl, hiraditya Differential Revision: https://reviews.llvm.org/D36907 llvm-svn: 312034
*	[X86] Add a test cases to demonstrate selecting GPR instructions when	Guy Blank	2017-08-29	1	-0/+365
\| \| \| \| \| \|	using mask based ones are more appropriate. llvm-svn: 311996
*	[X86] Adding a test to demonstrate aggressive folding for LEA facotrization.	Jatin Bhateja	2017-08-29	1	-0/+148
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D37257 llvm-svn: 311994
*	[ARM] GlobalISel: Select globals in PIC mode	Diana Picus	2017-08-29	3	-2/+121
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Support the selection of G_GLOBAL_VALUE in the PIC relocation model. For simplicity we use the same pseudoinstructions for both Darwin and ELF: (MOV\|LDRLIT)_ga_pcrel(_ldr). This is new for ELF, so it requires a small update to the ARM pseudo expansion pass to make sure it adds the correct constant pool modifier and add-current-address in the case of ELF. Differential Revision: https://reviews.llvm.org/D36507 llvm-svn: 311992
*	[ARM] GlobalISel: Rename tests. NFC.	Diana Picus	2017-08-29	2	-0/+0
\| \| \| \| \| \| \|	The checks are complicated enough as it is, there's no use cramming PIC in there as well... llvm-svn: 311989
*	Mark Knights Landing as having slow two memory operand instructions	Craig Topper	2017-08-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Knights Landing, because it is Atom derived, has slow two memory operand instructions. Mark the Knights Landing CPU model accordingly. Patch by David Zarzycki. Reviewers: craig.topper Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37224 llvm-svn: 311979
*	Revert "[codeview] support more DW_OPs for more complete debug info"	Bob Haarman	2017-08-29	1	-253/+0
\| \| \| \| \| \|	This reverts commit e160912f53f047bc97e572add179e08e33f4df48. llvm-svn: 311977
*	[codeview] support more DW_OPs for more complete debug info	Bob Haarman	2017-08-29	1	-0/+253
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Some variables show up in Visual Studio as "optimized out" even in -O0 -Od builds. This change fixes two issues that would cause this to happen. The first issue is that not all DIExpressions we generate were recognized by the CodeView writer. This has been addressed by adding support for DW_OP_constu, DW_OP_minus, and DW_OP_plus. The second issue is that we had no way to encode DW_OP_deref in CodeView. We get around that by changinge the type we encode in the debug info to be a reference to the type in the source code. This fixes PR34261. Reviewers: aprantl, rnk, zturner Reviewed By: rnk Subscribers: mgorny, llvm-commits, aprantl, hiraditya Differential Revision: https://reviews.llvm.org/D36907 llvm-svn: 311957
*	[AArch64][Falkor] Avoid generating STRQro* instructions	Geoff Berry	2017-08-28	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: STRQro* instructions are slower than the alternative ADD/STRQui expanded instructions on Falkor, so avoid generating them unless we're optimizing for code size. Reviewers: t.p.northover, mcrosier Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D37020 llvm-svn: 311931
*	Fix ARMv4 support	Joerg Sonnenberger	2017-08-28	5	-11/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ARMv4 doesn't support the "BX" instruction, which has been introduced with ARMv4t. Adjust the call lowering and tail call implementation accordingly. Further changes are necessary to ensure that presence of the v4t feature is correctly set. Most importantly, the "generic" CPU for thumb-* triples should include ARMv4t, since thumb mode without thumb support would naturally be pointless. Add a couple of asserts to ensure thumb instructions are not emitted without CPU support. Differential Revision: https://reviews.llvm.org/D37030 llvm-svn: 311921
*	[ARM] Fix bug in ARMLoadStoreOptimizer when kill flags are missing.	Geoff Berry	2017-08-28	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: ARMLoadStoreOpt::FixInvalidRegPairOp() was only checking if one of the load destination registers to be split overlapped with the base register if the base register was marked as killed. Since kill flags may not always be present, this can lead to incorrect code. This bug was exposed by my MachineCopyPropagation change D30751 breaking the sanitizer-x86_64-linux-android buildbot. Also clean up some dead code and add an assert that a register offset is never encountered by this code, since it does not handle them correctly. Reviewers: MatzeB, qcolombet, t.p.northover Subscribers: aemerson, javed.absar, kristof.beyls, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D37164 llvm-svn: 311907
*	[Hexagon] Check for potential bank conflicts in post-RA scheduling	Krzysztof Parzyszek	2017-08-28	1	-0/+28
\| \| \| \| \| \| \|	Insert artificial edges between loads that could cause a cache bank conflict. llvm-svn: 311901
*	[AMDGPU] Fix regression in AMDGPULibCalls allowing native for doubles	Stanislav Mekhanoshin	2017-08-28	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \|	Under -cl-fast-relaxed-math we could use native_sqrt, but f64 was allowed to produce HSAIL's nsqrt instruction. HSAIL is not here and we stick with non-existing native_sqrt(double) as a result. Add check for f64 to not return native functions and also remove handling of f64 case for fold_sqrt. Differential Revision: https://reviews.llvm.org/D37223 llvm-svn: 311900
*	[AMDGPU] computeKnownBitsForTargetNode for 24 bit mul	Stanislav Mekhanoshin	2017-08-28	1	-13/+47
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D37168 llvm-svn: 311896
*	[DAGCombiner] Teach visitEXTRACT_SUBVECTOR to turn extracts of BUILD_VECTOR ↵	Craig Topper	2017-08-28	2	-11/+5
\| \| \| \| \| \| \| \| \| \|	into smaller BUILD_VECTORs Only do this before operations are legalized of BUILD_VECTOR is Legal for the target. Differential Revision: https://reviews.llvm.org/D37186 llvm-svn: 311892
*	[X86][Haswell] Updating HSW instruction scheduling information	Gadi Haber	2017-08-28	39	-11252/+10009
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch completely replaces the instruction scheduling information for the Haswell architecture target by modifying the file X86SchedHaswell.td located under the X86 Target. We used the scheduling information retrieved from the Haswell architects in order to replace and modify the existing scheduling. The patch continues the scheduling replacement effort started with the SNB target in r307529 and r310792. Information includes latency, number of micro-Ops and used ports by each HSW instruction. Please expect some performance fluctuations due to code alignment effects. Reviewers: RKSimon, zvi, aymanmus, craig.topper, m_zuckerman, igorb, dim, chandlerc, aaboud Differential Revision: https://reviews.llvm.org/D36663 llvm-svn: 311879
*	[mips] Generate NMADD and NMSUB instructions when fneg node is present	Petar Jovanovic	2017-08-27	1	-0/+83
\| \| \| \| \| \| \| \| \| \| \| \|	This patch enables generation of NMADD and NMSUB instructions when fneg node is present. These instructions are currently only generated if fsub node is present. Patch by Stanislav Ocovaj. Differential Revision: https://reviews.llvm.org/D34507 llvm-svn: 311862
*	[AVX512] Add more patterns for using masked moves for subvector extracts of ↵	Craig Topper	2017-08-27	1	-0/+228
\| \| \| \| \| \|	the lowest subvector. This time with bitcasts between the vselect and the extract. llvm-svn: 311856
*	[DAGCombiner] allow undef shuffle operands when eliminating bitcasts (PR34111)	Sanjay Patel	2017-08-27	1	-7/+2
\| \| \| \| \| \| \| \|	As noted in the FIXME, this could be improved more, but this is the smallest fix that helps: https://bugs.llvm.org/show_bug.cgi?id=34111 llvm-svn: 311853
*	[x86] add haddps test for PR34111; NFC	Sanjay Patel	2017-08-27	1	-0/+25
\| \| \| \|	llvm-svn: 311852
*	[X86] Adding more tests for horizontal [F]HADD/[F]SUB for AVX512 vectors types	Jatin Bhateja	2017-08-27	1	-2/+82
\| \| \| \|	llvm-svn: 311847