bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[llvm-objcopy] Refactor llvm-objcopy to use reader and writer objects	Jake Ehrlich	2018-01-25	3	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While writing code for input and output formats in llvm-objcopy it became apparent that there was a code health problem. This change attempts to solve that problem by refactoring the code to use Reader and Writer objects that can read in different objects in different formats, convert them to a single shared internal representation, and then write them to any other representation. New classes: Reader: the base class used to construct instances of the internal representation Writer: the base class used to write out instances of the internal representation ELFBuilder: a helper class for ELFWriter that takes an ELFFile and converts it to a Object SectionVisitor: it became necessary to remove writeSection from SectionBase because, under the new Reader/Writer scheme, it's possible to convert between ELF Types such as ELF32LE and ELF32BE. This isn't possible with writeSection because it (dynamically) depends on the underlying section type and (statically) depends on the ELF type. Bad things would happen if the underlying sections for ELF32LE were used for writing to ELF64BE. To avoid this code smell (which would have compiled, run, and output some nonsesnse) I decoupled writing of sections from a class. SectionWriter: This is just the ELFT templated implementation of SectionVisitor. Many classes now have this class as a friend so that the writing methods in this class can write out private data. ELFWriter: This is the Writer that outputs to ELF BinaryWriter: This is the Writer that outputs to Binary ElfType: Because the ELF Type is not a part of the Object anymore we need a way to construct the correct default Writer based on properties of the Reader. This enum just keeps track of the ELF type of the input so it can be used as the default output type as well. Object has correspondingly undergone some serious changes as well. It now has more generic methods for building and manipulating ELF binaries. This interface makes ELFBuilder easy enough to use and will make the BinaryReader/Builder easy to create as well. Most changes in this diff are cosmetic and deal with the fact that a method has been moved from one class to another or a change from a pointer to a reference. Almost no changes should result in a functional difference (this is after all a refactor). One minor functional change was made and the result can be seen in remove-shstrtab-error.test. The fact that it fails hasn't changed but the error message has changed because that failure is detected at a later point in the code now (because WriteSectionHeaders is a property of the ElfWriter not a property of the Object). I'd say roughly 80-90% of this code is cosmetically different, 10-19% is different but functionally the same, and 1-5% is functionally different despite not causing a change in tests. Differential Revision: https://reviews.llvm.org/D42222 llvm-svn: 323480
*	Add testcase accidentally left out from r323460.	Easwaran Raman	2018-01-25	1	-0/+35
\| \| \| \|	llvm-svn: 323478
*	[llvm-objcopy] Add --add-gnu-debuglink	Jake Ehrlich	2018-01-25	1	-0/+27
\| \| \| \| \| \| \| \|	This change adds support for --add-gnu-debuglink to llvm-objcopy Differential Revision: https://reviews.llvm.org/D41731 llvm-svn: 323477
*	[DWARFv5] Support DW_FORM_line_strp in llvm-dwarfdump.	Paul Robinson	2018-01-25	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \|	This form is like DW_FORM_strp, but points to .debug_line_str instead of .debug_str as the string section. It's intended to be used from the line-table header, and allows string-pooling of directory and filenames across compilation units. Differential Revision: https://reviews.llvm.org/D42553 llvm-svn: 323476
*	[Debug] Add dbg.value intrinsics for PHIs created during LCSSA.	Vedant Kumar	2018-01-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch is an enhancement to propagate dbg.value information when Phis are created on behalf of LCSSA. I noticed a case where a value carried across a loop was reported as <optimized out>. Specifically this case: int bar(int x, int y) { return x + y; } int foo(int size) { int val = 0; for (int i = 0; i < size; ++i) { val = bar(val, i); // Both val and i are correct } return val; // <optimized out> } In the above case, after all of the interesting computation completes our value is reported as "optimized out." This change will add a dbg.value to correct this. This patch also moves the dbg.value insertion routine from LoopRotation.cpp into Local.cpp, so that we can share it in both places (LoopRotation and LCSSA). Patch by Matt Davis! Differential Revision: https://reviews.llvm.org/D42551 llvm-svn: 323472
*	[X86] Teach Intel syntax InstPrinter to print lock prefixes that have been ↵	Craig Topper	2018-01-25	1	-0/+6
\| \| \| \| \| \| \| \|	parsed from the asm parser. The asm parser puts the lock prefix in the MCInst flags so we need to check that in addition to TSFlags. This matches what the ATT printer does. llvm-svn: 323469
*	Revert r322132; it appears to be an accidental commit, based on the commit ↵	Aaron Ballman	2018-01-25	1	-27/+0
\| \| \| \| \| \|	message. The original author of the commit has not commented on whether this was accidental or purposeful, so if this revert is in error, the author can re-commit with an actual commit message. llvm-svn: 323466
*	Reverting r323463 as it appears to be an accidental commit. Regardless, it ↵	Aaron Ballman	2018-01-25	3	-3/+1
\| \| \| \| \| \| \| \| \| \|	broke a lot of build bots, so reverting back to green. http://lab.llvm.org:8011/builders/lldb-amd64-ninja-netbsd8/builds/9294 http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/24084 http://lab.llvm.org:8011/builders/clang-ppc64le-linux-lnt/builds/9567 llvm-svn: 323465
*	tmp	Jake Ehrlich	2018-01-25	3	-1/+3
\| \| \| \|	llvm-svn: 323463
*	Revert "asan: add kernel inline instrumentation test"	Vedant Kumar	2018-01-25	1	-27/+0
\| \| \| \| \| \| \| \|	This reverts commit r323451. It breaks this bot: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/24077 llvm-svn: 323454
*	[Hexagon] SETEQ and SETNE are valid integer condition codes	Krzysztof Parzyszek	2018-01-25	1	-0/+21
\| \| \| \|	llvm-svn: 323452
*	asan: add kernel inline instrumentation test	Vedant Kumar	2018-01-25	1	-0/+27
\| \| \| \| \| \| \| \|	Patch by Andrey Konovalov! Differential Revision: https://reviews.llvm.org/D42473 llvm-svn: 323451
*	Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements ↵	Alexey Bataev	2018-01-25	3	-24/+24
\| \| \| \| \| \| \| \|	as shuffle." This reverts commit r323441 to fix buildbots. llvm-svn: 323447
*	[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle.	Alexey Bataev	2018-01-25	3	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the same value is going to be vectorized several times in the same tree entry, this entry is considered to be a gather entry and cost of this gather is counter as cost of InsertElementInstrs for each gathered value. But we can consider these elements as ShuffleInstr with SK_PermuteSingle shuffle kind. Reviewers: spatel, RKSimon, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38697 llvm-svn: 323441
*	[InstCombine] narrow masked zexted binops (PR35792)	Sanjay Patel	2018-01-25	1	-48/+84
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is guarded by shouldChangeType(), so the tests show that we don't do the fold if the narrower type is not legal. Note that there is a proposal (D42424) that would change the results for the specific cases shown in these tests. That difference is also discussed in PR35792: https://bugs.llvm.org/show_bug.cgi?id=35792 Alive proofs for the cases handled here as well as the bitwise logic binops that we should already do better on: https://rise4fun.com/Alive/c97 https://rise4fun.com/Alive/Lc5E https://rise4fun.com/Alive/kdf llvm-svn: 323437
*	[InstCombine] add tests for PR35792; NFC	Sanjay Patel	2018-01-25	1	-0/+192
\| \| \| \|	llvm-svn: 323436
*	Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements ↵	Alexey Bataev	2018-01-25	3	-24/+24
\| \| \| \| \| \| \| \|	as shuffle." This reverts commit r323430 to fix buildbots. llvm-svn: 323432
*	[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle.	Alexey Bataev	2018-01-25	3	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the same value is going to be vectorized several times in the same tree entry, this entry is considered to be a gather entry and cost of this gather is counter as cost of InsertElementInstrs for each gathered value. But we can consider these elements as ShuffleInstr with SK_PermuteSingle shuffle kind. Reviewers: spatel, RKSimon, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38697 llvm-svn: 323430
*	[X86][SSE] Add tests for vector truncation with signed saturation	Simon Pilgrim	2018-01-25	1	-0/+3520
\| \| \| \| \| \|	AVX512 isn't using X86ISD::VTRUNCS and SSE/AVX isn't using PACKSS/PACKUS llvm-svn: 323428
*	[X86][SSE] Add tests for vector truncation with unsigned saturation	Simon Pilgrim	2018-01-25	1	-0/+2352
\| \| \| \| \| \|	AVX512 tends to do a good job, but there are some missed opportunities with SSE/AVX llvm-svn: 323422
*	X86 Tests: Add AVX+XOP config to SDIV combine tests	Zvi Rackover	2018-01-25	1	-0/+622
\| \| \| \| \| \| \|	As pointed out in D42479, XOP also needs to be covered as it supports vector shifts with variable shift amount. llvm-svn: 323418
*	Another try to commit 323321 (aggressive instruction combine).	Amjad Aboud	2018-01-25	5	-2/+299
\| \| \| \|	llvm-svn: 323416
*	[GlobalOpt] Emit fragments using field offsets from struct layout	Mikael Holmen	2018-01-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When creating the debug fragments for a SRA'd struct, use the fields' offsets, taken from the struct layout, as the offsets for the resulting fragments. This fixes an issue where GlobalOpt would emit fragments with incorrect offsets for padded fields. This should solve PR36016. Patch by David Stenberg. Reviewers: aprantl Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42489 llvm-svn: 323411
*	[IRMover] Add comment and fix test case	Eugene Leviant	2018-01-25	1	-1/+5
\| \| \| \|	llvm-svn: 323407
*	[X86] Expand IMUL/MUL instregexs in Intel scheduler models. Add load latency ↵	Craig Topper	2018-01-25	1	-29/+29
\| \| \| \| \| \| \| \| \| \| \| \|	to some of them in SkylakeClient model. The regular expressions and the imul names caused some instructions to be matched by multiple regexs creating unpredictable results. This changes them all to use explicit instrs instead. While doing this I also found that some instructions in Skylake were missing load latency so I fixed that too. llvm-svn: 323406
*	[X86] Remove 64/128/256 from MMX/SSE/AVX instruction names for overall ↵	Craig Topper	2018-01-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	consistency. NFC MMX instrutions all start with MMX_ so the 64 isn't needed for disambigutation. SSE/AVX1 instructions are assumed 128-bit so we don't need to say 128. AVX2 instructions should use a Y to indicate 256-bits. llvm-svn: 323402
*	[GlobalISel] Add a requires: asserts to a test.	Amara Emerson	2018-01-24	1	-0/+1
\| \| \| \|	llvm-svn: 323384
*	[InstCombine] fix datalayout in test file	Sanjay Patel	2018-01-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The only part of the datalayout that should matter for these tests is the part that specifies the legal int widths ('n*'). But there was a bug - that part of the string was not correctly separated with the expected '-' character, so we were testing as if there were no legal int widths at all. Removed the leading cruft so we have some legal ints to test with. I noticed this while testing a potential change to the way we transform shifts and sexts in D42424. llvm-svn: 323377
*	[AArch64][GlobalISel] Fall back during AArch64 isel if we have a volatile load.	Amara Emerson	2018-01-24	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The tablegen imported patterns for sext(load(a)) don't check for single uses of the load or delete the original after matching. As a result two loads are left in the generated code. This particular issue will be fixed by adding support for a G_SEXTLOAD opcode in future. There are however other potential issues around this that wouldn't be fixed by a G_SEXTLOAD, so until we have a proper solution we don't try to handle volatile loads at all in the AArch64 selector. Fixes/works around PR36018. llvm-svn: 323371
*	[GlobalISel] Don't fall back to FastISel.	Amara Emerson	2018-01-24	1	-0/+10
\| \| \| \| \| \| \|	Apparently checking the pass structure isn't enough to ensure that we don't fall back to FastISel, as it's set up as part of the SelectionDAGISel. llvm-svn: 323369
*	[X86][SSE] Aggressively use PMADDWD for v4i32 multiplies with 17 or more ↵	Simon Pilgrim	2018-01-24	3	-170/+374
\| \| \| \| \| \| \| \| \| \| \| \|	leading zeros As discussed in D41484, PMADDWD for 'zero extended' vXi32 is nearly always a better option than PMULLD: On SNB it will result in code that isn't any faster, but not any slower so we may as well keep it. On KNL it only has half the throughput, so I've disabled it on there - ideally there'd be a better way than this. Differential Revision: https://reviews.llvm.org/D42258 llvm-svn: 323367
*	[X86][SSE] Add slow-pmulld attribute (silvermont-style) test	Simon Pilgrim	2018-01-24	1	-247/+505
\| \| \| \| \| \|	Requested by @zvi on D42258 llvm-svn: 323364
*	Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements ↵	Alexey Bataev	2018-01-24	3	-24/+24
\| \| \| \| \| \| \| \|	as shuffle." This reverts commit r323348 because of the broken buildbots. llvm-svn: 323359
*	Revert "[ThinLTO] Add call edges' relative block frequency to per-module ↵	Easwaran Raman	2018-01-24	1	-35/+0
\| \| \| \| \| \| \| \|	summary." Causes buildbot regressions. llvm-svn: 323358
*	Revert r321751, "StructurizeCFG: Fix broken backedge detection"	Nicolai Haehnle	2018-01-24	5	-144/+123
\| \| \| \| \| \| \| \| \| \| \|	It causes regressions in various OpenGL test suites. Keep the test cases introduced by r321751 as XFAIL, and add a test case for the regression. Change-Id: I90b4cc354f68cebe5fcef1f2422dc8fe1c6d3514 Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=36015 llvm-svn: 323355
*	[ARM] Expand long shifts for Thumb1 to __aeabi_ calls	Weiming Zhao	2018-01-24	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For long shifts, the inlined version takes about 20 instructions on Thumb1. To avoid the code bloat, expand to __aeabi_ calls if target is Thumb1. Reviewers: samparker Reviewed By: samparker Subscribers: samparker, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42401 llvm-svn: 323354
*	[X86] Fix some inconsistencies in the itineraries and Sched for ↵	Craig Topper	2018-01-24	2	-2/+2
\| \| \| \| \| \| \| \|	(V)PEXTRW/(V)PINSRW The weirdest being that PEXTRWrr was tagged as a memory operation. llvm-svn: 323353
*	[X86] Adjust names of PINSRW/PEXTRW intructions between MMX/SSE/AVX/AVX512 ↵	Craig Topper	2018-01-24	2	-5/+5
\| \| \| \| \| \|	for consistency and to maybe enable more regular expression compaction in the scheduler models. NFCI llvm-svn: 323352
*	[ThinLTO] Add call edges' relative block frequency to per-module summary.	Easwaran Raman	2018-01-24	1	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This allows relative block frequency of call edges to be passed to the thinlink stage where it will be used to compute synthetic entry counts of functions. Reviewers: tejohnson, pcc Subscribers: mehdi_amini, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D42212 llvm-svn: 323349
*	[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle.	Alexey Bataev	2018-01-24	3	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the same value is going to be vectorized several times in the same tree entry, this entry is considered to be a gather entry and cost of this gather is counter as cost of InsertElementInstrs for each gathered value. But we can consider these elements as ShuffleInstr with SK_PermuteSingle shuffle kind. Reviewers: spatel, RKSimon, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38697 llvm-svn: 323348
*	[Hexagon] Run late copy propagation and dead code elimination passes	Krzysztof Parzyszek	2018-01-24	10	-37/+43
\| \| \| \|	llvm-svn: 323346
*	InstSimplify: If divisor element is undef simplify to undef	Zvi Rackover	2018-01-24	2	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If any vector divisor element is undef, we can arbitrarily choose it be zero which would make the div/rem an undef value by definition. Reviewers: spatel, reames Reviewed By: spatel Subscribers: magabari, llvm-commits Differential Revision: https://reviews.llvm.org/D42485 llvm-svn: 323343
*	[ValueTracking] add recursion depth param to matchSelectPattern	Sanjay Patel	2018-01-24	1	-0/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We're getting bug reports: https://bugs.llvm.org/show_bug.cgi?id=35807 https://bugs.llvm.org/show_bug.cgi?id=35840 https://bugs.llvm.org/show_bug.cgi?id=36045 ...where we blow up the stack in value tracking because other passes are sending in selects that have an operand that is itself the select. We don't currently have a reliable way to avoid analyzing dead code that may take non-standard forms, so bail out when things go too far. This mimics the recursion depth limitations in other parts of value tracking. Unfortunately, this pushes the underlying problems for other passes (jump-threading, simplifycfg, correlated-propagation) into hiding. If someone wants to uncover those again, the first draft of this patch on Phab would do that (it would assert rather than bail out). Differential Revision: https://reviews.llvm.org/D42442 llvm-svn: 323331
*	X86 Tests: Add more sdiv combine cases. NFC	Zvi Rackover	2018-01-24	1	-9/+3160
\| \| \| \| \| \|	Add cases with vector non-splat pow2 contant divider. llvm-svn: 323329
*	Regenerate shuffle sink test	Simon Pilgrim	2018-01-24	1	-28/+39
\| \| \| \|	llvm-svn: 323328
*	Reverted 323321.	Amjad Aboud	2018-01-24	4	-220/+2
\| \| \| \|	llvm-svn: 323326
*	[AArch64] Avoid unnecessary vector byte-swapping in big-endian	Pablo Barrio	2018-01-24	1	-55/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Loads/stores of some NEON vector types are promoted to other vector types with different lane sizes but same vector size. This is not a problem in little-endian but, when in big-endian, it requires additional byte reversals required to preserve the lane ordering while keeping the right endianness of the data inside each lane. For example: %1 = load <4 x half>, <4 x half>* %p results in the following assembly: ld1 { v0.2s }, [x1] rev32 v0.4h, v0.4h This patch changes the promotion of these loads/stores so that the actual vector load/store (LD1/ST1) takes care of the endianness correctly and there is no need for further byte reversals. The previous code now results in the following assembly: ld1 { v0.4h }, [x1] Reviewers: olista01, SjoerdMeijer, efriedma Reviewed By: efriedma Subscribers: aemerson, rengolin, javed.absar, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D42235 llvm-svn: 323325
*	[DebugInfo] Emit DWARF reference for DIVariable 'count' in DISubrange	Sander de Smalen	2018-01-24	1	-0/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch implements the codegen of DWARF debug info for non-constant 'count' fields for DISubrange. This is patch [2/3] in a series to extend LLVM's DISubrange Metadata node to support debugging of C99 variable length arrays and vectors with runtime length like the Scalable Vector Extension for AArch64. It is also a first step towards representing more complex cases like arrays in Fortran. Reviewers: echristo, pcc, aprantl, dexonsmith, clayborg, kristof.beyls, dblaikie Reviewed By: aprantl Subscribers: fhahn, aemerson, rengolin, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D41696 llvm-svn: 323323
*	[InstCombine] Introducing Aggressive Instruction Combine pass ↵	Amjad Aboud	2018-01-24	4	-2/+220
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(-aggressive-instcombine). Combine expression patterns to form expressions with fewer, simple instructions. This pass does not modify the CFG. For example, this pass reduce width of expressions post-dominated by TruncInst into smaller width when applicable. It differs from instcombine pass in that it contains pattern optimization that requires higher complexity than the O(1), thus, it should run fewer times than instcombine pass. Differential Revision: https://reviews.llvm.org/D38313 llvm-svn: 323321
*	[Metadata] Extend 'count' field of DISubrange to take a metadata node	Sander de Smalen	2018-01-24	5	-0/+131
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch extends the DISubrange 'count' field to take either a (signed) constant integer value or a reference to a DILocalVariable or DIGlobalVariable. This is patch [1/3] in a series to extend LLVM's DISubrange Metadata node to support debugging of C99 variable length arrays and vectors with runtime length like the Scalable Vector Extension for AArch64. It is also a first step towards representing more complex cases like arrays in Fortran. Reviewers: echristo, pcc, aprantl, dexonsmith, clayborg, kristof.beyls, dblaikie Reviewed By: aprantl Subscribers: rnk, probinson, fhahn, aemerson, rengolin, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D41695 llvm-svn: 323313