summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* R600/SI: Add an s_mov_b32 to patterns which use the M0RegClassTom Stellard2014-11-212-24/+8
| | | | | | | We need to use a s_mov_b32 rather than a copy, so that CSE will eliminate redundant moves to the m0 register. llvm-svn: 222584
* R600/SI: Emit s_mov_b32 m0, -1 before every DS instructionTom Stellard2014-11-217-41/+30
| | | | | | | | | | | | This s_mov_b32 will write to a virtual register from the M0Reg class and all the ds instructions now take an extra M0Reg explicit argument. This change is necessary to prevent issues with the scheduler mixing together instructions that expect different values in the m0 registers. llvm-svn: 222583
* Per off-list feedback, this API returns the *first* value with a given name, ↵Enrico Granata2014-11-213-5/+5
| | | | | | not the *only* one. Rename it to reflect that llvm-svn: 222582
* R600/SI: Add SIFoldOperands passTom Stellard2014-11-217-28/+266
| | | | | | | This pass attempts to fold the source operands of mov and copy instructions into their uses. llvm-svn: 222581
* [mips][microMIPS] This patch implements functionality in MIPS delay slotJozef Kolek2014-11-215-12/+73
| | | | | | | | | | | filler such as if delay slot filler have to put NOP instruction into the delay slot of microMIPS BEQ or BNE instruction which uses the register $0, then instead of emitting NOP this instruction is replaced by the corresponding microMIPS compact branch instruction, i.e. BEQZC or BNEZC. Differential Revision: http://reviews.llvm.org/D3566 llvm-svn: 222580
* R600/SI: Mark s_mov_b32 and s_mov_b64 as rematerializableTom Stellard2014-11-211-0/+2
| | | | llvm-svn: 222579
* R600/SI: Use hex notation for constant in testTom Stellard2014-11-211-1/+1
| | | | llvm-svn: 222578
* [Hexagon] Adding sxth instruction.Colin LeMahieu2014-11-214-8/+20
| | | | llvm-svn: 222577
* Add an API on SBValueList to find the first value with a given name stored ↵Enrico Granata2014-11-213-0/+31
| | | | | | in the list llvm-svn: 222576
* [Hexagon] Adding sxtb instruction. Renaming some identically named classes ↵Colin LeMahieu2014-11-213-15/+97
| | | | | | that will be removed after converting referencing defs. llvm-svn: 222575
* [Mips] Use endian::read() for relocation addendum readingSimon Atanasyan2014-11-211-3/+5
| | | | llvm-svn: 222574
* [asan] add statistic counter to dynamic alloca instrumentationKostya Serebryany2014-11-211-0/+3
| | | | llvm-svn: 222573
* [ASan] Get fake stack code working with GCC 4.8.2.Jay Foad2014-11-213-12/+18
| | | | | | | | | | | | | | | | | | | | | Summary: TestCases/Linux/heavy_uar_test.cc was failing on my PowerPC64 box with GCC 4.8.2, because the compiler recognised a memset-like loop and turned it into a call to memset, which got intercepted by __asan_memset, which got upset because it was being called on an address in high shadow memory. Use break_optimization to stop the compiler from doing this. Reviewers: kcc, samsonov Reviewed By: kcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6266 llvm-svn: 222572
* [Hexagon] Removing SUB_rr and replacing with A2_sub.Colin LeMahieu2014-11-214-23/+6
| | | | llvm-svn: 222571
* Objective-C ARC. Fixes a crash when checking for 'weak' propery Fariborz Jahanian2014-11-212-0/+16
| | | | | | whose base is not an expression. rdar://19053620 llvm-svn: 222570
* Sema: Don't permit variably modified types in typeidDavid Majnemer2014-11-213-0/+14
| | | | | | | | | | GCC and ICC both reject this and the 'Runtime-sized arrays with automatic storage duration' (N3639) paper forbade this as well. Previously, we would crash on our way to mangling. This fixes PR21632. llvm-svn: 222569
* Implement -Wcast-qual, fixing #13772.Roman Divacky2014-11-214-5/+75
| | | | | | Many thanks to dblaikie for his advices and suggestions! llvm-svn: 222568
* Remove duplication of relocation names in lib/Object/ELFYAML.cppTim Northover2014-11-211-330/+7
| | | | | | | | | | We can now use the ELF relocation .def files to create the mapping of relocation numbers to names and avoid having to duplicate the list of relocations. Patch by Will Newton. llvm-svn: 222567
* Remove duplication of relocation names in lib/Object/ELF.cppTim Northover2014-11-211-742/+14
| | | | | | | | | | We can now use the ELF relocation .def files to create the mapping of relocation numbers to names and avoid having to duplicate the list of relocations. Patch by Will Newton. llvm-svn: 222566
* Split ELF relocation defintions into per-architecture .def filesTim Northover2014-11-2111-763/+826
| | | | | | | | | | This should allow the list of relocations for a particular architecture to be kept in a single header rather than duplicated whenever we need to enumerate all the relocations. Patch by Will Newton. llvm-svn: 222565
* MS ABI: Mangle char16_t and char32_t string literalsDavid Majnemer2014-11-212-10/+11
| | | | | | | | | | | | | | | | We previously had support for char and wchar_t string literals. VS 2015 added support for char16_t and char32_t. String literals must be mangled in the MS ABI in order for them to be deduplicated across translation units: their linker has no notion of mergeable section. Instead, they use the mangled name to make a COMDAT for the string literal; the COMDAT will merge with other COMDATs in other object files. This allows strings in object files generated by clang to get merged with strings in object files generated by MSVC. llvm-svn: 222564
* Debug Info: revert r222195, r222210 and r222239.Manman Ren2014-11-215-81/+7
| | | | | | | This is no longer needed after David's fix at r222377 + r222485. rdar://18958417 llvm-svn: 222563
* Disable header duplication at -Oz in loop-rotate pass.Roman Divacky2014-11-211-1/+2
| | | | llvm-svn: 222562
* Debug Info: add an assertion that the context field of a global variable can notManman Ren2014-11-211-1/+8
| | | | | | | | | | | be a DIType with identifier. This makes sure that there is no need to use DIScopeRef for global variable's context. rdar://18958417 llvm-svn: 222561
* Use isl_schedule_get_ctxTobias Grosser2014-11-211-5/+1
| | | | llvm-svn: 222560
* Fix typoTobias Grosser2014-11-211-1/+1
| | | | llvm-svn: 222559
* [Objective-C] Support a new special module flag that will be put into theManman Ren2014-11-211-1/+2
| | | | | | | | objc_imageinfo struct. rdar://17954668 llvm-svn: 222558
* LazyValueInfo: range'ify some for-loops. No functional change.Hans Wennborg2014-11-211-34/+19
| | | | llvm-svn: 222557
* Add params() to FunctionType. NFC.Rafael Espindola2014-11-212-6/+7
| | | | | | While at it, also use makeArrayRef in elements(). llvm-svn: 222556
* Don't repeat class/function/variable names in comments. NFC.Sanjay Patel2014-11-211-47/+35
| | | | llvm-svn: 222555
* LazyValueInfo: fix some typos and indentation, etc. NFC.Hans Wennborg2014-11-211-10/+12
| | | | llvm-svn: 222554
* Add and use a helper elements() to StructType. NFC.Rafael Espindola2014-11-212-3/+6
| | | | llvm-svn: 222553
* Add an assertion for detecting missed/uncorrected TypoExprs.Kaelyn Takata2014-11-211-0/+2
| | | | llvm-svn: 222552
* Enable ActOnIdExpression to use delayed typo correction for non-C++ codeKaelyn Takata2014-11-219-22/+68
| | | | | | when calling DiagnoseEmptyLookup. llvm-svn: 222551
* Properly correct initializer expressions based on whether they would be valid.Kaelyn Takata2014-11-212-0/+51
| | | | llvm-svn: 222550
* Use the full-Expr filter to disambiguate equidistant correctionKaelyn Takata2014-11-212-26/+52
| | | | | | candidates. llvm-svn: 222549
* Do some cleanup of DumpValueObjectOptions. The whole concept of raw printing ↵Enrico Granata2014-11-213-81/+62
| | | | | | was split between feature-specific flags, and an m_be_raw flag, which then drove some other changes in printing behavior. Clean that up, so that each functionality has its own flag .. oh, and make the bools all go in a bitfield since I may want to add more of those over time llvm-svn: 222548
* Allow multiple -debug-only argsMatthias Braun2014-11-211-6/+17
| | | | | | Debug output is shown if any of the -debug-only arguments match. llvm-svn: 222547
* Less space; NFCSanjay Patel2014-11-211-8/+4
| | | | llvm-svn: 222546
* Fix formatting. NFC.Rafael Espindola2014-11-211-19/+18
| | | | llvm-svn: 222545
* Add a feature flag for slow 32-byte unaligned memory accesses [x86].Sanjay Patel2014-11-216-23/+65
| | | | | | | | | | | | | | This patch adds a feature flag to avoid unaligned 32-byte load/store AVX codegen for Sandy Bridge and Ivy Bridge. There is no functionality change intended for those chips. Previously, the absence of AVX2 was being used as a proxy to detect this feature. But that hindered codegen for AVX-enabled AMD chips such as btver2 that do not have the 32-byte unaligned access slowdown. Performance measurements are included in PR21541 ( http://llvm.org/bugs/show_bug.cgi?id=21541 ). Differential Revision: http://reviews.llvm.org/D6355 llvm-svn: 222544
* Revert "Allow FDE references outside the +/-2GB range supported by PC ↵Duncan P. N. Exon Smith2014-11-211-11/+0
| | | | | | | | | | | | | | | | | relative offsets for code models other than small/medium. For JIT application, memory layout is less controlled and can result in truncations otherwise." This reverts commit r222538. It's causing test failures for CFI, at least on Darwin: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental/1189/ http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_check/1391/ Note that the previous incremental build was on r222537, and the CFI tests weren't failing: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental/1188/ llvm-svn: 222542
* Extend PipePosix with child_processes_inherit support - to control whether ↵Oleksiy Vyalov2014-11-213-3/+40
| | | | | | | | pipe handles should be inherited by a child process. http://reviews.llvm.org/D6348 llvm-svn: 222541
* [x86] Restructure the checking patterns for v16 and v32 avx2 vectorChandler Carruth2014-11-214-64/+47
| | | | | | | | | | shuffle lowering to allow much better blend matching. Specifically, with the new structure the code seems clearer to me and we correctly can hit the cases where merging two 128-bit lanes is a clear win and can be shuffled cheaply afterward. llvm-svn: 222539
* Allow FDE references outside the +/-2GB range supported by PC relativeJoerg Sonnenberger2014-11-211-0/+11
| | | | | | | | | | | | offsets for code models other than small/medium. For JIT application, memory layout is less controlled and can result in truncations otherwise. Patch from Akos Kiss. Differential Revision: http://reviews.llvm.org/D6079 llvm-svn: 222538
* [x86] Make the previous logic significantly less conservative and getChandler Carruth2014-11-213-94/+42
| | | | | | | | | | | | | a bunch more improvements. Non-lane-crossing is fine, the key is that lane merging only makes sense for single-input shuffles. Not sure why I got so turned around here. The code all works, I was just using the wrong model for it. This only updates v4 and v8 lowering. The v16 and v32 lowering requires restructuring the entire check sequence. llvm-svn: 222537
* [DAG] Teach how to turn a build_vector into a shuffle if some of the ↵Andrea Di Biagio2014-11-212-23/+88
| | | | | | | | | | | | | | operands are zero. Before this patch, the DAGCombiner only tried to convert build_vector dag nodes into shuffles if all operands were either extract_vector_elt or undef. This patch improves that logic and teaches the DAGCombiner how to deal with build_vector dag nodes where one or more operands are zero. A build_vector dag node with some zero operands is turned into a shuffle only if the resulting shuffle mask is legal for the target. llvm-svn: 222536
* [ASan] Allow the users of SymbolizationLoop to make use of the --dsym_hint ↵Alexander Potapenko2014-11-211-11/+39
| | | | | | | | | | option in llvm-symbolizer Let the users of SymbolizationLoop define a function that produces the list of .dSYM hints (possible path to the .dSYM bundle) for the given binary. Because the hints can't be added to an existing llvm-symbolizer process, we spawn a new symbolizer process ones each time a new hint appears. Those can only appear for binaries that we haven't seen before. llvm-svn: 222535
* clang-format: Understand more lambda return types.Daniel Jasper2014-11-213-1/+10
| | | | | | | | | | Before: auto a = [&b, c ](D * d) -> D * {} After: auto a = [&b, c](D* d) -> D* {} llvm-svn: 222534
* [x86] Teach the x86 vector shuffle lowering to detect mergable 128-bitChandler Carruth2014-11-218-205/+236
| | | | | | | | | | | | | | | | | | | lanes. By special casing these we can often either reduce the total number of shuffles significantly or reduce the number of (high latency on Haswell) AVX2 shuffles that potentially cross 128-bit lanes. Even when these don't actually cross lanes, they have much higher latency to support that. Doing two of them and a blend is worse than doing a single insert across the 128-bit lanes to blend and then doing a single interleaved shuffle. While this seems like a narrow case, it kept cropping up on me and the difference is *huge* as you can see in many of the test cases. I first hit this trying to perfectly fix the interleaving shuffle patterns used by Halide for AVX2. llvm-svn: 222533
OpenPOWER on IntegriCloud