|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| | 
| 
| 
| 
| 
| | Fixes PR21607
llvm-svn: 222385 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | symbolizer.
FYI, removed the unused MCInstrAnalysis as it does not exist for 64-bit ARM and
was causing a “couldn't initialize disassembler for target” error.
llvm-svn: 222045 | 
| | 
| 
| 
| 
| 
| 
| | Don't assert if we can return an error code, reuse existing
functionality like is64Bit().
llvm-svn: 221915 | 
| | 
| 
| 
| | llvm-svn: 221782 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | With this patch MCDisassembler::getInstruction takes an ArrayRef<uint8_t>
instead of a MemoryObject.
Even on X86 there is a maximum size an instruction can have. Given
that, it seems way simpler and more efficient to just pass an ArrayRef
to the disassembler instead of a MemoryObject and have it do a virtual
call every time it wants some extra bytes.
llvm-svn: 221751 | 
| | 
| 
| 
| 
| 
| | Thanks to Aaron Ballman for noticing this!
llvm-svn: 221696 | 
| | 
| 
| 
| | llvm-svn: 221502 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | add the code and test cases for 32-bit ARM symbolizer.
Also fixed the printing of data in code as it was not using the table correctly
and needed to fix one of the test cases too.
This will break lld’s test/mach-o/arm-interworking-movw.yaml till the tweak
for that is made. Which I’ll be committing immediately after this commit.
llvm-svn: 221470 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | DiceTableEntry is 24 bytes on my machine, it's probably better to pass
them by reference.
This fixes PR21464.
llvm-svn: 221247 | 
| | 
| 
| 
| 
| 
| | symbolizer.
llvm-svn: 221211 | 
| | 
| 
| 
| | llvm-svn: 220875 | 
| | 
| 
| 
| | llvm-svn: 220833 | 
| | 
| 
| 
| | llvm-svn: 220518 | 
| | 
| 
| 
| 
| 
| | Should fix the build bot issues from commit r220500.
llvm-svn: 220504 | 
| | 
| 
| 
| 
| 
| 
| | This prints disassembly comments for Objective-C references to CFStrings,
Selectors, Classes and method calls.
llvm-svn: 220500 | 
| | 
| 
| 
| | llvm-svn: 219945 | 
| | 
| 
| 
| 
| 
| | bad library ordinals
llvm-svn: 219746 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | There are two methods in SectionRef that can fail:
* getName: The index into the string table can be invalid.
* getContents: The section might point to invalid contents.
Every other method will always succeed and returning and std::error_code just
complicates the code. For example, a section can have an invalid alignment,
but if we are able to get to the section structure at all and create a
SectionRef, we will always be able to read that invalid alignment.
llvm-svn: 219314 | 
| | 
| 
| 
| | llvm-svn: 218649 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | stubs.
So in fully linked images when a call is made through a stub it now gets a
comment like the following in the disassembly:
    callq	0x100000f6c             ## symbol stub for: _printf
indicating the call is to a symbol stub and which symbol it is for.  This is
done for branch reference types and seeing if the branch target is in a stub
section and if so using the indirect symbol table entry for that stub and
using that symbol table entries symbol name.
llvm-svn: 218546 | 
| | 
| 
| 
| 
| 
| | accepts a const data pointer. This silences a -Wcast-qual warning.
llvm-svn: 218454 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | files to
get the literal string “Hello world” printed as a comment on the instruction
that loads the pointer to it. For now this is just for x86_64. So for object
files with relocation entries it produces things like:
	leaq	L_.str(%rip), %rax      ## literal pool for: "Hello world\n"
and similar for fully linked images like executables:
	leaq	0x4f(%rip), %rax        ## literal pool for: "Hello world\n"
Also to allow testing against darwin’s otool(1), I hooked up the existing 
-no-show-raw-insn option to the Mach-O parser code, added the new Mach-O
only -full-leading-addr option to match otool(1)'s printing of addresses and
also added the new -print-imm-hex option.
llvm-svn: 218423 | 
| | 
| 
| 
| 
| 
| | getLibraryShortNameByIndex() error handling.
llvm-svn: 217930 | 
| | 
| 
| 
| | llvm-svn: 217909 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | First step done in this commit is to get flush out enough of the
SymbolizerGetOpInfo() routine to symbolic an X86_64 hello world .o and
its loading of the literal string and call to printf.  Also the code to
symbolicate the X86_64_RELOC_SUBTRACTOR relocation and a test is also
added to show a slightly more complicated case.
Next will be to flush out enough of SymbolizerSymbolLookUp() to get the
literal string “Hello world” printed as a comment on the instruction that load
the pointer to it.
llvm-svn: 217893 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | This finishes the ability of llvm-objdump to print out all information from
the LC_DYLD_INFO load command.
The -bind option prints out symbolic references that dyld must resolve 
immediately.
The -lazy-bind option prints out symbolc reference that are lazily resolved on 
first use.
The -weak-bind option prints out information about symbols which dyld must
try to coalesce across images.
llvm-svn: 217853 | 
| | 
| 
| 
| | llvm-svn: 217724 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | Similar to my previous -exports-trie option, the -rebase option dumps info from
the LC_DYLD_INFO load command. The rebasing info is a list of the the locations
that dyld needs to adjust if a mach-o image is not loaded at its preferred 
address. Since ASLR is now the default, images almost never load at their
preferred address, and thus need to be rebased by dyld.
llvm-svn: 217709 | 
| | 
| 
| 
| | llvm-svn: 217433 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | executable Mach-O files.
This adds the printing of more load commands, so that the normal load commands
in a typical X86 Mach-O executable can all be printed.
llvm-svn: 217172 | 
| | 
| 
| 
| | llvm-svn: 217005 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | The code is buggy and barely tested. It is also mostly boilerplate.
(This includes MCObjectDisassembler, which is the interface to that
functionality)
Following an IRC discussion with Jim Grosbach, it seems sensible to just
nuke the whole lot of functionality, and dig it up from VCS if
necessary (I hope not!).
All of this stuff appears to have been added in a huge patch dump (look
at the timeframe surrounding e.g. r182628) where almost every patch
seemed to be untested and not reviewed before being committed.
Post-review responses to the patches were never addressed. I don't think
any of it would have passed pre-commit review.
I doubt anyone is depending on this, since this code appears to be
extremely buggy. In limited testing that Michael Spencer and I did, we
couldn't find a single real-world object file that wouldn't crash the
CFG reconstruction stuff. The symbolizer stuff has O(n^2) behavior and
so is not much use to anyone anyway. It seemed simpler to remove them as
a whole. Most of this code is boilerplate, which is the only way it was
able to scrape by 60% coverage.
HEADSUP: Modules folks, some files I nuked were referenced from
include/llvm/module.modulemap; I just deleted the references. Hopefully
that is the right fix (one was a FIXME though!).
llvm-svn: 216983 | 
| | 
| 
| 
| | llvm-svn: 216931 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | MachOObjectFile in lib/Object currently has no support for parsing the rebase, 
binding, and export information from the LC_DYLD_INFO load command in final 
linked mach-o images. This patch adds support for parsing the exports trie data
structure. It also adds an option to llvm-objdump to dump that export info.
I did the exports parsing first because it is the hardest. The information is 
encoded in a trie structure, but the standard ObjectFile way to inspect content 
is through iterators. So I needed to make an iterator that would do a 
non-recursive walk through the trie and maintain the concatenation of edges 
needed for the current string prefix.
I plan to add similar support in MachOObjectFile and llvm-objdump to 
parse/display the rebasing and binding info too.
llvm-svn: 216808 | 
| | 
| 
| 
| 
| 
| 
| | This adds the printing of the LC_SEGMENT load command and sections,
LC_SYMTAB and LC_DYSYMTAB load commands.
llvm-svn: 216795 | 
| | 
| 
| 
| 
| 
| 
| 
| | reason.
The switch statement would never fire due to the preceding break statement. Also, the switch statement has a default label with no case labels. Simplified the code, and allow it to execute.
llvm-svn: 216346 | 
| | 
| 
| 
| 
| 
| 
| 
| | Mach-O files.
This adds the printing of the mach header. Load command printing will be next.
llvm-svn: 216285 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | Owning the buffer is somewhat inflexible. Some Binaries have sub Binaries
(like Archive) and we had to create dummy buffers just to handle that. It is
also a bad fit for IRObjectFile where the Module wants to own the buffer too.
Keeping this ownership would make supporting IR inside native objects
particularly painful.
This patch focuses in lib/Object. If something elsewhere used to own an Binary,
now it also owns a MemoryBuffer.
This patch introduces a few new types.
* MemoryBufferRef. This is just a pair of StringRefs for the data and name.
  This is to MemoryBuffer as StringRef is to std::string.
* OwningBinary. A combination of Binary and a MemoryBuffer. This is needed
  for convenience functions that take a filename and return both the
  buffer and the Binary using that buffer.
The C api now uses OwningBinary to avoid any change in semantics. I will start
a new thread to see if we want to change it and how.
llvm-svn: 216002 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | file with -macho, the Mach-O specific object file parser option.
After some discussion I chose to do this implementation contained in the logic
of llvm-objdump’s MachODump.cpp using a second disassembler for thumb when
needed and with updates mostly contained in the MachOObjectFile class.
llvm-svn: 215931 | 
| | 
| 
| 
| 
| 
| | same time. NFC.
llvm-svn: 215643 | 
| | 
| 
| 
| | llvm-svn: 215437 | 
| | 
| 
| 
| 
| 
| | Third time lucky. This should finally fix the ARM (& MIPS, I think) bots.
llvm-svn: 215349 | 
| | 
| 
| 
| | llvm-svn: 215198 | 
| | 
| 
| 
| 
| 
| 
| 
| | ARM bots (& others, I think, now that I look) were failing because we
were using incorrect printf-style format specifiers. They were wrong
on almost any platform, actually, just mostly harmlessly so.
llvm-svn: 215196 | 
| | 
| 
| 
| 
| 
| 
| | Also make the disassembler created with the Mach-O parser (the -m option)
pick up the Target specific attributes specified with -mattr option.
llvm-svn: 215032 | 
| | 
| 
| 
| | llvm-svn: 214509 | 
| | 
| 
| 
| 
| 
| | This makes using a std::unique_ptr in the caller more convenient.
llvm-svn: 214433 | 
| | 
| 
| 
| | llvm-svn: 214377 | 
| | 
| 
| 
| | llvm-svn: 212405 | 
| | 
| 
| 
| 
| 
| 
| 
| | This makes the buffer ownership on error conditions very natural. The buffer
is only moved out of the argument if an object is constructed that now
owns the buffer.
llvm-svn: 211546 |