bin2llvm

A binary to LLVM translator

Github星跟踪图

The bin2llvm Project Build Status

This is an S2E based binary-to-LLVM
translator. It converts any binary code to LLVM code. The resulting LLVM module
contains functions. Some, control flow details are recovered.

Overview

The idea is to reuse components from S2E to achieve the translation to LLVM.
Rougly, qemu translates from binary to TCG and S2E translates from TCG to LLVM.
Plugins were added to perform the recursive disassembly of the binary. The
raw LLVM code is then fed to a set of external LLVM passes. The purpose of
these step is to add more details about the extracted code, concretely, basic
blocks are grouped in functions.
It is mainly tested on the ARM architecture.
bin2llvm is a best effort tool, it will try to translate as much as possible
and then link the LLVM code in a final file.

Running the Docker image

$ docker pull docker.io/cojocar/bin2llvm
$ # run one example binary
$ docker run --rm -t docker.io/cojocar/bin2llvm /bin/bash -c "/usr/local/bin2llvm/bin/bin2llvm.py --file /usr/local/bin2llvm/bin/ls-example"
$ # run the tests
$ docker run --rm -t docker.io/cojocar/bin2llvm /bin/bash -c "cd /usr/local/bin2llvm/tests; BIN2LLVM_INSTALL_DIR=/usr/local/bin2llvm make;"

How to build, install & run from the source tree

Dependencies

Consult the Dockerfile for the list of dependencies.

Building (outside Docker)

$ ./scripts/setup.sh # this will copy some dependencies in the third_party directory
$ ./scripts/build.sh ../bin2llvm-build
$ ./scripts/install.sh ../bin2llvm-build ../bin2llvm-install

(optionally) Building the Docker image

$ ./scripts/build_docker.sh

This will result in bin2llvm-dev and in bin2llvm-release-squashed images.

Running

$ cd ../bin2llvm-install && ./bin/bin2llvm.py --file ./bin/ls-example
Press Ctrl+C
INFO:bin2llvm:Using /tmp/bin2llvm-W4yJvU as temp_dir
INFO:bin2llvm:Use entry: 0x00009a74
INFO:bin2llvm:Use entry: 0x00009fa8
INFO:bin2llvm:Use entry: 0x0000c470
INFO:bin2llvm:Use entry: 0x0000c4d0
INFO:bin2llvm:Use entry: 0x0000c514
INFO:bin2llvm:Use entry: 0x0000c560
....
INFO:bin2llvm:Use entry: 0x00000000
WARNING:bin2llvm:(passes) crashed with entry: 0x00000000
INFO:bin2llvm:FINAL output is in /tmp/bin2llvm-W4yJvU/final.bc (370 functions)

The final bit code is ${OUT_DIR}/final.bc

Testing

$ cd ./tests && BIN2LLVM_INSTALL_DIR=$(realpath ../../bin2llvm-install) make

See the test directory for more details.


bin2llvm in practice

The following works are using bin2llvm:

主要指标

概览
名称与所有者cojocar/bin2llvm
主编程语言C++
编程语言Python (语言数: 10)
平台
许可证Apache License 2.0
所有者活动
创建于2017-05-13 09:17:36
推送于2018-06-05 12:46:08
最后一次提交2018-06-05 14:43:23
发布数0
用户参与
星数148
关注者数10
派生数18
提交数12
已启用问题?
问题数5
打开的问题数2
拉请求数0
打开的拉请求数0
关闭的拉请求数0
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?