- PR#700 LLVM-4.0 is now supported
- PR#688 Primus - the microexecution framework
- PR#688 BIL type checker and normalizer
- PR#688 New constant folding with effect analysis
- PR#688 SSA transformation plugin
- PR#688 Dead code elimination plugin
- PR#688 New pretty-printer for BIL and bitvectors
- PR#684 Support for cmdliner 1.0
- PR#682 Severely reduces memory consumption
- PR#680 New thin bitvector representation
- PR#663 Adds topic tags to bundle
- PR#654 New OGRE based loader that supports:
- Linux Kernel Modules
- MachO Kernel Bundles (LLVM 3.8+ only)
- Plain object files
- Shared libraries
- PR#630 Enhancments in IDA plugin
- PR#688 Type errors in the x86 lifter
- PR#688 Type errors in the ARM lifter
- PR#688 Fixes constant folding
- PR#672 Fixes demangler plugin
- PR#665 Fixes symbol cases in objdump
- PR#644 Robustness fixes in x86 lifter
- PR#609 compilation with ocaml 4.04.0
- PR#621 don't store api files in api plugin
- PR#627 beagle - obfuscated string solver
- binary release (deb, rpm, tgs)
- c-bindings
- PR#586 segfault with short or damaged files fed to bap.
- PR#590 llvm 3.8 specific issues
- PR#592 a bug in lifting x86 PSHUFD/PSHUFB instructions
- PR#595 bap exit status
- PR#596 most of the compilation warnings
- PR#593 bapbundle: it is no longer needed to specify the .plugin extension
- PR#597 API pass will stop processing in case of the error
- PR#599 print backtraces from passes
- PR#600 documented memory interface
- A powerful plugin system
- Split Bap.Std into several libraries.
- The disassembler layer is severly rewritten
- Made project storable and loadable
- Added new injection points
- Added BIL interpreters
- Removed bap-server
- New python interface (see https://BinaryAnalysisPlatform/bap-python)
- New ida integration, that works in both directions
- Multipass disassembling
- llvm-3.8 support (#546)
- new x86 lifter (#549)
- new testsuite with functional tests (#520)
- extensible API/ABI (#448)
-
Graphlib library
Graphlib
is a generic library that extends a well known OCamlGraph library.Graphlib
uses its own, more reach and modern,Graph
interface that is isomorphic to OCamlGraph'sSigs.P
signature for persistant graphs. This interface is developed according to the Janestreet's style guidlines and depends on Core_kernel library. Other than the new interface,Graphlib
provides several graph implementations, and generic algorithms. To make our algorithms polymorphic over chosen graph representation we use first-class modules, instead of functors, that makes library syntactically more light weight. -
Refined IR.
phi and arg terms were refined. a phi term now is a discriminated set of expressions, and arguments are made more like a defintions.
-
SSA form
A transformation to a semipruned SSA form was added to a library
-
Changed Var interface.
Variables can now be versioned, that plays well with SSA form. A version is used to represent the same variable but at different point of time.
-
Extended and fixed Trie data structure
Several bug-fixes to a Trie data structure were made. Added
walk
function, that allows to perform generic queries over a trie. Also added prefix and suffix variants of String tries, as well as provided a functor to create tries with an array keys. -
Redesigned
abi
interface.Now
abi
works on an IR level. -
Added argument reconstruction algorithms.
Two argument reconstructions algorithm, based on a new
abi
interface, were added - for ARM gnueabi ABI and System V amd64 ABI. Currently only parameters passed via registers are reconstructed. -
Added
free_vars
algorithm to BIL and BIR terms.
-
BAP IR is introduced
BAP Intermediate Representation is based on BAP Instruction Language and is a semigraphical representation of a program.
See documentation and following PR's for more information. a2a4621df7c5b25d85c04665732423992e8def98 74cdee48818225e8b43d39803c97471903ef6d1f
-
Refactored structure of the Project Module
Project
now a proper entry point to the library. Many stuff from bap utility moved there. See 96bd334a0d8af17a6dfd21eff9ec710d448f13e8 for more details.This is a breaking change. It hides
project
record and removes access to some information, that was previously marked as deprecated:- symbols as a mapping from memory to string
- base as a memory.
Instead of old symbols table we now have a better interface, see below. Instead of base, we now represent all memory as an interval map (Memmap).
-
New model for symbols
Previosly symbols were modeled as contiguous chunk of memory, marked with name. Moreover, data sharing between different symbols weren't allowed. Since this release, symbols can be a noncontiguous, and share data. A new interface is implemented in
Symtab
module. -
Plugins dependency and autoloading
Plugins now can now specify dependencies to other plugins, that may be auto-loaded by the library. See db2a175ba8e6708753a06a2428940c857a1910ec
-
Extended BIL helpers See 65f472c08d27020a6570b7992b93397346251d1e
-
Exposed ELF library
-
Fixed segment/section/region name hell See 9a574498392c6a13606c9d202037daf137bb780c
-
New universal values library
The library is based on Core_kernel's Univ, but with addition of serialization, comparison and pretty-printing. See 383003d60baa3434dd4cd8c894e1d8c2e889b4a2
-
Added bap-fsi-benchmark utility
80382114f395bcf45925ae2e4bc5b9aac5bba4e7
-
Fixed BIL piqi serialization
2a5c4671468c5a2699b6007a8af3fda8867e8eb8
- Fixed installation on more recent ubuntu
By defaulting LLVM version to 3.4 (and more clever searching procedure)
- Lot's of bugfixes and small extensions
-
BAP structure is refactored
Complex hierarchy is now hidden under one umbrella
bap.mli
.Bap_*
modules are marked as internal and is no more installable and, thus, they do not polute the namespace. This will of course, break the code that used this internal modules. It is intended behavior. -
New documentation generator
camlp4
messes withmli
files, rendering autogenerated API unreadable. To mitigate this, a smallbapdoc
tool was written that preprocess file and removes all syntax extensions, and then runocamldoc
on a preprocessed file. The tool was integrated with build system. -
Disabled peer checking when downloading signatures
-
Fixed assembly pretty-printing
-
Fixed reading PE-64
-
Fixed
Block.dfs
exhaustiveness issue. AlsoBlock.dfs
now has anorder
parameter, allowing to choose between post-order and pre-order traversal. -
Fixed
to_graph
issue, i.e., #181 -
Fixed
bapbuild
double linking, see #193 -
Ordered block destinations, see #191
-
Fixed an issue in x86 lifter, see #198
-
Fixed interaction with IDA, see #189
-
New loader backed with LLVM BAP now have another loader (image reader), that supports MACH-O, ELF, COFF, PE. This loader is backed with LLVM library.
-
Online plugin system
New extension point is added - "bap.project". Plugins marked with this plugin system will not be loaded automatically when
Plugins.load
is called, instead, they can be loaded dynamically (or online, hence the title), by using-l
option to thebap
utility. After being loaded the plugin is applied to aproject
data structure that contains all information about disassembled binary. Plugin can functionally update this data structure, to push information to other plugins or back to thebap
utility.In addition to a common way of creating plugins with
oasis
, we extendedbapbuild
utility with a new rule the will product aplugin
file. This is just a shared library underneath the hood, and you can load a plugin, created with this method directly, without installing it anywhere.bap
utility will try to find the plugin, specified with-l
option in a current folder, then in all folders specified inBAP_PLUGIN_PATH
environment variable, and, finally in the system, usingocamlfind
.In order to provide a typesafe way of interacting between plugins, we added extensible variants to BAP. But instead of using one from the 4.02, we're using universal types, based on that one, that Core library provides. First of all this is more portable, second it is more explicit and a little bit more safe.
-
New ABI and CPU interfaces
Modules that implements
CPU
interface are used to describe particular CPU in BIL terminology, e.g., it tells which variable corresponds to which register, flag, etc. To obtain such module, one should usetarget_of_cpu
function.ABI is used to capture the procedure abstraction, starting from calling convetions and stack frame structure and ending with special function handling and support for different data-types.
See d5cab1a5e122719b4a3b1ece2b1bc44f3f93095a for more information and examples.
-
Bap-objdump renamed to bap
bap-objdump has outgrown its name. Actually it was never really a bap-objdump at all. From now, it is just an entry point to the
bap
as platform. We will later unitebap
with other utilities, to make them subcommands, e.g.bap byteweight
. -
Cleanup of BIL modules
Now there is a separation between BIL fur uns, and BIL fur OCaml. For writing BIL programs (as EDSL in OCaml) one should use
Bil
module, e.g.Bil.(x = y)
will evaluate to a BIL expression. For using BIL entities as OCaml values, one should use corresponding module, e.g.Exp.(x = y)
will compare to expressions and evaluate to a value of typebool
. -
Enhanced IDA integration
IDA intergation is now more robust. We switched to
IDA-32
by default, since 64-bit version doesn't support decompiler. Alsobap
utility can now output IDA python scripts. Andbap
plugins can annotate project withpython
commands, that later will be dumped into the script. -
In ARM switched to ARMv7 by default
-
Introduce LNF algorithm and Sema library
A new layer of BAP is started in this release. This would be a third pass of decompilation, where the semantic model of program will be built. Currently, there is nothing really interesting here, e.g., an implementation of the Loop nesting forest, that is not very usable right now. But the next release, will be dedicated to this layer. So, stay tuned.
-
Add support for OCamlGraph
Now we provide a helper utilities for those who would like to use ocamlgraph library for analysis.
-
Extended bap-mc utility
bap-mc
utility now prints results in plethora of formats,
including protocol buffers, from the piqi library, that was revived
by Kenneth Miller.
- Interval trees, aka memory maps
For working with arbitrary overlapping memory regions we now have a memory map data structure, aka interval trees, segment trees, etc. It is based on AVL trees, and performs logarithmic searches.
- Simplified CI
We put Travis on a diet. Now only 4 machines with 20 ETA for all test suites to pass. (Instead of 8 * 40).
- removed tag warnings from the ocamlbuild
- fixed #114
- moved Bap_plugins out of Bap library
- plugin library can now load arbitrary files
- bap-objdump is now pluggable
- added new extension point in the plugin system
- updated BAP LICENSE, baptop is now QPLed
- IDA can now work in a headless mode
- enhanced symbol resolution algorithm
- cleaned up image backend interface
- constraint OPAM file
- x86 and x86_64 lifter #106
- New byteweight implementation #99
- Intra-procedure CFG reconstruction #102
- IDA integration #103
- Binary release #108
- Man pages and documentation #107
- Unconstraint opam file and extended it with system dependents #109
-
Bitvector (aka Word, aka Addr) now provides all Integer interface without any monads right at the toplevel of the module. In other words, now you can write: Word.(x + y).
-
Bitvector.Int is renamed to Bitvector.Int_exn so that it don't clobber the real Int module
-
All BIL is now consolidated in one module named Bil. This module contains everything, including constructors for statements, expressions casts, binary and unary operations. It also includes functional constructors, that are now written by hand and, thus, don't suffer from syntactic clashes with keywords. There're also a plenty of other functions and new operators, available from the new Bap_helpers module, see later. Old modules, like Expr, Stmt, etc are still available, they implement Regular interface for corresponding types.
-
New feature: visitor classes to traverse and transform the AST. Writing a pattern matching code every time you need to traverse or map the BIL AST is error prone and time-consuming. This visitors, do all the traversing for you, allowing you to override default behavior. Some handy algorithms, that use visitors are provided in an internal Bap_helpers module, that is included into resulting Bil module. Several optimizations were added to bap-objdump utility, like constant propogation, inlining, pruning unused variables and resolving addresses to symbols.
-
Insn interface now provides predicates to query insn classes, this predicates use BIL if available.
-
Disam interface now provides linear_sweep function.
- Recursive descent disassembler
- High-level simple to use interface to BAP
- New utility
bap-objdump
- Enhanced pretty-printing
- Lots of small fixes and new handy functions
- Automatically generated documentation.
First release of a new BAP.