As stated in the high-level goals, the first release aims at being a Minimum Viable Product (MVP). This means that there are important features we know we want and need, but are post-MVP; these are in a separate essential post-MVP features document. The MVP will contain features which are available today in modern web browsers and which perform well even on mobile devices, which leads to roughly the same functionality as asm.js.
This document explains the contents of the MVP at a high-level. There are also separate docs with more precise descriptions of:
- Polyfill to JavaScript;
- AST semantics;
- Binary encoding;
- Implementation in the browser and outside the browser.
Note: This content is still in flux and open for discussion.
- The primary unit of loadable, executable code is a module.
- A module can declare a subset of its functions and global variables to be
exports. The meaning of exports (how and when they are called) is defined
by the host environment. For example,
_start
andinit
can be the only meaningful exports. - A module can declare a set of imports. An import is a tuple containing a module name, export name, and the type to use for the import within the module. The host environment controls the mapping from module name to which module is loaded.
- The spec defines the semantics of loading and calling exports of a single
module. The meaning of a call to an import is defined by the host environment.
- In a minimal shell environment, imports could be limited to builtin modules (implemented by the shell) and/or shell scripts.
- The dynamic linking post-MVP feature would extend the semantics to include multiple modules and thus allow heap and pointer sharing. Dynamic linking would be semantically distinct from importing, though.
- When compiling from C++, imports would be generated for unresolved
extern
functions and calls to thoseextern
functions would call the import. - Host environments can define builtin modules that are implemented natively but
can otherwise be imported like other modules. As examples:
- A WebAssembly shell might define a builtin
stdio
library with an exportputs
. - In the browser, the WebIDL support mentioned in future features.
- A WebAssembly shell might define a builtin
- Any ABI for statically linked libraries will be specific to your source language compiler. In the future, standard ABIs may be defined to allow for compatibility between compilers and versions of compilers.
- TODO: there is more to discuss here concerning APIs.
- At the top level, a module is ELF-like: a sequence of sections which declare their type and byte-length.
- Sections with unknown types would be skipped without error.
- Standardized section types:
- module import section;
- globals section (constants, signatures, variables);
- code section;
- heap initialization section.
- The code section begins with a table of functions containing the signatures and offsets of each function followed by the list of function bodies. This allows parallel and streaming decoding, validation and compilation.
- A function body consists of a set of typed variable bindings and an AST closed under these bindings.
- The Abstract Syntax Tree is composed of two primary kinds of nodes: statements and expressions.
- Control flow is structured (no
goto
).
- A binary format provides efficiency: it reduces download size and accelerates decoding, thus enabling even very large codebases to have quick startup times. Towards that goal, the binary format will be natively decoded by browsers.
- The binary format has an equivalent and isomorphic text format. Conversion from one format to the other is both straightforward and causes no loss of information in either direction.
The text format provides readability to developers, and is isomorphic to the binary format.
- In the MVP, when a WebAssembly module is loaded, it creates a new heap which isn't directly accessible from other modules.
- The dynamic linking feature will be necessary for two WebAssembly modules to share the same heap.
- Modules can specify heap size and initialization data (
data
,rodata
,bss
) in the heap-initialization section. - Modules can specify whether the heap is growable (via
sbrk
). - Modules can optionally export the heap, allowing it to be aliased by the
embedder, such as JavaScript:
- JavaScript sees the exported heap as an
ArrayBuffer
. - To keep an
ArrayBuffer
's length immutable, resizing a module's heap detaches any existentArrayBuffer
.
- JavaScript sees the exported heap as an
- See the AST Semantics heap section for more details.
WebAssembly MVP will be no looser from a security point-of-view than if the module was JavaScript or asm.js.