This specification provides an API for interoperability with native binary data. It defines a generic fixed-length buffer type, as well as accessor types that allow access to the data stored within the buffer.
The functionality described here originated in the WebGL specification [WEBGL].
This document is an editor's draft. Do not cite this document as other than work in progress. Public discussion of this specification is welcome on the (archived) WebGL mailing list public_webgl@khronos.org (see instructions).
The latest version of this document, including all revision history, may be obtained via Subversion:
svn co https://cvs.khronos.org/svn/repos/registry/trunk/public/typedarray
ECMAScript [ECMA-262] has traditionally been used in contexts where there is no access to binary data. Where binary data has needed to be manipulated, it is often stored as a String and accessed using charCodeAt(), or stored as an Array with conversion to and from base64 for transmission. Both of these methods are slow and error-prone. For example, reading binary data as 32-bit integers requires manual conversion of 4 source bytes to and from the target type. Reading floating-point data is even more expensive.
As web applications gain access to new functionality, working with binary data has become a much-demanded feature. Current specifications such as the File API [FILEAPI] and Web Sockets [WEBSOCKETS] would benefit from being able to read and write binary data directly in its native form. Specifications such as WebGL [WEBGL] require this functionality to meet acceptable performance characteristics.
This specification defines a minimal set of functionality for accessing binary data from ECMAScript.
This specification defines an ArrayBuffer type, representing a generic fixed-length binary buffer. It is not possible to manipulate the contents of an ArrayBuffer directly. Instead, a group of types are used to create views of the ArrayBuffer. For example, to access the buffer as an array of 32-bit signed integers, an Int32Array would be created that refers to the ArrayBuffer.
Multiple typed array views can refer to the same ArrayBuffer, of different types, lengths, and offsets. This allows for complex data structures to be built up in the ArrayBuffer. As an example, given the following code:
// create an 8-byte ArrayBuffer
var b = new ArrayBuffer(8);
// create a view v1 referring to b, of type Int32, starting at
// the default byte index (0) and extending until the end of the buffer
var v1 = new Int32Array(b);
// create a view v2 referring to b, of type Uint8, starting at
// byte index 2 and extending until the end of the buffer
var v2 = new Uint8Array(b, 2);
// create a view v3 referring to b, of type Int16, starting at
// byte index 2 and having a length of 2
var v3 = new Int16Array(b, 2, 2);
The following buffer and view layout is created:
| var | index | |||||||
|---|---|---|---|---|---|---|---|---|
| bytes (not indexable) | ||||||||
| b = | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| indices | ||||||||
| v1 = | 0 | 1 | ||||||
| v2 = | 0 | 1 | 2 | 3 | 4 | 5 | ||
| v3 = | 0 | 1 | ||||||
This defines an 8-byte buffer b, and three views of
that buffer, v1, v2, and v3. Each of
the views refers to the same buffer -- so v1[0] refers
to bytes 0..3 as a signed 32-bit integer, v2[0] refers
to byte 2 as a unsigned 8-bit integer, and v3[0] refers
to bytes 2..3 as a signed 16-bit integer. Any modification to
one view is immediately visible in the other: for example,
after v2[0] = 0xff; v2[1] = 0xff; then v3[0] ==
-1 (where -1 is represented as 0xffff).
Computer memory is fundamentally organized as a linear series of numbers. On all contemporary computers, each of these values is an 8-bit binary number (a "byte"). Larger numbers are represented by interpreting multiple bytes at a time as a single value. For example, this specification discusses 16-bit and 32-bit signed and unsigned integers, and 32-bit and 64-bit floating-point numbers.
There exists an ambiguity in the interpretation of such multi-byte values: specifically, which order the bytes are assembled. Little-endian architectures treat the byte with the lowest address as the least significant byte; big-endian architectures treat the byte with the lowest address as the most significant byte. More possibilities exist, but are not commonly used. Little-endian architectures are currently used in most consumer computing devices, but big-endian architectures are still used in important use cases.
The following diagram illustrates the difference in storage of values between little-endian and
big-endian architectures. If the value 305419896 (0x12345678 in
hexadecimal) were stored into an ArrayBuffer using a Uint32Array, then depending on the host
computer's endianness, the component bytes of the ArrayBuffer would contain the following (organized
from low address to high address):
| Little-endian: | 78 | 56 | 34 | 12 |
| Big-endian: | 12 | 34 | 56 | 78 |
In this specification, differences in endianness only become apparent in certain situations: for example, when overlaying multiple types of views on the same region of an ArrayBuffer. The following normative rules apply to implementations of the typed array views.
The typed array view types operate with the endianness of the host computer.
The DataView type operates upon data with a specified endianness (big-endian or little-endian).
The typed array view types (Uint8Array, Float32Array, etc.) are designed for in-memory assembly of large blocks of data to be sent to the graphics card, audio system, etc. For these use cases, it is required to use the native endianness of the host machine when writing the data to main memory. If this specification had mandated for consistency that the typed array view types used a specified endianness (for example, little-endian), the data would either be misinterpreted by the graphics card on some processor architectures (in this example, big-endian architectures), or impose an unacceptably high performance overhead on such architectures.
When receiving data from other computers, or reading files from disk, it is critical to specify the byte order of the data so that it can be properly interpreted regardless of the endianness of the host computer. Essentially every file format or network protocol in existence has a clearly specified data format, including the byte order of all multi-byte data contained in the file or network payload. The DataView view type is designed for input/output tasks and therefore operates upon data with a specified byte order. The host computer's endianness is irrelevant when working with DataView; it always reads or writes data assuming that data is stored with a particular endianness.
This article on the design and usage of the typed array views may provide more background and in-depth discussion.
The specification below implicitly references certain type conversion rules; for example,
conversion of floating-point values to integer values of various sizes. The Web IDL
specification [WEBIDL] defines these rules, and references the
ECMA-262 specification [ECMA-262] for conversion algorithms such
as ToInt32.
The Web IDL specification does not currently define all of the numerical types referenced in
this specification; for example, byte, which is a signed 8-bit integer type. For
these types, the rules for the type of the closest size and signedness shall be extrapolated.
As a hint to implementors of the algorithms in the ECMA-262 specification, conversion of floating-point numbers to integer values uses the truncate, or round-to-zero, rounding mode.
When the not-a-number (NaN) value is stored into a Float32Array
or Float64Array, or into a DataView using the setFloat32
or setFloat64 methods, the bit pattern written into the underlying ArrayBuffer is
not specified, but shall be one of the IEEE 754 bit patterns that represent
NaN [IEEE-754].
When a bit pattern representing an IEEE 754 NaN is loaded from a Float32Array
or Float64Array, or from a DataView using the getFloat32
or getFloat64 methods, the language binding (for example, ECMAScript) may use an
alternative bit pattern to represent the NaN value.
The Web IDL [WEBIDL] and ECMA-262 [ECMA-262] specifications govern all other handling of NaN values, in particular the conversion to 0 when converting NaN to an integer value.
ArrayBuffer TypeThe ArrayBuffer type describes a buffer used to store
data for the array buffer views. An ArrayBuffer has the following methods and properties:
[ Constructor(unsigned long length) ]
interface ArrayBuffer {
readonly attribute unsigned long byteLength;
ArrayBuffer slice(long begin, optional long end);
static boolean isView(any value);
};
ArrayBuffer implements Transferable;
| Constructors | |
|---|---|
|
|
| Properties | |
| unsigned long byteLength |
Read-only property. The length of the ArrayBuffer in bytes, as fixed at construction time. Reading this property returns 0 if this |
| Methods | |
|
|
|
|
ArrayBufferView TypeThe ArrayBufferView type holds information shared among all of the types of views of ArrayBuffers. An ArrayBufferView has the following properties:
[NoInterfaceObject]
interface ArrayBufferView {
readonly attribute ArrayBuffer buffer;
readonly attribute unsigned long byteOffset;
readonly attribute unsigned long byteLength;
};
| Constructors | |
|---|---|
| None | |
| Properties | |
| ArrayBuffer buffer |
Read-only property. The |
| unsigned long byteOffset |
Read-only property. The offset of this Reading this property returns 0 if the referenced |
| unsigned long byteLength |
Read-only property. The length of the Reading this property returns 0 if the referenced |
| Methods | |
| None | |
The typed array view types represent a view of
an ArrayBuffer that allows for indexing and manipulation.
The length of each of these is fixed. Each of the typed array
view types follows the same template.
The following typed array view types are defined by this specification. The size below is given in bytes, and corresponds to the BYTES_PER_ELEMENT constant for the given type.
| Type | Size | Description | Web IDL type | Equivalent C Type |
|---|---|---|---|---|
Int8Array | 1 | 8-bit 2's complement signed integer | byte | signed char |
Uint8Array | 1 | 8-bit unsigned integer | octet | unsigned char |
Uint8ClampedArray | 1 | 8-bit unsigned integer (clamped) | octet | unsigned char |
Int16Array | 2 | 16-bit 2's complement signed integer | short | short |
Uint16Array | 2 | 16-bit unsigned integer | unsigned short | unsigned short |
Int32Array | 4 | 32-bit 2's complement signed integer | long | int |
Uint32Array | 4 | 32-bit unsigned integer | unsigned long | unsigned int |
Float32Array | 4 | 32-bit IEEE floating point | unrestricted float | float |
Float64Array | 8 | 64-bit IEEE floating point | unrestricted double | double |
Each of the typed array types has the following constructors, properties, constants and
methods. In the descriptions below, the generic term TypedArray is used to indicate that
any valid typed array view type is allowed. Uint8ClampedArray is defined in the
next section.
An object array implementing one of the TypedArray
interfaces supports
indexed properties [WEBIDL] with indices in the range 0 ≤ index
< array.length.
Typed array views operating upon multi-byte values use the host computer's endianness when reading or writing values from or to main memory.
[
Constructor(unsigned long length),
Constructor(TypedArray array),
Constructor(type[] array),
Constructor(ArrayBuffer buffer,
optional unsigned long byteOffset, optional unsigned long length)
]
interface TypedArray {
const unsigned long BYTES_PER_ELEMENT = element size in bytes;
readonly attribute unsigned long length;
getter type get(unsigned long index);
setter void set(unsigned long index, type value);
void set(TypedArray array, optional unsigned long offset);
void set(type[] array, optional unsigned long offset);
TypedArray subarray(long begin, optional long end);
};
TypedArray implements ArrayBufferView;
| Constructors | |
|---|---|
|
|
|
|
|
|
| Constants | |
| unsigned long BYTES_PER_ELEMENT | The size in bytes of each element in the array. |
| Properties | |
| unsigned long length |
Read-only property. The length of the TypedArray in elements, as fixed at construction time. Reading this property returns 0 if the referenced |
| Methods | |
|
|
|
|
|
|
|
|
Uint8ClampedArray
Uint8ClampedArray is defined in order to replace CanvasPixelArray. It
behaves identically to the other typed array views, except that the setters and constructor use
clamping [WEBIDL]
rather than modulo arithmetic when converting incoming number values. The IDL
for Uint8ClampedArray follows.
// The 'unsigned byte' type does not currently exist in Web IDL, though
// 'octet' is equivalent.
[
Constructor(unsigned long length),
Constructor(Uint8ClampedArray array),
Constructor(octet[] array),
Constructor(ArrayBuffer buffer,
optional unsigned long byteOffset, optional unsigned long length)
]
interface Uint8ClampedArray {
const unsigned long BYTES_PER_ELEMENT = 1;
readonly attribute unsigned long length;
getter octet get(unsigned long index);
setter void set(unsigned long index, [Clamp] octet value);
void set(Uint8ClampedArray array, optional unsigned long offset);
void set(octet[] array, optional unsigned long offset);
Uint8ClampedArray subarray(long begin, optional long end);
};
Uint8ClampedArray implements ArrayBufferView;
DataView View TypeAn ArrayBuffer is a useful object for representing an arbitrary chunk of data. In many cases, such data will be read from disk or from the network, and will not follow the alignment restrictions that are imposed on the typed array views described earlier. In addition, the data will often be heterogeneous in nature and have a defined byte order. The DataView view provides a low-level interface for reading such data from and writing it to an ArrayBuffer.
Regardless of the host computer's endianness, DataView reads or writes values to or from main memory with a specified endianness: big or little.
[
Constructor(ArrayBuffer buffer,
optional unsigned long byteOffset,
optional unsigned long byteLength)
]
interface DataView {
// Gets the value of the given type at the specified byte offset
// from the start of the view. There is no alignment constraint;
// multi-byte values may be fetched from any offset.
//
// For multi-byte values, the optional littleEndian argument
// indicates whether a big-endian or little-endian value should be
// read. If false or undefined, a big-endian value is read.
//
// These methods raise an exception if they would read
// beyond the end of the view.
byte getInt8(unsigned long byteOffset);
octet getUint8(unsigned long byteOffset);
short getInt16(unsigned long byteOffset,
optional boolean littleEndian);
unsigned short getUint16(unsigned long byteOffset,
optional boolean littleEndian);
long getInt32(unsigned long byteOffset,
optional boolean littleEndian);
unsigned long getUint32(unsigned long byteOffset,
optional boolean littleEndian);
float getFloat32(unsigned long byteOffset,
optional boolean littleEndian);
double getFloat64(unsigned long byteOffset,
optional boolean littleEndian);
// Stores a value of the given type at the specified byte offset
// from the start of the view. There is no alignment constraint;
// multi-byte values may be stored at any offset.
//
// For multi-byte values, the optional littleEndian argument
// indicates whether the value should be stored in big-endian or
// little-endian byte order. If false or undefined, the value is
// stored in big-endian byte order.
//
// These methods raise an exception if they would write
// beyond the end of the view.
void setInt8(unsigned long byteOffset,
byte value);
void setUint8(unsigned long byteOffset,
octet value);
void setInt16(unsigned long byteOffset,
short value,
optional boolean littleEndian);
void setUint16(unsigned long byteOffset,
unsigned short value,
optional boolean littleEndian);
void setInt32(unsigned long byteOffset,
long value,
optional boolean littleEndian);
void setUint32(unsigned long byteOffset,
unsigned long value,
optional boolean littleEndian);
void setFloat32(unsigned long byteOffset,
float value,
optional boolean littleEndian);
void setFloat64(unsigned long byteOffset,
double value,
optional boolean littleEndian);
};
DataView implements ArrayBufferView;
The following constructors, properties, and methods are available on a DataView:
| Constructors | |
|---|---|
|
|
| Properties | |
| None | |
| Methods | |
|
|
|
In order to enable repeated transfer of large amounts of data between Web
Workers [WEBWORKERS], ArrayBuffer implements
the Transferable
interface [HTML]. This section defines the behavior of ArrayBuffers and
views under
the structured
cloning [HTML] and transfer algorithms.
When a user agent is asked to clone an ArrayBuffer object old, it must run the following steps, which return a new ArrayBuffer object. These steps must be run atomically.
ArrayBuffer object new buffer pointing at a copy of the
underlying data from old, and with the same byteLength property.
Add the following to the list of Transferable types:
ArrayBuffer
To transfer [HTML] an ArrayBuffer object old, a user agent must run the following steps.
ArrayBuffer object new buffer pointing at the same
underlying data as old, and with the same byteLength property.
When a user agent is asked to clone an ArrayBufferView object old, it must run the following steps, which return a new object. These steps must be run atomically.
ArrayBufferView subclass of the
same type as old, referring to buffer, and with the
same byteOffset, byteLength, and any subclass-specific properties
as old.
The above sections define the following behavior:
ArrayBuffer is transferred during a postMessage call, then
any ArrayBufferView instances which refer to that ArrayBuffer and
which are cloned during that call refer to the newly allocated ArrayBuffer
object. After the postMessage call, the ArrayBufferView instances
referring to the old ArrayBuffer can no longer be used to reference the
buffer's data. Further attempts to clone them or the old ArrayBuffer will
cause a DATA_CLONE_ERR exception to be thrown.
ArrayBuffer and
any ArrayBufferView instances which refer to it is the same as for any other
object type.
ArrayBuffer type is transferable. It is not possible to transfer
certain ArrayBufferView instances, and clone others, if they refer to the same
underlying ArrayBuffer.
var f32s = new Float32Array(128);
for (var i = 0; i < 128/8; ++i) {
var sub_f32s = f32s.subarray(i, i+8);
for (var j = 0; j < 8; ++j) {
sub_f32s[j] = j;
}
}
Note that this code uses subarray() to create a new Float32Array
that references the same data as the original, so that it can always index the sub-array using 0..7.
Some APIs, in particular WebGL [WEBGL], can benefit from being able to use a single contiguous buffer, with interleaved data types. For example, a point might have coordinate data (3 Float32 values) followed by color data (4 Uint8 values).
For 4 points and their associated colors, this can be set up in the following way:
var elementSize = 3 * Float32Array.BYTES_PER_ELEMENT + 4 * Uint8Array.BYTES_PER_ELEMENT;
var buffer = new ArrayBuffer(4 * elementSize);
var coords = new Float32Array(buffer, 0);
var colors = new Uint8Array(buffer, 3 * Float32Array.BYTES_PER_ELEMENT);
However, typed arrays don't have a way to explicitly encode the desired per-point structure, so some manual arithmetic must be done to correctly index into the right values. Note that the colors Uint8Array view is created with an explicit offset (which is given in bytes), so that the [0] element points at the 13th byte in the underlying buffer.
In this example, each set of packed data is 16 bytes in size (3 4-byte floats followed by 4 bytes). 16 / Float32Array.BYTES_PER_ELEMENT is 4, so from any given Float32 element, to skip to the same Float32 element in the next point, 4 must be added to the index. Similarly, to skip from any given Uint8 element to the same in the next point, 16 must be added:
var coordOffset = elementSize / Float32Array.BYTES_PER_ELEMENT;
var colorOffset = elementSize / Uint8Array.BYTES_PER_ELEMENT;
coords[0] = coords[1] = coords[2] = 1.0; // The first point's three coordinate values
colors[0] = colors[1] = colors[2] = colors[3] = 255; // The first point's four colors
coords[0+N*coordOffset] = 5.0; // The Nth point's first coordinate value
colors[0+N*colorOffset] = 128; // The Nth point's first color value
coords[i+N*coordOffset] = 6.0; // The Nth point's i coordinate value;
colors[j+N*colorOffset] = 200; // The Nth point's j color value
In the above example, note that for keeping the data consistent, i must be one of 0, 1, or 2; and j must be one of 0, 1, 2, or 3. Any higher values will result in data segments that are reserved for 32-bit floats or for 8-bit integers being overwritten with incorrect data.
Another usage similar to the above is allocating one large buffer, and then using different regions of it for different purposes:
var buffer = new ArrayBuffer(1024);
Carve out 128 floats, 128*4 = 512 bytes in size:
var floats = new Float32Array(buffer, 0, 128);
Then 128 shorts, 128*2 = 256 bytes in size, immediately
following the floats. Note that the 512 byte offset
argument is equal to floats.byteOffset + floats.byteLength.
var shorts = new Uint16Array(buffer, 512, 128);
Finally, 256 unsigned bytes. We can write the byte offset in the form suggested above to simplify the chaining. We also let this array extend until the end of the ArrayBuffer by not explicitly specifying a length.
var bytes = new Uint8Array(buffer, shorts.byteOffset + shorts.byteLength);
If the data is no longer needed, the entire 1024-byte array can be repurposed without causing additional allocations simply by creating new views and discarding the old.
DataView have methods for reading arrays of data?While such methods would be useful, they are not present in this version of the specification to reduce the API footprint. Such methods may be added in the future.
The editors would like to thank Erik Arvidsson (Google), Joshua Bell (Linden Lab), Mark Callow (HI), Brendan Eich (Mozilla), Andreas Gal (Mozilla), Daniel Gessel (Apple), Dave Herman (Mozilla), Oliver Hunt (Apple), Tim Johansson (Opera), Vangelis Kokkevis (Google), Chris Marrin (Apple), Glenn Maynard, Cameron McCormack, Shiki Okasaka (Google), Arun Ranganathan (Mozilla), Alex Russell (Google), Gregg Tavares (Google), Ben Vanik (Google), Cedric Vivier (Mozilla), and the members of the WebGL working group for their contributions to this specification.
The editors would especially like to thank Vladimir Vukicevic for co-editing earlier versions of this specification.