Some user agents have music devices, such as synthesizers, keyboard and other controllers, and drum machines connected to their host computer or device. The widely adopted Musical Instrument Digital Interface (MIDI) protocol enables electronic musical instruments, controllers and computers to communicate and synchronize with each other. MIDI does not transmit audio signals: instead, it sends event messages about musical notes, controller signals for parameters such as volume, vibrato and panning, cues and clock signals to set the tempo, and system-specific MIDI communications (e.g. to remotely store synthesizer-specific patch data). This same protocol has become a standard for non-musical uses, such as show control, lighting and special effects control.
This specification defines an API supporting the MIDI protocol, enabling web applications to enumerate and select MIDI input and output devices on the client system and send and receive MIDI messages. It is intended to enable non-music MIDI applications as well as music ones, by providing low-level access to the MIDI devices available on the users' systems. The Web MIDI API is not intended to describe music or controller inputs semantically; it is designed to expose the mechanics of MIDI input and output interfaces, and the practical aspects of sending and receiving MIDI messages, without identifying what those actions might mean semantically (e.g., in terms of "modulate the vibrato by 20Hz" or "play a G#7 chord", other than in terms of changing a controller value or sending a set of note-on messages that happen to represent a G#7 chord).
To some users, "MIDI" has become synonymous with Standard MIDI Files and General MIDI. That is not the intent of this API; the use case of simply playing back a .SMF file is not within the purview of this specification (it could be considered a different format to be supported by the HTML5 <audio>
element, for example). The Web MIDI API is intended to enable direct access to devices that respond to MIDI - external synthesizers or lighting systems, for example, or even the software synthesizers that are built in to many common operating systems. The Web MIDI API is also explicitly designed to enable a new class of applications on the web that can respond to MIDI controller inputs - using external hardware controllers with physical buttons, knobs and sliders (as well as musical controllers like keyboard, guitar or wind instrument controllers) to control web applications.
The Web MIDI API is also expected to be used in conjunction with other APIs and elements of the web platform, notably the Web Audio API. This API is also intended to be familiar to users of MIDI APIs on other systems, such as Apple's CoreMIDI and Microsoft's Windows MIDI API.
The Web MIDI API specification defines a means for web developers to enumerate, manipulate and access MIDI devices - for example interfaces that may provide hardware MIDI ports with other devices plugged in to them and USB devices that support the USB-MIDI specification. Having a Web API for MIDI enables web applications that use existing software and hardware synthesizers, hardware music controllers and light systems and other mechanical apparatus controlled by MIDI. This API has been defined with this wide variety of use cases in mind.
The approaches taken by this API are similar to those taken in Apple's CoreMIDI API and Microsoft's Windows MIDI API; that is, the API is designed to represent the low-level software protocol of MIDI, in order to enable developers to build powerful MIDI software on top. The API enables the developer to enumerate input and output interfaces, and send and receive MIDI messages, but (similar to the aforementioned APIs) it does not attempt to semantically define or interpret MIDI messages beyond what is necessary to robustly support current devices.
The Web MIDI API is not intended to directly implement high-level concepts such as sequencing; it does not directly support Standard MIDI Files, for example, although a Standard MIDI File player can be built on top of the Web MIDI API. It is also not intended to semantically capture patches or controller assignments, as General MIDI does; such interpretation is outside the scope of the Web MIDI API (though again, General MIDI can easily be utilized through the Web MIDI API).
This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains.
Implementations that use ECMAScript to implement the APIs defined in this specification MUST implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [[!WEBIDL]], as this specification uses that specification and terminology.
The concepts queue a task and fires a simple event are defined in [[!HTML5]].
The terms
event handlers and
event handler event types and corresponding EventHandler
interface are defined in [[!HTML5]].
The Uint8Array interface is defined in [[!TYPED-ARRAYS]].
The term octet is defined in [[!WEBIDL]].
The Event interface is defined in [[!DOM4]].
The DOMHighResTimeStamp interface is defined in [[!HIGHRES-TIME]].
The terms MIDI, MIDI device, MIDI input port, MIDI output port, MIDI interface, MIDI message, MIDI System Real-Time message and system exclusive are defined in [[!MIDI]].
The Promise interface is currently defined in the WHATWG DOM specification.
When invoked, returns a Promise object representing a request for access to MIDI devices on the user's system.
Requesting MIDI access SHOULD prompt the user for access to MIDI devices,
particularly if system exclusive access is requested. In some
scenarios, this permission may have already been implicitly or
explicitly granted, in which case this prompt may not appear.
If the user gives express permission or the call is otherwise
approved, the vended Promise's resolveCallback is invoked, as a MIDISuccessCallback
(i.e., with a
MIDIAccess
object and a MIDIOptions
object as its arguments. The
underlying system may choose to allow the user to select
specific MIDI interfaces to expose to this API (i.e. pick
and choose interfaces on an individual basis), although
this is not required. The system may also choose to prompt
(or not) based on whether system exclusive support is
requested, as system exclusive has greater privacy and
security implications.
If the user declines or the call is denied for any other reason, the Promise's
rejectCallback (if any) is invoked as a MIDIErrorCallback
.
When the
requestMIDIAccess
method is called, the user
agent MUST run the algorithm to request MIDI Access:
Let promise be a new Promise object and resolver be its associated resolver.
Return promise and run the following steps asynchronously.
Optionally, e.g. based on a previously-established user preference, for security reasons, or due to platform limitations, jump to the step labeled failure below.
Optionally, e.g. based on a previously-established user preference, jump to the step labeled success below.
Prompt the user in a user-agent-specific manner for
permission to provide the entry script's origin with a
MIDIAccess
object representing
control over user's MIDI devices. This prompt may
be contingent upon whether system exclusive support was
requested, and may allow the user to enable or disable
that access.
If permission is denied, jump to the step labeled failure below. If the user never responds, this algorithm will never progress beyond this step. If permission is granted, continue the following steps.
success: Let access be a new MIDIAccess
object. (It is possible to call requestMIDIAccess() multiple times; this may prompt the user multiple times, so it may not be best practice, and the same instance of MIDIAccess will not be returned each time.)
Call resolver's accept(value)
method with access as value argument.
Terminate these steps.
failure: Let error be a new DOMError
.
This should be of type "SecurityError"
if the
user or their security settings denied the application from creating a MIDIAccess
instance with the requested options, or otherwise it should be of type "NotSupportedError"
.
Call resolver's reject(value)
method with error as value argument.
This dictionary contains optional settings that may be provided to the requestMIDIAccess request.
This member informs the system whether the ability to send and receive system exclusive messages is requested or allowed on a given MIDIAccess object. On the option passed to requestMIDIAccess
, if this member is set to true, but system exclusive support is denied (either by policy or by user action), the access request will fail with a "SecurityError"
error. If this support is not requested (and allowed), the system will throw exceptions if the user tries to send system exclusive messages, and will silently mask out any system exclusive messages received on the port.
In the options
parameter passed to the resolveCallback, this member indicates whether system exclusive is allowed on the MIDIAccess.
A MIDIAccess
object created to provide
script access to the user's MIDI devices. This object is used
to enumerate and obtain access to individual MIDI devices.
Note: The term "MIDI device" in this specification refers to a MIDI interface available to the host system; for example, if a hardware MIDI adapter is connected to the host system, it will be enumerated as a single device, even if several MIDI-supporting devices (such as synthesizers or drum machines) are plugged into hardware MIDI ports on the adapter.
This parameter describes the options enabled on this MIDIAccess object.
DOMError
object representing the reason
why creating getting a MIDIAccess
failed.
This interface provides the methods to list MIDI input and output devices, and obtain access to an individual device.
Returns a list of the MIDI input ports available on the system. This list is not a live list (i.e., if a device is added or removed after calling getInputs(), the list is not changed.)
Returns a list of the MIDI output ports available on the system. This list is not a live list (i.e., if a device is added or removed after calling getOutputs(), the list is not changed.)
The handler called when a new port is connected.
This event handler, of type MIDIConnectionEvent
,
MUST be supported by all objects implementing the
MIDIAccess
interface.
The handler called when a previously-available port is disconnected.
This event handler, of type MIDIConnectionEvent
,
MUST be supported by all objects implementing the
MIDIAccess
interface.
Whenever a previously unavailable MIDI port becomes available for use, the user agent SHOULD run the following steps:
Let port
be the MIDIPort
corresponding to the newly-available port.
Let event
be a newly constructed
MIDIConnectionEvent
, with the port
attribute set to the port.
Fire an event named connect
at the MIDIAccess
, using the event
as the event object.
Whenever a previously available MIDI port becomes unavailable for use, the user agent SHOULD run the following steps:
Let port
be the MIDIPort
corresponding to the newly-unavailable port.
Let event
be a newly constructed
MIDIConnectionEvent
, with the port
attribute set to the port.
Fire an event named disconnect
at the MIDIAccess
, using the event
as the event object.
This interface represents a MIDI input or output port.
A unique ID of the port. This can be used by developers to
remember ports the user has chosen for their application. The
User Agent MUST ensure that the id
is unique to only that port. The User Agent SHOULD ensure that
the id is maintained across instances of the
application - e.g., when the system is rebooted - and when a
device is removed from the system. Applications may want to
cache these ids locally to re-create a MIDI setup.
Some systems may not support completely unique persistent
identifiers; in such cases, it will be more challenging to
maintain identifiers when another interface is added or removed
from the system. (This might throw off the index of the
requested port.) It is expected that the system will do the
best it can to match a port across instances of the MIDI API:
for example, storing the port interface manufacturer, name and
index in the id, and attempting to find any ports with
that name to consider as a match. Applications may use the
comparison of id of MIDIPorts to test for equality.
The manufacturer of the port.
The system name of the port.
A descriptor property to distinguish whether the port is an
input or an output port.
For MIDIOutput
,
this MUST be "output"
.
For MIDIInput
,
this MUST be "input"
.
The version of the port.
The handler called when a previously-available port is disconnected.
This event handler, of type disconnect
,
MUST be supported by all objects implementing
MIDIPort
interface.
Whenever the MIDI port corresponding to the
MIDIPort
becomes unavailable for use, the user agent SHOULD
run the following steps:
Let port
be the MIDIPort
.
Let event
be a newly constructed
MIDIConnectionEvent
, with the port
attribute set to the port.
Fire an event named disconnect
at the port
, using the event
as the event object.
This event handler, of type MIDIMessage
,
MUST be supported by all objects implementing
MIDIInput
interface.
Whenever the MIDI port corresponding to the
MIDIInput
finishes receiving one or more MIDI messages, the user agent MUST
run the following steps:
Let port
be the MIDIInput
.
If the MIDIAccess
did not enable system exclusive access, and the message is a system exclusive message, abort this process.
Let event
be a newly constructed
MIDIMessageEvent
, with the timestamp
attribute set to the time the message was received by the system, and
with the data
attribute set to a Uint8Array of MIDI data
bytes representing a single MIDI message.
Fire an event named midimessage
at the port
, using the event
as the event object.
It is specifically noted that MIDI System Real-Time Messages may actually occur in the middle of system exclusive messages in the input stream; in this case, the System Real-Time messages will be dispatched as they occur, while the system exclusive message will be buffered until it is complete (and then dispatched).
Enqueues the message to be sent to the corresponding MIDI port. The underlying implementation will (if necessary) coerce each member of the sequence to an unsigned 8-bit integer. The use of sequence rather than a Uint8Array enables developers to use the convenience of output.send( [ 0x90, 0x45, 0x7f ] );
rather than having to create a Uint8Array, e.g. output.send( new Uint8Array( [ 0x90, 0x45, 0x7f ] ) );
- while still enabling use of Uint8Arrays for efficiency in large MIDI data scenarios (e.g. reading Standard MIDI Files and sending sysex messages).
The data contains one or more valid, complete MIDI messages. Running status is not allowed in the data, as underlying systems may not support it.
If the port is disconnected, throw an InvalidStateError
exception.
If data is not a valid sequence or does not contain a valid MIDI message, throw a TypeError
exception.
If data is a system exclusive message, and the MIDIAccess
did not enable system exclusive access, throw an InvalidAccessError
exception.
timestamp
is not present or is set to zero (or another time in the past), the data is to be sent as soon as possible.
An event object implementing this interface is passed to a MIDIInput's onmidimessage handler when MIDI messages are received.
A DOMHighResTimeStamp
specifying when the event occurred.
The DOM4 Event object has a timeStamp member in the event object that will be filled out with the current time, but that it is lower precision (DOMTimeStamp is defined as an integer number of milliseconds), has a different zero reference (DOMTimeSTamp is the number of milliseconds that has passed since 00:00:00 UTC on 1 January 1970), and therefore is less suitable for MIDI applications.
A Uint8Array containing the MIDI data bytes of a single MIDI message.
A DOMHighResTimeStamp
specifying when the event occurred.
A Uint8Array containing the MIDI data bytes of a single MIDI message.
An event object implementing this interface is passed to a MIDIAccess' ondisconnect handler, and (if present) to any MIDIPorts referencing the port when a previously-available port becomes unavailable (for example, when a MIDI interface is disconnected), and is also passed to a MIDIAccess' onconnect handler when a new port becomes available (for example, when a MIDI interface that has been disconnected is plugged in to the computer).
Connection events are not required to be dispatched, as some underlying systems do not support them as notification events; or, such systems may have long time delays as they poll for new devices infrequently.
The port that has been connected or disconnected.
The port that has been connected or disconnected.
The following are some examples of common MIDI usage in JavaScript.
This example shows how to request access to the MIDI system.
var midi = null; // global MIDIAccess object function onMIDISuccess( midiAccess ) { console.log( "MIDI ready!" ); midi = midiAccess; // store in the global (in real usage, would probably keep in an object instance) } function onMIDIFailure(msg) { console.log( "Failed to get MIDI access - " + msg ); } navigator.requestMIDIAccess().then( onMIDISuccess, onMIDIFailure );
This example shows how to request access to the MIDI system, including the ability to send and receive system exclusive messages.
var midi = null; // global MIDIAccess object function onMIDISuccess( midiAccess ) { console.log( "MIDI ready!" ); midi = midiAccess; // store in the global (in real usage, would probably keep in an object instance) } function onMIDIFailure(msg) { console.log( "Failed to get MIDI access - " + msg ); } navigator.requestMIDIAccess( { sysexEnabled: true } ).then( onMIDISuccess, onMIDIFailure );
This example gets the list of the input and output ports and prints their information to the console log.
function listInputsAndOutputs( midiAccess ) { var inputs = midiAccess.getInputs(); var outputs = midiAccess.getOutputs(); var i; for (i=0; i<inputs.length; i++) { console.log( "Input port #" + i + ": type:'" + inputs[i].type + "' id:'" + inputs[i].id + "' manufacturer:'" + inputs[i].manufacturer + "' name:'" + inputs[i].name + "' version:'" + inputs[i].version + "'" ); } for (i=0; i<outputs.length; i++) { console.log( "Output port #" + i + ": type:'" + outputs[i].type + "' id:'" + outputs[i].id + "' manufacturer:'" + outputs[i].manufacturer + "' name:'" + outputs[i].name + "' version:'" + outputs[i].version + "'" ); } }
This example prints incoming MIDI messages on a supplied input port to the console log.
function onMIDIMessage( event ) { var str = "MIDI message received at timestamp " + event.timestamp + "[" + event.data.length + " bytes]: "; for (var i=0; i<event.data.length; i++) { str += "0x" + event.data[i].toString(16) + " "; } console.log( str ); } function startLoggingMIDIInput( midiAccess, indexOfPort ) { var input = midiAccess.getInput( indexOfPort ); input.onmidimessage = onMIDIMessage; }
This example sends a middle C note on message immediately on MIDI channel 1 (MIDI channels are 0-indexed, but generally referred to as channels 1-16), and queues a corresponding note off message for 1 second later.
function sendMiddleC( midiAccess, indexOfPort ) { var noteOnMessage = [0x90, 60, 0x7f]; // note on, middle C, full velocity var output = midiAccess.getOutput( indexOfPort ); output.send( noteOnMessage ); //omitting the timestamp means send immediately. output.send( [0x80, 60, 0x40], window.performance.now() + 1000.0 ); // Inlined array creation- note off, middle C, // release velocity = 64, timestamp = now + 1000ms. }
This example loops all input messages on the first input port to the first output port - including system exclusive messages.
var midi = null; // global MIDIAccess object var output = null; function echoMIDIMessage( event ) { if (output) { output.send( event.data, event.timestamp ); } } function onMIDISuccess( midiAccess ) { console.log( "MIDI ready!" ); try { var input = midiAccess.getInput( 0 ); input.onmidimessage = echoMIDIMessage; output = midiAccess.getOutput( 0 ); } catch (e) { console.error("Exception! Couldn't get i/o ports." + e ); } } function onMIDIFailure(msg) { console.log( "Failed to get MIDI access - " + msg ); } navigator.requestMIDIAccess().then( onMIDISuccess, onMIDIFailure );
There are two primary security and privacy concerns with adding the Web MIDI API to the web platform:
Few systems will have significant numbers of MIDI devices attached; those systems that do will typically use hardware MIDI interfaces, not fanning out a dozen USB-MIDI connections through USB hubs. In this case, of course, enumerating the MIDI “devices” will only see the hardware MIDI interface(s), not the synthesizers, samplers, etc. plugged into it on the other side. Given the few number of devices plugged in, the amount of information exposed here is fairly symmetric with the fingerprinting concern exposed by other APIs such as the Gamepad API. The vast majority of systems have relatively few MIDI interfaces attached.
In brief, the general categories of things you can do with MIDI ports are:
The impact of each of these is:
It's also useful to examine what scenarios are enabled by MIDI, mapped against these features:
In short: the additional fingerprinting exposure of enumerating MIDI devices is directly analogous to the Gamepad API’s additional fingerprinting exposure through gamepad enumeration; typical users will only have at most a few devices connected, their configuration may change, and the information exposed is about the interface itself (i.e., no user-configured data).
The additional security concern for receiving short messages is also small - it’s analogous to listening to keyboard, mouse, mobile/laptop accelerometer, touch input or gamepad events; there is no additional information exposed, and all messages other than clock signals must be initiated by the user.
The additional concerns about sending short messages are analogous to any audio output - you cannot overwrite user information or expose use information, but you can make sounds happen, change patches, or (in rare configurations) toggle lights - but non-destructively, and not persistently.
System Exclusive, on the other hand, has a much less bounded potential, and it seems that distinguishing requests for SysEx separately in the API is a good idea, in order to more carefully provide user security hooks. The suggested security model explicitly allows user agents to require the user's approval before giving access to MIDI devices, although it is not currently required to prompt the user for this approval - but it also detailed that system exclusive support must be requested as part of that request.