summaryrefslogtreecommitdiffstats
path: root/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'README.md')
-rw-r--r--README.md646
1 files changed, 9 insertions, 637 deletions
diff --git a/README.md b/README.md
index 1a29648..4293bbd 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,4 @@
-Copyright 2016 IBM
+Copyright 2017 IBM
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
@@ -12,642 +12,14 @@ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-## Intro
+## MBOX
-This document describes a protocol for host to BMC communication via the
-mailbox registers present on the Aspeed 2400 and 2500 chips.
-This protocol is specifically designed to allow a host to request and manage
-access to the flash with the specifics of how the host is required to control
-this described below.
+This repo contains the protocol definition for the host to BMC mailbox
+communication specification which can be found in
+Documentation/mbox_procotol.md.
-## Version
+There is also a reference implementation of a BMC mailbox daemon, the details
+of which can be found in Documentation/mboxd.md.
-Both version 1 and version 2 of the protocol are described below with version 2
-specificities represented with V2 in brackets - (V2).
-
-## Problem Overview
-
-"mbox" is the name we use to represent a protocol we have established between
-the host and the BMC via the Aspeed mailbox registers. This protocol is used
-for the host to control the flash.
-
-Prior to the mbox protocol, the host uses a backdoor into the BMC address space
-(the iLPC-to-AHB bridge) to directly manipulate the BMCs own flash controller.
-
-This is not sustainable for a number of reasons. The main ones are:
-
-1. Every piece of the host software stack that needs flash access (HostBoot,
- OCC, OPAL, ...) has to have a complete driver for the flash controller,
- update it on each BMC generation, have all the quirks for all the flash
- chips supported etc... We have 3 copies on the host already in addition to
- the one in the BMC itself.
-
-2. There are serious issues of access conflicts to that controller between the
- host and the BMC.
-
-3. It's very hard to support "BMC reboots" when doing that
-
-4. It's slow
-
-5. Last but probably most important, having that backdoor open is a security
- risk. It means the host can access any address on the BMC internal bus and
- implant malware in the BMC itself. So if the host is a "bare metal" shared
- system in some kind of data center, not only the host flash needs to be
- reflashed when switching from one customer to another, but the entire BMC
- flash too as nothing can be trusted. So we want to disable it.
-
-To address all these, we have implemented a new mechanism that we call mbox.
-
-When using this mechanism, the BMC is solely responsible for directly accessing
-the flash controller. All flash erase and write operations are performed by the
-BMC and the BMC only. (We can allow direct reads from flash under some
-circumstances but we tend to prefer going via memory).
-
-The host uses the mailbox registers to send "commands" to the BMC, which
-responds via the same mechanism. Those commands allow the host to control a
-"window" (which is the LPC -> AHB FW space mapping) that is either a read
-window or a write window onto the flash.
-
-When set for writing, the BMC makes the window point to a chunk of RAM instead.
-When the host "commits" a change (via MBOX), then the BMC can perform the
-actual flashing from the data in the RAM window.
-
-The idea is to have the LPC FW space be routed to an active "window". That
-window can be a read or a write window. The commands allow to control which
-window and which offset into the flash it maps.
-
-* A read window can be a direct window to the flash controller space (ie.
- 0x3000\_0000) or it can be a window to a RAM image of a flash. It doesn't have
- to be the full size of the flash per protocol (commands can be use to "slide"
- it to various parts of the flash) but if its set to map the actual flash
- controller space at 0x3000\_0000, it's probably simpler to make it the full
- flash. The host makes no assumption, it's your choice what to provide. The
- simplest implementation is to just route to the flash read/only.
-
-* A write window has to be a chunk of BMC memory. The minimum size is not
- defined in the spec, but it should be at least one block (4k for now but it
- should support larger block sizes in the future). When the BMC receive the
- command to map the write window at a given offset of the flash, the BMC should
- copy that portion of the flash into a reserved memory buffer, and modify the
- LPC mapping to point to that buffer.
-
-The host can then write to that window directly (updating the BMC memory) and
-send a command to "commit" those updates to flash.
-
-Finally there is a `RESET_STATE`. It's the state in which the bootloader in the
-SEEPROM of the POWER9 chip will find what it needs to load HostBoot. The
-details are still being ironed out: either mapping the full flash read only or
-reset to a "window" that is either at the bottom or top of the flash. The
-current implementation resets to point to the full flash.
-
-## Where is the code?
-
-The mbox userspace is available [on GitHub](https://github.com/openbmc/mboxbridge)
-This is Apache licensed but we are keen to see any enhancements you may have.
-
-The kernel driver is still in the process of being upstreamed but can be found
-in the OpenBMC Linux kernel staging tree:
-
-https://github.com/openbmc/linux/commit/85770a7d1caa6a1fa1a291c33dfe46e05755a2ef
-
-## Building
-
-The autotools of this requires the autoconf-archive package for your
-system
-
-## The Hardware
-
-The Aspeed mailbox consists of 16 (8 bit) data registers see Layout for their
-use. Mailbox interrupt enabling, masking and triggering is done using a pair
-of control registers, one accessible by the host the other by the BMC.
-Interrupts can also be raised per write to each data register, for BMC and
-host. Write tiggered interrupts are configured using two 8 bit registers where
-each bit represents a data register and if an interrupt should fire on write.
-Two 8 bit registers are present to act as a mask for write triggered
-interrupts.
-
-### Layout
-
-```
-Byte 0: COMMAND
-Byte 1: Sequence
-Byte 2-12: Arguments
-Byte 13: Response code
-Byte 14: Host controlled status reg
-Byte 15: BMC controlled status reg
-```
-
-## Low Level Protocol Flow
-
-What we essentially have is a set of registers which either the host or BMC can
-write to in order to communicate to the other which will respond in some way.
-There are 3 basic types of communication.
-
-1. Commands sent from the Host to the BMC
-2. Responses sent from the BMC to the Host in response to commands
-3. Asyncronous events raised by the BMC
-
-### General Use
-
-Messages usually originate from the host to the BMC. There are special
-cases for a back channel for the BMC to pass new information to the
-host which will be discussed later.
-
-To initiate a request the host must set a command code (see
-Commands) into mailbox data register 0. It is also the hosts
-responsibility to generate a unique sequence number into mailbox
-register 1. After this any command specific data should be written
-(see Layout). The host must then generate an interrupt to the BMC by
-using bit 0 of its control register and wait for an interrupt on the
-response register. Generating an interrupt automatically sets bit 7 of the
-corresponding control register. This bit can be used to poll for
-messages.
-
-On receiving an interrupt (or polling on bit 7 of its Control
-Register) the BMC should read the message from the general registers
-of the mailbox and perform the necessary action before responding. On
-responding the BMC must ensure that the sequence number is the same as
-the one in the request from the host. The BMC must also ensure that
-mailbox data regsiter 13 is a valid response code (see Responses). The
-BMC should then use its control register to generate an interrupt for
-the host to notify it of a response.
-
-### Asynchronous BMC to Host Events
-
-BMC to host communication is also possible for notification of events
-from the BMC. This requires that the host have interrupts enabled on
-mailbox data register 15 (or otherwise poll on bit 7 of mailbox status
-register 1). On receiving such a notification the host should read
-mailbox data register 15 to determine the event code which was set by the
-BMC (see BMC Event notifications in Commands for detail). Events which are
-defined as being able to be acknowledged by the host must be with a
-BMC_EVENT_ACK command.
-
-## High Level Protocol Flow
-
-When a host wants to communicate with the BMC via the mbox protocol the first
-thing it should do it call MBOX_GET_INFO in order to establish the protocol
-version which each understands. Before this the only other commands which are
-allowed are RESET_STATE and BMC_EVENT_ACK.
-
-After this the host can open and close windows with the CREATE_READ_WINDOW,
-CREATE_WRITE_WINDOW and CLOSE_WINDOW commands. Creating a window is how the
-host requests access to a section of flash. It is worth noting that the host
-can only ever have one window that it is accessing at a time - hence forth
-referred to as the active window.
-
-When the active window is a write window the host can perform MARK_WRITE_DIRTY,
-MARK_WRITE_ERASED and WRITE_FLUSH commands to identify changed blocks and
-control when the changed blocks are written to flash.
-
-Independently, and at any point not during an existing mbox command
-transaction, the BMC may raise raise asynchronous events with the host to
-communicate a change in state.
-
-### Version Negotiation
-
-Given that a majority of command and response arguments are specified as a
-multiple of block size it is necessary for the host and BMC to agree on a
-protocol version as this determines the block size. In V1 it is hard coded at
-4K and in V2 the BMC chooses and specifies this to the host as a response
-argument to MBOX_GET_INFO. Thus the host must always call MBOX_GET_INFO before
-any other command which specifies an argument in block size.
-
-The host must tell the BMC the highest protocol level which it supports. The
-BMC will then respond with a protocol level. If the host doesn't understand
-the protocol level specified by the BMC then it must not continue to
-communicate with the BMC. Otherwise the protocol level specified by the
-BMC is taken to be the protocol level used for further communication and can
-only be changed by another call to MBOX_GET_INFO. The BMC should use the
-request from the host to influence its protocol version choice.
-
-### Window Management
-
-In order to access flash contents the host must request a window be opened at
-the flash offset it would like to access. The host may give a hint as to how
-much data it would like to access or otherwise set this argument to zero. The
-BMC must respond with the lpc bus address to access this window and the
-window size. The host must not access past the end of the active window.
-
-There is only ever one active window which is the window created by the most
-recent CREATE_READ_WINDOW or CREATE_WRITE_WINDOW call which succeeded. Even
-though there are two types of windows there can still only be one active window
-irrespective of type. A host must not write to a read window. A host may read
-from a write window and the BMC must guarantee that the window reflects what
-the host has written there.
-
-A window can be closed by calling CLOSE_WINDOW in which case there is no active
-window and the host must not access the LPC window after it has been closed.
-If the host closes an active write window then the BMC must perform an
-implicit flush. If the host tries to open a new window with an already active
-window then the active window is closed (and implicitly flushed if it was a
-write window). If the new window is successfully opened then it is the new
-active window, if the command fails then there is no active window and the
-previous active window must no longer be accessed.
-
-The host must not access an lpc address other than that which is contained by
-the active window. The host must not use write management functions (see below)
-if the active window is a read window or if there is no active window.
-
-### Write Management
-
-The BMC has no method for intercepting writes that occur over the LPC bus. Thus
-the host must explicitly notify the BMC of where and when a write has
-occured. The host must use the MARK_WRITE_DIRTY command to tell the BMC where
-within the write window it has modified. The host may also use the
-MARK_WRITE_ERASED command to erase large parts of the active window without the
-need to write 0xFF. The BMC must ensure that if the host
-reads from an area it has erased that the read values are 0xFF. Any part of the
-active window marked dirty/erased is only marked for the lifetime of the current
-active write window and does not persist if the active window is closed either
-implicitly or explicitly by the host or the BMC. The BMC may at any time
-or must on a call to WRITE_FLUSH flush the changes which it has been notified
-of back to the flash, at which point the dirty or erased marking is cleared
-for the active window. The host must not assume that any changes have been
-written to flash unless an explicit flush call was successful, a close of an
-active write window was successful or a create window command with an active
-write window was successful - otherwise consistency between the flash and memory
-contents cannot be guaranteed.
-
-The host is not required to perform an erase before a write command and the
-BMC must ensure that a write performs as expected - that is if an erase is
-required before a write then the BMC must perform this itself.
-
-### BMC Events
-
-The BMC can raise events with the host asynchronously to communicate to the
-host a change in state which it should take notice of. The host must (if
-possible for the given event) acknowledge it to inform the BMC it has been
-received.
-
-If the BMC raises a BMC Reboot event then the host must renegotiate the
-protocol version so that both the BMC and the host agree on the block size.
-A BMC Reboot event implies a BMC Windows Reset event.
-If the BMC raises a BMC Windows Reset event then the host must
-assume that there is no longer an active window - that is if there was an
-active window it has been closed by the BMC and if it was a write window
-then the host must not assume that it was flushed unless a previous explicit
-flush call was successful.
-
-The BMC may at some points require access to the flash and the BMC daemon must
-set the BMC Flash Control Lost event when the BMC is accessing the flash behind
-the BMC daemons back. When this event is set the host must assume that the
-contents of the active window could be inconsistent with the contents of flash.
-
-## Protocol Definition
-
-### Commands
-
-```
-RESET_STATE 0x01
-GET_MBOX_INFO 0x02
-GET_FLASH_INFO 0x03
-CREATE_READ_WINDOW 0x04
-CLOSE_WINDOW 0x05
-CREATE_WRITE_WINDOW 0x06
-MARK_WRITE_DIRTY 0x07
-WRITE_FLUSH 0x08
-BMC_EVENT_ACK 0x09
-MARK_WRITE_ERASED 0x0a (V2)
-```
-
-### Sequence
-
-The host must ensure a unique sequence number at the start of a
-command/response pair. The BMC must ensure the responses to
-a particular message contain the same sequence number that was in the
-command request from the host.
-
-### Responses
-
-```
-SUCCESS 1
-PARAM_ERROR 2
-WRITE_ERROR 3
-SYSTEM_ERROR 4
-TIMEOUT 5
-BUSY 6 (V2)
-WINDOW_ERROR 7 (V2)
-```
-
-#### Description:
-
-SUCCESS - Command completed successfully
-
-PARAM_ERROR - Error with parameters supplied or command invalid
-
-WRITE_ERROR - Error writing to the backing file system
-
-SYSTEM_ERROR - Error in BMC performing system action
-
-TIMEOUT - Timeout in performing action
-
-BUSY - Daemon in suspended state (currently unable to access flash)
- - Retry again later
-
-WINDOW_ERROR - Command not valid for active window or no active window
- - Try opening an appropriate window and retrying the command
-
-### Information
-- All multibyte messages are LSB first (little endian)
-- All responses must have a valid return code in byte 13
-
-
-### Commands in detail
-
-Note in V1 block size is hard coded to 4K, in V2 it is variable and must be
-queried with GET_MBOX_INFO.
-Sizes and addresses are specified in either bytes - (bytes)
- or blocks - (blocks)
-Sizes and addresses specified in blocks must be converted to bytes by
-multiplying by the block size.
-```
-Command:
- RESET_STATE
- Implemented in Versions:
- V1, V2
- Arguments:
- -
- Response:
- -
- Notes:
- This command is designed to inform the BMC that it should put
- host LPC mapping back in a state where the SBE will be able to
- use it. Currently this means pointing back to BMC flash
- pre mailbox protocol. Final behavour is still TBD.
-
-Command:
- GET_MBOX_INFO
- Implemented in Versions:
- V1, V2
- Arguments:
- V1:
- Args 0: API version
-
- V2:
- Args 0: API version
-
- Response:
- V1:
- Args 0: API version
- Args 1-2: default read window size (blocks)
- Args 3-4: default write window size (blocks)
-
- V2:
- Args 0: API version
- Args 1-2: reserved
- Args 3-4: reserved
- Args 5: Block size as power of two (encoded as a shift)
-
-Command:
- GET_FLASH_INFO
- Implemented in Versions:
- V1, V2
- Arguments:
- -
- Response:
- V1:
- Args 0-3: Flash size (bytes)
- Args 4-7: Erase granule (bytes)
-
- V2:
- Args 0-1: Flash size (blocks)
- Args 2-3: Erase granule (blocks)
-
-Command:
- CREATE_{READ/WRITE}_WINDOW
- Implemented in Versions:
- V1, V2
- Arguments:
- V1:
- Args 0-1: Window location as offset into flash (blocks)
-
- V2:
- Args 0-1: Window location as offset into flash (blocks)
- Args 2-3: Requested window size (blocks)
-
- Response:
- V1:
- Args 0-1: LPC bus address of window (blocks)
-
- V2:
- Args 0-1: LPC bus address of window (blocks)
- Args 2-3: Actual window size (blocks)
- Args 4-5: Actual window location as offset into flash (blocks)
- Notes:
- Window location is always given as an offset into flash as
- taken from the start of flash - that is it is an absolute
- address.
-
- LPC bus address is always given from the start of the LPC
- address space - that is it is an absolute address.
-
- The requested window size is only a hint. The response
- indicates the actual size of the window. The BMC may
- want to use the requested size to pre-load the remainder
- of the request. The host must not access past the end of the
- active window.
-
- The actual window location indicates the absolute flash offset
- that the window actually maps and is not required to be equal
- to the flash offset requested by the host, but however must be
- less than or equal to it. Thus the first block of the window at
- the lpc address in the response will map the first block at the
- actual flash offset also contained in the response. It is the
- responsibility of the host to use this information to access
- any offset which is required.
-
- The requested window size may be zero. In this case the
- BMC is free to create any sized window but it must contain
- atleast the first block of data requested by the host. A large
- window is of course preferred and should correspond to
- the default size returned in the GET_MBOX_INFO command.
-
- If this command returns successfully then the window which the
- host requested is the active window. If it fails then there is
- no active window.
-
-Command:
- CLOSE_WINDOW
- Implemented in Versions:
- V1, V2
- Arguments:
- V1:
- -
-
- V2:
- Args 0: Flags
- Response:
- -
- Notes:
- Closes the active window. Any further access to the LPC bus
- address specified to address the previously active window will
- have undefined effects. If the active window is a
- write window then the BMC must perform an implicit flush.
-
- The Flags argument allows the host to provide some
- hints to the BMC. Defined Values:
- 0x01 - Short Lifetime:
- The window is unlikely to be accessed
- anytime again in the near future. The effect of
- this will depend on BMC implementation. In
- the event that the BMC performs some caching
- the BMC daemon could mark data contained in a
- window closed with this flag as first to be
- evicted from the cache.
-
-Command:
- MARK_WRITE_DIRTY
- Implemented in Versions:
- V1, V2
- Arguments:
- V1:
- Args 0-1: Flash offset to mark from base of flash (blocks)
- Args 2-5: Number to mark dirty at offset (bytes)
-
- V2:
- Args 0-1: Window offset to mark (blocks)
- Args 2-3: Number to mark dirty at offset (blocks)
-
- Response:
- -
- Notes:
- The BMC has no method for intercepting writes that
- occur over the LPC bus. The host must explicitly notify
- the daemon of where and when a write has occured so it
- can be flushed to backing storage.
-
- Offsets are given as an absolute (either into flash (V1) or the
- active window (V2)) and a zero offset refers to the first
- block. If the offset + number exceeds the size of the active
- window then the command must not succeed.
-
-Command
- WRITE_FLUSH
- Implemented in Versions:
- V1, V2
- Arguments:
- V1:
- Args 0-1: Flash offset to mark from base of flash (blocks)
- Args 2-5: Number to mark dirty at offset (bytes)
-
- V2:
- -
-
- Response:
- -
- Notes:
- Flushes any dirty/erased blocks in the active window to
- the backing storage.
-
- In V1 this can also be used to mark parts of the flash
- dirty and flush in a single command. In V2 the explicit
- mark dirty command must be used before a call to flush
- since there are no longer any arguments. If the offset + number
- exceeds the size of the active window then the command must not
- succeed.
-
-
-Command:
- BMC_EVENT_ACK
- Implemented in Versions:
- V1, V2
- Arguments:
- Args 0: Bits in the BMC status byte (mailbox data
- register 15) to ack
- Response:
- *clears the bits in mailbox data register 15*
- Notes:
- The host should use this command to acknowledge BMC events
- supplied in mailbox register 15.
-
-Command:
- MARK_WRITE_ERASED
- Implemented in Versions:
- V2
- Arguments:
- V2:
- Args 0-1: Window offset to erase (blocks)
- Args 2-3: Number to erase at offset (blocks)
- Response:
- -
- Notes:
- This command allows the host to erase a large area
- without the need to individually write 0xFF
- repetitively.
-
- Offset is the offset within the active window to start erasing
- from (zero refers to the first block of the active window) and
- number is the number of blocks of the active window to erase
- starting at offset. If the offset + number exceeds the size of
- the active window then the command must not succeed.
-```
-
-### BMC Events in Detail:
-
-If the BMC needs to tell the host something then it simply
-writes to Byte 15. The host should have interrupts enabled
-on that register, or otherwise be polling it.
-
-#### Bit Definitions:
-
-Events which should be ACKed:
-```
-0x01: BMC Reboot
-0x02: BMC Windows Reset (V2)
-```
-
-Events which cannot be ACKed (BMC will clear when no longer
-applicable):
-```
-0x40: BMC Flash Control Lost (V2)
-0x80: BMC MBOX Daemon Ready (V2)
-```
-
-#### Event Description:
-
-Events which must be ACKed:
-The host should acknowledge these events with BMC_EVENT_ACK to
-let the BMC know that they have been received and understood.
-```
-0x01 - BMC Reboot:
- Used to inform the host that a BMC reboot has occured.
- The host must perform protocol verison negotiation again and
- must assume it has no active window. The host must not assume
- that any commands which didn't respond as such succeeded.
-0x02 - BMC Windows Reset: (V2)
- The host must assume that its active window has been closed and
- that it no longer has an active window. The host is not
- required to perform protocol version negotiation again. The
- host must not assume that any commands which didn't respond as such
- succeeded.
-```
-
-Events which cannot be ACKed:
-These events cannot be acknowledged by the host and a call to
-BMC_EVENT_ACK with these bits set will have no effect. The BMC
-will clear these bits when they are no longer applicable.
-```
-0x40 - BMC Flash Control Lost: (V2)
- The BMC daemon has been suspended and thus no longer
- controls access to the flash (most likely because some
- other process on the BMC required direct access to the
- flash and has suspended the BMC daemon to preclude
- concurrent access).
- The BMC daemon must clear this bit itself when it regains
- control of the flash (the host isn't able to clear it
- through an acknowledge command).
- The host must not assume that the contents of the active window
- correctly reflect the contents of flash while this bit is set.
-0x80 - BMC MBOX Daemon Ready: (V2)
- Used to inform the host that the BMC daemon is ready to
- accept command requests. The host isn't able to clear
- this bit through an acknowledge command, the BMC daemon must
- clear it before it terminates (assuming it didn't
- terminate unexpectedly).
- The host should not expect a response while this bit is
- not set.
- Note that this bit being set is not a guarantee that the BMC daemon
- will respond as it or the BMC may have crashed without clearing
- it.
-```
+Finally there is also an implementation of a mailbox daemon control program, the
+details of which can be found in Documentation/mboxctl.md.
OpenPOWER on IntegriCloud