From Wikipedia, the free encyclopedia
Audio Video Bridging (AVB) is a common name for the set of standards in development by the IEEE 802.1 Audio Video Bridging Task Group[1]. The charter of this organization is to "provide the specifications that will allow time-synchronized low latency streaming services through IEEE 802 networks". Currently, these consist of:
IEEE P802.1Qat and P802.1Qav are amendments to the base IEEE 802.1Q document, which specifies the operation of "Virtual Bridged Local Area Networks", which are implemented by network devices typically (and not very accurately) called "Ethernet switches".
To help ensure interoperability between devices that implement the AVB standards, AVnu Alliance develops device certification for the automotive, consumer, and professional A/V markets.
Historical background

There have been several attempts to get around these problems:
The specialized A/V technologies for professional, home, and automotive use were too specialized, in that they did not have any easy kind of interoperability with regular IT networks like Ethernet. This limited their market to those applications that needed those particular services, like video cameras or professional audio equipment. The adaptations of standard IT networks had the opposite problem: they were built from commodity equipment, but getting a professional level of service required very tight control over how the equipment was used and managed. The introduction of even one “unmanaged” device could cause the whole network to fail.
Requirements for A/V streaming
The problems with existing approaches
Almost all current network equipment is based on information technology requirements: move the data through the network as quickly as possible with minimum cost and minimal management. This is an excellent approach as long as there are no hard limits on delay or synchronization requirements. IT-oriented networks do not always, however, meet the requirements of the previous section:
The typical way these last two problems are handled today is with buffering ... but excessive buffering can cause delays that are annoying in the consumer environment, and completely unacceptable in a professional one. Another way to allow existing IT-oriented networks to be used for A/V streams is to “manage” the network at a higher layer or to impose very strictly defined, inflexible configurations. For example, in the professional market, there are a few systems in place that can provide adequate delays and guaranteed bandwidth, but they require a single proprietary solution, are initially configured using a special “system generation” program, and need to be reconfigured every time a new device is added. CobraNet is an example of this kind of architecture.
IEEE standardization
Several years ago, an effort was started within the IEEE 802.3 (Ethernet) working group to define a “residential Ethernet” which would directly address the challenges of A/V streaming. It quickly moved over to the IEEE 802.1 working group, since that is where all the major work needed to be done, and because that group is responsible for all the “cross network” bridging specifications. In particular, the group wanted to ensure that the technology was scalable from consumer applications (home/auto) to very high professional standards.
Summary of Audio Video Bridging

An “Audio Video Bridging” network is one that implements a set of protocols being developed by the IEEE 802.1 Audio/Video Bridging Task Group. There are four primary differences between the proposed Audio Video Bridging architecture and existing 802 architectures (from now on the term “AVB” will be used instead of “Audio Video Bridging”):
These are implemented using relatively small extensions to standard layer-2 MACs and bridges. This “minimal change” philosophy allows non-AVB and AVB devices to communicate using standard 802 frames. However, as shown in Figure 2, only AVB devices are able to: i) reserve a portion of network resources through the use of admission control and traffic shaping and ii) send and receive the new timing-based frames.
Precise synchronization
AVB devices periodically exchange timing information that allows both ends of the link to synchronize their time base reference clock very precisely. This precise synchronization has two purposes:

An 802.1AS network timing domain is formed when all devices follow the requirements of the 802.1AS standard and communicate with each other using the IEEE 802.1AS protocol. Within the timing domain there is a single device that provides a master timing signal called the “Grand Master Clock”. All other devices synchronize their clocks with this master as shown in Figure 3.
The device acting as Grand Master can either be auto selected or can be specifically assigned (e.g., if the network is used in a professional environment that needs “house clock” (audio), or “genlock” (video), or if the timing hierarchy needs to be specified for other reasons). AVB devices typically exchange capability information after physical link establishment. If peer devices on a link are network synchronization capable they will start to exchange clock synchronization frames. If not, then an AVB timing domain boundary is determined (as shown in Figure 2).
Traffic shaping for AV streams
In order to provide professional AV services, the AVB architecture implements traffic shaping using existing 802.1Q forwarding and priority mechanisms but also defines a particular relationship between priority tags and frame forwarding behavior at endpoints and bridges. Traffic shaping is the process of smoothing out the traffic for a stream so that the packets making up the stream are evenly distributed in time. If traffic shaping is not done at sources and bridges, then the packets tend to “bunch” into bursts of traffic that can overwhelm the buffers in subsequent bridges, switches and other infrastructure devices (“bunching” is described in greater detail in the following sections).
Tagging requirements at the stream source and the bridge
AVB streams consist of 802 frames with priority tagging, with normal restrictions on format and length. The default 802.1 tagging for a particular market segment will be chosen to avoid potential conflict with existing uses of the 802.1 priority tags within that market segment.
Traffic shaping at the stream source
Endpoint devices are required to very evenly transmit frames for a particular stream based on the AVB traffic class and the specific QoS parameters that were used when the stream was OK’d by the network (see “Admission controls” below). The specific rules for traffic shaping are described in the IEEE 802.1Qav specification, and are a simple form of what is known as “leaky bucket” credit-based shaping where the bandwidth reserved for a stream controls the time between the packets that make up the stream.,

Traffic shaping at an AVB bridge
The traffic shaping mechanism used by stream sources is also employed by AVB bridges. AVB frames are forwarded with precedence over Best Effort traffic (i.e., reserved AVB stream traffic traversing an AVB bridge has forwarding precedence over non-reserved traffic) and will be subjected to traffic shaping rules (they may need to wait for sufficient credits). Just like for stream sources, the traffic shaping rules for bridges require that frames be distributed very evenly in time, but only on an aggregate class basis rather than on a per-stream basis. This means that all the AVB traffic being transmitted out a particular port is distributed evenly in time measured using the QoS parameters of that class (this is the sum of the bandwidths of all the reservations for a particular AVB class for the particular port as made by the admission control process described below). This has the effect of smoothing out the delivery times (preventing “bunching” of frames ) as a stream propagates through a network. The limited “bunching” has the very useful benefit of placing a relatively small upper limit on the size of the AVB output buffers needed at all egress ports on a bridge, independent of the number of hops in the path. This bounded buffer size is a key attribute that enables bounded delay and eliminates network congestion for admitted AV streams in AVB networks even if non-admitted traffic does experience congestion.
Admission controls
Even though the preceding mechanism can reliably deliver data with a deterministic low latency and low jitter, it will only do so if the network resources (e.g., throughput on a port, buffer space in a bridge) are available along the entire path from the talker to the listener(s). In the AVB protocols, the term ‘talker’ is used to denote a stream source and ‘listener’ denotes a stream destination. In this architecture, it is both the talker’s and the listener's responsibility to guarantee the path is available and to reserve the resources. The process to do this is specified by the 802.1Qat “Stream Reservation Protocol” (SRP). This protocol registers a stream and reserves the resources required through the entire path taken by the stream: Talkers initiate by sending an SRP “talker advertise” message. This message includes a Stream ID composed of the MAC address of the stream source plus a talker-specific 16-bit unique ID and the MAC address of the stream destination. Additionally, the “talker advertise” message includes QoS requirements (e.g., AVB traffic class and data rate information), and accumulated worst case latency. Even though the address and QoS requirements are originated by the talker, the worst case latency is recalculated at every bridge so that the listener can communicate this information to higher layers to do media synchronization.

Figure 5 – Successful reservation (talker advertise) All AVB intermediate bridges receiving a “talker advertise” message check for bandwidth availability on their output ports. If the bridge has sufficient resources available on that port, then the “talker advertise” is propagated to the next station. If the resources are not available, rather than propagating the advertise message, the bridge sends a “talker failed” message. Included in this message is a failure code and bridge identification such that a higher-layer application can provide error checking or notification. An intermediate bridge receiving a “talker failed” will just pass on the message out towards the listener. When a listener receives a "talker advertise” message, it will know whether the resources are available, and if so, the latency for the path. It can then respond with a “listener ready” message that is forwarded back towards the talker. Intermediate bridges use the “ready” message to lock down the resources needed by the stream and to make the appropriate entries in their forwarding database to allow the stream to be sent on the port that received the “ready” message. When the talker receives a “ready” message, it can start transmitting the stream.

The talker can explicitly tear down a stream by de-registering the “talker advertise”, and a listener can disconnect by de-registering the “listener ready”. A de-registration message propagates through the network in the same manner as the original registration. There are also implicit methods used to tear down a connection and release the allocated resources. For example, the listener must periodically resend registrations and “ready” messages, and talkers must periodically resend “advertise” messages. That way any receiving device (including intermediate bridges) could automatically release assigned resources and notify higher layers if the appropriate registrations and reservations were not received due to a system that, for example, suddenly lost power.
LAN-specific considerations
Although the intent of the AVB Task Group is to provide a LAN-technology-independent method for requesting and providing streaming services, the characteristics and architectures of different LAN technologies require specific ways of implementing those services as outlined in the next few sections.
IEEE 802.3 / Ethernet Links
Today Ethernet devices predominantly support full-duplex operation at 100Mbps or greater. Thus, since the total available bandwidth available over such an Ethernet link is both known and constant, an AVB reservation over those Ethernet links combined with the appropriate traffic shaping assures both throughput and delivery latency parameters are met for packets of reserved streams. Since the bandwidth and delivery timing cannot be assured between two devices in an older shared CSMA/CD Ethernet using hubs, these older technologies are not supported by AVB. AVB’s Ethernet time synchronization standard, 802.1AS, leverages and simplifies deployed IEEE 1588-2008.
IEEE 802.11 / Wireless LAN
To date, the AVB support planned for 802.11 is limited to time synchronization. 802.1AS provides for accurate time synchronization over 802.11 links, in part by invoking MAC-specific timestamp-reporting primitives defined in draft 802.11v. The time synchronization protocol defined by 802.1AS has been designed to be resilient to the transmission characteristics that are possible on wireless medium.
Coordinated Shared Network Links
Several MAC/PHY specifications and standards are currently deployed or being developed which operate over existing wires within the home (e.g. AC power lines, coax cabling,). These wires are electrically “Shared” between multiple devices (not point-to-point like Ethernet), so to provide predictable performance, transmission of information onto the wire is “Coordinated” to avoid collisions by one of the devices on the network. Such networking technologies are typically called Coordinated Shared Networks (CSN). If the CSN provides an access method with bounded latency (as most do), and if accurate link-specific time stamping or clock distribution is available for the CSN, then extensions can be defined to take advantage of them.
Identification of participating devices
Since the whole AVB scheme depends on the participation of all devices between the talker and listener, any network element that does not support AVB (including so-called “unmanaged bridges”) must be identified and flagged. The process to do this is described in the developing IEEE 802.1BA “Audio Video Bridging Systems” standard, which specifies the default configuration for AVB devices in a network. For Ethernet, the method specified by 802.1BA to determine if its peer is AVB capable is a combination of 802.3 link capabilities (determined during Ethernet link establishment) and the link delay measurements done by IEEE 802.1AS. An AVB capable Ethernet port uses AVB operation if:
Other layer 2 connections will have their own specific methods to identify cooperating peers. Even though a port may be enabled for AVB operation, there is a possibility that a complete end-to-end AVB connection cannot be made to another endpoint device that is AVB enabled. For example, in Figure 2 above, devices in AVB domain 1 cannot establish an AVB connection to devices in AVB domain 2. An AVB connection can only be assured if a successful reservation is made using SRP and SRP “talker advertise” messages will not be propagated by a non-AVB bridge.
Higher layer protocols
P1722 and P1722.1
Applications using IEC 61883 formats can use procedures defined in IEEE 1722[3] to sample the 802.1AS clock at the start of an A/V data block and then add the worst case transport delay to the sample time to get a presentation time which is inserted into the 1722 packet.
IEEE 1722.1[4] is a standard being developed to allow Discovery, Enumeration, Configuration and Control (DECC) of devices using the P1722 standard.
P1733
If an application uses the IETF Real-Time Transport Protocol (RTP), it can use a new RTCP payload format defined in IEEE 1733[5] that correlates the RTP timestamp with the 802.1AS presentation time. The applications at the renderer(s) then use that correlation to translate the RTP timestamp to the presentation time stamp allowing the renderer(s) to start playing at the same time and keep playing at the same rate.
HTTP
Applications using HTTP can also take advantage of AVB’s time synchronization by carrying a presentation time. E.g., MPEG2 transport streams that require clock synchronization between the server and client can include Transport Time Stamps (TTS) as defined by ARIB TTS (ARIB STD-B24) which are derived from the 802.1AS clock. Similarly, an application could utilize clock synchronization through methods described in ISO 13818-1 Annex J which includes a discussion of various clock recovery schemes proposed for MPEG2 Transport Streams over jitter inducing networks, and figure J.2 illustrates a simple way to use the 802.1AS clock for this purpose.
If the media source is not a real-time source (e.g. a media file on a mass storage device), the presentation times can be generated based on the nominal media rate. If the media source is a real-time source (e.g. a microphone), the presentation time can be constructed by the talker based on its observation of the 802.1AS time in relation to the microphone’s sample clock. Other higher layer services can use AVB in a similar way. Existing connection management schemes, for example, can use the AVB SRP reservation services by mapping their internal stream identifiers with the SRP stream ID. Once a connection is established, streaming can start. E.g., applications using RTP transmit RTCP packets defined by IEEE 1733 that correlate the SSRC to the SRP stream ID. Furthermore, listener applications using 1722 use the SRP stream ID to discriminate between different streams.
Standards Status
| Standard | Status | Date |
| IEEE 802.1Qav | Ratified and published | 19-Jan-2010 |
| IEEE 802.1Qat | Draft 5.0 | 20-Jan-2010 |
| IEEE 802.1AS | Draft 7.0 | 23-Mar-2010 |
| IEEE 802.1BA | Draft 1.0 | 08-Feb-2010 |
| IEEE P1722 | Draft 2.2 | 1-Mar-2010 |
| IEEE P1733 | Draft 2.2 | 20-Apr-2009 |
| IEEE P1722.1 | Draft 0.02 | 7-Apr-2010 |
References