6. BGP

This chapter provides information about configuring BGP.

Note:

BGP is supported on all 7210 SAS platforms as described in this document, except those operating in access-uplink mode.

6.1. BGP overview

Border Gateway Protocol (BGP) is an inter-autonomous system routing protocol. An autonomous system (AS) is a network or group of routers logically organized and controlled by a common network administration. BGP enables routers to exchange network reachability information, including information about other ASs that traffic must traverse to reach other routers in other ASs. To implement BGP, the ASN must be specified in the config>router context. A 7210 SAS BGP configuration must contain at least one group and include information about at least one 7210 SAS neighbor (peer).

AS paths are the routes to each destination. Other attributes, such as the origin of the path, the multiple exit discriminator (MED), the local preference, and communities included with the route are called path attributes. When BGP interprets routing and topology information, loops can be detected and eliminated. Route preference for routes learned from the configured peers can be enabled among groups of routes to enforce administrative preferences and routing policy decisions.

Note:

MP-BGP (family IPv4 and IPv6) for use in Layer 3 VPN services (also known as VPRN services) is supported on all platforms as described in this document.
BGP (family IPv4 and IPv6) is not available for use in the base routing instance. It is only available for use as a PE-CE routing protocol.
The VPN-IPv4, VPN-IPv6, and L2-VPN (BGP-AD) BGP address families are supported on the 7210 SAS-Mxp (standalone), 7210 SAS-Sx/S 1/10GE (standalone and standalone-VC), 7210 SAS-Sx 10/100GE (standalone and standalone-VC), and 7210 SAS-T (network).
The MVPN-IPv4 BGP address family is supported on the 7210 SAS-Mxp, 7210 SAS-R6, 7210 SAS-R12, 7210 SAS-Sx/S 1/10GE (standalone and standalone-VC), and 7210 SAS-T (network).
The EVPN BGP address family is supported only on the 7210 SAS-Mxp (standalone), 7210 SAS-R6, and 7210 SAS-R12.

6.2. BGP communication

There are two types of BGP peers: internal BGP (iBGP) and external BGP (eBGP) (Figure 24):

iBGP is used to communicate with peers in the same autonomous system. Routes received from an iBGP peer in the same autonomous system are not advertised to other iBGP peers (unless the router is a route reflector) but can be advertised to an eBGP peer.
eBGP is used to communicate with peers in different autonomous systems. Routes received from an router in a different AS can be advertised to both eBGP and iBGP peers.

Autonomous systems share routing information, such as routes to each destination and information about the route or AS path, with other ASs using BGP. Routing tables contain lists of known routers, reachable addresses, and associated path cost metrics to each router. BGP uses the information and path attributes to compile a network topology.

6.2.1. Message types

Four message types are used by BGP to negotiate parameters, exchange routing information and indicate errors. They are:

Open message
After a transport protocol connection is established, the first message sent by each side is an Open message. If the Open message is acceptable, a Keepalive message confirming the Open is sent back. When the Open is confirmed, Update, Keepalive, and Notification messages can be exchanged.
Open messages consist of the BGP header and the following fields:
1. version
  The current BGP version number is 4.
2. local ASN
  The autonomous system number is configured in the config>router context.
3. hold time
  Configure the maximum time BGP will wait between successive messages (either keep alive or update) from its peer, before closing the connection. Configure the local hold time with in the config>router>bgp context.
4. BGP identifier
  IP address of the BGP system or the router ID. The router ID must be a valid host address.
Update Message
Update messages are used to transfer routing information between BGP peers. The information contained in the packet can be used to construct a graph describing the relationships of the various autonomous systems. By applying rules, routing information loops and some other anomalies can be detected and removed from the inter-AS routing,
The update messages consist of a BGP header and the following optional fields:
1. unfeasible routes length
  The field length which lists the routes being withdrawn from service because they are considered unreachable.
2. withdrawn routes
  The associated IP address prefixes for the routes withdrawn from service.
3. total path attribute length
  The total length of the path field that provides the attributes for a possible route to a destination.
4. path attributes
  The path attributes presented in variable length TLV format.
5. network layer reachability information (NLRI)
  IP address prefixes of reachability information.
keepalive Message
Keepalive messages, consisting of only a 19 octet message header, are exchanged between peers frequently so hold timers do not expire. The keepalive messages determine whether a link is unavailable.
notification
A Notification message is sent when an error condition is detected. The peering session is terminated and the BGP connection (TCP connection) is closed immediately after sending it.
The following figure shows BGP configuration.
Figure 24: BGP configuration

6.3. Group configuration and peers

To enable BGP routing, participating routers must have BGP enabled and be assigned to an autonomous system and the neighbor (peer) relationships must be specified. A router typically belongs to only one AS. TCP connections must be established in order for neighbors to exchange routing information and updates. Neighbors exchange BGP open messages that includes information such as ASNs, BGP versions, router IDs, and hold-time values. Keepalive messages determine whether a connection is established and operational. The hold-time value specifies the maximum time BGP will wait between successive messages (either keep alive or update) from its peer, before closing the connection.

In BGP, peers are arranged into groups. A group must contain at least one neighbor. A neighbor must belong to a group. Groups allow multiple peers to share similar configuration attributes.

Although neighbors do not have to belong to the same AS, they must be able to communicate with each other. If TCP connections are not established between two neighbors, the BGP peering will not be established and updates will not be exchanged.

Peer relationships are defined by configuring the IP address of the routers that are peers of the local BGP system. When neighbor and peer relationships are configured, the BGP peers exchange update messages to advertise network reachability information.

6.4. Hierarchical levels

BGP parameters are initially applied on the global level. These parameters are inherited by the group and neighbor (peer) levels. Parameters can be modified and overridden on a level-specific basis. BGP command hierarchy consists of three levels:

global level
group level
neighbor level

Many of the hierarchical BGP commands can be modified on different levels. The most specific value is used. That is, a BGP group-specific command takes precedence over a global BGP command. A neighbor-specific statement takes precedence over a global BGP and group-specific command; for example, if you modify a BGP neighbor-level command default, the new value takes precedence over group- and global- level settings.

Note:

Careful planning is essential to implement commands that can affect the behavior of global, group, and neighbor-levels. Because the BGP commands are hierarchical, analyze the values that can disable features on the global or group levels that must be enabled at the neighbor level. For example, if you enable the damping command on the global level but want it disabled only for a specific neighbor (not for all neighbors within the group), you cannot configure a double no command (no no damping) to enable the feature.

6.5. Route reflection

In a standard BGP configuration, all BGP speakers within an AS must have full BGP mesh to ensure that all externally learned routes are redistributed through the entire AS. iBGP speakers do not re-advertise routes learned from one iBGP peer to another iBGP peer. If a network grows, scaling issues could emerge because of the full mesh configuration requirement. Instead of peering with all other iBGP routers in the network, each iBGP router only peers with a router configured as a route reflector.

Route reflection circumvents the full mesh requirement but maintains the full distribution of external routing information within an AS. Route reflection is effective in large networks because it is manageable, scalable, and easy to implement. Route reflection is implemented in autonomous systems with a large internal BGP mesh to reduce the number of iBGP sessions required within an AS.

Note:

7210 SAS-R6 and 7210 SAS-R12 devices can be configured as route reflector clients or servers. These devices support route reflector server functionality for VPN-IPv4, VPN-IPv6, and BGP LU routes. All other 7210 SAS devices can be configured only as route reflector clients.

A route reflector (RR) provides route reflection services to iBGP peers called clients. Other iBGP peers of the RR are called non-clients. An RR and its client peers form a cluster. A large AS can be subdivided into multiple clusters, each identified by a unique 32-bit cluster ID. Each cluster contains at least one route reflector, which is responsible for redistributing route updates to all clients. Route reflector clients do not need to maintain a full peering mesh between each other. They only require peering to the route reflectors in their cluster. The route reflectors must maintain a full peering mesh between all non-clients within the AS.

Each route reflector must be assigned a cluster ID and specify which neighbors are clients and which are non-clients to determine which neighbors should receive reflected routes and which should be treated as a standard iBGP peer. Additional configuration is not required for the route reflector besides the typical BGP neighbor parameters.

The following figure shows a simple full-mesh configuration with several BGP routers. When SR-A receives a route from SR-1 (an external neighbor), it must advertise route information to all of its iBGP peers (SR-B, SR-C, SR-D, SR-E, SR-F). To prevent loops, iBGP learned routes are not re-advertised to other iBGP peers.

Figure 25: Fully meshed BGP configuration

When route reflectors are configured, the routers within a cluster do not need to be fully meshed. Figure 25 shows a fully meshed network and Figure 26 shows the same network but with route reflectors configured to minimize the iBGP mesh between SR-A, SR-B, SR-C, and SR-D. SR-A, configured as the route reflector, is responsible for redistributing route updates to clients SR-B, SR-C, and SR-D. iBGP peering between SR-B, SR-C and SR-D is not necessary because even iBGP learned routes are reflected to the route reflector clients.

In the following figure, SR-E and SR-F are shown as non-clients of the route reflector. As a result, a full mesh of iBGP peerings must be maintained between SR-A, SR-E, and SR-F.

Figure 26: BGP configuration with route reflectors

A route reflector enables communication between the clients and non-client peers. Clients of a route reflector do not need to be fully meshed but non-client peers need to be fully meshed within an AS.

A grouping (cluster) is composed of a route reflector (or a redundant pair of route reflectors configured with the same cluster ID) and its client peers. Each route reflector is assigned a cluster ID and this defines the cluster that it and its clients belong to. Multiple route reflectors can be configured within a cluster for redundancy. A router assumes the role as a route reflector by configuring the cluster cluster-id command. No other command is required unless the operator needs to disable reflection to specific clients.

When a route reflector receives an advertised route, it selects the best path. If the best path was received from an eBGP peer, then it is typically advertised, with next hop unchanged, to all clients and non-client peers of the route reflector. If the best path was received from a non-client peer, then it is advertised to all clients of the route reflector. If the best path was received from a client, then it is advertised to all clients and non-client peers.

A router becomes a route reflector whenever it has one or more client iBGP sessions. A client iBGP session is created with the cluster command, which also indicates the cluster ID of the client. The router ID is typically used as the cluster ID, but this is not necessary.

Basic route reflection operation (without add-path configured) can be summarized as follows:

If the best and valid path for an NLRI is learned from a client and disable-client-reflect is not configured, then that route is advertised to all clients, non-clients, and eBGP peers (as allowed by policy). If the client that advertised the best and valid path is a neighbor to which the split-horizon command (at the bgp, group, or neighbor level) applies, then the route is not advertised back to the sending client. In the route that is reflected to clients and non-clients:
1. The route reflector adds an ORIGINATOR_ID attribute if it does not already exist; the ORIGINATOR_ID indicates the BGP identifier (router ID) of the client that originated the route.
2. The route reflector prepends the cluster ID of the client that advertised the route and then the cluster ID of the client receiving the route (if applicable) to the CLUSTER_LIST attribute, creating the attribute if it does not already exist.
If the best and valid path for an NLRI is learned from a client and disable-client-reflect is configured, then that route is advertised to all clients in other clusters, non-clients, and eBGP peers (as allowed by policy). In the route that is reflected to clients in other clusters and non-clients:
1. The route reflector adds an ORIGINATOR_ID attribute if it does not already exist; the ORIGINATOR_ID indicates the BGP identifier (router ID) of the client that originated the route.
2. The route reflector prepends the cluster ID of the client that advertised the route and then the cluster ID of the client receiving the route (if applicable) to the CLUSTER_LIST attribute, creating the attribute if it does not already exist.
If the best and valid path for an NLRI is learned from a non-client, then that route is advertised to all clients and eBGP peers (as allowed by policy). In the route that is reflected to clients:
1. The route reflector adds an ORIGINATOR_ID attribute if it does not already exist; the ORIGINATOR_ID indicates the BGP identifier (router ID) of the non-client that originated the route.
2. The route reflector prepends the cluster ID of the client receiving the route to the CLUSTER_LIST attribute, creating the attribute if it already exists.
If the best and valid path for an NLRI is learned from an eBGP peer, then that route is advertised to all clients, non-clients, and other eBGP peers (as allowed by policy). The ORIGINATOR_ID and CLUSTER_LIST attributes are not added to the route.
If the best and valid path for an NLRI is locally originated by the RR — that is, it was learned through means other than BGP — then that route is advertised to all clients, non-clients, and eBGP peers (as allowed by policy). The ORIGINATOR_ID and CLUSTER_LIST attributes are not added to the route.

The ORIGINATOR_ID and CLUSTER_LIST attributes allow BGP to detect the looping of a route within the AS. If any router receives a BGP route with an ORIGINATOR_ID attribute containing its own BGP identifier, the route is considered invalid. In addition, if a route reflector receives a BGP route with a CLUSTER_LIST attribute containing a locally configured cluster ID, the route is considered invalid. Invalid routes are not installed in the route table and are not advertised to other BGP peers.

If VPN-IPv4 and VPN-IPv6 routes are advertised by a 7210 SAS router, the BGP next-hop address is set as follows for a reflected route:

When a route is reflected from one iBGP peer to another iBGP peer, the RR server does not modify the next hop by default; however, if the next-hop-self command is applied to the RR and enable-rr-vpn-forwarding is configured, then this combination of commands changes the next hop to the local address of the RR used with the peer.
If BGP 3107 LU service optimization has been configured using the config>router>bgp>group>neighbor>advertise-label ipv4 use-svc-routes command along with the enable-rr-vpn-forwarding command, only those BGP 3107 LU routes for which the node installs a swap entry for a VPN label are installed in the FIB.

Note:

On the 7210 SAS, the next-hop-self command can be used only in conjunction with the enable-rr-vpn-forwarding command or for BGP 3107 LU routes.

6.6. Fast external failover

Fast external failover on a group and neighbor basis is supported. For eBGP neighbors, this feature controls whether the router should drop an eBGP session immediately upon an interface-down event, or whether the BGP session should be kept up until the hold-time expires.

When fast external failover is disabled, the eBGP session stays up until the hold-time expires or the interface comes back up. If the BGP routes become unreachable as a result of the down IP interface, BGP withdraws the unavailable route immediately from other peers.

6.7. Sending of BGP communities

The capability to explicitly enable or disable the sending of the BGP community attribute to BGP neighbors, other than through the use of policy statements, is supported.

This feature allows an administrator to enable or disable the sending of BGP communities to an associated peer. This feature overrides communities that are already associated with a specific route or that may have been added via an export route policy. That is, even if the export policies leave BGP communities attached to a specific route, when the disable-communities feature is enabled, no BGP communities are advertised to the associated BGP peers.

6.8. ECMP and BGP route tunnels

Note:

ECMP is not supported for BGP route tunnels.

ECMP is not available for BGP route tunnels and also not on the transport LSP that is used to resolve BGP next-hop. If multiple LSP next-hops are available, only then the first next-hop is used and the rest ignored.

6.9. Next-hop resolution of BGP labeled routes to tunnels

The user enables the resolution of RFC 3107 BGP label route prefixes using tunnels to BGP next hops in the TTM with using following commands:

CLI Syntax:

config>router>bgp>next-hop-res

labeled-route-transport-tunnel

[no] family family

resolution {any | disabled | filter}

resolution-filter

[no] ldp

[no] rsvp

[no] sr-isis

[no] sr-ospf

Note:

The 7210 SAS-Mxp does not support the use of RLFA with BGP 3107 labeled routes.

The transport-tunnel context allows the user to configure different types of BGP label routes: label-IPv4 and VPN routes (which includes both VPN-IPv4 and VPN-IPv6 routes). By default, all labeled routes resolve to LDP, even if the preceding CLI commands are not configured in the system.

If the resolution command is set to disabled, the default binding to LDP tunnels resumes. If resolution is set to any, the supported tunnel type selection is based on the TTM preference. The order of preference of TTM tunnels is the following:

RSVP
LDP
segment routing OSPF
segment routing IS-IS

If the rsvp option is enabled, BGP searches for the best metric RSVP LSP to the address of the BGP next-hop. The address can correspond to the system interface or to another loopback used by the BGP instance on the remote node. MPLS provides the LSP metric in the tunnel table. In the case of multiple RSVP LSPs with the same lowest metric, BGP selects the LSP with the lowest tunnel ID. When using RSVP LSP, only FRR one-to-one is supported for services using BGP 3107 labelled routes.

If the ldp option is enabled, BGP searches for an LDP LSP with a FEC prefix corresponding to the address of the BGP next-hop.

If the sr-isis or sr-ospf option is enabled, an SR tunnel to the BGP next-hop is selected in the TTM from the lowest preference IS-IS or OSPF instance. If many instances have the same lowest preference, the lowest numbered IS-IS or OSPF instance is chosen.

Note:

RLFA used with SR is not supported for BGP 3107 tunnels when services are resolved to use BGP 3107 tunnels.

If one or more explicit tunnel types are specified using the resolution-filter option, only these tunnel types are selected again following the TTM preference. The resolution command must be set to filter to activate the list of tunnel types configured in the resolution-filter context.

6.9.1. VPN-IPv4 and VPN-IPv6 route resolution

The user enables the resolution of VPN-IPv4 and VPN-IPv6 prefixes using tunnels to MP-BGP peers using the following commands:

CLI Syntax:

config>service>vprn

auto-bind-tunnel

resolution {any|disabled|filter}

resolution-filter

[no] ldp

[no] rsvp

[no] sr-isis

[no] sr-ospf

The auto-bind-tunnel context configures the binding of VPRN routes to tunnels. The user must configure the resolution command to enable auto-bind resolution to tunnels in the TTM. If the resolution command is set to disabled, auto-binding to a tunnel is removed.

If the resolution command is set to any, any supported tunnel type in the vprn context is selected following the TTM preference. If one or more explicit tunnel types are specified using the resolution-filter command, only these tunnel types are selected again following the TTM preference. The following tunnel types are supported in a vprn context in order of preference: RSVP, LDP, and segment routing (SR).

If the rsvp command is enabled, BGP searches for the best metric RSVP LSP to the address of the BGP next-hop. This address can correspond to the system interface or to another loopback that the BGP instance uses on the remote node. MPLS provides the LSP metric in the tunnel table. In the case of multiple RSVP LSPs with the same lowest metric, BGP selects the LSP with the lowest tunnel ID.

If the ldp command is enabled, BGP searches for an LDP LSP with a FEC prefix corresponding to the address of the BGP next-hop.

If the sr-isis or sr-ospf command is configured, an SR tunnel to the BGP next-hop is selected in the TTM from the lowest preference ISIS or OSPF instance. If many instances have the same lowest preference, the lowest numbered IS-IS or OSPF instance is chosen.

The BGP tunnel type is not explicitly configured in VPRN resolution and is therefore implicit. It is always preferred over any other tunnel type enabled in the auto-bind-tunnel context. However, the BGP tunnel type is configurable as a new tunnel type for BGP EVPN prefixes. The user must enable the BGP tunnel type to ensure that inter-area or inter-as prefixes are resolved.

The user must set the resolution command to filter to activate the list of tunnel types configured under resolution-filter.

When configured in a VPRN service (using the configure>service>vprn>spoke-sdp command), an explicit SDP to a BGP next-hop overrides the auto-bind-tunnel selection for that BGP next-hop only. There is no support for reverting automatically to the auto-bind-tunnel selection if the explicit SDP goes down. The user must delete the explicit spoke-SDP in the VPRN service to resume using the auto-bind-tunnel selection for the BGP next-hop.

6.10. Route selection criteria

For each prefix in the routing table, the routing protocol selects the best path. Then, the best path is compared to the next path in the list until all paths in the list are exhausted. The following parameters are used to determine the best path:

Routes are not considered if they are unreachable.
An RTM preference is lowered as well as the hierarchy of routes from a different protocol. The lower the preference the higher the chance of the route being the active route.
Routes with higher local preference have preference.
Routes with the shorter AS path have preference.
Routes with the lower origin have preference. IGP = 0 EGP = 1 INCOMPLETE = 2
Routes with the lowest MED metric have preference. Routes with no MED value are exempted from this step unless always-compare-med is configured.
Routes learned by an eBGP peer rather than those learned from an iBGP peer are preferred.
Routes with the lowest IGP cost to the next-hop path attribute are preferred.
Routes with the lowest BGP-ID are preferred.
Routes with shortest cluster list are preferred.
Routes with lowest next-hop IP address are preferred.

Notes:

Note:

7210 SAS devices do not support BGP ECMP (multipath). An ECMP value is set to 1 and is assumed in the following notes.

For BGP-VPN routes with the same prefix but a different Route Distinguisher (RD) that are imported in a VRF, if ECMP is not enabled in that VRF, the preceding selection criteria are used until parameter point 8. If all selection criteria are still the same after that point, the last updated route will be selected.
For BGP-VPN routes with the same prefix but a different Route Distinguisher (RD) that reach parameter point 8 in the selection criteria, all routes will be flagged as BEST and USED although the actual number of used routes will depend on the ECMP value configured in the VRF.
For BGP-VPN routes with the same prefix and same Route Distinguisher (RD) that reach parameter point 8 in the selection criteria, such routes will be flagged as BEST but parameter points 9-11 will determine which routes are submitted to the VRF and marked as USED in accordance to the ECMP value configured in the VRF.

6.11. Enabling best external

Enabling the best-external feature is supported only at the config>router>bgp level. This feature can be enabled/disabled on a per address family basis, with IPv4 and IPv6 as the only options supported initially. Enabling best-external for IPv4 causes the new advertisement rules to apply to both regular IPv4 unicast routes as well as labeled-IPv4 (SAFI4) routes. Similarly, enabling best-external for IPv6 causes the new advertisement rules to apply to both regular IPv6 unicast routes as well as labeled-IPv6 (SAFI4) routes.

The advertise-external command cannot be applied to a route reflector unless client-to-client reflection is disabled (disable-client-reflect in the CLI).

6.11.1. BGP decision process with best external

When best-external is enabled for an address family, all routes belonging to that address family must be classified internally as either “internal” or “external”. A route is “internal” if:

It was received from an iBGP peer in the same AS.
It was originated by a router in the same or a different RR cluster of the same AS.

A route is “external” if it was received from an eBGP peer in a different AS.

The tie-breaking steps of the decision process are run as usual on all of the routes (both “internal” and “external”) for a particular destination until only one path, the best path, is left. If this is an external route then the decision process must be rerun on only the “internal” routes to find the single best path in that subset. This “best internal” route is advertised to confed-eBGP peers, as described in Advertisement rules with best external.

If the overall best path found by the first run of the decision process is an internal route with NEXT_HOP n the decision process must be rerun on only the “external” routes with NEXT_HOP not equal to n to find the single best path in that subset. This “best external” route is advertised to iBGP peers, as described in Advertisement rules with best external.

6.11.2. Advertisement rules with best external

The advertisement rules when advertise-external is enabled can be summarized as follows:

If a router has advertise-external enabled and its best overall route is an internal route then this best route should be advertised to:
1. all iBGP RR clients (if the route came from a non-client peer) or all iBGP non-clients (if the route came from a client peer).
2. all eBGP peers
3. If there is a best external route, it should be sent to iBGP client and non-client peers instead of the best overall route.
If a router has advertise-external enabled and its best overall route is an external route then this best route should be advertised to all iBGP peers and all eBGP peers.

6.11.3. Displaying best-external routes

BGP show commands display the following information for this feature:

For each RIB-IN entry in the output of the show router bgp routes prefix hunt command there is a Flags field that indicates the origin of the route and whether it is valid, best, used, etc. This feature reflects an “Advertised” value in the Flags field. This indicates that the route was advertised to one or more peers. If the “Advertised” flag is present but the “Best” flag is not the operator can determine that the route was probably a best-external.
The show router bgp neighbor advertised-routes command displays all advertised routes to that peer, including routes that were overall best, best-external and best-internal.
The advertise-external configuration (specifically the address families for which it is enabled) is displayed as part of the show router bgp output.

Note that the overall best, best-external and best-internal routes for a prefix can be determined from the output of the show router bgp routes prefix command. The first external route to be displayed in the output is always be the best-external route and the first internal route to be displayed in the output is always be the best-internal route. Only one of these routes will have the “Best” flag set, and this will be the overall best route.

6.12. BGP path attributes

A BGP route for a specific NLRI is distinguished from other BGP routes for the same NLRI by its set of path attributes. Each path attribute is encoded as a TLV in the Path Attributes field of the Update message, and describes a property of the path. The type field of the TLV identifies the path attribute and the value field carries data specific to the attribute type.

The 7210 SAS supports the following path attributes:

ORIGIN (well-known mandatory)
AS_PATH (well-known mandatory)
NEXT_HOP (well-known; required only in Update messages with IPv4 prefixes in the NLRI field); see Next-hop indirection for information about the NEXT_HOP attribute
MED (optional non-transitive)
LOCAL_PREF (well-known; required only in Update messages sent to iBGP peers)
ATOMIC_AGGR (well-known discretionary)
AGGREGATOR (optional transitive)
COMMUNITY (optional transitive)
ORIGINATOR_ID (optional non-transitive)
CLUSTER_LIST (optional non-transitive)
MP_REACH_NLRI (optional non-transitive)
MP_UNREACH_NLRI (optional non-transitive)
EXT_COMMUNITY (optional transitive)
AS4_PATH (optional transitive)
AS4_AGGREGATOR (optional transitive)
CONNECTOR (optional transitive)
PMSI_TUNNEL (supported only on platforms that support NG-MVPN with BGP signaling; refer to the 7210 SAS Software Release Notes 22.x.Rx for more information about NG-MVPN with BGP signaling)

6.12.1. NEXT_HOP attribute

The NEXT_HOP attribute indicates the IPv4 address of the BGP router that is the next hop to reach the IPv4 prefixes in the NLRI field. If the Update message is advertising routes other than IPv4 unicast routes, next hop of these routes is encoded in the MP_REACH_NLRI attribute and the NEXT_HOP attribute is not included in the Update message.

6.12.1.1. Next-hop indirection

The 7210 SAS supports next-hop indirection for most types of BGP routes. Next-hop indirection means that BGP next hops are logically separated from resolved next hops in the forwarding plane (IOMs). The separation allows the grouping of routes that share the same BGP next hops such that if the method of BGP next-hop resolution changes, only one forwarding plane update is required, instead of one update for each route in the group. The convergence time after the next-hop resolution change is uniform, and not linear, with the number of prefixes. The next-hop indirection technology supports Prefix-Independent Convergence (PIC). The 7210 SAS uses next-hop indirection whenever possible; there is no option to disable the functionality.

The following families support next-hop indirection on the 7210 SAS:

label-IPv4
MVPN-IPv4 (not supported on the 7210 SAS-Sx/S 1/10GE and 7210 SAS-Sx 10/100GE)
VPN-IPv4
label-IPv6
VPN-IPv6
L2-VPN
PW route

6.13. BGP Routing Information Base

The entire set of BGP routes learned and advertised by a BGP router make up its BGP Routing Information Base (RIB). Conceptually, the BGP RIB contains three parts:

RIB-IN
LOC-RIB
RIB-OUT

The RIB-IN (or Adj-RIBs-In, as defined in RFC 4271) contains the BGP routes received from peers that the router has stored in its memory.

The LOC-RIB contains modified versions of the BGP routes in the RIB-IN. The path attributes of a RIB-IN route can be modified using BGP import policies. All LOC-RIB routes for the NLRI are compared using the BGP decision process, which selects the best path for each NLRI. The local router uses the best paths in the LOC-RIB for forwarding, filtering, auto-discovery, and other tasks.

The RIB-OUT (or Adj-RIBs-Out, as defined in RFC 4271) contains the BGP routes selected for advertisement to peers. A BGP route is generally not advertised to a peer; that is, the router is not held in the RIB-OUT unless it is used locally, but there are exceptions. BGP export policies modify the path attributes of a LOC-RIB route to create the path attributes of the RIB-OUT route. A specific LOC-RIB route can be advertised with different path attribute values to different peers, and a 1:N relationship may exist between LOC-RIB and RIB-OUT routes.

When a VPN route is rejected by an import policy or not imported by any services, it is deleted from the RIB-IN. For VPN-IPv4 and VPN-IPv6 routes, this behavior can be changed by configuring the mp-bgp-keep command. Configuring this option maintains rejected VPN-IP routes in the RIB-IN and issuing a Route Refresh message is not required for an import policy change.

6.13.1. LOC-RIB features

The 7210 SAS implements the following LOC-RIB processing features:

BGP decision process
BGP route installation in the route table
BGP route installation in the tunnel table
BGP fast reroute
route flap damping (RFD)

The BGP fast reroute feature is described in BGP fast reroute and BGP fast reroute in a VPRN.

6.13.2. BGP fast reroute

BGP fast reroute uses indirection techniques in the forwarding plane and BGP backup path precomputation in the control plane to support the fast reroute of BGP traffic around unreachable or failed BGP next hops. BGP fast reroute is supported for label-IPv4, VPN-IPv4, and VPN-IPv6 routes. For information about BGP fast reroute in a VPRN, see BGP fast reroute in a VPRN.

The following table describes the scenarios supported in the base router BGP context.

Table 79: BGP fast reroute scenarios (base router context)

Ingress packet	Primary route	Backup route	PIC
IPv4	IPv4 route with next-hop A resolved by an IPv4 route or any shortcut tunnel	IPv4 route with next-hop B resolved by an IPv4 route or any shortcut tunnel	No
IPv4	Label-IPv4 route with next-hop A resolved by any transport tunnel	Label-IPv4 route with next-hop B resolved by any transport tunnel	Yes
IPv4	Label-IPv4 route with next-hop A resolved by a local route	Label-IPv4 route with next-hop B resolved by a local route	Yes
IPv4	Label-IPv4 route with next-hop A resolved by a static route	Label-IPv4 route with next-hop B resolved by a static route	Yes

6.13.2.1. Calculating backup paths

BGP fast reroute is optional on the 7210 SAS. Use the bgp backup-path command to enable the feature.

Note:

On the 7210 SAS, the backup-path command is supported only for label-IPv4 routes.

In the base router context, the backup-path command is used to control fast reroute on a per-RIB basis (labeled IPv4 routes). When the command specifies a particular family, BGP attempts to find a backup path for every prefix learned by the associated BGP RIB.

A prefix supports ECMP paths or a backup path, but not both. The backup path is the best path after the primary path and any paths using the same BGP next hop as the primary path have been removed.

6.13.2.2. Failure detection and switchover to the backup path

When BGP fast reroute is enabled, BGP decides when a primary path is no longer usable and notifies the IOM. Based on BGP input, the IOM immediately reroutes affected traffic to the backup path.

When BGP fast reroute is enabled, the IOM reroutes traffic onto a backup path based on input from BGP. When BGP decides that a primary path is no longer usable, it notifies the IOM and affected traffic is immediately switched to the backup path.

The following events trigger failure notifications to the IOM and traffic rerouting to backup paths:

peer IP address is unreachable and peer tracking is enabled
BFD session associated with the BGP peer goes down
BGP session is terminated with the peer (for example, send or receive NOTIFICATION)
there is no longer any route (allowed by the next-hop resolution policy, if configured) that can resolve the BGP next-hop address
BGP tunnel that resolves the next hop goes down because the BGP label-IPv4 route is withdrawn by the peer or becomes invalid due to an unresolved next hop

6.13.3. BGP fast reroute in a VPRN

In a VPRN context, BGP fast reroute is supported for VPN-IPv4 and VPN-IPv6 routes. On the 7210 SAS, BGP fast reroute is not supported for unlabeled IPv4 and unlabeled IPv6 routes.

The following table describes the supported VPRN scenarios.

Table 80: BGP fast reroute scenarios (VPRN context)

Ingress packet	Primary route	Backup route	PIC
IPv4 (ingress PE)	IPv4 route with next-hop A resolved by an IPv4 route	IPv4 route with next-hop B resolved by an IPv4 route	No
IPv4 (ingress PE)	VPN-IPv4 route with next-hop A resolved by an LDP, RSVP, or BGP tunnel	VPN-IPv4 route with next-hop A resolved by an LDP, RSVP, or BGP tunnel	Yes
MPLS (egress PE)	IPv4 route with next-hop A resolved by an IPv4 route	IPv4 route with next-hop B resolved by an IPv4 route	No
MPLS (egress PE)	IPv4 route with next-hop A resolved by an IPv4 route	VPN-IPv4 route with next-hop B resolved by an LDP, RSVP, or BGP tunnel	No

6.13.3.1. BGP fast reroute in a VPRN configuration

In a VPRN context, BGP fast reroute is optional and must be enabled. Fast reroute can be applied to all IPv4 prefixes, all IPv6 prefixes, all IPv4 and IPv6 prefixes, or to a specific set of IPv4 and IPv6 prefixes.

If all IP prefixes require backup path protection, use a combination of the BGP instance-level backup-path and VPRN-level enable-bgp-vpn-backup commands. The VPRN BGP backup-path command enables BGP fast reroute for all IPv4 prefixes and/or all IPv6 prefixes that have a best path through a VPRN BGP peer. The VPRN-level enable-bgp-vpn-backup command enables BGP fast reroute for all IPv4 prefixes and/or all IPv6 prefixes that have a best path through a remote PE peer.

6.13.4. RIB-OUT features

This section describes features related to RIB-OUT processing.

6.13.4.1. BGP export policies

The export command is used to apply one or more policies (up to 15) to a neighbor or group, or to the entire BGP context. The export command that is most specific to a peer is applied. An export policy command applied at the neighbor level takes precedence over the same command applied at the group or global level. An export policy command applied at the group level takes precedence over the same command specified on the global level. The export policies applied at different levels are not cumulative. The policies listed in an export command are evaluated in the order in which they are specified in the configuration.

Note:

The export command can reference a policy before the policy has been created as a policy-statement.

The most common uses for BGP export policies are the following:

BGP export policies can be used to locally originate a BGP route by exporting (or redistributing) a non-BGP route that is installed in the route table and actively used for forwarding. The non-BGP route is most frequently a direct, static, or aggregate route (exporting IGP routes into BGP is generally not recommended).
BGP export policies can be used to block the advertisement of certain BGP routes towards specific BGP peers. The routes may be blocked on the basis of IP prefix, communities, and so on.
BGP export policies can be used to modify the attributes of BGP routes advertised to specific BGP peers. The following path attribute modifications are possible using BGP export policies:
1. change the ORIGIN value
2. add a sequence of AS numbers to the start of the AS_PATH. When a route is advertised to an eBGP peer, the addition of the local-AS or global-AS numbers to the AS_PATH is always the final step (done after export policy).
3. replace the AS_PATH with a new AS_PATH. When a route is advertised to an eBGP peer, the addition of the local-AS or global-AS numbers to the AS_PATH is always the final step (done after export policy).
4. prepend an AS number multiple times to the start of the AS_PATH. When a route is advertised to an eBGP peer, the addition of the local-AS or global-AS numbers to the AS_PATH is always the final step (done after export policy). The add or replace action on the AS_PATH supersedes the prepend action if both are specified in the same policy entry.
5. change the NEXT_HOP to a specific IP address. When a route is advertised to an eBGP peer, the next hop cannot be changed from the local-address.
6. change the NEXT_HOP to the local-address used with the peer (next-hop-self)
7. add a value to the MED. If the MED attribute does not exist, it is added.
8. subtract a value from the MED. If the MED attribute does not exist, it is added with a value of 0. If the result of the subtraction is a negative number, the MED metric is set to 0.
9. set the MED to a specific value
10. set the MED to the cost of the IP route (or tunnel) used to resolve the BGP next hop
11. set LOCAL_PREF to a specific value when advertising to an iBGP peer
12. add, remove, or replace standard communities
13. add, remove, or replace extended communities
14. add a static value to the AIGP metric when advertising the route to an AIGP-enabled peer with a modified BGP next hop. The static value is incremental to the automatic adjustment of the LOC-RIB AIGP metric to reflect the distance between the local router and the received BGP next hop.
15. increment the AIGP metric by a fixed amount when advertising the route to an AIGP-enabled peer with a modified BGP next hop. The static value is a substitute for the dynamic value of the distance between the local router and the received BGP next hop.

6.13.4.2. Outbound Route Filtering

The ORF mechanism allows the ORF-sending router to signal to a peer, the ORF-receiving router, a set of route filtering rules (ORF entries) that the ORF-receiving router should apply to its route advertisements toward the ORF-sending router. The ORF entries are encoded in Route Refresh messages.

The use of ORF on a session must be negotiated; that is, both routers must advertise the ORF capability in their Open messages. The ORF capability describes the address families that support ORF, and for each address family, the ORF types that are supported and the ability to send and receive each type. 7210 SAS routers support ORF type 3, which is ORF based on extended communities, for only the following address families:

VPN-IPv4
VPN-IPv6
MVPN-IPv4

On the 7210 SAS, the send and receive capability for ORF type 3 is configurable using the send-orf and accept-orf commands, but the setting applies to all supported address families.

The 7210 SAS support for ORF type 3 allows a PE router that imports VPN routes with a particular set of route target extended communities to indicate to a peer (for example, a route reflector) that it only wants to receive VPN routes that contain one or more of these extended communities. When the PE router wants to inform its peer about a new RT extended community, it sends a Route Refresh message to the peer containing an ORF type 3 entry instructing the peer to add a permit entry for the 8-byte extended community value. When the PE router wants to inform its peer about an RT extended community that is no longer needed, it sends a Route Refresh message to the peer containing an ORF type 3 entry instructing the peer to remove the permit entry for the 8-byte extended community value.

On the 7210 SAS, the type-3 ORF entries that are sent to a peer can be generated dynamically (if no route target extended communities are specified with the send-orf command) or specified statically. Dynamically generated ORF entries are based on the route targets that are imported by all locally-configured VPRNs.

A router that has installed ORF entries received from a peer can still apply BGP export policies to the session. If the evaluation of a BGP export policy results in a reject action for a VPN route that matches a permit ORF entry, the route is not advertised; that is, the export policy has the final word.

Note:

The 7210 SAS implementation of ORF filtering is efficient. It takes less time to filter a large number of VPN routes with ORF than it does to reject non-matching VPN routes using a conventional BGP export policy.

Despite the advantages of ORF compared to manually configured BGP export policies, RTC is the better technology when it comes to dynamic filtering based on route target extended communities. See RT constrained route distribution for more information about RTC.

6.13.4.3. RT constrained route distribution

The RTC mechanism allows a router to advertise an RTC route, which is a special type of MP-BGP route, to specific peers; the associated AFI is 1 and the SAFI is 132. The NLRI of an RTC route encodes an origin AS and a route target extended community with prefix-type encoding (for example, if there is a prefix-length and host bits after the prefix-length are set to zero). A peer receiving RTC routes does not advertise VPN routes to the RTC-sending router unless they contain a route target extended community that matches one of the received RTC routes. As with any other type of BGP route, RTC routes are propagated loop-free throughout and between ASs. If multiple RTC routes exist for the same NLRI, the BGP decision process selects one as the best path. The propagation of the best path installs RIB-OUT filter rules as it travels from one router to the next, and this process creates an optimal VPN route distribution tree rooted at the source of the RTC route.

Note:

RTC and extended community-based ORF mechanisms are similar in that they both allow a router to signal to a peer the route target extended communities they want to receive in VPN routes from that peer. However, RTC has distinct advantages over extended community-based ORF because it is more widely supported, it is simpler to configure, and its distribution scope is not limited to a direct peer.

The capability to exchange RTC routes is advertised when the route-target keyword is added to the relevant family command. RTC is supported on eBGP and iBGP sessions of the base router instance. On a specific session, either ORF or RTC may be used, but not both; if RTC is configured, the ORF capability is not announced to the peer.

RTC is supported for the following BGP address families:

VPN-IPv4
VPN-IPv6
MVPN-IPv4
L2-VPN (BGP-AD)
EVPN

Note:

BGP address family support varies per 7210 SAS platform. RTC is supported only for the BGP families that the specific 7210 SAS platform supports. See BGP overview for more information.

When RTC is negotiated with one or more peers, the software automatically originates and advertises to these peers one /96 RTC route (the origin AS and route target extended community are fully specified) for every route target imported by a locally-configured VPRN or BGP-based Layer 2 VPN. Route targets are supported for all BGP families in the preceding list.

Note:

When route-target is enabled, it is activated for all address families configured on the node under BGP. Per-family activation is not supported.

The 7210 SAS also supports the group or neighbor level default-route-target command, which causes routers to generate and send a 0:0:0/0 default RTC route to one or more peers. Sending the default RTC route to a peer conveys a request to receive all VPN routes from that peer. The default-route-target command is typically configured on sessions that a route reflector has established with its PE clients. A received default RTC route is never propagated to other routers.

The route reflector advertises RTC routes in accordance with the rules described in RFC 4684. These rules ensure that RTC routes for the same NLRI that are originated by different PE routers in the same AS are correctly distributed within the AS.

When a BGP session comes up and RTC is enabled on the session (both peers advertised the MP-BGP capability), routers delay sending any VPN-IPv4 and VPN-IPv6 routes until either the session has been up for 60 seconds or the end-of-RIB marker is received for the RTC address family. When the VPN-IPv4 and VPN-IPv6 routes are sent, they are filtered to include only those with a route target extended community that matches an RTC route from the peer. VPN-IP routes matching an RTC route originated in the local AS are advertised to any iBGP peer that advertises a valid path for the RTC NLRI. That is, route distribution is not constrained to only the iBGP peer advertising the best path. However, VPN-IP routes matching an RTC route originated outside the local AS are only advertised to the eBGP or iBGP peer that advertises the best path.

Note:

The 7210 SAS does not support an equivalent of BGP-Multipath for RTC routes. There is no way to distribute VPN routes across more than one “almost” equal set of inter-AS paths.

6.13.4.4. Minimum Route Advertisement Interval

In accordance with RFC 4271, a BGP router should not send updated NLRI reachability information to a BGP peer until a specific period of time (the minimum route advertisement interval (MRAI)) has elapsed since the last update. The RFC suggests that the MRAI should be configurable per peer, but does not propose a specific algorithm; consequently, MRAI implementation details vary from one router operating system to another.

On the 7210 SAS, the MRAI is configurable on a per-session basis using the min-route-advertisement command. This CLI command can be configured with any value between 1 and 255 seconds, and the configuration applies to all address families. The default value is 30 seconds, regardless of the session type (eBGP or iBGP). The MRAI timer is started at the configured value when the session is established and counts down continuously. When the timer reaches zero, it resets to the configured value and all pending RIB-OUT routes are sent to the peer.

To send Update messages that advertise new NLRI reachability information more frequently for some address families than others, use the rapid-update command to overrides the remaining time on a peer MRAI timer and immediately send routes belonging to specified address families (and all other pending updates) to the peers receiving these routes. The following address families support rapid-update:

EVPN
L2-VPN
MVPN-IPv4

In many cases, the default MRAI is appropriate for all address families (or at least those not included in the preceding list) when it applies to Update messages that advertise reachable NLRI, but it is not the best option for Update messages that advertise unreachable NLRI (route withdrawals). Fast reconvergence after some types of failures requires route withdrawals to propagate to other routers as quickly as possible so that they can calculate and start using new best paths, which would be impeded by the effect of the MRAI timer at each router hop. This is facilitated by the rapid-withdrawal configuration command.

When rapid-withdrawal is configured, Update messages containing withdrawn NLRI are sent immediately to a peer without waiting for the MRAI timer to expire. Update messages containing reachable NLRI continue to wait for the MRAI timer to expire, or for a rapid-update trigger, if it applies. When rapid-withdrawal is enabled, it applies to all address families.

6.13.4.5. Advertise-inactive

BGP does not allow a route to be advertised unless it is the best path in the RIB and an export policy allows the advertisement.

In some cases, it may be useful to advertise the best BGP path to peers despite the fact that the BGP path is inactive, for example, if the path is inactive because there are one or more preferred non-BGP routes to the same destination and one of these other routes is the active route. The 7210 SAS supports this flexibility using the advertise-inactive command; other supported methods include Add-paths.

When the BGP advertise-inactive command is configured on a BGP session, it has the following effect on the IPv4, IPv6, label-IPv4, and label-IPv6 routes advertised to that peer:

If the active route for the IP prefix is a BGP route, that route is advertised.If the active route for the IP prefix is a non-BGP route and there is at least one valid but inactive BGP route for the same destination, the best of the inactive and valid BGP routes is advertised unless the non-BGP active route is matched and accepted by an export policy applied to the session.
If the active route for the IP prefix is a non-BGP route and there are no (valid) BGP routes for the same destination, no route is advertised for the prefix unless the non-BGP active route is matched and accepted by an export policy applied to the session.

6.13.4.6. Split-horizon

Split-horizon refers to the action taken by a router to avoid advertising a route back to the peer from which it was received. By default, the 7210 SAS applies split-horizon behavior only to routes received from iBGP non-client peers, and split-horizon only works for routes to non-imported routes within a RIB. Split-horizon functionality, which can never be disabled, prevents a route learned from a non-client iBGP peer from being advertised to the sending peer or any other non-client peer.

To apply split-horizon behavior to routes learned from RR clients or eBGP peers, configure the split-horizon command in either the global BGP, group or neighbor contexts. When split-horizon is enabled on these types of sessions, it only prevents the advertisement of a route back to its originating peer; for example, the software does not prevent the advertisement of a route learned from one eBGP peer back to a different eBGP peer in the same neighbor AS.

6.14. Add-paths

6.14.1. Receiving multiple paths per prefix from a BGP peer

If the 7210 SAS receives an advertisement of an NLRI and path from a specific peer and that peer subsequently advertises the same NLRI with different path information (a different next-hop or different path attributes), the new path overwrites the existing path.

However, when the add-path has been negotiated with the peer, the newly advertised path is stored in the RIB-IN along with all paths previously advertised (and not withdrawn) by the peer.

For router A to receive multiple paths per NLRI from peer B for a specific address family (AFI x, SAFI y), the BGP capabilities advertisement during session setup must indicate that peer B must send multiple paths for (AFI x, SAFI y) and router A is willing to receive multiple paths for (AFI x, SAFI y).

When the add-path receive capability for (AFI x, SAFI y) has been negotiated with a peer, all advertisements and withdrawals of NLRI within that address family by that peer include a path identifier.

If the add-path has been negotiated with a peer and a path identifier is expected but missing, or if the add-path has not been negotiated with a peer and a path identifier is present but not expected, a Notification message is sent with the error subcode indicating Invalid Network Field, in accordance with standard BGP error handling procedures.

The path identifiers have no significance to the receiving peer. If the combination of NLRI and path identifier in an advertisement from a peer is unique (does not match an existing route in the RIB-IN from that peer), the route is added to the RIB-IN. If the combination of NLRI and path identifier in a received advertisement is the same as an existing route in the RIB-IN from the peer, the new route replaces the existing one. If the combination of NLRI and path identifier in a received withdrawal matches an existing route in the RIB-IN from the peer, that route is removed from the RIB-IN.

A BGP Update message from an add-path peer may advertise and withdraw more than one NLRI belonging to one or more address families. In this case, the add-path may be supported for some address families and not others. In this situation, the receiving BGP router should not require that all path identifiers in the Update message be the same.

The following figure shows an Update message carrying a VPN-IPv4 NLRI with a path identifier.

Figure 27: BGP Update message with path identifier for VPN-IPv4 NLRI

The following figure shows an Update message carrying an IPv4 NLRI with a path identifier.

Figure 28: BGP Update message with path identifier for IPv4 NLRI

Currently, add-path is only supported by the iBGP sessions with other add-path capable peers. The add-path capability is not supported for eBGP sessions or for native IPv4 and IPv6 routes (that is, IPv4 and IPv6 routes advertised without a label) in iBGP sessions.The ability to receive multiple paths per prefix from an add-path peer is configurable per route type. The supported route types are the following:

label-IPv4
label-IPv6
VPN-IPv4
VPN-IPv6

6.14.2. Path selection with add-paths

The LOC-RIB may have multiple paths for a prefix. The path selection mode refers to the algorithm used to decide which of these paths to advertise to an add-paths peer. SR OS supports the Add-N path selection algorithm described in draft-ietf-idr-add-paths-guidelines. The Add-N algorithm selects, as candidates for advertisement, the N best paths with unique BGP next-hops. In the SR OS implementation, the default value of N is configurable, per address-family, at the BGP instance, group and neighbor levels, however, this default value can be overridden, for specific prefixes, using route policies. The maximum number of paths to advertise for a prefix to an add-paths neighbor is the value N assigned by a BGP import policy to the best path for P, otherwise it defaults to the neighbor, group or instance level configuration of N for the address family to which P belongs.

Add-paths allows non-best paths to be advertised to a peer, but it still complies with basic BGP advertisement rules such as the iBGP split horizon rule: a route learned from an iBGP neighbor cannot be readvertised to another iBGP neighbor unless the router is configured as a route reflector.

6.14.3. BGP decision process with ADD-PATH

To use multiple paths per NLRI for forwarding and to advertise multiple paths per NLRI to add-path peers, a router implementing an add-path must run a modified version of the BGP decision process. The existing BGP decision algorithm selects the one best path for any particular NLRI. Paths that are second best or third best remain in the RIB-IN but are not installed in the LOC-RIB and not advertised to peers.

The system automatically changes its BGP decision process for routes belonging to a particular address family whenever either of the following applies:

BGP edge PIC is enabled for the address family
the add-path send capability is enabled for that address family on one or more peering sessions

When BGP PIC is enabled, the BGP decision process selects a backup path per prefix or NLRI to install in the LOC_RIB. The algorithm is summarized as follows:

Select the single best path based on a full evaluation of all the BGP tie-breaking rules, as described in the following examples:
1. Select the route with the highest route preference.
2. From all routes with an AIGP metric, select the route with the lowest sum of the AIGP metric value stored with the RIB-IN copy of the route and the iteratively resolved distance between the calculating router and the BGP NEXT_HOP of the route.
3. Select the route with the highest local preference (LOCAL_PREF).
4. Select the route with the shortest AS path.
5. Select the route with the lowest origin.
6. Among routes advertised by the same neighbor AS (unless always-compare-med is configured). Select the route with the lowest MED.
7. Prefer routes learned from eBGP peers over routes learned from iBGP peers.
8. Select the route with the lowest IGP cost (unless ignore-nh-metric is configured).
9. Select the route received by the peer with the lowest originator ID or BGP identifier.
10. Select the route with the shortest cluster list.
11. Select the route received from the lowest peer IP address.
Select up to one additional second best path among the paths remaining after removing from consideration all paths with a NEXT_HOP or BGP identifier (or originator ID) in common with any of the previously-selected best paths. A full evaluation of all the BGP tie-breaking rules is required to find this single second-best path, as shown in the following examples:
1. Select the route with the highest route preference.
2. Select the route with the highest local preference (LOCAL_PREF).
3. Select the route with the shortest AS path (unless as-path-ignore is configured).
4. Select the route with the lowest origin.
5. Among routes advertised by the same neighbor AS (unless always-compare-med is configured) select the route with the lowest MED.
6. Prefer routes learned from eBGP peers over routes learned from iBGP peers.
7. Select the route with the lowest IGP cost.
8. Select the route received by the peer with the lowest originator ID or BGP identifier.
9. Select the route with the shortest cluster list.
10. Select the route received from the lowest peer IP address.

6.14.4. Advertising multiple paths using ADD-PATH

For router A to send multiple paths per NLRI to peer B for a particular address family (AFI x, SAFI y), the BGP capability advertisement during session setup must indicate that router A must send multiple paths for (AFI x, SAFI y), and peer B is willing to receive multiple paths for (AFI x, SAFI y).

By default, unless changed through configuration, all paths for a particular NLRI in the LOC-RIB are advertised to all add-path peers with which the send capability has been negotiated. All such advertisements (and any subsequent withdrawals) include a path identifier. Each advertised path for a specific NLRI must have a unique path identifier. When a path is reflected or propagated from one peer to another, the path identifier is expected to change, even if there has been no change in the next-hop. A BGP Update message sent to an add-path peer may advertise and withdraw more than one NLRI belonging to one or more address families. In this case, the add-path may be supported for some address families and not others, and the path identifiers associated with different NLRI in the Update message may be the same or different.

In the current implementation, the add-path is only supported by the iBGP sessions it forms with other add-path capable peers. The add-path capability is not supported for eBGP sessions or for native IPv4 and IPv6 routes (that is, IPv4 and IPv6 routes advertised without a label) in iBGP sessions. The ability to receive multiple paths per prefix from an add-path peer is configurable per route type. Route type support is as follows:

label-IPv4
label-IPv6
VPN-IPv4
VPN-IPv6

6.14.5. Limiting the number of paths per prefix

Advertising multiple paths per prefix to a peer means that the peer must maintain more entries in its RIB-IN than would be the case without add-path. The memory and CPU resources associated with these extra paths may not be justified if the peer cannot take advantage of them. Operators may therefore want precise control over the number of paths per prefix to send to particular peers.

The new add-paths CLI node (BGP, group or neighbor level) has address family-specific commands to set the maximum number of paths to send per prefix.

To ensure routing consistency in cases where an add-path speaking router has a mix of add-path and non add-path peers and where the number of paths to send for a particular prefix can vary by add-path peer, the following behavior should be enforced: if the advertising router advertises n paths for prefix XYZ to peer1 and m paths to peer2, and n < m, then all the paths advertised to peer1 must be included in the paths advertised to peer2. Suppose the LOC-RIB has N paths for prefix XYZ. The preceding behavior can be guaranteed if:

the N paths are sorted in strict order of their preference by the BGP decision process: p1 (overall best path found during step 1 of BGP decision process with ADD-PATH), p2, p3, …, pN (a path found during step 2 or 3 of BGP decision process with ADD-PATH)
p1 (only) is advertised to non add-path peers, add-path peers that indicate a send-only capability and add-path peers for which the configured path-limit is 1
(p1, p2) is advertised to add-path peers for which the configured path-limit is 2
(p1, p2, p3, …, pN) is advertised to add-path peers for which the configured path-limit is N, or else the path-limit is configured as max (default)

6.15. AIGP metric

Note:

Accumulated Interior Gateway Protocol (AIGP) is only supported on 7210 SAS-Mxp, 7210 SAS-R6, and 7210 SAS-R12.

The AIGP metric is an optional, non-transitive attribute that can be attached to selected routes using route policies. In networks that use AIGP, BGP paths with a lower end-to-end IGP cost are preferred, even if the compared paths span more than one AS or IGP instance. AIGP differs from MED in the following important ways:

AIGP is not transitive between completely distinct autonomous systems. It is only transitive across internal AS boundaries.
AIGP is always compared in paths that have the AIGP attribute, regardless of whether they are located in different neighbor ASs.
AIGP is more important than MED in the BGP decision process.
AIGP is automatically incremented every time there is a BGP next-hop change, so that the system can track the end-to-end IGP cost. All arithmetic operations on MED attributes must be performed manually, such as by using route policies.

On the 7210 SAS, AIGP is supported only in the base router BGP instance and only for label-IPv4 and 6PE routes. The AIGP attribute is only sent to peers configured using the aigp command. If the attribute is received from a peer that is not configured for AIGP, or if the attribute is received in a non-supported route type, the attribute is discarded and not propagated to other peers. The AIGP attribute is still displayed in BGP show command output.

When the 7210 SAS receives a route with an AIGP attribute and it re-advertises the route to an AIGP-enabled peer without changes to the BGP next hop, the AIGP metric value is unchanged by the advertisement (RIB-OUT) process. However, if the route is re-advertised with a new BGP next hop, the AIGP metric value is automatically incremented, either by the route table or tunnel table cost to reach the received BGP next hop, or by a value configured using route policies.

6.16. Command interactions and dependencies

This section describes the BGP command interactions and dependencies that apply to the configuration or operational maintenance of 7210 SAS routers.

6.16.1. Changing the ASN

If the autonomous system number (ASN) is changed on a router with an active BGP instance, the new ASN is not used until the BGP instance is restarted, either by administratively disabling or enabling the BGP instance or by rebooting the system with the new configuration.

6.16.2. BGP advertisement

BGP advertisement allows a BGP router to indicate to a peer, using the Optional Parameter, the features that it supports so that the router and peer can coordinate and use only the features that both support. Each capability in the Optional Parameter is TLV-encoded with a unique type code. The 7210 SAS supports the following capability codes:

Multiprotocol BGP (code 1)
Route refresh (code 2)
Outbound route filtering (code 3)
Graceful restart (code 64)
4-octet ASN (code 65)
Add-path (code 69)

6.16.3. Changing the local ASN

Changing the local AS of an active BGP instance:

at the global level, causes the BGP instance to restart with the new local ASN
at the group level, causes BGP to reestablish the peer relationships with all peers in the group with the new local ASN
at the neighbor level, causes BGP to reestablish the peer relationship with the new local ASN

6.16.4. Changing the router ID at the configuration level

If you configure a new router ID in the config>router context, protocols are not automatically restarted with the new router ID. The updated router ID is only used the next time the protocol is initialized or reinitialized. An interim period can occur when the protocols use different router IDs.

6.16.5. Hold time and keep alive timer dependencies

The BGP hold time specifies the maximum time BGP will wait between successive messages (either keep alive or update) from its peer, before closing the connection. This configuration parameter can be set at three levels. The most specific value is used.

global level - applies to all peers
group level - applies to all peers in group
neighbor level - only applies to specified peer

Although the keep alive time can be user specified, the configured keep alive timer is overridden by the value of hold time under the following circumstances:

If the hold time specified is less than the configured keep alive time, then the operational keep alive time is set to one third of the specified hold time; the configured keep alive time is unchanged.
If the hold time is set to zero, then the operational value of the keep alive time is set to zero; the configured keep alive time is unchanged. This means that the connection with the peer will be up permanently and no keep alive packets are sent to the peer.

If the hold time or keep alive values are changed, the changed timer values take effect when the new peering relationship is established. Changing the values cause the peerings to restart. The changed timer values are used when renegotiating the peer relationship.

6.16.6. Import and export route policies

Import and export route policy statements are specified for BGP on the global, group, and neighbor level. Up to five unique policy statement names can be specified in the command line per level. The most specific command is applied to the peer. Defining the policy statement name is not required before being applied. Policy statements are evaluated in the order in which they are specified within the command context.

The import and export policies configured on different levels are not cumulative. The most specific value is used. An import or export policy command specified on the neighbor level takes precedence over the same command specified on the group or global level. An import or export policy command specified on the group level takes precedence over the same command specified on the global level.

6.16.7. Route damping and route policies

To prevent BGP systems from sending excessive route changes to peers, BGP route damping can be implemented. Damping can reduce the number of update messages sent between BGP peers, to reduce the load on peers, without adversely affecting the route convergence time for stable routes.

The damping profile defined in the policy statement is applied to control route damping parameters. Route damping characteristics are specified in a route damping profile and are referenced in the action for the policy statement or in the action for a policy entry. Damping can be specified at the global, group, or neighbor level with the most specific command applied to the peer.

6.16.8. AS Override

The BGP-4 Explicit AS Override simplifies the use of the same ASN across multiple RFC 2547 VPRN sites.

The Explicit AS Override feature can be used in VPRN scenarios where a customer is running BGP as the PE-CE protocol and some or all of the CE locations are in the same Autonomous System (AS). With normal BGP, two sites in the same AS would not be able to reach each other directly since there is an apparent loop in the ASPATH.

With AS Override enabled on an egress eBGP session, the Service Provider network can rewrite the customer ASN in the ASPATH with its own ASN as the route is advertised to the other sites within the same VPRN.

6.17. Configuration guideline for BGP

The following are the configuration guidelines for BGP:

The 7210 SAS platforms can act only as a route reflector clients, except for the 7210 SAS-R6 and 7210 SAS-R12, which can act as route reflector servers.
The 7210 SAS platforms support IPv4 family for PE-CE eBGP instance and for RFC 3107 labeled IPv4 routes. It does not support the IPv4 family in the base routing instance to exchange IPv4 routes.
The 7210 SAS platforms support IPv6 family for PE-CE eBGP instance and for RFC 3107 labeled IPv6 routes. It does not support the IPv6 family in the base routing instance to exchange IPv6 routes.

6.18. BGP configuration process overview

The following figure shows the process to provision basic BGP parameters.

Figure 29: BGP configuration and implementation flow

6.19. Configuration notes

This section describes BGP configuration caveats.

6.19.1. General

Before BGP can be configured, the router ID (a valid host address, not the MAC address default) and autonomous system global parameters must be configured.
BGP instances must be explicitly created on each BGP peer. There are no default BGP instances on a 7210 SAS.

6.19.1.1. BGP defaults

The following list summarizes the BGP configuration defaults:

By default, the 7210 SAS is not assigned to an AS.
A BGP instance is created in the administratively enabled state.
A BGP group is created in the administratively enabled state.
A BGP neighbor is created in the administratively enabled state.
No BGP router ID is specified. If no BGP router ID is specified, BGP uses the router system interface address.
The 7210 SAS BGP timer defaults are the values recommended in IETF drafts and RFCs (see BGP MIB notes)
If no import route policy statements are specified, then all BGP routes are accepted.
If no export route policy statements specified, then all best and used BGP routes are advertised and non-BGP routes are not advertised.

6.19.1.2. BGP MIB notes

The 7210 SAS implementation of the RFC 1657 MIB variables listed in the following table differs from the IETF MIB specification.

Table 81: 7210 SAS and IETF MIB variations

MIB variable	Description	RFC 1657 allowed values	Allowed values
bgpPeerMinASOriginationInterval	Time interval in seconds for the MinASOriginationInterval timer. The suggested value for this timer is 15 seconds.	1 to 65535	2 to 255
bgpPeerMinRouteAdvertisementInterval	Time interval in seconds for the MinRouteAdvertisementInterval timer. The suggested value for this timer is 30.	1 to 65535	1 to 255 1

Note:

A value of 0 is supported when the rapid-update command is applied to an address family that supports it.

If SNMP is used to set a value of X to the MIB variable in the following table, there are three possible results:

Table 82: MIB variable with SNMP

Condition

Result

X is within IETF MIB values

and

X is within 7210 SAS values

SNMP set operation does not return an error

MIB variable set to X

X is within IETF MIB values

and

X is outside 7210 SAS values

SNMP set operation does not return an error

MIB variable set to “nearest” 7210 SAS supported value (e.g. 7210 SAS range is 2 - 255 and X = 65535, MIB variable will be set to 255)

Log message generated

X is outside IETF MIB values

and

X is outside 7210 SAS values

SNMP set operation returns an error

When the value set using SNMP is within the IETF allowed values and outside the 7210 SAS values as specified in the preceding tables, a log message is generated. The log messages that display are similar to the following log messages:

Sample log message for setting bgpPeerMinASOriginationInterval to 65535

576 2006/11/12 19:45:48 [Snmpd] BGP-4-bgpVariableRangeViolation: Trying to set

bgpPeerMinASOrigInt to 65535 - valid range is [2-255] - setting to 255

Sample log message for setting bgpPeerMinASOriginationInterval to 1

594 2006/11/12 19:48:05 [Snmpd] BGP-4-bgpVariableRangeViolation: Trying to set

bgpPeerMinASOrigInt to 1 - valid range is [2-255] - setting to 2

Sample log message for setting bgpPeerMinRouteAdvertisementInterval to 256

535 2006/11/12 19:40:53 [Snmpd] BGP-4-bgpVariableRangeViolation: Trying to set

bgpPeerMinRouteAdvInt to 256 - valid range is [2-255] - setting to 255

Sample log message for setting bgpPeerMinRouteAdvertisementInterval to 1

566 2006/11/12 19:44:41 [Snmpd] BGP-4-bgpVariableRangeViolation: Trying to set

bgpPeerMinRouteAdvInt to 1 - valid range is [2-255] - setting to 2

6. BGP

6.1. BGP overview

6.2. BGP communication

6.2.1. Message types

Figure 24: BGP configuration

6.3. Group configuration and peers

6.4. Hierarchical levels

6.5. Route reflection

Figure 25: Fully meshed BGP configuration

Figure 26: BGP configuration with route reflectors

6.6. Fast external failover

6.7. Sending of BGP communities

6.8. ECMP and BGP route tunnels

6.9. Next-hop resolution of BGP labeled routes to tunnels

6.9.1. VPN-IPv4 and VPN-IPv6 route resolution

6.10. Route selection criteria

6.11. Enabling best external

6.11.1. BGP decision process with best external

6.11.2. Advertisement rules with best external

6.11.3. Displaying best-external routes

6.12. BGP path attributes

6.12.1. NEXT_HOP attribute

6.12.1.1. Next-hop indirection

6.13. BGP Routing Information Base

6.13.1. LOC-RIB features

6.13.2. BGP fast reroute

Table 79: BGP fast reroute scenarios (base router context)

6.13.2.1. Calculating backup paths

6.13.2.2. Failure detection and switchover to the backup path

6.13.3. BGP fast reroute in a VPRN

Table 80: BGP fast reroute scenarios (VPRN context)

6.13.3.1. BGP fast reroute in a VPRN configuration

6.13.4. RIB-OUT features

6.13.4.1. BGP export policies

6.13.4.2. Outbound Route Filtering

6.13.4.3. RT constrained route distribution

6.13.4.4. Minimum Route Advertisement Interval

6.13.4.5. Advertise-inactive

6.13.4.6. Split-horizon

6.14. Add-paths

6.14.1. Receiving multiple paths per prefix from a BGP peer

Figure 27: BGP Update message with path identifier for VPN-IPv4 NLRI

Figure 28: BGP Update message with path identifier for IPv4 NLRI

6.14.2. Path selection with add-paths

6.14.3. BGP decision process with ADD-PATH

6.14.4. Advertising multiple paths using ADD-PATH

6.14.5. Limiting the number of paths per prefix

6.15. AIGP metric

6.16. Command interactions and dependencies

6.16.1. Changing the ASN

6.16.2. BGP advertisement

6.16.3. Changing the local ASN

6.16.4. Changing the router ID at the configuration level

6.16.5. Hold time and keep alive timer dependencies

6.16.6. Import and export route policies

6.16.7. Route damping and route policies

6.16.8. AS Override

6.17. Configuration guideline for BGP

6.18. BGP configuration process overview

Figure 29: BGP configuration and implementation flow

6.19. Configuration notes

6.19.1. General

6.19.1.1. BGP defaults

6.19.1.2. BGP MIB notes

Table 81: 7210 SAS and IETF MIB variations

Table 82: MIB variable with SNMP