7. Network Address Translation

7.1. In This Chapter

This chapter provides information about Network Address Translation (NAT) and implementation notes.

Topics in this chapter include:

7.2. Terminology

BNG Subscriber — A broader term than the ESM Subscriber, independent of the platform on which the subscriber is instantiated. It includes ESM subscribers on 7750 SR as well as subscribers instantiated on third party BNGs. Some of the NAT functions, such as Subscriber Aware Large Scale NAT44 utilizing standard RADIUS attribute work with subscribers independently of the platform on which they are instantiated.

Deterministic NAT — A mode of operation where mappings between the NAT subscriber and the outside IP address and port-range are allocated at the time of configuration. Each subscriber is permanently mapped to an outside IP and a dedicated port block. This dedicated port block is referred to as deterministic port block. Logging is not needed as the reverse mapping can be obtained using a known formula. The subscriber’s ports can be expanded by allocating a dynamic port block in case that all ports in deterministic port block are exhausted. In such case logging for the dynamic port block allocation/de-allocation is required.

Enhanced Subscriber Management (ESM) subscriber — A host or a collection of hosts instantiated in 7750 SR Broadband Network Gateway (BNG). The ESM subscriber represents a household or a business entity for which various services with committed Service Level Agreements (SLA) can be delivered. NAT function is not part of basic ESM functionality.

L2-Aware NAT — In the context of 7750 SR platform combines Enhanced Subscriber Management (ESM) subscriber-id and inside IP address to perform translation into a unique outside IP address and outside port. This is in contrast with classical NAT technique where only inside IP is considered for address translations. Since the subscriber-id alone is sufficient to make the address translation unique, L2-Aware NAT allows many ESM subscribers to share the same inside IP address. The scalability, performance and reliability requirements are the same as in LSN.

Large Scale NAT (LSN) — Refers to a collection of network address translation techniques used in service provider network implemented on a highly scalable, high performance hardware that facilitates various intra and inter-node redundancy mechanisms. The purpose of LSN semantics is to make delineation between high scale and high performance NAT functions found in service provider networks and enterprise NAT that is usually serving much smaller customer base at smaller speeds. The following NAT techniques can be grouped under the LSN name:

Large Scale NAT44 or Carrier Grade NAT (CGN)
DS-Lite
NAT64

Each distinct NAT technique is referred to by its corresponding name (Large Scale NAT44 [or CGN], DS-Lite and NAT64) with the understanding that in the context of 7750 SR platform, they are all part of LSN (and not enterprise based NAT).

Large Scale NAT44 term can be interchangeably used with the term Carrier Grade NAT (CGN) which in its name implies high reliability, high scale and high performance. These are again typical requirements found in service provider (carrier) network.

L2-Aware NAT term refers to a separate category of NAT defined outside of LSN.

NAT RADIUS accounting — Reporting (or logging) of address translation related events (port-block allocation/de-allocation) via RADIUS accounting facility. NAT RADIUS accounting is facilitated via regular RADIUS accounting messages (star/interim-update/stop) as defined in RFC 2866, RADIUS Accounting, with NAT specific VSAs.

NAT RADIUS accounting — Can be interchangeably used with the term NAT RADIUS logging.

NAT Subscriber — in NAT terminology a NAT subscriber is an inside entity whose true identity is hidden from the outside. There are a few types of NAT implementation in 7750 SR and subscribers for each implementation are defined as follows:

Large Scale NAT44 (or CGN) — The subscriber is an inside IPv4 address.
L2-Aware NAT — The subscriber is an ESM subscriber which can spawn multiple IPv4 inside addresses.
DS-Lite — The subscriber in DS-lite can be identified by the CPE’s IPv6 address (B4 element) or an IPv6 prefix. The selection of address or prefix as the representation of a DS-Lite subscriber is configuration dependent.
NAT64 — The subscriber is an IPv6 prefix.

Non-deterministic NAT — A mode of operation where all outside IP address and port block allocations are made dynamically at the time of subscriber instantiation. Logging in such case is required.

Port block — A collection of ports that is assigned to a subscriber. A deterministic LSN subscriber can have only one deterministic port block that can be extended by multiple dynamic port blocks. Non-deterministic LSN subscriber can be assigned only dynamic port blocks. All port blocks for a LSN subscriber must be allocated from a single outside IP address.

Port-range — A collection of ports that can spawn multiple port blocks of the same type. For example, deterministic port-range includes all ports that are reserved for deterministic consumption. Similarly dynamic port-range is a total collection of ports that can be allocated in the form of dynamic port blocks. Other types of port-ranges are well-known ports and static port forwards.

7.3. Network Address Translation (NAT) Overview

The 7750 SR supports Network Address (and port) Translation (NAPT) to provide continuity of legacy IPv4 services during the migration to native IPv6. By equipping the multi-service ISA (MS ISA) in an IOM3-XP, the 7750 SR can operate in two different modes, known as:

Large Scale NAT, and;
Layer 2-Aware NAT

These two modes both perform source address and port translation as commonly deployed for shared Internet access. The 7750 SR with NAT is used to provide consumer broadband or business Internet customers access to IPv4 Internet resources with a shared pool of IPv4 addresses, such as may occur around the forecast IPv4 exhaustion. During this time it, is expected that native IPv6 services will still be growing and a significant amount of Internet content will remain IPv4.

7.3.1. Principles of NAT

Network Address Translation devices modify the IP headers of packets between a host and server, changing some or all of the source address, destination address, source port (TCP/UDP), destination port (TCP/UDP), or ICMP query ID (for ping). The 7750 SR in both NAT modes performs Source Network Address and Port Translation (S-NAPT). S-NAPT devices are commonly deployed in residential gateways and enterprise firewalls to allow multiple hosts to share one or more public IPv4 addresses to access the Internet. The common terms of inside and outside in the context of NAT refer to devices inside the NAT (that is behind or masqueraded by the NAT) and outside the NAT, on the public Internet.

TCP/UDP connections use ports for multiplexing, with 65536 ports available for every IP address. Whenever many hosts are trying to share a single public IP address there is a chance of port collision where two different hosts may use the same source port for a connection. The resultant collision is avoided in S-NAPT devices by translating the source port and tracking this in a stateful manner. All S-NAPT devices are stateful in nature and must monitor connection establishment and traffic to maintain translation mappings. The 7750 SR NAT implementation does not use the well-known port range (1to1023).

In most circumstances, S-NAPT requires the inside host to establish a connection to the public Internet host or server before a mapping and translation will occur. With the initial outbound IP packet, the S-NAPT knows the inside IP, inside port, remote IP, remote port and protocol. With this information the S-NAPT device can select an IP and port combination (referred to as outside IP and outside port) from its pool of addresses and create a unique mapping for this flow of data.

Any traffic returned from the server will use the outside IP and outside port in the destination IP/port fields – matching the unique NAT mapping. The mapping then provides the inside IP and inside port for translation.

The requirement to create a mapping with inside port and IP, outside port and IP and protocol will generally prevent new connections to be established from the outside to the inside as may occur when an inside host wishes to be a server.

7.3.2. Application Compatibility

Applications which operate as servers (such as HTTP, SMTP, etc) or peer-to-peer applications can have difficulty when operating behind an S-NAPT because traffic from the Internet can reach the NAT without a mapping in place.

Different methods can be employed to overcome this, including:

Port Forwarding;
STUN support; and,
Application Layer Gateways (ALG)

The 7750 SR supports all three methods following the best-practice RFC for TCP (RFC 5382, NAT Behavioral Requirements for TCP) and UDP (RFC 4787, Network Address Translation (NAT) Behavioral Requirements for Unicast UDP). Port Forwarding is supported on the 7750 SR to allow servers which operate on well-known ports <1024 (such as HTTP and SMTP) to request the appropriate outside port for permanent allocation.

STUN is facilitated by the support of Endpoint-Independent Filtering and Endpoint-Independent Mapping (RFC 4787) in the NAT device, allowing STUN-capable applications to detect the NAT and allow inbound P2P connections for that specific application. Many new SIP clients and IM chat applications are STUN capable.

Application Layer Gateways (ALG) allows the NAT to monitor the application running over TCP or UDP and make appropriate changes in the NAT translations to suit. The 7750 SR has an FTP ALG enabled following the recommendation of the IETF BEHAVE RFC for NAT (RFC 5382).

Even with these three mechanisms some applications will still experience difficulty operating behind a NAT. As an industry-wide issue, forums like UPnP the IETF, operator and vendor communities are seeking technical alternatives for application developers to traverse NAT (including STUN support). In many cases the alternative of an IPv6-capable application will give better long-term support without the cost or complexity associated with NAT.

7.4. Large Scale NAT

Large Scale NAT represents the most common deployment of S-NAPT in carrier networks today, it is already employed by mobile operators around the world for handset access to the Internet.

A Large Scale NAT is typically deployed in a central network location with two interfaces, the inside towards the customers, and the outside towards the Internet. A Large Scale NAT functions as an IP router and is located between two routed network segments (the ISP network and the Internet).

Traffic can be sent to the Large Scale NAT function on the 7750 SR using IP filters (ACL) applied to SAPs or by installing static routes with a next-hop of the NAT application. These two methods allow for increased flexibility in deploying the Large Scale NAT, especially those environments where IP MPLS VPN are being used in which case the NAT function can be deployed on a single PE and perform NAT for any number of other PE by simply exporting the default route.

The 7750 SR NAT implementation supports NAT in the base routing instance and VPRN, and through NAT traffic may originate in one VPRN (the inside) and leave through another VPRN or the base routing instance (the outside). This technique can be employed to provide customer’s of IP MPLS VPN with Internet access by introducing a default static route in the customer VPRN, and NATing it into the Internet routing instance.

As Large Scale NAT is deployed between two routed segments, the IP addresses allocated to hosts on the inside must be unique to each host within the VPRN. While RFC1918 private addresses have typically been used for this in enterprise or mobile environments, challenges can occur in fixed residential environments where a subscriber has existing S-NAPT in their residential gateway. In these cases the RFC 1918 private address in the home network may conflict with the address space assigned to the residential gateway WAN interface. Some of these issues are documented in draft-shirasaki-nat444-isp-shared-addr-02. Should a conflict occur, many residential gateways will fail to forward IP traffic.

7.4.1. Port Range Blocks

The S-NAPT service on the 7750 SR BNG incorporates a port range block feature to address scalability of a NAT mapping solution. With a single BNG capable of hundreds of thousands of NAT mappings every second, logging each mapping as it is created and destroyed logs for later retrieval (as may be required by law enforcement) could quickly overwhelm the fastest of databases and messaging protocols. Port range blocks address the issue of logging and customer location functions by allocating a block of contiguous outside ports to a single subscriber. Rather than log each NAT mapping, a single log entry is created when the first mapping is created for a subscriber and a final log entry when the last mapping is destroyed. This can reduce the number of log entries by 5000x or more. An added benefit is that as the range is allocated on the first mapping, external applications or customer location functions may be populated with this data to make real-time subscriber identification, rather than having to query the NAT as to the subscriber identity in real-time and possibly delay applications.

Port range blocks are configurable as part of outside pool configuration, allowing the operator to specify the number of ports allocated to each subscriber when a mapping is created. Once a range is allocated to the subscriber, these ports are used for all outbound dynamic mappings and are assigned in a random manner to minimize the predictability of port allocations (draft-ietf-tsvwg-port-randomization-05).

Port range blocks also serve another useful function in a Large Scale NAT environment, and that is to manage the fair allocation of the shared IP resources among different subscribers.

When a subscriber exhausts all ports in their block, further mappings will be prohibited. As with any enforcement system, some exceptions are allowed and the NAT application can be configured for reserved ports to allow high-priority applications access to outside port resources while exhausted by low priority applications.

7.4.1.1. Reserved Ports and Priority Sessions

Reserved ports allows an operator to configure a small number of ports to be reserved for designated applications should a port range block be exhausted. Such a scenario may occur when a subscriber is unwittingly subjected to a virus or engaged in extreme cases of P2P file transfers. In these situations, rather than block all new mappings indiscriminately the 7750 SR NAT application allows operators to nominate a number of reserved ports and then assign a 7750 SR forwarding class as containing high priority traffic for the NAT application. Whenever traffic reaches the NAT application which matches a priority session forwarding class, reserved ports will be consumed to improve the chances of success. Priority sessions could be used by the operator for services such as DNS, web portal, e-mail, VoIP, etc to permit these applications even when a subscriber exhausted their ports.

7.4.1.2. Preventing Port Block Starvation

7.4.1.2.1. Dynamic Port Block Starvation in LSN

The outside IP address is always shared for the subscriber with a port forward (static or via PCP) and the dynamically allocated port block, insofar as the port from the port forward is in the range >1023. This behavior can lead to starvation of dynamic port blocks for the subscriber. An example for this scenario is shown in Figure 52.

A static port forward for the WEB server in Home 1 is allocated in the CPE and the CGN. At the time of static port forward creation, no other dynamic port blocks for Home 1 exist (PCs are powered off).
Assume that the outside IP address for the newly created static port forward in the CGN is 3.3.3.1.
Over time dynamic port blocks are allocated for a number of other homes that share the same outside IP address, 3.3.3.1. Eventually those dynamic port block allocations will exhaust all dynamic port block range for the address 3.3.3.1.
Once the dynamic port blocks are exhausted for outside IP address 3.3.3.1, a new outside IP address (for example, 3.3.3.2) will be allocated for additional homes.

Eventually the PCs in Home 1 come to life and they try to connect to the Internet. Due to the dynamic port block exhaustion for the IP address 3.3.3.1 (that is mandated by static port forward – Web Server), the dynamic port block allocation will fail and consequently the PCs will not be able to access the Internet. There will be no additional attempt within CGN to allocate another outside IP address. In the CGN there is no distinction between the PCs in Home 1 and the Web Server when it comes to source IP address. They both share the same source IP address 2.2.2.1 on the CPE.

The solution for this is to reserve a port block (or blocks) during the static port forward creation for the given subscriber.
Figure 52: Dynamic Port Block Starvation in LSN

7.4.1.2.2. Dynamic Port Block Reservation

To prevent starvation of dynamic port blocks for the subscribers that utilize port forwards, a dynamic port block (or blocks) will be reserved during the lifetime of the port forward. Those reserved dynamic port blocks will be associated with the same subscriber that created the port forward. However, a log would not be generated until the dynamic port block is actually used and mapping within that block are created.

At the time of the port forward creation, the dynamic port block will be reserved in the following fashion:

If the dynamic port block for the subscriber does not exist, then a dynamic port block for the subscriber will be reserved. No log for the reserved dynamic port block is generated until the dynamic port block starts being utilized (mapping created due to the traffic flow).
If the corresponding dynamic port block already exists, then it will be reserved even after the last mapping within the last port block had expired.

The reserved dynamic port block (even without any mapping) will continue to be associated with the subscriber as long as the port forward for the subscriber is present. The log (syslog or RADIUS) will be generated only when there is not active mapping within the dynamic port block AND all port forwards for the subscriber are deleted.

Additional considerations with dynamic port block reservation:

The port block reservation should be triggered only by the first port forward for the subscriber. The subsequent port forwards will not trigger additional dynamic port block reservation.
Only a single dynamic port block for the subscriber is reserved (i.e no multiple port-block reservations for the subscriber are possible).
This feature is enabled with the configuration command port-forwarding-dyn-block-reservation under the configure>service>vprn>nat>outside>pool and the configure>router>nat>outside>pool CLI hierarchy. This command can be enabled only if the maximum number of configured port blocks per outside IP is greater or equal then the maximum configured number of subscribers per outside IP address. This will guarantee that all subscribers (up the maximum number per outside IP address) configured with port forwards will be able to reserve a dynamic port block.
In case that the port-reservation is enabled while the outside pool is operational and subscribers traffic is already present, the following two cases will have to be considered:
1. The configured number of subscribers per outside IP is less or equal than the configured number of port blocks per outside IP address (this is permitted) but all dynamic port blocks per outside IP address are occupied at the moment when port reservation is enabled. This will leave existing subscribers with port forwards that do not have any dynamic port blocks allocated (orphaned subscribers), unable to reserve dynamic port blocks. In this case the orphaned subscribers will have to wait until dynamic port blocks allocated to the subscribers without port forwards are freed.
2. The configured number of subscribers per outside IP is greater than the configured number of port blocks per outside IP address. In addition, all dynamic port blocks per outside IP address are allocated. Before the port reservation is even enabled, the subscriber-limit per outside IP address will have to be lowered (by configuration) so that it is equal or less than the configured number of port blocks per outside IP address. This action will cause random deletion of subscribers that do not have any port forwards. Such subscribers will be deleted until the number of subscriber falls below the newly configured subscriber limit. Subscribers with static port forwards will not be deleted, regardless of the configured subscriber-limit number. Once the number of subscriber is within the newly configured subscriber-limit, the port-reservation can take place under the condition that the dynamic port blocks are available. If certain subscribers with pot forwards have more than one dynamic port block allocated, the orphaned subscribers will have to wait for those additional dynamic port blocks to expire and consequently be released.
This feature is supported on the following applications: CGN, DS-Lite and NAT64.

7.4.2. Timeouts

Creating a NAT mapping is only one half of the problem – removing a NAT mapping at the appropriate time maximizes the shared port resource. Having ports mapped when an application is no longer active reduces solution scale and may impact the customer experience should they exhaust their port range block. The NAT application provides timeout configuration for TCP, UDP and ICMP.

TCP state is tracked for all TCP connections, supporting both three-way handshake and simultaneous TCP SYN connections. Separate and configurable timeouts exist for TCP SYN, TCP transition (between SYN and Open), established and time-wait state. Time-wait assassination is supported and enabled by default to quickly remove TCP mappings in the TIME WAIT state.

UDP does not have the concept of connection state and is subject to a simple inactivity timer. Company-sponsored research into applications and NAT behavior suggested some applications, like the Bittorrent Distributed Hash Protocol (DHT) can make a large number of outbound UDP connections that are unsuccessful. Rather than wait the default five (5) minutes to time these out, the 7750 SR NAT application supports an udp-initial timeout which defaults to 15 seconds. When the first outbound UDP packet is sent, the 15 second time starts – it is only after subsequent packets (inbound or outbound) that the default UDP timer will become active, greatly reducing the number of UDP mappings.

7.4.3. Watermarks

It is possible to define watermarks to monitor the actual usage of sessions and/or ports.

For each watermark, a high and a low value has to be set. Once the high value is reached, a notification will be send. As soon as the usage drops below the low watermark, another notification will be send.

Watermarks can be defined on nat-group, pool and policy level.

Nat-group: Watermarks can be placed to monitor the total number of sessions on an MDA.
Pool: Watermarks can be placed to monitor the total number of blocks in use in a pool.
Policy: In the policy it is possible to define watermarks on session and port usage. In both cases, it is the usage per subscriber (for l2-aware nat) or per host (for large-scale nat) that will be monitored.

7.5. L2-Aware NAT

Figure 53: L2-Aware Tree

NAT is supported on DHCP, PPPoE and L2TP, there is not support for static and ARP hosts.

In an effort to address issues of conflicting address space raised in draft-shirasaki-nat444-isp-shared-addr-02, an enhancement to Large Scale NAT was co-developed to give every broadband subscriber their own NAT mapping table, yet still share a common outside pool of IPs.

Layer-2 Aware (or subscriber aware) NAT is combined with Enhanced Subscriber Management on the 7750 SR BNG to overcome the issues of colliding address space between home networks and the inside routed network between the customer and Large Scale NAT.

Layer-2 Aware NAT permits every broadband subscriber to be allocated the exact same IPv4 address on their residential gateway WAN link and then proceeds to translate this into a public IP through the NAT application. In doing so, L2-Aware NAT avoids the issues of colliding address space raised in draft-shirasaki without any change to the customer gateway or CPE.

Layer-2-Aware NAT is supported on any of the ESM access technologies, including PPPoE, IPoE (DHCP) and L2TP LNS. For IPoE both n:1 (VLAN per service) and 1:1 (VLAN per subscriber) models are supported. A subscriber device operating with L2-Aware NAT needs no modification or enhancement – existing address mechanisms (DHCP or PPP/IPCP) are identical to a public IP service, the 7750 SR BNG simply translates all IPv4 traffic into a pool of IPv4 addresses, allowing many L2-Aware NAT subscribers to share the same IPv4 address.

More information on L2-Aware NAT can be found in draft-miles-behave-l2nat-00.

7.6. One-to-One (1:1) NAT

In 1:1 NAT, each source IP address is translated in 1:1 fashion to a corresponding outside IP address. However, the source ports are passed transparently without translation.

The mapping between the inside IP addresses and outside IP addresses in 1:1 NAT supports two modes:

Dynamic - the operator can specify the outside IP addresses in the pool, but the exact mapping between the inside IP address and the configured outside IP addresses is performed dynamically by the system in a semi-random fashion.
Static – the mappings between IP addresses are configurable and they can be explicitly set.

Dynamic version of 1:1 NAT is protocol dependent. Only TCP/UDP/ICMP protocols are allowed to traverse such NAT. All other protocols are discarded, with the exception of PPTP with ALG. In this case, only GRE traffic associated with PPTP is allowed through dynamic 1:1 NAT.

Static version of 1:1 NAT is protocol agnostic. This means that all IP based protocols are allowed to traverse static 1:1 NAT.

The following is applicable to 1:1 NAT:

Even though source ports are not being translated, the state maintenance for TCP and UDP traffic is still performed.
Traffic can be initiated from outside towards any statically mapped IPv4 address.
1:1 NAT can be supported simultaneously with NAPT (classic non 1:1 NAT) within the same inside routing context. This is accomplished by configuring two separate NAT pools, one for 1:1 NAT and the other for non 1:1 NAPT.

7.6.1. Static 1:1 NAT

In static 1:1 NAT, inside IP addresses are statically mapped to the outside IP addresses. In this fashion, devices on the outside can predictably initiate traffic to the devices on the inside.

Static configuration is based on the CLI concepts used in deterministic NAT. For example:

config

router

nat

inside

deterministic

prefix 10.0.0.0/24 subscriber-type classic-lsn-sub nat-policy ‘one-to-one’

map start 10.0.0.10 end 10.0.0.10 to 1.2.3.4

map start 10.0.0.15 end 10.0.0.15 to 1.2.3.20

map start 10.0.0.100 end 10.0.0.100 to 1.2.3.30

Static mappings are configured according to the map statements. The map statement can be configured manually by the operator or automatically by the system. IP addresses from the automatically generated map statements are sequentially mapped into available outside IP address in the pool:

The first inside IP address will be mapped to the 1st available outside IP address from the pool
The second inside IP address will be mapped to the 2nd available outside IP address from the pool
Etc.

The following mappings apply to the example from above:

Static mappings

10.0.0.0 — 1.2.3.0

10.0.0.1 — 1.2.3.1

10.0.0.2 — 1.2.3.2

10.0.0.3 — 1.2.3.3

10.0.0.4 — 1.2.3.5

10.0.0.5 — 1.2.3.6

10.0.0.9 — 1.2.3.10

10.0.0.10 — 1.2.3.4

10.0.0.11 — 1.2.3.11

10.0.0.12 — 1.2.3.12

10.0.0.14 — 1.2.3.14

10.0.0.15 — 1.2.3.20

10.0.0.16 — 1.2.3.15

10.0.0.19 — 1.2.3.18

10.0.0.20 — 1.2.3.19

10.0.0.21 — 1.2.3.21

10.0.0.28 — 1.2.3.28

10.0.0.29 — 1.2.3.29

10.0.0.30 — 1.2.3.31

10.0.0.99 — 1.2.3.100

10.0.0.100 — 1.2.3.30

10.0.0.101 — 1.2.3.101

10.0.0.255 — 1.2.3.255

7.6.1.1. Protocol Agnostic Behavior

Although static 1:1 NAT is protocol agnostic, the state maintenance for TCP and UDP traffic is still required in order to support ALGs. Because of that, the existing scaling limits related to the number of supported flows still apply.

Protocol agnostic behavior in 1:1 NAT is a property of a NAT pool:

config

router / service vprn

nat

outside

pool “one-to-one” nat-group 1 large-scale application agnostic

mode one-to-one

no port-forward-range

no port-reservation

subscriber-limit 1

deterministic port-reservation 65536

address-range 192.168.0.0 192.168.255.255

The application agnostic command is a pool create-time parameter. This command will automatically pre-set the following pool parameters:

mode one-to-one

no port-forward-range

no port-reservation

subscriber-limit 1

deterministic port-reservation 65536.

Once pre-set, these parameters cannot be changed while pool is operating in protocol agnostic mode.

The deterministic port-reservation 65536 command configures the pool to operate in static (or deterministic) mode.

7.6.1.2. Modification of Parameters in Static 1:1 NAT

Parameters in static 1:1 NAT can be changed according to the following rules:

Deterministic pool must be in a no shutdown state any time when a prefix or a map command in deterministic NAT is added or removed.
All configured prefixes referencing the pool via the NAT policy must be deleted (un-configured) before the pool can be shut down.
Map statements can be modified only when prefix is shutdown state. All existing map statements must be removed before the new ones are created.

7.6.1.3. Load Distribution over ISAs in Static 1:1 NAT

For best traffic distribution over ISAs, the value of the classic-lsn-max-subscriber-limit parameter should be set to 1.

config

router / service vprn X

nat

inside

deterministic

classic-lsn-max-subscriber-limit <num>

This mean that traffic is load balanced over ISAs based on inside IP addresses. In static 1:1 NAT this is certainly possible since the subscriber-limit parameter at the pool level is preset to a fixed value of 1.

However, if 1:1 static NAT is simultaneously used with regular (many-to-one) deterministic NAT where the subscriber-limit parameter can be set to a value greater than 1, then the classic-lsn-max-subscriber-limit parameter will also have to be set to a value that is greater than 1. The consequence of this is that the traffic will be load balanced based on the consecutive blocks of IP addresses (subnets) rather than individual IP addresses. Further information on this topic is provided in sections describing Deterministic NAT behavior.

7.6.1.4. NAT-Policy Selection

Traffic match criteria used in selection of specific NAT policy in static 1:1 NAT (deterministic part of the configuration) must not overlap with traffic match criteria that is used in selection of specific NAT policy used in filters or in destination-prefix statement (these are used for traffic diversion to NAT). Otherwise, traffic will be dropped in ISA.

A specific NAT policy in this context refers to a non-default NAT policy, or a NAT policy that is directly referenced in a filter, in a destination-prefix command or in a deterministic prefix command.

The following example is used to clarify this point further:

Traffic is diverted to nat using specific nat-policy pol-2:

service vprn 10

nat

inside

destination-prefix 192.168.0.0/16 nat-policy pol-2

determinisitic

prefix 10.10.10.0/24 subscriber-type classic-lsn-sub nat-policy pol-1

Deterministic (source) prefix 10.10.10.0/30 is configured to be mapped to specific nat-policy pol-1 that point to protocol agnostic 1:1 nat pool.

service vprn 10

nat

inside

destination-prefix 192.168.0.0/16 nat-policy pol-2

determinisitic

prefix 10.10.10.0/30 subscriber-type classic-lsn-sub nat-policy pol-1

Packet received in the ISA has srcIP 10.10.10.1 and destIP 192.168.10.10.

In case that no NAT mapping for this traffic exists in the ISA, a NAT policy (and with this the NAT pool) needs to be determined in order to create the mapping. Traffic is diverted to NAT using nat-policy pol-2, while the deterministic mapping says that nat-policy pol-1 should be used (and thus a different pool from the one referenced in nat-policy pol-2). Due to the specific NAT policy conflict, traffic will be dropped in the ISA.

In order to successfully pass traffic between two subnets through NAT while simultaneously using static 1:1 NAT and regular LSN44, a default (non-specific) NAT policy can be used for regular LSN44.

For example:

service vprn 10

nat

inside

destination-prefix 192.168.0.0/16

nat-policy pol-2

determinisitic

prefix 10.10.10.0/30 subscriber-type classic-lsn-sub nat-policy pol-1

In this case, the four hosts from the prefix 10.10.10.0/30 will be mapped in 1:1 fashion to 4 IP addresses from the pool referenced in the specific nat-policy pol-1, while all other hosts from the 10.10.10.0/24 network will be mapped to the NAPT pool referenced by the default nat-policy pol-2. In this fashion, nat-policy conflict is avoided.

In summary, specific nat-policy (in filter, destination-prefix command or in deterministic prefix command) will always take precedence over default nat-policy. However, traffic that matches classification criteria (in filter, destination-prefix command or a deterministic prefix command) that leads to multiple specific nat-policies, will be dropped.

7.6.1.5. Mapping Timeout

Static 1:1 NAT mappings are explicitly configured, and therefore their lifetime is tied to the configuration.

7.6.1.6. Logging

The logging mechanism for static mapping is the same as in Deterministic NAT - configuration changes are logged via syslog enhanced with reverse querying on the system.

7.6.1.7. Restrictions

Static 1:1 NAT is supported only for LSN44 (no support for DS-Lite/NAT64 or L2-aware NAT).

7.6.2. ICMP

In 1:1 NAT, certain ICMP messages contain an additional IP header that is embedded in the ICMP header. For example, when the ICMP message is sent to the source due to the inability to deliver datagram to its destination, the ICMP generating node will include the original IP header of the packet + 64bits of the original datagram. This information will help the source node to match the ICMP message to the process associated with this message in the first place.

When such message are received in the downstream direction (on the outside), 1:1 NAT will recognize them and change the destination IP address not only in the outside header but also in the ICMP header. In other words, a lookup in the downstream direction will be performed in the ISA to determine if the packet is ICMP with specific type. Depending on the outcome, the destination IP address in the ICMP header will be changed (reverted to the original source IP address).

Messages which carry original IP header within ICMP header are:

Destination Unreachable Messages (Type 3)
Time Exceeded Message (Type 11)
Parameter Problem Message (Type 12)
Source Quench Message (Type 4)

7.7. Deterministic NAT

7.7.1. Overview

In deterministic NAT the subscriber is deterministically mapped into an outside IP address and a port block. The algorithm that performs this deterministic mapping is revertive, which means that a NAT subscriber can be uniformly derived from the outside IP address and the outside port (and the routing instance). Thus, logging in deterministic NAT is not needed.

The deterministic [subscriber <-> outside-ip, deterministic-port-block] mapping can be automatically extended by a dynamic port-block in case that deterministic port block becomes exhausted of ports. By extending the original deterministic port block of the NAT subscriber by a dynamic port block yields a satisfactory compromise between a deterministic NAT and a non-deterministic NAT. There will be no logging as long as the translations are in the domain of the deterministic NAT. Once the dynamic port block is allocated for port extension, logging will be automatically activated.

NAT subscribers in deterministic NAT are not assigned outside IP address and deterministic port-block on a first come first serve basis. Instead, deterministic mappings will be pre-created at the time of configuration regardless of whether the NAT subscriber is active or not. In other words we can say that overbooking of the outside address pool is not supported in deterministic NAT. Consequently, all configured deterministic subscribers (for example, inside IP addresses in LSN44 or IPv6 address/prefix in DS-Lite) will be guaranteed access to NAT resources.

7.7.2. Supported Deterministic NAT Flavors

The routers support Deterministic LSN44 and Deterministic DS-Lite. The basic deterministic NAT principle is applied equally to both NAT flavors. The difference between the two stem from the difference in interpretation of the subscriber – in LSN44 a subscriber is an IPv4 address, whereas in DS-Lite the subscriber is an IPv6 address or prefix (configuration dependent).

With the exception of classic-lsn-max-subscriber-limit and dslite-max-subscriber-limit commands in the inside routing context, the deterministic NAT configuration blocks are for the most part common to LSN44 and DS-Lite.

Deterministic DS-Lite section at the end of this section will focus on the features specific to DS-Lite.

7.7.3. Number of Subscribers per Outside IP and per Pool

The outside pools in deterministic NAT can contain an arbitrary number of address ranges, where each address range can contain an arbitrary number of IP addresses (up to the ISA maximum).

The maximum number of NAT subscribers that can be mapped to a single outside IP address is configurable using a subscriber-limit command under the pool hierarchy. For Deterministic NAT, this number is restricted to the power of 2 (2^n). The consequence of this is that the number of NAT subscribers must be configuration-wise organized in ranges with the boundary that must be power of 2.

For example, in LSN44 where the NAT subscriber is an IP address, the deterministic subscribers would be configured with prefixes (for example, 10.10.10.0/24 – 256 subscribers) rather than an IP address range that would contain an arbitrary number of addresses (e.g. 10.10.10.10 – 10.10.10.50).

On the other hand, in DS-Lite the deterministic subscribers are for the most part already determined by the prefix with the subscriber-prefix-length command under the DS-Lite configuration node.

The number of subscribers per outside IP (the subscriber-limit command [2^n]) multiplied by the number of IP addresses over all address-range in an outside pool will determine the maximum number of subscribers that a deterministic pool can support.

7.7.4. Referencing a Pool

In deterministic NAT, the outside pool can be shared amongst subscribers from multiple routing instances. Also, NAT subscribers from a single routing instance can be selectively mapped to different outside pools.

7.7.5. Outside Pool Configuration

The number of deterministic mappings that a single outside IP address can sustain is determined through the configuration of the outside pool.

The port allocation per an outside IP is shown in Figure 54.

Figure 54: Outside Pool Configuration

The well-known ports are predetermined and are in the range 0 — 1023.

The upper limit of the port range for static port forwards (wildcard range) is determined by the existing port-forwarding-range command.

The range of ports allocated for deterministic mappings (DetP) is determined by multiplying the number of subscribers per outside IP (subscriber-limit command) with the number of ports per deterministic block (determinisitic>port-reservation command). The number of subscribers per outside IP in deterministic NAT must be power of 2 (2^n).

The remaining ports, extending from the end of the deterministic port range to the end of the total port range (65,535) are used for dynamic port allocation. The size of each dynamic port block is determined with the existing port-reservation command.

The determinisitic>port-reservation command enables deterministic mode of operation for the pool.

Examples:

The follow show three examples with deterministic Large Scale NAT44 where the requirements are:

300, 500 or 700 (three separate examples) ports in each deterministic port block.
A subscriber (an inside IPv4 address in LSN44) can extend its deterministic ports by a minimum of one dynamic port-block and by a maximum of four dynamic port blocks.
Each dynamic port-block contains 100 ports.
Oversubscription of dynamic port blocks is 4:1. This means that 1/4th of inside IP addresses may be starved out of dynamic port blocks in worst case scenario.
The wildcard (static) port range is 3000 ports.

In the first case, the ideal case will be examined where an arbitrary number of subscribers per outside IP address is allocated according to our requirements outlined above. Then the limitation of the number of subscribers being power of 2 will be factored in.

Table 30: Contiguous Number of Subscribers

Well-Known Ports*	Static Port Range*	Number of Ports in Deterministic Block*	Number of Deterministic Blocks	Number of Ports in Dynamic Block*	Number of Dynamic Blocks	Number of Inside IP Addresses per Outside IP Address*	Block Limit per Inside IP Address*	Wasted Ports
0-1023	1024-4023	300	153	100	153	153	5	312
0-1023	1024-4023	500	102	100	102	102	5	312
0-1023	1024-4023	700	76	100	76	76	5	712

The example in Table 30 shows how port ranges would be carved out in ideal scenario.

* — Signifies the fixed parameters (requirements).

The other values are calculated according to the fixed requirements.

port-block-limit includes the deterministic port block plus all dynamic port-blocks.

Next, a more realistic example with the number of subscribers being equal to 2^n are considered. The ratio between the deterministic ports and the dynamic ports per port-block just like in the example above: 3/1, 5/1 and 7/1 are preserved. In this case, the number of ports per port-block is dictated by the number of subscribers per outside IP address.

Table 31: Preserving Det/Dyn Port Ratio with 2^n Subscribers

Well-Known Ports*	Static Port Range*	Number of Ports in Deterministic Block*	Number of Deterministic Blocks	Number of Ports in Dynamic Block*	Number of Dynamic Blocks	Number of Inside IP Addresses per Outside IP Address*	Block Limit per Inside IP Address*	Wasted Ports
0-1023	1024-4023	180	256	60	256	256	5	72
0-1023	1024-4023	400	128	80	128	128	5	72
0-1023	1024-4023	840	64	120	64	64	5	72

* — Signifies the fixed parameters (requirements).

The final example is similar as Table 30 with the difference that the number of deterministic port blocks fixed are kept, as in the original example (300, 500 and 700).

Table 32: Fixed Number of Deterministic Ports with 2^n Subscribers

Well-Known Ports	Static Port Range	Number of Ports in Deterministic Block	Number of Deterministic Blocks	Number of Ports in Dynamic Block	Number of Dynamic Blocks	Number of Inside IP Addresses per Outside IP Address	Block Limit per Inside IP Address	Wasted Ports
0-1023	1024-4023	300	128	180	128	128	5	72
0-1023	1024-4023	500	64	461	64	64	5	8
0-1023	1024-4023	700	64	261	64	64	5	8

The three examples from above should give us a perspective on the size of deterministic and dynamic port blocks in relation to the number of subscribers (2^n) per outside IP address. Operators should run a similar dimensioning exercise before they start configuring their deterministic NAT.

The CLI for the highlighted case in the Table 30 is displayed:

configure

service

vprn

nat

outside

pool mypool

port-reservation ports 180

deterministic

port-reservation 300

subscriber-limit 128

port-forwarding-range 4023

Where:

128 subs * 300ports = 38,400 deterministic port range

128 subs * 180ports = 23,040 dynamic port range

Det+dyn available ports = 65,536 – 4024 = 61,512

Det+dyn usable pots = 128*300 + 128 *180 = 61,440 ports

72 ports per outside-ip are wasted.

configure

service

nat

nat-policy mypolicy

block-limit 5  1 deterministic port block + 4 dynamic port blocks

This configuration will allow 128 subscribers (inside IP addresses in LSN44) for each outside address (compression ratio is 128:1) with each subscriber being assigned up to 1020 ports (300 deterministic and 720 dynamic ports over 4 dynamic port blocks).

The outside IP addresses in the pool and their corresponding port ranges are organized as shown in Figure 55.

Figure 55: Outside Address Ranges

Assuming that the above graph depicts an outside deterministic pool, the number of subscribers that can be accommodated by this deterministic pool is represented by purple squares (number of IP addresses in an outside pool * subscriber-limit). The number of subscribers across all configured prefixes on the inside that are mapped to the same deterministic pool must be less than the outside pool can accommodate. In other words, an outside address pool in deterministic NAT cannot be oversubscribed.

The following is a CLI representation of a deterministic pool definition including the outside IP ranges:

pool ‘mypool’ nat-group 1 type large-scale

port-reservation {blocks <dynBlocks>} | {ports <ports>}

deterministic

port-reservation <ports>

subscriber-limit <sub-limit>

port-forwarding-range <pfRange>

address-range <start-ip-address> <end-ip-address>

7.7.6. Mapping Rules and the map Command in Deterministic LSN44

The common building block on the inside in the deterministic LSN44 configuration is a IPv4 prefix. The NAT subscribers (inside IPv4 addresses) from the configured prefix will be deterministically mapped to the outside IP addresses and corresponding deterministic port-blocks. Any inside prefix in any routing instance can be mapped to any pool in any routing instance (including the one in which the inside prefix is defined).

The mapping between the inside prefix and the deterministic pool is achieved through a NAT policy that can be referenced per each individual inside IPv4 prefix. IPv4 addresses from the prefixes on the inside will be distributed over the IP addresses defined in the outside pool referenced by the NAT policy.

The mapping itself is represented by the map command under the prefix hierarchy:

router/service vprn

nat

inside

deterministic

prefix <ip-prefix/length> subscriber-type <nat-sub-type> nat-policy <nat-policy-name>

map start <inside-ip-address> end <inside-ip-address> to <outside-ip-address>

The purpose of the map statement is to split the number of subscribers within the configured prefix over available sequences of outside IP addresses. The key parameter that governs mappings between the inside IPv4 addresses and outside IPv4 addresses in deterministic LSN44 is defined by the outside>pool>subscriber-limit command. This parameter must be power of 2 and it limits the maximum number of NAT subscribers that can be mapped to the same outside IP address.

The follow are rules governing the configuration of the map statement:

If the number of subscribers per configured prefix is greater than the subscriber-limit per outside IP parameter (2^n), then the lowest n bits of the map start inside-ip-address must be set to 0.
If the number of subscribers per configured prefix is equal or less than the subscriber-limit per outside IP parameter (2^n), then only one map command for this prefix is allowed. In this case there is no restriction on the lower n bits of the map start inside-ip-address. The range of the inside IP addresses in such map statement represents the prefix itself.
The outside-ip-address in the map statements must be unique amongst all map statements referencing the same pool. In other words, two map statements cannot reference the same outside-ip-address in the pool.

In case that the number of subscribers (IP addresses in LSN44) in the map statement is larger than the subscriber-limit per outside IP, then the subscribers must be split over a block of consecutive outside IP addresses where the outside-ip-address in the map statement represent only the first outside IP address in that block.

The number of subscribers (range of inside IP addresses in LSN44) in the map statement does not have to be a power of 2. Rather it has to be a multiple of a power of two  m * 2^n, where m is the number of consecutive outside IP addresses to which the subscribers are mapped and the 2^n is the subscriber-limit per outside IP.

An example of the map statement is given below:

router

nat

outside

pool ‘my-det-pool’ nat-group 1 type large-scale

subscriber-limit 128

deterministic

port-reservation 400

address-range 128.251.0.0 128.251.0.10

service vprn 10

nat

inside

deterministic

prefix 10.0.0.0/24 subscriber-type classic-lsn-sub nat-policy det

map start 10.0.0.0 end 10.0.0.255 to 128.251.0.1

In this case, the configured 10.0.0.0/24 prefix is represented by the range of IP addresses in the map statement (10.0.0.0-10.0.0.255). Since the range of 256 IP addresses in the map statement cannot be mapped into a single outside IP address (subscriber-limit=128), this range must be further implicitly split within the system and mapped into multiple outside IP addresses. The implicit split will create two IP address ranges, each with 128 IP addresses (10.0.0.0/25 and 10.0.0.128/25) so that addresses from each IP range are mapped to one outside IP address. The hosts from the range 10.0.0.0-10.0.0.127 will be mapped to the first IP address in the pool (128.251.0.1) as explicitly stated in the map statement (to statement). The hosts from the second range, 10.0.0.128-10.0.0.255 will be implicitly mapped to the next consecutive IP address (128.251.0.2).

Alternatively, the map statement can be configured as:

service vprn 10

nat

inside

deterministic

prefix 10.0.0.0/24 subscriber-type classic-lsn-sub nat-policy det

map start 10.0.0.0 end 10.0.0.127 to 128.251.0.1

map start 10.0.0.128 end 10.0.0.255 to 128.251.0.5

In this case the IP address range in the map statement is split into two non-consecutive outside IP addresses. This gives the operator more freedom in configuring the mappings.

However, the following configuration is not supported:

service vprn 10

nat

inside

deterministic

prefix 10.0.0.0/24 subscriber-type classic-lsn-sub nat-policy det

map start 10.0.0.0 end 10.0.0.63 to 128.251.0.1

map start 10.0.0.64 end 10.0.0.127 to 128.251.0.3

map start 10.0.0.128 end 10.0.0.255 to 128.251.0.5

Considering that the subscriber-limit = 128 (2^n; where n=7), the lower n bits of the start address in the second map statement (map start 10.0.0.64 end 10.0.0.127 to 128.251.0.3) are not 0. This is in violation of the rule #1 that governs the provisioning of the map statement.

Assuming that we use the same pool with 128 subscribers per outside IP address, the following scenario is also not supported (configured prefix in this example is different than in previous example):

service vprn 10

nat

inside

deterministic

prefix 10.0.0.0/26 subscriber-type classic-lsn-sub nat-policy det

map start 10.0.0.0 end 10.0.0.63 to 128.251.0.1

prefix 10.0.1.0/26 subscriber-type classic-lsn-sub nat-policy det

map start 10.0.1.0 end 10.0.1.63 to 128.251.0.1

Although the lower n bits in both map statements are 0, both statements are referencing the same outside IP (128.251.0.1). This is violating rule #2 that governs the provisioning of the map statement. Each of the prefixes in this case will have to be mapped to a different outside IP address, which will lead to underutilization of outside IP addresses (half of the deterministic port-blocks in each of the two outside IP addresses will be not be utilized).

In conclusion, considering that the number of subscribers per outside IP (subscriber-limit) must be 2^n, the inside IP addresses from the configured prefix will be split on the 2^n boundary so that every deterministic port-block of an outside IP is utilized. In case that the originally configured prefix contains less subscribers (IP addresses in LSN44) than an outside IP address can accommodate (2^n), all subscribers from such configured prefix will be mapped to a single outside IP. Since the outside IP cannot be shared with NAT subscribers from other prefixes, some of the deterministic port-blocks for this particular outside IP address are not utilized.

Each configured prefix can evaluate into multiple map commands. The number of map commands will depend on the length of the configured prefix, the subscriber-limit command and fragmentation of outside address-range within the pool with which the prefix is associated.

7.7.7. Hashing Considerations in Deterministic LSN44

Support for multiple MS-ISAs in the nat-group calls for traffic hashing on the inside in the ingress direction. This will ensure fair load balancing of the traffic amongst multiple MS-ISAs. While hashing in non-deterministic LSN44 can be performed per source IP address, hashing in deterministic LSN44 is based on subnets instead of individual IP addresses. The length of the hashing subnet is common for all configured prefixes within an inside routing instance. In case that a prefixes from an inside routing instances is referencing multiple pools, the common hashing prefix length will be chosen according to the pool with the highest number of subscribers per outside IP address. This will ensure that subscribers mapped to the same outside IP address will be always hashed to the same MS-ISA.

In general, load distribution based on hashing is dependent on the sample. Large and more diverse sample will ensure better load balancing. Therefore the efficiency of load distribution between the MS-ISAs is dependent on the number and diversity of subnets that hashing algorithm is taking into consideration within the inside routing context.

A simple rule for good load balancing is to configure a large number of subscribers relative to the largest t subscriber-limit parameter in any given pool that is referenced from this inside routing instance.

Figure 56: Deterministic LSN44 Configuration Example

The configuration example shown Figure 56 depicts a case in which prefixes from multiple routing instances are mapped to the same outside pool and at the same time the prefixes from a single inside routing instance are mapped to different pools (we do not support the latter with non-deterministic NAT).

Note:

In this example is the inside prefix 10.10.10.0/24 that is present in VPRN 1 and VPRN 2. In both VPRNs, this prefix is mapped to the same pool - pool-1 with the subscriber-limit of 64. Four outside IP addresses per prefix per VPRN (eight in total) are allocated to accommodate the mappings for all hosts in prefix 10.10.10.0/24. But the hashing prefix length in VPRN1 is based on the subscriber-limit 64 (VPRN1 references only pool-1) while the hashing prefix length in VPRN2 is based on the subscriber-limit 256 in pool-2 (VPRN2 references both pools, pool-1 and pool-2 and we must select the larger subscriber-limit). The consequence of this is that the traffic from subnet 10.10.10.0/24 in VPRN 1 can be load balanced over 4 MS-ISA (hashing prefix length is 26) while traffic from the subnet 10.10.10.0/24 in VPRN 2 is always sent to the same MS-ISA (hashing prefix length is 24).

7.7.7.1. Distribution of Outside IP Addresses Across MS-ISAs in an MS-ISA NA Group

Distribution of outside IP addresses across the MS-ISAs is dependent on the ingress hashing algorithm. Since traffic from the same subscriber is always pre-hashed to the same MS-ISA, the corresponding outside IP address also must reside on the same ISA. CPM runs the hashing algorithm in advance to determine on which MS-ISA the traffic from particular inside subnet will land and then the corresponding outside IP address (according to deterministic NAT mapping algorithm) will be configured in that particular MS-ISA.

7.7.8. Sharing of Deterministic NAT Pools

Sharing of the deterministic pools between LSN44 and DS-Lite is supported.

7.7.9. Simultaneous support of dynamic and deterministic NAT

Simultaneous support for deterministic and non-deterministic NAT inside of the same routing instance is supported. However, an outside pool can be only deterministic (although expandable by dynamic ports blocks) or non-deterministic at any given time.

Ingress hashing for all NATed traffic within the VRF will in this case be performed based on the subnets driven by the classic-lsn-max-subscriber-limit parameter.

7.7.10. Selecting Traffic for NAT

Deterministic NAT does not change the way how traffic is selected for the NAT function but instead only defines a predictable way for translating subscribers into outside IP addresses and port-blocks.

Traffic is still diverted to NAT using the existing methods:

routing based – traffic is forwarded to the NAT function if it matches a configured destination prefix that is part of the routing table. In this case inside and outside routing context must be separated.
filter based – traffic is forwarded to the NAT function based on any criteria that can be defined inside an IP filter. In this case the inside and outside routing context can be the same.

7.7.11. Inverse Mappings

The inverse mapping can be performed with a MIB locally on the node or externally via a script sourced in the router. In both cases, the input parameters are <outside routing instance, outside IP, outside port. The output from the mapping is the subscriber and the inside routing context in which the subscriber resides.

7.7.11.1. MIB approach

Reverse mapping information can be obtained using the following command:

tools dump nat deterministic-mapping outside-ip <ipv4-address> router <router-instance> outside-port <[1..65535]>

<ipv4-address> : a.b.c.d

<router-instance> : <router-name> | <service-id>

router-name - "Base"

service-id - [1..2147483647]

Example:

tools dump nat deterministic-mapping outside-ip 85.0.0.2 router "Base" outside-port 2333

Output:

Inside router 10 ip 20.0.5.171 -- outside router Base ip 85.0.0.2 port 2333 at Mon Jan 7 10:02:02 PST 2013

7.7.11.2. Off-line Approach to Obtain Deterministic Mappings

Instead of querying the system directly, there is an option where a Python script can be generated on router and exported to an external node. This Python script contains mapping logic for the configured deterministic NAT in the router. The script can be then queried off-line to obtain mappings in either direction. The external node must have installed Python scripting language with the following modules: getopt, math, os, socket and sys.

The purpose of such off-line approach is to provide fast queries without accessing the router. Exporting the Python script for reverse querying is a manual operation that needs to be repeated every time there is configuration change in deterministic NAT.

The script is exported outside of the box to a remote location (assuming that writing permissions on the external node are correctly set). The remote location is specified with the following command:

config service nat deterministic-script location <remote-url>

<remote-url> - [{ftp:// | tftp://}<login>:<pswd>@<remote-locn>/][<file-path>]

180 chars max

The status of the script is shown using the following command:

show service nat deterministic-script

========================================================================

Deterministic NAT script data

========================================================================

Location : ftp://10.10.10.10/pub/det-nat-script/det-nat.py

Save needed : yes

Last save result : none

Last save time : N/A

========================================================================

Once the script location is specified, the script can be exported to that location with the following command:

admin nat save-deterministic-script

This needs to be repeated manually every time the configuration affecting deterministic NAT changes.

Once the script is exported (saved), the status of the script is changed as well:

show service nat deterministic-script

========================================================================

Deterministic NAT script data

========================================================================

Location : ftp://10.10.10.10/pub/det-nat-script/det-nat.py

Save needed : no

Last save result : sucess

Last save time : 2013/01/07 10:33:43

========================================================================

The script itself can be run to obtain mapping in forward or backward direction:

user@external-server:/home/ftp/pub/det-nat-script$ ./det-nat.py

Usage: det-nat-.py {{DIRECTION PARAMS} | -h[elp] }

where DIRECTION := { -f[orward] | -b[ackward] }

where PARAMS := { -s[ervice] -a[ddress] -p[ort] }

The following displays an example in which source addresses are mapped in the following manner:

Router 10, Source-ip: 20.0.5.0-20.0.5.127 to router base, outside-ip 85.0.0.1

Router 10 Source-ip: 20.0.5.128-20.0.5.255 to router base outside-ip 85.0.0.2

The forward query for this example will be performed as:

user@external-server:/home/ftp/pub/det-nat-script$ ./det-nat.py -f -s 10 -a 20.0.5.10

Output:

subscriber has public ip address 85.0.0.1 from service 0 and is using ports [1324 - 1353]

The reverse query for this example will be performed as:

user@external-server:/home/ftp/pub/det-nat-script$ ./det-nat.py -b -s 0 -a 85.0.0.1 -p 3020

Output:

subscriber has private ip address 20.0.5.66 from service 10

7.7.12. Logging

Every configuration change concerning the deterministic pool will be logged and the script (if configured for export) will be automatically updated (although not exported). This is needed to keep current track of deterministic mappings. In addition, every time a deterministic port-block is extended by a dynamic block, the dynamic block will be logged just as it is today in non-deterministic NAT. The same logic is followed when the dynamic block is de-allocated.

All static port forwards (including PCP) are also logged.

PCP allocates static port forwards from the wildcard-port range.

7.7.13. Deterministic DS-Lite

A subscriber in non-deterministic DS-Lite is defined as v6 prefix, with the prefix length being configured under the DS-Lite NAT node:

config>service>vprn>nat>inside>dslite#

subscriber-prefix-length [32..64 | 128] (default is 128)

All incoming IPv6 traffic with source IPv6 addresses falling under a unique v6 prefix that is configured with subscriber-prefix-length command will be considered as a single subscriber. As a result, all source IPv4 addresses carried within that IPv6 prefix will be mapped to the same outside IPv4 address.

The concept of deterministic DS-Lite is very similar to deterministic LSN44. The DS-lite subscribers (IPv6 addresses/prefixes) are deterministically mapped to outside IPv4 addresses and corresponding deterministic port-blocks.

Although the subscriber in DS-Lite is considered to be either a B4 element (IPv6 address) or the aggregation of B4 elements (IPv6 prefix determined by the subscriber-prefix-length command), only the IPv4 source addresses and ports carried inside of the IPv6 tunnel are actually translated.

The prefix statement for deterministic DS-lite remains under the same deterministic CLI node as for the deterministic LSN44. However, the prefix statement parameters for deterministic DS-Lite differ from the one for deterministic LSN44 in the following fashion:

DS-Lite prefix will be a v6 prefix (instead of v4). The DS-lite subscriber whose traffic is mapped to a particular outside IPv4 address and the deterministic port block is deduced from the prefix statement and the subscriber-prefix-length statement.
Subscriber-type is set to dslite-lsn-sub.

config>service>vprn>nat>inside>deterministic#

prefix <v6-prefix/length> subscriber-type dslite-lsn-sub nat-policy <policy-name>

Example:

config>service>vprn>nat>inside>deterministic#

prefix ABCD:FF::/56 subscriber-type dslite-lsn-sub nat-policy det-policy

config>service>vprn>nat>inside>dslite#

subscriber-prefix-length 60

In this case, 16 v6 prefixes (from ABCD:FF::/60 to ABCD:FF:00:F0::/60) are considered DS-Lite subscribers. The source IPv4 addresses/ports inside of the IPv6 tunnels is mapped into respective deterministic port blocks within an outside IPv4 address according to the map statement.

The map statement contains minor modifications as well. It maps DS-Lite subscribers (IPv6 address or prefix) to corresponding outside IPv4 addresses. Continuing on the previous example:

map start ABCD:FF::/60 end ABCD:FF:00:F0::/60 to 128.251.1.1

The prefix length (/60) in this case MUST be the same as configured subscriber-prefix-length. If we assume that the subscriber-limit in the corresponding pool is set to 8 and outside IP address range is 128.251.1.1 - 128.251.1.10, then the actual mapping is the following:

ABCD:FF::/60 to ABCD:FF:00:70::/60 to 128.151.1.1

ABCD:FF:00:80::/60 to ABCD:FF:00:F0::/60 to 128.151.1.2

7.7.13.1. Hashing Considerations in DS-Lite

The ingress hashing and load distribution between the ISAs in Deterministic DS-Lite is governed by the highest number of configured subscribers per outside IP address in any pool referenced within the given inside routing context.

This limit is configured under:

configure

router/service vprn

nat

inside

deterministic

dslite-max-subscriber-limit <1,2,4,8…32768>

While ingress hashing in non-deterministic DS-Lite is governed by the subscriber-prefix-length command, in deterministic DS-Lite the ingress hashing is governed by the combination of dslite-max-subscriber-limit and subscriber-prefix-length commands. This is to ensure that all DS-Lite subscribers that are mapped to a single outside IP address are always sent to the same MS-ISA (on which that outside IPv4 address resides). In essence, as soon as deterministic DS-Lite is enabled, the ingress hashing is performed on an aggregated set of n = log2(dslite-max-subscriber-limit) contiguous subscribers. n is the number of bits used to represent the largest number of subscribers within an inside routing context, that is mapped to the same outside IP address in any pool referenced from this inside routing context (referenced through the NAT policy).

Once the deterministic DS-lite is enabled (a prefix command under the deterministic CLI node is configured), the ingress hashing influenced by the dslite-max-subscriber-limit will be in effect for both flavors of DS-Lite (deterministic AND non-deterministic) within the inside routing context assuming that both flavors are configured simultaneously.

With introduction of deterministic DS-lite, the configuration of the subscriber-prefix-length must adhere to the following rule:

The configured value for the subscriber-prefix-length minus the number of bits representing the dslite-max-subscriber-limit value, must be in the range [32..64,128]. Or:

subscriber-prefix-length – n = [32..64,128]

where n = log2(dslite-max-subscriber-limit)

[or dslite-max-subscriber-limit = 2^n]

This can be clarified by the two following examples:

dslite-max-subscriber-limit = 64 — n=6 [log2(64) = 6] .

This means that 64 DS-Lite subscribers will be mapped to the same outside IP address. Consequently the prefix length of those subscribers must be reduced by 6 bits for hashing purposes (so that chunks of 64 subscribers are always hashed to the same ISA).

According to our rule, the prefix of those subscribers (subscriber-prefix-length) can be only in the range of [38..64], and no longer in the range [32..64, 128].

dslite-max-subscriber-limit = 1 > n=0 [log2(1) = 0]

This means that each DS-lite subscriber will be mapped to its own outside IPv4 address. Consequently there is no need for the aggregation of the subscribers for hashing purposes, since each DS-lite subscriber is mapped to an entire outside IPv4 address (with all ports). Since the subscriber prefix length are not contracted in this case, the prefix length can be configured in the range [32..64, 128].

In other words the largest configured prefix length for the deterministic DS-lite subscriber will be 32+n, where n = log2(dslite-max-subscriber-limit). The subscriber prefix length can extend up to 64 bits. Beyond 64 bits for the subscriber prefix length, there is only one value allowed: 128. In the case n must be 0, which means that the mapping between B4 elements (or IPv6 address) and the IPv4 outside addresses is in 1:1 ratio (no sharing of outside IPv4 addresses).

The dependency between the subscriber definition in DS-Lite (based on the subscriber-prefix-length) and the subscriber hashing mechanism on ingress (based on the dslite-max-subscriber-limit value), will influence the order in which deterministic DS-lite is configured.

7.7.13.2. Order of Configuration Steps in Deterministic DS-Lite

Configure deterministic DS-Lite in the following order.

Configure DS-lite subscriber-prefix-length
Configure dslite-max-subscriber-limit
Configure deterministic prefix (using a NAT policy)
Optionally configure map statements under the prefix
Configure DS-lite AFTR endpoints
Enable (no shutdown) DS-lite node

Modifying the dslite-max-subscriber-limit requires that all nat-policies be removed from the inside routing context.

To migrate a non-deterministic DS-Lite configuration to a deterministic DS-Lite configuration, the non-deterministic DS-Lite configuration must be first removed from the system. The following steps should be followed:

Shutdown DS-lite node
Remove DS-lite AFTR endpoints
Remove global NAT policy
Configure/modify DS-lite subscriber-prefix-length
Configure dslite-max-subscriber-limit
Reconfigure global NAT policy
Configure deterministic prefix
Optionally configure a manual map statement(s) under the prefix
Reconfigure DS-lite AFTR endpoints
Enable (no shutdown) DS-lite node
Configuration Restrictions in Deterministic NAT

NAT Pool

To modify nat pool parameters, the nat pool must be in a shutdown state.
Shutting down the nat pool by configuration (shutdown command) is not allowed in case that any NAT policy referencing this pool is active. In other words, all configured prefixes referencing the pool via the NAT policy must be deleted system-wide before the pool can be shut down. Once the pool is enabled again, all prefixes referencing this pool (with the NAT policy) will have to be recreated. For a large number of prefixes, this can be performed with an offline configuration file executed using the exec command.

NAT Policy

All NAT policies (deterministic and non-deterministic) in the same inside routing-instance must point to the same nat-group.
A NAT policy (be it a global or in a deterministic prefix) must be configured before one can configure an AFTR endpoint.

NA Group

The active-mda-limit in a nat-group cannot be modified as long as a deterministic prefix using that NAT group exists in the configuration (even if that prefix is shutdown). In other words, all deterministic prefixes referencing (with the NAT policy) any pool in that nat-group, must be removed.

Deterministic Mappings (prefix ans map statements)

Non-deterministic policy must be removed before adding deterministic mappings.
Modifying, adding or deleting prefix and map statements in deterministic DS-Lite require that the corresponding nat pool is enabled (in no-shutdown state).
Removing an existing prefix statement requires that the prefix node is in a shutdown state.

config>service>vprn>nat>inside>deterministic# info

----------------------------------------------

classic-lsn-max-subscriber-limit 128

prefix 10.0.5.0/24 subscriber-type classic-lsn-sub nat-policy "det"

map start 10.0.5.0 end 10.0.5.127 to 128.251.0.7

map start 10.0.5.128 end 10.0.5.255 to 128.251.0.2

shutdown

config>service>vprn>nat>inside>deterministic# info

----------------------------------------------

dslite-max-subscriber-limit 128

prefix 2001:db8:0:1/64 subscriber-type dslite-lsn-sub nat-policy "det"

map start 2001:BD8::/64 end 2001:BD8::FF:0:0:0:0/64 to 85.0.0.5

shutdown

config>service>vprn>nat>inside>ds-lite#

subscriber-prefix-length 64

no shutdown

Similarly, the map statements can be added or removed only if the prefix node is in a shutdown state.

There are a few rules governing the configuration of the map statement:
1. If the number of subscribers per configured prefix is greater than the subscriber-limit per outside IP parameter (2^n), then the lowest n bits of the map start <inside-ip-address> must be set to 0.
2. If the number of subscribers per configured prefix is equal or less than the subscriber-limit per outside IP parameter (2^n), then only one map command for this prefix is allowed. In this case there is no restriction on the lower n bits of the map start <inside-ip-address>. The range of the inside IP addresses in such map statement represents the prefix itself.

The outside-ip-address in the map statements must be unique amongst all map statements referencing the same pool. In other words, two map statements cannot reference the same <outside-ip-address> in a pool.

Configuration Parameters

The subscriber-limit in deterministic nat pool must be a power of 2.
The nat inside classic-lsn-max-subscriber-limit must be power of 2 and at least as large as the largest subscriber-limit in any deterministic nat pool referenced by this routing instance. In order to change this parameter, all nat-policies in that inside routing instance must be removed.
The nat inside ds-lite-max-subscriber-limit must be power of 2 and at least as large as the largest subscriber-limit in any deterministic nat pool referenced by this routing instance. In order to change this parameter, all nat-policies in that inside routing instance must be removed.
In DS-lite, the [subscriber-prefix-length - log2(dslite-max-subscriber-limit)] value must fall within [32 ..64, 128].
In Ds-Lite, the subscriber-prefix-length can be only modified if the DS-lite CLI node is in shutdown state and there are no deterministic DS-lite prefixes configured.

Miscellaneous

Deterministic NAT is not supported in combination with 1:1 NAT. Therefore the nat pool cannot be in mode 1:1 when used as deterministic pool. Even if each subscriber is mapped to its own unique outside IP (sub-limit=1, det-port-reservation ports (65535-1023), NAPT (port translation) function is still performed.
Wildcard port forwards (including PCP) will map to the wildcard port ranges and not the deterministic port range. Consequently logs will be generated for static port forwards using PCP.

7.8. Destination Based NAT (DNAT)

Destination NAT (DNAT) in SR OS is supported for LSN44 and L2-Aware NAT. DNAT can be used for traffic steering where the destination IP address of the packet is rewritten. In this fashion traffic can be redirected to an appliance or set of servers that are in control of the operator, without the need for a separate transport service (for example, PBR plus LSP). Applications utilizing traffic steering via DNAT normally require some form of inline traffic processing, such as inline content filtering (parental control, antivirus/spam, firewalling), video caching, etc.

Once the destination IP address of the packet is translated, traffic is naturally routed based on the destination IP address lookup. DNAT will translate the destination IP address in the packet while leaving the original destination port untranslated.

Similar to source based NAT (Source Network Address and Port Translation (SNAPT)), the SR OS will maintain state of DNAT translations so that the source IP address in the return (downstream) packet is translated back to the original address.

Traffic selection for DNAT processing in MS-ISA is performed via a NAT classifier.

7.8.1. Combination of SNAPT and DNAT

In certain cases SNAPT is required along with DNAT. In other cases only DNAT is required without SNAPT. The following table shows the supported combinations of SNAPT and DNAT in SR OS.

Table 33: Supported Combinations of SNAPT and DNAT

	SNAPT	DNAT-Only	SNAPT + DNAT
LSN44	X	X	X
L2-Aware	X		X

The SNAPT/DNAT address translations are shown in Figure 57.

Figure 57: IP Address/Port Translation Modes

7.8.2. Forwarding Model in DNAT

NAT forwarding in SR OS is implemented in two stages:

Traffic is first directed towards the MS-ISA. This is performed via a routing lookup, via a filter or via a subscriber-management lookup (L2-aware NAT). DNAT does not introduce any changes to the steering logic responsible for directing traffic from the I/O line card towards the MS-ISA.

Once traffic reaches the MS-ISA, translation logic is performed. DNAT functionality will incur an additional lookup in the MS-ISA. This lookup is based on the protocol type and the destination port of the packets, as defined in the nat-classifier.

As part of the NAT state maintenance, the SR OS maintains the following fields for each DNATed flow:

<inside host /port, outside IP/port, foreign IP address/port, destination IP address/port, protocol (TCP,TCP,ICMP)> Note that the inside host in LSN is inside the IP address and in L2-Aware NAT it is the <inside IP address + subscriber-index>. The subscriber index is carried in session-id of the L2TP.

The foreign IP address represents the destination IP address in the original packet, while the destination IP address represents the DNAT address (translated destination IP address).

7.8.3. DNAT Traffic Selection via NAT Classifier

Traffic intended for DNAT processing is selected via a nat classifier. The nat classifier has configurable protocol and destination ports. The inclusion of the classifier in the NAT policy is the trigger for performing DNAT. The configuration of the nat classifier determines whether:

a specific traffic defined in the match criteria is DNATed while the rest of the traffic is transparently passed through the nat classifier, OR

a specific traffic defined in the match criteria is transparently passed through the nat classifier while the rest of the traffic is DNATed.

Classifier cannot drop traffic (no action drop). However, a non-reachable destination IP address in DNAT will cause traffic to be black-holed.

7.8.4. Configuring DNAT

DNAT is enabled in the config>service>nat>nat-policy context.

config>service>nat

nat-policy <nat-policy-name> create

dnat

dnat-only router <router-instance> nat-group <nat-group-id>

nat-classifier <classifier-name>

exit

DNAT function is triggered by the presence of the nat classifier (nat-classifier command), referenced in the NAT policy.DNAT-only option is configured in case where SNAPT is not required. This command is necessary in order to determine the outside routing context and the nat-group when SNAPT is not configured. Pool (relevant to SNAPT) and DNAT-only configuration options within the NAT policy are mutually exclusive.

7.8.4.1. DNAT Traffic Selection and Destination IP Address Configuration

DNAT traffic selection is performed via a nat-classifier. Nat-classifier is defined under config>service>nat hierarchy and is referenced within the nat-policy.

configure>service>nat

nat-classifier <name> create

default-DNAT-ip-address <ip-addr>

default-action {DNAT|forward} [ip-addr <ip-address>]

entry 1 create

action {DNAT|forward}[ip-addr <ipv4-address>]

match protocol {tcp | udp}

dst-port range start <port-number> end <port-number>

exit

default-dnat-ip-address is used in all match criteria that contain DNAT action without specific destination IP address. However, the default-dnat-ip-address is ignored in cases where IP address is explicitly configured as part of the action within the match criteria.

default-action is applied to all packets that do not satisfy any match criteria.

forward (forwarding action) has no effect on the packets and will transparently forward packets through the nat-classifier.

By default, packets that do not match any matching criteria are transparently passed through the classifier.

7.8.4.2. Micro-Netting Original Source (Inside) IP Space in DNAT-Only Case

In order to forward upstream and downstream traffic for the same NAT binding to the same MS-ISA, the original source IP address space must be known in advance and consequently hashed on the inside ingress towards the MS-ISAs and micro-netted on the outside.This will be performed with the following CLI:

router | service vprn <id>

nat

inside

classic-lsn-max-subscriber-limit <max>

dnat-only

source-prefix <nat-prefix-list-name>

service nat

nat-prefix-list <name> application dnat-only-subscribers create

prefix <ip-prefix>

The classic-lsn-max-subscriber-limit parameter was introduced by deterministic NAT and it is reused here. This parameter affects the distribution of the traffic across multiple MS-ISA in the upstream direction traffic. Hashing mechanism based on source IPv4 addresses/prefixes is used to distribute incoming traffic on the inside (private side) across the MS-ISAs. Hashing based on the entire IPv4 address will produce the most granular traffic distribution, while hashing based on the IPv4 prefix (determined by prefix length) will produce less granular hashing. For further details about this command, consult the CLI command description. The source IP prefix is defined in the nat-prefix-list and then applied under the DNAT-only node in the inside routing context. This will instruct the SR OS to create micro-nets in the outside routing context. The number of routes installed in this fashion is limited by the following configuration:

router | service vprn <id>

nat

outside

dnat-only

route-limit <route-limit>

The configurable range is 1-128K with the default value of 32K.DNAT provisioning concept is shown in Figure 58.

Figure 58: DNAT Provisioning Model

7.9. LSN – Multiple NAT Policies per Inside Routing Context

7.9.1. Restrictions

The following restrictions apply to multiple NAT policies per inside routing context

There is no support for L2-aware NAT.
DS-Lite and NAT64 diversion to NAT is supported only through IPv6 filters.
A maximum of 8 different NAT policies per inside routing context are supported. For routing based NAT diversion, this limit is enforced during the configuration of the NAT policies within the inside routing context. In case of a filter-based NAT diversion, the filter instantiation will fail if the number of different nat-policies per inside routing context exceeds 8.
The default NAT policy is counted towards this limit (8).

7.9.2. Multiple NAT Policies Per Inside Routing Context

The selection of the NAT pool and the outside routing context is performed through the NAT policy. Multiple NAT policies can be used within an inside routing context. This feature effectively allows selective mapping of the incoming traffic within an inside routing context to different NAT pools (with different mapping properties, such as port-block size, subscriber-limit per pool, address-range, port-forwarding-range, deterministic vs non-deterministic behavior, port-block watermarks, etc.) and to different outside routing contexts. NAT policies can be configured:

via filters as part of the action nat command.
via routing with the destination-prefix command within the inside routing context

The concept of the NAT pool selection mechanism based on the destination of the traffic via routing is shown in Figure 59.

Figure 59: Pool Selection Based on Traffic Destination

Diversion of the traffic to NAT based on the source of the traffic is shown in Figure 60.

Only filter-based diversion solution is supported for this case. The filter-based solution can be extended to a 5 tuple matching criteria.

Figure 60: NAT Pool Selection Based on the Inside Source IP Address

The following considerations must be taken into account when deploying multiple NAT policies per inside routing context:

The inside IP address can be mapped into multiple outside IP addresses based on the traffic destination. The relationship between the inside IP and the outside IP is 1:N.
In case where the source IP address is selected as a matching criteria for a NAT policy (or pool) selection, the inside IP address will always stay mapped to the same outside IP address (relationship between the inside IP and outside IP address is, in this case, 1:1)
Static Port Forwards (SPF) — Each SPF can be created only in one pool. This means that the pool (or NAT policy) must be an input parameter for SPF creation.

7.9.3. Routing Approach for NAT Diversion

The routing approach relies on upstream traffic being directed (or diverted) to the NAT function based on the destination-prefix command in the configure>service>vprn/router>nat>inside CLI context. In other words, the upstream traffic will be NATed only if it matches a preconfigured destination IP prefix. The destination-prefix command creates a static route in the routing table of the inside routing context. This static route will divert all traffic with the destination IP address that matches the created entry, towards the MS-ISA. The NAT function itself will be performed once the traffic is in the proper context in the MS-ISA.

The CLI for multiple NAT policies per inside routing context with routing based diversion to NAT is the following:

service vprn/router

nat

inside

destination-prefix <ip-prefix/length> nat-policy <policy-name>]

or, for example:

service vprn/router

nat

inside

destination-prefix 20.20.10.0/24 nat-policy policy-1

destination-prefix 30.30.30.0/24 nat-policy policy-1

destination-prefix 40.40.40.0/24 nat-policy policy-2

Different destination prefixes can reference a single NAT policy (policy-1 in this case).

In case that the destination-policy does not directly reference the NAT policy, the default NAT policy will be used. The default NAT policy is configured directly in the vprn/router>nat>inside context.

Once that destination-prefix command referencing the NAT policy is configured, an entry in the routing table will be created that will direct the traffic to the MS-ISA.

7.9.4. Filter-Based Approach

A filter-based approach will divert traffic to NAT based on the ip matching criteria shown in the CLI below.

*A:right-a21>config>filter>ip-filter>entry# match

- match [protocol <protocol-id>]

- no match

<protocol-id> : protocol numbers - [0..255] (Decimal,

Hexadecimal, or Binary representation).

Supported IANA IP protocol names -

gre|icmp|idrp|igmp|igp|ip|ipv6|ipv6-frag|ipv6-icmp|

ospf-igp|pim|pnni|ptp|rdp|rsvp|sctp|stp|tcp|udp|vrrp

* - udp/tcp wildcard

[no] dst-ip - Configure dest. ip match condition

[no] dst-port - Configure destination port match condition

[no] port - Configure port match condition

[no] src-ip - Configure source ip match condition

[no] src-port - Configure source port match condition

The CLI for the filter-based diversion in conjunction with multiple NAT policies is shown below:

filter

entry

action nat [nat-policy <nat-policy-name>]

The association with the NAT policy is made once the filter is applied to the SAP.

7.9.5. Multiple NAT Policies with DS-Lite and NAT64

DS-Lite and NAT64 diversion to NAT with multiple nat-policies is supported only through IPv6 filters:

configure

filter

ipv6-filter

entry <entry-id> [create]

action nat nat-type <nat-type> [nat-policy <nat-policy-name>]

exit

Where the nat-type parameter can be either dslite or nat64.

The DS-Lite AFTR address and NAT64 destination prefix configuration under the corresponding (DS-Lite or NAT64) router/vprn>nat>inside context is mandatory. This is even in the case when only filters are desired for traffic diversion to NAT.

For example, every AFTR address and NAT64 prefix that is configured as a match criteria in the filter, must also be duplicated in the router/vprn>nat>inside context. However, the opposite is not required.

IPv6 traffic with the destination address outside of the AFTR/NAT64 address/prefix will follow normal IPv6 routing path within the 7750 SR.

7.9.6. Default NAT Policy

The default nat-policy is always mandatory and must be configured under the router/vprn>nat>inside context. This default NAT policy can reference any configured pool in the desired ISA group. The pool referenced in the default NAT policy can be then overridden by the NAT policy associated with the destination-prefix in LSN44 or by the NAT policy referenced in the ipv4/ipv6-filter used for NAT diversion in LSN44/DS-Lite/NAT64.

The NAT CLI nodes will fail to activate (be brought out of the no shutdown state), unless a valid NAT policy is referenced in the router/vprn>nat>inside context.

7.9.7. Scaling Considerations

Each subscriber using multiple policies is counted as 1 subscriber for the inside resources scaling limits (such as the number of subscribers per MS-ISA), and counted as 1 subscriber per (subscriber + policy combination) for the outside limits (subscriber-limit subscribers per IP; port-reservation port/block reservations per subscriber).

7.9.8. Multiple NAT Policies and SPF Configuration Considerations

Any given Static Port Forward (SPF) can be created only in one pool. This pool, which is referenced through the NAT policy, has to be specified at the SPF creation time, either explicitly through the configuration request or implicitly via defaults.

Explicit request will be submitted either via SAM or via CLI:

tools perform nat port-forwarding-action lsn

- lsn create router <router-instance> [b4 <ipv6-address>] [aftr <ipv6-address>] ip <ip- address> protocol {tcp | udp} [port <port>] lifetime <lifetime> [outside-ip <ipv4- address>] [outside-port <port>] [nat-policy <policy-name>]

In the absence of the NAT policy referenced in the SPF creation request, the default nat-policy command under the vprn/router>nat>inside context will be used.

The consequence of this is that the operator must know the NAT policy in which the SPF is to be created. The SPF cannot be created via PCP outside of the pool referenced by the default NAT policy, since PCP does not provide means to communicate NAT policy name in the SPF creation request.

The static port forward creation and their use by the subscriber types must follow these rules:

Default NAT policy — Any subscriber type can use an SPF created in the pool referenced by the default NAT policy
Deterministic LSN44 NAT policy — Only deterministic LSN44 subscribers matching the configured prefix can use the SPF created in the pool referenced by the deterministic LSN44 prefix NAT policy
Deterministic DS-Lite NAT policy — Only deterministic DS-Lite subscribers matching the configured prefix can use the SPF created in the pool referenced by the deterministic DS-Lite prefix NAT policy
LSN44 filter based NAT policy — Only LSN44 subscribers matching the configured filter entry can use the SPF created in the pool referenced by the non-deterministic LSN44 NAT policy within the filter
DS-Lite filter based NAT policy — Only DS-Lite subscribers matching the configured filter entry can use the SPF created in the pool referenced by the DS-Lite NAT policy within the filter
NAT64 filter based NAT policy — Only NAT64 subscribers matching the configured filter entry can use the SPF created in the pool referenced by the NAT64 NAT policy within the filter

When the last relevant policy for a certain subscriber type is removed from the virtual router, the associated port forwards are automatically deleted.

7.9.8.1. Multiple NAT Policies and Forwarding Considerations

Figure 61 and Figure 62 describe certain scenarios that are more theoretical and are less likely to occur in reality. However, they are described here for the purpose of completeness.

Figure 61 represents the case where traffic from the WEB server 1.1.1.1 is initiated toward the destined network 11.0.0.0/8. Such traffic will end up translated in the Pool B and forwarded to the 11.0.0.0/8 network even though the static port forward has been created in Pool A. In this case the NAT policy rule (dest 11.0.0.0/8 pool B) will determine the pool selection in the upstream direction (even though the SPF for the WEB server already exists in the Pool A).

Figure 61: SPF With Multiple NAT Policies

The next example in Figure 62 shows a case where the Flow 1 is initiated from the outside. Since the partial mapping matching this flow already exist (created by SPF) and there is no more specific match (FQF) present, the downstream traffic will be mapped according to the SPF (through Pool A to the Web server). At the same time, a more specific entry (FQF) will be created (initiated by the very same outside traffic). This FQF will now determine the forwarding path for all traffic originating from the inside that is matching this flow. This means that the Flow 2 (reverse of the Flow 1) will not be mapped to an IP address from the pool B (as the policy dictates) but instead to the Pool A which has a more specific match.

A more specific match would be in this case fully qualified flows (FQF) that contains information about the foreign host: <host, inside IP/port, outside IP/port, foreign IP address/port, protocol>.

Figure 62: Bypassing Nat Policy Rule

7.9.9. Logging

When multiple NAT policies per inside routing context are deployed, a new policy-id parameter is added to certain syslog messages. The format of the policy-id is:

plcy-id XX

where XX is an arbitrary unique number per inside routing context assigned by the router. This number, represents the corresponding NAT policy. Since the maximum number of NAT policies in the inside routing context is 8, the policy-id value is also a numerical value in the range 1 — 8.

Introduction of the policy-id in logs is necessary due to the bulk-operations associated with multiple NAT policies per inside routing context. A bulk operation, for example, represents the removal of the nat-policy from the configuration, shutting down the NAT pool, or removing an IP address range from the pool. Removing a NAT accounting policy in case of RADIUS NAT logging will not trigger a summarization log since an acct-off message is sent. Such operations have a tendency to be heavy on NAT logging since they affect a large number of NAT subscribers at once. Summarization logs are introduced to prevent excessive logging during bulk operations. For example, the NAT policy deletion can be logged with a single (summarized) entry containing the policy-id of the NAT policy that was removed and the inside srvc-id. Since all logs contain the policy-id, a single summarization free log can be compared to all map2 logs containing the same policy-id to determine for which subscribers the NAT mappings have ceased. Map and Free logs are generated when the port-block for the subscribers are allocated and de-allocated.

Summarization log is always generated on the CPM, regardless of whether the RADIUS logging is enabled or not. A summarization log simply cannot be generated via RADIUS logging since the RADIUS accounting message streams (start/interim-updates/stop) are always generated per subscriber. In other words, for RADIUS logging, the summarization log would need to be sent to each subscriber, which defeats the purpose of the summarization logs.

A summarization log on the CPM is generated:

When the NAT policy is removed — With a single NAT policy per inside routing context, a summarization log is generated with only one field: inside srvc-id (vprn or base). This is sufficient since there is only one NAT policy per inside routing context. To determine subscribers for which NAT mappings are terminated, the operator should search all most recent map logs matching the service-id from the summarization log.

With multiple NAT policies per inside routing context, the inside srvc-id and the policy-id are included in the summarization log (no outside IPs, outside srvc-id, port-block or source IP).

A log search based on the policy-id and inside srvc-id should reveal all subscribers whose mappings were affected by the NAT policy removal.

When the pool is shutdown — The router will send a summarization log that includes the outside srvc-id and all IP address ranges configured in the pool. No other parameters are included in the summarization log.

A log search based on the outside IP address and outside srvc-id should reveal all subscribers for which the NAT mappings have ceased.

When an IP address-range is removed from the pool. The router will send a summarization log that includes the outside srvc-id and the IP address range that has been removed. No other parameters are included in the summarization log.

A log search based on the outside IP addresses in the range and the outside srvc id should reveal all subscribers for which the NAT mappings have ceased.

When the last AFTR address is removed.
When DS-Lite/NAT64 node is shutdown.
When deterministic NAT prefixes are created or removed.

Summarization logs in RADIUS logging

The summarization log for bulk operation while RADIUS logging is in effect will be generated only in the CPM (syslog). This means that for bulk operations with RADIUS logging, the operator will have to rely on RADIUS logging as well as on the CPM logging.

An open log sequence in RADIUS, for example a map for the <inside IP 1, outside IP 1,port-block 1> followed at some later time with a map for <inside IP 2, outside IP 1, port-block 1>, is an indication that the free log for <inside IP 1, outside IP 1,port-block 1> is missing. This means that either the free log for <inside IP 1, outside IP 1,port-block 1> was lost or that a policy/pool/address-range was removed from the configuration. In the latter case, the operator should look in the CPM log for the summarization message.

The summarization logs are enabled via the event control 2021 tmnxNatLsnSubBlksFree which is by default suppressed. The even control 2021 is also used to report when all blocks for the subscriber are freed.

7.10. L2-Aware NAT Destination-Based Multiple NAT Policies

Multiple NAT policies for a L2-aware subscriber can be selected based on the destination IP address of the packet. This allows the operator to assign different NAT pools and outside routing contexts based on the traffic destinations.

The mapping between the destination IP prefix and the NAT policy is defined in a nat-prefix-list. This nat-prefix-list is applied to the L2-aware subscriber via a sub-profile. Once the subscriber traffic arrives to the MS-ISA where NAT is performed, an additional lookup based on the destination IP address of the packet will be executed to select the specific NAT policy (and consequently the outside NAT pool). Failure to find the specific NAT policy based on the destination IP address lookup will result in selection of the default NAT policy referenced in the sub-profile.

CLI example:

--------------------------------------------------

echo "Service Configuration"

#--------------------------------------------------

service

nat

nat-policy "l2aw nat policy" create

pool "l2aw-nat-pool" router 1

exit

nat-policy "another-l2aw-nat-policy" create

pool "another-l2aw-nat-pool" router 2

exit

nat-policy "default-nat-policy" create

pool "default-nat-pool" router Base

exit

nat-prefix-list "prefixlist1" application l2-aware-dest-to-policy create

prefix 192.168.0.0/30 nat-policy "l2aw-nat-pol"

prefix 192.168.0.64/30 nat-policy "l2aw-nat-pol"

prefix 192.168.0.128/30 nat-policy "l2aw-nat-pol"

prefix 192.168.1.0/30 nat-policy "another-l2aw-nat-pol"

prefix 192.168.1.64/30 nat-policy "another-l2aw-nat-pol"

prefix 192.168.1.128/30 nat-policy "another-l2aw-nat-pol"

exit

#--------------------------------------------------

echo "Subscriber-mgmt Configuration"

#--------------------------------------------------

subscriber-mgmt

sub-profile "sub_profile" create

nat-policy "def-nat-policy"

nat-prefix-list "prefixlist1"

exit

As displayed in the example, multiple IP prefixes can be mapped to the same NAT policy.

The NAT prefix list cannot reference the default NAT policy. The default NAT policy is the one that is referenced directly under the sub-profile.

7.10.1. Logging

In L2-aware NAT with multiple nat-policies, the NAT resources are allocated in each pool associated with the subscriber. This NAT resource allocation is performed at the time when the ESM subscriber is instantiated. Each NAT resource allocation will be followed by log generation.

For example, if RADIUS logging is enabled, one Alc-NAT-Port-Range VSA per NAT policy will be included in the acct START/STOP message.

[Alc-Nat-Port-Range = "192.168.20.1 1024-1055 router base nat-pol-1"

Alc-Nat-Port-Range = "193.168.20.1 1024-1055 router base nat-pol-2".

Alc-Nat-Port-Range = "194.168.20.1 1024-1055 router base" nat-pol-3.]

7.10.1.1. RADIUS Logging and Nat-Policy Change via CoA

Nat-policy change for L2-aware NAT is supported via sub-profile change triggered in CoA. However, change of sub-profile alone via CoA will not trigger generation of new Radius accounting message and thus NAT events related to NAT policy change will not be promptly logged. For this reason, each CoA initiating the sub-profile change in NAT environment should:

Change the sla-profile OR
Include the Alc-Trigger-Acct-Interim VSA in the CoA messages.

Note that the sla-profile will have to be changed and not just refreshed. In other words replacing the existing sla-profile with the same one will not trigger a new accounting message.

Both of the above events will trigger an accounting update at the time when CoA is processed. This will keep NAT logging current. The information about NAT resources for logging purposes is conveyed in the following RADIUS attributes:

Alc-Nat-Port-Range-Freed VSA 932932 ® NAT resources released due to CoA.

Alc-Nat-Port-Range VSA ® NAT resources in use. These can be the existing NAT resources which were not affected by CoA or they can be new NAT resource allocated due to CoA.

NAT logging behavior due to CoA will depend on the deployed accounting mode of operation. This is described in Table 1. Note that interim-update keyword must be configured for host/session accounting in order for Interim-Update messages to be triggered:

configure

subscriber-mgmt

radius-accounting-policy <name>

session-accounting interim-update

configure

subscriber-mgmt

radius-accounting-policy <name>

host-accounting interim-update

Table Legend:

AATR - Alc-Acct-Triggered-Reason VSA ® This VSA is optionally carried in Interim-Update messages that are triggered by CoA.

ATAI - Alc-Trigger-Acct-Interim VSA ® this VSA can be carried in CoA to trigger Interim-Update message. The string carried in this VSA is reflected in the triggered Interim-Update message.

I-U – Interim-Update Message

Table 34: NAT-Policy Change and CoA in L2Aware NAT

Host or session accounting

Queue-instance accounting

Comments

CoA

Sub-prof change +

ATAI VSA

Single I-U with:

— released NAT info

— unchanged NAT info

— new NAT info

— AATR

— ATAI

Single I-U with:

— released NAT info

— unchanged NAT info

— new NAT info

— AATR

— ATAI

Single I-U message is triggered by CoA.

CoA

Sub-profile change +

Sla-profile change

First I-U:

— released NAT info

— unchanged NAT info

— new NAT info

Second I-U:

— unchanged NAT info

— new NAT info

Acct Stop:

— released NAT info

— unchanged NAT info

— new NAT info

Acct Start:

— unchanged NAT info

— new NAT info

Two accounting messages are triggered in succession.

CoA

Sub-profile change

—

No accounting messages are triggered by CoA. The next regular I-U messages will contain:

— old (released) NAT info

— unchanged NAT info

— new NAT info.

CoA

Sub-profile change+

Sla-profile-change +

ATAI VSA

First I-U:

— released NAT info

— unchanged NAT info

— new NAT info

Second I-U

— unchanged NAT info

— new NAT info

— AATR

— ATAI

Acct Stop:

— re-released NAT info

— unchanged NAT info

— new NAT info

Acct Start:

— unchanged NAT info

— new NAT info

Two accounting messages are triggered in succession.

For example, the second CoA row describes the outcome triggered by CoA carrying new sub and sla profiles. In host/session accounting mode this will create two Interim-Update messages. The first Interim-Messages will carry information about:

the released NAT resources at the time when CoA is activated.
existing NAT resources that are not affected by CoA.
new NAT resources allocated at the time when CoA is activated.

The second Interim-Update message will carry information about the NAT resources that are in use (existing and new) once CoA is activated.

From this, the operator can infer which NAT resources are released by CoA and which NAT resources continue to be in use once CoA is activated.

7.10.1.2. Delay Between the NAT Resource Allocation and Logging During CoA

Nat-policy change induced by CoA will trigger immediate log generation (for example acct STOP or INTERIM-UPDATE) indicating that the nat resources have been released. However, the NAT resources (outside IP addresses and port-blocks) in SR OS will not be released for another five seconds. This delay is needed to facilitate proper termination of traffic flow between the NAT user and the outside server during the NAT policy transition. A typical example of this scenario is the following:

HTTP traffic is redirected to a WEB portal for authentication. Only when the user is authenticated, access to the Internet will be granted along with a new NAT policy that will provide more NAT resources (larger port-ranges, etc.).
Once the user is authenticated, CoA will used to change the user forwarding properties (HTTP-redirect is removed and the NAT policy is changed). However, CoA must be sent before the authentication acknowledgment (ACK) messages is sent, otherwise the next new HTTP request would be redirected again.
Authentication acknowledgment is sent to the NAT user following the CoA which removed the HTTP redirect and instantiated a new NAT policy. Since the original communication between the WEB portal and the NAT user was relying on the original NAT policy, the NAT resources associated with the original NAT policy must be preserved to terminate this communication gracefully. Hence the delay of five seconds before the NAT resources are freed.

Stale port forwards will similarly to other stale dynamic mappings be released after five seconds. Note that static port forwards will be kept on the CPM.New CoAs related to NAT will be rejected (NAK’d) in case that the previous change is in progress (during the 5seconds interval until the stale mappings are purged).

7.10.2. Static Port Forwards

Unless the specific NAT policy is provided during Static Port Forward (SPF) creation (SPF creation command), the port forward will be created in the pool referenced in the default NAT policy. Nat-policy can be part of the command used to modify or delete SPF. If the NAT policy is not provided, then the behavior will be:

if there is only 1 match, the port forward will be modified/deleted.
if there is more than 1 match, modify/delete port forward must specify a NAT policy. Otherwise, the modify/delete action will fail.

A match is considered when at least these parameters from the modify/delete command are matched (mandatory parameters in the spf command):

subscriber identification string
inside IP address
inside port
protocol

7.10.3. UPnP

UPnP will use the default NAT policy.

7.11. NAT and CoA

RADIUS Change of Authorization (CoA) can be used in subscriber management (ESM) to modify the NAT behavior of the subscriber. This can be performed by:

Replacing a NAT policy in a subscriber profile for the L2-Aware NAT subscriber.

Replacing or removing a NAT policy within the IP filter for the ESM subscriber using LSN44, DS-Lite or NAT64.

Modifying DNAT parameters directly via CoA for the L2-Aware subscriber.

7.11.1. CoA and NAT Policies

The behavior for NAT policy changes via CoA for LSN and L2-Aware NAT is summarized in Table 35.

Table 35: NAT Policy Changes via CoA

Action	Outcome		Remarks
Action	L2-Aware	LSN	Remarks
CoA - replacing NAT policy	Stale flows using the old NAT policy are cleared after 5 seconds. New flows immediately start using a new NAT policy. Restrictions: Allowed only when the previous change is completed (need to wait for a 5 second interval during which the stale mappings caused by previous CoA are purged).	Stale flows using the old NAT policy continue to exist and are used for traffic forwarding until they are naturally timed-out or TCP- terminated. The exception to this is when the reference to the NAT policy in the filter was the last one for the inside VRF. In this scenario, the flows from the removed NAT policy are cleared immediately. New flows immediately start using new NAT policy.	A NAT policy change via CoA is performed by changing the sub-profile for the ESM subscriber or by changing the ESM subscriber filter in the LSN case. 1 A sub-profile change alone does not trigger accounting messages in L2-aware NAT and consequently the logging information is lost. To ensure timely RADIUS logging of the NAT policy change in L2-aware NAT, each CoA must, in addition to the sub-profile change, also: Change the sla-profile2 or Include the Alc-Trigger-Acct-Interim VSA in the CoA messages. Both of the above events will trigger an accounting update at the time when CoA is processed. This keeps NAT logging current.
	(cont.) Not allowed if L2-aware subscriber has multiple hosts and the new prefix-list contains one or more 1:1 NAT policies. Not allowed if the new NAT policy references to a pool in a different NAT group.

1. In non-ESM environments, the NAT policy can be changed by replacing the interface filter via CLI for LSN case.

2. The SLA profile will have to be changed and not just refreshed. In other words, replacing the existing SLA profile with the same one will not trigger a new accounting message.

7.11.2. CoA and DNAT

Adding, removing or replacing DNAT parameters in LSN44 can be achieved through NAT policy manipulation in an IP filter for ESM subscriber. The rules for NAT policy manipulation via CoA are given in Table 35.In L2-Aware NAT, CoA can be used to:

Enable or disable DNAT functionality while leaving the Source Network Address and Port Translation (SNAPT) uninterrupted.

Modify the default destination IP address in DNAT.

Once the DNAT configuration is modified via CoA (enable or disable DNAT or change the default DNAT IP address), the existing flows affected by the change remain active for 5 more seconds while the new flows are created in accordance with the new configuration. After a 5 second timeout, the stale flows are cleared from the system.

The RADIUS attribute used to perform DNAT modifications is a composite attribute with the following format:

Alc-DNAT-Override (234) = “{<DNAT_state> | <DNAT-ip-addr>},[nat-policy]”

where: DNAT state = none | disable ® is mutually exclusive with the DNAT-ip-addr parameter.

DNAT-ip-addr = Provides an implicit enable with the destination IPv4 address in dotted format (a.b.c.d) ® is mutually exclusive with DNAT-state parameter.

nat-policy = nat-policy name ® This is an optional parameter. If it is not present, then the default NAT policy is assumed.

For example:

Alc-DNAT-Override=none ® This will re-enable DNAT functionality in the default NAT policy, assuming that DNAT was previously disabled via the Alc-DNAT-Override=disable attribute, submitted either in an Access-Accept message or in a previous CoA. In case that the none value is received while DNAT is already enabled, a CoA ACK is sent back to the originator.

Alc-DNAT-Override =none,nat-pol-1 ® This will re-enable DNAT functionality in the specific NAT policy with the name nat-policy-1.

Alc-DNAT-Override =none,1.1.1.1 ® A CoA NAK is returned. The DNAT-state and DNAT-ip-addr parameters are mutually exclusive within the same Alc-DNAT-Override attribute.

Alc-DNAT-Override =1.1.1.1 ® This will change the default DNAT IP address to 1.1.1.1 in the default NAT policy. In case DNAT was disabled prior to receiving this CoA, it will be implicitly enabled.

Alc-DNAT-Override =1.1.1.1,nat-pol-1 ® This will change the default DNAT IP address to 1.1.1.1 in the specific NAT policy named nat-policy-1. DNAT will be implicitly enabled if it was disabled prior to receiving this CoA.

The combination of sub-fields with the Alc-DNAT-Override RADIUS attribute and the corresponding actions are shown in Table 36.

Table 36: CoA and DNAT

DNAT-State	DNAT-ip-addr	NAT Policy	DNAT Action in L2-Aware NAT
none	-	-	Re-enable DNAT in the default NAT policy. If DNAT was enabled prior to receiving this CoA, then no specific action will be carried out by the SR OS with the exception of sending the CoA ACK back to the CoA server. If a DNAT classifier is not present in the default NAT policy when this CoA is received, then CoA NAK (error) will be sent to the CoA server.
none	-	nat-pol-name	Re-enable DNAT in the referenced NAT policy. If DNAT was enabled prior to receiving this CoA, no specific action will be carried out by the SR OS with the exception of sending the CoA ACK back to the CoA server. If the DNAT classifier is not present in the referenced NAT policy, a CoA NAK (error) will be sent to the CoA server.
none	a.b.c.d	-	CoA NAK (error) is generated. These two parameters are mutually exclusive in the same Alc-DNAT-Override attribute.
none	a.b.c.d	nat-pol-name	CoA NAK (error) is generated. DNAT-state and DNAT-ip-address parameters are mutually exclusive in the same Alc-DNAT-Override attribute.
disable	-	-	Disable DNAT in the default NAT policy. If a DNAT classifier is not present in the default NAT policy, a CoA NAK (error) will be sent to the CoA server.
disable	-	nat-pol-name	Disable DNAT in the referenced NAT policy. If a DNAT classifier is not present in the referenced NAT policy, a CoA NAK (error) will be sent to the CoA server.
disable	a.b.c.d	-	CoA NAK (error) is generated. DNAT-state and DNAT-ip-address parameters are mutually exclusive in the same Alc-DNAT-Override attribute.
disable	a.b.c.d	nat-pol-name	CoA NAK (error) is generated. DNAT-state and DNAT-ip-address parameters are mutually exclusive in the same Alc-DNAT-Override attribute.
-	a.b.c.d	-	The default destination IP address is changed in the default NAT policy.
-	a.b.c.d	nat-pol-name	The default destination IP address is changed in the referenced NAT policy.
-	-	- or nat-pol-name	A CoA NAK (error) is generated. Either DNAT-state or DNAT-ip-address parameters must be present in the Alc-DNAT-Override attribute.

If multiple Alc-DNAT-Override attributes with conflicting actions are received in the same CoA or Access-Accept, the action that occurred last will take precedence.

For example, if the following two Alc-DNAT-Override attributes are received in the same CoA, the last one takes effect and consequently DNAT will be disabled in the default NAT policy:

Alc-DNAT-Override = “1.1.1.1“

Alc-DNAT-Override = “disable“

7.11.3. Modifying an Active NAT Prefix List or Nat Classifier via CLI

The following table describes the outcome when the active NAT prefix list or NATclassifier is modified via CLI.

Table 37: Modifying Active NAT Prefix List or NAT Classifier

Action	Outcome		Remarks
Action	L2-Aware	LSN	Remarks
CLI – Modifying prefix in the NAT prefix list	Existing flows are always checked whether they comply with the NAT prefix list that is currently applied in the subscriber profile for the subscriber. If the flows do not comply with the current NAT prefix list, they are cleared after 5 seconds. The new flows immediately start using the updated settings.	Changing the prefix in the NAT prefix list will internally re-subnet the outside IP address space.	A NAT prefix list is used with multiple NAT policies in L2-aware NAT and for downstream Internal subnet in DNAT-only scenario for LSN. The prefix can be modified (added, removed, remapped) at any time in the NAT prefix list, while the NAT policy must be first shut down via CLI.
CLI – Modifying or replacing the NAT classifier	Existing flows are always checked whether they comply with the NAT classifier that is currently applied in the active NAT policy for the subscriber. If the flows do not comply with the current NAT classifier, they are cleared after 5 seconds. The new flows immediately start using the updated settings.	Changing the NAT classifier have the same effect as in L2-aware NAT; all existing flows using the NAT classifier are checked whether they comply with this classifier or not.	The NAT classifier is used for DNAT. NAT classifier is referenced in the NAT policy.
CLI - Removing or adding NAT policy in NAT prefix list	Blocked	Not applicable
CLI - Removing or adding NAT policy in the subscriber profile	Blocked	Not applicable
CLI - Removing, adding or replacing NAT prefix list under the rtr/nat/inside/DNAT-only	Not applicable	This action will trigger internally re-subnet the source address space according to the new NAT prefix list. However, the current flows in the MS ISA are not effected by this change. In other words, they are not removed if the associated prefix is removed from the prefix list.

7.12. Port Control Protocol (PCP)

PCP is a protocol that operates between subscribers and the NAT directly. This makes the protocol similar to DHCP or PPP in that the subscriber has a limited but direct control over the NAT behavior.

PCP is designed to allows the configuration of static port-forwards, obtain information about existing port forwards and to obtain the outside IP address from software running in the home network or on the CPE.

PCP runs on each MS-ISA as its own process and make use of the same source-IP hash algorithm as the NAT mappings themselves. The protocol itself is UDP based and is request/response in nature, in some ways, similar to UPnP.

PCP operates on a specified loopback interface in a similar way to the local DHCP server. It operates on UDP and a specified (in CLI) port. As Epoch is used to help recover mappings, a unique PCP service must be configured for each NAT group.

When epoch is lowered, there is no mechanism to inform the clients to refresh their mappings en-masse. External synchronization of mappings is possible between two chassis (epoch does not need to be synchronized). If epoch is unsynchronized then the result will be clients re-creating their mapping on next communication with the PCP server.

0 1 2 3

0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Requested Lifetime |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

: :

: (optional) opcode-specific information :

: :

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

: :

: (optional) PCP Options :

: :

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

The R-bit (0) indicates request and (1) indicates response. This is a request so (0).

OpCode defined as:

Requested Lifetime: Lifetime 0 means delete.

0 1 2 3

0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Lifetime |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Epoch |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

: :

: (optional) OpCode-specific response data :

: :

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

: (optional) Options :

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

As this is a response, R = (1).

The Epoch field increments by 1 every second and can be used by the client to determine if state needs to be restored. On any failure of the PCP server or the NAT to which it is associated Epoch must restart from zero (0).

Result Codes:

0 SUCCESS, success.

1 UNSUPP_VERSION, unsupported version.

2 MALFORMED_REQUEST, a general catch-all error.

3 UNSUPP_OPCODE, unsupported OpCode.

4 UNSUPP_OPTION, unsupported option. Only if the Option was mandatory.

5 MALFORMED_OPTION, malformed option.

6 UNSPECIFIED_ERROR, server encountered an error

7 MISORDERED_OPTIONS, options not in correct order

Creating a Mapping

Client Sends

0 1 2 3

0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Protocol | Reserved (24 bits) |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Internal port | Suggested external port |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

: :

: Suggested External IP Address (32 or 128, depending on OpCode):

: :

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

MAP4 opcode is (1). Protocols: 0 – all; 1 – ICMP; 6 – TCP; 17 – UDP.

MAP4 (1), PEER4 (3) and PREFER_FAILURE are supported. FILTER and THIRD_PARTY are not supported.

7.13. Universal Plug and Play Internet Gateway Device Service

Universal Plug and Play (UPnP), which is a set of specifications defined by the UPnP forum. One specification is called Internet Gateway Device (IGD) which defines a protocol for clients to automatically configure port mappings on a NAT device. Today, many gaming, P2P, VoIP applications support the UPnP IGD protocol. The SR OS supports the following UPnP version 1 InternetGatewayDevice version 1 features:

Supports only L2-Aware NAT hosts.
Distributed subscriber management is not supported.
The UPnP server runs on NAT ISA and only serves the local L2-aware NAT hosts on the same ISA.
The UPnP server can be enabled per subscriber by configuring a upnp-policy in the sub-profile.
UPnP discovery is supported.
UPnP eventing is not supported.
The following IGD devices and services are supported:
1. InternetGatewayDevice
  1. WANDevice
  2. WANConnectionDevice
    WANIPConnection service

For WANIPConnection services:
1. Optional state variables in a WANIPConnection service are not supported.
2. Optional actions in a WANIPConnection services are not supported.
3. Wildcard ExternalPort is not supported.
4. Only supports wildcard RemoteHost.
5. Up to 64 bytes of port mapping description are supported.
6. The SR OS supports a vendor specific action X_ClearPortMapping. This clears all port mappings of the subscriber belonging to the requesting host. This action has no in or out arguments.
If the NewExternalPort in an addPortMapping request is same as the external port of one existing UPnP port mapping:
1. If NewInternalClient is different from InternalClient of existing mapping, then system the will reject the request.
2. If NewInternalClient is same as InternalClient of existing mapping:
  1. With strict-mode on — If the source IP address of the request is same as InternalClient of existing mapping, then the request is accepted; otherwise the request is rejected.
  2. With strict-mode off, the request is accepted.

The system also supports the Alc-UPnP-Sub-Override-Policy RADIUS VSA which can be included in access-accept or CoA request. It can be used to override the upnp-policy configured in sub-profile or disable UPnP for the subscriber. See RADIUS reference guide for detail usage.

7.13.1. Configuring UPnP IGD Service

Configure L2-aware NAT.
Create a upnp-policy:
Configure the upnp-policy as created in Step 2 in the subscriber profile:

config>service

upnp

upnp-policy "test" create

no description

http-listening-port 5000

mapping-limit 100

no strict-mode

exit

config>subscr-mgmt

sub-profile "l2nat-upnp" create

nat-policy "l2"

upnp-policy "test"

exit

7.14. NAT Point-to-Point Tunneling Protocol (PPTP) Application Layer Gateway (ALG)

PPTP is defined in RFC 2637, Point-to-Point Tunneling Protocol (PPTP), and is used to provide VPN connection for home/mobile users to gain secure access to the enterprise network. Encrypted payload is transported over GRE tunnel that is negotiated over TCP control channel. In order for PPTP traffic to pass through NAT, the NAT device must correlate the TCP control channel with the corresponding GRE tunnel. This mechanism is referred to as PPTP ALG.

7.14.1. PPTP Protocol

There are two components of PPTP:

TCP control connection between the two endpoints.
An IP tunnel operating between the same endpoints. These are used to transport GRE encapsulated PPP packets for user sessions between the endpoints. PPTP uses an extended version of GRE to carry user PPP packets.

The control connection is established from the PPTP clients (for example, home users behind the NAT) to the PPTP server which is located on the outside of the NAT. Each session that carries data between the two endpoints can be referred as call. Multiple sessions (or calls) can carry data in a multiplexed fashion over a tunnel. The tunnel protocol is defined by a modified version of GRE. Call ID in the GRE header is used to multiplex sessions over the tunnel. The Call-ID is negotiated during the session/call establishment phase.

7.14.1.1. Supported Control Messages

Control Connection Management — The following messages are used to maintain the control connection.

Start-Control-Connection-Request
Start-Control-Connection-Reply
Stop-Control-Connection-Request
Stop-Control-Connection-Reply
Echo-Request
Echo-Reply

The remaining control message types are sent over the established TCP session to open/maintain sessions and to convey information about the link state:

Call Management — Call management messages are used to establish/terminate a session/call and to exchange information about the multiplexing field (Call-id). Call-IDs must be captured and translated by the NAT. The call management messages are:

Outgoing-Call-Request (contains Call ID)
Outgoing-Call-Reply (contains Call ID and peer’s Call-ID)
Call-Clear-Request (contains Call ID)
Call-Disconnect-Notify (contains Call ID)

Error Reporting — This message is sent by the client to indicate WAN error conditions that occur on the interface supporting PPP.

Wan-Error-Notify (contains Call ID and Peer’s Call ID)

PPP Session Control — This message is sent in both directions to setup PPP-negotiated options.

Set-Link-Info (contains Call ID and Peer’s Call ID)

Once Call-ID is negotiated by both endpoints, it is inserted in GRE header and used as multiplexing filed in the tunnel that carries data traffic.

7.14.1.2. GRE Tunnel

A GRE tunnel is used to transport data between two PPTP endpoints. The packet transmitted over this tunnel has the following general structure:

+--------------------------------+

| Media Header | Ethernet header, for example

+--------------------------------+

| IP Header | Tunnel endpoints

+--------------------------------+

| GRE Header | See following example

+--------------------------------+

| PPP Packet | Packet payload including PPP header

+--------------------------------+

The GRE header contains the Call ID of the peer for the session for which the GRE packet belongs.

0 1 2 3

0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

|C|R|K|S|s|Recur|A| Flags | Ver | Protocol Type |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Key (HW) Payload Length | Key (LW) Call ID |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Sequence Number (Optional) |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Acknowledgment Number (Optional) |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

7.14.2. PPTP ALG Operation

PPTP ALG is aware of the control session (Start Control Connection Request/Replay) and consequently it captures the Call ID field in all PPTP messages that carry that field. In addition to translating inside IP and TCP port, the PPTP ALG process data beyond the TCP header in order to extract the Call ID field and translate it inside of the Outgoing Call Request messages initiated from the inside of the NAT.

The GRE packets with corresponding Call IDs are translated through the NAT as follows:

The inside source IP address is replaced by the outside IP address and vice versa for traffic in the opposite direction. This is standard IP address translation technique. The key is to keep the outside IP address of the control packets and corresponding data packets (GRE tunnel) the same.
The Call-ID in the GRE packets in the direction outside to inside will be translated by the NAT according to the mappings that were created during session negotiation.

In addition, the following applies:

GRE packets are translated and passed through the NAT only if they can be matched to an existing PPTP call for which the mapping already exists.
Translation of the Call-IDs advertised by the PPTP server in the Outgoing Call Reply control message (this message is sent from the outside of the NAT to the inside) are not translated. Subsequently the Call ID in such messages are transparently passed through the NAT. There is no need to translate those Call IDs as their uniqueness between the two endpoints are guaranteed by the selection algorithm of the PPTP server. This can be thought of as destination TCP/UDP ports. They are not translated in the NAT. Instead only the source ports are translated.
PPTP session initiation in the outside to inside direction through the NAT is not supported.
Call-ID’s are allocated and used in the same fashion as the outside TCP/UDP ports (random with parity). They are taken from the same port range as ICMP ports.

The basic principle of PPTP NAT ALG is shown in Figure 63.

Figure 63: NAT PPTP Operation

The scenario where multiple clients behind the NAT are terminated to the same PPTP server is shown in Figure 64. In this case, it is possible that the source IP addresses of the two PPTP clients are mapped to the same outside address of the NAT. Since the endpoints of the GRE tunnel from the NAT to the PPTP server will be the same for both PPTP clients (although their real source IP addresses are different), the NAT must ensure the uniqueness of the Call-IDs in the outbound data connection. This is where Call-ID translation in the NAT becomes crucial.

Figure 64: Merging of Endpoints in NAT

7.14.3. Multiple Sessions Initiated From the Same PPTP Client Node

The routers supports a deployment scenario where multiple calls (or tunnels) are established from a single PPTP node within a single control connection. In this case, there is only one set of Start-Control-Connection-Req/Reply messages (one control channel) and multiple sets of Outgoing-Call-Req/Reply messages.

7.14.4. Selection of Call IDs in NAT

Call-Id are taken from the same pool as the ICMP port ranges. Port-ranges and Call-IDs are both 16-bit values. Call-id selection mechanism is the same as the outside TCP/UDP port selection mechanism (random with parity).

7.15. Modifying Active Nat-Prefix-List or NAT Classifier via CLI

The following table describes the outcome when the active nat-prefix-list or NAT classifier is modified via CLI.

Table 38: Modifying Active NAT Prefix List or NAT Classifier

Action	Outcome		Remarks
Action	L2-Aware	LSN	Remarks
CLI – Modifying prefix in the NAT prefix list	Existing flows are always checked whether they comply with the NAT prefix list that is currently applied in the sub-profile for the subscriber. If the flows do not comply with the current NAT prefix list, they are cleared after 5 seconds. The new flows will immediately start using the updated settings.	Changing the prefix in the NAT prefix list will internally re-subnet the outside IP address space.	Nat-prefix list is used with multiple NAT policies in L2-aware NAT and for downstream internal subnet in dNAT-only scenario for LSN. Prefix can be modified (added, removed, remapped) at any time in the NAT prefix list, while the NAT policy must be first shut down via CLI.
CLI – Modifying the NAT classifier	Existing flows are always checked whether they comply with the NAT classifier that is currently applied in the active NAT policy for the subscriber. If the flows do not comply with the current NAT classifier, they are cleared after 5 seconds. The new flows will immediately start using the updated settings.	Changing the NAT classifier has the same effect as in L2-aware NAT; all existing flows using the NAT classifier are checked to see whether or not they comply with this classifier.	The NAT classifier is used for dNAT. The NAT classifier is referenced in the NAT policy.
CLI - Removing/ adding NAT policy in nat-prefix-list	Blocked	Not Applicable
CLI - Removing/ adding/ replacing NAT policy in sub-profile	Blocked	Not Applicable
CLI - Removing/ adding/ replacing NAT prefix-list under the rtr/nat/inside/dnat-only	Not Applicable	Internally re-subnet, no effect on the flows

7.16. NAT Logging

LSN logging is extremely important to the Service Providers (SP) who are required by the government agencies to track source of suspicious Internet activities back to the users that are hidden behind the LSN device.

The 7750 SR supports several modes of logging for LSN applications. Choosing the right logging model will depend on the required scale, simplicity of deployment and granularity of the logged data.

For most purposes logging of allocation/de-allocation of outside port-blocks and outside IP address along with the corresponding LSN subscriber and inside service-id will suffice.

In certain cases port-block based logging is not satisfactory and per flow logging is required.

7.16.1. Syslog/SNMP/Local-File Logging

The simplest form of LSN and L2-Aware NAT logging is via logging facility in the 7750 SR, commonly called logger. Each port-block allocation/de-allocation event will be recorded and send to the system logging facility (logger). Such an event can be:

Recorded in the system memory as part of regular logs.
Written to a local file.
Sent to an external server via syslog facility.
Sent to a SNMT trap destination.

In this mode of logging, all applications in the system share the same logger.

Syslog/SNMP/Local-File logging on LSN is mutually exclusive with NAT RADIUS-based logging.

Syslog/SNMP/local-file logging must be separately enabled for LSN and L2-Aware NAT in log even-control. The following displays relevant MIB events:

2012 tmnxNatPlBlockAllocationLsn

2013 tmnxNatPlBlockAllocationL2Aw

7.16.1.1. Filtering LSN Events to System Memory

In this example a single port-block [1884-1888] is allocated/de-allocated for the inside IP address 5.5.5.5 which is mapped to the outside IP address 80.0.0.1. Consequently the event is logged in the memory as.

2 2012/07/12 16:40:58.23 WEST MINOR: NAT #2012 Base NAT

"{2} Free 80.0.0.1 [1884-1888] -- vprn10 5.5.5.5 at 2012/07/12 16:40:58"

1 2012/07/12 16:39:55.15 WEST MINOR: NAT #2012 Base NAT

"{1} Map 80.0.0.1 [1884-1888] -- vprn10 5.5.5.5 at 2012/07/12 16:39:55"

Once the desired LSN events are enabled for logging via event-control configuration, they can be logged to memory via standard log-id 99 or be filtered via a custom log-id, such as in this example (log-id 5):

Configuration:

*A:left-a20>config>log# info

----------------------------------------------

filter 1

default-action drop

entry 1

action forward

match

application eq "nat"

numbr eq 2012

exit

event-control "nat" 2001 suppress

event-control "nat" 2002 suppress

event-control "nat" 2003 suppress

event-control "nat" 2004 suppress

event-control "nat" 2005 suppress

event-control "nat" 2006 suppress

event-control "nat" 2007 suppress

event-control "nat" 2008 suppress

event-control "nat" 2009 suppress

event-control "nat" 2010 suppress

event-control "nat" 2011 suppress

event-control "nat" 2012 generate

event-control "nat" 2014 suppress

event-control "nat" 2015 suppress

event-control "nat" 2017 suppress

syslog 10

exit

log-id 5

filter 1

from main

to memory

exit

----------------------------------------------

*A:left-a20# show log event-control "nat"

=======================================================================

Log Events

=======================================================================

Application

ID# Event Name P g/s Logged Dropped

-----------------------------------------------------------------------

2001 tmnxNatPlL2AwBlockUsageHigh WA thr 0 0

2002 tmnxNatIsaMemberSessionUsageHigh WA thr 0 0

2003 tmnxNatPlLsnMemberBlockUsageHigh WA thr 0 0

2007 tmnxNatL2AwSubIcmpPortUsageHigh WA thr 0 0

2008 tmnxNatL2AwSubUdpPortUsageHigh WA thr 0 0

2009 tmnxNatL2AwSubTcpPortUsageHigh WA thr 0 0

2010 tmnxNatL2AwSubSessionUsageHigh WA thr 0 0

2012 tmnxNatPlBlockAllocationLsn MI sup 0 0

2013 tmnxNatPlBlockAllocationL2Aw MI sup 0 0

2014 tmnxNatResourceProblemDetected MI thr 0 0

2015 tmnxNatResourceProblemCause MI thr 0 0

2016 tmnxNatPlAddrFree MI sup 0 0

2017 tmnxNatPlLsnRedActiveChanged WA thr 0 0

2018 tmnxNatPcpSrvStateChanged MI thr 0 0

2020 tmnxNatMdaActive MI thr 0 0

2021 tmnxNatLsnSubBlksFree MI sup 0 0

2022 tmnxNatDetPlcyChanged MI thr 0 0

2023 tmnxNatMdaDetectsLoadSharingErr MI thr 0 0

2024 tmnxNatIsaGrpOperStateChanged MI thr 0 0

2025 tmnxNatIsaGrpIsDegraded MI thr 0 0

2026 tmnxNatLsnSubIcmpPortUsgHigh WA thr 0 0

2027 tmnxNatLsnSubUdpPortUsgHigh WA thr 0 0

2028 tmnxNatLsnSubTcpPortUsgHigh WA thr 0 0

2029 tmnxNatLsnSubSessionUsgHigh WA thr 0 0

2030 tmnxNatInAddrPrefixBlksFree MI sup 0 0

2031 tmnxNatFwd2EntryAdded MI sup 0 0

2032 tmnxNatDetPlcyOperStateChanged MI thr 0 0

2033 tmnxNatDetMapOperStateChanged MI thr 0 0

2034 tmnxNatFwd2OperStateChanged WA thr 0 0

=======================================================================

The event description is given below:

tmnxNatPlL2AwBlockUsageHigh

The tmnxNatPlL2AwBlockUsageHigh notification is sent when

the block usage of a Layer-2-Aware NAT address pool

reaches its high watermark ('true')

or when it reaches its low watermark again ('false').

tmnxNatIsaMemberSessionUsageHigh

The tmnxNatIsaMemberSessionUsageHigh notification is sent when

the session usage of a NAT ISA group member reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatPlLsnMemberBlockUsageHigh

The tmnxNatPlLsnMemberBlockUsageHigh notification is sent when

the block usage of a Large Scale NAT address pool

reaches its high watermark ('true')

or when it reaches its low watermark again ('false')

on a particular member MDA of its ISA group.

tmnxNatLsnSubIcmpPortUsageHigh

The tmnxNatLsnSubIcmpPortUsageHigh notification is sent when

the ICMP port usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatLsnSubUdpPortUsageHigh

The tmnxNatLsnSubUdpPortUsageHigh notification is sent when

the UDP port usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatLsnSubTcpPortUsageHigh

The tmnxNatLsnSubTcpPortUsageHigh notification is sent when

the TCP port usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatL2AwSubIcmpPortUsageHigh

The tmnxNatL2AwSubIcmpPortUsageHigh notification is sent when

the ICMP port usage of a Layer-2-Aware NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatL2AwSubUdpPortUsageHigh

The tmnxNatL2AwSubUdpPortUsageHigh notification is sent when

the UDP port usage of a Layer-2-Aware NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatL2AwSubTcpPortUsageHigh

The tmnxNatL2AwSubTcpPortUsageHigh notification is sent when

the TCP port usage of a Layer-2-Aware NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatL2AwSubSessionUsageHigh

The tmnxNatL2AwSubSessionUsageHigh notification is sent when

the session usage of a Layer-2-Aware NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatLsnSubSessionUsageHigh

The tmnxNatLsnSubSessionUsageHigh notification is sent when

the session usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

tmnxNatPlBlockAllocationLsn

The tmnxNatPlBlockAllocationLsn notification is sent when

an outside IP address and a range of ports is allocated to

a NAT subscriber associated with a Large Scale NAT (LSN) pool,

and when this allocation expires.

tmnxNatPlBlockAllocationL2Aw

The tmnxNatPlBlockAllocationL2Aw notification is sent when

an outside IP address and a range of ports is allocated to

a NAT subscriber associated with a Layer-2-Aware NAT pool,

and when this allocation expires.

tmnxNatResourceProblemDetected

The tmnxNatResourceProblemDetected notification is sent when

the value of the object tmnxNatResourceProblem changes.

tmnxNatResourceProblemCause

The tmnxNatResourceProblemCause notification is to describe the cause

of a NAT resource problem.

tmnxNatPlAddrFree

The tmnxNatPlAddrFree notification is sent when

a range of outside IP addresses becomes free at once.

tmnxNatPlLsnRedActiveChanged

The tmnxNatPlLsnRedActiveChanged notification is related to NAT Redundancy

sent when the value of the object tmnxNatPlLsnRedActive changes. The cause is

explained in the tmnxNatNotifyDescription which is a printable character

string.

tmnxNatMdaActive

The tmnxNatMdaActive notification is sent when

the value of the object tmnxNatIsaMdaStatOperState changes from

'primary' to any other value, or the other way around.

The value 'primary' means that the MDA is active in the group.

tmnxNatLsnSubBlksFree

The tmnxNatLsnSubBlksFree notification is sent when

all port blocks allocated to a Large Scale NAT (LSN) subscriber

are released.

The NAT subscriber is identified with its subscriber ID

tmnxNatNotifyLsnSubId.

To further facilitate the identification of the NAT subscriber,

its type tmnxNatNotifySubscriberType,

inside IP address tmnxNatNotifyInsideAddr

and inside virtual router instance tmnxNatNotifyInsideVRtrID

are provided.

The values of tmnxNatNotifyMdaChassisIndex, tmnxNatNotifyMdaCardSlotNum

and tmnxNatNotifyMdaSlotNum identify the ISA MDA where the blocks were

processed.

All notifications of this type are sequentially numbered with

the tmnxNatNotifyPlSeqNum.

The value of tmnxNatNotifyNumber is the numerical identifier of the

NAT policy used for this allocation; it can be used for correlation

with the tmnxNatPlBlockAllocationLsn notification; the value zero

means that this notification can be correlated with all the

tmnxNatPlBlockAllocationLsn notifications of the subscriber.

tmnxNatDetPlcyChanged

The tmnxNatDetPlcyChanged notification is sent when

something changed in the Deterministic NAT map.

[CAUSE] Such a change may be caused by a modification of the

tmnxNatDetPlcyTable or the tmnxNatDetMapTable.

[EFFECT] Traffic flows of one or more given subscribers, subject to NAT, may be

assigned different outside IP address and/or outside port.

[RECOVERY] Managers that rely on the offline representation of the

Deterministic NAT map should get an updated copy.

tmnxNatMdaDetectsLoadSharingErr

The tmnxNatMdaDetectsLoadSharingErr notification is sent

periodically at most every 10 seconds while a NAT ISA MDA

detects that it is receiving packets erroneously, due to

incorrect load-balancing by the ingress IOM.

The value of tmnxNatNotifyCounter is the incremental count of

dropped packets since the previous notification sent by the same MDA.

[CAUSE] The ingress IOM hardware does not support a particular

NAT function's load-balancing, for example an IOM-2 does not

support deterministic NAT.

[EFFECT] The MDA drops all incorrectly load-balanced traffic.

[RECOVERY] Upgrade the ingress IOM, or change the configuration.

tmnxNatIsaGrpOperStateChanged

The tmnxNatIsaGrpOperStateChanged notification is sent when

the value of the object tmnxNatIsaGrpOperState changes.

tmnxNatIsaGrpIsDegraded

The tmnxNatIsaGrpIsDegraded notification is sent when

the value of the object tmnxNatIsaGrpDegraded changes.

tmnxNatLsnSubIcmpPortUsgHigh

The tmnxNatLsnSubIcmpPortUsgHigh notification is sent when

the ICMP port usage of a Large Scale NAT subscriber reaches its high watermark

('true') or when it reaches its low watermark again ('false').

The subscriber is identified with its inside IP address or prefix

tmnxNatNotifyInsideAddr in the inside virtual router instance

tmnxNatNotifyInsideVRtrID.

tmnxNatLsnSubUdpPortUsgHigh

The tmnxNatLsnSubUdpPortUsgHigh notification is sent when

the UDP port usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

The subscriber is identified with its inside IP address or prefix

tmnxNatNotifyInsideAddr in the inside virtual router instance

tmnxNatNotifyInsideVRtrID.

tmnxNatLsnSubTcpPortUsgHigh

The tmnxNatLsnSubTcpPortUsgHigh notification is sent when

the TCP port usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

The subscriber is identified with its inside IP address or prefix

tmnxNatNotifyInsideAddr in the inside virtual router instance

tmnxNatNotifyInsideVRtrID.

tmnxNatLsnSubSessionUsgHigh

The tmnxNatLsnSubSessionUsgHigh notification is sent when

the session usage of a Large Scale NAT subscriber reaches its high

watermark ('true') or when it reaches its low watermark

again ('false').

The subscriber is identified with its inside IP address or prefix

tmnxNatNotifyInsideAddr

in the inside virtual router instance tmnxNatNotifyInsideVRtrID.

tmnxNatInAddrPrefixBlksFree

The tmnxNatInAddrPrefixBlksFree notification is sent when

all port blocks allocated to one or more subscribers

associated with a particular set of inside addresses

are released by this system.

The type of subscriber(s) is indicated by tmnxNatNotifySubscriberType.

The set of inside IP addresses is associated with the virtual

router instance indicated by tmnxNatNotifyInsideVRtrID and is of the

type indicated by tmnxNatNotifyInsideAddrType

The set of inside IP addresses consists of the address prefix

indicated with tmnxNatNotifyInsideAddr and tmnxNatNotifyInsideAddrPrefixLen

unless these objects are empty and zero; if tmnxNatNotifyInsideAddr is empty

and tmnxNatNotifyInsideAddrPrefixLen is zero, the set contains

all IP addresses of the indicated type.

The values of tmnxNatNotifyMdaChassisIndex, tmnxNatNotifyMdaCardSlotNum

and tmnxNatNotifyMdaSlotNum identify the ISA MDA where the blocks were

processed.

All notifications of this type are sequentially numbered with

the tmnxNatNotifyPlSeqNum.

This type of notification is typically the consequence of one or

more configuration changes; the nature of these changes is indicated

in the tmnxNatNotifyDescription.

tmnxNatFwd2EntryAdded

[CAUSE] The tmnxNatFwd2EntryAdded notification is sent when

a row is added to or removed from the tmnxNatFwd2Table by other means

than operations on the tmnxNatFwdAction;

a conceptual row can be added to or removed from the table by operations on

the tmnxNatFwdAction

object group or otherwise, by means of the PCP protocol

or automatically by the system, for example when a subscriber profile is

changed.

When the row is added, the value of the object

tmnxNatNotifyTruthValue is 'true'; when the row is removed,

it is 'false'.

[EFFECT] The specified NAT subscriber can start receiving inbound

traffic flows.

[RECOVERY] No recovery required; this notification is the result

of an operator or protocol action.

tmnxNatDetPlcyOperStateChanged

[CAUSE] The tmnxNatDetPlcyOperStateChanged notification is sent when

the value of the object tmnxNatDetPlcyOperState changes. The cause is

explained in the tmnxNatNotifyDescription.

tmnxNatDetMapOperStateChanged

[CAUSE] The tmnxNatDetMapOperStateChanged notification is sent when

the value of the object tmnxNatDetMapOperState changes. The cause is

explained in the tmnxNatNotifyDescription.

tmnxNatFwd2OperStateChanged

[CAUSE] The tmnxNatFwd2OperStateChanged notification is sent when

the value of the object tmnxNatFwd2OperState changes. This

is related to the state of the ISA MDA where the forwarding entry

is located, or the availability of resources on that MDA.

In the case of Layer-2-Aware NAT subscribers, the tmnxNatFwd2OperState

is 'down' while the subscriber is not instantiated. This would typically

be a transient situation.

[EFFECT] The corresponding inward bound packets are dropped while the

operational status is 'down'.

[RECOVERY] If the ISA MDA reboots successfully, or another ISA MDA takes over,

no recovery is required. If more resources become available on the ISA MDA, no

recovery is required.

7.16.1.2. NAT Logging to a Local File

In this case, the destination of log-id 5 in the following example would be a local file instead of memory:

*A:left-a20>config>log# info

----------------------------------------------

file-id 5

description "nat logging"

location cf3:

rollover 15 retention 12

exit

log-id 5

filter 1

from main

to file 5

exit

The events will be logged to a local file on the compact flash cf3 in a file under the /log directory.

7.16.2. SNMP Trap Logging

In case of SNMP logging to a remote node, the log destination should be set to SNMP destination. Allocation de-allocation of each port block will trigger sending a SNMP trap message to the trap destination.

*A:left-a20>config>log# info

----------------------------------------------

filter 1

default-action drop

entry 1

action forward

match

application eq "nat"

number eq 2012

exit

snmp-trap-group 6

trap-target "nat" address 114.0.1.10 port 9001 snmpv2c notify-community "private"

exit

log-id 6

filter 1

from main

to snmp

exit

7.16.3. NAT Syslog

NAT logs can be sent to a syslog remote facility. A separate syslog message is generated for every port-block allocation/de-allocation.

*A:left-a20>config>log#info

----------------------------------------------

...

filter 1

default-action drop

entry 1

action forward

match

application eq "nat"

number eq 2012

exit

syslog 7

address 114.0.1.10

exit

log-id 7

filter 1

from main

to syslog 7

exit

----------------------------------------------

Severity level for this event can be changed via CLI:

*A:left-a20# configure log event-control "nat" 2012 generate

<severity-level>

cleared indeterminate critical major minor warning

7.16.4. LSN RADIUS Logging

LSN RADIUS logging (or accounting) is based on RADIUS accounting messages as defined in RFC 2866. It requires an operator to have RADIUS accounting infrastructure in place. For that reason, LSN RADIUS logging and LSN RADIUS accounting terms can be used interchangeably.

This mode of logging operation is introduced so that the shared logging infrastructure in 7750 SR can be offloaded by disabling syslog/SNMP/Local-file LSN logging. The result is increased performance and higher scale, particularly in cases when multiple BB-ISA cards within the same system are deployed to perform aggregated LSN functions.

An additional benefit of LSN RADIUS logging over syslog/SNMP/local-file logging is reliable transport. Although RADIUS accounting relies on unreliable UDP transport, each accounting message from the RADIUS client must be acknowledged on the application level by the receiving end (accounting server).

Each port-block allocation or de-allocation is reported to an external accounting (logging) server in the form of start, interim-update or stop messages. The type of accounting messages generated depends on the mode of operation:

START and STOP per port-block. An accounting START is generated when a new port-block for the LSN subscriber is allocated. Similarly, the accounting STOP is generated when the port-block is released. Each accounting START/STOP pair of messages that are triggered by port block allocation/de-allocation within the same subscriber will have the same multi-acct-session-id (subscriber significant) but a different acct-session-id (port-block significant). This mode of operation is enabled by inclusion of multi-acct-session-id within the nat-accounting-policy.
START and STOP per subscriber. An accounting START will be generated when the first port block for the NAT subscriber is allocated. Each consecutive port-block allocation/de-allocation will trigger an INTERIM-UPDATE messages with the same acct-session-id (subscriber significant). The termination cause attribute in acct STOP messages will indicate the reason for port-block de-allocation. De-allocation of the last port-block for the LSN subscriber will trigger an acct STOP message. There is no multi-acct-session-id present in this mode of operation.

The accounting messages are generated and reported directly from the BB-ISA card, therefore bypassing accounting infrastructure residing on the Control Plane Module (CPM).

LSN RADIUS logging is enabled per nat-group. To achieve the required scale, each BB-ISA card in the nat-group group with LSN RADIUS logging enabled runs a RADIUS client with its own IP address. Accounting messages can be distributed to up to five accounting servers that can be accessed in round-robin fashion. Alternatively, in direct access mode, only one accounting server in the list is used. When this server fails, the next one in the list is used.

Configuration steps:

Configure nat-accounting-policy which defines:
1. accounting destination
2. inclusion of RADIUS attributes that will be sent in accounting messages to the destination
3. source IP addresses per BB-ISA card (RADIUS client) in the nat-group
Apply nat-accounting-policy to the nat-group. This will automatically enable RADIUS accounting on every BB-ISA card in the group, provided that each BB-ISA card has an IP address.

*A:left-a20>config>aaa>nat-acct-plcy# info detail

description "nat-acct-basic policy"

include-radius-attribute

framed-ip-addr

nas-identifier

no nat-subscriber-string =>only relevant when subscriber aware NAT is enabled

user-name

inside-service-id

outside-service-id

outside-ip

port-range-block

hardware-timestamp

release-reason

multi-session-id

frame-counters

octet-counters

session-time

called-station-id

no subscriber-data =>only relevant when subscriber aware NAT is enabled

exit

radius-accounting-server

access-algorithm direct

retry 3

router "Base"

source-address-range 114.0.1.20 114.0.1.20

timeout sec 5

server 1 address 114.0.1.10 secret "KlWIBi08CxTyM/YXaU2gQitOu8GgfSD7Oj5hjese27A" hash2 port 1813

exit

Each BB-ISA card is assigned one IPv4 address from the source-address-range command and this IPv4 address must be accessible from the accounting server. In the following example there is only one BB-ISA card in the nat-group 1. It source IP address is 114.0.1.20.

*A:left-a20# show router route-table

==================================================================================================

Route Table (Router: Base)

==================================================================================================

Dest Prefix[Flags] Type Proto Age Pref Next Hop[Interface Name] Metric

-------------------------------------------------------------------------------

80.0.0.1/32 Remote NAT 02d18h24m 0 NAT outside: group 1 member 1 0

114.0.1.0/28 Local Local 02d20h25m 0 radius 0

114.0.1.20/32 Remote NAT 00h38m29s 0 NAT outside: group 1 member 1 0

It is possible to load-balance accounting messages over multiple logging servers by configuring the access-algorithm to round-robin mode. Once the LSN RADIUS accounting policy is defined, it will have to be applied to a nat-group:

*A:left-a20>config>isa>nat-group# info

----------------------------------------------

active-mda-limit 1

radius-accounting-policy "nat-acct-basic"

mda 1/2

no shutdown

The RADIUS accounting messages for the case where a Large Scale NAT44 subscriber has allocated two port blocks in a logging mode where acct start/stop is generated per port-block is shown below.

Port-blocks allocation for the NAT44 subscriber:

Fri Jul 13 09:55:15 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Start

Acct-Multi-Session-Id = "500052cd2edcaeb97c2dad3d7c2dad3d"

Acct-Session-Id = "500052cd2edcaeb96206475d7c2dad3d"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "LSN44@26.0.0.58"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.58

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 2024-2028 router base"

Acct-Input-Packets = 0

Acct-Output-Packets = 0

Acct-Input-Octets = 0

Acct-Output-Octets = 0

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 0

Event-Timestamp = "Jul 13 2012 09:54:37 PDT"

Acct-Unique-Session-Id = "21c45a8b92709fb8"

Timestamp = 1342198515

Request-Authenticator = Verified

Fri Jul 13 09:55:16 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Start

Acct-Multi-Session-Id = "500052cd2edcaeb97c2dad3d7c2dad3d"

Acct-Session-Id = "500052cd2edcaeb9620647297c2dad3d"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "LSN44@26.0.0.58"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.58

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 2029-2033 router base"

Acct-Input-Packets = 0

Acct-Output-Packets = 5

Acct-Input-Octets = 0

Acct-Output-Octets = 370

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 1

Event-Timestamp = "Jul 13 2012 09:54:38 PDT"

Acct-Unique-Session-Id = "baf26e8a35e31020"

Timestamp = 1342198516

Request-Authenticator = Verified

Port-blocks de-allocation

Fri Jul 13 09:56:18 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Stop

Acct-Multi-Session-Id = "500052cd2edcaeb97c2dad3d7c2dad3d"

Acct-Session-Id = "500052cd2edcaeb96206475d7c2dad3d"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "LSN44@26.0.0.58"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.58

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 2024-2028 router base"

Acct-Terminate-Cause = Port-Unneeded

Acct-Input-Packets = 0

Acct-Output-Packets = 25

Acct-Input-Octets = 0

Acct-Output-Octets = 1850

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 64

Event-Timestamp = "Jul 13 2012 09:55:41 PDT"

Acct-Unique-Session-Id = "21c45a8b92709fb8"

Timestamp = 1342198578

Request-Authenticator = Verified

Fri Jul 13 09:56:20 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Stop

Acct-Multi-Session-Id = "500052cd2edcaeb97c2dad3d7c2dad3d"

Acct-Session-Id = "500052cd2edcaeb9620647297c2dad3d"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "LSN44@26.0.0.58"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.58

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 2029-2033 router base"

Acct-Terminate-Cause = Host-Request

Acct-Input-Packets = 0

Acct-Output-Packets = 25

Acct-Input-Octets = 0

Acct-Output-Octets = 1850

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 65

Event-Timestamp = "Jul 13 2012 09:55:42 PDT"

Acct-Unique-Session-Id = "baf26e8a35e31020"

Timestamp = 1342198580

Request-Authenticator = Verified

The inclusion of acct-multi-session-id in the NAT accounting policy will enable generation of start/stop messages for each allocation/de-allocation of a port-block within the subscriber. Otherwise, only the first and last port-block for the subscriber would generate a pair of start/stop messages. All port-block in between would trigger generation of interim-update messages.

The User-Name attribute in accounting messages is set to app-name@inside-ip-address, whereas the app-name can be any of the following: LSN44, DS-Lite or NAT64.

7.16.4.1. Periodic RADIUS Logging

Currently-allocated NAT resources (such as a public IP address and a port block for a NAT subscriber) can be periodically refreshed via Interim-Update (I-U) accounting messages. This functionality is enabled by the periodic RADIUS logging facility. Its primary purpose is to keep logging information preserved for long-lived sessions in environments where NAT logs are periodically and deliberately deleted from the service provider’s network. This is typically the case in countries where privacy laws impose a limit on the amount of time that the information about customer’s traffic can be retained/stored in service provider’s network.

Periodic RADIUS logging for NAT is enabled by the following command:

configure

aaa

isa-radius-policy <name> create

[no] periodic-update interval <hours> [rate-limit <r>]

The configurable interval dictates the frequency of I-U messages that are generated for each currently allocated NAT resource (such as a public IP address and a port block).

By default, the I-U messages are sent in rapid succession for a subscriber without any intentional delay inserted by SR OS. For example, a NAT subscriber with 8 NAT policies, each configured with 40 port ranges will generate 320 consecutive I-U messages at the expiration of the configured interval. This can create a surge in I-U message generation in cases where intervals are synchronized for multiple NAT subscribers. This can have adverse effects on the logging behavior. For example, the logging server can drop messages due to its inability to process the high rate of incoming I-U messages.

To prevent this, the rate of I-U message generation can be controlled by a rate-limit CLI parameter.

The periodic logging is applicable to both modes of RADIUS logging in NAT:

Acct-Multi-Session-Id AVP is enabled.
1. In this case, accounting START/STOP messages are generated for each NAT resource (such as a public IP address and a port block) allocation/de-allocation. Acct-multi-session-id and acct-session-id messages in the periodic I-U messages for the currently allocated NAT resource will be inherited from the acct START messages related to the same NAT resource.

Acct-Multi-Session-Id AVP is disabled.
1. In this case, the acct START is generated for the first allocated NAT resource for the subscriber (a public IP address and a port block) and the acct STOP message is generated when the last NAT resource for the subscriber is released. All of the in-between port block allocations for the same subscriber will trigger I-U messages with the same acct-session-id as the one contained in the acct START message. To differentiate between the port-block allocations, releases and updates within the I-U messages for the same NAT subscriber, the Alc-Acct-Triggered-Reason AVP will be included in every periodic I-U message. Sending the Alc-Acct-Triggered-Reason AVP is configuration dependent (enabled in the isa-radius-policy>acct-include-attributes context).The supported values for Alc-Acct-Triggered-Reason AVP in I-U messages will be:
  1. Alc-Acct-Triggered-Reason=Nat-FREE (19) Generated when the port-block is released.
  1. Alc-Acct-Triggered-Reason=Nat-MAP (20) Generated when the port-block is allocated.
  1. Alc-Acct-Triggered-Reason = Nat-UPDATE (21) Generated during periodically scheduled I-U update.
The log for each port-block periodic update is carried in a separate I-U message.

7.16.4.1.1. Message Pacing

Periodic I-U message output can be paced in order to avoid congestion at the logging server. Pacing is controlled by the rate-limit option of the periodic-update command. As an example, consider the following hypothetical case:

1 million NAT subscribers came up within 1hour (16,666 NAT-subs per minute).

On average, each NAT subscriber allocates two port blocks.

This means that 2 million logs will be sent to the logging server.

If the rate-limit value is set to 100 (messages per second), on average it would take over 5 hours to send all those messages at the given rate.

In this case, it would be prudent to set the interval value to at least 6 hours, or increase the rate-limit value so there is no time overlap between the old and new logs.

In case of an MS-ISA switchover or a NAT multi-chassis redundancy switchover, there is a chance that a large number of subscribers will become active at approximately the same time on the newly active MS-ISA (or chassis). This will cause a large number of logs to be sent in a relatively short amount of time, which may overwhelm the logging server. The rate-limit parameter is designed to help in such situations.

7.16.4.2. RADIUS Logging and L2-Aware NAT

Logging of L2-Aware NAT is supported via accounting policy associated with the ESM subscriber (outside of NAT). In addition to ESM subscriber specific attributes, the NAT port-ranges and outside IP address (nat-port-range command in regular ESM accounting policy) are reported in the same accounting messages.

Fri Jul 13 11:57:38 2012

Acct-Status-Type = Start

NAS-IP-Address = 1.1.1.1

User-Name = "l2-aware-nat"

Framed-IP-Address = 25.0.1.100

Framed-IP-Netmask = 255.255.255.0

Class = 0x6c322d61776172652d636c737373

Calling-Station-Id = "remote-l2-aware0"

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000001150006F7C"

Event-Timestamp = "Jul 13 2012 11:57:00 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.13"

ADSL-Agent-Circuit-Id = "l2-aware-nat"

ADSL-Agent-Remote-Id = "remote-l2-aware0"

Alc-Subsc-ID-Str = "l2-aware-1"

Alc-Subsc-Prof-Str = "l2-aware-nat"

Alc-SLA-Prof-Str = "tp_sla_prem"

Alc-Nat-Port-Range = "83.0.0.1 1024-1079 router base"

Alc-Client-Hardware-Addr = "00:00:65:05:13:01"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Acct-Unique-Session-Id = "6bbbd5a110313b47"

Timestamp = 1342205858

Request-Authenticator = Verified

RADIUS accounting initiated by BB-ISA card is not supported for L2-Aware NAT.

Syslog/SNMP/Local-file logging can be enabled simultaneously with L2-aware NAT RADIUS accounting (which is in this case regular ESM RADIUS accounting).

7.16.5. LSN and L2-Aware NAT Flow Logging

LSN and L2-Aware NAT Flow logging is a facility that allows each BB-ISA card to export the creation and deletion of NAT flows to an external server. A NAT flow or a Fully Qualified Flow consists of the following parameters: Inside IP, inside port, outside IP, outside port, foreign IP, foreign port, protocol (UDP, TCP, ICMP).

------------------------------------------------------------------------

Owner : LSN-Host@10.0.0.15

Router : 10

FlowType : UDP Timeout (sec) : 11

Inside IP Addr : 10.0.0.15 Inside Port : 100

Outside IP Addr : 80.0.0.1 Outside Port : 1164

Foreign IP Addr : 10.0.3.2 Foreign Port : 5000

Dest IP Addr : 10.0.3.2 Dest Port : 5000

-------------------------------------------------------------------------------

In addition, the inside/outside service-id and subscriber string will be added to a flow record.

Flow logging can be deployed as an alternative to the port-range logging or can be complementary (providing a more granular log for offline reporting or compliance). Certain operators have legal and compliance requirements that require extremely detailed logs, created per flow, to be exportable from the NAT node.

Because the setup rate of new flows is excessive, logging to an internal facility (like compact flash) is not possible except in a debugging mode (which must specify match criteria down to the inside-IP and service level).

Flow logging can be enabled per NAT policy and consequently it is initiated from each BB-ISA card independently as a UDP stream, unlike a centralized Netflow/Cflowd application.

Flows are formatted according to IETF IPFIX RFC 5101, Specification of the IP Flow Information Export (IPFIX) Protocol, for the Exchange of IP Traffic Flow Information. Data structures are contained in RFC5102, Information Model for IP Flow Information Export. NAT flow logging is sent to up to two different IP addresses both of which must be unicast IPv4 destinations. These UDP streams are stateless due to the significant volume of transactions. However they do contain sequence numbers such that packet loss can be identified. They egress the chassis at FC NC.

IPFIX defines two different type of messages that will be sent from the IPFIX exporter (7750 SR NAT node). The first contains Template Set – an IPFIX message that defines fields for subsequent IPFIX messages but contains no actual data of its own. The second IPFIX message type is that containing Data Sets – here the data is passed using the previous Template Set message to define the fields. This means an IPFIX message is NOT passed as sets of TLV, but instead data is encoded with a scheme defined through the Template Set message.

While an IPFIX message can contain both Template Set and Data Set, 7750 SR sends Template Set messages periodically without any data, whereas the Data Set messages are sent on demand and as required. When IPFIX is used over UDP, the default retransmission frequency of the Template Set messages defaults to 10 minutes. The interval for retransmission is configurable in CLI with a minimum interval of 1 minute and a maximum interval of 10 minutes. When the exporter first initializes, or when a configuration change occurs the Template Set is sent out three times, one second apart. Templates are sent before any data sets, assuming that the collector is enabled, so that an IPFIX collector can establish the data template set.

Although the UDP transport is unreliable, the IPFIX Sequence Number is a 32bit number that contains the total number of IPFIX Data Records sent for the UDP transport session prior to the receipt of the new IPFIX message. The sequence number starts with 0 and it will roll over once it reaches 4,294,967,268.

The default packet size is 1500B unless another value has been defined in config (range is 512B through 9212B inclusive). Traffic is originated from a random high-port to the collector on port 4739. Multiple create/delete flow records will be stuffed into a single IPFIX packet (although the mapping creates are not delayed) until stuffing an additional data record would exceed MTU or a timer expires. The timer is not configurable and is set to 250ms (that is, should any mapping occur a packet will be sent within 250ms of that mapping being created)

Each collector has a 50 packet buffering space. In case that due to excessive logging the buffering space becomes unavailable, new flows will be denied and the deletion of flows will be delayed until buffering space becomes available.

Two collector nodes can be defined in the same IPFIX export policy for redundancy purposes.

7.16.5.1. Large Scale NAT44 Flow Logging Configuration Example

This section provides an example of how to configure large scale NAT44 flow logging.

Define a collector node along with other local transport parameters through an IPFIX export-policy.

*A:left-a20>config>service>ipfix# info detail

----------------------------------------------

ipfix-export-policy "ipxif-policy" create

description "external IPFIX collector"

collector router "Base" ip 114.0.1.10 create

mtu 1500

source-address 114.0.1.20

template-refresh-timeout min 10

no shutdown

exit

To export flow records via UDP stream, the BB-ISA card must be configured with appropriate IPv4 address within a designated VPRN. This address (/32) will act as the source for sending all IPFIX records and is shared by all ISA.

After the IPFIX export policy is defined, apply it within the NAT policy:

*A:left-a20>config>service>nat>nat-policy# info

----------------------------------------------

pool "base" router Base

ipfix-export-policy "ipxif-policy"

The capture of IPFIX packet for an ICMP flow creation and deletion is shown in the following examples.

Flow creation:

Internet Protocol Version 4, Src: 114.0.1.20 (114.0.1.20), Dst: 114.0.1.10 (114.0.1.10)

User Datagram Protocol, Src Port: 50000 (50000), Dst Port: ipfix (4739)

Source port: 50000 (50000)

Destination port: ipfix (4739)

Length: 80

Checksum: 0x0e6c [correct]

[Good Checksum: True]

[Bad Checksum: False]

Cisco NetFlow/IPFIX

Version: 10

Length: 72

Timestamp: Jul 13, 2012 14:37:03.000000000 Pacific Daylight Time

FlowSequence: 0

Observation Domain Id: 1179650

Set 1

FlowSet Id: (Data) (256)

FlowSet Length: 56

Flow 1

Flow Id: 285191984

SrcAddr: 80.0.0.1 (80.0.0.1)

DstAddr: 10.0.3.2 (10.0.3.2)

SrcPort: 1031

DstPort: 0

Padding (1 byte)

Enterprise Private entry: (Nokia (previously was ‘Nokia Data

Network’)) Type 91: value (hex bytes): 00 0a

Enterprise Private entry: (Nokia (previously was ‘Nokia Data

Network’)) Type 92: value (hex bytes): 00 00

Padding (1 byte)

[Enterprise Private entry: (Nokia (previously was ‘Nokia Data

Network’)) Type 93: value (hex bytes): 4c 53 4e 34 34 40 35 2e 35 2e 35 2e 35 00 00

00 (variable length)]

StartTime: Jul 13, 2012 14:37:03.277000000 Pacific Daylight Time

Flow deletion:

Internet Protocol Version 4, Src: 114.0.1.20 (114.0.1.20), Dst: 114.0.1.10 (114.0.1.10)

User Datagram Protocol, Src Port: 50000 (50000), Dst Port: ipfix (4739)

Source port: 50000 (50000)

Destination port: ipfix (4739)

Length: 80

Checksum: 0x1357 [correct]

[Good Checksum: True]

[Bad Checksum: False]

Cisco NetFlow/IPFIX

Version: 10

Length: 72

Timestamp: Jul 13, 2012 14:37:07.000000000 Pacific Daylight Time

FlowSequence: 1

Observation Domain Id: 1179650

Set 1

FlowSet Id: (Data) (257)

FlowSet Length: 56

Flow 1

Flow Id: 285191984

SrcAddr: 80.0.0.1 (80.0.0.1)

DstAddr: 10.0.3.2 (10.0.3.2)

SrcPort: 1031

DstPort: 0

Protocol: 1

Flow End Reason: Idle timeout (1)

Enterprise Private entry: (Nokia (previously was ‘Nokia Data

Network’)) Type 91: value (hex bytes): 00 0a

Enterprise Private entry: (Nokia (previously was ‘Nokia Data

Network’)) Type 92: value (hex bytes): 00 00

Padding (1 byte)

[Enterprise Private entry: (Nokia (previously was ‘Nokia Data

Network’)) Type 93: value (hex bytes): 4c 53 4e 34 34 40 35 2e 35 2e 35 2e 35 00 00

00 (variable length)]

EndTime: Not representable

Table 39 lists the values and descriptions of the fields in the example flow creation and deletion templates.

Table 39: Flow Creation and Deletion Template Field Descriptions

Field	Value	Description
Description	Size (B)
Export Timestamp	N/A	Timestamp derived from chassis NTP, per RFC 5101
Sequence Id	N/A	Total number of IPFIX data records sent for the UDP transport session prior to the receipt of the new IPFIX message (modulo 232), per RFC 5101
Observation Domain I	N/A	Unique ID set per ISA in the 7750 SR chassis
FlowID	8	Unique ID (per observation domain ID) for this flow used for tracking purposes only (opaque value); flow ID in a create and a delete mapping record must be the same for a specific NAT mapping
IP_SRC_ADDR	4	Outside IP address used in the NAT mapping
IP_DST_ADDR	4	Destination or remote IP address used in the NAT mapping
L4_SRC_PORT	2	Outside source port used in the NAT mapping
L4_DST_PORT	2	Destination source port used in the NAT mapping
flowStartMilliseconds 1	8	Timestamp when the flow was created (chassis NTP derived) in milliseconds from epoch, per RFC 5102
flowEndMilliseconds 2	8	Timestamp when the flow was destroyed (chassis NTP derived) in milliseconds from epoch, per RFC 510
PROTOCOL	1	Protocol ID, TCP, UDP or ICMP. Per RFC 5102
PADDING	1	N/A
flowEndReason	1	Supported flow end reasons: 0x01: Idle Timeout—A mapping expired (because of UDP or TCP timeout) 0x03: End of Flow Detected—A mapping closed (only used for TCP after a FIN or RST). 0x04: Forced End—Collects all other reasons included administrative or failure case
aluInsideServiceID	2	16-bit service ID representing the inside service ID
aluOutsideServiceI	2	16-bit service ID representing the outside service ID
aluNatSubString	var	A variable 8B aligned string that represents the NAT subscriber construct (as currently used in the tools dump service nat session commands)

Notes:

Flow Creation Template Set only
Flow Deletion Template Set only

7.17. DS-Lite and NAT64 Fragmentation

7.17.1. Overview

In general, fragmentation functionality is invoked when the size of a fragmentation eligible packet exceeds the size of the MTU of the egress interface/tunnel. Packets eligible for fragmentation are:

IPv4 packets/fragments with the DF bit in the IPv4 header cleared. Fragmentation can be performed on any routing node between the source and the destination of the packet.
IPv6 packets on the source node. Fragmentation of IPv6 packet on the transient routing nodes is not allowed.

The best practice is to avoid fragmentation in the network by ensuring adequate MTU size on the transient/source nodes. Drawbacks of the fragmentation are:

Increased processing and memory demands to the network nodes (especially during reassembly process)
Increased byte overhead
Increased latency.

Fragmentation can be particularly deceiving in a tunneled environment whereby the tunnel encapsulation adds extra overhead to the original packet. This extra overhead could tip the size of the resulting packet over the egress MTU limit.

Fragmentation could be one solution in cases where the restriction in the mtu size on the packet’s path from source to the destination cannot be avoided. Routers support IPv6 fragmentation in DS-Lite and NAT64 with some enriched capabilities, such as optional packet IPv6 fragmentation even in cases where DF-bit in corresponding IPv4 packet is set.

In general, the lengths of the fragments must be chosen such that resulting fragment packets fit within the MTU of the path to the packets destination(s).

In downstream direction fragmentation can be implemented in two ways:

IPV4 packet can be fragmented in the carrier IOM before it reaches ISA for any NAT function.
IPv6 packet can be fragmented in the ISA, once the IPv4 packet is IPv6 encapsulated in DS-lite or IPv6 translated in NAT64.

In upstream direction, IPv4 packets can be fragmented once they are de-capsulated in DS-lite or translated in NAT64. The fragmentation will occur in the IOM.

7.17.2. IPv6 Fragmentation in DS-Lite

In the downstream direction, the IPv6 packet carrying IPv4 packet (IPv4-in-IPv6) is fragmented in the ISA in case the configured DS-lite tunnel-mtu is smaller than the size of the IPv4 packet that is to be tunneled inside of the IPv6 packet. The maximum IPv6 fragment size will be 48bytes larger than the value set by the tunnel-mtu. The additional 48 bytes is added by the IPv6 header fields: 40 bytes for the basic IPv6 header + 8 bytes for extended IPv6 fragmentation header. NAT implementation in the routers does not insert any extension IPv6 headers other than fragmentation header.

Figure 65: DS-Lite

In case that the IPv4 packet is larger than the value set by the tunnel-mtu, the fragmentation action will depend on the configuration options and the DF bit setting in the header of the received IPv4 header:

The IPv4 packet can be dropped regardless of the DF bit setting. IPv6 fragmentation is disabled.
The IPv4 packet can be encapsulated in IPv6 packet and then the IPv6 can be fragmented regardless of the DF bit setting in the IPv4 tunneled packet. The IPv6 fragment payload is limited to the value set by the tunnel-mtu.
The IPv4 packet can be encapsulated in IPv6 packet and then the IPv6 can be fragmented ONLY if the DF bit is cleared. The IPv6 fragment payload is limited to the value set by the tunnel-mtu.

In case that the IPv4 packet is dropped due to fragmentation not being allowed, an ICMPv4 Datagram Too Big message will be returned to the source. This message will carry the information about the size of the MTU that is supported, in essence notifying the source to reduce its MTU size to the requested value (tunnel-mtu).

The maximum number of supported fragments per IPv6 packet is 8. Considering that the minimum standard based size for IPv6 packet is 1280bytes, 8 fragments is enough to cover jumbo Ethernet frames.

configure

[router] | [service vprn]

nat

inside

dual-stack-lite

address <IPv6 Addr>

tunnel-mtu bytes

ip-fragmentation {disabled | fragment-ipv6 | fragment-ipv6-unless-ipv4-df-set}

7.17.3. NAT64

Downstream fragmentation in NAT64 works in similar fashion. The difference between DS-lite is that in NAT64 the configured ipv6-mtu represents the mtu size of the ipv6 packet (as opposed to payload of the IPv6 tunnel in DS-lite). In addition, IPv4 packet in NAT64 is not tunneled but instead IPv4/v6 headers are translated. Consequently, the fragmented IPv6 packet size will be 28bytes larger than the translated IPv4 packet  20bytes difference in basic IP header sizes (40bytes IPv6 header vs 20byte IPv4 header) plus 8 bytes for extended fragmentation IPv6 header. The only extended IPv6 header that NAT64 generates is the fragmentation header.

In case that the IPv4 packet is dropped due to the fragmentation not being allowed, the returned ICMP message will contain MTU size of ipv6-mtu minus 28 bytes.

Otherwise the fragmentation options are the same as in DS-lite.

configure

[router] | [service vprn]

nat

inside

nat64

ipv6-mtu bytes

ip-fragmentation {disabled | fragment-ipv6 | fragment-ipv6-unless-ipv4-df-set}

7.18. Enhanced Statistics in NAT — Histogram

The NAT command histogram displays compartmentalized port distribution per protocol for an aggregated number of subscribers. This allows operators to trend port usage over time and consequently adjust the configuration as the port demand per subscriber increase/decrease. For example, an operator may find that the port usage in a pools has increased over a period of time. Accordingly, the operator may plan to increase the number of ports per port block.

The feature is not applicable to pools which operate in one-to-one mode.

The output is organized in port buckets with the number of subscribers in each bucket.

# tools dump nat histogram

- histogram router <router-instance> pool <pool-name> bucket-size <[1..65536]> num-buckets <[2..50]>

<router-instance> : <router-name> | <service-id>

router-name - "Base"

service-id - [1..2147483647]

<pool-name> : [32 chars max]

For example:

tools dump nat histogram router "Base" pool "det" bucket-size 20 num-buckets 20

========================================================================Usage histogram NAT pool "det" router "Base"

========================================================================

Num-ports Sub-TCP Sub-UDP Sub-ICMP

------------------------------------------------------------------------

0-19 0 0 0

20-39 0 0 0

40-59 0 0 0

60-79 0 0 0

80-99 0 0 0

100-119 0 0 0

120-139 0 0 0

140-159 0 0 0

160-179 0 0 0

180-199 0 0 0

200-219 0 0 0

220-239 0 0 0

240-259 0 0 0

260-279 0 0 0

280-299 0 0 0

300-319 0 0 0

320-339 0 0 0

340-359 0 0 0

360-379 0 0 0

380- 0 0 0

---------------------------------------------------

The output of the histogram command can be periodically exported to en external destination via cron. The following is an example:

*A:CPM>config>cron# info

----------------------------------------------

script "nat_histogram"

location "ftp://*:*@138.203.8.62/nat-histogram.txt"

no shutdown

exit

action "dump_nat_histogram"

results "ftp://*:*@138.203.8.62/nat_histogram_results.txt"

script "nat_histogram"

no shutdown

exit

schedule "nat_histogram_schedule"

interval 600

action "dump_nat_histogram"

no shutdown

exit

----------------------------------------------

*A:CPM>config>cron#

The nat-histogram.txt file contains the command execution line. For example:

tools dump nat histogram router 4 pool "deterministic" bucket-size

10 num-buckets 10

This command will be executed every 10 minutes (600 seconds) and the output of the command will be written into a set of files on an external FTP server:

[root@ftp]# ls nat_histogram_results.txt*

nat_histogram_results.txt_20130117-153548.out

nat_histogram_results.txt_20130117-153648.out

nat_histogram_results.txt_20130117-153748.out

nat_histogram_results.txt_20130117-153848.out

nat_histogram_results.txt_20130117-153948.out

nat_histogram_results.txt_20130117-154048.out

[root@ftp]#

7.18.1. Configuration

tools dump nat histogram router <router-instance> pool <pool-name> bucket-size <[1..65536]> num-buckets <[2..50]>

The output of this command displays the port usage in a given pool per protocol per subscriber. The output is organized in a configurable number of port-buckets.

In the following example there is 1 subscriber that is using between 20 and 39 UDP ports in the pool named det. The pool is configured in the Base routing instance.

tools dump nat histogram router "Base" pool "det" bucket-size 20 num-buckets 40

===============================================================================

Usage histogram NAT pool "det" router "Base"

===============================================================================

Num-ports Sub-TCP Sub-UDP Sub-ICMP

-------------------------------------------------------------------------------

0-19 0 0 0

20-39 0 1 0

40-59 0 0 0

60-79 0 0 0

80-99 0 0 0

100-119 0 0 0

120-139 0 0 0

140-159 0 0 0

160-179 0 0 0

180-199 0 0 0

200-219 0 0 0

220-239 0 0 0

240-259 0 0 0

260-279 0 0 0

280-299 0 0 0

300-319 0 0 0

320-339 0 0 0

340-359 0 0 0

360-379 0 0 0

380-399 0 0 0

400-419 0 0 0

420-439 0 0 0

440-459 0 0 0

460-479 0 0 0

480-499 0 0 0

500-519 0 0 0

520-539 0 0 0

540-559 0 0 0

560-579 0 0 0

580-599 0 0 0

600-619 0 0 0

620-639 0 0 0

640-659 0 0 0

660-679 0 0 0

680-699 0 0 0

700-719 0 0 0

720-739 0 0 0

740-759 0 0 0

760-779 0 0 0

780- 0 0 0

-------------------------------------------------------------------------------

No. of entries: 40

===============================================================================

7.19. NAT Stateless Dual-Homing

Multi-chassis stateless NAT redundancy is based on a switchover of the NAT pool that can assume active (master) or standby state. The inside/outside routes that attract traffic to the NAT pool are always advertised from the active node (the node on which the pool is active).

This dual-homed redundancy based on the pool mastership state works well in scenarios where each inside routing context is configured with a single NAT policy (NATed traffic within this inside routing context will be mapped to a single NAT pool).

However, in cases where the inside traffic is mapped to multiple pools (with deterministic NAT and in case when multiple NAT policies are configured per inside routing context), the basic per pool multi-chassis redundancy mode can cause the inside traffic within the same routing instance to fail since some pools referenced from the routing instance might be active on one node while other pools might be active on the other node.

Imagine a case where traffic ingressing the same inside routing instance is mapped as follows (this mapping can be achieved via filters):

Source ip-address A —> Pool 1 (nat-policy 1) active on Node 1
Source ip-address B —> Pool 2 (nat-policy 2) active on Node 2

Traffic for the same destination is normally attracted only to one NAT node (the destination route is advertised only from a single NAT node). Let assume that this node is Node 1 in out example. Once the traffic arrives to the NAT node, it will be mapped to the corresponding pool according to the mapping criteria (routing based or filter based). But if active pools are not co-located, traffic destined to the pool that is active on the neighboring node would fail. In our example traffic from the source ip-address B would arrive to the Node 1, while the corresponding Pool 2 is inactive on that node. Consequently the traffic forwarding would fail.

To remedy this situation, a group of pools targeted from the same inside routing context must be active on the same node simultaneously. In other words, the active pools referenced from the same inside routing instance must be co-located. This group of pools is referred to as Pool Fate Sharing Group (PFSG). The PFSG is defined as a group of all NAT pools referenced by inside routing contexts whereby at least one of those pools is shared by those inside routing contexts. This is shown in Figure 66.

Even though only Pool 2 is shared between subscribers in VRF 1 and VRF 2, the remaining pools in VRF 1 and VRF 2 must be made part of PFSG 1 as well.

This will ensure that the inside traffic will be always mapped to pools that are active in a single box.

Figure 66: Pool Fate Sharing Group

There is always one lead pool in PFSG. The Lead pool is the only pool that is exporting/monitoring routes. Other pools in the PFSG are referencing the lead pool and they inherit its (activity) state. If any of the pools in PFSG fails, all the pools in the PFSG will switch the activity, or in another words they will share the fate of the lead pool (active/standby/disabled).

There is one lead pool per PFSG per node in a dual-homed environment. Each lead pool in a PFSG will have its own export route that must match the monitoring route of in the lead pool in the corresponding PFSG on the peering node.

PFSG is implicitly enabled by configuring multiple pools to follow the same lead pool.

7.19.1. Configuration Considerations

Attracting traffic to the active NAT node (from inside and outside) is based on the routing.

On the outside, the active pool address range will be advertised. On the inside, the destination prefix or steering route (in case of filter based diversion to the NAT function) will be advertised by the node with the active pool.

The advertisement of the routes will be driven by the activity of the pools in the pool fate sharing group:

configure

router/service vprn

nat

outside

pool <name>

redundancy

export <ip-prefix/length>

monitor <ip-prefix/length>[no] shutdown

follow router <rtr-id> pool <master-pool>

For example:

router/service vprn

nat

outside

pool "nat0-pool" nat-group 1 type large-scale create

port-reservation ports 252

redundancy

follow router 500 pool "nat500-pool"

exit

address-range 128.251.12.0 128.251.12.10 create

exit

no shutdown

exit

A pool can be one of the following:

A leading pool: configure export- and monitor-route and put in no shutdown
A following pool: configure follow

Both sets of options are thus mutually exclusive.

For a leading pool redundancy will only be enabled when the redundancy node is in no shutdown. For a following pool, the administrate has no effect, and the redundancy will only be enabled when the leading pool is enabled.

Before a lead pool is enabled, consistency check will be performed to make sure that PSFG is properly configured and that the all pools in the given PFSG belong to the same NAT isa-group. PFSG is implicitly enabled by configuring multiple pools to follow the same lead pool. Adding or removing pools from the fate-share-group is only possible when the leading pool is disabled.

For example in the following case, the consistency check would fail since pool 1 is not part of the PFSG 1 (where it should be).

Figure 67: Consistency Check

7.19.2. Troubleshooting Commands

The following command displays the state of the leading pool (dual-homing section towards the bottom of the command output):

*A:Dut-B# show router 500 nat pool "nat500-pool"

===============================================================================

NAT Pool nat500-pool

===============================================================================

Description : (Not Specified)

ISA NAT Group : 1

Pool type : largeScale

Admin state : inService

Mode : auto (napt)

Port forwarding dyn blocks reserved : 0

Port forwarding range : 1 - 1023

Port reservation : 2300 blocks

Block usage High Watermark (%) : (Not Specified)

Block usage Low Watermark (%) : (Not Specified)

Subscriber limit per IP address : 65535

Active : true

Deterministic port reservation : (Not Specified)

Last Mgmt Change : 02/17/2014 09:41:43

===============================================================================

NAT address ranges of pool nat500-pool

===============================================================================

Range Drain Num-blk

-------------------------------------------------------------------------------

81.81.1.0 - 81.81.1.255 0

-------------------------------------------------------------------------------

No. of ranges: 1

===============================================================================

NAT members of pool nat500-pool ISA NAT group 1

===============================================================================

Member Block-Usage-% Hi

-------------------------------------------------------------------------------

1 < 1 N

2 < 1 N

3 < 1 N

4 < 1 N

5 < 1 N

6 < 1 N

-------------------------------------------------------------------------------

No. of members: 6

===============================================================================

Dual-Homing

===============================================================================

Type : Leader

Export route : 170.0.0.3/32

Monitor route : 170.0.0.2/32

Admin state : inService

Dual-Homing State : Active

===============================================================================

Dual-Homing fate-share-group

===============================================================================

Router Pool Type

-------------------------------------------------------------------------------

Base nat0-pool Follower

vprn500 nat500-pool Leader

vprn501 nat501-pool Follower

vprn502 nat502-pool Follower

-------------------------------------------------------------------------------

No. of pools: 4

===============================================================================

The following command displays the state of the follower pool (dual-homing section towards the bottom of the command output):

*A:Dut-B# show router 501 nat pool "nat501-pool"

===============================================================================

NAT Pool nat501-pool

===============================================================================

Description : (Not Specified)

ISA NAT Group : 1

Pool type : largeScale

Admin state : inService

Mode : auto (napt)

Port forwarding dyn blocks reserved : 0

Port forwarding range : 1 - 1023

Port reservation : 2300 blocks

Block usage High Watermark (%) : (Not Specified)

Block usage Low Watermark (%) : (Not Specified)

Subscriber limit per IP address : 65535

Active : true

Deterministic port reservation : (Not Specified)

Last Mgmt Change : 02/17/2014 09:41:43

===============================================================================

NAT address ranges of pool nat501-pool

===============================================================================

Range Drain Num-blk

-------------------------------------------------------------------------------

81.81.2.0 - 81.81.2.255 0

81.81.3.0 - 81.81.3.255 0

-------------------------------------------------------------------------------

No. of ranges: 2

===============================================================================

NAT members of pool nat501-pool ISA NAT group 1

===============================================================================

Member Block-Usage-% Hi

-------------------------------------------------------------------------------

1 < 1 N

2 < 1 N

3 < 1 N

4 < 1 N

5 < 1 N

6 < 1 N

-------------------------------------------------------------------------------

No. of members: 6

===============================================================================

Dual-Homing

===============================================================================

Type : Follower

Follow-pool : "nat500-pool" router 500

Dual-Homing State : Active

===============================================================================

Dual-Homing fate-share-group

===============================================================================

Router Pool Type

-------------------------------------------------------------------------------

Base nat0-pool Follower

vprn500 nat500-pool Leader

vprn501 nat501-pool Follower

vprn502 nat502-pool Follower

-------------------------------------------------------------------------------

No. of pools: 4

===============================================================================

The following command lists all the pools that are configured along with the NAT inside/outside routing context.

*A:Dut-B# show service nat overview

===============================================================================

NAT overview

===============================================================================

Inside/ Policy/ Type

Outside Pool

-------------------------------------------------------------------------------

vprn550 lsn-policy_unused default

Base nat0-pool

vprn550 lsn-policy_nat1 destination prefix

vprn500 nat500-pool

vprn550 lsn-policy-nat2 destination prefix

vprn501 nat501-pool

vprn551 lsn-policy_unused default

Base nat0-pool

vprn551 lsn-policy-nat3 destination prefix

vprn501 nat501-pool

vprn551 lsn-policy-nat4 destination prefix

vprn502 nat502-pool

vprn552 lsn-policy_unused default

Base nat0-pool

vprn552 lsn-policy-nat5 destination prefix

vprn502 nat502-pool

===============================================================================

7.20. NAT ISA (Intra-Chassis) Redundancy

NAT ISA redundancy helps protect against Integrated Service Adapter (ISA) failures. This protection mechanism relies on the CPM maintaining configuration copy of each ISA. In case that an ISA fails, the CPM restores the NAT configuration from the failed ISA to the remaining ISAs in the system. NAT configuration copy of each ISA, as maintained by CPM, is concerned with configuration of outside IP address and port forwards on each ISA. However, CPM does not maintain the state of dynamically created translations on each ISA. This will cause interruption in traffic until the translation are re-initiated by the devices behind the NAT.

Two modes of operation are supported:

Active-Standby — In this mode of operation, any number of standby ISAs can be allocated for protection purposes. When there are no failures in the router, standby ISAs are idle, in a state ready to accept traffic from failed ISA. Mapping between the failed ISA and the standby ISA is always 1:1. This means that one standby ISA will entirely replace one failed ISA. In this respect, NAT bandwidth from the failed ISA is reserved and restored upon failure. This model is shown in Figure 68.
Figure 68: Active-Standby Intra-Chassis Redundancy Model
Active-Active — In this mode all ISAs in the system are active. Once an ISA fails, its load is distributed across the remaining active ISA. In this mode of operation there is no bandwidth reservation across active ISA. Each ISA can operate at full speed at any given time. However, memory resources necessary to setup new translations from the failed ISAs are reserved. The reserved resources are:
1. Subscribers — Inside IPv4 addresses for LSN44, IPv6 prefixes for DS-lite/NAT64 and L2-aware subscriber.
2. Outside IPv4 addresses
3. Outside port-ranges.

By reserving memory resources it can be assured that failed traffic can be recovered by remaining ISAs, potentially with some bandwidth reduction in case that remaining ISAs operated at full or close to full speed before the failure occurred. Active-active ISA redundancy model is shown in Figure 69.

Figure 69: Active-Active Intra-Chassis Redundancy Model

In case of an ISA failure, the member-id of the member ISA that failed is contained in the FREE log. This info is used to find the corresponding MAP log which also contains the member-id field.

In case of RADIUS logging, CPM summarization trap is generated (since RADIUS log is sent from the ISA – which is failed).

7.20.1. Active-Active ISA Redundancy Model

In active-active ISA redundancy, each ISA is subdivided into multiple logical ISAs. These logical sub-entities are referred to as members. NAT configuration of each member is saved in the CPM. In case that any one ISA fails, its members will be downloaded by the CPM to the remaining active ISAs. Memory resources on each ISA will be reserved in order to accommodate additional traffic from the failed ISAs. The amount of resources reserved per ISA will depend on the number of ISAs in the system and the number of simultaneously supported ISA failures. The number of simultaneous ISA failures per system is configurable. Memory reservation will affect NAT scale per ISA.

Traffic received on the inside will be forwarded by the ingress forwarding complex to a predetermined member ISAs for further NAT processing. Each ingress forwarding complex maintains an internal link per member. The number of these internal links will, among other factors, determine the maximum number of members per system and with this, the granularity of traffic distribution over remaining ISAs in case of an ISA failure. The segmentation of ISAs into members for a single failure scenario is shown in Figure 70. The protection mechanism in this example is designed to cover for one physical ISA failure. Each ISA is divided into four members. Three of those will carry traffic during normal operation, while the fourth one will have resources reserved to accommodate traffic from one of the members in case of failure. When an ISA failure occurs, three members will be delegated to the remaining ISAs. Each member from the failed ISA will be mapped to a corresponding reserved member on the remaining ISAs.

Figure 70: Load Distribution in Active-Active Intra-Chassis Redundancy Model

Active-active ISA redundancy model supports multiple failures simultaneously. The protection mechanism shown in Figure 71 is designed to protect against two simultaneous ISA failures. Just like in the previous case, each ISA is divided into six members, three of which are carrying traffic under normal circumstances while the remaining three members have reserved memory resources.

Figure 71: Multiple Failures

Table 40shows resource utilization for a single ISA failure in relation to the total number of ISAs in the system. The resource utilization will affect only scale of each ISA. However, bandwidth per ISA is not reserved and each ISA can operate at full speed at any given time (with or without failures).

Table 40: Load Distribution in Active-Active ISA Redundancy Model Supporting Single ISA Failure

Number of Physical ISAs per System	Number of Member ISAs per Physical ISA (active/reserved)	Resource Utilization Per System in Non-Failed Condition	Resource Utilization Per System With One Failed ISA
2	1A 1R	50%	100%
3	2A 1R	67%	100%
4	3A 1R	75%	100%
5	3A 1R	75%	95%
6	2A 1R	66%	83%
7	2A 1R	66%	80%
8	2A 1R	66%	79%
9	1A 1R	50%	61%
10	1A 1R	50%	60%
11	1A 1R	50%	59%
12	1A 1R	50%	58%
13	1A 1R	50%	58%
14	1A 1R	50%	57%

7.20.2. Start-up Conditions

During the first five minutes of system boot-up or nat-group activation, the system behaves as if all ISAs are operational. Consequently, ISAs are segmented in its members according to the configured maximum number of supported failures.

Upon expiration of this initial five minute interval, the system is re-evaluated. In case that one of more ISAs are found in faulty state during re-evaluation, the members of the failed ISAs will be distributed to the remaining ISAs that are operational.

7.20.3. Recovery

Once a failed ISA is recovered, the system will automatically accept it and traffic will be assigned to it. Traffic that is moved to the recovered ISA will be interrupted.

7.20.4. Adding Additional ISAs in the ISA Group

Adding additional ISAs in an operational nat-group requires reconfiguration of the active mda-limit for the nat-group (or the failed mda-limit for that matter). This is only possible when nat-group is in an administratively shutdown state.

7.21. ISA Feature Interactions

This section describes the interaction between MS-ISA applications and other system features.

7.21.1. MS-ISA Use with Service Mirrors

All MS-ISA uses include support for service mirroring running with no feature interactions or impacts. For example, any service diverted to AA, IPsec, NAT, LNS, or supported combinations of MS-ISA application also supports service mirroring simultaneously.

7.21.2. LNS, Application Assurance and NAT

Multiple uses of MS-ISAs can be combined at one time by daisy-chaining use of the MS-ISAs. Services and subscribers terminated on the LNS ISA are full supported by Application Assurance per AA subscriber and service capabilities, and by the full NAT capabilities.

When Application Assurance and NAT are used in combination (for both ESM and SAP service contexts):

AA is always on subscriber of NAT to be able to see the original (inside) subscriber IP tuple (IP + port numbers).
AA subscriber ID includes the VRF context from the service, so shared or private subscriber IP as seen in Layer-2 Aware NAT is compatible with AA subscriber contexts.

7.21.3. Subscriber Aware Large Scale NAT44

Subscriber aware Large Scale NAT44 attempts to combine the positive attributes of Large Scale NAT44 and L2-Aware NAT, namely:

The ability for some traffic to bypass the NAT function, such as IPTV traffic and VoIP traffic whenever a unique IP address per subscriber is used (ie, not L2-aware NAT where all subs share the same IP). This can be achieved using existing Large Scale NAT44 mechanisms (ingress IP-filters)
The use of RADIUS Acct for logging of port-ranges, including multiple port-range blocks.
The use of subscriber-identification/RADIUS user-name to identify the customer to simplify management of Large Scale NAT44 subscribers.

Subscriber awareness in Large Scale NAT44 will facilitate release of NAT resources immediately after the BNG subscriber is terminated, without having to wait for the last flow of the subscriber to expire on its own (TCP timeout is 4hours by default).

The subscriber aware Large Scale NAT44 function leverages RADIUS accounting proxy built-in to the 7750 SR. The RADIUS accounting proxy allows the 7750 SR to inform Large Scale NAT44 application about individual BNG subscribers from the RADIUS accounting messages generated by a remote BNG and use this information in the management of Large Scale NAT44 subscribers. The combination of the two allows, for example, the 7750 SR running as a Large Scale NAT44 to make the correlation between the BNG subscriber (represented in the Large Scale NAT44 by the Inside IP Address) and RADIUS attributes such as User-Name, Alc-Sub-Ident-String, Calling-Station-Id or Class. These attributes can subsequently be used for either management of the Large Scale NAT44 subscriber, or in the NAT RADIUS Accounting messages generated by the 7750 SR Large Scale NAT44 application. Doing so will simplify both the administration of the Large Scale NAT44 and the logging function for port-range blocks.

As BNG subscribers authenticate and come online, the RADIUS accounting messages are ‘snooped’ via RADIUS accounting proxy which creates a cache of attributes from the BNG subscriber. BNG subscribers are correlated with the NAT subscriber via framed-ip address, and one of the following attributes that must be present in the accounting messages generated by BNG:

User-name
Subscriber id
RADIUS Class attribute
Calling-Station-id
IMSI
IMEI

Framed-ip address must also be present in the accounting messages generated by BNG.

Large Scale NAT44 Subscriber Aware application will receive a number of cached attributes which will then be used for appropriate management of Large Scale NAT44 subscribers, for example:

Delete the Large Scale NAT44 subscriber when the BNG subscriber is terminated
Report attributes in Large Scale NAT44 accounting messages according to configuration options

Creation and removal of RADIUS accounting proxy cache entries related to BNG subscriber is triggered by the receipt of accounting start/stop messages sourced by the BNG subscriber. Modification of entries can be triggered by interim-update messages carrying updated attributes. Cached entries can also be purged via CLI.

In addition to passing one of the above attributes in Large Scale NAT44 RADIUS accounting messages, a set of opaque BNG subscriber RADIUS attributes can optionally be passed in Large Scale NAT44 RADIUS accounting messages. Up to 128B of such opaque attributes will be accepted. The remaining attributes will be truncated.

Large Scale NAT44 subscriber instantiation can optionally be denied in case that corresponding BNG subscriber cannot be identified in Large Scale NAT44 via RADIUS accounting proxy.

Configuration guidelines:

Configure RADIUS accounting proxy functionality in a routing instance that will receive accounting messages from the remote or local BNG. Optionally forward received accounting message received by RADIUS accounting proxy to the final accounting destination (accounting server).

Point the BNG RADIUS accounting destination to the RADIUS accounting proxy – this way RADIUS accounting proxy will receive and ‘snoop’ BNG RADIUS accounting data.

BNG subscriber can be associated with two accounting policies, therefore pointing to two different accounting destinations. For example, one to the RADIUS accounting proxy, the other one to the real accounting server.

Configure subscriber aware Large Scale NAT44. From Large Scale NAT44 Subscriber Aware application reference the RADIUS Proxy accounting server and define the string that will be used to correlate BNG subscriber with the Large Scale NAT44 subscriber.

Optionally enable NAT RADIUS accounting that will include BNG subscriber relevant data.

(1)

*A:left-a20>config>service>vprn#

radius-proxy

server "proxy-acct" purpose accounting create

default-accounting-server-policy "lsn-policy"

description "two side server -interface:client ; default-plcy:real server"

interface "rad-proxy-loopback"

secret "TEg1UEZzemRMyZXD1HvvQGkeGfoQ58MF" hash2

no shutdown

exit

RADIUS accounting proxy will listen to accounting messages on interface ‘rad-proxy-loopback’.

The name ‘proxy-acct’ as defined by the server command will be used to reference this proxy accounting server from Large Scale NAT44.

Received accounting messages can be relayed further from RADIUS accounting proxy to the accounting server which can be indirectly referenced in the default-accounting-policy ‘lsn-policy’.

The lsn-policy is defined as:

*A:left-a20>config>aaa#

radius-server-policy "lsn-policy" create

servers

router "Base"

source-address 114.0.1.12

server 1 name "114"

exit

This lsn-policy can then reference an external RADIUS accounting server with its own security credentials. This external accounting server can be configured in any routing instance.

*A:left-a20>config>router>radius-server# info

----------------------------------------------

server "114" address 114.0.1.10 secret "KRr7H.K3i0z9O/hj2BUSmdJUdl.zWrkE" hash2 port 1813 create

description "real radius or acct server"

exit

(2)

Two RADIUS accounting policies can be configured in BNG – one to the real radius server, the other one to the RADIUS accounting proxy.

*A:left-a20>config>subscr-mgmt>sub-prof# info

----------------------------------------------

radius-accounting-policy "real-acct-srvr" duplicate “lsn”

egress

agg-rate-limit 10000

exit

----------------------------------------------

*A:left-a20>config>subscr-mgmt>acct-plcy# info

----------------------------------------------

description “lsn radius-acct-policy”

update-interval 5

include-radius-attribute

acct-authentic

acct-delay-time

called-station-id

calling-station-id remote-id

circuit-id

framed-interface-id

framed-ip-addr

framed-ip-netmask

mac-address

nas-identifier

nas-port-id

nas-port-type

nat-port-range

remote-id

sla-profile

sub-profile

subscriber-id

user-name

alc-acct-triggered-reason

exit

session-id-format number

radius-accounting-server

router 10 (service id where proxy radius is configured)

server 1 address 5.5.5.5 secret "cVi1sidvgH28Pd9QoN1flE" hash2 (radius proxy IP address is 5.5.5.5 on interface “rad-proxy-loopback”; the ‘secret’ is the same as configured on RADIUS accounting proxy)

exit

(3)

Sub-aware Large Scale NAT44 references the RADIUS accounting proxy server ‘proxy-acct’ and defines the calling-station-id attribute from the BNG subscriber as the matching attribute:

*A:left-a20>config>service>vprn>nat>inside# info

----------------------------------------------

nat-policy "nat-base"

destination-prefix 10.0.0.0/16

subscriber-identification

attribute vendor "standard" attribute-type "station-id"

description "sub-aware CGN"

radius-proxy-server router 10 name "proxy-acct"

no shutdown

exit

----------------------------------------------

(4)

Optionally RADIUS NAT accounting can be enabled:

*A:left-a20>config>isa>nat-group# info

----------------------------------------------

active-mda-limit 1

radius-accounting-policy "nat-acct-basic"

mda 1/2

no shutdown

*A:left-a20>config>aaa>nat-acct-plcy# info detail

----------------------------------------------

description "nat-acct-basic policy"

include-radius-attribute

framed-ip-addr

nas-identifier

nat-subscriber-string

user-name

inside-service-id

outside-service-id

outside-ip

port-range-block

hardware-timestamp

release-reason

multi-session-id

frame-counters

octet-counters

session-time

called-station-id

subscriber-data

exit

radius-accounting-server

access-algorithm direct

retry 3

router "Base"

source-address-range 114.0.1.20 114.0.1.20

timeout sec 5

server 1 address 114.0.1.10 secret "KlWIBi08CxTyM/YXaU2gQitOu8GgfSD7Oj5hjese27A" hash2 port 1813

exit

----------------------------------

Such setup would produce a stream of following Large Scale NAT44 RADIUS accounting messages:

Mon Jul 16 10:59:27 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Start

Acct-Multi-Session-Id = "500456500365a4de7c29a9a07c29a9a0"

Acct-Session-Id = "500456500365a4de6201d7b87c29a9a0"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "remote0"

Calling-Station-Id = "remote0"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.7

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 1054-1058 router base"

Acct-Input-Packets = 0

Acct-Output-Packets = 0

Acct-Input-Octets = 0

Acct-Output-Octets = 0

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 0

Event-Timestamp = "Jul 16 2012 10:58:40 PDT"

NAS-IP-Address = 1.1.1.1

User-Name = "cgn_1_ipoe"

Framed-IP-Netmask = 255.255.255.0

Class = 0x63676e2d636c6173732d7375622d6177617265

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000000550045640"

Event-Timestamp = "Jul 16 2012 10:58:24 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.10"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Acct-Unique-Session-Id = "10f8bce6e5e7eb41"

Timestamp = 1342461567

Request-Authenticator = Verified

Mon Jul 16 11:03:56 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Interim-Update

Acct-Multi-Session-Id = "500456500365a4de7c29a9a07c29a9a0"

Acct-Session-Id = "500456500365a4de6201d7b87c29a9a0"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "remote0"

Calling-Station-Id = "remote0"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.7

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 1054-1058 router base"

Acct-Input-Packets = 0

Acct-Output-Packets = 1168

Acct-Input-Octets = 0

Acct-Output-Octets = 86432

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 264

Event-Timestamp = "Jul 16 2012 11:03:04 PDT"

Acct-Delay-Time = 5

NAS-IP-Address = 1.1.1.1

User-Name = "cgn_1_ipoe"

Framed-IP-Netmask = 255.255.255.0

Class = 0x63676e2d636c6173732d7375622d6177617265

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000000550045640"

Acct-Session-Time = 279

Event-Timestamp = "Jul 16 2012 11:03:04 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.10"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Acct-Unique-Session-Id = "10f8bce6e5e7eb41"

Timestamp = 1342461836

Request-Authenticator = Verified

Mon Jul 16 11:04:34 2012

NAS-IP-Address = 1.1.1.1

NAS-Identifier = "left-a20"

NAS-Port = 37814272

Acct-Status-Type = Stop

Acct-Multi-Session-Id = "500456500365a4de7c29a9a07c29a9a0"

Acct-Session-Id = "500456500365a4de6201d7b87c29a9a0"

Called-Station-Id = "00-00-00-00-01-01"

User-Name = "remote0"

Calling-Station-Id = "remote0"

Alc-Serv-Id = 10

Framed-IP-Address = 26.0.0.7

Alc-Nat-Outside-Ip-Addr = 80.0.0.1

Alc-Nat-Port-Range = "80.0.0.1 1054-1058 router base"

Acct-Terminate-Cause = Host-Request

Acct-Input-Packets = 0

Acct-Output-Packets = 1321

Acct-Input-Octets = 0

Acct-Output-Octets = 97754

Acct-Input-Gigawords = 0

Acct-Output-Gigawords = 0

Acct-Session-Time = 307

Event-Timestamp = "Jul 16 2012 11:03:47 PDT"

NAS-IP-Address = 1.1.1.1

User-Name = "cgn_1_ipoe"

Framed-IP-Netmask = 255.255.255.0

Class = 0x63676e2d636c6173732d7375622d6177617265

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000000550045640"

Acct-Session-Time = 279

Event-Timestamp = "Jul 16 2012 11:03:04 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.10"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Acct-Unique-Session-Id = "10f8bce6e5e7eb41"

Timestamp = 1342461874

Request-Authenticator = Verified

The matching accounting stream generated on the BNG is given below:

Mon Jul 16 10:59:11 2012

Acct-Status-Type = Start

NAS-IP-Address = 1.1.1.1

User-Name = "cgn_1_ipoe"

Framed-IP-Address = 26.0.0.7

Framed-IP-Netmask = 255.255.255.0

Class = 0x63676e2d636c6173732d7375622d6177617265

Calling-Station-Id = "remote0"

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000000550045640"

Event-Timestamp = "Jul 16 2012 10:58:24 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.10"

ADSL-Agent-Circuit-Id = "cgn_1_ipoe"

ADSL-Agent-Remote-Id = "remote0"

Alc-Subsc-ID-Str = "CGN1"

Alc-Subsc-Prof-Str = "nat"

Alc-SLA-Prof-Str = "tp_sla_prem"

Alc-Client-Hardware-Addr = "00:00:65:05:10:01"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Acct-Unique-Session-Id = "9c1723d05e87c043"

Timestamp = 1342461551

Request-Authenticator = Verified

Mon Jul 16 11:03:51 2012

Acct-Status-Type = Interim-Update

NAS-IP-Address = 1.1.1.1

User-Name = "cgn_1_ipoe"

Framed-IP-Address = 26.0.0.7

Framed-IP-Netmask = 255.255.255.0

Class = 0x63676e2d636c6173732d7375622d6177617265

Calling-Station-Id = "remote0"

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000000550045640"

Acct-Session-Time = 279

Event-Timestamp = "Jul 16 2012 11:03:04 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.10"

ADSL-Agent-Circuit-Id = "cgn_1_ipoe"

ADSL-Agent-Remote-Id = "remote0"

Alc-Subsc-ID-Str = "CGN1"

Alc-Subsc-Prof-Str = "nat"

Alc-SLA-Prof-Str = "tp_sla_prem"

Alc-Client-Hardware-Addr = "00:00:65:05:10:01"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Alcatel-IPD-Attr-163 = 0x00000001

Alc-Acct-I-Inprof-Octets-64 = 0x00010000000000000000

Alc-Acct-I-Outprof-Octets-64 = 0x00010000000000020468

Alc-Acct-I-Inprof-Pkts-64 = 0x00010000000000000000

Alc-Acct-I-Outprof-Pkts-64 = 0x0001000000000000052a

Alc-Acct-I-Inprof-Octets-64 = 0x00030000000000000000

Alc-Acct-I-Outprof-Octets-64 = 0x00030000000000000000

Alc-Acct-I-Inprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-I-Outprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-I-Inprof-Octets-64 = 0x00050000000000000000

Alc-Acct-I-Outprof-Octets-64 = 0x00050000000000000000

Alc-Acct-I-Inprof-Pkts-64 = 0x00050000000000000000

Alc-Acct-I-Outprof-Pkts-64 = 0x00050000000000000000

Alc-Acct-O-Inprof-Octets-64 = 0x00010000000000000000

Alc-Acct-O-Outprof-Octets-64 = 0x00010000000000003154

Alc-Acct-O-Inprof-Pkts-64 = 0x00010000000000000000

Alc-Acct-O-Outprof-Pkts-64 = 0x0001000000000000009a

Alc-Acct-O-Inprof-Octets-64 = 0x00030000000000000000

Alc-Acct-O-Outprof-Octets-64 = 0x00030000000000000000

Alc-Acct-O-Inprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-O-Outprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-O-Inprof-Octets-64 = 0x00050000000000000000

Alc-Acct-O-Outprof-Octets-64 = 0x00050000000000000000

Alc-Acct-O-Inprof-Pkts-64 = 0x00050000000000000000

Alc-Acct-O-Outprof-Pkts-64 = 0x00050000000000000000

Acct-Unique-Session-Id = "9c1723d05e87c043"

Timestamp = 1342461831

Request-Authenticator = Verified

Mon Jul 16 11:04:34 2012

Acct-Status-Type = Stop

NAS-IP-Address = 1.1.1.1

User-Name = "cgn_1_ipoe"

Framed-IP-Address = 26.0.0.7

Framed-IP-Netmask = 255.255.255.0

Class = 0x63676e2d636c6173732d7375622d6177617265

Calling-Station-Id = "remote0"

NAS-Identifier = "left-a20"

Acct-Session-Id = "D896FF0000000550045640"

Acct-Session-Time = 322

Acct-Terminate-Cause = User-Request

Event-Timestamp = "Jul 16 2012 11:03:47 PDT"

NAS-Port-Type = Ethernet

NAS-Port-Id = "1/1/5:5.10"

ADSL-Agent-Circuit-Id = "cgn_1_ipoe"

ADSL-Agent-Remote-Id = "remote0"

Alc-Subsc-ID-Str = "CGN1"

Alc-Subsc-Prof-Str = "nat"

Alc-SLA-Prof-Str = "tp_sla_prem"

Alc-Client-Hardware-Addr = "00:00:65:05:10:01"

Acct-Delay-Time = 0

Acct-Authentic = RADIUS

Alc-Acct-I-Inprof-Octets-64 = 0x00010000000000000000

Alc-Acct-I-Outprof-Octets-64 = 0x000100000000000248c4

Alc-Acct-I-Inprof-Pkts-64 = 0x00010000000000000000

Alc-Acct-I-Outprof-Pkts-64 = 0x000100000000000005d9

Alc-Acct-I-Inprof-Octets-64 = 0x00030000000000000000

Alc-Acct-I-Outprof-Octets-64 = 0x00030000000000000000

Alc-Acct-I-Inprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-I-Outprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-I-Inprof-Octets-64 = 0x00050000000000000000

Alc-Acct-I-Outprof-Octets-64 = 0x00050000000000000000

Alc-Acct-I-Inprof-Pkts-64 = 0x00050000000000000000

Alc-Acct-I-Outprof-Pkts-64 = 0x00050000000000000000

Alc-Acct-O-Inprof-Octets-64 = 0x00010000000000000000

Alc-Acct-O-Outprof-Octets-64 = 0x00010000000000003860

Alc-Acct-O-Inprof-Pkts-64 = 0x00010000000000000000

Alc-Acct-O-Outprof-Pkts-64 = 0x000100000000000000b0

Alc-Acct-O-Inprof-Octets-64 = 0x00030000000000000000

Alc-Acct-O-Outprof-Octets-64 = 0x00030000000000000000

Alc-Acct-O-Inprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-O-Outprof-Pkts-64 = 0x00030000000000000000

Alc-Acct-O-Inprof-Octets-64 = 0x00050000000000000000

Alc-Acct-O-Outprof-Octets-64 = 0x00050000000000000000

Alc-Acct-O-Inprof-Pkts-64 = 0x00050000000000000000

Alc-Acct-O-Outprof-Pkts-64 = 0x00050000000000000000

Acct-Unique-Session-Id = "9c1723d05e87c043"

Timestamp = 1342461874

Request-Authenticator = Verified

7.22. Mapping of Address and Port Using Translation (MAP-T)

Note:

The MAP-T feature and commands described in this section apply to the Nokia Virtualized Service Router (VSR) only.

MAP-T is a NAT technique defined in RFC 7599. Its key advantage is the decentralization of stateful NAT while enabling the sharing of public IPv4 addresses among the customer edge (CE) devices. In a nutshell, the CE performs the stateful NAT44 function and translates the resulting IPv4 packet into an IPv6 packet. The IPv6 packet is transported over the IPv6 network to the Border Router (BR), which then translates the IPv6 packet to IPv4 and sends it into the public domain.

As multiple CEs can share a single public IPv4 address, MAP-T must rely on an algorithm (A+P algorithm running on the CEs and BR) to ensure that each CE is assigned a unique port-range on a shared IPv4 public address. In this way, each CE can be uniquely identified at the BR by a combination of the shared IPv4 public address and unique port-range. A set of CEs and BR that share a common set of MAP algorithm rules constitutes a MAP domain. Figure 72 shows a network-level view of Map-T.

Figure 72: MAP-T Network Level View

MAP-T offers the following advantages mainly as a result of its stateless BR operation:

Improved Scaling
State maintenance is decentralized, which enables better scaling.
Simplified Redundancy
There are no sessions synchronized between redundant BRs and this translates to simplified redundancy.
Reduced Logging
As there are no NAT resources in the BR that require logging, only configuration changes in the BR are logged, which reduces the volume of logging data.
Simpler Communication
MAP-T simplifies user-to-user communication.
Higher Throughput
MAP-T offers higher throughput than a stateful solution, with less processing required in the BR.

Mapping of address and port (MAP) is a generic function, regardless of the underlying transport mechanism (MAP-T or MAP-E) used. Each MAP CE is assigned as follows:

A shared public IPv4 address with a unique port-range on the shared IPv4 address
Although a shared IPv4 address is used in most cases, the CE is sometimes assigned a unique IPv4 address or even an IPv4 prefix. This information is used for stateful NAT44 at the CE.
An IPv6 prefix (IA-PD)
A “subnet” from the IPv6 prefix is allocated to the CE as a MAP prefix. The MAP prefix is used to encode public IPv4 information and identify the CE in a MAP domain. The remainder of the IA-PD is used on the LAN side of the CE.
An IPv6 address (IA-NA)
The IPv6 address is independent of MAP and is a regular IPv6 address on the WAN side. The address is used for native end-to-end IPv6 communication (it can participate in forming routing adjacencies and other tasks).

The CE and BR perform the following functions in the MAP-T domain:

CE Upstream Direction (IPv4®IPv6)
1. perform stateful NAT44 function (private®public)
2. translate the public IPv4 address and port into an assigned IPv6 MAP source address
3. send the IPv6 packet with encoded IPv4 information towards the BR
BR Upstream Direction (IPv6®IPv4)
1. perform an anti-spoof check on the received IPv6 packet to ensure that it is coming from a trusted source (CE)
  Anti-spoofing is achieved by checking the source IPv6 MAP address against the configured MAP rules and making sure that the correct public IPv4 address and port-range of the CE are encoded in the CE's source IPv6 MAP address.
2. translate the IPv6 packet into an IPv4 packet and forward it into the public domain
BR Downstream Direction (IPv6<--IPv4)
1. translate the IPv4 packet into an IPv6 packet according to MAP rules
  The IPv4 destination address of the received packet is translated into an IPv6 MAP address of the CE.
2. send the IPv6 packet towards the CE
CE Downstream Direction (IPv4 <-- IPv6)
1. perform the anti-spoofing function using the destination IPv6 address to verify that the packet is destined for the CE
  MAP rules are used to verify that the public IPv4 address and the port-range of the CE is encoded in the IPv6 destination IP address of the received packet (IPv6 MAP address of the CE).
2. translate the IPv6 packet into an IPv4 packet
3. perform the NAT44 function (public®private)
4. forward the packet into the private IPv4 network

Each device (CE and BR) is also responsible for fragmentation handling and ICMP error reporting (MTU to small, TTL expired, and so on).

7.22.1. MAP-T Rules

MAP-T rules control the address translation in a MAP-T domain. The mapping rules can be delivered to the devices in the MAP domain using RADIUS or DHCP, or be statically provisioned.

The MAP-T rules are:

Basic Mapping Rule (BMR)
The BMR is used to translate the public IPv4 address and port-range assigned to the CE into the IPv6 MAP address. It is composed of the following parameters:
1. rule IPv6 prefix (including prefix length)
2. rule IPv4 prefix (including prefix length)
3. rule Embedded Address bits (EA-bits) define the portion of the IA-PD that encodes the IPv4 suffix and port-range
4. rule Port Parameters (optional)
Forwarding Mapping Rule (FMR)
The FMR is used for forwarding within the MAP domain. FMRs are instantiated in the BR so that the BR can forward traffic to the CEs. FMRs can also be instantiated in CEs to forward traffic directly between CEs, effectively bypassing BR. The FMR is composed of the same set of parameters as the BMR:
1. rule IPv6 prefix (including prefix length)
2. rule IPv4 prefix (including prefix length)
3. rule Embedded Address (EA) bits that define the portion of the IA-PD that encodes the IPv4 suffix and port-range
4. rule Port Parameters (optional)
Default Mapping Rule (DMR)
The DMR is used to forward traffic outside the MAP domain. This rule contains the IPv6 prefix of the BR in MAP-T and it is used as the default route.

7.22.2. A+P Mapping Algorithm

The public IPv4 address and the port-range information of the CE is encoded in its assigned IPv6 delegated prefix (IA-PD). The BMR holds the key to decode this information from the IA-PD of the CE. The BMR identifies the portion of bits of the IA-PD that contain the suffix of the IPv4 address and the port-set ID (PSID). These bits are called the EA bits. The PSID represents the port-range assigned to the CE.

The public IPv4 address of the CE is constructed by concatenating the IPv4 prefix carried in the BMR and the suffix, which is extracted from the EA bits within the IA-PD. The port-range is identified by the remaining EA bits (PSID portion). The EA bits uniquely identify the CE within the IPv6 network in a given MAP domain.

The psid-offset value must be set to a value greater than 0. It represents ports that are omitted from the mapping (for example, well-known ports).

An IPv4 address and port on the private side of the CE must be statefully translated to a public IPv4 address, and the port within the assigned port must be set on the public side of the CE. This ensures that the BR, based on the same MAP rules, can extrapolate the IPv4 source of the packet for verification (anti-spoofing) purposes in the upstream direction, and conversely, to determine the destination IPv6 MAP address (CE address) in the downstream direction (based on the destination IPv4+port).

The IPv6 MAP address is constructed by setting the subnet-id in the delegated IPv6 prefix to 0. In this way, the subnet-id of 0 is reserved for MAP function. The remaining subnets can be delegated on the LAN side of the CE.

The interface-id is set to the IPv4 public address and PSID. This is described in RFC 7599, §6.

In this way, the IPv4 and IPv6 addresses of the CE are defined and easily converted to each other based on the BMR and the port information in the packet. Figure 73 shows the A+P mapping algorithm.

Figure 73: A+P Mapping

7.22.3. Routing Considerations

Figure 74 shows a MAP-T deployment scenario.

Figure 74: MAP-T Deployment Scenario

The routes related to MAP-T are:

IPv4 prefixes from the MAP-rules — These routes are advertised in the upstream direction.
DMR — This is the BR prefix for a specific domain. This route is advertised in the downstream direction.

Routes related to MAP-T are advertised through IPv4 and IPv6 routing protocols. MAP-T routes in the VSR are owned by “protocol nat” with a metric of 50. This can be used to configure an export routing policy when advertising MAP-T routes in IGP or BGP.

Multiple MAP-T domains can be supported in the same routing context.

Note:

IPv6 IA-PD end-user prefixes are carved out of the IPv6 rule prefix. Aside from MAP-T, IA-PD is used for native IPv6 end-to-end traffic outside of MAP-T. Although the IPv6 rule prefix is not marked as a NAT route in the routing table, it is nonetheless advertised in the upstream direction.

7.22.4. Forwarding Considerations in the BR

In the upstream direction, when the BR receives an IPv6 packet destined for the BR prefix, a source-based IPv6 address lookup (anti-spoofing) is performed to verify that the packet is sent by the credible CE.

In the downstream direction, a destination-based IPv4 lookup is performed. This leads to the MAP-T rule entry, which provides the information necessary to derive the IPv6 address of the destination CE.

The MAP-T forwarding function in the VSR is also responsible for:

address conversion between IPv4 and IPv6 based on the BMR rule
header translation between IPv4 and IPv6, as described in RFC 6145

In address-sharing scenarios, address translation is performed for TCP/UDP and a subset of ICMP traffic; everything else is dropped. In contrast, 1:1 and prefix-sharing scenarios are protocol agnostic.

7.22.4.1. IPv6 Addresses

An IPv6 address of the MAP-T node is constructed according to RFC 7597, §5.2 and RFC 7599, §6. Figure 75 shows the IPv6 address of the MAP-T node.

Figure 75: IPv6 Address Construction

The subnet ID for a MAP node (CE) is set to 0. Figure 76 shows the node interface (PSID is left-padded with zeros to create a 16-bit field and the IPv4 address is the public IPv4 address assigned to the CE).

Figure 76: Node Interface

This constructed IPv6 address represents the source IPv6 address of traffic sent from the CE to BR (upstream direction), and the destination IPv6 address in the opposite direction (downstream traffic sent from the BR to the CE).

The source IPv6 address in the downstream direction is a combination of the BR IPv6 prefix and the source IPv4 address (per RFC 7599, §5.1) received in the original packet.

The destination IPv6 address in the upstream direction is a combination of the BR IPv6 prefix and the IPv4 destination address (RFC 7599, §5.1) in the original packet.

7.22.4.2. 1:1 Translations and IPv4 Prefix Translations

1:1 translations refer to the case in which each CE is assigned a distinct public IPv4 address; that is, there is no public IPv4 address sharing between the CEs. In this case, the PSID field is 0 and the sum of lengths for the IPv4 rule prefix and EA bits is 32. In other words, all the EA bits represent the IPv4 suffix. The public IPv4 address of the CE is created by concatenating the Rule IPv4 Prefix and the EA bits.

IPv4 Prefix translations refer to the case where an IPv4 prefix is assigned to a CE. In this case, the PSID field is 0 and the sum of the lengths for IPv4 rule prefix and EA bits is less than 32.

In both preceding cases, the translations are protocol agnostic; all protocols, not just TCP/UDP or ICMP, will be translated.

7.22.4.3. Hub-And-Spoke Topology

The BR supports hub-and-spoke topology, which means that the BR facilitates communication between MAP-T CEs.

7.22.4.4. Rule Prefix Overlap

Rule prefix overlap is not supported because it can cause lookup ambiguity. Figure 77 shows a rule prefix overlap example.

Figure 77: IPv6 Rule Prefix Overlap

In the case where rule IPv6 prefix 1 is a subset of rule IPv6 prefix 2, the overlapping bits between the EA-bits in end user prefix 2 and the overlapping bits in rule prefix 1 (represented by the shaded sections in Figure 77) could render end-user prefixes 1 and 2 indistinguishable (everything else being the same) when anti-spoof lookup is performed in the upstream direction. This could result in an incorrect anti-spoofing lookup.

A similar logic can be applied to overlapping IPv4 prefixes in the downstream direction, where the longest prefix match will always lead to the same CE, while the shortest match (leading to a different CE) will not be evaluated.

7.22.5. BMR Rules Implementation Example

This section examines an example MAP-T deployment with three MAP rules. The deployment assumes the following:

There are about 12,000 private IPv4 addresses that need to be translated via MAP-T.
Each such address should have approximately 4000 ports available per CE. Therefore, the IP address sharing ratio is 16:1; that is, 16 CEs share the same public IP address.
The public IPv4 addresses that are available to the operator for this translation are from three /24 subnets (11.11.11.0/24, 12.12.12.0/24 and 13.13.13.0/24).
All users (or CEs) are assigned a /60 IA-PD.

The 12,000 private IPv4 addresses (CEs) in a 16:1 sharing scenario can be covered using three /24 subnets as follows:

(3 * 2^8 * 16 = 12,288)

The IPv4 rule prefix and EA bits length per rule in this scenario are:

11.11.11.0/24 EA length: 12 bits (8 bits for the IPv4 suffix and 4 bits for PSID)
12.12.12.0/24 EA length: 12 bits (8 bits for the IPv4 suffix and 4 bits for PSID)
13.13.13.0/24 EA length: 12 bits (8 bits for the IPv4 suffix and 4 bits for PSID)

The first 6 bits of the 16 bit port-range are set to 000000 and are reserved for psid-offset (ports 0-1023 are reserved); therefore, the user-allocated port space will be calculated as follows:

4000 - 64 = 4032 ports

The IPv6 rule prefix is the next parameter in the MAP rule. Figure 78 shows the relevant bits in the IPv6 address: only bits /32 to /64 are considered; the irrelevant bits of the IPv6 addresses are ignored in this example.

Figure 78: Determining the Rule IPv6 Prefix

The following three rules are created in this example:

Rule 1 will cover subnet 1
Rule 2 will cover subnet 2
Rule 3 will cover subnet 3

In each of the three cases, the EA bits extend from the PD length (/60) to the IPv6 rule prefix length (/48).

The IPv6 rule prefix length is determined for each of the three rules. However, the IPv6 rule prefixes must not overlap, see section 7.22.4.4 for more information. Non-overlapping IPv6 rule prefixes ensure that each CE is assigned a unique IA-PD. Table 41 describes the rules.

Table 41: IPv6 Rule Prefixes

	Rule 1	Rule 1	Rule 1
IPv6 rule prefix	2001:db8:0000::/48-	2001:db8:0001::/48-	2001:db8:0002::/48-
IPv4 rule prefix	11.11.11.0/24	12.12.12.0/24	13.13.13.0/24
EA bits	12	12	12
Paid-Offset	6	6	6

The final step is to ensure that the DHCPv6 server hands out proper end-user prefixes (IA-PD), and the rules are also delegated.

In this example, each /48 IPv6 rule prefix supports 4,000 MAP-T CEs, where each CE can further delegate 15 IPv6 “subnets” on the LAN side and each CE is allocated about 4,000 ports to use in stateful NAT44.

Note:

The VSR-BR supports only IPv6 rule prefixes of the same length within a given domain. To accommodate a different prefix length assignment for IA-PD (for example /56), create another domain with a different IPv6 rule prefix (/44 instead of /48).

7.22.6. ICMP

The following ICMPv4 messages are supported in MAP-T on the VSR; other types of ICMP messages are not supported:

ICMP Query messages — These messages contain an identifier field in the ICMP header, which is referred to as the “query identifier” or “query-id” and it is used in MAP-T in the same way as the L4 ports are used in TCP or UDP. ICMP Echo Req/Rep (PING) and traceroute are examples that rely on ICMP Query messages.
ICMP Error messages — These messages contain the embedded original datagram that triggers the ICMP error message. The ICMP error messages do not contain the query-id field.

The ICMP Query messages and ICMP Error messages are supported regardless of whether they are just passing through a VSR (transit messages), or are terminated or generated in or from a VSR.

The NAT-related ICMPv4 behavior is described in RFC 5508. The following NAT messages are supported in the MAP-T VSR (RFC 5508, §7, Requirement 10a):

ICMPv4 Error Message: Destination Unreachable Message (Type 3)
ICMPv4 Error Message: Time Exceeded Message (Type 11)
ICMPv4 Query Message: Echo and Echo Reply Messages (Type 8 and Type 0)

7.22.7. Fragmentation

The IPv6 header of the IPv4-translated packet in MAP-T can be up to 28 bytes larger than the IPv4 header (40-byte IPv6 header plus 8-byte fragmentation header versus 20-byte IPv4 header). In the case where the IPv4-to-IPv6 translated packet is larger than the IPV6 MTU, the original IPv4 packet will be fragmented so that the size of the translated IPv6 packet is within IPv6 MTU. IPv6 packets will never be fragmented, although they may contain the fragmentation header that carries fragmentation information related to the original IPv4 packet/fragment.

The IPv6 MTU in the VSR is configurable for each MAP-T domain. The L2 header is excluded from the IPv6 MTU.

7.22.7.1. Fragmentation in the Downstream Direction

All fragments of the same IPv4 packet are translated and sent towards the same CE. As the second and consecutive fragments do not contain any port information, the translation is performed based on the <SA, DA, Prot, Ident> cached flow records extracted from the IPv4 header.

Note that the VSR may further fragment an IPv4 fragment that it has received in order to fit it within the IPv6 MTU.

Figure 79 shows downstream fragmentation scenarios.

Figure 79: Fragmentation in the Downstream Direction

7.22.7.2. Fragmentation in the Upstream Direction

In the upstream direction, the received IPv6 fragments are artifacts of the IPv4 packets being fragmented on the CP side, before they are translated into IPv6. No flow caching is performed in the upstream direction. The BR performs an anti-spoof for each fragment and if the anti-spoof is successful, the fragment is translated to IPv4. Figure 80 shows the upstream fragmentation scenario.

Figure 80: Fragmentation in the Upstream Direction

7.22.7.3. Fragmentation Statistics

Fragmentation statistics can be cleared using the clear nat map frag-stats command. The following fragmentation statistics are available:

Rx Resolved Frags
This counter shows fragments that were resolved and never buffered; for example:
1. first fragments (MF=1, FO=0), which are always resolved by definition
2. non-first fragments with matching flow records
Rx Unresolved Frags
This counter shows the number of packets that were queued in the system since the last clear command was invoked. For example, packets with out-of-order fragments without a matching flow record (missing 1st fragment) can be eventually resolved and forwarded, or discarded (for example, due to timeout).
Tx Frags
This counter shows the fragments that were transmitted (Rx Resolved and Rx Unresolved that were eventually resolved) out of fragmentation logic within the VSR. There is no guarantee that the fragments will be transmitted out of the system as they may be dropped on egress because of congestion or restrictions imposed by the configured filter.
Dropped Frags
This counter represents the fragments that are dropped due to fragmentation issues such as timeout, buffer full, and so on.
Buffers in Use
This counter represents the amount of buffered fragments expressed as a percentage of the maximum buffer space that can be used for fragmentation.
Max Buffers
This is a non-cumulative counter that represents the maximum number of buffers allocated since the last clear command. The counter captures the highest value of the buffers-in-use counter since the last clear command. The unit of this counter is the percentage of the total buffer space that can be used by fragmentation.
Created Flows
This is a cumulative counter that represents the total number of flow records since the last clear command was invoked. It only counts the first fragment. It represents the number of fragmented packets that were processed by the system since the last clear command. This counter does not indicate the number of flows (packets whose fragments were transmitted fully) that were actually transmitted.
Flows in Use
The counter gives an approximation of the number of flow records currently in use and the number of fragmented packets being processed at the time the counter was invoked, as a percentage.
Max Flows
A non-cumulative counter that represents the maximum number of flow records reached since the last clear command. The counter shows the highest value of the flows-in-use counter since the last clear command, as a percentage.
Flow Collisions
This counter represents the number of overlapping first fragments. For example, in the case where a flow record already exists and another first fragment for the flow is received.
Exceeded Max Timeouts
This counter shows the number of fragments that have timed out since the last clear command. The represented fragments are:
1. Rx unresolved (buffered) fragments that have timed out due to a missing first fragment
2. deleted flow-records because they have not received all fragments within the timeout period
Exceeded Max Flows
This counter represents the number of times that the flows in the system has exceeded the maximum supported value.
Exceeded Max Buffers
This counter represents the number of times that the buffers in the system has exceeded the maximum supported value.
Exceeded Max Buffers Per Flow
This counter represents the number of times that the fragment counter per flow has exceeded its limit.

7.22.8. Maximum Segment Size (MSS) Adjust

The MSS Adjust feature is used to prevent fragmentation of TCP traffic. The TCP synchronize/start (SYN) packets are intercepted and their MSS value inspected to ensure that it conforms with the configured MSS value. If the inspected value is greater than the value configured in the VSR BR, the MSS value in the packet is lowered to match the configured value before the TCP SYN packet is forwarded.

As the end nodes governing the MSS value are IPv4 nodes, this feature is supported for IPv4 packets only.

An MSS adjust is performed in both the upstream and downstream directions.

7.22.9. Statistics Collection

The VSR BR maintains a count of the forwarded and dropped packets/octets per MAP-T-domain per direction. The statistics are collected on ingress (upstream v6 and downstream v4) and stored in 64-bit counters

7.22.10. Logging

As with any NAT operation where the identity of the user is hidden behind the NAT identity, logging of the NAT translation information is required. In the MAP-T domain, NAT logging is based on configuration changes because the user identity can be derived from the configured rules.

A system can have a large number of rules and each configured MAP rule generates a separate log. As a result, the amount of logs generated can be substantial. Logging is explicitly enabled using a log event.

A NAT log contains information about the following:

MAP type (map-t)
map-domain name
map-rule name
v6 rule-prefix
v4 rule-prefix
EA bits
psid-offset bits
associated routing context for the MAP-T rule
timestamp

A MAP rule log is generated when both of the following conditions are met:

a MAP rule is activated and deactivated in the system (administratively shutdown/no shutdown, corresponding MAP domain is associated/dissociated from the routing context, corresponding MAP domain is shutdown /no shutdown, and so on)
event tmnxNatMapRuleChange (id=2036) has been enabled in event-control

Example:

551 2016/04/22 14:56:35.44 UTC MINOR: NAT #2036 vprn220 NAT MAP

"map-type=map-t map-domain=domain-name-1 rule-name=rule-name-1 rule-prefix=2001:db8::/44 ipv4-prefix=192.168.10.0/24 ea-length=12 psid-offset=6 enabled router=vprn220 at 2016/04/22 14:56:35"

7.22.11. Licensing

A valid MAP-T license is required to enable the MAP-T functionality in the VSR BR. A MAP-T domain can only be instantiated with the appropriate license, which enables the following CLI command:

configure

service

vprn <id> customer <cust-id> create

map-domain <domain-name>

7.22.12. Configuration

The MAP-T configuration consists of defining MAP-T parameters within a template. The MAP-T domain is then instantiated by applying (referencing) this template within a routing (router or VPRN) context.

Defining a MAP Domain Template

configure

service

nat

map-domain <domain-name> create

[no] shutdown

dmr-prefix <ipv6-prefix>

tcp-mss-adjust <segment-size>

mtu <mtu-size>

ip-fragmentation

[no] v6-frag-header

map-rule <rule-name>

[no] shutdown

rule-prefix <ipv6-prefix>

ipv4-prefix <ipv4-prefix>

ea-length <ea-bits-length>

psid-offset <psid-offset-len>

map-rule <rule-name>

[no] shutdown

rule-prefix <ipv6-prefix>

ipv4-prefix <ipv4-prefix>

ea-length <ea-bits-length>

psid- offset <psid-offset-len>

up to 256 rules per domain

MAP-T Domain Instantiation

configure

service vprn <id> | router

nat

map-t

map-domain <domain-name>

MAP Domain Example Template

The following example shows the MAP domain template for the BMRs defined in section 7.22.5.

configure

service

nat

map-domain domain_1 create

no shutdown

dmr-prefix 2001:db8:0100::/64

map-rule rule_1

no shutdown

rule-prefix 2001:db8:0000::/48

ipv4-prefix 11.11.11.0/24

ea-length 12

psid-offset 6

map-rule rule_2

no shutdown

rule-prefix 2001:db8:0001::/48

ipv4-prefix 12.12.12.0/24

ea-length 12

psid-offset 6

map-rule rule_3

no shutdown

rule-prefix 2001:db8:0002::/48

ipv4-prefix 13.13.13.0/24

ea-length 12

psid- offset 6

MAP-T Domain Instantiation Example

The following example shows the MAP-T domain instantiation for the BMRs defined in section 7.22.5.

configure

service

vprn 10

nat

map-t

map-domain domain_1

7.22.12.1. Modifying MAP-T Parameters When the MAP-T Domain is Active

You can add new rules to an existing MAP-T domain while the MAP-T domain is instantiated and forwarding traffic. However, each rule must be in the shutdown state before any of its parameters are modified.

A MAP-T domain must be in the shutdown state to modify the dmr-prefix parameter. The remaining parameters (tcp-mss-adjust, mtu, ip-fragmentation) can be modified while the domain is active.

A MAP domain does not have to be in a shutdown state when rule modification is in progress.

7.22.13. Inter-Chassis Redundancy

MAP natively provides multi-chassis redundancy through the use of the anycast BR prefix that is advertised from multiple nodes.

As there is no state maintenance in the MAP-T BR, any BR node can process traffic for the same domain at all times. The only traffic interruption during the switch-over is for the fragmented traffic in the downstream direction being handled at the time of switchover (the flow record cache is not synchronized between the nodes).

7. Network Address Translation

7.1. In This Chapter

7.2. Terminology

7.3. Network Address Translation (NAT) Overview

7.3.1. Principles of NAT

7.3.2. Application Compatibility

7.4. Large Scale NAT

7.4.1. Port Range Blocks

7.4.1.1. Reserved Ports and Priority Sessions

7.4.1.2. Preventing Port Block Starvation

7.4.1.2.1. Dynamic Port Block Starvation in LSN

Figure 52: Dynamic Port Block Starvation in LSN

7.4.1.2.2. Dynamic Port Block Reservation

7.4.2. Timeouts

7.4.3. Watermarks

7.5. L2-Aware NAT

Figure 53: L2-Aware Tree

7.6. One-to-One (1:1) NAT

7.6.1. Static 1:1 NAT

7.6.1.1. Protocol Agnostic Behavior

7.6.1.2. Modification of Parameters in Static 1:1 NAT

7.6.1.3. Load Distribution over ISAs in Static 1:1 NAT

7.6.1.4. NAT-Policy Selection

7.6.1.5. Mapping Timeout

7.6.1.6. Logging

7.6.1.7. Restrictions

7.6.2. ICMP

7.7. Deterministic NAT

7.7.1. Overview

7.7.2. Supported Deterministic NAT Flavors

7.7.3. Number of Subscribers per Outside IP and per Pool

7.7.4. Referencing a Pool

7.7.5. Outside Pool Configuration

Figure 54: Outside Pool Configuration

Table 30: Contiguous Number of Subscribers

Table 31: Preserving Det/Dyn Port Ratio with 2^n Subscribers

Table 32: Fixed Number of Deterministic Ports with 2^n Subscribers

Figure 55: Outside Address Ranges

7.7.6. Mapping Rules and the map Command in Deterministic LSN44

7.7.7. Hashing Considerations in Deterministic LSN44

Figure 56: Deterministic LSN44 Configuration Example

7.7.7.1. Distribution of Outside IP Addresses Across MS-ISAs in an MS-ISA NA Group

7.7.8. Sharing of Deterministic NAT Pools

7.7.9. Simultaneous support of dynamic and deterministic NAT

7.7.10. Selecting Traffic for NAT

7.7.11. Inverse Mappings

7.7.11.1. MIB approach

7.7.11.2. Off-line Approach to Obtain Deterministic Mappings

7.7.12. Logging

7.7.13. Deterministic DS-Lite

7.7.13.1. Hashing Considerations in DS-Lite

7.7.13.2. Order of Configuration Steps in Deterministic DS-Lite

7.8. Destination Based NAT (DNAT)

7.8.1. Combination of SNAPT and DNAT

Table 33: Supported Combinations of SNAPT and DNAT

Figure 57: IP Address/Port Translation Modes

7.8.2. Forwarding Model in DNAT

7.8.3. DNAT Traffic Selection via NAT Classifier

7.8.4. Configuring DNAT

7.8.4.1. DNAT Traffic Selection and Destination IP Address Configuration

7.8.4.2. Micro-Netting Original Source (Inside) IP Space in DNAT-Only Case

Figure 58: DNAT Provisioning Model

7.9. LSN – Multiple NAT Policies per Inside Routing Context

7.9.1. Restrictions

7.9.2. Multiple NAT Policies Per Inside Routing Context

Figure 59: Pool Selection Based on Traffic Destination

Figure 60: NAT Pool Selection Based on the Inside Source IP Address

7.9.3. Routing Approach for NAT Diversion

7.9.4. Filter-Based Approach

7.9.5. Multiple NAT Policies with DS-Lite and NAT64

7.9.6. Default NAT Policy

7.9.7. Scaling Considerations

7.9.8. Multiple NAT Policies and SPF Configuration Considerations

7.9.8.1. Multiple NAT Policies and Forwarding Considerations

Figure 61: SPF With Multiple NAT Policies

Figure 62: Bypassing Nat Policy Rule

7.9.9. Logging

7.10. L2-Aware NAT Destination-Based Multiple NAT Policies

7.10.1. Logging

7.10.1.1. RADIUS Logging and Nat-Policy Change via CoA