Elsevier

Computer Networks

Volume 54, Issue 1, 15 January 2010, Pages 133-149
Computer Networks

Tuning SCTP failover for carrier grade telephony signaling

https://doi.org/10.1016/j.comnet.2009.08.016Get rights and content

Abstract

The Stream Control Transmission Protocol (SCTP) has not only been selected as the signaling transport protocol of choice in IETF SIGTRAN, the architecture that bridges circuit-switched and IP-based mobile core networks, but also plays a pivotal role in SAE/LTE, the next-generation UMTS/HSPA networks. To meet the redundancy requirements of telecom signaling traffic, SCTP includes a failover mechanism that enables rerouting of traffic from an unreachable network path to a backup path. However, the recommendations provided by IETF on how to configure the SCTP failover mechanism to meet telecom signaling requirements are kept quite general and leave much of the tuning to the telecom equipment vendor and/or operator. Several works by us and others have been carried out to study the effect of different SCTP parameters on the failover performance. The main contribution of this paper is that it gives a coherent treatment of how to configure the SCTP failover mechanism for carrier-grade telephony signaling, and provides practically usable configuration recommendations. The paper also discusses an alternate or complementary way of optimizing the SCTP failover mechanism by relaxing the exponential backoff that foregoes a retransmission timeout in SCTP. Some results showing significantly reduced failover times by use of this mechanism, with only marginal deteriorating effects on a signaling network, are discussed and analyzed in the paper.

Introduction

A large number of technology market research firms forecasts that future telecommunication will, in terms of revenue, be dominated by mobile broadband networks. Today, the mobile broadband market is dominated by 3GPP-based technologies such as UMTS and HSPA. Informa, an independent market analyst firm, forecasts that by year end 2012, there will be 1.3 billion UMTS/HSPA connections representing 78% of the 3G mobile broadband market [24]. In comparison, 56.8 million total connections to mobile WiMAX are anticipated by year end 2012 [24].

A large part of current mobile core networks are built upon legacy TDM- and ATM-based circuit-switched technologies. However, future-generation core networks are expected to be all IP. To make an incremental transition possible, IETF introduced the SIGnaling TRANsport (SIGTRAN) architecture [25]. The SIGTRAN architecture separates voice traffic from control traffic,1 and specifies how circuit-switched signaling, i.e., Signaling System Nr. 7 (SS7) signaling, is transported over, or terminated in, an IP core network. Currently, around 50% of all shipped SS7 stacks are SIGTRAN stacks, however, this is anticipated to rise up to more than 75% by the end of 2012 [7].

To be able to comply with the requirements of SS7, SIGTRAN introduced a new transport protocol, the Stream Control Transmission Protocol (SCTP). Since its introduction in SIGTRAN, SCTP has also been selected as one of the transport protocols to be used in SAE/LTE, the next-generation UMTS/HSPA network. A key feature of SCTP that sets it apart from TCP and UDP, the two standard Internet protocols, is multihoming. Normally, traffic is sent on a primary network path. However, if this path goes down, traffic is diverted to a secondary, backup, path. To meet the delay and timing requirements of SS7, ITU-T implicitly prescribes that a failover from the primary to a secondary path should take no more than 2 s [16]. However, being considered as a general transport protocol by IETF, SCTP configured as recommended by RFC 4960 [30] exhibits a minimum failover time of 63 s. Several works [14], [20], including IETF [6], have given recommendations on how to configure SCTP for telecom applications, but they are mostly quite vague in their recommendations [6], or focus on one or a few parameters and factors [14], [20]. Many of the findings presented in this paper go back to earlier work by us and others, however, this paper provides a coherent treatment of this subject and translates the results into concrete recommendations. The key factors behind these recommendations are illustrated with experimental results. The paper also reports on a proposal by us that can substantially decrease SCTP failover times by making the backoff factor for the retransmission timer configurable. The possible performance benefit of this proposal is verified by simulation results.

The recommendations in the paper point out that a strict configuration of the SCTP protocol parameters is crucial to reach the signaling application timing requirements. On the other hand, the paper acknowledges the fact that a strict tuning increases the risk of spurious failovers. This is the reason why this tuning has to be carefully conducted. The proposal for a modified RTO policy is motivated as a way to enable a slightly more liberal tuning of the protocol parameters. In the paper, it is shown that by reducing the backoff factor from its default value of 2 to a value in between 1.5 and 2 it is possible to meet the demands of signaling applications using a less strict parameter tuning, and this without violating network stability or performance.

The remainder of the paper is organized as follows: Section 2 gives an overview of SCTP, focusing on multihoming and the failover mechanism. Some of the results presented in this paper are illustrated with experiments, and Section 3 describes the common parts of the experiment setups used in these experiments. The next two sections constitute the core of the paper; Section 4 discusses how to tune SCTP protocol parameters to make SCTP meet telecom signaling failover requirements, while Section 5 considers the impact of external factors, such as network and traffic characteristics, on the SCTP failover performance. Next, Section 6 surveys our proposal for a new backoff mechanism. The paper concludes in Section 7 by summarizing the configuration recommendations given in the paper and recommending avenues for future work.

Section snippets

Background on SCTP and SCTP failover

As already mentioned, the Stream Control Transmission Protocol (SCTP) was initiated as a transport protocol to meet the robustness, reliability and timing demands set by telephony signaling applications. To fulfill requirements on robustness, multihoming is one of the new features of the protocol. Further, to distinguish temporary congestion from path failure a tunable failover mechanism is part of the protocol. This section presents the SCTP protocol with an emphasis on its multihoming and

Experiment setup

Several of the results presented in this study refer to results earlier presented in other papers. To illustrate some of these results, we have repeated and extended some experiments. We employed a similar network topology for all experiments presented in the paper, and this topology is outlined in Fig. 2.

A dual-homed association was setup between a traffic source and a traffic sink. The traffic source was able to generate traffic with different intensities and distributions. The bandwidth

SCTP protocol parameters impacting failover performance

This section focuses on possible optimizations in failover performance through tuning of the different protocol parameters mentioned in Section 2.2. One of the tunable parameters present in SCTP, PMR, defines the acceptable number of consecutive timeouts before considering the path unavailable. Furthermore, the failover time is tightly connected to the RTO timer, which is tuned dynamically during the transfer, smoothly adapting to the current RTT times. This parameter is, however, partly

Traffic and network properties impacting the failover performance

To give a complete view of the mechanisms impacting the failover performance, we in this section consider some studies performed on the impact of network and traffic properties. Although network properties usually are relatively constant during the transfer of data, experiences from this aspect are important when configuring the network as well as to dimension the traffic load. Further, these results may be important when tuning protocol parameters to detect plausible interaction effects.

Tuning of the RTO backoff factor

In the previous sections, the importance of careful tuning of protocol parameters together with concerns about traffic and network properties when designing the network have been pointed out. As discussed in Section 4, a restriction of the number of consecutive retransmission timeouts is one of the crucial tunings to meet the ITU-T timing requirements. On the other hand, the tight restriction, necessary to meet the timing demands, reduces the resilience of the protocol against congestion, and

Conclusions

The SCTP transport protocol plays an important role, not only in the transition from circuit-switched to IP-based technologies in current mobile core networks, but also in the future mobile core networks. One of the most important features of SCTP is its support for multihoming which provides for network path redundancy; when the currently used path, the primary path, becomes unreachable, SCTP includes a failover mechanism which reroutes the traffic to a backup path. Being standardized by IETF,

Acknowledgements

The authors would like to thank Dr. Rainer Ludwig at Ericsson Research for valuable input and support during parts of this work. Furthermore, some of the experiments presented in this paper have been conducted on the Emulab [32] testbed. We thank the Flux Research group at the University of Utah for providing and supporting this facility. Parts of the work in this paper has been supported by grants from the Knowledge Foundation of Sweden with Tieto and Ericsson as industrial partners.

Johan Eklund (1966) received his MSc degree in Computer Science from Karlstad University(KaU), Sweden in 2004. Currently he works on his PhD at the department of Computer Science at KaU. His fields of interests are performance and robustness issues in signaling traffic over IP.

References (32)

  • H. Al-Ammal et al.

    Binary exponential backoff is stable for high arrival rates

    Lecture Notes in Computer Science (LNCS)

    (2000)
  • D.J. Aldous

    Ultimate instability of exponential backoff protocol for acknowledgement-based transmission control of random access communication channels

    IEEE Transactions on Information Theory

    (1987)
  • M. Allman, V. Paxson, W. Stevens, RFC 2581: TCP Congestion Control, April...
  • A.T. Andersen, Modelling of Packet Traffic with Mastrix Analytic Methods, PhD thesis, Technical University of Denmark...
  • R.Braden, RFC 1122: Requirements for Internet Hosts – Communication Layers, October...
  • L. Coene, J. Pastor-Balbas, RFC 4166: Telephony Signaling Transport over Stream Control Transmission Protocol (SCTP)...
  • Venture Development Corporation, SIGTRAN Stacks Displacing SS7 in the Market, May...
  • J. Eklund, A. Brunstrom, Impact of SACK delay and link delay on failover performance in SCTP, in: The Third IASTED...
  • J. Eklund, A. Brunstrom, On the relation between SACK delay and SCTP failover performance for different traffic...
  • L.A. Goldberg et al.

    Analysis of practical backoff protocols for contention resolution with multiple servers

    Journal of Computer Systems Sciences

    (1999)
  • J. Goodman et al.

    Stability of binary exponential backoff

    Journal of the ACM

    (1988)
  • K.-J. Grinnemo, Transport Services for Soft Real-Time Applications in IP Networks, PhD thesis, Karlstad University,...
  • K.-J. Grinnemo, A. Brunstrom, Impact of traffic load on SCTP failovers in SIGTRAN, in: International Conference on...
  • K.-J. Grinnemo, A. Brunstrom, Performance of SCTP-controlled failovers in M3UA-based SIGTRAN networks, in: Advanced...
  • J. Hastad et al.

    Analysis of backoff protocols for multiple access channels

    SIAM Journal of Computing

    (1996)
  • ITU-T. Q.706: Specifications of Signaling System No. 7 – Message Transfer Part Signaling Performance, ITU-T, March...
  • Cited by (0)

    Johan Eklund (1966) received his MSc degree in Computer Science from Karlstad University(KaU), Sweden in 2004. Currently he works on his PhD at the department of Computer Science at KaU. His fields of interests are performance and robustness issues in signaling traffic over IP.

    Karl-Johan Grinnemo (1968) received his M.Sc. in Computer Science and Engineering from the Linköping Institute of Technology, Sweden, in 1994. In 2006, he received his Ph.D. in Computer Science from Karlstad University.

    Karl-Johan has worked almost fifteen years as engineer in the telecom industry; first at Ericsson and then as consultant at Tieto. Currently, he is on leave from Tieto and works as acting associate professor at the Shool of Information and Communication Technology, KTH. Karl-johan has published more than a dozen conference and journal papers.

    Stephan Baucke received his Master degree in computer science in 1997 from the Technical University of Aachen, Germany. He is currently working as a Senior Researcher at Ericsson Research, Germany. His research interests include wireless IP networking and protocol optimization, IP signaling, and virtualization technologies.

    Anna Brunstrom (1967) received a B.Sc. in Computer Science and Mathematics from Pepperdine University, CA, in 1991, and a M.Sc. and Ph.D. in Computer Science from College of William & Mary, VA, in 1993 and 1996, respectively.

    She joined the Department of Computer Science at Karlstad University, Sweden, in 1996, where she is currently a Full Professor and Research Manager for the Distributed Systems and Communications Research Group. She has a background in distributed systems, but her main area of work over the last years has been in computer networking with a focus on transport protocol design, QoS issues, cross-layer interactions, wireless communication and network security. She has authored/coauthored ten book chapters and over 80 international journal and conference papers.

    View full text