An analysis of the impact of network device buffers on packet schedulers through experiments and simulations

doi:10.1016/j.simpat.2017.09.008

Simulation Modelling Practice and Theory

Volume 80, January 2018, Pages 1-18

https://doi.org/10.1016/j.simpat.2017.09.008 Get rights and content

Abstract

Keeping the delay experienced by packets while travelling from a source to a destination below certain thresholds is essential to successfully deliver a number of Internet services nowadays. Most of the packet delay can be usually ascribed to the time spent in the many queues encountered by the packet. In this context, the term bufferbloat has been recently coined to denote the uncontrolled growth of the queuing time due, among others, to the excessive size of the buffers and the attitude of TCP to increase the sending rate until a packet is dropped. In this paper, we focus on the queues employed by the traffic control infrastructure and by the network device drivers. Reducing the queuing time due to the former is the objective of a plethora of scheduling algorithms developed in the past years and referred to as Active Queue Management (AQM) algorithms. Conversely, the impact of the additional queuing in the buffer of the network device driver on performance and on the effectiveness of AQM algorithms has instead received much less attention. In this paper, we report the results of an experimental analysis we conducted to gain a better insight into the impact that network device buffers (and their size) have on performance. We also give an in-depth presentation of Dynamic Queue Limits (DQL), an algorithm recently introduced in the Linux kernel to dynamically adapt the size of the buffers held by network device drivers. The experiments we conducted show that DQL not only enables to reduce the queuing time in the network device buffers, which is essential to ensure the effectiveness of AQM algorithms, but also enables to keep latency stable, which is important to reduce the jitter. In order to faithfully reproduce through simulations the dynamics revealed by the experimental study we conducted, we implemented DQL for the popular ns-3 network simulator. In this paper, we describe the design of such implementation and report the results of a simulation study we conducted to show the ability of the simulator to accurately reproduce the experimental results.

Introduction

Network devices make a wide use of buffers to temporarily store packets waiting to be transmitted. The size of such buffers heavily influences network performance, primarily packet delay and loss probability. The buffer size should be large enough to accommodate packet bursts, but it should not be excessively large, in order to avoid that the latency experienced by packets grows in an uncontrolled manner, a problem which is commonly referred to as bufferbloat [1]. A common rule-of-thumb is to make the buffer size proportional to the link bandwidth (i.e., the so called Bandwidth-Delay Product). However, the use of such a rule has been questioned, e.g., in [2], where authors claim that much smaller buffers can be employed. In practice, the availability of memory chips at low cost often leads to the use of oversized buffers.

Limiting the queuing delay resulting from the use of large buffers has been the objective of a plethora of Active Queue Management (AQM) algorithms that have been defined by the research community in the last decades. RED (Random Early Drop) [3] has been one of the first AQM algorithms to be proposed and has been followed by a number of variants, such as Adaptive RED [4], Gentle RED [5] and Nonlinear RED [6]. RED and its derivatives can be classified [7] as queue-based schemes, since they aim to maintain the queue length at a target level. Other algorithms, such as BLUE [8] and AVQ (Adaptive Virtual Queue) [9] use instead the packet arrival rate as a congestion measure and hence can be classified as rate-based schemes. A combination of the two approaches is proposed by algorithms such as REM (Random Exponential Marking) [10]. More recently, CoDel (Controlled Delay) [11] has been proposed to specifically address the bufferbloat problem. CoDel aims to keep the queuing delay below a target value and drops packets that exceed such threshold after they have been dequeued. Furthermore, a few AQM algorithms, such as SFB (Stochastic Fair Blue) [12], AFCD (Approximated-Fair and Controlled Delay) [13] and FQ-CoDel [14], were proposed with the aim of additionally providing fairness among different flows.

The interaction with the TCP congestion control mechanism has been taken into account in the design of some AQM algorithms. The PI (Proportional Integral) [15] controller is among the earliest of such approaches and is based on a linearized dynamic model for a TCP/AQM system. PIE (Proportional Integral controller Enhanced) [16] computes the drop probability based on both the departure rate and the queue length. The stability of PIE is demonstrated by using a TCP fluid model. PI² [17] extends PIE with the objective to improve the coexistence between classic congestion controls (TCP Reno, Cubic, etc.) and scalable congestion controls (Data Center TCP). In [18], authors evaluate the interaction among AQM algorithms (such as CoDel) and low-priority congestion control techniques (such as LEDBAT [19]) through both experiments and simulations. A theoretical analysis is instead presented in [20] to study the interaction between Compound TCP and REM.

AQM algorithms have been implemented in real systems such as the Linux operating system as queuing disciplines within the Traffic Control (TC) infrastructure [21]. As such, AQM algorithms manage packets before they are handed to the device driver. The latter employs its own buffer, usually called transmission ring, to avoid the starvation of the network interface. Thus, packets are enqueued again after being scheduled by the queuing discipline. Also, a flow control strategy is implemented to regulate the passing of packets from the queuing discipline to the transmission ring. While the interactions between TCP congestion control and AQM algorithms have been addressed by a number of papers, the impact of such additional queuing and the related flow control remains largely unexplored. To the best of our knowledge, only [22] recognizes the presence of the transmission rings, but authors just model them as FIFO queues to evaluate their impact on the service guarantees of fair-queuing schedulers.

The goal of this paper is to fill this gap by evaluating the impact of transmission rings and flow control on network performance, in general, and on the effectiveness of AQM algorithms, in particular. We conducted a thorough experimental campaign which provided us with a number of insights. For instance, if the queuing discipline does not differentiate among traffic flows and its capacity is not saturated, the size of the transmission ring has no impact on network performance; when the queuing discipline assigns different priority levels to flows, the latency experienced by prioritized flows increases with the size of the transmission ring. It turns out, however, that sizing the transmission ring is not trivial: a too small ring causes a throughput loss, a too large ring causes a high packet delay. To complicate things further, for a given transmission ring size, the packet delay is affected by multiple factors, including packet size and CPU load. Also, rather unexpectedly, latency may exhibit a large variance and exceed by far the waiting time in the queuing discipline and in the transmission ring.

Another contribution of this paper is an in-depth description and experimental evaluation of Dynamic Queue Limits (DQL), a mechanism that has been recently introduced in the Linux kernel with the goal of dynamically computing the number of packets to store in the transmission ring in order to prevent starvation. Our experiments showed that DQL is effective in keeping latency low when AQM algorithms or priority packet schedulers are used, while it has no impact when a queuing discipline that enqueues all the packets in a single queue is used below its capacity. Also, when DQL is used, latency turns out to be rather stable and equal to the waiting time in the queuing discipline and in the transmission ring.

Furthermore, in order to allow for simulation studies to faithfully reproduce the dynamics revealed by our experimental campaign, we implemented DQL inside the ns-3 network simulator¹ and present the implementation design in this paper. This work builds upon our previous effort to introduce a traffic control layer resembling the Linux TC infrastructure within ns-3 [23]. With the introduction of the traffic control layer and DQL, ns-3 is now a reliable tool to evaluate the performance of AQM algorithms and higher layer protocols in terms of delay, packet loss and throughput. To this end, in this paper we present a simulation study we conducted to show that ns-3 simulations reproduce the experimental results rather accurately. We note that having DQL implemented in ns-3 enables to evaluate its performance also in contexts, such as wifi networks, that could not be tested with real experiments because of the lack of support from the Linux drivers.

The rest of this paper is structured as follows. Section 2 provides some background information about how queuing works in Linux, while Section 3 presents an in-depth description of DQL. Section 4 illustrates the design of our implementation of DQL in ns-3. Section 5 presents the results of our experimental campaign, while Section 6 presents the results of the simulation study we conducted with ns-3. Finally, Section 6.1 concludes the paper.

Section snippets

Background

Applications write data in the buffer of their sockets to have it delivered to the destination. On Linux systems, the size of a socket buffer is kept small by a mechanism called TCP Small Queues (TSQ). Basically, an application is prevented to write data in a socket buffer if there are already (approximately) two packets not yet transferred to the network interface card, for that socket. The goal of this mechanism is to reduce round-trip times and hence mitigate the bufferbloat issue. Data is

Dynamic queue limits

Dynamic Queue Limits is a mechanism that has been recently introduced in the Linux kernel to dynamically determine the maximum amount of bytes (denoted as “limit”) that shall be stored in the transmission ring of a device. The goal is to keep such amount around the minimum value that guarantees to avoid starvation. The computed limit is enforced by exploiting the existing flow control mechanism. Given that the limit is computed in bytes, the algorithm is also known as BQL (Byte Queue Limits).

Implementing DQL in ns-3

Introducing DQL in ns-3 has been made possible after we introduced a traffic-control module [23] modelled after the Traffic Control infrastructure in Linux. Along the transmission path, packets are passed by the IPv4 and IPv6 protocols to the traffic-control module, where they are enqueued into the root queue disc installed on the outgoing NetDevice. Queued packets are scheduled by the queue disc and sent to the NetDevice for transmission, following the same approach of the Linux TC

Experimental results

We used two desktop computers running the Linux 4.11 kernel and equipped each with an Intel 1 Gbps Ethernet network interface card using the Linux e1000 driver. The default size for the transmission ring, as set by the driver, is 256 descriptors. The two computers are connected back to back with an Ethernet cross cable. One of the computers acts as traffic sender, the other as traffic receiver. Netperf is used to generate network traffic. In the following, we present the insights that we gained

Simulation study

We carried out a simulation study to verify that our implementation of BQL within ns-3 behaves like the Linux implementation, so that users can trust that they get realistic results. The simulation setup recreates to some extent the router scenario used for the experiments of the previous section, given that TSO and TSQ are not currently implemented in ns-3. Two nodes are connected by means of a point-to-point link having a capacity of 1 Gbps and a delay of 0.1 ms. Network traffic consists of

Conclusions and future work

In this paper, we illustrated in details the operation of the DQL algorithm, recently introduced in the Linux Kernel to dynamically limit the amount of bytes stored in a transmission ring. Then, we reported the results of a thorough experimental campaign we conducted to evaluate the performance of various schedulers, including a simple FIFO scheduler, a priority scheduler and an AQM algorithm, when DQL is enabled or disabled and with varying transmission ring size. Experimental results

References (24)

K. Zhou et al.
Nonlinear RED: a simple yet efficient active queue management scheme
Comput. Netw.
(2006)
L. Xue et al.
Towards fair and low latency next generation high speed networks: {AFCD} queuing
J. Netw. Comput. Appl.
(2016)
Y. Gong et al.
Fighting the bufferbloat: on the coexistence of {AQM} and low priority congestion control
Comput. Net.
(2014)
L. Rizzo et al.
On service guarantees of fair-queueing schedulers in real systems
Comput. Commun.
(2015)
J. Gettys et al.
Bufferbloat: dark buffers in the internet
Commun. ACM
(2012)
G. Appenzeller et al.
Sizing router buffers
SIGCOMM Comput. Commun. Rev.
(2004)
S. Floyd et al.
Random early detection gateways for congestion avoidance
IEEE/ACM Trans. Netw.
(1993)
S. Floyd et al.
Adaptive RED: An Algorithm for Increasing the Robustness of RED Active Queue Management
Technical Report
(2001)
V. Rosolen et al.
A RED discard strategy for ATM networks and its performance evaluation with TCP/IP traffic
SIGCOMM Comput. Commun. Rev.
(1999)
G. Abbas et al.
Fairness-driven queue management: a survey and taxonomy
IEEE Commun. Surv. Tut.
(2016)

W. chang Feng et al.

The BLUE active queue management algorithms

IEEE/ACM Trans. Netw.

(2002)

S. Kunniyur et al.

Analysis and design of an adaptive virtual queue (AVQ) algorithm for active queue management

SIGCOMM Comput. Commun. Rev.

(2001)

Cited by (16)

A new median-average round Robin scheduling algorithm: An optimal approach for reducing turnaround and waiting time
2022, Alexandria Engineering Journal
Citation Excerpt :
The working formula for the derivation of dynamic time quantum: The author proposed QT = √ (median * HB) [27] as a formula to calculate the time quantum for the MMRRA algorithm. According to the author, QT is considered for time quantum, the median is the middle value of all the burst time, which is sorted in ascending order, and HB is taken for the highest burst time out of all the burst time of processes.
A variety of algorithms handles processes on the CPU. The round-robin algorithm is an efficient CPU scheduling mechanism for a time-sharing operating system. The system processes the methods based on the time slice; however, determining the time slice has proven highly challenging for the researchers. As a result, a variety of dynamic time quantum scheduling techniques are presented by various academics to address this challenge. This study aims to determine how to best schedule resources to maximize efficiency. It is important to note that this scheduling mechanism rotates between the processes after the static quantum time is complete. However, the quantum decision affects how effectively and efficiently the procedures may be scheduled. Additionally, the quantum decision has an impact on the scheduling of processes. The average waiting time, turnaround time, and context switch times of the Round Robin scheduling algorithm are high enough to influence the system's performance. To get over the round-drawbacks, robin's the authors in this study suggest using the improved algorithm Median-Average Round Robin (MARR). Using the median and average of the burst time of each process, the author proposes a dynamic time quantum for the system. The authors compared the proposed model with four other scheduling algorithms. The results vividly depict that the proposed algorithms successfully give effective results with reduced average turnaround time and waiting time. In the future, cost and RAM utilization will be considered to enhance the algorithm.
Revisiting design choices in queue disciplines: The PIE case
2020, Computer Networks
Citation Excerpt :
The initial concern raised in [6] about the excessive computational overhead due to the timestamping approach has been disproved by several evaluations of actual CoDel implementations. Experiments in [11] show that the timestamp based approach used by CoDel enables to achieve good performance in terms of throughput and latency over wired (Ethernet) networks. CoDel has been also ported to the 802.11 MAC (Medium Access Control) layer of the Linux kernel to perform software timestamping to control the queuing time inside Wi-Fi device drivers [15].
Bloated buffers in the Internet add significant queuing delays and have a direct impact on the user perceived latency. There has been an active interest in addressing the problem of rising queue delays by designing easy-to-deploy and efficient Active Queue Management (AQM) algorithms for bottleneck devices. The real deployment of AQM algorithms is a complex task because the efficiency of every algorithm depends on appropriate setting of its parameters. Hence, the design of AQM algorithms is usually entrusted on simulation environments where it is relatively straightforward to evaluate the algorithms with different parameter configurations. Unfortunately, several factors that affect the efficiency of AQM algorithms in real deployment do not manifest during simulations, and therefore, lead to inefficient design of the AQM algorithm. In this paper, we revisit the design considerations of Proportional Integral controller Enhanced (PIE), an algorithm widely considered for network deployment, and extensively evaluate its performance using a Linux based testbed. Our experimental study reveals some performance anomalies in certain circumstances and we prove that they can be attributed to a specific design choice of PIE, namely the use of the estimated departure rate to compute the expected queuing delay. Therefore, we designed an alternative approach based on packet timestamps, implemented it in the Linux kernel and proved its effectiveness through an experimental campaign.
Evaluating QoE in VoIP networks with QoS mapping and machine learning algorithms
2020, Neurocomputing
Citation Excerpt :
This phenomenon is termed as the spatial-related impairment of QoS parameters. Since modified jitter butter algorithms can enhance total user-perceived quality by handling subsequent packets and disorder packets differently, They have always been intensively studied [51–54]. Temporal-related impairments are related to the end-to-end delay in a VoIP system (as shown in Fig. 2.d).
The quality of experience (QoE) of the end-users is a critical criterion of measurement in VoIP (Voice over Internet Protocol) systems for technical and commercial purposes. We investigate how quality of service (QoS) influences QoE and assesses the QoE in VoIP communication. Our contributions are three-fold. First, the impacts of QoS on QoE are comprehensively analyzed by experimental means and an association test method, instead of independently studying each parameter. Second, an algorithm is proposed to integrate the effects of QoS parameters with spatial or temporal characteristics on QoE. Third, we apply machine learning regression algorithms with QoS impairments, noise and echo impairments to nonintrusive voice quality prediction in different network environments. The results from numerous experiments show that fairly accurate prediction can be obtained from these models. Our work will achieve a more accurate evaluation of the QoE in VoIP by using QoS parameters, clarify the influence of IP network environments, noise and echo impairments on the quality and reliability of VoIP traffic, and provide QoS parameter requirements for the VoIP application that runs at the desired QoE level.
Avoiding queue overflow and reducing queuing delay at eNodeB in LTE networks using congestion feedback mechanism
2019, Computer Communications
Citation Excerpt :
The thumb rule states that the queue space should be proportional to the bandwidth of the link or bandwidth delay product should be constant. But, using small queue space in high speed network affects the performance of the network [4] when there is sudden burst in data traffic as mentioned above. To address the problem of network congestion and its related performance issues, the research community and industrial organization have designed many TCP congestion control algorithms and Active Queue Management (AQM) algorithms to improve the system performance.
During congestion in LTE network, most of the algorithms succeed in either address the queue overflow problem or bufferbloat problem through implicit congestion control mechanism or using flow control technique. The flow control technique solves the bufferbloat problem but it affects the users while competing for shared resource in coexistence with conventional TCP or UDP users. The implementation of explicit congestion notification in LTE eNodeB is difficult because the network layers do not support it. Therefore, in this paper, congestion identification and feedback mechanisms are proposed to reduce the queuing delay and queue overflow problem by varying the congestion window size at sender based on the estimation of congestion level at eNodeB. The extensive experimental results illustrate that the proposed algorithms reduce the delay of the packets and increase the fairness among the users compared to drop-tail, RED, STRED and PIE methods. Moreover, the Packet Delivery Fraction (PDF) has been increased and further throughput of the system is maintained. In addition to that, it competes with other users of conventional methods for the shared resources in a fair manner.
Enhancing the fidelity of network emulation through direct access to device buffers
2019, Journal of Network and Computer Applications
Citation Excerpt :
Indeed, a NetDevice can stop its queues (which are the analogous of the transmission rings) when there is not enough room for another packet and restart them when a packet can be accommodated. We introduced the BQL library in ns-3 as well (Imputato and Avallone, 2018). We added flow control and BQL support to a number of NetDevices, which start and stop their queues in much the same way Linux network device drivers do.
Researchers from academia, industry and research centers often resort to emulation to overcome the drawbacks associated with network simulation and experimental evaluation. Emulation is broadly classified in environment emulation, usually carried out by running real code in Virtual Machines (VMs) or containers, and network emulation, typically involving network simulators that exchange packets with the real world. In this paper, we focus on network emulation, which is often exploited for rapid prototyping and testing of network protocols and algorithms. We identify the limitations of the approach currently used by various network simulators to provide network emulation and design an alternative solution based on netmap, a framework for high speed packet I/O which is available on multiple operating systems. We argue that the proposed solution to network emulation provides extremely accurate results in terms of packet latency and packet drops and prove our claim by means of an extensive experimental campaign. We also show that by building upon an accurate network emulation mechanism it is possible to validate the implementation of protocols found in network simulators against their implementation in real network stacks. As an example, we present the results of the experiments we conducted to validate the ns-3 implementation of various packet schedulers against their Linux counterpart.
TCP-WBQ: a backlog-queue-based congestion control mechanism for heterogeneous wireless networks
2022, Scientific Reports

View all citing articles on Scopus

View full text

An analysis of the impact of network device buffers on packet schedulers through experiments and simulations

Abstract

Introduction

Section snippets

Background

Dynamic queue limits

Implementing DQL in ns-3

Experimental results

Simulation study

Conclusions and future work

Comput. Netw.

J. Netw. Comput. Appl.

Comput. Net.

Comput. Commun.

Bufferbloat: dark buffers in the internet

Commun. ACM

Sizing router buffers

SIGCOMM Comput. Commun. Rev.

Random early detection gateways for congestion avoidance

IEEE/ACM Trans. Netw.

Adaptive RED: An Algorithm for Increasing the Robustness of RED Active Queue Management

Technical Report

A RED discard strategy for ATM networks and its performance evaluation with TCP/IP traffic

SIGCOMM Comput. Commun. Rev.

Fairness-driven queue management: a survey and taxonomy

IEEE Commun. Surv. Tut.

The BLUE active queue management algorithms

IEEE/ACM Trans. Netw.

Analysis and design of an adaptive virtual queue (AVQ) algorithm for active queue management

SIGCOMM Comput. Commun. Rev.