Loading [a11y]/accessibility-menu.js
Distributed Network Telemetry With Resource Efficiency and Full Accuracy | IEEE Journals & Magazine | IEEE Xplore

Distributed Network Telemetry With Resource Efficiency and Full Accuracy


Abstract:

Network telemetry is essential for administrators to monitor massive data traffic in a network-wide manner. Existing telemetry solutions often face the dilemma between re...Show More

Abstract:

Network telemetry is essential for administrators to monitor massive data traffic in a network-wide manner. Existing telemetry solutions often face the dilemma between resource efficiency (i.e., low CPU, memory, and bandwidth overhead) and full accuracy (i.e., error-free and holistic measurement). We break this dilemma via a network-wide architectural design OmniMon, which simultaneously achieves resource efficiency and full accuracy in flow-level telemetry for large-scale data centers. OmniMon carefully coordinates the collaboration among different types of entities in the whole network to execute telemetry operations, such that the resource constraints of each entity are satisfied without compromising full accuracy. It further addresses consistency in network-wide epoch synchronization and accountability in error-free packet loss inference. We prototype OmniMon in DPDK and P4. Testbed experiments on commodity servers and Tofino switches demonstrate the effectiveness of OmniMon over state-of-the-art solutions.
Published in: IEEE/ACM Transactions on Networking ( Volume: 32, Issue: 3, June 2024)
Page(s): 1857 - 1872
Date of Publication: 09 January 2024

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.