skip to main content
research-article

Practical issues with using network tomography for fault diagnosis

Published:30 September 2008Publication History
Skip Abstract Section

Abstract

This paper investigates the practical issues in applying network tomography to monitor failures. We outline an approach for selecting paths to monitor, detecting and confirming the existence of a failure, correlating multiple independent observations into a single failure event, and applying existing binary networking tomography algorithms to identify failures. We evaluate the ability of network tomography algorithms to correctly detect and identify failures in a controlled environment on the VINI testbed.

References

  1. A. Bavier, N. Feamster, M. Huang, J. Rexford, and L. Peterson. In VINI Veritas: Realistic and Controlled Network Experimentation. In Proc. ACM SIGCOMM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Caceres, N. Duffield, S. Moon, and D. Towsley. Inference of Internal Loss Rates in the MBone. In Proc. IEEE Global Internet, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  3. R. Caceres, N.G. Duffield, J. Horowitz, D.F. Towsley, and T. Bu. Multicast-Based Inference of Network-Internal Characteristics: Accuracy of Packet Loss Estimation. In Proc. IEEE INFOCOM, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  4. R. Castro, M. Coates, G. Liang, R. Nowak, and B. Yu. Network Tomography: Recent Developments. Statistical Science, 19(3):499--517, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  5. A. Dhamdhere, R. Teixeira, C. Dovrolis, and C. Diot. NetDiagnoser:Troubleshooting network unreachabilities using end-to-end probes and routing data. In Proc. CoNEXT, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. N. Duffield. Network tomography of binary network performance characteristics. IEEE Trans. Information Theory, 52, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N.G. Duffield, F.L. Presti, V. Paxson,and D.F. Towsley. Inferring Link Loss Using Striped Unicast Probes. In Proc. IEEE INFOCOM, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  8. S. Kandula, D. Katabi, and J.-P. Vasseur. Shrink: A Tool for Failure Diagnosis in IP Networks. In ACM SIGCOMM Workshop on mining network data (MineNet), August 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E. Katz-Bassett, H.V. Madhyastha, J.P. John, A. Krishnamurthy, D. Wetherall, and T. Anderson. Studying Black Holes in the Internet with Hubble. In Proc. USENIX Symposium on Networked Systems Design and Implementation, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. I. Katzela and M. Schwartz. Schemes for Fault Identification in Communication Networks. IEEE/ACM Trans. Networking, 3(6), 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Keynote Systems -- The mobile and Internet performance authority. http://www.keynote.com/.Google ScholarGoogle Scholar
  12. R.R. Kompella, J. Yates, A. Greenberg, and A.C. Snoeren. Detection and Localization of Network Blackholes. In Proc. IEEE INFOCOM, May 2007.Google ScholarGoogle Scholar
  13. H. Nguyen and P. Thiran. Active measurement for multiple link failures diagnosis in ip networks. In Proc. of Passive and Active Measurement Workshop, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  14. H. Nguyen and P. Thiran. Network Loss Inference with Second Order Statistics of End-to-End Flows. In Proc. Internet Measurement Conference, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Quagga Routing Suite. http://www.quagga.net/.Google ScholarGoogle Scholar
  16. RIPE. Test Traffic Measurements Service. http://www.ripe.net/ttm/.Google ScholarGoogle Scholar
  17. M. Steinder and A.S. Sethi. Probabilistic Fault Localization in Communication Systems Using Belief Networks. IEEE/ACM Trans. Networking, 12(5), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Zhang, C. Zhang, V. Pai, L.Peterson, and R. Wang. PlanetSeer: Internet Path Failure Monitoring and Characterization in Wide-Area Services. In Proc. USENIX OSDI, Dec 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Practical issues with using network tomography for fault diagnosis

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM SIGCOMM Computer Communication Review
            ACM SIGCOMM Computer Communication Review  Volume 38, Issue 5
            October 2008
            68 pages
            ISSN:0146-4833
            DOI:10.1145/1452335
            Issue’s Table of Contents

            Copyright © 2008 Authors

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 30 September 2008

            Check for updates

            Qualifiers

            • research-article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader