Abstract

Replicating content across a geographically distributed set of servers and redirecting clients to the closest server in terms of latency has emerged as a common paradigm for improving client performance. In this paper, we analyze latencies measured from servers in Google's content distribution network (CDN) to clients all across the Internet to study the effectiveness of latency-based server selection. Our main result is that redirecting every client to the server with least latency does not suffice to optimize client latencies. First, even though most clients are served by a geographically nearby CDN node, a sizeable fraction of experience latencies several tens of milliseconds higher than other in the same region. Second, we find that queueing delays often override the benefits of a client interacting with a nearby server. To help the administrators of Google's CDN cope with these problems, we have built a system called WhyHigh. First, WhyHigh measures client latencies across all nodes in the CDN and correlates measurements to identify the prefixes affected by inflated latencies. Second, since clients in several thousand prefixes have poor latencies, WhyHigh prioritizes problems based on the impact that solving them would have, e.g., by identifying either an AS path common to several inflated prefixes or a CDN node where path inflation is widespread. Finally, WhyHigh diagnoses the causes for inflated latencies using active measurements such as traceroutes and pings, in combination with datasets such as BGP paths and flow records. Typical causes discovered include lack of peering, routing misconfigurations, and side-effects of traffic engineering. We have used WhyHigh to diagnose several instances of inflated latencies, and our efforts over the course of a year have significantly helped improve the performance offered to clients by Google's CDN.


Original document

The different versions of the original document can be found in:

http://www.sysnet.ucsd.edu/sysnet/miscpapers/harshaimc09.pdf,
http://cse.ucsd.edu/sites/cse/files/cse/assets/research/biblio/harshaimc09.pdf,
http://web.eecs.umich.edu/~harshavm/papers/imc09.pdf,
https://research.google/pubs/pub35590,
https://ai.google/research/pubs/pub35590,
http://research.google.com/pubs/pub35590.html,
https://core.ac.uk/display/21741296,
http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/35590.pdf,
https://dblp.uni-trier.de/db/conf/imc/imc2009.html#KrishnanMSJKAG09,
http://www-bcf.usc.edu/~katzbass/teaching/csci599-sp13/papers/04/why_high.pdf,
https://www.researchgate.net/profile/Jie_Gao3/publication/221611925_Moving_beyond_end-to-end_path_information_to_optimize_CDN_performance/links/00b49524e2cf232ee5000000.pdf,
https://academic.microsoft.com/#/detail/2130384722
http://dx.doi.org/10.1145/1644893.1644917
Back to Top

Document information

Published on 01/01/2009

Volume 2009, 2009
DOI: 10.1145/1644893.1644917
Licence: CC BY-NC-SA license

Document Score

0

Views 0
Recommendations 0

Share this document

Keywords

claim authorship

Are you one of the authors of this document?