Best utilizing fat long pipes and large file transfer

Hi,

I'm looking for input on the best practices for sending large files over a long fat pipe between facilities (gigabit private circuit, ~20ms RTT).
I'd like to avoid modifying TCP windows and options on end hosts where possible (I have a lot of them). I've seen products that work as "transfer stations" using "reliable UDP" to get around the windowing problem.

I'm thinking of setting up servers with optimized TCP settings to push big files around data centers but I'm curious to know how others deal with LFN+large transfers.

thanks,
Sean

In our experience, you can't get to line speed with over 20-30ms of latency using TCP regardless of how much you tweak it. We transfer files across the US with 60-70ms at line speeds with UDP based file transfer programs. There are a number of open source projects out there designed for this purpose.

-Robert

Tellurian Networks - Global Hosting Solutions Since 1995
http://www.tellurian.com | 888-TELLURIAN | 973-300-9211
"Well done is better than well said." - Benjamin Franklin

Take a look at some of the stuff from Aspera.

Mark

Hi,

I'm looking for input on the best practices for sending large
files

There are both commercial products (fastcopy)
and various "free"(*) products (bbcp, bbftp,
gridftp) that will send large files. While
they can take advantage of larger windows
they also have the capability of using multiple
streams (dealing with the inability to tune the
tcp stack). There are, of course, competitors
to these products which you should look into.
As always, YMWV.

Some references:
http://www.softlink.com/fastcopy_techie.html
  (Some parts of NASA seem to like fastcopy)
http://nccs.gov/user-support/general-support/data-transfer/bbcp/
  (Full disclosure, bbcp was written by someone who sits
  about 3 meters from where I am sitting, but I cannot find
  a nice web reference from him about the product, so I am
  showing a different sites documentation)
http://doc.in2p3.fr/bbftp/
  (One of the first to use multistream for BaBar)
http://www.globus.org/grid_software/data/gridftp.php
  (Part of the globus toolkit. Somewhat heavier weight
  if all you want is file transfer.)
http://fasterdata.es.net/tools.html
  (A reference I am surprised Kevin did not point to :slight_smile:

  (A few year old performance evaluation)
www.triumf.ca/canarie/amsterdam-test/References/010402-ftp.ppt
  (Another older performance evaluation)

Gary

(*) Some are GPL, and some (modified) BSD licenses.
    Which one is "free enough" depends on some strongly
    held beliefs of the validator.

I'm looking for input on the best practices for sending large files over
a long fat pipe between facilities (gigabit private circuit, ~20ms RTT).

providing you have RFC1323 type extensions enabled on a semi-decent OS, a 4MB
TCP window should be more than sufficient to fill a GbE pipe over 30msec.

with a modified TCP stack, that uses TCP window sizes up to 32MB, i've worked
with numerous customers to achieve wire-rate GbE async replication for
storage-arrays with FCIP.

the modifications to TCP were mostly to adjust how it reacts to packet loss,
e.g. don't "halve the window".
the intent of those modifications is that it doesn't use the "greater internet"
but is more suited for private connections within an enterprise customer
environment.

that is used in production today on many Cisco MDS 9xxx FC switch environments.

I'd like to avoid modifying TCP windows and options on end hosts where
possible (I have a lot of them). I've seen products that work as
"transfer stations" using "reliable UDP" to get around the windowing
problem.

given you don't want to modify all your hosts, you could 'proxy' said TCP
connections via 'socat' or 'netcat++'.

cheers,

lincoln.

Does anybody heard if comcast is having problems today?

Thank you,
Taeko

Since I got on shift two hours ago, I've done nothing but stare at
traceroutes into and out of Comcast space trying to reassure dozens of
customers that we're not down, Comcast is having problems. Our upstream
claims they've been dealing with Comcast customers all (US) day. I'm pretty
sure there's some serious weirdness going on in there.

(Oh, and don't reply to an existing message to start a new thread)

- Matt

Does anybody heard if comcast is having problems today?

lucy was having problems in eugene orygun. she diagnosed and then gave
up and went to dinner.

randy

I have a comcast business line in Western WA and have seen no hiccups
so far today. Main IP is 75.146.60.201.

If someone that is seeing issues can send me an IP, I can traceroute
to see if I can reach it from within Comcast.

I've got a customer in 73.72.92.0/24, and I don't see the prefix on the net.

Just to confirm from here (Toronto):

core2-rtr-to#sh ip bgp 73.72.92.0
% Network not in table

Paul

I have Smokeping running from behind my Comcast (Eastern MA / New
England) and have alarms of latency from 6:28P-7:18pm EST. Not sure if
attachments make it through - but doc of last 3 hr graph showing .13%
loss avg and 4.57% max loss. Seems clean otherwise. On Tuesday I had
alarms going all day long.....I run monitors to my Corp Network and
Legacy Genuity DNS and the results are the same for both.

Eric

Tom,
Where would that be located. From my house my UUNet/MCI/Verizon Business
Link doesn't have it. My speakeasy link doesn't have it either. All of
Comcast was out in my neighborhood (Alexandria, VA) yesterday at 7pm when I
got home, was still out at 11pm when I went to bed, up and running fine this
AM.

David

when was the last time you saw this prefix reachable?

i dont see anything announced from comcasts 73.0.0.0/8 allocation within the
past 2 weeks...

FYI: Internally within Comcast it does route:

$ mtr --report -c 1 73.0.0.1
HOST: blue Loss% Snt Last Avg Best Wrst StDev
  1. linksys 0.0% 1 0.9 0.9 0.9 0.9 0.0
  2. c-24-98-192-1.hsd1.ga.comcas 0.0% 1 7.3 7.3 7.3 7.3 0.0
  3. ge-2-5-ur01.a2atlanta.ga.atl 0.0% 1 8.0 8.0 8.0 8.0 0.0
  4. te-9-1-ur02.a2atlanta.ga.atl 0.0% 1 6.0 6.0 6.0 6.0 0.0
  5. te-9-3-ur01.b0atlanta.ga.atl 0.0% 1 6.5 6.5 6.5 6.5 0.0
  6. 68.85.232.62 0.0% 1 6.4 6.4 6.4 6.4 0.0
  7. po-15-ar01.b0atlanta.ga.atla 0.0% 1 7.6 7.6 7.6 7.6 0.0
  8. te-4-1-cr01.atlanta.ga.cbone 0.0% 1 9.0 9.0 9.0 9.0 0.0
  9. te-1-1-cr01.charlotte.nc.cbo 0.0% 1 11.8 11.8 11.8 11.8 0.0
10. te-1-1-cr01.richmond.va.cbon 0.0% 1 19.5 19.5 19.5 19.5 0.0
11. te-1-1-cr01.mclean.va.cbone. 0.0% 1 24.0 24.0 24.0 24.0 0.0
12. te-1-1-cr01.philadelphia.pa. 0.0% 1 25.6 25.6 25.6 25.6 0.0
13. be-40-crs01.401nbroadst.pa.p 0.0% 1 26.5 26.5 26.5 26.5 0.0
14. be-50-crs01.ivyland.pa.panjd 0.0% 1 28.8 28.8 28.8 28.8 0.0
15. po-10-ar01.verona.nj.panjde. 0.0% 1 41.7 41.7 41.7 41.7 0.0
16. po-10-ar01.eatontown.nj.panj 0.0% 1 33.5 33.5 33.5 33.5 0.0
17. po-10-ur01.middletown.nj.pan 0.0% 1 34.4 34.4 34.4 34.4 0.0
18. po-10-ur01.burlington.nj.pan 0.0% 1 48.0 48.0 48.0 48.0 0.0

-Jim P.

interesting enough mine goes the other way

brokenrobot:~ christian$ traceroute 73.72.92.1
traceroute to 73.72.92.1 (73.72.92.1), 64 hops max, 40 byte packets
1 192.168.2.1 (192.168.2.1) 12.130 ms 1.135 ms 1.262 ms
2 * * *
3 ge-2-3-ur01.jerseycity.nj.panjde.comcast.net (68.86.220.185) 10.710 ms
9.638 ms 12.299 ms
4 po-10-ur02.jerseycity.nj.panjde.comcast.net (68.86.209.246) 15.747 ms
13.280 ms 10.082 ms
5 po-10-ur01.narlington.nj.panjde.comcast.net (68.86.209.250) 11.373 ms
10.847 ms 11.873 ms
6 po-10-ur02.narlington.nj.panjde.comcast.net (68.86.158.178) 36.034 ms
11.622 ms 16.032 ms
7 po-70-ar01.verona.nj.panjde.comcast.net (68.86.209.254) 17.642 ms
16.334 ms 11.358 ms
8 te-0-7-0-7-crs01.plainfield.nj.panjde.comcast.net (68.86.72.18) 25.986
ms 13.774 ms 12.524 ms
9 te-4-1-cr01.newyork.ny.cbone.comcast.net (68.86.72.17) 22.172 ms
24.545 ms 17.774 ms
10 te-9-1-cr01.philadelphia.pa.cbone.comcast.net (68.86.68.5) 16.040 ms
14.670 ms 14.232 ms
11 te-9-1-cr01.mclean.va.cbone.comcast.net (68.86.68.1) 32.585 ms 20.303
ms 28.397 ms
12 te-9-1-cr02.pittsburgh.pa.cbone.comcast.net (68.86.68.125) 24.909 ms
24.261 ms 34.669 ms
13 te-8-1-cr01.cleveland.oh.cbone.comcast.net (68.86.68.117) 28.253 ms
26.949 ms 27.105 ms
14 te-9-1-cr01.chicago.il.cbone.comcast.net (68.86.68.22) 37.728 ms
59.564 ms 36.835 ms
15 te-9-1-cr01.omaha.ne.cbone.comcast.net (68.86.68.30) 53.805 ms 54.945
ms 53.015 ms
16 te-9-1-cr01.denver.co.cbone.comcast.net (68.86.68.42) 62.957 ms 74.028
ms 63.913 ms
17 te-9-1-cr01.denverqwest.co.cbone.comcast.net (68.86.68.146) 62.570 ms
68.635 ms 62.655 ms
18 te-2-1-cr01.santateresa.tx.cbone.comcast.net (68.86.68.150) 74.011 ms
74.431 ms 76.615 ms
19 te-8-1-cr01.losangeles.ca.cbone.comcast.net (68.86.68.81) 92.442 ms
93.805 ms 93.965 ms
20 te-9-1-cr01.sacramento.ca.cbone.comcast.net (68.86.68.69) 105.415 ms
102.575 ms 103.331 ms
21 te-0-2-0-1-ar01.oakland.ca.sfba.comcast.net (68.87.192.134) 111.564 ms
111.281 ms 106.768 ms
22 te-9-3-ur02.pittsburg.ca.sfba.comcast.net (68.87.192.133) 105.908 ms
105.997 ms 108.539 ms
23 GE-6-1-ur01.pittsburg.ca.sfba.comcast.net (68.87.199.173) 107.303 ms
108.250 ms 110.819 ms
24 ge-0-1-ubr02.pittsburg.ca.sfba.comcast.net (68.87.197.22) 119.945 ms *
107.782 ms

I didn't get online tonight till ~8pm, but I haven't noticed any
problems at all. (I'm on 74/8.)

    --Steve Bellovin, http://www.cs.columbia.edu/~smb

They've been fine in my area (atlanta), though there was a fair bit of
downtime last week. I did, however, notice today that my port 25 blocks
are gone... which wasn't the case last week.

23 114 ms 122 ms 113 ms ge-0-1-ubr02.pittsburg.ca.sfba.comcast.net [68.8
7.197.22]

Für eine Weile hatten wir Zugang durch eine Hong Kong basiert Piraten-Netzwerk

-M<

Yeah, I've saw similar in traces a few days back, I wondered wtf.

-Jim P.