analysing intermittent problems with DNS forwarder m0n0wall

Question

Hi, i've setup a 'private subnet' in my home network. I have a 'server' which has two nic's eth0 is connected to the ADSL modem/router and eth1 is connected to a switch connecting all pc's on the subnet.

The server runs vmware server and runs 3 VM's: - ubuntu bridged to eth0 - ubuntu bridged to eth1 - m0n0wall which acts as the firewall/gateway/dhcp server for the subnet. the "WAN" interface is bridged to eth0 and the "LAN" interface to eth1

At this stage only the dhcp server is enabled and dns forwarding so that machines on the subnet can access the internet.

The setup works DNS names are resolved and 'most' websites work fine. ping to various hosts returns the same values via the m0n0wall as directly through the modem/router gateway.

However, some sites take ages to load, or don't load at all. But not always. www.bbc.co.uk is one example.

I'm using wireshark to get to the bottom of the problem but so far i've only found that the times the request fails the server (bbc) sends an window-update as a response to the packet with the GET request. After that the client retransmits two more times and the server finally responds with 'bad tcp'.

What i'm trying to find out is whether i've got a config problem in the m0n0wall or whether its more fundamental e.g packets getting mangled etc.

any pointers would be appreciated.

cheers, Michael

Accepted Answer

If some sites work directly, but not over the m0n0wall firewall, then I would make traces on both sides of the m0n0wall and compare them. Look at the TCP options for things like:

Window scale adjustments (or deletion)
SACK option deletion
Incorrect SACK translations

What exactly do you mean with "the server responds with 'bad tcp'"? Is that an HTTP message? Or is it wireshark reporting a packet as bad tcp?

Answer 2

I used tcpdump to monitor both physical interfaces. and imported the results into wireshark. There i used "follow tcp stream".

On the eth0 side there are more packets being sent, i was kind of expecting them to be one-to-one. I looked at the packets (not all ) and they don't seem to be corrupted or have their options reset.

eth1:

No.     Time        Source                Destination           Protocol Length Info
      7 6.001700    192.168.2.2           212.58.244.66         TCP      78     dyna-lm > http [SYN] Seq=0 Win=65535 Len=0 MSS=1460 WS=2 TSval=0 TSecr=0 SACK_PERM=1
      8 6.048679    212.58.244.66         192.168.2.2           TCP      66     http > dyna-lm [SYN, ACK] Seq=0 Ack=1 Win=5840 Len=0 MSS=1460 SACK_PERM=1 WS=128
      9 6.048698    212.58.244.66         192.168.2.2           TCP      66     http > dyna-lm [SYN, ACK] Seq=0 Ack=1 Win=5840 Len=0 MSS=1460 SACK_PERM=1 WS=128
     10 6.049136    192.168.2.2           212.58.244.66         TCP      60     dyna-lm > http [ACK] Seq=1 Ack=1 Win=128000 Len=0
     11 6.049460    192.168.2.2           212.58.244.66         TCP      1514   [TCP segment of a reassembled PDU]
     12 6.049483    192.168.2.2           212.58.244.66         HTTP     70     GET / HTTP/1.1 
     13 6.095471    212.58.244.66         192.168.2.2           TCP      66     [TCP Window Update] http > dyna-lm [ACK] Seq=1 Ack=1 Win=5888 Len=0 SLE=1461 SRE=1477
     14 6.095490    212.58.244.66         192.168.2.2           TCP      66     [TCP Dup ACK 13#1] http > dyna-lm [ACK] Seq=1 Ack=1 Win=5888 Len=0 SLE=1461 SRE=1477
     15 8.952247    192.168.2.2           212.58.244.66         TCP      1514   [TCP Retransmission] dyna-lm > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     16 14.987273   192.168.2.2           212.58.244.66         TCP      1514   [TCP Retransmission] dyna-lm > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     17 17.573181   212.58.244.66         192.168.2.2           TCP      66     http > dyna-lm [FIN, ACK] Seq=1 Ack=1 Win=5888 Len=0 SLE=1461 SRE=1477
     18 17.573203   212.58.244.66         192.168.2.2           TCP      66     http > dyna-lm [FIN, ACK] Seq=1 Ack=1 Win=5888 Len=0 SLE=1461 SRE=1477
     19 17.573729   192.168.2.2           212.58.244.66         TCP      60     dyna-lm > http [ACK] Seq=1461 Ack=2 Win=128000 Len=0
     20 17.573833   192.168.2.2           212.58.244.66         TCP      70     [TCP Retransmission] [TCP segment of a reassembled PDU]
     21 17.619934   212.58.244.66         192.168.2.2           TCP      60     http > dyna-lm [RST] Seq=2 Win=0 Len=0
     22 17.619952   212.58.244.66         192.168.2.2           TCP      60     http > dyna-lm [RST] Seq=2 Win=0 Len=0

eth0:

No.     Time        Source                Destination           Protocol Length Info
     19 2.887053    192.168.1.95          212.58.244.66         TCP      78     11277 > http [SYN] Seq=0 Win=65535 Len=0 MSS=1460 WS=2 TSval=0 TSecr=0 SACK_PERM=1
     20 2.887073    192.168.1.95          212.58.244.66         TCP      78     11277 > http [SYN] Seq=0 Win=65535 Len=0 MSS=1460 WS=2 TSval=0 TSecr=0 SACK_PERM=1
     21 2.933395    212.58.244.66         192.168.1.95          TCP      66     http > 11277 [SYN, ACK] Seq=0 Ack=1 Win=5840 Len=0 MSS=1460 SACK_PERM=1 WS=128
     22 2.934360    192.168.1.95          212.58.244.66         TCP      60     11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=0
     23 2.934376    192.168.1.95          212.58.244.66         TCP      60     [TCP Dup ACK 22#1] 11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=0
     24 2.934717    192.168.1.95          212.58.244.66         TCP      1514   [TCP segment of a reassembled PDU]
     25 2.934731    192.168.1.95          212.58.244.66         TCP      1514   [TCP Retransmission] 11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     26 2.934798    192.168.1.95          212.58.244.66         HTTP     70     GET / HTTP/1.1 
     27 2.934805    192.168.1.95          212.58.244.66         TCP      70     [TCP Retransmission] [TCP segment of a reassembled PDU]
     28 2.935617    192.168.1.1           192.168.1.95          ICMP     590    Destination unreachable (Fragmentation needed)
     29 2.980166    212.58.244.66         192.168.1.95          TCP      66     [TCP Window Update] http > 11277 [ACK] Seq=1 Ack=1 Win=5888 Len=0 SLE=1461 SRE=1477
     48 5.837664    192.168.1.95          212.58.244.66         TCP      1514   [TCP Retransmission] 11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     49 5.837685    192.168.1.95          212.58.244.66         TCP      1514   [TCP Retransmission] 11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     50 5.838493    192.168.1.1           192.168.1.95          ICMP     590    Destination unreachable (Fragmentation needed)
     87 11.872608   192.168.1.95          212.58.244.66         TCP      1514   [TCP Retransmission] 11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     88 11.872630   192.168.1.95          212.58.244.66         TCP      1514   [TCP Retransmission] 11277 > http [ACK] Seq=1 Ack=1 Win=128000 Len=1460
     89 11.873432   192.168.1.1           192.168.1.95          ICMP     590    Destination unreachable (Fragmentation needed)
    108 14.457945   212.58.244.66         192.168.1.95          TCP      66     http > 11277 [FIN, ACK] Seq=1 Ack=1 Win=5888 Len=0 SLE=1461 SRE=1477
    109 14.459048   192.168.1.95          212.58.244.66         TCP      60     11277 > http [ACK] Seq=1461 Ack=2 Win=128000 Len=0
    110 14.459062   192.168.1.95          212.58.244.66         TCP      60     11277 > http [ACK] Seq=1461 Ack=2 Win=128000 Len=0
    111 14.459128   192.168.1.95          212.58.244.66         TCP      70     [TCP Retransmission] [TCP segment of a reassembled PDU]
    112 14.459139   192.168.1.95          212.58.244.66         TCP      70     [TCP Out-Of-Order] [TCP segment of a reassembled PDU]
    113 14.504728   212.58.244.66         192.168.1.95          TCP      60     http > 11277 [RST] Seq=2 Win=0 Len=0

thanks, michael