Question on linebreaks on output from tshark’s -z io,stat option

Question

Hello,

I've done up a script that reads a capture file with tshark's -z io,stat argument, where the goal is to be able to generate statistics on several different display filter search criteria with a single pass on the capture file itself (automatically-generated capture files of predictable name and timestamp, where the script users tshark to get the stats off of that time period and pushes it to a line in a .csv file). This was a slightly tedious effort due to every added display filter increasing the line count in the output of the tshark query, which also changes which line of output the statistics themselves are generated in.

Anyway my question is this:

Right now it looks like all outputs of this command will put the statistiics onto a single, very long line even when dozens of display filters are used in the query. Is it a safe assumption with current Wireshark/Tshark versions (in this example, 1.8.6) that the io,stats printout will put all stats on a single very-long line, or will it break the line at some upper limit and use a second line? If it does, what is that upper limit? The reason is I'm making that one-line assumption at the moment and don't want my scripts to break if they're calling a hundred different display filters.

Related question - any way we could lose the text art in that output and just pump out a nice clean delimited line of stats in the order requested?

Accepted Answer

Yes I tried to break it last night but it does seem to be always one line. I'm 'relatively' confident that I'm safe there

You can be safe. I've just checked the source code. There is no limit (besides available RAM), so you can rely on a single line.

See iostat_draw() in tap-iostat.c. The required space for the column data is requested via g_malloc() and the column data itself is printed in small pieces with printf, column by column, so there is not even a large string that needs to be handled internally.

There is also no limit from the OS, at least I cannot imagine one, because if there was a limit you would not be able to pipe large amounts of data via STDOUT/STDIN into another program, which is obviously not the case on any of the current OSes.

Regarding the fancy ASCII art. You can simply convert that to CSV with this one liner on Linux and similar OSes.

   tshark -nr input.pcap  -z io,stat,1 -q | grep '<>' | sed 's/ <> */;/' | sed 's/^| *//' | sed 's/ *| */;/g'

It might not be the best and fastest, nor the most elegant regexp, but it works ;-)

Regards
Kurt

Answer 2

I just tested this on OSX, I was able to create lines with a length of >50000 characters with the "io,stat" option. I assume (without looking at the source code) that there is no limit in tshark itself, but that there might be a limit imposed by the OS.

Regarding the "pretty printing", AFAIK this is hardcoded. You could file an enhancement request to add an option to csv'ify the output.