HTML Presentation

A perl script to extract data from FortNet Monthly Statistics Summary Files
FortNet's Monthly Statistics Summaries of the Web Server Statistics consist of rather large html files. Files such as stats-Feb97.html include a section called HTTP Server Request Statistics which lists :

Of course, you may be using page counters but these do not provide monthly breakdowns and summaries without use of multiple counters or resetting the count data file. The summary file data may be suspect but it is another source of information about visits to your site.

A perl script can be used to extract specific data from the html summary file. You can run such scripts from the unix prompt or obtain one of the "Alien Ports" to run on your PC.

The script statck.pl was written to extract numbers for all pages for WidowNet. You may modify the file to pull data for other URL's without much effort.

To extract data from the February 1997 Monthly Summary, the command line (from the unix prompt) would be:

statck.pl /www/htdocs/FortNet/stats/stats-Feb97.html >> mystats
(If you're running this on your PC, you will have to save the summary html file to your disk and change the command line accordingly.)

The output file created displays the number of hits for each file within the target directory structure. This is a tab-delimited text file that can be imported into a database, spreadsheet or whatever tool you would like to use for further analysis.

In my case, this information was loaded into a spreadsheet to produce a graph of monthly hits to all pages making up the site.

Michael Goshorn - February 26, 1997

FortNet Home