hwlogstat 1.20


Name

hwlogstat - create statistics about WWW-access to HyperWave servers

Synopsis

hwlogstat parameters
For a short description of the parameters try hwlogstat -h. For more details see below.

But first of all, you might want to take a look at an example.


Note

With the release of a new HyperWave version in June 1996, the www logfile format has been changed. According to that, hwlogstat (and beyond) parses the new format only.
So, if you still want to analyze old logfiles, you should use hglogstat.


Description

Based on the logfiles written by the WWW-gateway, hwlogstat produces access-statistics by collecting various information (see Modes). Depending on the users' selection, the tool may present the results as an HTML-document, it may write detailed information about requested objects, searches and failed searches to a file, or, as a third possibility, the tool may produce overall statistics, based on information gathered during either of the previous modes. (The first two actions may be combined in a single run, the third one needs an extra run.)

Modes

hwlogstat may execute in three different modes, the first two of which can be combined into a single run.


Parameters

Most parameters may be abbreviated by the first four characters. Exceptions are -lastse[ven] and -lastmo[nth].
(<...> defines the type of data required)

-html
Mode 1. The script will produce detailed statistics, output an HTML document and optionally insert it into the specified collection.

-details
Mode 2. Output all requested objects, search requests and failed search requests in plain ASCII (for further use).

-overall
Mode 3. Produce overall statistics using results from previous runs.

-dir
Defines the directory the logfiles are stored in. Only logfiles in this directory will be examined.

-dir <directory>
Defines the directory the logfiles are stored in. Only the logfiles in this directory will be examined
(e.g. ~hgsystem/logs).

-file <filename>
Specifies the name of the current logfile (as defined in .db.contr.rc). The default name is wave.log, so this parameter may be omitted.
Old logfiles are supposed to be consist of the given filename followed by a timestamp (e.g. wwwlog.30703723). Optionally, these files may also be gzipped; in this case, the tool temporarily expands them (using gzip -c). So, giving 'wwwlog' as filename actually means all files matching wwwlog[.timestamp[.gz]].

-hghost <name>
Name of HyperWave host ...

-pname <coll>
... and name of collection to put the HTML document into.
If this parameter is missing, output goes to stdout.

-imgcoll <coll>
Name of collection to put images into (by default, equals the collection defined by -pname).

-hname <string>
Hostname that shall appear in the summary's title. This option may be used in Mode 1, when an alias name shall be used instead of the host's domain name within the report.
When the script is executed in Mode 2 only, there is no need to define -hghost, -pname and -imgcoll, since an ASCII-file is the only thing that will be output. So -hname may be used to still give the host a name.

-from <yy/mm/dd>
First day to analyze. Should be in the form yy/mm/dd.

-to <yy/mm/dd>
Last day to analyze. By default, yesterday's date is assumed. Format as above.

-lastseven
Analyzes the last seven days (may be used instead of -from and -to).

-lastmonth
Analyzes the last month (may be used instead of -from and -to).

-top <number>
Specifies the top n items to be listed (20 by default).

-regex
Tells the script to treat the entries in the rc-file as regular expressions (in perl-fashion);
without this option the entries are supposed to be object titles

-domains
Add the top domains to each requested object.

-maxdomain <number>
Specifies the number of domains to be displayed (5 by default).

-nographics
Do not produce any graphics for the time information.

-cmd <filename>
Take the parameters defined in the given file. These parameters can still be overridden by those given in the commandline.

-test
output current settings.

-v
Verbose mode.


The rc-file

It has been mentioned above that unwanted items may be excluded from the summaries by describing them in an rc-file. By default, the script looks for hwlogstat.rc in the current directory, but an alternative filename may be defined by the -rc parameter.
The list of items in this file may be divided into several categories, each headed by a line identifying the type of objects to follow. So far, requested objects and entry pages and requesting hosts may be skipped (the corresponding heading lines are _SKIP_OBJECTS_, _SKIP_ENTRIES_ and _SKIP_HOSTS_.
Lines starting with # are considered to be comments.

The objects may be described in one of two ways:

It shall be emphasized, however, that the items that appear in hglogstat.rc are excluded from the top-n lists only; they still count as requested objects or entry pages!


The command file

When a series of statistics should be generated with most of the parameters remaining constant, the user does not have to type the whole lot of command-line parameters again and again. Instead, the parameters may be put into a command file, and calling the script parameter -cmd <file> induces it to read the parameters from that file.
But still, additional parameters given in the command-line will override those in the file.

Format:

Example:
# what a command-file might look like
-dir /usr3/users/hgsystem/log
-hname "My own Server"


What is necessary to run hwlogstat?

If you have these installed on your system, too, nothing can keep you from working with hwlogstat.


History

Changes since 1.10

Changes since 1.00


Known Bugs

Of course, there are some minor bugs, but none of them is really serious.


Author

Alfons Schmid (aschmid@iicm.edu) - September 25, 1996