hglogstat 1.13


Name

hglogstat - create statistics about WWW-access to Hyper-G servers

Synopsis

hglogstat parameters
For a short description of the parameters try hglogstat -h. For more details see below.


Description

Based on the logfiles produced by the WWW-gateway, hglogstat produces access-statistics by collecting various information (see Modes). Depending on the users' selection the tool may put the results into an HTML-document (which may immediately be inserted into a defined Hyper-G collection, optionally along with some graphic representation), or the script writes detailed information about requested objects, searches and failed searches to a file, or, as a third possibility, the tool may produce overall statistics, based on information gathered during either of the previous modes. (The first two actions may be combined in a single run, the third one needs an extra run.)

Modes

hglogstat may execute in three different modes, the first two of which can be combined into a single run.


Parameters

Most parameters may be abbreviated by the first four characters. Exceptions are -lastse[ven] and -lastmo[nth].
(<...> defines the type of data required)

-html
Mode 1. The script will produce detailed statistics and output an HTML; the user may choose to have this document inserted into a specified collection.

-details
Mode 2. Output all requested objects, searches and failed searches in plain ASCII (for further use).

-overall
Mode 3. Produce overall statistics using results from previous runs.

-dir <directory>
Defines the directory the logfiles are stored in. Only the logfiles in this directory will be examined
(e.g. ~hgsystem/logs).

-file <filename>
Gives the name of the current logfile. The default name is wwwlog, so this parameter may be omitted.
Old logfiles are supposed to be consist of the given filename followed by a timestamp (e.g. wwwlog.30703723). Optionally, these files may also be gzipped; in this case, the tool temporarily expands them (using gzip -c). So, giving 'wwwlog' as filename actually means all files matching wwwlog[.timestamp[.gz]].

-hghost <name>
Name of Hyper-G host ...

-pname <coll>
... and name of collection to put the HTML document into.
If an HTML output is desired but this parameter is omitted, the HTML text goes to the standard output.

-imgcoll <coll>
Name of collection to put images into (by default, equals the collection defined by -pname).

-hname <string>
Hostname that shall appear in the summary's title. This option may be used in Mode 1, when an alias name shall be used instead of the host's domain name within the report.
When the script is executed in Mode 2 only, there is no need to define -hghost, -pname and -imgcoll, since an ASCII-file is the only thing that will be output. So -hname may be used to still give the host a name.

-from <yy/mm/dd>
First day to analyze. Should be in the form yy/mm/dd.

-to <yy/mm/dd>
Last day to analyze. By default, yesterday's date is assumed. Format as above.

-lastseven
Analyzes the last seven days (may be used instead of -from and -to).

-lastmonth
Analyzes the last month (may be used instead of -from and -to).

-top <number>
Specifies the top n items to be listed (20 by default).

-regex
Tells the script to treat the entries in th rc-file as regular expressions (in perl-fashion);
without this option the entries are supposed to be object titles

-cmd <filename>
Take the parameters defined in the given file. These parameters can still be overridden by those given in the commandline.

-test
output current settings.

-v
Verbose mode.


The rc-file

It has been mentioned above that unwanted items may be excluded from the summaries by describing them in an rc-file. By default, the script looks for hglogstat.rc in the current directory, but an alternative filename may be defined by the -rc parameter.
The list of items in this file may be divided into several categories, each headed by a line identifying the type of objects to follow. So far, requested objects and entry pages may be skipped, the corresponding heading lines are _SKIP_OBJECTS_, _SKIP_ENTRIES_ and _SKIP_HOSTS_.
Lines starting with # are considered to be comments.

The objects may be described in one of two ways:

It shall be emphasized, however, that the items that appear in the rc-file are excluded from the top-n lists only; they still count as requested objects or entry pages!


What is necessary to run hglogstat?

hglogstat is a perl script and takes advantage of the features new in perl 5. So, the first prerequisite is perl 5 to be installed on your system.
The graphics are produced by Gnuplot, which is called by the script. So, this has to be installed, too. Since Gnuplot does not produce gif outputs (at least my version 3.5 (pre 3.6) does not), ppmtogif is called to do the translation. So, this, too, should be installed.
Finally, insertion of the HTML document into the database is done by hginstext. If you have this installed on your system, too, nothing can keep you from working with hglogstat.


History

Changes since hglogstat 1.12

Changes since hglogstat 1.11


Known Bugs

Of course, there are some minor bugs, but none of them is really serious.


Author

Alfons Schmid (aschmid@iicm.edu) - September 25, 1996