Webserver Log Analysis

From Schmid.wiki
Jump to: navigation, search

Contents

Choice

I have chosen Webalizer over Analog, because I had trouble getting Analog to work as I wanted... (not very much trouble, but I'm extremely lazy!)

Installation

# emerge webalizer   <- (or perhaps # emerge app-admin/webalizer)

Create a cron script for automatic analysis. When my logs grow in size, perhaps it should only be run once a day (e.g. use cron.daily)

# vim /etc/cron.hourly/analyze_apache_logs.sh
   #!/bin/sh
   webalizer

Configuration

# mkdir /var/www/path/to/webalizer
# chown schmid:users /var/www/path/to/webalizer
# vim /etc/webalizer.conf
LogFile        /var/log/apache2/access_log
OutputDir      /var/www/path/to/webalizer
/etc/webalizer.conf:
LogFile        /var/log/apache2/access_log
OutputDir      /var/www/schmid.dk/public/private/analysis
HostName       schmid.dk
PageType        htm*
PageType        php
# these are necessary for DNS names to appear in the analysis
DNSCache        /tmp/dns_cache.db
DNSChildren     5
CountryGraph    yes
DailyGraph      yes
DailyStats      no
HourlyGraph     yes
HourlyStats     no
GraphLegend     yes
TopKSites       0
TopURLs         30
TopKURLs        0
TopCountries    10
TopEntry        0
TopExit         0
HideSite        localhost
HideReferrer    *.schmid.dk*
HideURL         *createpic.php*
GroupSite       inktomisearch.com
HideSite        inktomisearch.com
GroupSite       googlebot.com
HideSite        googlebot.com

References

Personal tools