url.list(5)
NAME
url.list - websec url monitoring configuration
DESCRIPTION
The URL list consists of one or more sections separated by newlines.
You can have sections without a URL, they will update the defaults for
all the subsequenet blocks. The Name and Prefix parameters are required
as well as one of Email, EmailLink and Program. The rest are completely
optional. The configuration is expected to be of the form "keyword =
value".
- The following parameters (case-sensitive) are recognized in each section:
- URL - URL of web page to monitor
- Name - Name of web site. Pages delivered to you will have the
following format: "Name - Date (Day)" eg. "PC Magazine - 4 Sep 98 (Fri)"
- Prefix - Prefix of filenames for archive files of web pages created
- by Web Secretary.
- Email - Comma-delimited list of email addresses to send highlighted
- pages to.
- EmailLink - Comma-delimited list of email addresses to send URL of
- changed pages to.
- Program - application to call with the diff-file, special cases:
- "mozilla", pages are opened in new tabs,
"konqueror", pages are opened using "kfmclient openURL" - Auth - Authentication information in "username:password" format.
- Put "none" if no authentication needed.
- Diff - Put "none" if you want Web Secretary to always mail this
- page to you instead of checking for and highlighting
changes in the page. Put "webdiff" if you want Web
Secretary to check for changes.
Put "htmldiff" to use a different implementation of HTML diff output. Note that you need to install the perl-modules Algorithm::Diff and HTML::Diff which are availabe on
http://www.cpan.org/ for this to work. - Hicolor - Color used to highlight new or changed content. Currently,
- four colors are defined. They are: blue, pink, yellow and grey. You can also supply your own HTML color tag in the form "#rrggbb".
- Ignore - Comma-delimited List of section names containing ignore
- keywords. There must be NO SPACES between delimiters and section names. The ignore sections and keywords are stored in a file called "ignore.list".
- IgnoreURL - Comma-delimited List of section names containing ignore
- URLs. There must be NO SPACES between delimiters and
section names. The ignore sections and keywords are stored in a file called "ignore.list". - AsciiMarker - If set to 1 it will add ascii markers around the changes so
- that highlighting is noticeable in text mode too. Useful for
text MUAs users.
- Tmin - Every token containing <= Tmin words will not be highlighted
- for differences.
- Tmax - Every token containing >= Tmax words will not be checked for
- ignore keywords.
- Proxy - Specify proxy "http://your.proxy.here:portnum" if you are
- using one. (Alternatively, you can make use of the
"http_proxy" environment variable) - ProxyAuth - Specify proxy authentication in "username:password" format.
- The code for this feature was contributed by Volker Stampa.
- MailFrom - The E-Mail address to send mail from, this can be left empty
- and the user used to run websec will be used.
- ProgramDigest - If specified "true", websec does not open all changed pages
- separately with the application specified in "Program", but opens a summary page that contains links to all changed pages.
- Digest - true|false or yes|no. This works only if EmailLink is
- specified. It consolidates all the changed URLs and sends them in one email.
- UserAgent - The User-Agent that will be sent by the web client. This can
- be used to bypass servers that prevent access based on the user agent.
- DateFMT - Date format to use in e-mail messages, can be empty for no date.
- Set it to " - %Y-%m-%d" for ISO dates. This is perl format for dates.
- RandomWait - Websec waits for a random number of seconds between retries up
- to the value specified by the RandomWait keyword. This is to prevent websec from being blocked by websites that perform log analysis to find time similarities between requests.
- Any line which begins with a '#' is treated as comment and ignored.
- If a section does not contain a URL entry, the values provided will be treated as the default for the following sections.
- For example,
# Defaults
Auth = none
Diff = webdiff
Hicolor = blue
Ignore = General,Date_Time
IgnoreURL = Adverts
Tmin = 1
Tmax = 10
Proxy = http://proxy.nus.edu.sg:8080
Email = vchew@post1.com- # Web page to monitor which does not require authentication
URL = http://browserwatch.iworld.com/news.html
Name = Browser Watch
Prefix = browsewatch - # New defaults with authentication information
Auth = user:password - # More web pages to monitor which requires authentication
URL = http://www.infoworld.com
Name = Infoworld
Prefix = infoworld - URL = http://developer.javasoft.com/
Name = Java Developer Central
Prefix = jdc
SEE ALSO
AUTHOR
- Baruch Even <websec@ev-en.org> is maintaining this program.