Wed, 4 Feb 1998 06:23:19 -0500
|
I own and moderate the Advanced HTML list at UA1VM. I now have about 2
years of archive logs on UA1VM, and I am interested in:
1. Moving a copy of the old logs to a new server (netsquirrel.com)
although the list will continue to operate on UA1VM.
2. Modifying the logs on the new server so that it only includes
those e-mail letters with certain "topic" keywords in the
subject lines (in other words, I want the new archive to
include e-mail letters whose subject lines begin with
SUMMARY, INFO, or COMMENT but exclude those e-mail letters that
begin with QUESTON or ADMIN ... oh, and _all_ ADV-HTML posts
have one of those five words at the beginning of their subject
lines).
3. Slapping a high-quality search engine on the logs that will
be smart enough to parse the logs and return individual posts
(in other words, if I search for "Dynamic HTML" I don't want
the return to be the entire log but instead the individual
email(s) in that log that best match my search term).
4. "Subscribing" the new server to ADV-HTML so that the remote
server's archive will automatically update as each new letter
is sent to the list.
I can handle the first and fourth steps pretty easily (I just need to FTP
the logs to netsquirrel.com, and "subscribing" the archive server is just a
matter of an ADD). The second and third steps are stumping me, though.
So, I guess my questions are:
1) How do I parse out the "QUESTION" and "ADMIN" posts from
the logs that I am going to put on the new server?
2) Which search engine should I use (I've used MHonArc in the
past, but I don't think it had search capabilities then)?
.~~~. ))
(\__/) .' ) )) Patrick Douglas Crispen
/o o \/ .~
{o_, \ { **NEW** [log in to unmask] **NEW**
/ , , ) \ http://www.netsquirrel.com/
`~ '-' \ } ))
_( ( )_.' Warning: squirrels.
'---..{____}
|
|
|