>
>...and can be processed quite nicely as well. A Perl script to go and pull
>out the blocks of headers from this king of archive should be under 20
>lines long (look for \nFrom:, save until you get to \n\n, repeat).
^
Space, not colon, after that From. The "From:" (with colon) may appear
anywhere in the headers (and almost always appears *after* the Received:
headers needed to chase spam). But since \nFrom may appear in messages,
using formail (part of the free procmail package) to get the headers
will likely do the best job.
(In non-UNIX-mbox-format mailboxes, you may need to replace the method
of finding start of message headers, whether using perl or not. However,
many non-UNIX systems/clients also adopt this format.)
Cheers,
Stan
|
|