我正在使用下面的程序对电子邮件进行排序并最终打印出来。一些消息可能包含附件或HTML代码,这对打印不利。有没有一种简单的方法来剥离附件和去除HTML而不是HTML中从消息格式化的文本?如何从电子邮件中去除HTML和附件?
#!/usr/bin/perl
use warnings;
use strict;
use Mail::Box::Manager;
open (MYFILE, '>>data.txt');
binmode(MYFILE, ':encoding(UTF-8)');
my $file = shift || $ENV{MAIL};
my $mgr = Mail::Box::Manager->new(
access => 'r',
);
my $folder = $mgr->open(folder => $file)
or die "$file: Unable to open: $!\n";
for my $msg (sort { $a->timestamp <=> $b->timestamp } $folder->messages)
{
my $to = join(', ', map { $_->format } $msg->to);
my $from = join(', ', map { $_->format } $msg->from);
my $date = localtime($msg->timestamp);
my $subject = $msg->subject;
my $body = $msg->decoded->string;
# Strip all quoted text
$body =~ s/^>.*$//msg;
print MYFILE <<"";
From: $from
To: $to
Date: $date
Subject: $subject
\n
$body
}
你能修的链接linuxquestions.org? – innaM 2008-12-16 12:49:45