6 messages in net.sourceforge.lists.courier-maildropRe: [maildropl] Spam recipe
FromSent OnAttachments
Eric d'AlibutAug 7, 2007 12:55 pm 
Todd LyonsAug 7, 2007 1:19 pm 
Eric d'AlibutAug 7, 2007 3:17 pm 
Todd LyonsAug 7, 2007 3:59 pm 
Eric d'AlibutAug 7, 2007 7:11 pm 
Tony EarnshawAug 11, 2007 7:10 am 
Actions with this message:
Paste this link in email or IM:
Paste this link in email or IM:
Atom feed for this thread
Paste this URL into your reader:
Subject:Re: [maildropl] Spam recipeActions...
From:Todd Lyons (tly@ivenue.com)
Date:Aug 7, 2007 3:59:51 pm
List:net.sourceforge.lists.courier-maildrop

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1

On Tue, Aug 07, 2007 at 06:18:18PM -0400, Eric d'Alibut wrote:

On 8/7/07, Todd Lyons <tly@ivenue.com> wrote:?

If you're running spamassassin, you can use the CVS version of FuzzyOcr which extracts and detects these spams.

Thanks for this steer. A quick glance at the package gives me the impression that the focus is on image files, gif's, jpeg's, etc. Am I right in thinking that these images are the payloads carried by all those spam pdf attachments? So that FuzzyOcr works with pdfs?

The spam pdf payload is the pdf itself. The original FuzzyOcr incarnation ran an ocr against the gif/jpeg/etc to extract text from it, then detection algorithms score the extracted text. The pdf spams simply get pdftotxt run against them and then the text is searched with the same detection algorithms. - -- Regards... Todd Exponential problems need logarithmic solutions. --Eddy Dreger Linux kernel 2.6.17-6mdv 1 user, load average: 0.80, 0.74, 0.57 -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (GNU/Linux)

iD8DBQFGuPl+Y2VBGxIDMLwRAoHwAJ4qyHtZD8t/O7SU5LBJSkTleSSSngCdH7Qb 4jbCXJYA1uUm/nVVwmCkziA= =L6+8 -----END PGP SIGNATURE-----