[SpamCop.net - protecting the internet through technology]

[SC-Help] Re: false multipart header, Spamcop misses URL in body

Mike Easter MikeE at ster.invalid
Tue Sep 28 10:45:07 EDT 2004


Joe Blow wrote:
> "Mike Easter"
>> The parser uses the content type line in the headers and the content
>> type line in the body when it is separated by boundary delimitors to
>> tell it the 'content type'

> Text is text....and that is all that is being parsed.  The parser
> doesn't have to deal with the content type because it doesn't
> matter...the message is just a blob of text.

No, you are mistaken.  You don't understand how the parser uses content
type in its body parsing.

> All the parser needs to
> do is look for hyperlinks within that text.   It seems to me that
> you're implying the task is more difficult than that....why?

Because it is.  [You hate answers like that, but you present your
arguments in ways that demand those answers.]  You are going to have to
look at more good and bad body parses of various spamtypes, correctly and
incorrectly constructed, to get the full picture.  I cannot write a
treatise on that here.

The body parse does *not* simply look for the occurrence of
http://domain.tld - but instead must find it properly 'textually' [or
'contextually' textually] constructed correctly for its appropriate
content type.

>> Content type is a legitimate MIME descriptor, and the accurate
>> determination of the notify requires that the /actual/ spamvertised
>> sites get notified accurately - which requires being 'right' about
>> the content type.

Read that above again.

-- 
Mike Easter
kibitzer, not SC admin



More information about the SpamCop-Help mailing list