[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Message body size limits? (Bigger Problem)



On Sun, Dec 4, 2011 at 12:16 AM, Tom Hutchison <tom@xxxxxxxxx> wrote:
> Is it possible some malformed email could be causing a parsing error? What I
> am getting at. If I have 250 emails in a folder, how is it the run on the
> folder is writing 260. The extra ten being date and subject blank,
> sometimes, and sometimes, with or without content.

Your problem is likely related to the following FAQ entry:

http://www.mhonarc.org/MHonArc/doc/faq/archives.html#split

> When the parser reads them, is it possible Mhonarc is picking up on
> malformed reply quotes and thinks they are new emails within the actual
> email? So instead of 4 emails in the above example, it thinks there are 6.
> Garbage in, garbage out comes to mind.
>
> I did solve the broken HTML, not very efficently with Outlook 2010 as it
> does allow for a striping of all HTML code by setting the open email to
> “edit” then choosing “plain text”  after you edit anything in the body of

IIRC, Outlook allows a text/plain alternative to be generated along
with the HTML part.  You can use the MIMEALTPREFS resource, as noted
in the FAQ, to give higher precedence to text/plain over text/html.

> the email. Even if it is just a carriage return or a space. Close the email
> and save on exit and the whole email is rewritten, stripping out all HTML
> and resetting the header information to show “plain/text” and whatever you
> have the encoding set to. Stripping out all HTML from the emails was the
> only way I could think of to solve the unclosed <table> attribute in quite a
> few emails which was causing problems with the msgxxx.html pages.
>
> It’s long past time for standardized header and html format for email. If
> anything it might secure them more...

text/enriched was created a long time ago to provide enhanced formatting
of email messages, but it faded away when the Web grew and HTML became
a defacto markup format for "enriched" text.

IMO, it is inexcusable for major software/services organizations to generate
such malformed HTML.  Dealing with malicious HTML is one thing, but when
non-malicious-generated HTML is so badly formatted (when it should not
be) it makes the lives of consumers of such content much more difficult.

--ewh



[Index of Archives]     [Bugtraq]     [Yosemite News]     [Mhonarc Home]