Well, if he uses point-and-click software that creates HTML atrocities
like this ("etc." means appropriate ( or inappropriate) markup I'm too
lazy to think of appropriate (or inappropriate) fake values for and type
in; '$' indicates long line gone past the edge of the screen as displayed
by PICO):
<body>
<table etc.>
<tr etc.>
<td etc.>
<table etc.>
<tr etc.>
<td etc.>
<font color=""000000">
<img src="http:$
<br>
Welcome to Foo $
</font>
</td>
<td etc.>
<font color=""000000">
<img src="http:$
</font>
<td etc.>
</tr>
</table>
</td>
</tr>
</table>
.... and so forth, and so forth, then stripping out leading spaces could
be very helpful. One >1000KB spam I got from Argentina was reduced to
about 50KB merely by stripping the leading spaces from all of the lines.
There were some lines almost 200 characters long with a single "</td>" or
"</tr>" or "<br>" at the end of the line being the only non-space
characters in the line.
And, yes, I have seen images wrapped in <font>...</font> tags too.
Now, if the page author creates the HTML by hand, and uses CSS properly
instead of the redundant styles for paragraphs followed by paragraphs
with style attributes that specify the same thing again and again with
a span tag nested inside of that specifyies more stuff again and again
as generated by Word that may reduce the overhead to begin with. An
example from spam (the faint of heart may wish to avert their eyes even
though I have snipped most of it):
: <style>
: <!--
: /* Style Definitions */
: p.MsoNormal, li.MsoNormal, div.MsoNormal
: {mso-style-parent:"";
: margin:0in;
: margin-bottom:.0001pt;
: mso-pagination:widow-orphan;
: font-size:12.0pt;
: font-family:"Times New Roman";
: mso-fareast-font-family:"Times New Roman";}
: p
: {margin-right:0in;
: mso-margin-top-alt:auto;
: mso-margin-bottom-alt:auto;
: margin-left:0in;
: mso-pagination:widow-orphan;
: font-size:12.0pt;
: font-family:"Arial Unicode MS";}
: -->
: </style>
: </head>
:
: <body lang=EN-US link="#003399" vlink=purple style='tab-interval:.5in'>
:
: <div class=Section1>
:
: <p align=center style='margin-top:12.0pt;margin-right:0in;margin-bottom:0in;
: margin-left:0in;margin-bottom:.0001pt;text-align:center;mso-line-height-alt:
: 14.4pt'><span style='font-size:16.0pt;mso-bidi-font-size:10.0pt;font-family:
: Arial;color:#333333'>Travel M0NEY - Freedom <o:p></o:p></span></p>
:
: <p align=center style='margin-top:12.0pt;margin-right:0in;margin-bottom:0in;
: margin-left:0in;margin-bottom:.0001pt;text-align:center;mso-line-height-alt:
: 14.4pt'><span style='font-size:16.0pt;mso-bidi-font-size:10.0pt;font-family:
: Arial;color:#333333'><![if !supportEmptyParas]> <![endif]><o:p></o:p></span></p>
:
: <p align=center style='margin:0in;margin-bottom:.0001pt;text-align:center;
: mso-line-height-alt:14.4pt'><span style='font-size:16.0pt;mso-bidi-font-size:
: 10.0pt;font-family:Arial;color:#333333'>The Perfect Home Business<o:p></o:p></span></p>
:
: <p class=MsoNormal><![if !supportEmptyParas]> <![endif]><o:p></o:p></p>
:
: <p class=MsoNormal><![if !supportEmptyParas]> <![endif]><o:p></o:p></p>
:
: </div>
While I'm at it, does anyone know what possible use there is for the
"<o:p></o:p>" tags?
--
Can you Change: MINDWORKS to MINDWORKS (* == Book)
*HALIFAX HALIFAX*
in 76 moves? Try
http://www.chebucto.ns.ca/~af380/MHPuzzle.html(Requires a browser supporting the W3C DOM such as Firefox or IE ver 6)