miriam_e: from my drawing MoonGirl (Default)
[personal profile] miriam_e
I mentioned a little while ago how I realised that there was an extremely easy way to stop pretty-much all spam. I found it hard to believe that I was the only one to come up with such an idea so I started looking into it. I'm extremely surprised to find that we should really be getting almost no spam at all. The major reason spam continues to swamp the network is that most mail servers don't actually implement the existing standards and everybody basically says "oh it's too hard" when really, it isn't.

I have first-hand experience at giving in to this. For many years I used Eudora for my email. A few years ago I found a lot of emails suddenly weren't getting through to me. At first I thought it was a bug in the latest version of Eudora, but eventually I realised Eudora was doing the right thing. It didn't recognise broken emails. In almost all cases those broken, invisible emails came from people using Microsoft Office to send emails. Amazing!

I tried and tried to get people to use a safe and standards-compliant email program, but I might as well have been talking to trees. In the end I had to drop Eudora because I couldn't afford to lose work. Mozilla is more lax, and I now get all those emails, but that isn't necessarily a good thing. I am seriously considering going back to Eudora. If people want to use broken shit then maybe I just shouldn't see it. The trouble is, of course, most people don't realise how seriously flawed Microsoft products often are. They are encouraged to think that they're some kind of standard.

I still get people sending me Microsoft Word and Microsoft PowerPoint documents as if they were standards. Aaaarrrgh! I and many other people don't own or want Microsoft Word or PowerPoint. They are costly, dangerous, bloated, anti-standards programs.
[end rant]

RFCs

Incidentally, in my reading about email standards I've been referring to the original RFC documents (Request For Comments) that are the discussion papers that most of the standards grow out of. I noticed that the very early ones are now up there too. http://www.faqs.org/rfcs/rfc-index.html For some time many of the early ones were missing.

The very first one rfc1 is there. It talks about packets (called messages) with a header containing a 5-bit destination. That means it could be sent to one of up to 32 possible destinations. hehehehehe :) Somehow I suspect the awsome future of the internet had not hit them yet.

Date: 2006-10-18 03:06 am (UTC)
From: [identity profile] greylock.livejournal.com
I've used Eudora for years.
It's not the most sturdy email client, but I've lost very few emails - I get a lot of spam that is corrupt and I have the option of downloading it as text or deleting it from the server...

I also do not use Word, but get 20-30 documents a day in Word format. OpenOffice has very few issues with it.

Date: 2006-10-18 05:48 am (UTC)
From: [identity profile] miriam-e.livejournal.com
I don't particularly like Open Office, but feel compelled to use it because of all the Word docs that get sent around. It just takes Microsoft to change the format again (as they have done so many times) for OpenOffice to stop reading them.

I much prefer HTML. You can do almost anything far more efficiently in HTML. I'm editing a book for a lady who sent it to me in Word format. On converting it to proper HTML (not Word's bastardised form of HTML or OpenOffice's bloated HTML) it was around half the size -- 1MB from 2.5MB.

Date: 2006-10-18 05:51 am (UTC)
From: [identity profile] greylock.livejournal.com
I would, personally, smack anyone in the head who sent me a document in html.
If I read it.

HMTL, in my world, is a format for websites, nothing else.

I'd prefer pdf.

Date: 2006-10-19 02:20 am (UTC)
From: [identity profile] miriam-e.livejournal.com
Eeek! No. PDF is a terrible, proprietary format suitable only for printing onto paper. HTML is one of the few truly electronic formats available. Not only does it adapt itself dynamically to the kind of display device you have, but it is totally open and easy to create and modify. Also, pretty-much everybody who has a computer has an HTML viewer.

PDF viewers, while they are free, are large, slow, and difficult to search. If the text is in image form (and I've seen that in far too many PDF documents) then you can NOT search it. The older versions of the PDF format are almost open, in that Adobe have published details of the format, but they still own it and they constantly change it.

HTML is an extremely efficient format. I've on a few occasions had to convert PDF documents to other formats -- a nightmare, because the so-called portable document format is anything but portable. But the result, if converted to HTML, is almost invariably a tiny fraction the filesize with no loss of information or style. PDF is bloated and unwieldy.

One of the worst things about PDF is that it is an extremely unforgiving format. If you download a webpage and you got all but the last byte, no problem. The browser will display as much as it got. If you download a multi-megabyte PDF document and the final byte is damaged, the PDF viewer will refuse to show it for you. What is more, because caching of documents is common now on the net, you can't force downloading from the original source. If you download it a second time you will just get the cached damaged version again. At least with HTML you can hit shift-reload and it will force the request to bypass the caches.

Here are a couple of links on why PDF is a really bad idea for anything other than printing on paper:
http://www.useit.com/alertbox/20030714.html
http://www.useit.com/alertbox/20010610.html
and a report on the problems for search engines when presented with PDF documents
http://www.searchtools.com/info/pdf.html

Date: 2006-10-19 02:22 am (UTC)
From: [identity profile] greylock.livejournal.com
Hmmm.
I may have to rethink my anti-HTML stance.

Profile

miriam_e: from my drawing MoonGirl (Default)
miriam_e

December 2025

S M T W T F S
 123456
7 8 910 111213
1415 1617181920
21222324252627
28293031   

Style Credit

Expand Cut Tags

No cut tags
Page generated Dec. 26th, 2025 01:58 pm
Powered by Dreamwidth Studios