21 messages in org.w3.www-htmlTag Soup vs Generalized Markup (was: ...
FromSent OnAttachments
Larry MasinterSep 23, 1999 12:15 pm 
Jukk...@hut.fiSep 24, 1999 2:44 am 
Walter Ian KayeSep 24, 1999 10:39 am 
Arjun RaySep 27, 1999 6:31 pm 
Harald Tveit AlvestrandSep 28, 1999 10:51 am 
Arjun RaySep 28, 1999 5:01 pm 
Harald Tveit AlvestrandSep 30, 1999 11:41 am 
Arjun RaySep 30, 1999 11:55 pm 
Larry MasinterOct 4, 1999 1:06 pm 
Arjun RayOct 4, 1999 9:56 pm 
Larry MasinterOct 5, 1999 6:55 am 
Arjun RayOct 6, 1999 12:04 am 
Arjun RayOct 6, 1999 3:29 am 
Rick JelliffeOct 6, 1999 6:37 am 
Arjun RayOct 6, 1999 6:39 am 
Russell Steven Shawn O'ConnorOct 6, 1999 6:42 am 
Arjun RayOct 6, 1999 10:17 am 
Arjun RayOct 7, 1999 10:00 pm 
Larry MasinterOct 8, 1999 2:55 am 
Jukk...@hut.fiOct 8, 1999 3:27 am 
Arjun RayOct 8, 1999 3:49 pm 
Actions with this message:
Paste this link in email or IM:
Paste this link in email or IM:
Atom feed for this thread
Paste this URL into your reader:
Subject:Tag Soup vs Generalized Markup (was: I-D ACTION..)Actions...
From:Arjun Ray (ar@q2.net)
Date:Oct 6, 1999 6:39:35 am
List:org.w3.www-html

On Tue, 5 Oct 1999, Larry Masinter wrote:

[This is the third time I'm quoting the same passage, and only now am I really addressing the meaning of the last clause! Sorry about that.]

I don't think this reduces the value of specifying what 'text/html' *should* be, although I agree it makes implementation hard.

The hard part is reconciliation of two different paradigms. Tag soup processors are not difficult to write. Nor for that matter, are (S)GM(L) processors. (A *validating* SGML parser, OTOH, is indeed not a task for mere mortals:)) Mixing the two, however, is a nightmare, because the paradigms actually reflect a classic tradeoff - the simple contextless versus the sophisticated contextual, and thus the stolidly robust versus the delicately powerful. Just as the tagsoup processor is too *dumb* to get into trouble (so it doesn't matter what kind of dog's breakfast you feed it) the GM processor demands correspondingly greater coherence in its input for its smarts.

Now, we'd all love to have those smarts working for us, except that Mosaic and its spawn popularized the dumbs. Moreover, there's no mystery to the popularity. The freedom to toss any random mishmash of tags into a wowser set a very low bar; this has turned out to be extraordinarily empowering. People are not going to give it up easily. ("But it works in Netploder, and that's good enough for me.")

HTML is Humpty Dumpty toppled a long time ago. There really isn't a cause or even a need for a "should". Because no one with two braincells to rub together is ever going to bother to write a "conforming HTML processor" in relation to the spec as it stands today. The non-compliance is massive to total.

The I-D should point to a Tag Soup spec, and a separate SGML-based spec should probably be written up as a W3C Note. (Because there may be value to modularized HTML as a family of architectures.)

Arjun