Welcome to WebHeadStart.org

Web Technologies

Sponsored By

WebHeadStart.org is currently in beta.
Please pardon our appearance as we work to provide you with the most comprehensive reference on today's web technologies.

Interested in advertising on WebHeadStart? Become an advertising partner today!

[WWW-HTML Mailing List Archive Home] [Messages By Thread] [Messages By Date]

Re: XHTML2: Proposal for total separation of semantics from structure

From: Sjoerd Visscher <sjoerd@w3future.com>
Date: Fri, 26 Aug 2005 00:22:05 +0200
Message-ID: <430E448D.5050105@w3future.com>
To: www-html@w3.org

Junk Account wrote:
> As far as "structure" goes, this means simply an enumeration of the
> parts, in such a way as to form e hierarchy. It does not say anything
> about what each of those parts might actually mean.
> It does not say if it is a paragraph, a piece of code, a lyric, the
> abstract of a scientific paper, a movie, a picture, or whatever else.
> It is simply a hierarchy. A tree.
> "What each part means" is the semantics.

I've always considered titles, paragraphs, tables and lists to be 
structure. They give the content no meaning. Structure *is* semantics, 
but only up to a certain level. Structure is all the meaning which is 
usefull without actually knowing what the text means. F.e. I can create 
a ToC of a properly structured Japanese document, without knowing any 
Japanese.

> The per-capita income of Kenya for 1988 is probably there, somewhere in the net.
> Can you get it in just one search?

Actually, almost. I first tried "The per-capita income of Kenya for 
1988", but that got me only documents with recent information, so I 
added "history". The third page of that query tells me that it is 370 
current US Dollars.

> Even if you got more or less decent results....could you (or even a
> search engine, programatically) extract just that microcontent? Or
> would I need to load and read whole pages looking for the relevant two
> lines in each?

Apparently Google can, again almost. If your search for "The per-capita 
income of Kenya", Google responds with "Kenya ? GDP - Per Capita: $ 1,100".

> If microcontent is not programatically extractable, furthermore, then
> I'd have to do some screen scrapping to be able to reuse that content.

Which is probably what Google does. They are doing something special 
with the CIA World Factbook.

> Shouldn't we be providing ways to hook extensible meaning at all
> levels (including the elemental one), in order to facilitate such a
> thing?
> 
> Are we providing extensible means to mark up microcontent semantically?
>
> To add semantic functionality to the web probably requires whole
> lenguages, with concepts as "is-part-of", inheritance, and even more
> complex relationships. And being such a vast thing, likely requires
> extensbility inherently.
> If you ask me, I'd separate. In advance.

I think XHTML 2.0 already does a very good job, with the Metainformation 
and Role Modules. It just needs to take the extra step, and remove the 
old semantics-only elements (address, code, quote, kdb, etc)

-- 
Sjoerd Visscher
http://w3future.com/weblog/ 
Received on Thursday, 25 August 2005 22:22:22 GMT
Valid XHTML 1.0! Valid CSS! Site Map | Privacy Policy | Terms of Use | WebHeadStart.org © 2005 All Rights Reserved.