Half a croissant, on a plate, with a sign in front of it saying '50c'

h a l f b a k e r y

"My only concern is that it wouldn't work, which I see as a problem."

idea: add, search, annotate, link, view, overview, recent, by name, random

meta: news, help, about, links, report a problem

account: browse anonymously, or get an account and write.

user:

pass:
register,

...
Searchmarks cloud
shortanswers.com
Simple natural language websearch algo
SpecialChar web search
<start here>
Steganographic URL Text
Text2UI UX
textorigin
Translated Search
...

computer:web: searching

<start here>

Eliminate navigation garbage for better searching.

(+5, -2)

[vote for,
against]

Many pages on the web, this one included, have a lot of navigation information as well as actual content. Authors should have a tag (or other means) avaiable to differentiate between these.

The benefit of this is that search engines, data mining tools, etc, need only read the relevant part of a page.

CLARIFICATION: I didn't actually think <start here> would be the tag. As annotated, <content>...</content> would do the job admirably.

—	sadie, Aug 07 2002

[link]

<start here> wouldn't work, as 'here' would be interpreted as an attribute.
<croissant></croissant>

—	NickTheGreat, Aug 07 2002

Why not just put the <start here> information on the main page and skip the extra step?

—	Mr Burns, Aug 07 2002

As I remember it, the interspersing of navigation garbage with actual content was the whole benefit of html, which otherwise does rather poorly at layout. As example, the links on contributors' IDs on this page, linking you to their account pages.

If anything, people should be encouraged to more closely integrate navigation and content, not to strip it out. So, feeshbone.

—	DrCurry, Aug 07 2002

// search engines, data mining tools, etc, need only read the relevant part of a page.

It depends on the indexing software, but they already do (<head>, <meta> <a> e.g.)

Fewer and fewer web pages are 'hand written' these days, a lot are published from content management software, which makes it impossible to say <start here> anywhere on a page fragment. You have no prior knowledge of where the fragment will appear in relation to anything else.

—	namaste, Aug 07 2002

I see site designers using the presence of such a brower enhancement to redirect any offsite traffic to a gateway page.

—	reensure, Aug 08 2002

// <head>, <meta> <a>

That gets rid of the 'official' metadata, but not the garbage that appears on the page itself.

I don't see how it would be hard, [namaste]. On most auto-generated pages, it's quite clear which parts are real content and which are meta-content.

Applying this to HB, it's quite easy. The title, slugline, original idea and annotations would be marked as content, the logo, sidebar and related ideas list at the top wouldn't. Is that so hard?

[reensure]... huh?

—	sadie, Aug 09 2002

<content>So, something like an optional <keyword>content tag</keyword> pair that you put around that which is actual content rather than navigational stuff.</content>Return to top. <content>The browser wouldn't recognise the tag and so would ignore it. The <keyword alt="searchbot, find, webcrawler, crawler">search</keyword> 'bot upon finding the tags would strip out all other tags between them and be left with a list of words names and numbers that appear on the web page.

If I understand you right then this looks to be a simple move. Not much benefit not much cost either.

You could even extend the concept by putting <keyword><keyword>keyword</keyword> <keyword>tags</keyword></keyword> around relevant words to increase the chance of them getting indexed. </content>

—	st3f, Aug 09 2002

¯sadie, do you want a tag to indicate to your browser it is (or is not) showing the start page for the browsed site? If so, you could design a custom toolbar button <-Start Back> to browse the site back to the start of the author's content. I'm deliberating if this type of searching is more powerful, more respectful, or less fun.

—	reensure, Aug 09 2002

I don't think so, reensure. What I think sadie is after is a pair of tags that you use to indicate content on a per page basis.

—	st3f, Aug 10 2002

Oh, okay. I suspected it was more for the benefit of search engines after reading a few annos. At first the idea seemed to be a call to revise the entry point of a browser to a point in a url where its author had tagged data as content, regardless of where a search bot may have cached some keyword from the body of the content.

—	reensure, Aug 10 2002

You're along the right lines, but not just search engines. There are an increasing number of data mining tools, translators, summary tools, things that read pages for blind people, etc. A content tag would help all of them.

—	sadie, Aug 14 2002

search engines should just automatically mark the relavence of content down if that content is copied between the current page and pages linked to/from it. This should focus in more on specific pages, instead of listing every page on a site who's main menu matches a keyword you enter.

—	ironfroggy, Jan 03 2003

[annotate]

back: main index

business computer culture fashion food halfbakery home other product public science sport vehicle