h a l f b a k e r yBusiness Failure Incubator
add, search, annotate, link, view, overview, recent, by name, random
news, help, about, links, report a problem
browse anonymously,
or get an account
and write.
register,
|
|
|
The crawler bot for this one is looking specifically for numbers in the text content of web pages; especially floating point numbers, in a variety of formats, including scientific notation, numbers with commas (both American and foreign versions) and your "vulgar fractions." For our purposes it would
probably be best to index "6.000" as -both- "six (six point 0 0 0)" and "six thousand." Now that I mention it, I guess spelled out numbers in a few major languages should be implementable without too many lines of code.
The web-facing page will take an input that represents a quantity, and will also ideally be fairly eclectic in interpreting numeric, although it is not as mission critical here, but we seem to have the pattern matching software already in place, so why not? The output should be the indexed numbers that are equal to the search term, if any, accompanied by links to the pages where they were found. These zero or more exact matches should be followed by near matches in increasing order by absolute difference (which should produce the same sort order as the square of the difference).
An additional feature could be a second (optional) input for a function of one real value that returns a real result. In this case the search result on search term x would give pages containing numbers y such that x≊f(y) (or perhaps as an added option, f(x)≊f(y)). This part of the system would probably interface with some Maxima-type program to solve for "multiple inverses."
Please log in.
If you're not logged in,
you can see what this page
looks like, but you will
not be able to add anything.
Annotation:
|
|
This could be quite useful ... |
|
|
Quick search for 3.14 gets me ads for Dominos. Great success |
|
|
3.14159 returns "Cosine Secant Tangent Sine"... |
|
|
3.2 returns pi, but only in Indiana. |
|
|
My favorite constant: 0.739085133215 |
|
|
The problem: If I run a search on 0.739, and there exists a page somewhere on the web that contains the character sequence 0.7391, will existing search engines reliably match it to that search? |
|
| |