Halfbakery: Overstand

Join me in developing (in plan, theory, and code) what I call Overstand technology. Instead of "getting it right", this technology knows its limits and uses the discussion with you to advance in steps toward the answer.

It can be interrupted in mid-thought, and in fact, does that all the time to itself when a better answer comes up in one of the competing threads. It's just that, unlike us, the interruptions don't distract it from what it was set out to achieve.

It creates a "lexicon and grammar" weighted network for each field of discussion. It has a summary and a mood, and a mode of conversation, even claiming to have feelings. It can be scientific or empathic, or funny. But this time it "gets" the joke before it tells one, and learns to categorize the humor and "see where it's going".

I am using the following types of information which all tie in to create artificial comprehension:

From linguistics: Phonetics, phonemics, accents and sounds. (Analyzing the sounds and finding out the Vowels and consonants, rhymes, imitations of nature etc).

Also lexical, morphological and syntactical analysis (which means finding the words in the dictionary, checking out the grammar that is used for those words, and analyzing the sentences and paragraphs that are built from it),

Semantics, pragmatics, context (getting the literal meaning along with several levels of possible extra or deeper meaning). Analyzing the style and goal of the topic discussed, and creating a "lexicon and grammar" for each kind of conversation and field of discussion.

We'll start like children with simple language, fulfilling the Overstands needs (which will perhaps be an artificial equivalent of some of the basic human needs).

It doesn't have to be fast (at least during the first stages). The three main things that make it different from the large models are:

1. that it listens to its own competing versions of thought before emitting the result,

2. That it remembers its conversations and can criticize itself on the go, before, after, and while emitting the answer. Reshaping its own thoughts.

3. And that the knowledge core itself is built comprehensively and not statistically so that it can tell you what it understood, and how it got to that understanding. It can correct its responses on the go and can show us where there are gaps and possible errors in its replies and thoughts.

Because of bad the state of info in the large models at least in the beginning AI results would not be very useful but since a module that learns to find the sources of information is one of the basic parts of Overstand, a coherence and fact check comprehension module can easily be constructed, for reading info off the web, and especially AI generated crap.

== Adendum ==

The most important output of the overstand is the list of steps of what needs to be done.

For example: rather than setting out to do OCR on a handwritten note according to millions of doctors' prescriptions, I would ask it to look at this note and tell me what is needed to get it deciphered.

First thing would be to gather as much info as possible about the picture. (perhaps the image has already been deciphered).

I would need to focus on the written area. And find the lines of text. Figure out if it's cursive or separate. Try to see if its in a Western language, and English in particular.

Get a few of the easier letters and probable words. Now start comparing in the old fashioned (statistical LLM style) OCR way. While searching for the text and constructing a lexicon of words that would be used. Oh so it is a doc's prescription. And its for someone with heart condition. So that Eli...st is most probably Eliquist. and everything else is clear. The doctor's stamp show's who she was, and we can track that info down, perhaps it will... no need, we already finished the job. You want to know anyway?