Opslag

How to create super intelligence

With the advent of thinking models, or test time compute as it is called, AI has reach a milestone. It could probably be called AGI, and continuing refining this path can give very strong models. But it is still not very close to the theoretical maximum of how good these models can get. To get a little close here are six ingredients for achieving super intelligence. The underlying reason why this will probably work can be found in this paper: https://arxiv.org/html/2305.17026v3 "How Powerful are Decoder-Only Transformer Neural Models?" If the transformer architecture is Turing complete (meaning it can run like a computer program, and thus solve any problem), then we can probably also assume that we can have the architecture have Turing like properties, but not run as a program, but as reasoning.  In other words, when the number of layers and parameters rises, we can probably simulate something that is more complex than are circuit (and we already know we can simulate a circui...

Strawberry I-don't-know, and an agent implementation

Reinforcement with 'i-don'know'  OpenAI has just released strawberry/o1, and we are now very close to AGI. It is pretty much using the same technique that I have already outlined in a previous blogpost. Create synthetic data with trains of thought, where each thought is a Q/A. This turns the problem into a reinforcement problem, where the evaluation of each answer can be scored. One crucial thing they unfortunately didn't do, was to mark answers that are wrong as i-don't-know. Then the algorithm would be to train with a small nerual net the first round, find all answers that are wrong, and then train again with a bigger neural net. Scoring wise a wrong answer is worse than saying i-don't-know, which is then worse the correct answer. Since final neural net is bigger than the one that was original trained, it should have a high chance of answering i-don't-know, when it needs to. Agent Having a strong LLM is just one part of AGI. The AGI also needs to be able t...