@BioMan

BioMan@awful.systems · edit-2 1 month ago

I have a vague hypothesis that I am utterly unprepared to make rigorous that the more of what you take into your mind is the result of another human mind, rather than the result of a nonhuman process operating on its own terms, the more likely you are to have mental issues.

On the low end this would include the documented protective effect of natural environments against psychotic episodes compared to urban environments (where EVERYTHING was put there by someone’s idea). But computers… they are amplifiers of things put out by human minds, with very short feedback loops. Everything is ultimately in one way or another defined by a person who put it there, even it is then allowed to act according to the rules you laid down.

And then an LLM is the ultimate distillation of the short feedback loop, feeding back whatever you shovel into it straight back at you. Even just mathematically - the whole ‘transformer’ architecture is just a way to take imputed semantic meanings of tokens early in the stream and jiggling them around to ‘transform’ that information into the later tokens of the stream, no new information is really entering it it is just moving around what you put into it and feeding it back at you in a different form.

EDIT: I also sometimes wonder if this has a mechanistic relation to mode collapse when you train one generative model on output from another, even though nervous systems and ML systems learn in fundamentally different ways (with ML resembling evolution much more than it resembles learning)

BioMan@awful.systems · 1 month ago

Shortly after he goes on the Better Offline podcast

BioMan@awful.systems · 1 month ago

You know what they say. Never interrupt your enemy when they are making a mistake, as this can allow them to continue their errors and lead to their own downfall

BioMan@awful.systems · edit-2 1 month ago

I mean I dunno if any internal numbers are meaningful at all as anything but accounting fictions. But the cost of the falcon 9 to external customers is believable, even if they are potentially subsidized by funding rounds, and impressive. Near as I can tell it comes from accepting trade-offs: they accept low specific impulse and thus declining performance at high velocity for cheap engines, they accept an overpowered oversized upper stage to have only one engine assembly line and to shift some of the burden to the upper stage that optimally would be on the first without reuse, they accept that entering at 2 km/s is way easier than entering at 8 km/s and don’t try to recover the second stage, they accept the steep payload penalty of recovering the first stage. Starship on the other hand tries to brute-force through every trade-off - meaning theyre trying to push their engines through all sanity, the second stage is heavy and bulky and comically oversized, and theyre trying to have a big empty fuel tank be a heat shield which not even the shuttle ever tried.

BioMan@awful.systems · 1 month ago

I mean Starship is a VERY questionable financial decision the way they are running it. The falcon program is another matter. It’s actually remarkable how the two of them are almost diametrically opposed in how they are run.

BioMan@awful.systems · 1 month ago

So.

How much of this is folding all the meme stocks into the one thing that actually produces a product that people all over the world demonstrably want to pay for over the competition to keep their stupid plates spinning?

And how much is religious psychosis, advancing along the singulatarian eschatology from AI to dyson sphere to rebuilding the universe in His Image?

I can’t tell.

BioMan@awful.systems · 1 month ago

Does (deservedly) mercilessly bullying Slopya Nadella actually work?

BioMan@awful.systems · edit-2 3 months ago

The Great Leader himself, on how he avoids going insane during the onging End of the World because among other things that’s not what an intelligent character would do in a story, but you might not be capable of that.

BioMan@awful.systems · 3 months ago

Is it better for these people to be collected in one place under the singularity cult, or dispersed into all the other religions, cults, and conspiracy theories that they would ordinarily be pulled into?

BioMan@awful.systems · 4 months ago

deleted by creator

BioMan@awful.systems · 4 months ago

The long expected collapse of the rationalists out of their flagging cult into ordinary religion and conspiracy theory continues apace.

BioMan@awful.systems · 4 months ago

I look forward to the cultists continuing to update these graphs convinced they are seeing the future of the cosmos as fewer and fewer people pay attention to the fever dreams

BioMan@awful.systems · 4 months ago

Of course. People who have money and don’t need to make money will use it.

BioMan@awful.systems · 4 months ago

I wonder what will happen to all the data-center-specialized hardware when the demand falls through the floor. SOMEONE will buy it, the question is what will people figure out how to use it for despite it not being like ordinary consumer hardware.

BioMan@awful.systems · 4 months ago

Doing a LOT of python. Here’s hoping.

For fun, take a look at this older work from someone else

https://www.nature.com/articles/s41467-021-26568-2

BioMan@awful.systems · 4 months ago

Try “neuroevolution”

BioMan@awful.systems · edit-2 4 months ago

I would say it’s more that the relationship between a text prediction model’s output and real text is precisely mathematically the relationship between a leaf bug and a leaf, down to being made by very different processes, optimized by different forces over their origin, and doing very different things inside.

Trying to force an LLM to produce true statements is like trying to get a leaf bug to photosynthesize. What they do is unrelated to that, they just happen to have been optimized over time to resemble something that does do that as seen by a certain mode of inspection.

BioMan@awful.systems · 4 months ago

Edited to note that I am referring to the trajectory the system takes as it changes during training/learning/evolving.

BioMan@awful.systems · edit-2 4 months ago

There’s some really cool work with running evolution-type algorithms versus gradient descent showing that training a network through gradient descent creates a training ‘trajectory’ (how it changes over time during the training process, in a very high dimensional space) that is basically the ‘average’ central tendency trajectory in the middle of the ‘cloud’ of trajectories that individual replicates of an evolutionary processes create. Of course, something like code is discrete chunks rather than real numbers you can calculate a gradient of, and kind of necessitates such an evolutionary process.

Sorry if I just get super nerdy technical here, I am in the middle of a project at work about the relationship between evolutionary processes and machine learning processes that’s resulting in a lot of very interesting math about the nature of both and the kinds of things that they can learn.

BioMan@awful.systems · edit-2 4 months ago

That’s an indication that the problem is a problem that is not well-served by a neural network. They are useful for approximating highly nonlinear functions with lots of inputs (and will not work well outside the range of inputs that you approximate within), not simple linear systems. The goal of recent ML has been to reduce as many problems to high dimensional highly nonlinear curve fitting as possible, with some great successes (machine translation, image recognition) and some not so great (shhhhh don’t tell the investors!)