Skip to content

Commit 6695f92

Browse files
committed
autosave: 2026-03-11 06:15:32
1 parent 4825c39 commit 6695f92

1 file changed

Lines changed: 76 additions & 38 deletions

File tree

posts/2025-09-13-recursive-self-improvement-explosion.qmd renamed to posts/2025-09-13-recursive-self-improvement-explosion-optimization.qmd

Lines changed: 76 additions & 38 deletions
Original file line numberDiff line numberDiff line change
@@ -117,6 +117,49 @@ We could plot data from a variety of different experiments on this plot, & ask s
117117

118118

119119

120+
# Apple Tree Model / Low Hanging
121+
122+
Apple tree model of ideas.
123+
:
124+
- Some problems, all the fruit are low-hanging.
125+
- You have an orchard, you send in an army of midgets and they.
126+
- You can list out all the insights, some are low, some are
127+
128+
Model.
129+
: - There's a distribution of apples, each has a height.
130+
- Each apple is equally valuable.
131+
- An agent can pick apples up to their height.
132+
- Can represent state with a scalar: the line below which apples are picked.
133+
- There are diminishing returns to effort in picking apples below that line, so not all apples are picked.
134+
135+
Implication: you want tests of *reach* for LLMs.
136+
:
137+
- You don't want to just see whether LLMs can pick apples, but you want to test for their reach.
138+
- Thus you find problems which
139+
140+
Implication: whether they can stand on each others' shoulders.
141+
: -
142+
143+
--------------------------------------------
144+
145+
Apple Tree Optimization
146+
147+
1. LLMs are suddenly able to optimize algorithms pretty well.
148+
149+
2. AI R&D is an algorithm optimization problem -- model training is a big stack of algorithms which we've been optimizing at around 10X/year.
150+
151+
3. It seems like.
152+
153+
--------------------------------------------
154+
155+
LLMs are suddenly able to optimize algorithms pretty well, & so maybe recursive-self-improvement has kicked off.
156+
157+
I think the critical thing is to distinguish between shallow & deep optimizations.
158+
159+
Most models of RSI, as far as I can tell,
160+
161+
--------------------------------------------
162+
120163
# Toy model of AI stack
121164

122165
See also `posts/2026-02-10-model-of-labs.qmd`
@@ -252,12 +295,6 @@ Josh You / Epoch (Oct 2025) ["Most of OpenAI’s 2024 compute went to experiment
252295

253296
## estimates of efficiency growth
254297

255-
| | |
256-
|---|---|
257-
| | |
258-
| | |
259-
260-
261298
Anson Ho (Feb 2026): 10X/year.
262299
:
263300
https://epoch.ai/gradient-updates/the-least-understood-driver-of-ai-progress
@@ -321,7 +358,7 @@ Thompson et al. (2022) ["The Importance of (Exponentially More) Computing Power"
321358

322359

323360

324-
# Models
361+
# Other Peoples' Models
325362

326363
## charts
327364

@@ -488,37 +525,6 @@ Natural processes often have explosive *regions*.
488525
:
489526
You
490527

491-
## metaphors for RSI
492-
493-
Drawing balls from an urn.
494-
:
495-
There's some distribution of values, then you get a very nice expression. The expected value of $N$ draws just depends on the extreme value distribution of $f(v)$. This is exactly Kortum (1997).
496-
497-
A tool factory makes better tools.
498-
:
499-
I think this is Jones' metaphor. *Distinct* from Kortum, because the returns to search now depends on the stock of ideas.
500-
501-
Machine tools and regular tools
502-
:
503-
1. The Dutch make machine tools.
504-
2. The machine tools make regular tools.
505-
3. The regular tools make products.
506-
507-
At some point the machine tools become good enough to make themselves, but it's a discrete jump.
508-
509-
Lego - combining ideas.
510-
:
511-
Weitzmann's recombinant search is like this: you combine ideas to make new ideas, now you have a larger stock of ideas to combine.
512-
513-
Recipes.
514-
:
515-
You try out recipes, which are combinations of prior recipes. @jones2023recipes -- something like a reconciliation of Weitzman & Kortum.
516-
517-
Blacksmith.
518-
:
519-
You have a hammer and you spend time making horseshoes, or working on a new hammer.
520-
521-
A harder hammer both (1) makes horseshoes faster, or (2) makes your hammer still harder.
522528

523529
# References
524530

@@ -603,6 +609,38 @@ Eric Drexler (Aug 2025) ["The Reality of Recursive Self-Improvement"](https://ai
603609

604610
# Overflow / Offcuts
605611

612+
## metaphors for RSI
613+
614+
Drawing balls from an urn.
615+
:
616+
There's some distribution of values, then you get a very nice expression. The expected value of $N$ draws just depends on the extreme value distribution of $f(v)$. This is exactly Kortum (1997).
617+
618+
A tool factory makes better tools.
619+
:
620+
I think this is Jones' metaphor. *Distinct* from Kortum, because the returns to search now depends on the stock of ideas.
621+
622+
Machine tools and regular tools
623+
:
624+
1. The Dutch make machine tools.
625+
2. The machine tools make regular tools.
626+
3. The regular tools make products.
627+
628+
At some point the machine tools become good enough to make themselves, but it's a discrete jump.
629+
630+
Lego - combining ideas.
631+
:
632+
Weitzmann's recombinant search is like this: you combine ideas to make new ideas, now you have a larger stock of ideas to combine.
633+
634+
Recipes.
635+
:
636+
You try out recipes, which are combinations of prior recipes. @jones2023recipes -- something like a reconciliation of Weitzman & Kortum.
637+
638+
Blacksmith.
639+
:
640+
You have a hammer and you spend time making horseshoes, or working on a new hammer.
641+
642+
A harder hammer both (1) makes horseshoes faster, or (2) makes your hammer still harder.
643+
606644

607645

608646
## We need more bottom-up modelling of AI's economic effect [UNFINISHED]

0 commit comments

Comments
 (0)