WEBVTT

1
00:00:01.199 --> 00:00:06.200
<v Speaker 1>Welcome to the Sentient Code, where intelligence is engineered, autonomy

2
00:00:06.280 --> 00:00:10.439
<v Speaker 1>is emerging, and a line between human and machine grows thinner.

3
00:00:10.800 --> 00:00:15.359
<v Speaker 1>Each episode, we decode the algorithms, explore the robotics, and

4
00:00:15.439 --> 00:00:19.000
<v Speaker 1>examine the ideas shaping the future of artificial minds.

5
00:00:23.839 --> 00:00:27.239
<v Speaker 2>Hello, and welcome back. We are looking at something today

6
00:00:27.559 --> 00:00:30.559
<v Speaker 2>that on the surface feels a little bit like a

7
00:00:30.600 --> 00:00:34.320
<v Speaker 2>classic David and Goliath's story. But the more I look

8
00:00:34.359 --> 00:00:38.560
<v Speaker 2>at the research, the more I start to think maybe

9
00:00:38.880 --> 00:00:42.719
<v Speaker 2>David is actually being secretly funded by another Goliath.

10
00:00:42.960 --> 00:00:47.280
<v Speaker 3>That is a surprisingly, surprisingly accurate way to put it.

11
00:00:47.280 --> 00:00:51.119
<v Speaker 3>It's definitely not the fairy tale version people might imagine, right.

12
00:00:51.119 --> 00:00:54.399
<v Speaker 2>It's way more complicated. Yeah, we're talking about what is

13
00:00:54.439 --> 00:00:57.200
<v Speaker 2>and I don't think this is an exaggeration arguably the

14
00:00:57.280 --> 00:01:00.960
<v Speaker 2>defining technological rivalry of our era. It is absolutely talking

15
00:01:00.960 --> 00:01:03.640
<v Speaker 2>about the battle between open source AI and big tech.

16
00:01:03.840 --> 00:01:07.760
<v Speaker 3>It's huge, and honestly, the landscape is shifting so fast

17
00:01:07.799 --> 00:01:09.519
<v Speaker 3>that by the time you're listening to this, the battle

18
00:01:09.519 --> 00:01:10.920
<v Speaker 3>lines might have already moved again.

19
00:01:11.200 --> 00:01:13.359
<v Speaker 2>That's how fast this whole thing is going. And I

20
00:01:13.359 --> 00:01:15.519
<v Speaker 2>think for a lot of people, you know, the narrative

21
00:01:15.560 --> 00:01:18.879
<v Speaker 2>feels pretty simple It's the scrappy underdogs, right, the hackers,

22
00:01:18.879 --> 00:01:22.560
<v Speaker 2>the independent researchers, the global open source community.

23
00:01:22.200 --> 00:01:25.920
<v Speaker 3>Right, the rebels, exactly, the rebels versus the big bad

24
00:01:26.079 --> 00:01:29.439
<v Speaker 3>corporate giants. But looking at the material we have today,

25
00:01:29.799 --> 00:01:32.760
<v Speaker 3>the reality is, well, it's a lot messier.

26
00:01:32.959 --> 00:01:36.319
<v Speaker 2>It's so much messier and frankly, so much more interesting

27
00:01:36.920 --> 00:01:39.959
<v Speaker 2>because this isn't just about who sells the most software

28
00:01:40.120 --> 00:01:43.000
<v Speaker 2>or you know, who has the highest stock price. This

29
00:01:43.079 --> 00:01:46.040
<v Speaker 2>is really about the fundamental structure of power. It's about

30
00:01:46.120 --> 00:01:50.000
<v Speaker 2>who holds the knowledge, who owns the infrastructure, and ultimately

31
00:01:50.120 --> 00:01:51.879
<v Speaker 2>who gets to shape the twenty first century.

32
00:01:51.959 --> 00:01:54.920
<v Speaker 3>That is a heavy way to open the structure of power.

33
00:01:55.000 --> 00:01:56.799
<v Speaker 2>I think it needs to be, because we have to

34
00:01:56.799 --> 00:02:00.480
<v Speaker 2>move past this simple binary idea of open versus closed.

35
00:02:00.719 --> 00:02:03.840
<v Speaker 2>It's not that simple. Both sides are winning in different arenas,

36
00:02:03.840 --> 00:02:06.439
<v Speaker 2>and you know, both sides are facing these huge, almost

37
00:02:06.480 --> 00:02:08.719
<v Speaker 2>existential threats they might not even fully see yet.

38
00:02:08.759 --> 00:02:10.560
<v Speaker 3>Okay, so let's set the board. Then. If we're looking

39
00:02:10.560 --> 00:02:12.759
<v Speaker 3>at this as a kind of conflict or maybe a game,

40
00:02:13.000 --> 00:02:16.719
<v Speaker 3>who are the main players. Let's start with the heavy hitters.

41
00:02:16.680 --> 00:02:19.039
<v Speaker 2>The big tech camp For sure, these are the names

42
00:02:19.080 --> 00:02:21.800
<v Speaker 2>everyone knows, you have open Ai, which is of course

43
00:02:22.159 --> 00:02:25.280
<v Speaker 2>heavily heavily backed by Microsoft the tens of billions of

44
00:02:25.319 --> 00:02:26.759
<v Speaker 2>dollars camp that's a good name for.

45
00:02:26.719 --> 00:02:30.400
<v Speaker 3>Them, exactly. Then you have Google deep Mind, which is

46
00:02:30.639 --> 00:02:34.280
<v Speaker 3>the force behind the Gemini models. You've got Anthropic, which

47
00:02:34.360 --> 00:02:36.840
<v Speaker 3>is a really interesting one because it was founded by

48
00:02:36.919 --> 00:02:41.080
<v Speaker 3>ex OpenAI people, but now it's backed by both Amazon and.

49
00:02:40.960 --> 00:02:42.319
<v Speaker 2>Google at a bit of a tangled web.

50
00:02:42.360 --> 00:02:44.800
<v Speaker 3>There, a very tangled web. And then of course you

51
00:02:44.919 --> 00:02:49.280
<v Speaker 3>have the quiet giant Apple doing its own thing, mostly

52
00:02:49.319 --> 00:02:51.439
<v Speaker 3>focused on on device intelligence.

53
00:02:51.479 --> 00:02:54.560
<v Speaker 2>Okay, so that's the fortress, that's the establishment. And on

54
00:02:54.599 --> 00:02:56.960
<v Speaker 2>the other side, who are the rebels.

55
00:02:56.680 --> 00:02:59.840
<v Speaker 3>The open source ecosystem? And this is where it gets much.

56
00:03:00.000 --> 00:03:04.439
<v Speaker 3>It's more diffuse. It's not one company. It's a global network.

57
00:03:04.759 --> 00:03:07.680
<v Speaker 3>You have incredible startups like Mistral and Friends. You have

58
00:03:07.719 --> 00:03:10.879
<v Speaker 3>academic researchers and universities all over the world. You have hobbyists,

59
00:03:11.080 --> 00:03:14.560
<v Speaker 3>and then you have this massive community coalescing on platforms

60
00:03:14.639 --> 00:03:15.800
<v Speaker 3>like hugging face.

61
00:03:15.960 --> 00:03:18.560
<v Speaker 2>Hugging Face I always love that name. It just sounds

62
00:03:18.599 --> 00:03:21.719
<v Speaker 2>so friendly and innocent for something that is essentially an

63
00:03:21.800 --> 00:03:23.639
<v Speaker 2>arms depot for AI models.

64
00:03:23.800 --> 00:03:26.159
<v Speaker 3>It is a charming name. Isn't it. But the scale

65
00:03:26.719 --> 00:03:30.719
<v Speaker 3>is anything but charming. It's deadly serious. They host hundreds

66
00:03:30.759 --> 00:03:35.120
<v Speaker 3>of thousands of models. We're talking billions of downloads. It is,

67
00:03:35.479 --> 00:03:38.919
<v Speaker 3>for all intents and purposes, the GitHub of the AI revolution.

68
00:03:39.199 --> 00:03:41.639
<v Speaker 2>So we have the giants and we have the swarm,

69
00:03:42.560 --> 00:03:45.719
<v Speaker 2>and the mission for us today is to figure out, well,

70
00:03:45.719 --> 00:03:49.879
<v Speaker 2>who's actually winning, because looking through all the source material,

71
00:03:49.919 --> 00:03:53.960
<v Speaker 2>the answer seems to depend entirely on how you define winning.

72
00:03:54.240 --> 00:03:57.479
<v Speaker 3>Precisely, Winning the performance benchmark is one thing. Winning the

73
00:03:57.599 --> 00:04:01.280
<v Speaker 3>enterprise market is another. Winning the hearts minds of developers

74
00:04:01.280 --> 00:04:03.039
<v Speaker 3>that's a third. They're all different battles.

75
00:04:03.080 --> 00:04:05.479
<v Speaker 2>So let's start with the giants. Let's talk about why

76
00:04:05.560 --> 00:04:08.439
<v Speaker 2>big tech is so dominant. What is this fortress they've built?

77
00:04:08.759 --> 00:04:09.879
<v Speaker 2>What are the walls made of?

78
00:04:10.039 --> 00:04:12.919
<v Speaker 3>The first, and by far the thickest wall is what's

79
00:04:12.960 --> 00:04:14.000
<v Speaker 3>called the compute mote.

80
00:04:14.039 --> 00:04:16.399
<v Speaker 2>The compute mote. Okay, I want to pause on this

81
00:04:16.439 --> 00:04:18.959
<v Speaker 2>because I think when people hear expensive, they think, okay,

82
00:04:19.040 --> 00:04:21.639
<v Speaker 2>like a really nice car expensive, or maybe a house expensive.

83
00:04:22.000 --> 00:04:24.800
<v Speaker 2>But looking at the numbers here, we are talking about

84
00:04:24.879 --> 00:04:26.879
<v Speaker 2>nation state level spending, aren't we.

85
00:04:26.639 --> 00:04:31.079
<v Speaker 3>We really really are to even begin to visualize what

86
00:04:31.120 --> 00:04:35.240
<v Speaker 3>it takes to train a frontier model, the absolute bleeding

87
00:04:35.399 --> 00:04:39.399
<v Speaker 3>edge systems like GPT four, Gemini Ultra. You have to

88
00:04:39.439 --> 00:04:42.000
<v Speaker 3>start with the hardware, right, we aren't talking about the

89
00:04:42.000 --> 00:04:45.720
<v Speaker 3>graphics card in your gaming PC. We're talking about the

90
00:04:45.759 --> 00:04:49.000
<v Speaker 3>top of the line in Vidia H one hundreds, the

91
00:04:49.160 --> 00:04:50.879
<v Speaker 3>AI chips.

92
00:04:50.519 --> 00:04:52.920
<v Speaker 2>The chips that there's a global shortage of the ones

93
00:04:52.959 --> 00:04:54.120
<v Speaker 2>everyone is fighting.

94
00:04:53.800 --> 00:04:57.000
<v Speaker 3>Over the very same These chips are, you know, almost

95
00:04:57.000 --> 00:05:00.000
<v Speaker 3>a strategic national resource at this point. A single one

96
00:05:00.000 --> 00:05:01.879
<v Speaker 3>one of these costs as much as a luxury car.

97
00:05:02.000 --> 00:05:02.759
<v Speaker 2>A single chip.

98
00:05:02.879 --> 00:05:05.240
<v Speaker 3>Yes, now imagine you need to buy it, not one,

99
00:05:05.480 --> 00:05:08.600
<v Speaker 3>but maybe twenty five thousand of them, twenty five thousand,

100
00:05:08.720 --> 00:05:11.199
<v Speaker 3>and you have to tie them all together with specialized

101
00:05:11.240 --> 00:05:14.720
<v Speaker 3>high speed networking cabling that costs more than most people's houses.

102
00:05:15.199 --> 00:05:17.639
<v Speaker 3>You stick all that in a massive, custom built data

103
00:05:17.639 --> 00:05:21.000
<v Speaker 3>center and you run it at one hundred percent capacity

104
00:05:21.079 --> 00:05:22.040
<v Speaker 3>for ninety.

105
00:05:21.759 --> 00:05:23.920
<v Speaker 2>Days straight, ninety days, NonStop.

106
00:05:23.519 --> 00:05:25.639
<v Speaker 3>NonStop, twenty four to seven. And this is where the

107
00:05:25.680 --> 00:05:28.240
<v Speaker 3>moat gets very, very physical. It's not just about buying

108
00:05:28.240 --> 00:05:30.120
<v Speaker 3>the chips. It's about the physics, the heat.

109
00:05:30.360 --> 00:05:32.240
<v Speaker 2>I saw a note here about thermal densities.

110
00:05:32.240 --> 00:05:34.040
<v Speaker 3>That what you mean, that's exactly.

111
00:05:34.079 --> 00:05:34.199
<v Speaker 2>It.

112
00:05:34.240 --> 00:05:36.319
<v Speaker 3>If you put that much compute in one room, the

113
00:05:36.360 --> 00:05:39.160
<v Speaker 3>air doesn't just get hot, it turns into a blast furnace.

114
00:05:39.480 --> 00:05:43.199
<v Speaker 3>The racks would literally melt. So you need these incredible

115
00:05:43.600 --> 00:05:48.399
<v Speaker 3>industrial scale liquid cooling systems, pipes of chilled water running

116
00:05:48.680 --> 00:05:50.759
<v Speaker 3>to every single server rack.

117
00:05:51.040 --> 00:05:52.879
<v Speaker 2>So it's not just a server room. It's a power

118
00:05:52.920 --> 00:05:55.720
<v Speaker 2>plant and a plumbing project it is, and you.

119
00:05:55.759 --> 00:05:58.600
<v Speaker 3>Need access to a power grid that can handle the

120
00:05:58.680 --> 00:06:02.439
<v Speaker 3>load of a small city. We're talking hundreds of megawatts

121
00:06:02.480 --> 00:06:06.639
<v Speaker 3>of sustained power. The electricity bill alone for a single

122
00:06:06.720 --> 00:06:09.879
<v Speaker 3>training run, just the electricity can be tens of millions

123
00:06:09.920 --> 00:06:10.399
<v Speaker 3>of dollars.

124
00:06:10.439 --> 00:06:12.120
<v Speaker 2>Get to wrap my head around that. So when we

125
00:06:12.160 --> 00:06:14.199
<v Speaker 2>say the kid in the garage is locked out of

126
00:06:14.199 --> 00:06:16.800
<v Speaker 2>this game, it's not because the GID isn't smart enough.

127
00:06:16.959 --> 00:06:19.879
<v Speaker 2>It's because the garage would literally melt and the entire

128
00:06:19.920 --> 00:06:21.879
<v Speaker 2>neighborhood grid would blow a transformer.

129
00:06:21.920 --> 00:06:24.600
<v Speaker 3>The garage would melt, the transformer would blow, and they'd

130
00:06:24.600 --> 00:06:28.079
<v Speaker 3>get a bill for ten million dollars. So, yes, this

131
00:06:28.240 --> 00:06:30.959
<v Speaker 3>is why the list of companies that can actually train

132
00:06:31.040 --> 00:06:34.759
<v Speaker 3>a Frontier model from scratch is so incredibly short. It's

133
00:06:34.800 --> 00:06:40.079
<v Speaker 3>basically Microsoft, Google, Meta, and Amazon. The barrier to entry

134
00:06:40.199 --> 00:06:43.160
<v Speaker 3>is a capital expenditure of billions of dollars before you've

135
00:06:43.199 --> 00:06:45.279
<v Speaker 3>written a single line of useful code.

136
00:06:45.439 --> 00:06:48.959
<v Speaker 2>You use the analogy of a really talented carpenter in

137
00:06:49.000 --> 00:06:52.759
<v Speaker 2>their backyard workshop versus a massive industrial factory that spans

138
00:06:52.800 --> 00:06:54.920
<v Speaker 2>ten city blocks. It feels like that.

139
00:06:54.720 --> 00:06:56.639
<v Speaker 3>That's a perfect analogy, and it doesn't even stop at

140
00:06:56.639 --> 00:06:59.720
<v Speaker 3>the hardware. It's also the talent the people. Yes, the

141
00:06:59.759 --> 00:07:03.160
<v Speaker 3>huge expertise required to orchestrate one of these massive training

142
00:07:03.199 --> 00:07:06.920
<v Speaker 3>runs is incredibly rare. It's a kind of dark art.

143
00:07:07.160 --> 00:07:08.199
<v Speaker 2>What do you mean by that.

144
00:07:08.360 --> 00:07:10.920
<v Speaker 3>Well, when you have twenty thousand GPUs all trying to

145
00:07:10.959 --> 00:07:14.600
<v Speaker 3>work in perfect parallel, things break all the time. Mysterious

146
00:07:14.639 --> 00:07:15.959
<v Speaker 3>failures just happen.

147
00:07:15.920 --> 00:07:18.920
<v Speaker 2>Like a cable comes loose or a network switch fails.

148
00:07:19.199 --> 00:07:21.560
<v Speaker 3>It can be that simple, or it can be much weirder.

149
00:07:22.240 --> 00:07:25.839
<v Speaker 3>A few GPUs might just start overheating and produce infinitesimally

150
00:07:25.879 --> 00:07:29.800
<v Speaker 3>small errors in their calculations bad math. Now, in a

151
00:07:29.839 --> 00:07:33.319
<v Speaker 3>normal computer program, you might get an error message it crashes.

152
00:07:34.079 --> 00:07:37.360
<v Speaker 3>In a training run like this, those tiny errors can

153
00:07:37.399 --> 00:07:40.639
<v Speaker 3>act like a poison. They can slowly destabilize the entire

154
00:07:40.639 --> 00:07:45.000
<v Speaker 3>mathematical process. You might lose days of progress, or even worse,

155
00:07:45.319 --> 00:07:48.639
<v Speaker 3>the model might end up subtly brain damaged in a

156
00:07:48.639 --> 00:07:51.040
<v Speaker 3>way you don't even detect until weeks later when it

157
00:07:51.040 --> 00:07:52.639
<v Speaker 3>starts giving bizarre answers.

158
00:07:52.720 --> 00:07:56.319
<v Speaker 2>So the engineers running these things, yeah, they aren't just coders.

159
00:07:56.319 --> 00:08:01.639
<v Speaker 2>They're more like nuclear reactor operators staring at dials trying

160
00:08:01.680 --> 00:08:02.720
<v Speaker 2>to prevent a meltdown.

161
00:08:02.920 --> 00:08:05.920
<v Speaker 3>They are absolutely high stakes problem solvers. They are staring

162
00:08:05.959 --> 00:08:09.360
<v Speaker 3>at hundreds of graphs looking for tiny, almost imperceptible anomalies

163
00:08:09.360 --> 00:08:12.439
<v Speaker 3>that suggest that AI's brain is getting sick. That level

164
00:08:12.439 --> 00:08:15.240
<v Speaker 3>of operational expertise is a huge part of the moat.

165
00:08:15.560 --> 00:08:17.319
<v Speaker 3>You can't just buy the chips and plug them in.

166
00:08:17.600 --> 00:08:19.639
<v Speaker 3>You need the priesthood of engineers who know how to

167
00:08:19.720 --> 00:08:20.519
<v Speaker 3>keep the beasts alive.

168
00:08:20.639 --> 00:08:23.560
<v Speaker 2>And I'm guessing those people are very, very expensive.

169
00:08:23.319 --> 00:08:26.120
<v Speaker 3>Extremely and they almost all work for the giants.

170
00:08:26.399 --> 00:08:28.360
<v Speaker 2>Okay, so that's the hardware in the talent. The first

171
00:08:28.399 --> 00:08:31.319
<v Speaker 2>wall of the fortress is thick. But there's another piece, right,

172
00:08:32.200 --> 00:08:33.240
<v Speaker 2>the data.

173
00:08:32.840 --> 00:08:35.399
<v Speaker 3>The data advantage. And this is where it gets a

174
00:08:35.399 --> 00:08:37.919
<v Speaker 3>lot more subtle, but maybe even more.

175
00:08:37.759 --> 00:08:40.600
<v Speaker 2>Powerful, because I think most people's first thought is, well,

176
00:08:40.600 --> 00:08:43.720
<v Speaker 2>the Internet is open, right, can anyway, just scrape the

177
00:08:43.720 --> 00:08:45.600
<v Speaker 2>web and get the same data you can.

178
00:08:46.320 --> 00:08:49.799
<v Speaker 3>And the open source models do rely heavily on public

179
00:08:49.919 --> 00:08:53.080
<v Speaker 3>data sets like the common crawl, but big tech has

180
00:08:53.159 --> 00:08:57.720
<v Speaker 3>access to vast oceans of proprietary data that is completely

181
00:08:57.799 --> 00:09:00.720
<v Speaker 3>invisible to the public web, like what for exams. Okay,

182
00:09:00.759 --> 00:09:03.600
<v Speaker 3>just think about Google. They have over two decades of

183
00:09:03.720 --> 00:09:06.200
<v Speaker 3>indexed search queries. They don't just have the text of

184
00:09:06.240 --> 00:09:08.960
<v Speaker 3>the Internet. They know what billions of people are looking for,

185
00:09:09.080 --> 00:09:12.799
<v Speaker 3>how they ferze their questions, and critically, what results they

186
00:09:12.799 --> 00:09:13.480
<v Speaker 3>actually click on.

187
00:09:13.759 --> 00:09:16.159
<v Speaker 2>Right, they don't just see the words, they see the

188
00:09:16.240 --> 00:09:18.000
<v Speaker 2>intent behind the words.

189
00:09:18.080 --> 00:09:22.039
<v Speaker 3>Precisely the intent. Think about Microsoft. They have deep integration

190
00:09:22.120 --> 00:09:25.799
<v Speaker 3>into enterprise workflows. They'll see how businesses write emails and outlook,

191
00:09:25.840 --> 00:09:29.039
<v Speaker 3>how they structure documents and word how they build presentations

192
00:09:29.080 --> 00:09:33.120
<v Speaker 3>in PowerPoint. That's a huge unique data set about professional

193
00:09:33.159 --> 00:09:38.279
<v Speaker 3>communication and meta the social graph decades of behavioral data,

194
00:09:38.320 --> 00:09:40.679
<v Speaker 3>how people interact, what they like, what they share, the

195
00:09:40.759 --> 00:09:43.879
<v Speaker 3>nuances of casual human to human conversation. And then you

196
00:09:43.879 --> 00:09:48.039
<v Speaker 3>have Apple with behavioral patterns from millions of devices, understanding

197
00:09:48.080 --> 00:09:49.279
<v Speaker 3>how people use apps.

198
00:09:49.399 --> 00:09:51.840
<v Speaker 2>It seems like the difference between having a library of

199
00:09:51.919 --> 00:09:54.639
<v Speaker 2>every book ever written, which is the public web, and

200
00:09:54.720 --> 00:09:57.879
<v Speaker 2>having that same library plus a secret video recording of

201
00:09:57.960 --> 00:10:00.519
<v Speaker 2>every single person who ever read a book, yeah, showing

202
00:10:00.840 --> 00:10:03.759
<v Speaker 2>what pages they lingered on, what they highlighted, what made

203
00:10:03.759 --> 00:10:04.200
<v Speaker 2>them laugh.

204
00:10:04.360 --> 00:10:07.399
<v Speaker 3>That is a profound and perfect distinction. It's the quality

205
00:10:07.480 --> 00:10:10.679
<v Speaker 3>versus quantity aspect. But there's an even bigger data advantage

206
00:10:10.679 --> 00:10:12.679
<v Speaker 3>that builds on top of that, the feedback loop.

207
00:10:12.840 --> 00:10:14.919
<v Speaker 2>Explain that, what do you mean by feedback loop?

208
00:10:15.039 --> 00:10:18.039
<v Speaker 3>Big tech sees how users interact with their finished models

209
00:10:18.080 --> 00:10:20.759
<v Speaker 3>in real time. Every time you use chat GPT and

210
00:10:20.799 --> 00:10:24.080
<v Speaker 3>you don't like an answer, so you hit regenerate response.

211
00:10:24.600 --> 00:10:26.200
<v Speaker 2>Open ai sees that they log that.

212
00:10:26.480 --> 00:10:29.039
<v Speaker 3>They log it as a failure case every time you

213
00:10:29.080 --> 00:10:32.639
<v Speaker 3>give a thumbs down to a response from Gemini. Google's

214
00:10:32.639 --> 00:10:36.360
<v Speaker 3>reinforcement learning systems take note. They have this incredibly tight

215
00:10:36.399 --> 00:10:40.159
<v Speaker 3>feedback loop with hundreds of millions of users constantly telling

216
00:10:40.159 --> 00:10:43.879
<v Speaker 3>them what works and what doesn't. They are perpetually refining

217
00:10:43.919 --> 00:10:47.759
<v Speaker 3>the model based on massive real world.

218
00:10:47.480 --> 00:10:50.039
<v Speaker 2>Usage, and the open source community doesn't have that same

219
00:10:50.120 --> 00:10:51.240
<v Speaker 2>direct line, not.

220
00:10:51.240 --> 00:10:53.320
<v Speaker 3>In the same way. Now, when you download a model

221
00:10:53.320 --> 00:10:55.440
<v Speaker 3>from hugging face and run it on your own laptop,

222
00:10:55.519 --> 00:10:57.799
<v Speaker 3>the developers of that model have no idea what you're doing.

223
00:10:57.799 --> 00:10:59.600
<v Speaker 3>They don't get that data back, because that's the whole.

224
00:10:59.480 --> 00:11:01.759
<v Speaker 2>Point, right privacy. You run it locally so they can't

225
00:11:01.759 --> 00:11:03.000
<v Speaker 2>see your data exactly.

226
00:11:03.039 --> 00:11:05.360
<v Speaker 3>It's a feature, not a bug. But the cost of

227
00:11:05.360 --> 00:11:09.879
<v Speaker 3>that privacy is that the model creators lose that invaluable telemetry.

228
00:11:10.360 --> 00:11:13.840
<v Speaker 3>They're flying blind compared to Google or open ai, who

229
00:11:13.879 --> 00:11:16.000
<v Speaker 3>have a god's eye view of how their creations are

230
00:11:16.000 --> 00:11:17.519
<v Speaker 3>performing in the wild, so.

231
00:11:17.960 --> 00:11:22.440
<v Speaker 2>Hearing all of that, the billions in compute, the rare talent,

232
00:11:22.720 --> 00:11:27.000
<v Speaker 2>the proprietary data, the real time feedback loops, it honestly

233
00:11:27.039 --> 00:11:30.799
<v Speaker 2>feels like game over. How can open source possibly compete?

234
00:11:31.279 --> 00:11:33.679
<v Speaker 2>It feels like you're bringing a well crafted knife to

235
00:11:33.720 --> 00:11:34.879
<v Speaker 2>a thermonuclear war.

236
00:11:35.120 --> 00:11:38.559
<v Speaker 3>It absolutely looks that way on paper. The structural disadvantages

237
00:11:38.600 --> 00:11:41.440
<v Speaker 3>are immense. But this is where the rebellion gets really

238
00:11:41.480 --> 00:11:44.799
<v Speaker 3>really interesting, because despite all of that, open source is

239
00:11:44.840 --> 00:11:48.240
<v Speaker 3>not only surviving in some very important areas, it's actually thriving.

240
00:11:48.399 --> 00:11:50.240
<v Speaker 2>Oh how is that even possible? Is it just the

241
00:11:50.279 --> 00:11:51.679
<v Speaker 2>sheer number of people working on it.

242
00:11:51.720 --> 00:11:54.200
<v Speaker 3>That's a big part of it. But the main technical driver,

243
00:11:54.440 --> 00:11:57.679
<v Speaker 3>the core reason they can compete, is a relentless focus

244
00:11:57.759 --> 00:12:01.720
<v Speaker 3>on efficiency efficiency. The open source community, precisely because they

245
00:12:01.720 --> 00:12:05.200
<v Speaker 3>don't have unlimited resources, has become world class at doing

246
00:12:05.240 --> 00:12:06.679
<v Speaker 3>more with less necessity.

247
00:12:06.720 --> 00:12:08.759
<v Speaker 2>Is the mother of invention, as they say.

248
00:12:08.519 --> 00:12:11.679
<v Speaker 3>It's the perfect embodiment of that saying, when you don't

249
00:12:11.679 --> 00:12:14.240
<v Speaker 3>have a billion dollars for your next training run, you

250
00:12:14.360 --> 00:12:17.840
<v Speaker 3>have to get incredibly clever. And they've developed these amazing

251
00:12:17.879 --> 00:12:19.320
<v Speaker 3>techniques like quantization.

252
00:12:19.759 --> 00:12:22.799
<v Speaker 2>Okay, I keep seeing this word in the research quantization.

253
00:12:23.720 --> 00:12:25.919
<v Speaker 2>It sounds like something out of a sci fi movie,

254
00:12:26.360 --> 00:12:30.480
<v Speaker 2>but it seems absolutely critical to the open source survival guide.

255
00:12:30.799 --> 00:12:32.600
<v Speaker 2>What is actually happening there.

256
00:12:32.559 --> 00:12:34.840
<v Speaker 3>It is absolutely critical. The best way to think of

257
00:12:34.879 --> 00:12:37.240
<v Speaker 3>it as like image compression. You know how a raw

258
00:12:37.279 --> 00:12:40.399
<v Speaker 3>photo file from a professional camera can be huge, maybe

259
00:12:40.440 --> 00:12:41.399
<v Speaker 3>fifty megabytes.

260
00:12:41.440 --> 00:12:43.720
<v Speaker 2>Sure, it has all the raw sensor data, every bit of.

261
00:12:43.679 --> 00:12:46.600
<v Speaker 3>Information exactly, But if you convert that photo to a

262
00:12:46.679 --> 00:12:50.559
<v Speaker 3>JPIG it shrinks down to maybe two or three megabytes.

263
00:12:51.000 --> 00:12:53.799
<v Speaker 3>You lose a tiny, tiny bit of detail. Maybe if

264
00:12:53.840 --> 00:12:56.240
<v Speaker 3>you zoom in one thousand percent, the shadows aren't quite

265
00:12:56.240 --> 00:12:59.200
<v Speaker 3>as perfect, But to the human eye the picture looks

266
00:12:59.279 --> 00:13:00.879
<v Speaker 3>basically identicle and it.

267
00:13:00.799 --> 00:13:03.120
<v Speaker 2>Loads way faster. In a website and takes up a

268
00:13:03.159 --> 00:13:04.519
<v Speaker 2>fraction of the space on your hard drive.

269
00:13:04.559 --> 00:13:08.200
<v Speaker 3>Precisely, quantization is basically doing that same trick to the

270
00:13:08.200 --> 00:13:12.559
<v Speaker 3>AI's brain. These huge models are usually stored with incredibly

271
00:13:12.679 --> 00:13:15.759
<v Speaker 3>high precision numbers. Think of every connection in the neural

272
00:13:15.799 --> 00:13:20.080
<v Speaker 3>network having a weight that's like sixteen decimal places of accuracy,

273
00:13:20.159 --> 00:13:20.799
<v Speaker 3>which takes.

274
00:13:20.679 --> 00:13:24.519
<v Speaker 2>Up a ton of computer memory and processing power massive amounts.

275
00:13:25.320 --> 00:13:28.799
<v Speaker 3>Quantization is a process that basically says, hey, do we

276
00:13:28.919 --> 00:13:32.519
<v Speaker 3>really need sixteen decimal places of precision? What if we

277
00:13:32.600 --> 00:13:34.679
<v Speaker 3>just round it down to four or even two?

278
00:13:35.159 --> 00:13:37.519
<v Speaker 2>And the model doesn't just get stupid when you do that.

279
00:13:37.519 --> 00:13:40.879
<v Speaker 3>That's the magic. It turns out for many of these models,

280
00:13:40.879 --> 00:13:44.080
<v Speaker 3>you can round down those numbers very aggressively and the

281
00:13:44.120 --> 00:13:47.639
<v Speaker 3>model only loses a tiny fraction of its overall intelligence.

282
00:13:48.159 --> 00:13:51.919
<v Speaker 3>But suddenly a model that required a forty thousand dollars

283
00:13:51.919 --> 00:13:54.440
<v Speaker 3>server with a ton of RAM can now run on

284
00:13:54.480 --> 00:13:57.159
<v Speaker 3>a two thousand dollars gaming laptop.

285
00:13:57.600 --> 00:14:00.279
<v Speaker 2>So they are literally shrinking the giant's brain so it

286
00:14:00.320 --> 00:14:01.919
<v Speaker 2>fits inside a regular person's head.

287
00:14:02.039 --> 00:14:04.240
<v Speaker 3>That is exactly what they're doing, and that's not the

288
00:14:04.240 --> 00:14:07.120
<v Speaker 3>only trick. Then there's something called distillation, which is think

289
00:14:07.120 --> 00:14:10.440
<v Speaker 3>of it like a teacher student relationship. You take a huge, smart,

290
00:14:10.480 --> 00:14:13.240
<v Speaker 3>expensive model like GPT four and you use it to

291
00:14:13.279 --> 00:14:15.159
<v Speaker 3>teach much smaller, cheaper model.

292
00:14:15.519 --> 00:14:17.440
<v Speaker 2>How does that actually work in practice?

293
00:14:17.519 --> 00:14:20.039
<v Speaker 3>You can, for instance, ask the big teacher model to

294
00:14:20.159 --> 00:14:23.480
<v Speaker 3>generate thousands of perfect answers to questions on a specific

295
00:14:23.559 --> 00:14:27.159
<v Speaker 3>topics a customer service. Then you take those thousands of

296
00:14:27.159 --> 00:14:29.919
<v Speaker 3>perfect question answer pairs and you use them as the

297
00:14:30.000 --> 00:14:32.480
<v Speaker 3>training data for the small student model.

298
00:14:32.639 --> 00:14:36.000
<v Speaker 2>Ah, so the student learns from the master's work exactly.

299
00:14:36.559 --> 00:14:39.279
<v Speaker 3>The student model might not know everything the professor knows

300
00:14:39.320 --> 00:14:42.639
<v Speaker 3>about physics and poetry, but for that one specific task

301
00:14:42.679 --> 00:14:45.440
<v Speaker 3>of customer service, it can get ninety percent of the

302
00:14:45.480 --> 00:14:47.879
<v Speaker 3>way there for one percent of the cost to run.

303
00:14:47.960 --> 00:14:50.279
<v Speaker 2>That's a huge leverage point. It's like you're borrowing the

304
00:14:50.320 --> 00:14:52.840
<v Speaker 2>intelligence of the giant to train your own litle army.

305
00:14:53.200 --> 00:14:56.799
<v Speaker 3>It is and because of techniques like this, the performance

306
00:14:56.840 --> 00:15:00.399
<v Speaker 3>gap between the massive closed models and the nimble open

307
00:15:00.440 --> 00:15:03.919
<v Speaker 3>models has narrowed again and again. The community is just

308
00:15:04.000 --> 00:15:05.600
<v Speaker 3>relentless about optimization.

309
00:15:06.080 --> 00:15:07.919
<v Speaker 2>And this all leads to what one of our sources

310
00:15:07.919 --> 00:15:11.840
<v Speaker 2>calls the deployment victory. I found this concept fascinating. The

311
00:15:11.879 --> 00:15:14.799
<v Speaker 2>idea that winning isn't just about having the single smartest

312
00:15:14.799 --> 00:15:17.720
<v Speaker 2>brain in a lab somewhere, but about being the one

313
00:15:17.720 --> 00:15:19.879
<v Speaker 2>that actually gets used out in the real world.

314
00:15:20.320 --> 00:15:24.039
<v Speaker 3>This is such a critical distinction. There's a massive difference

315
00:15:24.080 --> 00:15:28.559
<v Speaker 3>between the frontier, the absolute smartest, most capable model possible,

316
00:15:29.120 --> 00:15:33.919
<v Speaker 3>and deployment what a company actually feels safe and comfortable

317
00:15:33.919 --> 00:15:36.960
<v Speaker 3>putting into their production systems. Let's say you're a healthcare

318
00:15:37.000 --> 00:15:41.200
<v Speaker 3>company you're handling sensitive patient records, or you're a bank

319
00:15:41.639 --> 00:15:44.799
<v Speaker 3>and you're handling financial data. Are you really going to

320
00:15:44.840 --> 00:15:48.080
<v Speaker 3>send all of that incredibly sensitive private data over the

321
00:15:48.120 --> 00:15:51.720
<v Speaker 3>Internet to a third party API owned by big tech?

322
00:15:52.159 --> 00:15:54.759
<v Speaker 2>Probably not, if you can avoid it. Your compliance department

323
00:15:54.759 --> 00:15:56.559
<v Speaker 2>would have a heart attack. You want to keep that

324
00:15:56.559 --> 00:15:59.000
<v Speaker 2>stuff locked down on your own servers. You definitely don't

325
00:15:59.039 --> 00:16:03.159
<v Speaker 2>want opening ISIS and IS potentially reading your patient files exactly.

326
00:16:03.200 --> 00:16:05.399
<v Speaker 3>You want to own the an entire stack. You need privacy,

327
00:16:05.440 --> 00:16:07.360
<v Speaker 3>You demand security and control.

328
00:16:07.360 --> 00:16:09.360
<v Speaker 2>I would imagine total control.

329
00:16:09.120 --> 00:16:11.919
<v Speaker 3>Massive control. If you build your entire product on top

330
00:16:11.960 --> 00:16:15.279
<v Speaker 3>of a proprietary model from a single company, you are

331
00:16:15.320 --> 00:16:18.879
<v Speaker 3>taking on an enormous strategic risk. What happens if they

332
00:16:18.919 --> 00:16:20.720
<v Speaker 3>decide to double the pricing next year?

333
00:16:20.840 --> 00:16:21.360
<v Speaker 2>You're stuck.

334
00:16:21.559 --> 00:16:24.480
<v Speaker 3>What if they change their roadmap and deprecate the version

335
00:16:24.519 --> 00:16:26.519
<v Speaker 3>of the model you rely on. What if they go

336
00:16:26.600 --> 00:16:28.200
<v Speaker 3>out of business or get acquired.

337
00:16:28.440 --> 00:16:32.320
<v Speaker 2>You're building your entire house on land that you're just renting.

338
00:16:32.399 --> 00:16:33.840
<v Speaker 2>They could evict you at any time.

339
00:16:34.159 --> 00:16:38.879
<v Speaker 3>Precisely, open source offers stability and sovereignty. You download the model,

340
00:16:38.879 --> 00:16:41.919
<v Speaker 3>weights there are yours. You run it on your own infrastructure,

341
00:16:41.960 --> 00:16:44.559
<v Speaker 3>behind your own firewall. No one can ever take it

342
00:16:44.600 --> 00:16:45.120
<v Speaker 3>away from you.

343
00:16:45.600 --> 00:16:49.519
<v Speaker 2>So for the enterprise, you know, the big serious companies

344
00:16:49.559 --> 00:16:52.519
<v Speaker 2>that are actually paying to implement this stuff in the

345
00:16:52.559 --> 00:16:56.120
<v Speaker 2>real world, open source is often the better strategic bet.

346
00:16:56.200 --> 00:16:59.480
<v Speaker 3>For many regulated industries. It's often the only acceptable option.

347
00:17:00.159 --> 00:17:02.519
<v Speaker 2>There's another factor here too, which is just the sheer

348
00:17:02.559 --> 00:17:06.160
<v Speaker 2>speed of innovation, the swarm intelligence. I want to dig

349
00:17:06.200 --> 00:17:07.799
<v Speaker 2>a little deeper into that. Can you give me a

350
00:17:07.839 --> 00:17:10.680
<v Speaker 2>concrete example of a time the open source community just

351
00:17:11.160 --> 00:17:12.880
<v Speaker 2>completely outran the giants? Oh?

352
00:17:12.920 --> 00:17:15.920
<v Speaker 3>Absolutely. The perfect example is the rise of mixture of

353
00:17:15.960 --> 00:17:17.920
<v Speaker 3>experts architecture or MOE.

354
00:17:18.559 --> 00:17:20.640
<v Speaker 2>Sounds fancy, but what is it? In simple terms?

355
00:17:20.839 --> 00:17:23.400
<v Speaker 3>Okay, So, a traditional model is what we call dense.

356
00:17:23.880 --> 00:17:27.839
<v Speaker 3>It's one single giant brain. Every time you ask it

357
00:17:27.880 --> 00:17:30.680
<v Speaker 3>a question, the entire brain has to light up and think.

358
00:17:30.519 --> 00:17:32.799
<v Speaker 2>About it, which is computationally expensive.

359
00:17:33.079 --> 00:17:36.960
<v Speaker 3>Very and MOE model is different. Instead of one giant brain,

360
00:17:37.359 --> 00:17:41.119
<v Speaker 3>the model is made up of many smaller expert submodels.

361
00:17:41.200 --> 00:17:44.039
<v Speaker 3>Maybe you have eight different experts, and when you ask

362
00:17:44.079 --> 00:17:46.960
<v Speaker 3>a question, there's a tiny router network at the front

363
00:17:46.960 --> 00:17:49.720
<v Speaker 3>that decides which one or two experts are best suited

364
00:17:49.759 --> 00:17:50.599
<v Speaker 3>to answer it.

365
00:17:50.519 --> 00:17:53.000
<v Speaker 2>Like a receptionist at a big hospital directing calls.

366
00:17:53.119 --> 00:17:55.160
<v Speaker 3>Exactly like that. Oh you have a coding question, I'll

367
00:17:55.160 --> 00:17:56.759
<v Speaker 3>send you to the python expert. You have a question

368
00:17:56.799 --> 00:17:59.039
<v Speaker 3>about ancient Rome, let's talk to the history expert.

369
00:17:59.119 --> 00:17:59.960
<v Speaker 2>And why is that better?

370
00:18:00.480 --> 00:18:04.200
<v Speaker 3>It's massively more efficient for any given thought. You're only

371
00:18:04.240 --> 00:18:06.880
<v Speaker 3>activating a small fraction of the total brain, so it's

372
00:18:06.960 --> 00:18:09.680
<v Speaker 3>much faster and much much cheaper to run, but you

373
00:18:09.720 --> 00:18:12.119
<v Speaker 3>still get the benefit of the combined knowledge of all

374
00:18:12.160 --> 00:18:12.759
<v Speaker 3>the experts.

375
00:18:12.799 --> 00:18:17.440
<v Speaker 2>Okay, I get it. So how did the swarm outrun

376
00:18:17.480 --> 00:18:18.319
<v Speaker 2>the giants on this?

377
00:18:18.720 --> 00:18:21.519
<v Speaker 3>Well? Big tech has been using this technique internally for

378
00:18:21.559 --> 00:18:24.799
<v Speaker 3>a while, but when the open source company Mistral released

379
00:18:24.839 --> 00:18:29.200
<v Speaker 3>their first MOEE model, the community just exploded. They dissected it,

380
00:18:29.240 --> 00:18:31.960
<v Speaker 3>they understood it, and they started building their own improved

381
00:18:32.039 --> 00:18:36.240
<v Speaker 3>variants instantly. Within weeks, we had people figuring out clever

382
00:18:36.440 --> 00:18:41.960
<v Speaker 3>hacks to run these incredibly complex architectures on consumer grade hardware, on.

383
00:18:42.000 --> 00:18:48.160
<v Speaker 2>MacBooks, a laptop, a model with multiple expert brains. How

384
00:18:48.240 --> 00:18:49.279
<v Speaker 2>is that even possible.

385
00:18:49.400 --> 00:18:53.079
<v Speaker 3>It's possible because the community became obsessed with optimization. They

386
00:18:53.119 --> 00:18:56.319
<v Speaker 3>found these brilliant ways to load the different experts in

387
00:18:56.359 --> 00:18:58.759
<v Speaker 3>and out of your computer's memory so fast that you

388
00:18:58.799 --> 00:19:01.319
<v Speaker 3>could run a model that on paper should never be

389
00:19:01.359 --> 00:19:02.599
<v Speaker 3>able to fit on your machine.

390
00:19:02.599 --> 00:19:04.559
<v Speaker 2>As that classic hacker spirit I love.

391
00:19:04.759 --> 00:19:05.000
<v Speaker 3>Yeah.

392
00:19:05.000 --> 00:19:07.599
<v Speaker 2>Big text answer is buy a bigger, more expensive server.

393
00:19:07.720 --> 00:19:10.079
<v Speaker 2>The open source answer is no, let's rewrite the code

394
00:19:10.119 --> 00:19:11.559
<v Speaker 2>to make it fit on the server we already have.

395
00:19:11.839 --> 00:19:14.200
<v Speaker 3>That is the fundamental cultural difference right there.

396
00:19:14.240 --> 00:19:17.000
<v Speaker 2>And look at the timeline in a big tech company.

397
00:19:17.079 --> 00:19:19.480
<v Speaker 2>If a researcher has a brilliant new idea like.

398
00:19:19.440 --> 00:19:21.759
<v Speaker 3>That, it has to be written up in a proposal,

399
00:19:21.920 --> 00:19:24.519
<v Speaker 3>It has to go through a committee review, it has

400
00:19:24.559 --> 00:19:28.640
<v Speaker 3>to get prioritized and put into the next quarters product roadmap.

401
00:19:29.200 --> 00:19:32.039
<v Speaker 3>It can take months or even years to see the

402
00:19:32.119 --> 00:19:32.680
<v Speaker 3>light of day.

403
00:19:32.839 --> 00:19:34.119
<v Speaker 2>And the open source world.

404
00:19:34.799 --> 00:19:37.880
<v Speaker 3>Someone reads a new academic paper on a Tuesday morning.

405
00:19:38.599 --> 00:19:42.519
<v Speaker 3>By Tuesday night, some brilliant person has a working implementation

406
00:19:42.599 --> 00:19:45.920
<v Speaker 3>of it on GitHub. By Wednesday, someone else has forked

407
00:19:45.920 --> 00:19:49.079
<v Speaker 3>it and made it ten percent faster. By Friday, it's

408
00:19:49.119 --> 00:19:52.599
<v Speaker 3>been integrated into the main community tools and is available

409
00:19:52.640 --> 00:19:55.119
<v Speaker 3>for everyone in the world to download and use.

410
00:19:55.240 --> 00:19:58.480
<v Speaker 2>It's like evolution on fast forward, a hyper revolution it is.

411
00:19:58.640 --> 00:20:03.839
<v Speaker 3>It's a decentralized that iterates and improves immediately in parallel.

412
00:20:04.319 --> 00:20:07.799
<v Speaker 3>Big tech has deep pockets and amazing researchers, but they

413
00:20:07.839 --> 00:20:12.880
<v Speaker 3>simply cannot match that sheer, chaotic velocity of global experimentation.

414
00:20:13.119 --> 00:20:15.400
<v Speaker 2>And then there's the geopolitical angle, which was something I

415
00:20:15.440 --> 00:20:17.440
<v Speaker 2>hadn't fully considered before digging into this.

416
00:20:17.519 --> 00:20:19.839
<v Speaker 3>It's huge. It's easy to forget that not every researcher

417
00:20:19.880 --> 00:20:22.759
<v Speaker 3>or developer is sitting in Siliton Valley or London. If

418
00:20:22.759 --> 00:20:25.559
<v Speaker 3>you're a brilliant AI researcher in a country that doesn't

419
00:20:25.599 --> 00:20:29.920
<v Speaker 3>have easy access to US proprietary tech, maybe because of

420
00:20:29.960 --> 00:20:33.279
<v Speaker 3>sanctions or just because of economics, you can't use the

421
00:20:33.279 --> 00:20:37.160
<v Speaker 3>closed APIs. OpenAI might be blocked in your country or

422
00:20:37.279 --> 00:20:38.359
<v Speaker 3>just be too expensive.

423
00:20:38.519 --> 00:20:41.279
<v Speaker 2>So for a huge portion of the world, open source

424
00:20:41.319 --> 00:20:42.480
<v Speaker 2>is the only game in town.

425
00:20:42.880 --> 00:20:45.960
<v Speaker 3>It's a force for democratizing access. It means a student

426
00:20:46.000 --> 00:20:49.400
<v Speaker 3>in India, a researcher in Brazil, a startup in Nigeria.

427
00:20:49.519 --> 00:20:52.000
<v Speaker 3>They all get access to the same cutting edge tools

428
00:20:52.039 --> 00:20:55.279
<v Speaker 3>that were until recently locked away inside a handful of

429
00:20:55.319 --> 00:20:56.559
<v Speaker 3>American corporations.

430
00:20:56.599 --> 00:20:58.720
<v Speaker 2>So we have this picture of the big tech fortress

431
00:20:58.839 --> 00:21:00.799
<v Speaker 2>with all the money and data, and we have the

432
00:21:00.839 --> 00:21:03.880
<v Speaker 2>open source rebellion with all the speed, the privacy in

433
00:21:03.920 --> 00:21:06.599
<v Speaker 2>the global reach. But then then we have to talk

434
00:21:06.599 --> 00:21:07.680
<v Speaker 2>about the meta paradox.

435
00:21:07.799 --> 00:21:12.000
<v Speaker 3>Ah, Yes, the agent of chaos, the spanner in the works.

436
00:21:12.200 --> 00:21:14.759
<v Speaker 2>This brings us to the weirdest, most confusing part of

437
00:21:14.759 --> 00:21:18.240
<v Speaker 2>the whole map. You have meta Mark Zuckerbert's empire, which

438
00:21:18.279 --> 00:21:21.559
<v Speaker 2>sits squarely, undeniably in the big tech camp. But then

439
00:21:21.599 --> 00:21:23.480
<v Speaker 2>you look at what they're doing with their Lama models,

440
00:21:23.599 --> 00:21:25.039
<v Speaker 2>and it looks for all the world like they're the

441
00:21:25.079 --> 00:21:26.759
<v Speaker 2>primary arms dealer for the rebels.

442
00:21:27.000 --> 00:21:29.759
<v Speaker 3>It's the single biggest plot twist of the last decade

443
00:21:29.799 --> 00:21:32.960
<v Speaker 3>in tech. They are spending those billions and billions of

444
00:21:33.000 --> 00:21:36.480
<v Speaker 3>dollars we just talked about building the data centers, burning

445
00:21:36.480 --> 00:21:39.200
<v Speaker 3>the megawatts of electricity to build these massive state of

446
00:21:39.240 --> 00:21:41.720
<v Speaker 3>the art brains and then they just put the weights

447
00:21:41.759 --> 00:21:43.400
<v Speaker 3>on the internet for free.

448
00:21:43.839 --> 00:21:47.119
<v Speaker 2>It feels like charity, which I'm going to go out

449
00:21:47.119 --> 00:21:49.400
<v Speaker 2>on a limb here and assume Meta is not a charity.

450
00:21:49.480 --> 00:21:52.480
<v Speaker 3>You would be correct. This is not altruism. This is

451
00:21:52.480 --> 00:21:55.640
<v Speaker 3>one of the most ruthless and brilliant strategic moves in

452
00:21:55.680 --> 00:21:58.559
<v Speaker 3>the history of the tech industry. It's a concept that

453
00:21:58.680 --> 00:22:01.880
<v Speaker 3>economists call commoditizing the compliment.

454
00:22:02.000 --> 00:22:04.640
<v Speaker 2>Commoditizing the compliment. Okay, break that down for us in

455
00:22:04.680 --> 00:22:06.559
<v Speaker 2>plain English. What does that mean?

456
00:22:06.640 --> 00:22:09.759
<v Speaker 3>Okay, think about it this way. If your core business

457
00:22:09.799 --> 00:22:12.319
<v Speaker 3>is selling hot dogs, what do you want to be

458
00:22:12.400 --> 00:22:13.839
<v Speaker 3>true about hot dog buns?

459
00:22:14.519 --> 00:22:16.640
<v Speaker 2>I want them to be as cheap and as widely

460
00:22:16.680 --> 00:22:19.920
<v Speaker 2>available as humanly possible free if I can manage it.

461
00:22:20.160 --> 00:22:23.680
<v Speaker 2>Why Because the cheaper and easier it is to get buns,

462
00:22:24.279 --> 00:22:26.920
<v Speaker 2>the more of my hot dogs people will buy. The

463
00:22:26.960 --> 00:22:29.920
<v Speaker 2>bun is a necessary part of the equation. But it's

464
00:22:29.960 --> 00:22:31.240
<v Speaker 2>not where I make my money.

465
00:22:31.559 --> 00:22:35.799
<v Speaker 3>Right, The buns are the compliment to your main product. Now,

466
00:22:35.839 --> 00:22:37.920
<v Speaker 3>look at Meta. What is their core business? What do

467
00:22:37.960 --> 00:22:38.759
<v Speaker 3>they actually sell?

468
00:22:38.880 --> 00:22:42.119
<v Speaker 2>They sell us, They sell our attention. They sell advertisements

469
00:22:42.160 --> 00:22:44.079
<v Speaker 2>on Instagram and Facebook and WhatsApp.

470
00:22:44.200 --> 00:22:47.640
<v Speaker 3>Correct, They do not sell cloud computing services. They don't

471
00:22:47.640 --> 00:22:51.319
<v Speaker 3>sell subscriptions to AI models. For Google and Microsoft, the

472
00:22:51.359 --> 00:22:54.640
<v Speaker 3>powerful AI model is the hot dog, it's the product.

473
00:22:55.319 --> 00:22:58.240
<v Speaker 3>They need that model to be expensive and exclusive so

474
00:22:58.279 --> 00:23:00.000
<v Speaker 3>they can charge you twenty dollars a month for hy

475
00:23:00.079 --> 00:23:00.680
<v Speaker 3>access to it.

476
00:23:01.000 --> 00:23:03.559
<v Speaker 2>But for Meta, the AI is just the bun.

477
00:23:03.680 --> 00:23:06.559
<v Speaker 3>The AI is the bun. It's the complementary good, it's

478
00:23:06.599 --> 00:23:09.680
<v Speaker 3>the infrastructure. By giving away their state of the art

479
00:23:09.759 --> 00:23:12.759
<v Speaker 3>LAMA models for free, Mark Zuckerberg is trying to drive

480
00:23:12.799 --> 00:23:15.640
<v Speaker 3>the market price of raw intelligence down to zero.

481
00:23:15.519 --> 00:23:19.240
<v Speaker 2>Which completely scorches the earth. For his primary competitors, Google

482
00:23:19.279 --> 00:23:20.720
<v Speaker 2>and Microsoft.

483
00:23:20.400 --> 00:23:24.079
<v Speaker 3>It annihilates their business model. If any startup in the

484
00:23:24.079 --> 00:23:26.599
<v Speaker 3>world can grab Lama three for free and build a

485
00:23:26.599 --> 00:23:29.839
<v Speaker 3>fantastic business on top of it, why would they ever

486
00:23:29.880 --> 00:23:34.039
<v Speaker 3>pay open Ai or Google huge licensing fees. Meta is

487
00:23:34.079 --> 00:23:36.640
<v Speaker 3>strategically trying to make sure that no one else can

488
00:23:36.680 --> 00:23:40.079
<v Speaker 3>build a gatekeeper position in the AI layer of the

489
00:23:40.079 --> 00:23:43.599
<v Speaker 3>Internet that could one day threaten their advertising empire.

490
00:23:43.799 --> 00:23:48.920
<v Speaker 2>That is absolutely rusiless. Okay. Another analogy, imagine if you

491
00:23:49.000 --> 00:23:52.240
<v Speaker 2>were a company that made all its money selling shaving cream,

492
00:23:53.000 --> 00:23:55.880
<v Speaker 2>and your main competitor made all their money selling expensive

493
00:23:55.920 --> 00:23:59.680
<v Speaker 2>proprietary razors. You might start giving away really good razor

494
00:23:59.680 --> 00:24:02.759
<v Speaker 2>handle for free. Yes, perfect, because if everyone has a

495
00:24:02.759 --> 00:24:05.359
<v Speaker 2>free razor handle that works great, nobody needs to buy

496
00:24:05.400 --> 00:24:07.960
<v Speaker 2>the competitors' expensive ones. But everybody still needs to buy

497
00:24:08.000 --> 00:24:08.640
<v Speaker 2>your shaving crank.

498
00:24:08.680 --> 00:24:10.960
<v Speaker 3>That is the perfect analogy. Meta is giving away the

499
00:24:11.039 --> 00:24:14.200
<v Speaker 3>razor handles to protect it shaving cream business. They are

500
00:24:14.279 --> 00:24:17.440
<v Speaker 3>scorching the earth so that their rivals can't build castles

501
00:24:17.480 --> 00:24:19.680
<v Speaker 3>that might one day charge Meta a toll.

502
00:24:19.759 --> 00:24:22.119
<v Speaker 2>And I'm guessing there are other benefits to Meta too, right,

503
00:24:22.400 --> 00:24:24.359
<v Speaker 2>beyond just kneecapping their competitors.

504
00:24:24.799 --> 00:24:28.000
<v Speaker 3>Oh, massive benefits. Remember that swarm intelligence we were just.

505
00:24:27.960 --> 00:24:29.759
<v Speaker 2>Talking about, Yeah, the hyper evolution.

506
00:24:30.079 --> 00:24:33.720
<v Speaker 3>The second Meta releases a new Lama model, thousands of

507
00:24:33.720 --> 00:24:37.079
<v Speaker 3>the world's smartest developers immediately start hacking on it. They

508
00:24:37.119 --> 00:24:40.000
<v Speaker 3>find bugs and they fix them. They figure out new

509
00:24:40.039 --> 00:24:42.720
<v Speaker 3>ways to make it faster, they discover how to run

510
00:24:42.759 --> 00:24:44.279
<v Speaker 3>it on cheaper hardware.

511
00:24:43.920 --> 00:24:46.880
<v Speaker 2>And Meta gets all of that research and development for.

512
00:24:46.599 --> 00:24:51.119
<v Speaker 3>Free, completely free R and D. They are effectively outsourcing

513
00:24:51.160 --> 00:24:55.200
<v Speaker 3>a huge chunk of their innovation pipeline to the entire world.

514
00:24:55.519 --> 00:24:58.799
<v Speaker 3>Plus they build up this incredible reservoir of goodwill with

515
00:24:58.880 --> 00:25:02.799
<v Speaker 3>the developer community. Suddenly Meta isn't the bad guy from

516
00:25:02.799 --> 00:25:05.440
<v Speaker 3>the privacy scandals. They're the good guy who is enabling

517
00:25:05.480 --> 00:25:08.720
<v Speaker 3>open science and empowering the little guy. It's a masterful

518
00:25:08.799 --> 00:25:12.119
<v Speaker 3>repositioning of their brand. They become the neutral Switzerland like

519
00:25:12.160 --> 00:25:13.559
<v Speaker 3>party in the AI wars.

520
00:25:13.680 --> 00:25:16.279
<v Speaker 2>That is some serious forty chess right there.

521
00:25:16.359 --> 00:25:19.279
<v Speaker 3>It really is, and it completely complicates the whole narrative.

522
00:25:19.319 --> 00:25:23.400
<v Speaker 3>It's no longer just corporations versus the people. It's corporation

523
00:25:23.640 --> 00:25:27.960
<v Speaker 3>A versus corporation B, with corporation A using the people

524
00:25:28.039 --> 00:25:30.519
<v Speaker 3>as a strategic labor against corporation B.

525
00:25:30.720 --> 00:25:34.279
<v Speaker 2>So with all these complex pieces on the board, the fortress,

526
00:25:34.440 --> 00:25:38.839
<v Speaker 2>the swarm, the chaotic agent in the middle, what is

527
00:25:38.880 --> 00:25:42.559
<v Speaker 2>the current score If we were to freeze the game

528
00:25:42.680 --> 00:25:45.759
<v Speaker 2>right now today, who is actually winning?

529
00:25:45.960 --> 00:25:47.720
<v Speaker 3>Well, we have to be realistic about it. If we're

530
00:25:47.799 --> 00:25:51.480
<v Speaker 3>judging purely on the frontier, the absolute cutting edge of

531
00:25:51.599 --> 00:25:54.880
<v Speaker 3>raw capability, big tech still has the lead.

532
00:25:55.200 --> 00:25:57.519
<v Speaker 2>The big proprietary models are still smarter.

533
00:25:57.480 --> 00:25:59.960
<v Speaker 3>On the whole. Yes, if you look at the leaderboard

534
00:26:00.160 --> 00:26:04.480
<v Speaker 3>for complex reasoning, for advanced coding challenges, for multimodal tasks

535
00:26:04.480 --> 00:26:08.920
<v Speaker 3>that involve understanding both images and text. The flagship models

536
00:26:08.920 --> 00:26:12.480
<v Speaker 3>from open Ai, Google and mpropic still consistently beat the

537
00:26:12.480 --> 00:26:13.440
<v Speaker 3>best open models.

538
00:26:13.519 --> 00:26:15.279
<v Speaker 2>So the gap is narrowing, but it's still there.

539
00:26:15.400 --> 00:26:17.720
<v Speaker 3>It hasn't closed yet. And for users who need the

540
00:26:17.799 --> 00:26:21.039
<v Speaker 3>absolute best in class performance, think of a doctor using

541
00:26:21.079 --> 00:26:24.359
<v Speaker 3>AI to help diagnose a complex disease or a lawyer

542
00:26:24.480 --> 00:26:28.240
<v Speaker 3>using it for intricate legal analysis, that small performance gap

543
00:26:28.279 --> 00:26:29.079
<v Speaker 3>can really matter.

544
00:26:29.160 --> 00:26:31.759
<v Speaker 2>And what about integration just getting it into the hands

545
00:26:31.799 --> 00:26:32.519
<v Speaker 2>of normal people.

546
00:26:32.599 --> 00:26:35.039
<v Speaker 3>Big tech wins there hands down. It's what we call

547
00:26:35.079 --> 00:26:36.400
<v Speaker 3>the distribution.

548
00:26:35.880 --> 00:26:37.960
<v Speaker 2>Advantage, which means what exactly it means.

549
00:26:38.039 --> 00:26:41.279
<v Speaker 3>The AI is showing up almost invisibly inside the tools

550
00:26:41.319 --> 00:26:44.599
<v Speaker 3>you already use every single day. It's Copilot appearing in

551
00:26:44.680 --> 00:26:47.880
<v Speaker 3>Microsoft word, it's Gemini helping you write an email in

552
00:26:47.920 --> 00:26:52.079
<v Speaker 3>Google Docs. It's AI photo editing build directly into your phone.

553
00:26:52.200 --> 00:26:54.000
<v Speaker 2>Right. I don't have to go find a special website

554
00:26:54.079 --> 00:26:57.200
<v Speaker 2>or installing it. It's just there. The friction is zero,

555
00:26:57.519 --> 00:26:58.119
<v Speaker 2>zero friction.

556
00:26:58.599 --> 00:27:01.680
<v Speaker 3>The open source alternatives might be fantastic, but you still

557
00:27:01.680 --> 00:27:04.039
<v Speaker 3>have to make a conscious choice to go find them,

558
00:27:04.160 --> 00:27:08.200
<v Speaker 3>download them, maybe configure them. Most people, most of the time,

559
00:27:08.279 --> 00:27:11.480
<v Speaker 3>will take the path of least resistance. And big tech

560
00:27:11.839 --> 00:27:12.799
<v Speaker 3>owns those paths.

561
00:27:13.279 --> 00:27:14.799
<v Speaker 2>And this is just the simple brand factor.

562
00:27:14.880 --> 00:27:17.119
<v Speaker 3>Right heask your neighbor, ask your parents to name a AI.

563
00:27:17.279 --> 00:27:19.960
<v Speaker 3>They will say chat GPT. They won't say mistroll or

564
00:27:20.039 --> 00:27:22.039
<v Speaker 3>Lama or mixt roll eight by seven B.

565
00:27:22.359 --> 00:27:25.359
<v Speaker 2>That brand recognition drives usage, which drives.

566
00:27:25.079 --> 00:27:28.400
<v Speaker 3>Revenue, and that revenue funds the next billion dollar training run.

567
00:27:28.480 --> 00:27:30.839
<v Speaker 3>It's a powerful self reinforcing cycle.

568
00:27:31.119 --> 00:27:34.279
<v Speaker 2>But there's some big X factors here, wild cards that

569
00:27:34.319 --> 00:27:37.119
<v Speaker 2>could still tip the scales, and one of the biggest

570
00:27:37.160 --> 00:27:39.039
<v Speaker 2>is the government regulation.

571
00:27:39.200 --> 00:27:42.759
<v Speaker 3>The regulatory dimension. This is where the battle moves from

572
00:27:42.799 --> 00:27:45.640
<v Speaker 3>the server room to the halls of Congress and Brussels.

573
00:27:45.759 --> 00:27:49.359
<v Speaker 2>How does regulation play into this? My first assumption would

574
00:27:49.359 --> 00:27:51.880
<v Speaker 2>be that everyone hates regulation, that it just gets in

575
00:27:51.880 --> 00:27:52.200
<v Speaker 2>the way.

576
00:27:52.319 --> 00:27:55.559
<v Speaker 3>Actually, no, and this is one of the most counterintuitive

577
00:27:55.559 --> 00:27:59.880
<v Speaker 3>parts of the whole story. Big tech often quietly welcomes

578
00:28:00.000 --> 00:28:01.200
<v Speaker 3>certain kinds of regulation.

579
00:28:01.559 --> 00:28:04.200
<v Speaker 2>Really, why on earth would they want more rules?

580
00:28:04.359 --> 00:28:06.359
<v Speaker 3>Because compliance is expensive?

581
00:28:06.559 --> 00:28:06.839
<v Speaker 2>Wow.

582
00:28:07.000 --> 00:28:09.839
<v Speaker 3>Of course, if the government passes a law that says,

583
00:28:10.279 --> 00:28:13.519
<v Speaker 3>to legally release a powerful AI model, you must first

584
00:28:13.559 --> 00:28:17.160
<v Speaker 3>perform ten million dollars worth of safety testing, formal auditing,

585
00:28:17.240 --> 00:28:18.000
<v Speaker 3>and red teaming.

586
00:28:18.480 --> 00:28:22.359
<v Speaker 2>Who can afford to do that, Microsoft, Google, the big guys.

587
00:28:22.119 --> 00:28:23.839
<v Speaker 3>Exactly, and who can afford.

588
00:28:23.559 --> 00:28:25.960
<v Speaker 2>To do that the two college students in a garage

589
00:28:26.559 --> 00:28:29.440
<v Speaker 2>or a small, bootstrapped open source startup.

590
00:28:29.519 --> 00:28:33.440
<v Speaker 3>Precisely expensive. Regulation, while sounding good on the surface, often

591
00:28:33.519 --> 00:28:36.920
<v Speaker 3>ends up creating a massive barrier to entry. It protects

592
00:28:36.960 --> 00:28:39.519
<v Speaker 3>the incumbents from new competition. It's a way for them

593
00:28:39.559 --> 00:28:40.920
<v Speaker 3>to pull up the ladder behind them.

594
00:28:41.200 --> 00:28:44.839
<v Speaker 2>So big tech might publicly say, oh, yes, please regulate us,

595
00:28:44.960 --> 00:28:49.559
<v Speaker 2>this technology is so dangerous, while privately thinking this is fantastic.

596
00:28:49.599 --> 00:28:52.079
<v Speaker 2>This will crush all the small fry competitors.

597
00:28:52.200 --> 00:28:55.079
<v Speaker 3>It's a very common dynamic in mature industries. It's a

598
00:28:55.119 --> 00:28:57.279
<v Speaker 3>form of what's called regulatory capture.

599
00:28:57.440 --> 00:28:59.519
<v Speaker 2>And what's the argument from the open source side?

600
00:28:59.759 --> 00:29:03.640
<v Speaker 3>They argue that you simply cannot regulate software in that way.

601
00:29:03.799 --> 00:29:08.480
<v Speaker 3>Their core argument is essentially math is speech. You can't

602
00:29:08.480 --> 00:29:10.640
<v Speaker 3>put the genie back in the bottle. You can't make

603
00:29:10.680 --> 00:29:13.240
<v Speaker 3>it illegal to share a file of numbers.

604
00:29:13.319 --> 00:29:14.559
<v Speaker 2>If I download it, I can run it.

605
00:29:14.839 --> 00:29:18.000
<v Speaker 3>And if you regulate the US and European developers too heavily,

606
00:29:18.400 --> 00:29:20.799
<v Speaker 3>the top talent and the innovation will just move to

607
00:29:20.839 --> 00:29:23.400
<v Speaker 3>a country with looser rules. You don't stop it, you

608
00:29:23.519 --> 00:29:24.359
<v Speaker 3>just offshore it.

609
00:29:24.559 --> 00:29:27.759
<v Speaker 2>So there's a real fear that heavy handed regulation could

610
00:29:27.839 --> 00:29:31.640
<v Speaker 2>actually hurt the good guys in the open community. Yeah,

611
00:29:31.759 --> 00:29:33.839
<v Speaker 2>far more than it hurts the giants they're trying to

612
00:29:33.920 --> 00:29:34.240
<v Speaker 2>rain in.

613
00:29:34.440 --> 00:29:37.359
<v Speaker 3>That is the central fear. The final shape of regulation

614
00:29:37.519 --> 00:29:40.079
<v Speaker 3>could end up deciding the winner of this war more

615
00:29:40.119 --> 00:29:42.400
<v Speaker 3>than any single technological breakthrough.

616
00:29:42.599 --> 00:29:44.200
<v Speaker 2>There's one more X factor I want to touch on,

617
00:29:44.240 --> 00:29:47.119
<v Speaker 2>which is synthetic data. We talked earlier about how big

618
00:29:47.160 --> 00:29:49.839
<v Speaker 2>tech has this huge data mode, but isn't there a

619
00:29:49.880 --> 00:29:52.559
<v Speaker 2>movement now to just create your own data.

620
00:29:52.680 --> 00:29:56.200
<v Speaker 3>Yes, this is a major frontier for the open source

621
00:29:56.240 --> 00:29:58.480
<v Speaker 3>world and a potential moat killer.

622
00:29:58.640 --> 00:29:59.680
<v Speaker 2>What is it exactly?

623
00:30:00.119 --> 00:30:02.000
<v Speaker 3>Well, the problem is that you can run out of

624
00:30:02.079 --> 00:30:05.039
<v Speaker 3>high quality text on the public internet to train your next.

625
00:30:04.839 --> 00:30:07.079
<v Speaker 2>Model, on which I hear we are actually getting close

626
00:30:07.119 --> 00:30:07.359
<v Speaker 2>to doing.

627
00:30:07.440 --> 00:30:10.680
<v Speaker 3>We are getting surprisingly close. The really high quality, well

628
00:30:10.680 --> 00:30:15.079
<v Speaker 3>written stuff is finite. So the idea is this, what

629
00:30:15.240 --> 00:30:19.039
<v Speaker 3>if you use a really smart existing model like GBT

630
00:30:19.200 --> 00:30:22.400
<v Speaker 3>four or claude to write new training data. You ask

631
00:30:22.440 --> 00:30:25.960
<v Speaker 3>it to generate textbooks or to create thousands of high

632
00:30:26.039 --> 00:30:29.400
<v Speaker 3>quality examples of Socratic dialogues, and then you use that

633
00:30:29.480 --> 00:30:33.799
<v Speaker 3>perfectly clean AI generated data to train your next new model.

634
00:30:34.119 --> 00:30:36.799
<v Speaker 2>So the AI is teaching the next generation of AI.

635
00:30:37.000 --> 00:30:38.480
<v Speaker 3>Yes, exactly, isn't that.

636
00:30:38.799 --> 00:30:41.240
<v Speaker 2>A bit incestuous? Don't you risk degrading the quality? It

637
00:30:41.319 --> 00:30:44.240
<v Speaker 2>feels like making a photocopy of a photocopy of a photocopy.

638
00:30:44.480 --> 00:30:46.359
<v Speaker 2>Eventually it just turns into a blurry mess.

639
00:30:46.519 --> 00:30:49.119
<v Speaker 3>That is the number one risk. It's a very real

640
00:30:49.119 --> 00:30:53.799
<v Speaker 3>phenomenon called model collapse or habsburg AI. If you aren't

641
00:30:53.799 --> 00:30:57.039
<v Speaker 3>incredibly careful, the models do start learning their own mistakes,

642
00:30:57.079 --> 00:31:01.039
<v Speaker 3>They start hallucinating more, their knowledge becomes weird and distorted.

643
00:31:01.160 --> 00:31:02.680
<v Speaker 2>So how do you prevent that from happening?

644
00:31:02.839 --> 00:31:06.200
<v Speaker 3>Meticulous curation and this is another area where the open

645
00:31:06.240 --> 00:31:10.640
<v Speaker 3>source community swarm intelligence shines. You have organizations like a

646
00:31:10.680 --> 00:31:15.319
<v Speaker 3>Luther AI and literally thousands of volunteers who treat building

647
00:31:15.400 --> 00:31:18.839
<v Speaker 3>these data sets like a massive science project. They are

648
00:31:18.960 --> 00:31:22.680
<v Speaker 3>manually cleaning data, filtering out the bad stuff, and rating

649
00:31:22.720 --> 00:31:24.559
<v Speaker 3>the quality of the synthetic examples.

650
00:31:24.680 --> 00:31:28.039
<v Speaker 2>So it's human verified synthetic data. A hybrid approach.

651
00:31:28.200 --> 00:31:31.119
<v Speaker 3>Exactly, It's a hybrid and this is a very powerful

652
00:31:31.160 --> 00:31:33.559
<v Speaker 3>technique for helping to close that data mode. If you

653
00:31:33.599 --> 00:31:36.759
<v Speaker 3>can generate your own high quality training data. You don't

654
00:31:36.799 --> 00:31:39.839
<v Speaker 3>need access to Google's private search logs nearly as much

655
00:31:39.839 --> 00:31:40.559
<v Speaker 3>as you thought you did.

656
00:31:40.640 --> 00:31:43.799
<v Speaker 2>That is fascinating. It really challenges that old idea that

657
00:31:44.160 --> 00:31:46.920
<v Speaker 2>data is the new oil and only the giant oil

658
00:31:46.920 --> 00:31:48.839
<v Speaker 2>barons have access to it. It turns out you can

659
00:31:48.880 --> 00:31:51.480
<v Speaker 2>kind of synthesize your own oil if you're smart enough, if.

660
00:31:51.359 --> 00:31:53.960
<v Speaker 3>You are smart enough and extremely careful.

661
00:31:54.240 --> 00:31:57.200
<v Speaker 2>So let's try to bring this all together. Let's try

662
00:31:57.200 --> 00:31:59.640
<v Speaker 2>to reach a verdict. We've looked at the advantages of

663
00:31:59.720 --> 00:32:03.960
<v Speaker 2>stre the paradox as the wild cards. Who wins? Is

664
00:32:03.960 --> 00:32:06.880
<v Speaker 2>there a clear winner in this battle for the future?

665
00:32:07.200 --> 00:32:09.359
<v Speaker 3>You know. The honest answer, and it might be a

666
00:32:09.400 --> 00:32:12.759
<v Speaker 3>little unsatisfying, is that it's not binary. It's not going

667
00:32:12.839 --> 00:32:15.680
<v Speaker 3>to be one side standing victorious on the ashes of

668
00:32:15.720 --> 00:32:16.000
<v Speaker 3>the other.

669
00:32:16.200 --> 00:32:17.359
<v Speaker 2>So it's a split decision.

670
00:32:17.400 --> 00:32:19.759
<v Speaker 3>I think it's a permanent coexistence. They're going to win

671
00:32:19.880 --> 00:32:20.880
<v Speaker 3>different layers.

672
00:32:20.559 --> 00:32:22.720
<v Speaker 2>Of the stack. Okay, break that down for me. Who

673
00:32:22.799 --> 00:32:23.279
<v Speaker 2>wins what.

674
00:32:23.759 --> 00:32:27.759
<v Speaker 3>Big tech wins the frontier for the foreseeable future, the

675
00:32:27.920 --> 00:32:33.400
<v Speaker 3>absolute smartest, most powerful, most capable systems are going to

676
00:32:33.440 --> 00:32:36.039
<v Speaker 3>come from the groups with the billions of dollars to

677
00:32:36.079 --> 00:32:39.279
<v Speaker 3>spend on compute and the unique data feedback loops.

678
00:32:39.359 --> 00:32:41.200
<v Speaker 2>Okay, so they own the top of the mountain.

679
00:32:41.319 --> 00:32:44.599
<v Speaker 3>They will likely also win the broad consumer layer, the

680
00:32:44.640 --> 00:32:47.279
<v Speaker 3>apps on your phone, the voice assistant in your car,

681
00:32:47.680 --> 00:32:50.079
<v Speaker 3>the AI built into your search engine. That will be

682
00:32:50.119 --> 00:32:50.599
<v Speaker 3>a big.

683
00:32:50.440 --> 00:32:52.839
<v Speaker 2>Tech because of that convenience and distribution advantage.

684
00:32:52.880 --> 00:32:55.400
<v Speaker 3>We talked about it, right, But open source is going

685
00:32:55.440 --> 00:32:56.839
<v Speaker 3>to win the deployment.

686
00:32:56.400 --> 00:32:59.279
<v Speaker 2>Layer, the infrastructure, the plumbing.

687
00:32:59.039 --> 00:33:03.160
<v Speaker 3>The plumbing of the world. Yes, enterprise back ends, specialized

688
00:33:03.160 --> 00:33:07.319
<v Speaker 3>tools for science and medicine, the infrastructure for finance. Most

689
00:33:07.319 --> 00:33:09.680
<v Speaker 3>of that will eventually run on open models because of

690
00:33:09.720 --> 00:33:14.240
<v Speaker 3>the critical need for cost effectiveness, control, privacy, and stability.

691
00:33:14.680 --> 00:33:18.680
<v Speaker 2>So, if I'm hearing you right, big tech pushes the

692
00:33:18.720 --> 00:33:21.759
<v Speaker 2>absolute ceiling of what's possible higher.

693
00:33:21.440 --> 00:33:25.160
<v Speaker 3>And higher, and open source raises the floor for everyone

694
00:33:25.279 --> 00:33:27.640
<v Speaker 3>and spreads that technology everywhere.

695
00:33:28.079 --> 00:33:33.000
<v Speaker 2>That's a really interesting, almost symbiotic dynamic. They need each

696
00:33:33.039 --> 00:33:33.480
<v Speaker 2>other in.

697
00:33:33.400 --> 00:33:35.680
<v Speaker 3>A weird way, they really do. And frankly, I think

698
00:33:35.680 --> 00:33:38.000
<v Speaker 3>the most important victory for open source might not be

699
00:33:38.119 --> 00:33:39.079
<v Speaker 3>market share at all.

700
00:33:39.279 --> 00:33:39.680
<v Speaker 2>What is it?

701
00:33:39.720 --> 00:33:42.079
<v Speaker 3>Then, It's the establishment of a principle.

702
00:33:42.200 --> 00:33:44.079
<v Speaker 2>What principles say, The principle.

703
00:33:43.680 --> 00:33:46.319
<v Speaker 3>That AI capability doesn't have to be concentrated in a

704
00:33:46.319 --> 00:33:48.799
<v Speaker 3>handful of corporations, that we don't have to live in

705
00:33:48.839 --> 00:33:52.039
<v Speaker 3>a world where three companies in California hold all the

706
00:33:52.119 --> 00:33:53.000
<v Speaker 3>keys to the future.

707
00:33:53.079 --> 00:33:54.640
<v Speaker 2>It's an insurance policy for the world.

708
00:33:55.000 --> 00:33:59.519
<v Speaker 3>It's exactly that open source ensures that the knowledge, the tools,

709
00:33:59.559 --> 00:34:03.039
<v Speaker 3>and the town are distributed globally. It means that no

710
00:34:03.119 --> 00:34:07.119
<v Speaker 3>matter what happens with corporate roadmaps or government regulations, the

711
00:34:07.200 --> 00:34:12.199
<v Speaker 3>fundamental capability to build and understand these powerful systems belongs

712
00:34:12.199 --> 00:34:14.840
<v Speaker 3>to everyone, not just to a few boardrooms.

713
00:34:15.039 --> 00:34:17.840
<v Speaker 2>That's a really powerful thought. It shifts the focus from

714
00:34:18.119 --> 00:34:20.880
<v Speaker 2>who makes the most money to who gets to participate

715
00:34:20.880 --> 00:34:21.800
<v Speaker 2>in building the future.

716
00:34:21.920 --> 00:34:25.800
<v Speaker 3>Exactly, it's about preserving human agency in the age of AI.

717
00:34:26.199 --> 00:34:28.480
<v Speaker 2>You know, we've talked about winning in these broad strokes,

718
00:34:28.760 --> 00:34:30.679
<v Speaker 2>but I want to ask you personally, as someone who

719
00:34:31.039 --> 00:34:33.320
<v Speaker 2>analyzes this space every day, where do you put your

720
00:34:33.400 --> 00:34:36.000
<v Speaker 2>chips if you had to bet on what the AI

721
00:34:36.159 --> 00:34:38.719
<v Speaker 2>landscape looks like and say five years.

722
00:34:38.519 --> 00:34:41.079
<v Speaker 3>Five years is an absolute eternity in AI time.

723
00:34:41.199 --> 00:34:42.719
<v Speaker 2>Humor me, what's your gut feeling.

724
00:34:43.000 --> 00:34:44.679
<v Speaker 3>I think we're going to see what I'd call a

725
00:34:44.760 --> 00:34:48.039
<v Speaker 3>barbell distribution of intelligence. A barbell What do you mean

726
00:34:48.199 --> 00:34:52.519
<v Speaker 3>On one end of the barbell, you'll have these massive, centralized,

727
00:34:52.760 --> 00:34:56.679
<v Speaker 3>super intelligent cloud models owned by maybe two or three companies.

728
00:34:57.320 --> 00:34:59.079
<v Speaker 3>This is what you'll use when you needed to solve

729
00:34:59.119 --> 00:35:02.480
<v Speaker 3>cancer or design a fusion reactor, or plan a global

730
00:35:02.519 --> 00:35:03.599
<v Speaker 3>logistics network.

731
00:35:03.800 --> 00:35:05.119
<v Speaker 2>The really heavy lifting.

732
00:35:04.880 --> 00:35:08.239
<v Speaker 3>The planetary scale heavy lifting. On the other far end

733
00:35:08.239 --> 00:35:12.679
<v Speaker 3>of the barbell, you will have billions of tiny, highly efficient,

734
00:35:12.840 --> 00:35:17.039
<v Speaker 3>highly personalized open source models running locally on our phones,

735
00:35:17.400 --> 00:35:21.239
<v Speaker 3>our glasses, our laptops, maybe even our home appliances.

736
00:35:21.400 --> 00:35:24.039
<v Speaker 2>The edge, the personal intelligence.

737
00:35:23.559 --> 00:35:25.519
<v Speaker 3>Layer, the edge, yes, and the middle. I think the

738
00:35:25.519 --> 00:35:28.960
<v Speaker 3>middle gets squeezed out. The medium sized general purpose proprietary

739
00:35:28.960 --> 00:35:31.519
<v Speaker 3>models might just disappear because they won't be as smart

740
00:35:31.559 --> 00:35:34.000
<v Speaker 3>as the giants or as cheap in private as the

741
00:35:34.039 --> 00:35:34.800
<v Speaker 3>local models.

742
00:35:34.920 --> 00:35:36.960
<v Speaker 2>So you're either a god in the cloud or you're

743
00:35:37.000 --> 00:35:39.199
<v Speaker 2>a trusted personal assistant running in my pocket.

744
00:35:39.400 --> 00:35:41.840
<v Speaker 3>That is where the economics and the user needs seem

745
00:35:41.880 --> 00:35:42.480
<v Speaker 3>to be pointing.

746
00:35:42.679 --> 00:35:46.559
<v Speaker 2>And open source in your view, owns that personal assistant layer.

747
00:35:46.800 --> 00:35:49.039
<v Speaker 3>I think it has to, because I do not want

748
00:35:49.119 --> 00:35:52.840
<v Speaker 3>Google or any single corporation owning the AI that runs

749
00:35:52.880 --> 00:35:55.199
<v Speaker 3>on my glasses and sees everything I see, And here's

750
00:35:55.239 --> 00:35:58.920
<v Speaker 3>everything I say I want that to be mine, verifiably

751
00:35:58.960 --> 00:36:02.119
<v Speaker 3>private and under my contry, and open source is the

752
00:36:02.159 --> 00:36:04.800
<v Speaker 3>only technological path to guarantee that that.

753
00:36:04.719 --> 00:36:07.039
<v Speaker 2>Makes a ton of sense. It's the privacy and control

754
00:36:07.119 --> 00:36:09.079
<v Speaker 2>argument again. It always seems to come back to that

755
00:36:09.199 --> 00:36:09.519
<v Speaker 2>in the end.

756
00:36:09.519 --> 00:36:10.480
<v Speaker 3>It almost always does.

757
00:36:11.199 --> 00:36:14.519
<v Speaker 2>So as we wrap this up, what's the one thing

758
00:36:14.719 --> 00:36:16.760
<v Speaker 2>our listeners should take away from all this when they

759
00:36:16.800 --> 00:36:19.320
<v Speaker 2>see the next big headline about open AI or Google

760
00:36:19.480 --> 00:36:23.039
<v Speaker 2>or nu Lama model, what lens should they view it through?

761
00:36:23.239 --> 00:36:27.320
<v Speaker 3>I would say, try to look past the simple versus narrative.

762
00:36:27.440 --> 00:36:29.239
<v Speaker 3>Don't just ask who has the higher school on the

763
00:36:29.280 --> 00:36:32.320
<v Speaker 3>latest benchmark today, ask a different set of questions. Where

764
00:36:32.360 --> 00:36:36.239
<v Speaker 3>is this technology actually being deployed and who controls that deployment?

765
00:36:36.519 --> 00:36:39.800
<v Speaker 2>Follow the power and remember the meta paradox. Nothing is

766
00:36:39.840 --> 00:36:42.079
<v Speaker 2>ever as simple as good guys versus bad guys.

767
00:36:42.400 --> 00:36:46.280
<v Speaker 3>Never The chaotic strategic middle is where the real story

768
00:36:46.360 --> 00:36:48.079
<v Speaker 3>is almost always hiding, and.

769
00:36:48.000 --> 00:36:50.360
<v Speaker 2>I think that's a perfect place to leave it. We

770
00:36:50.400 --> 00:36:53.079
<v Speaker 2>often think about technology as this linear race to a

771
00:36:53.119 --> 00:36:55.880
<v Speaker 2>finish line, but maybe it's more about the kind of

772
00:36:55.960 --> 00:36:58.639
<v Speaker 2>landscape we're building as we go. Who gets to build

773
00:36:58.639 --> 00:37:01.880
<v Speaker 2>the roads, who owns the land and Are we allowed

774
00:37:01.920 --> 00:37:03.599
<v Speaker 2>to build our own houses if we want to?

775
00:37:03.840 --> 00:37:06.480
<v Speaker 3>That is the fundamental question. What kind of world do

776
00:37:06.559 --> 00:37:08.960
<v Speaker 3>we want this AI to help us build? And who

777
00:37:09.039 --> 00:37:10.760
<v Speaker 3>gets a real say in the blueprints?

778
00:37:10.880 --> 00:37:14.039
<v Speaker 2>Something to mull over. Thank you for listening, Stay curious,

779
00:37:14.079 --> 00:37:16.119
<v Speaker 2>stay skeptical, and we will see you next time.
