WEBVTT

1
00:00:01.199 --> 00:00:06.200
<v Speaker 1>Welcome to the Sentient Code, where intelligence is engineered, autonomy

2
00:00:06.280 --> 00:00:10.439
<v Speaker 1>is emerging, and a line between human and machine grows thinner.

3
00:00:10.800 --> 00:00:15.359
<v Speaker 1>Each episode, we decode the algorithms, explore the robotics, and

4
00:00:15.439 --> 00:00:21.839
<v Speaker 1>examine the ideas shaping the future of artificial minds.

5
00:00:23.839 --> 00:00:25.480
<v Speaker 2>So I want you to just take a second and

6
00:00:25.600 --> 00:00:27.920
<v Speaker 2>look around the room you're in right now, or you

7
00:00:27.960 --> 00:00:29.879
<v Speaker 2>know wherever you happen to be listening to this today. Yeah,

8
00:00:29.920 --> 00:00:32.119
<v Speaker 2>just do a quick scan of your environment, right, and

9
00:00:32.159 --> 00:00:35.000
<v Speaker 2>I want you to try and find a standard, maybe

10
00:00:35.039 --> 00:00:37.280
<v Speaker 2>somewhat dim, incandescent.

11
00:00:36.679 --> 00:00:40.359
<v Speaker 3>Light bulb, like the kind in an old desk clamp maybe.

12
00:00:40.000 --> 00:00:43.159
<v Speaker 2>Exactly, or even just the little bulb inside your refrigerator.

13
00:00:44.119 --> 00:00:47.640
<v Speaker 2>That specific bulb, it takes about twenty watts of power

14
00:00:47.960 --> 00:00:49.359
<v Speaker 2>just to maintain its glow.

15
00:00:49.280 --> 00:00:51.560
<v Speaker 3>Which is I mean, it's barely enough light to read.

16
00:00:51.439 --> 00:00:54.000
<v Speaker 2>A book by, right, it's practically nothing. So I want

17
00:00:54.000 --> 00:00:57.439
<v Speaker 2>you to hold on to that image, that tiny twenty

18
00:00:57.439 --> 00:00:58.920
<v Speaker 2>watt energy budget.

19
00:00:58.719 --> 00:01:01.200
<v Speaker 3>Because we're going to use that as the bits line

20
00:01:01.280 --> 00:01:02.640
<v Speaker 3>for something pretty.

21
00:01:02.320 --> 00:01:05.920
<v Speaker 2>Wild, completely mind bending. Actually yeah, because that exact same

22
00:01:05.920 --> 00:01:08.959
<v Speaker 2>amount of energy, those twenty watts, is what your biological

23
00:01:09.040 --> 00:01:12.760
<v Speaker 2>brain is using right this very second. To do well

24
00:01:13.000 --> 00:01:13.760
<v Speaker 2>literally everything.

25
00:01:13.920 --> 00:01:17.079
<v Speaker 3>It really forces a complete recalibration of how we view

26
00:01:17.120 --> 00:01:18.599
<v Speaker 3>our own biology.

27
00:01:18.040 --> 00:01:20.519
<v Speaker 2>Doesn't it. It totally does we just walk.

28
00:01:20.400 --> 00:01:24.280
<v Speaker 3>Around completely oblivious to the sheer, I mean, the thermodynamic

29
00:01:24.359 --> 00:01:26.519
<v Speaker 3>miracle happening inside our skulls every day.

30
00:01:26.799 --> 00:01:29.680
<v Speaker 2>Yeah, think about what you, as the listener, are doing

31
00:01:29.719 --> 00:01:33.280
<v Speaker 2>on just twenty wants Right now, you are processing the

32
00:01:33.319 --> 00:01:35.120
<v Speaker 2>auditory signals of my voice.

33
00:01:35.239 --> 00:01:38.920
<v Speaker 3>You're converting acoustic waves into electrical impulses.

34
00:01:38.319 --> 00:01:41.560
<v Speaker 2>Exactly, and then you're parsing individual words out of that

35
00:01:41.719 --> 00:01:43.319
<v Speaker 2>continuous stream of sound.

36
00:01:43.519 --> 00:01:45.879
<v Speaker 3>And if you happen to be, say, walking down the

37
00:01:45.920 --> 00:01:47.879
<v Speaker 3>street while listening, it's even crazier.

38
00:01:47.959 --> 00:01:52.239
<v Speaker 2>Oh yeah, you're instantly recognizing familiar faces. You're navigating this

39
00:01:52.359 --> 00:01:55.719
<v Speaker 2>complex three D environment, keeping your balance against.

40
00:01:55.359 --> 00:01:58.040
<v Speaker 3>Gravity, regulating your core body temperature.

41
00:01:58.480 --> 00:02:01.400
<v Speaker 2>Right, and you can learn a complete, letely new abstract

42
00:02:01.439 --> 00:02:05.040
<v Speaker 2>concept from a single example and instantly integrate it with

43
00:02:05.319 --> 00:02:06.640
<v Speaker 2>decades of stored memories.

44
00:02:06.760 --> 00:02:10.360
<v Speaker 3>All of that high level computational work is happening simultaneously, And.

45
00:02:10.319 --> 00:02:13.520
<v Speaker 2>While you're doing all of that, that identical twenty watt

46
00:02:13.520 --> 00:02:20.120
<v Speaker 2>budget is sustaining the profound, completely unresolved mystery of human consciousness. Itself.

47
00:02:20.159 --> 00:02:22.319
<v Speaker 2>It's just staggering all for the energy it takes to

48
00:02:22.360 --> 00:02:23.439
<v Speaker 2>dimly light a closet.

49
00:02:23.639 --> 00:02:26.360
<v Speaker 3>And you know the contrast becomes almost absurd when we

50
00:02:26.400 --> 00:02:28.439
<v Speaker 3>look at our current technological landscape.

51
00:02:28.479 --> 00:02:29.919
<v Speaker 2>Oh absolutely, let's talk about that.

52
00:02:30.240 --> 00:02:33.919
<v Speaker 3>Like, consider the state of the art artificial intelligence models today.

53
00:02:34.639 --> 00:02:39.000
<v Speaker 3>The systems that are performing complex pattern recognition or generating

54
00:02:39.039 --> 00:02:41.039
<v Speaker 3>photorealistic images.

55
00:02:40.800 --> 00:02:42.800
<v Speaker 2>Or simulating human likesuage right.

56
00:02:42.639 --> 00:02:46.840
<v Speaker 3>To achieve even a tiny fraction of human cognitive capabilities,

57
00:02:47.319 --> 00:02:52.919
<v Speaker 3>and usually only within incredibly narrow domains. Current AI requires

58
00:02:53.039 --> 00:02:55.639
<v Speaker 3>physical infrastructure on a planetary scale.

59
00:02:55.680 --> 00:02:58.280
<v Speaker 2>We are definitely not talking about twenty watts anymore.

60
00:02:58.520 --> 00:03:02.439
<v Speaker 3>No, not at all. We're talking about sprawling, warehouse sized

61
00:03:02.520 --> 00:03:06.759
<v Speaker 3>data centers, just packed floor to ceiling with specialized hardware.

62
00:03:06.840 --> 00:03:10.520
<v Speaker 2>I mean, these facilities need dedicated industrial cooling towers just

63
00:03:10.599 --> 00:03:12.680
<v Speaker 2>to prevent the silicon from literally melting.

64
00:03:12.840 --> 00:03:15.560
<v Speaker 3>Exactly. They consume millions of watts.

65
00:03:15.319 --> 00:03:18.039
<v Speaker 2>Millions of wants for a single model. It's basically the

66
00:03:18.159 --> 00:03:20.759
<v Speaker 2>energy equivalent of a small dedicated power plan.

67
00:03:20.919 --> 00:03:24.000
<v Speaker 3>Just to run a specialized software program that can write

68
00:03:24.000 --> 00:03:26.919
<v Speaker 3>a passable email or identify a stop sign and a photograph.

69
00:03:27.120 --> 00:03:30.479
<v Speaker 2>So the ratio of computational capability to energy consumption in

70
00:03:30.520 --> 00:03:33.159
<v Speaker 2>the human brain isn't just like a little bit better

71
00:03:33.159 --> 00:03:33.479
<v Speaker 2>than our.

72
00:03:33.439 --> 00:03:36.759
<v Speaker 3>Ai, No, it is superior by multiple orders of magnitude.

73
00:03:36.919 --> 00:03:39.759
<v Speaker 2>And the most critical realization here is that this isn't

74
00:03:39.800 --> 00:03:42.319
<v Speaker 2>some temporary engineering hurdle, is it not?

75
00:03:42.400 --> 00:03:45.039
<v Speaker 3>At all? This isn't a problem where we can simply

76
00:03:45.319 --> 00:03:47.919
<v Speaker 3>wait for the next generation of silicon microchips and just

77
00:03:47.960 --> 00:03:50.719
<v Speaker 3>assume the efficiency gap will organically close.

78
00:03:50.560 --> 00:03:54.039
<v Speaker 2>Because the foundational architecture of how we build computers is

79
00:03:54.360 --> 00:03:58.599
<v Speaker 2>fundamentally at odds with how biological brains operate physically at odds. Yeah,

80
00:03:58.759 --> 00:04:02.599
<v Speaker 2>so we have this massive multi order of magnitude disparity

81
00:04:02.599 --> 00:04:07.080
<v Speaker 2>and efficiency, and the traditional roadmap of computer science is

82
00:04:07.199 --> 00:04:09.400
<v Speaker 2>essentially a dead end for solving it.

83
00:04:09.520 --> 00:04:12.039
<v Speaker 3>Which brings us to the terrain we're exploring today.

84
00:04:12.120 --> 00:04:16.399
<v Speaker 2>Yes, we are looking at a radical, incredibly ambitious field

85
00:04:16.399 --> 00:04:18.839
<v Speaker 2>of engineering called neuromorphic computing.

86
00:04:19.480 --> 00:04:22.519
<v Speaker 3>The name itself derives from the Greek roots for nerve

87
00:04:22.680 --> 00:04:23.319
<v Speaker 3>and form.

88
00:04:23.600 --> 00:04:25.480
<v Speaker 2>I love that. And just to be clear, this is

89
00:04:25.519 --> 00:04:28.079
<v Speaker 2>not about writing clever software code to run on the

90
00:04:28.120 --> 00:04:30.639
<v Speaker 2>exact same hardware we've been using since the nineteen eighties.

91
00:04:30.720 --> 00:04:34.920
<v Speaker 3>No, this is the engineering quest to build actual physical hardware,

92
00:04:35.000 --> 00:04:39.279
<v Speaker 3>new types of silicon chips that fundamentally think, physically operate,

93
00:04:39.639 --> 00:04:43.079
<v Speaker 3>and are structurally organized exactly like a biological brain.

94
00:04:43.439 --> 00:04:47.079
<v Speaker 2>It requires a complete paradigm shift, It really does, because

95
00:04:47.120 --> 00:04:51.480
<v Speaker 2>the vast majority of artificial intelligence today operates by simulating

96
00:04:51.519 --> 00:04:52.199
<v Speaker 2>a neural.

97
00:04:51.959 --> 00:04:57.319
<v Speaker 3>Network in software, but then executing that simulation on conventional,

98
00:04:57.839 --> 00:04:58.959
<v Speaker 3>non neural hardware.

99
00:04:59.079 --> 00:05:01.079
<v Speaker 2>Right, It's just a similar running on a normal.

100
00:05:00.839 --> 00:05:06.439
<v Speaker 3>Computer exactly, but neuromorphic engineering abandons the simulation. The goal

101
00:05:06.600 --> 00:05:10.519
<v Speaker 3>is to instantiate the physical operating principles of biological neural

102
00:05:10.560 --> 00:05:14.399
<v Speaker 3>computation directly into the atomic structure of the machine itself.

103
00:05:14.560 --> 00:05:17.439
<v Speaker 2>Okay, So to understand why we need to completely reinvent

104
00:05:17.519 --> 00:05:20.439
<v Speaker 2>the physical computer from the ground up, we first need

105
00:05:20.439 --> 00:05:23.240
<v Speaker 2>to talk about the fatal flaw baked into the computers you.

106
00:05:23.199 --> 00:05:25.920
<v Speaker 3>And I are already using, right, the von Neuman architecture.

107
00:05:26.000 --> 00:05:28.160
<v Speaker 2>Yeah, if you're listening to this on the smartphone, that

108
00:05:28.279 --> 00:05:31.839
<v Speaker 2>incredibly advanced device is constrained by a design choice made

109
00:05:31.839 --> 00:05:33.000
<v Speaker 2>back in the nineteen forties.

110
00:05:33.079 --> 00:05:34.879
<v Speaker 3>We have to go back to the mathematician John von

111
00:05:34.959 --> 00:05:35.879
<v Speaker 3>Neuman set.

112
00:05:35.639 --> 00:05:37.839
<v Speaker 2>The stage for us. What was happening in the forties.

113
00:05:37.920 --> 00:05:41.959
<v Speaker 3>Well, during the era of those massive room sized vacuum

114
00:05:42.000 --> 00:05:46.519
<v Speaker 3>tube computers like the Nis, von Neumann formalized a blueprint

115
00:05:46.560 --> 00:05:48.439
<v Speaker 3>for computer architecture.

116
00:05:48.160 --> 00:05:52.120
<v Speaker 2>And that blueprint became the standard for virtually every single

117
00:05:52.120 --> 00:05:54.040
<v Speaker 2>digital device we manufacture today.

118
00:05:53.920 --> 00:05:57.360
<v Speaker 3>Right exactly, and the defining characteristic of this von Noyman

119
00:05:57.480 --> 00:06:00.959
<v Speaker 3>architecture is a strict physical set of duties.

120
00:06:01.040 --> 00:06:02.120
<v Speaker 2>Okay, break that down for me.

121
00:06:02.199 --> 00:06:06.040
<v Speaker 3>You have the processor which performs the actual mathematical computation

122
00:06:06.120 --> 00:06:09.480
<v Speaker 3>and logic, and then physically separated from it, you have

123
00:06:09.560 --> 00:06:12.959
<v Speaker 3>the memory unit which stores the data and the instructions.

124
00:06:13.079 --> 00:06:16.079
<v Speaker 2>So they basically live in two completely different neighborhoods on the.

125
00:06:16.079 --> 00:06:19.839
<v Speaker 3>Motherboard they do, and they're connected by a communication pathway

126
00:06:19.879 --> 00:06:20.720
<v Speaker 3>known as a bus.

127
00:06:20.800 --> 00:06:22.040
<v Speaker 2>A bus got it.

128
00:06:21.920 --> 00:06:26.040
<v Speaker 3>Because they're physically separated. Every single time the processor needs

129
00:06:26.040 --> 00:06:29.639
<v Speaker 3>to execute a task, whether that's adding two numbers or

130
00:06:29.720 --> 00:06:31.920
<v Speaker 3>changing the color of a pixel on your screen, it

131
00:06:32.000 --> 00:06:32.879
<v Speaker 3>can't do it alone.

132
00:06:33.040 --> 00:06:34.680
<v Speaker 2>It has to ask for the data, right.

133
00:06:34.800 --> 00:06:37.600
<v Speaker 3>It must send an electrical request across the bus to

134
00:06:37.639 --> 00:06:39.839
<v Speaker 3>the memory. It has to wait for the data to

135
00:06:39.879 --> 00:06:43.079
<v Speaker 3>be retrieved. The data is pushed back across the bus

136
00:06:43.079 --> 00:06:46.720
<v Speaker 3>to the processor, the operation is performed, and then the

137
00:06:46.800 --> 00:06:48.959
<v Speaker 3>result usually has to be sent back across the bus

138
00:06:48.959 --> 00:06:50.199
<v Speaker 3>to be stored in memory. Again.

139
00:06:50.480 --> 00:06:55.879
<v Speaker 2>Wow, if you really visualize that, it's just endless shuttling back.

140
00:06:55.680 --> 00:06:57.199
<v Speaker 3>And forth, constant shuttling.

141
00:06:57.360 --> 00:07:01.519
<v Speaker 2>Let me train an analogy here. Imagine a chef working

142
00:07:01.560 --> 00:07:05.120
<v Speaker 2>in a high end restaurant kitchen. But there's a massive

143
00:07:05.120 --> 00:07:09.120
<v Speaker 2>design flaw. The pantry isn't in the kitchen.

144
00:07:09.279 --> 00:07:10.199
<v Speaker 3>Okay, where is it.

145
00:07:10.439 --> 00:07:12.279
<v Speaker 2>The pantry is three blocks down the street.

146
00:07:12.480 --> 00:07:14.279
<v Speaker 3>Oh, that sounds awful, right.

147
00:07:14.560 --> 00:07:17.360
<v Speaker 2>So the chef is standing at the stove, water is boiling,

148
00:07:17.959 --> 00:07:20.240
<v Speaker 2>and they realize they need a pinch of salt, so

149
00:07:20.279 --> 00:07:22.079
<v Speaker 2>they have to leave the kitchen. They have to stop

150
00:07:22.079 --> 00:07:24.839
<v Speaker 2>what they're doing, run out the door, sprint three blocks

151
00:07:24.879 --> 00:07:27.600
<v Speaker 2>down the street to the pantry, grab the pitch of salt,

152
00:07:27.759 --> 00:07:30.120
<v Speaker 2>run all the way back to the kitchen and sprinkle

153
00:07:30.160 --> 00:07:30.759
<v Speaker 2>it in the pot.

154
00:07:31.160 --> 00:07:32.920
<v Speaker 3>And then they probably need something else.

155
00:07:32.800 --> 00:07:35.160
<v Speaker 2>Exactly Then they realize they need a chopped on you

156
00:07:35.720 --> 00:07:38.399
<v Speaker 2>Boom out the door again, three blocks down the street,

157
00:07:38.800 --> 00:07:43.399
<v Speaker 2>grab the onion, sprint back over and over millions of

158
00:07:43.439 --> 00:07:45.240
<v Speaker 2>times a day, so by the end of the night.

159
00:07:45.439 --> 00:07:47.639
<v Speaker 2>By the end of the night, the chef isn't tired

160
00:07:47.639 --> 00:07:50.079
<v Speaker 2>from the actual act of cooking. The chef is completely

161
00:07:50.120 --> 00:07:51.240
<v Speaker 2>exhausted from commuting.

162
00:07:51.399 --> 00:07:53.360
<v Speaker 3>That is a perfect way to picture it, and the

163
00:07:53.399 --> 00:07:56.879
<v Speaker 3>physical consequence of that commute is the crux of the problem.

164
00:07:57.079 --> 00:07:58.879
<v Speaker 2>The von Neumann bottleneck exactly.

165
00:07:59.399 --> 00:08:02.600
<v Speaker 3>Historically, in computer science it was viewed as a speed limit.

166
00:08:03.120 --> 00:08:06.560
<v Speaker 3>The processor is incredibly fast, but it spends most of

167
00:08:06.560 --> 00:08:09.720
<v Speaker 3>its time sitting idle waiting for data to travel back

168
00:08:09.800 --> 00:08:10.800
<v Speaker 3>and forth over the bus.

169
00:08:10.959 --> 00:08:13.560
<v Speaker 2>But nowadays it's not just about speed, is it No.

170
00:08:14.000 --> 00:08:17.639
<v Speaker 3>In the context of modern artificial intelligence, the bottleneck is

171
00:08:17.680 --> 00:08:19.920
<v Speaker 3>a massive energy crisis.

172
00:08:19.560 --> 00:08:22.959
<v Speaker 2>Because pushing electrical signals back and forth across a physical

173
00:08:23.000 --> 00:08:27.959
<v Speaker 2>copper wire billions of times a second that generates heat.

174
00:08:27.920 --> 00:08:30.800
<v Speaker 3>Massive amounts of heat. Due to the physical capacitance of the.

175
00:08:30.759 --> 00:08:34.240
<v Speaker 2>Wire, it actually takes physical energy to push electrons down

176
00:08:34.279 --> 00:08:34.679
<v Speaker 2>a wire.

177
00:08:34.919 --> 00:08:38.360
<v Speaker 3>It takes enormous energy. When we run complex AI neural

178
00:08:38.399 --> 00:08:42.519
<v Speaker 3>networks on conventional hardware, we're shuffling massive matrices of data

179
00:08:42.639 --> 00:08:45.320
<v Speaker 3>millions of parameters back and forth continuously.

180
00:08:45.519 --> 00:08:48.440
<v Speaker 2>So the commute is basically bankrupting our energy budget.

181
00:08:48.480 --> 00:08:52.279
<v Speaker 3>Precisely, the sheer thermodynamic cost of moving the data is

182
00:08:52.399 --> 00:08:55.679
<v Speaker 3>astronomically higher than the energy cost of the actual mathematical

183
00:08:55.720 --> 00:08:57.039
<v Speaker 3>operations being performed.

184
00:08:57.200 --> 00:08:59.720
<v Speaker 2>We are burning our energy budget on the commute, not

185
00:08:59.759 --> 00:09:01.080
<v Speaker 2>the computation.

186
00:09:00.840 --> 00:09:03.159
<v Speaker 3>Which naturally leads us to ask a pretty important question.

187
00:09:03.600 --> 00:09:06.759
<v Speaker 2>Right, If that's how our current devices arrange their kitchen,

188
00:09:07.320 --> 00:09:11.080
<v Speaker 2>and it's clearly a disaster for energy efficiency, how does

189
00:09:11.080 --> 00:09:12.879
<v Speaker 2>the human brain around its kitchen?

190
00:09:13.000 --> 00:09:15.120
<v Speaker 3>How is your brain managing to do all of this

191
00:09:15.320 --> 00:09:19.559
<v Speaker 3>high level processing on just twenty watts without completely melting down?

192
00:09:19.639 --> 00:09:21.080
<v Speaker 2>Yeah? What's the biological solution?

193
00:09:21.440 --> 00:09:24.759
<v Speaker 3>The biological solution entirely eliminates the commute?

194
00:09:24.879 --> 00:09:25.919
<v Speaker 2>Wait, entirely.

195
00:09:26.080 --> 00:09:29.720
<v Speaker 3>Yes. In the human brain, computation and memory are colocated.

196
00:09:29.919 --> 00:09:32.320
<v Speaker 3>They occupy the exact same physical space.

197
00:09:32.440 --> 00:09:35.000
<v Speaker 2>Okay, I need to understand how that actually works physically.

198
00:09:35.240 --> 00:09:38.639
<v Speaker 3>Well, the brain contains roughly eighty six billion neurons, and

199
00:09:38.679 --> 00:09:43.120
<v Speaker 3>they communicate with each other through microscopic junctions called synapses.

200
00:09:42.799 --> 00:09:45.039
<v Speaker 2>And we have what trillions of those.

201
00:09:44.919 --> 00:09:48.200
<v Speaker 3>Trillions of synactic connections. Yeah. In the biological paradigm, the

202
00:09:48.200 --> 00:09:51.360
<v Speaker 3>synapse operate simultaneously as the processor and the hard drive.

203
00:09:51.559 --> 00:09:53.840
<v Speaker 2>So the wiring itself is the memory.

204
00:09:53.799 --> 00:09:57.120
<v Speaker 3>The connection itself is the physical medium through which the

205
00:09:57.159 --> 00:10:00.840
<v Speaker 3>computation occurs, and it is also the physical substrate where

206
00:10:00.919 --> 00:10:05.120
<v Speaker 3>learned information is stored. That is wild when you learn

207
00:10:05.200 --> 00:10:08.799
<v Speaker 3>something new, say you're practicing a new language or memorizing

208
00:10:08.840 --> 00:10:12.240
<v Speaker 3>the layout of a new neighborhood. The physical structure of

209
00:10:12.279 --> 00:10:14.440
<v Speaker 3>your brain actually changes like.

210
00:10:14.440 --> 00:10:16.240
<v Speaker 2>It physically alters. Its shape.

211
00:10:16.519 --> 00:10:19.720
<v Speaker 3>The synaptic weight, which determines the strength of the electrical

212
00:10:19.799 --> 00:10:23.320
<v Speaker 3>connection between specific neurons, physically alters.

213
00:10:23.559 --> 00:10:26.559
<v Speaker 2>See if you open up a traditional computer, the memory

214
00:10:26.679 --> 00:10:30.120
<v Speaker 2>is just a file saved in a discrete sector of

215
00:10:30.120 --> 00:10:33.720
<v Speaker 2>a silicon ship. But in the brain, the memory is

216
00:10:33.759 --> 00:10:38.240
<v Speaker 2>the physical shape, density, and chemical strength of the literal wiring.

217
00:10:38.480 --> 00:10:42.600
<v Speaker 3>Computation and memory changed together. They are intrinsically linked in

218
00:10:42.600 --> 00:10:43.720
<v Speaker 3>the same physical space.

219
00:10:43.840 --> 00:10:45.840
<v Speaker 2>So when I see something what happens.

220
00:10:45.759 --> 00:10:48.720
<v Speaker 3>When an electrical signal passes through a network of neurons

221
00:10:48.759 --> 00:10:51.799
<v Speaker 3>in your visual cortex, It doesn't have to pause and

222
00:10:51.840 --> 00:10:54.759
<v Speaker 3>fetch the memory of how to process a straight line

223
00:10:54.799 --> 00:10:56.200
<v Speaker 3>from a different lobe of the brain.

224
00:10:56.440 --> 00:10:58.200
<v Speaker 2>The instructions are just there.

225
00:10:58.360 --> 00:11:01.879
<v Speaker 3>The processing instructions are built directly into the physical pathway

226
00:11:01.919 --> 00:11:04.840
<v Speaker 3>the signal is currently traveling through. There is no shuttling

227
00:11:04.879 --> 00:11:05.240
<v Speaker 3>of data.

228
00:11:05.320 --> 00:11:07.000
<v Speaker 2>There's no bus, no bus at all.

229
00:11:07.279 --> 00:11:09.639
<v Speaker 3>To use your analogy, the kitchen and the pantry are

230
00:11:09.639 --> 00:11:12.240
<v Speaker 3>perfectly integrated at a microscopic level.

231
00:11:12.159 --> 00:11:17.000
<v Speaker 2>And replicating this exact physical colocation in silicon hardware is

232
00:11:17.039 --> 00:11:20.360
<v Speaker 2>the foundational premise of neuromorphic engineering exactly.

233
00:11:20.519 --> 00:11:21.399
<v Speaker 3>That's the core of it.

234
00:11:21.600 --> 00:11:23.919
<v Speaker 2>But you know, the physical layout of the von Normann

235
00:11:23.919 --> 00:11:26.639
<v Speaker 2>bottleneck is really only half the problem, isn't it.

236
00:11:27.039 --> 00:11:27.679
<v Speaker 3>That's true.

237
00:11:27.799 --> 00:11:30.480
<v Speaker 2>It's not just that traditional computers separate the kitchen in

238
00:11:30.519 --> 00:11:34.559
<v Speaker 2>the pantry, it's the very language they use to communicate internally,

239
00:11:35.200 --> 00:11:38.559
<v Speaker 2>the fundamental way we treat the materials we build computers

240
00:11:38.559 --> 00:11:38.840
<v Speaker 2>out of.

241
00:11:39.000 --> 00:11:40.960
<v Speaker 3>Right, we need to look back at the origins of

242
00:11:41.120 --> 00:11:44.080
<v Speaker 3>neuromorphic hardware to really understand this, which.

243
00:11:43.960 --> 00:11:46.480
<v Speaker 2>Shakes us to Carver Mead right at Caltech.

244
00:11:46.679 --> 00:11:50.519
<v Speaker 3>Yes, Carver Mead at the California Institute of Technology, in

245
00:11:50.559 --> 00:11:54.240
<v Speaker 3>a landmark nineteen ninety paper, Mead, who was already a

246
00:11:54.320 --> 00:11:59.279
<v Speaker 3>highly respected pioneer in integrated circuit design, actually coined the

247
00:11:59.360 --> 00:12:00.320
<v Speaker 3>term neural morph.

248
00:12:00.559 --> 00:12:02.279
<v Speaker 2>Okay, what was his big breakthrough?

249
00:12:02.519 --> 00:12:06.399
<v Speaker 3>His most profound realization wasn't just about moving memory closer

250
00:12:06.440 --> 00:12:09.080
<v Speaker 3>to the processor. It was about the fundamental physics of

251
00:12:09.120 --> 00:12:10.480
<v Speaker 3>how we utilize silicon.

252
00:12:10.799 --> 00:12:14.120
<v Speaker 2>Because right now the global tech industry uses silicon to

253
00:12:14.159 --> 00:12:17.919
<v Speaker 2>mass produce transistors, and we treat those transistors in a

254
00:12:18.039 --> 00:12:20.080
<v Speaker 2>highly rigid binary way.

255
00:12:20.279 --> 00:12:22.720
<v Speaker 3>Yes, we treat them like tiny light switches.

256
00:12:22.840 --> 00:12:26.200
<v Speaker 2>They're forced to be either entirely on, representing a one,

257
00:12:26.519 --> 00:12:28.440
<v Speaker 2>or entirely off, representing a zero.

258
00:12:28.720 --> 00:12:33.799
<v Speaker 3>Conventional computing is entirely digital and binary. It operates using discrete,

259
00:12:34.159 --> 00:12:36.679
<v Speaker 3>strictly controlled, clocked operations.

260
00:12:36.799 --> 00:12:39.799
<v Speaker 2>Everything is forced into that binary state of one or zero.

261
00:12:40.120 --> 00:12:44.440
<v Speaker 3>Furthermore, millions or billions of times a second. A rigid

262
00:12:44.480 --> 00:12:49.440
<v Speaker 3>internal clock forces the entire system to step forward synchronously.

263
00:12:48.840 --> 00:12:50.360
<v Speaker 2>Just marching to a beat exactly.

264
00:12:50.639 --> 00:12:54.480
<v Speaker 3>But med looking closely at biology, recognized that biological brains

265
00:12:54.480 --> 00:12:56.600
<v Speaker 3>do not operate like binary calculators.

266
00:12:56.639 --> 00:12:57.960
<v Speaker 2>They don't use ones and zeros.

267
00:12:58.200 --> 00:13:01.639
<v Speaker 3>No, the brain operates in a fundament only analog, continuous

268
00:13:01.679 --> 00:13:02.399
<v Speaker 3>time mode.

269
00:13:02.519 --> 00:13:05.279
<v Speaker 2>So carver Meede looked at these silicon transistors which we're

270
00:13:05.360 --> 00:13:08.159
<v Speaker 2>using to build these rigid digital calculators, and said, what

271
00:13:08.240 --> 00:13:10.279
<v Speaker 2>if we stop forcing them to be strict on and

272
00:13:10.279 --> 00:13:10.879
<v Speaker 2>off switches.

273
00:13:11.200 --> 00:13:14.519
<v Speaker 3>He proposed operating the transistors in what physicists call the

274
00:13:14.639 --> 00:13:15.799
<v Speaker 3>sub threshold regime.

275
00:13:16.000 --> 00:13:19.360
<v Speaker 2>The sub threshold regime, what does that mean? In plain English?

276
00:13:19.639 --> 00:13:22.320
<v Speaker 3>In standard digital logic, a transistor is hit with a

277
00:13:22.399 --> 00:13:26.159
<v Speaker 3>relatively high voltage to snap it cleanly on, allowing current

278
00:13:26.200 --> 00:13:29.720
<v Speaker 3>to flow freely, or the voltage is dropped to snap

279
00:13:29.720 --> 00:13:30.519
<v Speaker 3>it cleanly off.

280
00:13:30.559 --> 00:13:32.240
<v Speaker 2>It's a brute force approach.

281
00:13:32.039 --> 00:13:34.759
<v Speaker 3>It is. But if you apply a very low voltage

282
00:13:34.799 --> 00:13:39.159
<v Speaker 3>below that clean switching threshold, the transistor doesn't just go dead.

283
00:13:39.320 --> 00:13:40.840
<v Speaker 3>Its behavior changes entirely.

284
00:13:41.159 --> 00:13:42.240
<v Speaker 2>It does what happens.

285
00:13:42.480 --> 00:13:46.799
<v Speaker 3>It responds in a graded, continuous analog way. The tiny

286
00:13:46.879 --> 00:13:50.279
<v Speaker 3>trickle of leakage current that digital engineers usually try to

287
00:13:50.320 --> 00:13:52.600
<v Speaker 3>eliminate the stuff they see as a bug, exactly the

288
00:13:52.639 --> 00:13:56.000
<v Speaker 3>bug becomes the core feature. Mead realized that the analog

289
00:13:56.120 --> 00:14:00.679
<v Speaker 3>physics of this sub threshold silicon, how electrical charge slowly accumulates,

290
00:14:00.759 --> 00:14:03.120
<v Speaker 3>how capacitance builds, how ions.

291
00:14:02.759 --> 00:14:04.200
<v Speaker 2>Diffuse, It acts like biology.

292
00:14:04.399 --> 00:14:08.159
<v Speaker 3>It could naturally mimic the continuous physical dynamics of biological

293
00:14:08.200 --> 00:14:10.159
<v Speaker 3>cell membranes and ion channels.

294
00:14:10.240 --> 00:14:11.879
<v Speaker 2>So he wanted to use the raw physics of the

295
00:14:11.919 --> 00:14:15.320
<v Speaker 2>material to just do the math organically, rather than forcing

296
00:14:15.320 --> 00:14:18.679
<v Speaker 2>the material to act like a rigid mathematical abocus.

297
00:14:18.759 --> 00:14:21.159
<v Speaker 3>That's a great way to put it. By utilizing the

298
00:14:21.320 --> 00:14:26.120
<v Speaker 3>natural analog physics of the substrate, you bypass the massive

299
00:14:26.159 --> 00:14:30.559
<v Speaker 3>thermo dynamic overhead of encoding every single piece of sensory

300
00:14:30.639 --> 00:14:35.600
<v Speaker 3>information into long, complex strings of binary digits.

301
00:14:35.240 --> 00:14:37.200
<v Speaker 2>And shoving them through that serial bottleneck we.

302
00:14:37.159 --> 00:14:41.159
<v Speaker 3>Talked about, right, But this analog shift fundamentally changes how

303
00:14:41.279 --> 00:14:44.120
<v Speaker 3>information is transmitted across the network.

304
00:14:44.399 --> 00:14:45.240
<v Speaker 2>How so well?

305
00:14:45.360 --> 00:14:50.480
<v Speaker 3>Modern artificial neural networks in software exchange precise, continuous numerical

306
00:14:50.559 --> 00:14:53.960
<v Speaker 3>values floating point numbers like point seven four to three

307
00:14:54.080 --> 00:14:57.600
<v Speaker 3>fro sheats of numbers basically yeah, But biological neurons do

308
00:14:57.679 --> 00:14:59.519
<v Speaker 3>not exchange spreadsheets of numbers.

309
00:14:59.559 --> 00:15:00.480
<v Speaker 2>They speak in spikes.

310
00:15:00.679 --> 00:15:04.159
<v Speaker 3>They communicate through action potentials, commonly referred to as spikes.

311
00:15:04.279 --> 00:15:06.120
<v Speaker 2>What exactly is a spike? Physically?

312
00:15:06.159 --> 00:15:10.559
<v Speaker 3>These are brief, discrete, essentially identical electrical pulses that travel

313
00:15:10.600 --> 00:15:13.039
<v Speaker 3>down the axon of a neuron and trigger a chemical

314
00:15:13.120 --> 00:15:14.919
<v Speaker 3>or electrical response at the synaps.

315
00:15:15.000 --> 00:15:16.960
<v Speaker 2>Okay, I want to make sure we truly grasp this

316
00:15:17.039 --> 00:15:19.480
<v Speaker 2>because it's a massive departure from how we normally think

317
00:15:19.519 --> 00:15:20.080
<v Speaker 2>about data.

318
00:15:20.200 --> 00:15:21.600
<v Speaker 3>It is a huge mental leap.

319
00:15:21.840 --> 00:15:25.279
<v Speaker 2>If the biological brain isn't sending specific numerical values to

320
00:15:25.320 --> 00:15:30.360
<v Speaker 2>convey information, how does a literal zapp of electricity actually

321
00:15:30.440 --> 00:15:33.480
<v Speaker 2>carry any complex meaning? I mean, a spike is just

322
00:15:33.519 --> 00:15:35.159
<v Speaker 2>a spike, it's a great question.

323
00:15:35.679 --> 00:15:39.039
<v Speaker 3>In biological neural systems, the information is not encoded in

324
00:15:39.080 --> 00:15:42.600
<v Speaker 3>the size, the amplitude, or the shape of the signal.

325
00:15:42.559 --> 00:15:45.000
<v Speaker 2>Because every spike is basically identical, right.

326
00:15:45.480 --> 00:15:48.879
<v Speaker 3>Instead, the information is entirely encoded in the timing and

327
00:15:48.919 --> 00:15:50.080
<v Speaker 3>the rate of those spikes.

328
00:15:50.159 --> 00:15:50.960
<v Speaker 2>The timing and the rate.

329
00:15:51.080 --> 00:15:54.159
<v Speaker 3>Okay, if a sensory neuron in your eye fires rapidly,

330
00:15:54.360 --> 00:15:58.240
<v Speaker 3>producing a dense, high frequency cluster of spikes, it might

331
00:15:58.279 --> 00:16:01.799
<v Speaker 3>be conveying a stronger stimulu, say a very bright light,

332
00:16:01.919 --> 00:16:03.960
<v Speaker 3>compared to a neuron that fires rarely.

333
00:16:04.200 --> 00:16:06.120
<v Speaker 2>That makes sense, more spikes brighter light.

334
00:16:06.200 --> 00:16:08.879
<v Speaker 3>That is called rate coding. But more importantly, there is

335
00:16:08.960 --> 00:16:09.679
<v Speaker 3>temporal code.

336
00:16:09.759 --> 00:16:12.120
<v Speaker 2>Temporal coding like the exact microsecond.

337
00:16:12.200 --> 00:16:16.360
<v Speaker 3>Yes, the precise microsecond timing of a single spike relative

338
00:16:16.399 --> 00:16:18.879
<v Speaker 3>to the spikes of other surrounding neurons, and the network

339
00:16:18.919 --> 00:16:22.360
<v Speaker 3>can carry incredibly complex, high dimensional information.

340
00:16:22.720 --> 00:16:26.279
<v Speaker 2>Let's use an analogy to really visualize why this timing

341
00:16:26.360 --> 00:16:31.960
<v Speaker 2>based spiking communication is such a monumental advantage for saving power.

342
00:16:32.159 --> 00:16:35.720
<v Speaker 2>I'm all ears think about a traditional computer processor like

343
00:16:35.759 --> 00:16:39.799
<v Speaker 2>a massive symphony orchestra, but the conductor is an absolute

344
00:16:39.960 --> 00:16:42.240
<v Speaker 2>tyrant holding a metronome.

345
00:16:41.840 --> 00:16:44.120
<v Speaker 3>A very rigid conductor, very rigid.

346
00:16:44.399 --> 00:16:48.320
<v Speaker 2>The metronome is the computer's internal clock, ticking away billions

347
00:16:48.360 --> 00:16:51.320
<v Speaker 2>of times a second. Every single musician in the orchestra

348
00:16:51.399 --> 00:16:54.240
<v Speaker 2>is forced to play strictly to that beat. Right even

349
00:16:54.240 --> 00:16:56.679
<v Speaker 2>if a musician, let's say the triangle player in the back,

350
00:16:57.000 --> 00:16:59.440
<v Speaker 2>has absolutely no notes to play for a full twenty

351
00:16:59.440 --> 00:17:01.919
<v Speaker 2>minutes of this infanty, they cannot relax.

352
00:17:02.000 --> 00:17:03.080
<v Speaker 3>They have to stay engaged.

353
00:17:03.200 --> 00:17:06.960
<v Speaker 2>The conductor forces them to stand at attention, physically tapping

354
00:17:06.960 --> 00:17:10.680
<v Speaker 2>their foot rigidly keeping time on every single beat, exhausting

355
00:17:10.759 --> 00:17:14.160
<v Speaker 2>themselves just to remain perfectly synchronized with the global metronome.

356
00:17:14.200 --> 00:17:17.200
<v Speaker 3>They are burning massive amounts of energy just to actively

357
00:17:17.240 --> 00:17:17.960
<v Speaker 3>do nothing.

358
00:17:18.160 --> 00:17:22.640
<v Speaker 2>Exactly, and the physical reality of conventional digital chips mirrors

359
00:17:22.640 --> 00:17:24.920
<v Speaker 2>that orchestra exactly doesn't it.

360
00:17:24.559 --> 00:17:28.960
<v Speaker 3>It absolutely does. The global clock cycle forces electrical activity

361
00:17:29.000 --> 00:17:32.480
<v Speaker 3>and power consumption across the entire chip, even.

362
00:17:32.279 --> 00:17:35.640
<v Speaker 2>In sectors of the processor that are momentarily idle, even.

363
00:17:35.400 --> 00:17:38.200
<v Speaker 3>When they have no new data to process. The wires

364
00:17:38.200 --> 00:17:41.720
<v Speaker 3>are constantly charging and discharging just to maintain the rhythm.

365
00:17:41.799 --> 00:17:46.319
<v Speaker 2>But a neuromorphic system, a spiking neural network, is fundamentally different.

366
00:17:46.400 --> 00:17:47.839
<v Speaker 3>How would you describe it.

367
00:17:47.839 --> 00:17:51.680
<v Speaker 2>It's like a cool late night jazz ensemble playing completely

368
00:17:51.759 --> 00:17:54.480
<v Speaker 2>without a set tempo. There is no metronome.

369
00:17:54.599 --> 00:17:55.119
<v Speaker 3>I like that.

370
00:17:55.759 --> 00:17:59.960
<v Speaker 2>A musician in this ensemble will sit in total relaxed silence.

371
00:18:00.680 --> 00:18:04.319
<v Speaker 2>They expend absolutely zero energy. They aren't tapping their foot,

372
00:18:04.319 --> 00:18:06.640
<v Speaker 2>they are actively keeping time. They just wait. They just

373
00:18:06.759 --> 00:18:10.559
<v Speaker 2>wait at rest until exactly the moment they were organically

374
00:18:10.599 --> 00:18:13.559
<v Speaker 2>inspired to play a single note, a spike. They play

375
00:18:13.559 --> 00:18:16.039
<v Speaker 2>their note, and then they immediately return to resting in

376
00:18:16.119 --> 00:18:16.960
<v Speaker 2>total silence.

377
00:18:17.079 --> 00:18:19.720
<v Speaker 3>And the energy savings of that jazz ensemble approach are

378
00:18:19.759 --> 00:18:23.000
<v Speaker 3>staggering a bit, because when those artificial neurons are silent,

379
00:18:23.119 --> 00:18:26.960
<v Speaker 3>the power draw of the chick drops precipitously. This is

380
00:18:27.000 --> 00:18:29.480
<v Speaker 3>the essence of an event driven architecture.

381
00:18:29.759 --> 00:18:32.920
<v Speaker 2>Event driven, so it only reacts when something happens.

382
00:18:33.480 --> 00:18:38.200
<v Speaker 3>Spiking neurons are purely event driven. A neuromorphic chip consumes

383
00:18:38.240 --> 00:18:41.319
<v Speaker 3>practically no power when there is no new data changing

384
00:18:41.319 --> 00:18:42.160
<v Speaker 3>in its environment.

385
00:18:42.319 --> 00:18:43.079
<v Speaker 2>It assists there.

386
00:18:43.160 --> 00:18:46.519
<v Speaker 3>It sits in a quiescent state. Energy is expended only

387
00:18:46.599 --> 00:18:49.559
<v Speaker 3>when and where information is actually flowing through the network.

388
00:18:49.680 --> 00:18:52.279
<v Speaker 2>This means the overall energy cost to the system scales

389
00:18:52.440 --> 00:18:56.319
<v Speaker 2>entirely with the activity of the network, right, not the physical.

390
00:18:55.960 --> 00:18:58.920
<v Speaker 3>Size exactly, not the sheer physical size of the network.

391
00:18:59.319 --> 00:19:02.799
<v Speaker 3>You could build a massive chip containing millions of artificial neurons,

392
00:19:03.079 --> 00:19:05.720
<v Speaker 3>but if only one percent of them are actively spiking

393
00:19:05.720 --> 00:19:10.400
<v Speaker 3>at any given microsecond, your power draw remains astonishingly low.

394
00:19:10.559 --> 00:19:13.440
<v Speaker 2>Which perfectly explains how you get a human brain running

395
00:19:13.480 --> 00:19:14.599
<v Speaker 2>on twenty wants.

396
00:19:14.920 --> 00:19:16.480
<v Speaker 3>It makes perfect sense, now, doesn't it.

397
00:19:16.480 --> 00:19:20.160
<v Speaker 2>It's a massive, dense network of eighty six billion neurons,

398
00:19:20.680 --> 00:19:23.119
<v Speaker 2>but at any given moment, the vast majority of it

399
00:19:23.200 --> 00:19:24.200
<v Speaker 2>is resting.

400
00:19:24.200 --> 00:19:27.920
<v Speaker 3>Just whispering to itself in perfectly timed, sparse spikes.

401
00:19:28.359 --> 00:19:31.599
<v Speaker 2>So the theory is beautiful and biology proves it works.

402
00:19:32.000 --> 00:19:34.960
<v Speaker 2>But I want to shift gears from theory to cold

403
00:19:35.039 --> 00:19:41.640
<v Speaker 2>hard silicon. Are we actually forging these jazz playing analog

404
00:19:41.759 --> 00:19:42.880
<v Speaker 2>brains in the real world?

405
00:19:43.119 --> 00:19:45.880
<v Speaker 3>We are. The engineering community has taken this challenge head on,

406
00:19:46.039 --> 00:19:49.839
<v Speaker 3>and there are several major projects taking distinctly different architectural

407
00:19:49.839 --> 00:19:53.079
<v Speaker 3>approaches to forging spiking neural networks in hardware.

408
00:19:53.400 --> 00:19:55.640
<v Speaker 2>Okay, let's hear about them. Who's leading the charge.

409
00:19:55.720 --> 00:19:58.400
<v Speaker 3>Let's start with one of the most prominent research platforms

410
00:19:58.440 --> 00:20:00.680
<v Speaker 3>developed by one of the largest chip manuf facturers on

411
00:20:00.720 --> 00:20:03.440
<v Speaker 3>the planet, Intel's lowy heat architecture.

412
00:20:03.599 --> 00:20:07.720
<v Speaker 2>Intel. Really they are the undisputed giant of traditional von

413
00:20:07.799 --> 00:20:10.640
<v Speaker 2>Neumann computing. How did they approach building a brain?

414
00:20:11.119 --> 00:20:13.440
<v Speaker 3>Well? Intel released the first LOWI heat chip back in

415
00:20:13.480 --> 00:20:16.079
<v Speaker 3>twenty seventeen and followed it up with a more advanced

416
00:20:16.160 --> 00:20:18.440
<v Speaker 3>architecture LOWI Heat two and twenty twenty one.

417
00:20:18.559 --> 00:20:19.920
<v Speaker 2>Okay, and the sheer.

418
00:20:19.680 --> 00:20:22.680
<v Speaker 3>Scale of what they engineered is breathtaking. A single LOWI

419
00:20:22.720 --> 00:20:26.400
<v Speaker 3>heat two chip contains over a million programmable artificial spiking

420
00:20:26.400 --> 00:20:28.920
<v Speaker 3>neurons and over one hundred million synapses.

421
00:20:29.039 --> 00:20:31.000
<v Speaker 2>A million neurons on one chip.

422
00:20:31.079 --> 00:20:34.200
<v Speaker 3>But the true innovation is how it is structured. It

423
00:20:34.240 --> 00:20:39.000
<v Speaker 3>is organized into a dense mesh of neuromorphic cores, and crucially,

424
00:20:39.359 --> 00:20:43.200
<v Speaker 3>there is no global metronome, no clock, no clock. These

425
00:20:43.240 --> 00:20:47.720
<v Speaker 3>cores communicate entirely through an asynchronous on chip spike routing network.

426
00:20:47.880 --> 00:20:49.160
<v Speaker 2>How does that work without a clock?

427
00:20:49.440 --> 00:20:52.960
<v Speaker 3>When a neuron spikes, it essentially packages that spike like

428
00:20:53.000 --> 00:20:55.640
<v Speaker 3>a piece of mail and routes it through the mesh

429
00:20:55.680 --> 00:20:58.799
<v Speaker 3>to its destination, independent of any global clock.

430
00:20:59.079 --> 00:21:02.240
<v Speaker 2>A million neurons on a single chip, communicating like a

431
00:21:02.240 --> 00:21:05.359
<v Speaker 2>postal system what does the power consumption actually look like

432
00:21:05.400 --> 00:21:06.039
<v Speaker 2>when you run it?

433
00:21:06.200 --> 00:21:09.839
<v Speaker 3>This is where the event driven philosophy really proves its worth.

434
00:21:10.319 --> 00:21:14.279
<v Speaker 3>When the LOWI heat chip is actively computing processing complex inputs,

435
00:21:14.400 --> 00:21:16.839
<v Speaker 3>it consumes power in the milliwat.

436
00:21:16.440 --> 00:21:19.480
<v Speaker 2>Range milliwats and for context, a normal.

437
00:21:19.319 --> 00:21:22.079
<v Speaker 3>Chip to put that in perspective, a standard GPU might

438
00:21:22.119 --> 00:21:25.240
<v Speaker 3>consume hundreds of watts. Wow, But it gets better. When

439
00:21:25.279 --> 00:21:27.960
<v Speaker 3>lowihi is at rest waiting for a sensor input, its

440
00:21:28.039 --> 00:21:30.640
<v Speaker 3>power draw drops down to the microwat.

441
00:21:30.279 --> 00:21:33.559
<v Speaker 2>Rank microwats, literally a millionth of a watt. You could

442
00:21:33.599 --> 00:21:35.400
<v Speaker 2>run that off a watch battery for years.

443
00:21:35.720 --> 00:21:39.559
<v Speaker 3>It represents a reduction in energy consumption of several orders

444
00:21:39.559 --> 00:21:44.000
<v Speaker 3>of magnitude compared to running the exact same computational workload

445
00:21:44.400 --> 00:21:46.680
<v Speaker 3>on a conventional CPU or GPU.

446
00:21:47.000 --> 00:21:49.680
<v Speaker 2>That's incredible. So what are people actually doing with it?

447
00:21:50.079 --> 00:21:53.720
<v Speaker 3>Intel provides this platform to a global community of researchers,

448
00:21:54.000 --> 00:21:57.880
<v Speaker 3>and they're using it to demonstrate incredible edge applications like

449
00:21:57.920 --> 00:22:03.000
<v Speaker 3>what things like real time gesture recognition, complex robotic control systems,

450
00:22:03.240 --> 00:22:07.559
<v Speaker 3>and even artificial olfactory sensing, essentially giving machines the ability

451
00:22:07.599 --> 00:22:08.440
<v Speaker 3>to process.

452
00:22:08.160 --> 00:22:10.079
<v Speaker 2>Chemical smells the smelling computer.

453
00:22:10.279 --> 00:22:13.960
<v Speaker 3>Yeah, because the architecture relies on spiking networks, it naturally

454
00:22:13.960 --> 00:22:16.880
<v Speaker 3>excels at adaptive learning tasks where the system needs to

455
00:22:16.960 --> 00:22:19.319
<v Speaker 3>learn and adjust to new patterns on the fly. In

456
00:22:19.359 --> 00:22:20.079
<v Speaker 3>the real world.

457
00:22:20.319 --> 00:22:23.559
<v Speaker 2>That's Intel's approach, but IBM has been a major player

458
00:22:23.599 --> 00:22:24.759
<v Speaker 2>in this space as well. Right.

459
00:22:24.960 --> 00:22:28.480
<v Speaker 3>Yes, IBM introduced their true North chip back in twenty fourteen,

460
00:22:28.799 --> 00:22:31.720
<v Speaker 3>and it represents a slightly different philosophy within the neuromorphic

461
00:22:31.759 --> 00:22:36.319
<v Speaker 3>design space. True North also featured a million artificial neurons,

462
00:22:36.559 --> 00:22:41.000
<v Speaker 3>but paired them with two hundred and fifty six million synapses. However,

463
00:22:41.519 --> 00:22:45.279
<v Speaker 3>True North was engineered with a highly regular, incredibly rigid,

464
00:22:45.400 --> 00:22:48.000
<v Speaker 3>but extremely low power architecture.

465
00:22:48.119 --> 00:22:49.119
<v Speaker 2>Rigid in what way.

466
00:22:49.200 --> 00:22:54.640
<v Speaker 3>It was specifically optimized for deep spiking neural network inference inference.

467
00:22:54.680 --> 00:22:57.119
<v Speaker 2>Okay, when you say inferns, you mean it was designed

468
00:22:57.160 --> 00:22:59.200
<v Speaker 2>to take a neural network that has already in fully

469
00:22:59.200 --> 00:23:02.279
<v Speaker 2>train somewhere else, loaded onto the chip and just run

470
00:23:02.319 --> 00:23:04.400
<v Speaker 2>it as efficiently as physically possible.

471
00:23:04.519 --> 00:23:07.960
<v Speaker 3>Exactly. The focus was on execution rather than on chip learning,

472
00:23:08.119 --> 00:23:10.920
<v Speaker 3>got it. True North demonstrated that you could perform real

473
00:23:11.039 --> 00:23:16.200
<v Speaker 3>time complex pattern recognition tasks like classifying multiple moving objects

474
00:23:16.200 --> 00:23:19.640
<v Speaker 3>in a live video feed or processing spoken audio streams

475
00:23:19.839 --> 00:23:22.759
<v Speaker 3>at power levels measured in just tens of milliwatts.

476
00:23:22.839 --> 00:23:24.880
<v Speaker 2>Tens of milliwats for video processing.

477
00:23:25.000 --> 00:23:28.880
<v Speaker 3>Yes, if you attempted to perform that exact same real time,

478
00:23:29.119 --> 00:23:32.799
<v Speaker 3>high frame rate video processing on conventional hardware, you would

479
00:23:32.799 --> 00:23:36.079
<v Speaker 3>easily be burning through hundreds of watts or even kilowatts

480
00:23:36.119 --> 00:23:37.279
<v Speaker 3>for a dense enough network.

481
00:23:37.319 --> 00:23:38.519
<v Speaker 2>That's a massive difference.

482
00:23:38.759 --> 00:23:43.240
<v Speaker 3>True, North proved definitively over a decade ago that neuromorphic

483
00:23:43.319 --> 00:23:47.720
<v Speaker 3>hardware could handle practically useful, commercial grade tasks at a

484
00:23:47.799 --> 00:23:49.599
<v Speaker 3>dramatically lower energy cost.

485
00:23:49.880 --> 00:23:52.200
<v Speaker 2>See I hear that, But I am struggling to see

486
00:23:52.200 --> 00:23:55.039
<v Speaker 2>how this is financially practical. What do you mean You

487
00:23:55.079 --> 00:23:57.880
<v Speaker 2>are telling me we need to build entirely new analog

488
00:23:58.319 --> 00:24:02.920
<v Speaker 2>or highly specialized asyncre chifts from scratch. Yes, but the

489
00:24:02.960 --> 00:24:08.759
<v Speaker 2>manufacturing pipelines for standard digital silicon transistors are worth trillions

490
00:24:08.759 --> 00:24:11.559
<v Speaker 2>of dollars globally. Yes, we have perfected the art of

491
00:24:11.599 --> 00:24:14.960
<v Speaker 2>making digital processors. You can't just throw that entire global

492
00:24:14.960 --> 00:24:15.839
<v Speaker 2>infrastructure away.

493
00:24:15.920 --> 00:24:17.160
<v Speaker 3>It's a very valid point.

494
00:24:17.240 --> 00:24:18.920
<v Speaker 2>There has to be a middle ground, right, Yeah? Is

495
00:24:18.920 --> 00:24:21.519
<v Speaker 2>there way to use the conventional digital chips we already

496
00:24:21.559 --> 00:24:24.599
<v Speaker 2>mass produce? But somehow forced them to act like a brain.

497
00:24:24.839 --> 00:24:28.319
<v Speaker 3>That is the exact pragmatic dilemma that birthed a completely

498
00:24:28.319 --> 00:24:33.359
<v Speaker 3>different approach to neuromorphic engineering, the Spinnaker project. Spinnaker, Yes,

499
00:24:33.400 --> 00:24:35.920
<v Speaker 3>it was developed at the University of Manchester, led by

500
00:24:35.960 --> 00:24:37.559
<v Speaker 3>a researcher named Steve Ferber.

501
00:24:37.839 --> 00:24:41.480
<v Speaker 2>Steve Ferber, why is that named? Legendary and computer science.

502
00:24:41.880 --> 00:24:44.880
<v Speaker 3>Ferber is one of the original principal designers of the

503
00:24:45.160 --> 00:24:47.160
<v Speaker 3>ARM processor architecture.

504
00:24:46.680 --> 00:24:47.640
<v Speaker 2>The chips in our phones.

505
00:24:47.799 --> 00:24:51.559
<v Speaker 3>Exactly, if you are listening to this on a mobile phone, tablet,

506
00:24:51.680 --> 00:24:54.799
<v Speaker 3>or even many modern laptops, you are almost certainly using

507
00:24:54.839 --> 00:24:56.519
<v Speaker 3>an ARM based processor.

508
00:24:56.839 --> 00:24:58.480
<v Speaker 2>Wow. Okay, so what did he do?

509
00:24:58.880 --> 00:25:02.519
<v Speaker 3>Ferber and his team took a fascinating hybrid route. Instead

510
00:25:02.559 --> 00:25:07.000
<v Speaker 3>of designing highly experimental, custom analog neuron circuits from scratch,

511
00:25:07.319 --> 00:25:12.240
<v Speaker 3>they took an array of small, standard conventional digital.

512
00:25:12.119 --> 00:25:14.920
<v Speaker 2>ARM processors just off the shell processors, yes.

513
00:25:15.279 --> 00:25:19.759
<v Speaker 3>But they fundamentally altered how they interact. They interconnected these

514
00:25:19.799 --> 00:25:25.640
<v Speaker 3>standard processors using a completely custom, highly specialized asynchronous spike

515
00:25:25.720 --> 00:25:26.559
<v Speaker 3>routing network.

516
00:25:26.839 --> 00:25:30.160
<v Speaker 2>So the individual brain cells are just traditional digital processors,

517
00:25:30.599 --> 00:25:33.160
<v Speaker 2>but the wiring connecting them forces them to speak the

518
00:25:33.200 --> 00:25:34.480
<v Speaker 2>brain's language. Of spikes.

519
00:25:34.599 --> 00:25:38.160
<v Speaker 3>You've got it. The individual processors run software models simulating

520
00:25:38.160 --> 00:25:42.160
<v Speaker 3>the biological behavior of neurons, but the network architecture physically

521
00:25:42.160 --> 00:25:44.480
<v Speaker 3>forces them to communicate via discrete spikes.

522
00:25:44.559 --> 00:25:46.440
<v Speaker 2>That's a brilliant compromise, it is.

523
00:25:47.119 --> 00:25:51.640
<v Speaker 3>This hybrid approach provides immense flexible programmability. Because they are

524
00:25:51.799 --> 00:25:55.519
<v Speaker 3>standard processors, you can easily rewrite the software models of

525
00:25:55.559 --> 00:25:59.680
<v Speaker 3>the neurons while still capturing the system level efficiency of

526
00:25:59.720 --> 00:26:01.799
<v Speaker 3>an event driven spiking network.

527
00:26:02.039 --> 00:26:03.160
<v Speaker 2>How big did they scale this?

528
00:26:03.680 --> 00:26:08.039
<v Speaker 3>The scale is astounding. The second generation Spinnaker two, which

529
00:26:08.079 --> 00:26:11.759
<v Speaker 3>is currently deployed at the Technical University of Dresden, scales

530
00:26:11.799 --> 00:26:16.960
<v Speaker 3>this concept up to over a million interconnected arm processor cores.

531
00:26:17.039 --> 00:26:20.079
<v Speaker 2>A supercomputer made of a million cores just to rut spikes.

532
00:26:20.160 --> 00:26:23.680
<v Speaker 3>A network capable of routing billions of individual spike events

533
00:26:23.720 --> 00:26:26.359
<v Speaker 3>per second in real time without traffic jams.

534
00:26:26.480 --> 00:26:27.240
<v Speaker 2>That's incredible.

535
00:26:27.400 --> 00:26:30.680
<v Speaker 3>Spinnaker serves as a vital bridge between computer engineering and

536
00:26:30.759 --> 00:26:35.400
<v Speaker 3>computational neuroscience. It possesses the sheer computational power to run

537
00:26:35.480 --> 00:26:39.400
<v Speaker 3>practical neuromorphic applications, but its primary design goal is to

538
00:26:39.480 --> 00:26:41.759
<v Speaker 3>model massive biological neural.

539
00:26:41.519 --> 00:26:44.440
<v Speaker 2>Systems, so scientists use it to study the brain exactly.

540
00:26:44.720 --> 00:26:48.799
<v Speaker 3>It allows neuroscientists to simulate biologically realistic neural circuits at

541
00:26:48.799 --> 00:26:51.599
<v Speaker 3>a scale that actually approaches the complexity of small regions

542
00:26:51.640 --> 00:26:53.079
<v Speaker 3>of an actual mammalian brain.

543
00:26:53.400 --> 00:26:57.319
<v Speaker 2>That is wild is using millions of digital brains to

544
00:26:57.440 --> 00:27:02.039
<v Speaker 2>simulate an analog brain by enforcing a spiking language.

545
00:27:02.079 --> 00:27:03.079
<v Speaker 3>That's one way to do it.

546
00:27:03.240 --> 00:27:06.279
<v Speaker 2>But what if we want to abandon digital completely. What

547
00:27:06.359 --> 00:27:09.440
<v Speaker 2>if a research team wants to go full analog physics,

548
00:27:09.920 --> 00:27:13.160
<v Speaker 2>no digital processors, no software models at all.

549
00:27:13.359 --> 00:27:16.279
<v Speaker 3>For that pure analog vision, we look to the Massive

550
00:27:16.359 --> 00:27:20.279
<v Speaker 3>Human Brain project in Europe, specifically a system called brain

551
00:27:20.359 --> 00:27:22.759
<v Speaker 3>scale S developed at Heidelberg University.

552
00:27:22.839 --> 00:27:24.319
<v Speaker 2>Brain Scale brainscale S.

553
00:27:24.400 --> 00:27:28.799
<v Speaker 3>Is a pure analog mixed signal neuromorphic platform. It does

554
00:27:28.799 --> 00:27:32.240
<v Speaker 3>not use digital code to simulate biological neuron models.

555
00:27:32.319 --> 00:27:33.160
<v Speaker 2>Go how does it work?

556
00:27:33.359 --> 00:27:37.160
<v Speaker 3>Instead, it physically implements the differential equations of biological cell

557
00:27:37.200 --> 00:27:41.079
<v Speaker 3>membranes directly into the electrical physics of its custom silicon circuits.

558
00:27:41.079 --> 00:27:43.799
<v Speaker 2>So the physical electrical currents flowing through the chip are

559
00:27:43.799 --> 00:27:47.039
<v Speaker 2>literally acting out the biology. The silicon is physically behaving

560
00:27:47.039 --> 00:27:47.720
<v Speaker 2>like a cell wall.

561
00:27:47.880 --> 00:27:50.720
<v Speaker 3>The capacitance and resistance of the physical circuits are tuned

562
00:27:50.759 --> 00:27:53.880
<v Speaker 3>to perfectly mirror the ion channels of a neuron. And

563
00:27:54.079 --> 00:27:57.480
<v Speaker 3>because it relies on pure analog physics happening at the

564
00:27:57.480 --> 00:28:01.160
<v Speaker 3>speed of electronics rather than waiting for software to calculate

565
00:28:01.200 --> 00:28:05.319
<v Speaker 3>mathematical equations, it operates at an astonishing speed fast. It

566
00:28:05.359 --> 00:28:07.960
<v Speaker 3>does not run at biological real time. It runs neural

567
00:28:08.039 --> 00:28:11.880
<v Speaker 3>dynamics set up to ten thousand times the speed of biological.

568
00:28:11.319 --> 00:28:15.279
<v Speaker 2>Real time, ten thousand times faster than an actual living brain. Yes,

569
00:28:15.640 --> 00:28:17.519
<v Speaker 2>if you're a listener, just pause and think about the

570
00:28:17.559 --> 00:28:20.440
<v Speaker 2>implications of that for a second. If you're a scientist

571
00:28:20.640 --> 00:28:25.240
<v Speaker 2>studying how a specific brain network learns, adapts, and evolves

572
00:28:25.359 --> 00:28:28.680
<v Speaker 2>over say a full year of biological life, you don't

573
00:28:28.680 --> 00:28:30.000
<v Speaker 2>have to sit in a lab and wait a year.

574
00:28:30.079 --> 00:28:30.559
<v Speaker 3>No, you don't.

575
00:28:30.599 --> 00:28:33.559
<v Speaker 2>You can run that entire year of complex physical brain

576
00:28:33.599 --> 00:28:36.720
<v Speaker 2>evolution on the Brain scale S platform in about an hour.

577
00:28:37.480 --> 00:28:40.720
<v Speaker 2>You're essentially putting physical brain dynamics on fast forward.

578
00:28:41.079 --> 00:28:43.640
<v Speaker 3>The value proposition for basic science is immense.

579
00:28:43.839 --> 00:28:44.440
<v Speaker 2>It has to be.

580
00:28:44.599 --> 00:28:47.119
<v Speaker 3>It allows researchers to study the long term evolution of

581
00:28:47.160 --> 00:28:51.480
<v Speaker 3>neural network behavior, to test complex plasticity rules over massive

582
00:28:51.519 --> 00:28:54.880
<v Speaker 3>biological time scales, and to observe phenomena that would be

583
00:28:54.960 --> 00:28:58.200
<v Speaker 3>impossible to track in a living organism, all in mere

584
00:28:58.319 --> 00:29:00.200
<v Speaker 3>seconds or minutes of real time.

585
00:29:00.279 --> 00:29:02.880
<v Speaker 2>But you know, all of these chips, whether it's Intel's

586
00:29:02.920 --> 00:29:06.359
<v Speaker 2>digital mesh or a fervers million arm cores, or Heidelberg's

587
00:29:06.400 --> 00:29:09.759
<v Speaker 2>accelerated analog physics, they face a massive wall.

588
00:29:09.960 --> 00:29:10.359
<v Speaker 3>They do.

589
00:29:10.559 --> 00:29:13.599
<v Speaker 2>You can build a billion artificial neurons, but a brain

590
00:29:13.680 --> 00:29:15.920
<v Speaker 2>isn't a brain unless it can learn. It has to

591
00:29:16.000 --> 00:29:19.400
<v Speaker 2>adapt to new information. How do you actually program or

592
00:29:19.440 --> 00:29:23.799
<v Speaker 2>teach a billion silent spiking nodes when you don't have

593
00:29:23.880 --> 00:29:25.519
<v Speaker 2>standard code to write.

594
00:29:25.200 --> 00:29:28.960
<v Speaker 3>To fully grasp how a neuromorphic chip actually learns, we

595
00:29:29.039 --> 00:29:32.480
<v Speaker 3>first have to understand why the dominant learning mechanism used

596
00:29:32.480 --> 00:29:36.960
<v Speaker 3>by almost all conventional artificial intelligence today is completely fundamentally

597
00:29:37.000 --> 00:29:38.720
<v Speaker 3>incompatible with biological brains.

598
00:29:38.720 --> 00:29:39.759
<v Speaker 2>Okay, let's unpack that.

599
00:29:39.920 --> 00:29:43.119
<v Speaker 3>Conventional deep learning, the technology behind large language models and

600
00:29:43.160 --> 00:29:47.039
<v Speaker 3>image generators, trains its neural networks using a mathematical algorithm

601
00:29:47.079 --> 00:29:48.160
<v Speaker 3>called backpropagation.

602
00:29:48.640 --> 00:29:53.200
<v Speaker 2>Backpropagation. It is the undisputed engine of the modern AI boom.

603
00:29:53.240 --> 00:29:56.119
<v Speaker 2>It is, how does it actually work? And why couldn't

604
00:29:56.119 --> 00:29:57.759
<v Speaker 2>a biological brain just use it?

605
00:29:58.440 --> 00:30:03.519
<v Speaker 3>Backpropagation is mathematic elegant, but biologically impossible. Let's walk through it.

606
00:30:04.160 --> 00:30:07.000
<v Speaker 3>In a conventional AI network, data is fed forward through

607
00:30:07.079 --> 00:30:10.400
<v Speaker 3>multiple layers of artificial neurons to produce an output. Let's

608
00:30:10.400 --> 00:30:12.480
<v Speaker 3>say it's looking at an image and trying to guess

609
00:30:12.559 --> 00:30:15.720
<v Speaker 3>if it's a handwritten number seven. It makes its guess.

610
00:30:16.000 --> 00:30:19.680
<v Speaker 3>If the guess is wrong, the system calculates the exact

611
00:30:19.720 --> 00:30:23.279
<v Speaker 3>mathematical error of that guess right backpropagation. Then takes that

612
00:30:23.480 --> 00:30:28.279
<v Speaker 3>error value and mathematically propagates it backwards through every single

613
00:30:28.359 --> 00:30:32.119
<v Speaker 3>layer of the entire network. It uses calculus to calculate

614
00:30:32.160 --> 00:30:35.839
<v Speaker 3>an exact gradient, a specific adjustment for every single weight

615
00:30:35.920 --> 00:30:38.359
<v Speaker 3>in the network, so that the network will be slightly

616
00:30:38.400 --> 00:30:40.400
<v Speaker 3>more accurate the next time it sees that image.

617
00:30:40.440 --> 00:30:42.359
<v Speaker 2>I want to visualize this. It sounds like an incredibly

618
00:30:42.400 --> 00:30:45.160
<v Speaker 2>overbearing micromanager and a massive corporation.

619
00:30:45.400 --> 00:30:46.880
<v Speaker 3>That's a very apt comparison.

620
00:30:47.279 --> 00:30:50.200
<v Speaker 2>Like the company makes a mistake on a product, the

621
00:30:50.200 --> 00:30:53.920
<v Speaker 2>micromanager immediately hits the pause button on the entire factory.

622
00:30:54.440 --> 00:30:57.119
<v Speaker 2>They look at every single one of the million employees.

623
00:30:56.720 --> 00:30:59.759
<v Speaker 3>Simultaneously, and they calculate exactly how much of the blame

624
00:30:59.759 --> 00:31:02.240
<v Speaker 3>below belongs to each individual person.

625
00:31:02.079 --> 00:31:06.559
<v Speaker 2>Right, they synchronously update every employee's instructions, and only then

626
00:31:07.000 --> 00:31:09.240
<v Speaker 2>do they unpause the factory to try again.

627
00:31:09.799 --> 00:31:13.440
<v Speaker 3>And the biological brain simply cannot function that way. The

628
00:31:13.480 --> 00:31:16.319
<v Speaker 3>physical requirements of backpropagation are immense.

629
00:31:16.480 --> 00:31:17.559
<v Speaker 2>Why can't the brain do it?

630
00:31:18.039 --> 00:31:22.079
<v Speaker 3>First, it requires a global error signal, your micromanager that

631
00:31:22.160 --> 00:31:24.920
<v Speaker 3>has an omniscient view of the entire network at once,

632
00:31:25.000 --> 00:31:28.359
<v Speaker 3>which we don't have right. Second, it requires the system

633
00:31:28.559 --> 00:31:32.000
<v Speaker 3>to temporarily store all the electrical states from the forward

634
00:31:32.039 --> 00:31:35.319
<v Speaker 3>pass in a massive memory bank, so the backward pass

635
00:31:35.359 --> 00:31:36.759
<v Speaker 3>can use them to calculate the.

636
00:31:36.680 --> 00:31:39.480
<v Speaker 2>Blame, and we don't have that separated memory bank exactly.

637
00:31:39.759 --> 00:31:43.799
<v Speaker 3>Third, it requires perfectly synchronous updates across the entire system.

638
00:31:43.920 --> 00:31:47.359
<v Speaker 3>A biological brain simply does not possess the anatomical hardware

639
00:31:47.359 --> 00:31:47.559
<v Speaker 3>for this.

640
00:31:48.000 --> 00:31:51.160
<v Speaker 2>There's no biological manager neuron that can pause your brain,

641
00:31:51.599 --> 00:31:55.200
<v Speaker 2>calculate a global calculus error, and physically adjust eighty six

642
00:31:55.240 --> 00:31:56.960
<v Speaker 2>billion synapses simultaneously.

643
00:31:57.200 --> 00:31:59.079
<v Speaker 3>No, that would be impossible.

644
00:31:59.119 --> 00:32:02.279
<v Speaker 2>So if the brain can't do global backpropagation, how do

645
00:32:02.319 --> 00:32:04.640
<v Speaker 2>we get smarter? How do we wire new memories?

646
00:32:04.720 --> 00:32:07.599
<v Speaker 3>The brain relies entirely on local learning rules. It learns

647
00:32:07.599 --> 00:32:09.839
<v Speaker 3>from the ground up, not from the top down local

648
00:32:09.920 --> 00:32:13.400
<v Speaker 3>learning rules and the most heavily studied biological mechanism, which

649
00:32:13.440 --> 00:32:16.839
<v Speaker 3>is now being directly implemented in neuromorphic hardware, is called

650
00:32:16.960 --> 00:32:20.480
<v Speaker 3>spike timing dependent plasticity or STDP.

651
00:32:20.880 --> 00:32:23.880
<v Speaker 2>Spike timing dependent plasticity. That's a mouthful. Let's break down

652
00:32:23.920 --> 00:32:24.960
<v Speaker 2>the mechanics of it.

653
00:32:24.960 --> 00:32:29.720
<v Speaker 3>It is entirely dependent on highly localized physical information, specifically

654
00:32:29.759 --> 00:32:34.200
<v Speaker 3>the precise timing of spikes between two directly connected neurons.

655
00:32:34.319 --> 00:32:36.400
<v Speaker 2>Okay, let's isolate just two neurons.

656
00:32:36.440 --> 00:32:38.559
<v Speaker 3>We'll call the first one the presynaptic neuron, the one

657
00:32:38.599 --> 00:32:41.039
<v Speaker 3>sending the signal. The second is the post synaptic neuron,

658
00:32:41.079 --> 00:32:42.240
<v Speaker 3>the one receiving the signal.

659
00:32:42.359 --> 00:32:43.920
<v Speaker 2>Got it pre impost.

660
00:32:43.839 --> 00:32:47.119
<v Speaker 3>STDP dictates that the physical strength of the synapse connecting

661
00:32:47.160 --> 00:32:50.880
<v Speaker 3>them changes based entirely on the exact relative timing of

662
00:32:50.960 --> 00:32:51.960
<v Speaker 3>when they both spike.

663
00:32:52.440 --> 00:32:53.759
<v Speaker 2>Just the timing, just the timing.

664
00:32:54.079 --> 00:32:57.680
<v Speaker 3>If the presynaptic neuron fires spike just before the post

665
00:32:57.680 --> 00:33:02.799
<v Speaker 3>synaptic neuron fires, the physical can between them strengthens. The

666
00:33:02.839 --> 00:33:06.640
<v Speaker 3>biological logic is that the first neuron likely played a

667
00:33:06.720 --> 00:33:08.400
<v Speaker 3>role in causing the second one to fire.

668
00:33:08.559 --> 00:33:10.720
<v Speaker 2>It's recognizing cause and effect exactly.

669
00:33:10.960 --> 00:33:14.960
<v Speaker 3>However, if the timing is reversed, if the post synaptic

670
00:33:15.000 --> 00:33:17.720
<v Speaker 3>neuron fires first and then the pre synaptic neuron fire

671
00:33:17.839 --> 00:33:20.839
<v Speaker 3>slightly later, the connection physically weakens.

672
00:33:21.119 --> 00:33:22.119
<v Speaker 2>Why we caedday, the.

673
00:33:22.000 --> 00:33:25.200
<v Speaker 3>Logic being that the first neuron clearly didn't contribute to

674
00:33:25.240 --> 00:33:28.599
<v Speaker 3>the second one firing, so their association is meaningless and

675
00:33:28.599 --> 00:33:30.240
<v Speaker 3>should be diminished to save energy.

676
00:33:30.720 --> 00:33:32.480
<v Speaker 2>I want to frame this as a strict rule of

677
00:33:32.519 --> 00:33:35.319
<v Speaker 2>cause and effect. For you listening, think about the relationship

678
00:33:35.319 --> 00:33:38.240
<v Speaker 2>between lightning and thunder. That's a great real world example.

679
00:33:38.440 --> 00:33:40.799
<v Speaker 3>If you're standing outside and see a brilliant flash of

680
00:33:40.880 --> 00:33:43.759
<v Speaker 3>lightning and it is consistently followed right after by a

681
00:33:43.759 --> 00:33:47.680
<v Speaker 3>massive crash of thunder, your brain notes that specific timing

682
00:33:48.200 --> 00:33:50.400
<v Speaker 3>lightning first, then thunder right.

683
00:33:50.839 --> 00:33:53.799
<v Speaker 2>Because the timing is consistent and strictly ordered, your brain

684
00:33:53.839 --> 00:33:58.400
<v Speaker 2>physically wires those two concepts together. The synapse strengthens. You

685
00:33:58.559 --> 00:34:02.640
<v Speaker 2>learn the rule causes thunder perfect. But imagine you hear

686
00:34:02.680 --> 00:34:05.720
<v Speaker 2>a random crash of thunder, and then ten seconds later

687
00:34:05.839 --> 00:34:10.800
<v Speaker 2>someone flashes a flashlight in your eyes. Your brain completely

688
00:34:10.880 --> 00:34:12.159
<v Speaker 2>ignores the association.

689
00:34:12.440 --> 00:34:14.519
<v Speaker 3>The timing is wrong, the order is reversed.

690
00:34:14.800 --> 00:34:19.000
<v Speaker 2>STDP means the neuromorphic hardware is teaching itself based solely

691
00:34:19.039 --> 00:34:22.280
<v Speaker 2>on the timing of highly localized events. It doesn't need

692
00:34:22.320 --> 00:34:25.199
<v Speaker 2>a massive global manager telling the whole network it made

693
00:34:25.199 --> 00:34:25.679
<v Speaker 2>a mistake.

694
00:34:25.840 --> 00:34:29.400
<v Speaker 3>It learns autonomously synaps by sunaps based purely on what

695
00:34:29.440 --> 00:34:30.159
<v Speaker 3>it experiences.

696
00:34:30.320 --> 00:34:31.360
<v Speaker 2>That is just so elegant.

697
00:34:31.719 --> 00:34:34.280
<v Speaker 3>This is the physical realization of heavy in learning, a

698
00:34:34.360 --> 00:34:37.880
<v Speaker 3>concept proposed back in nineteen forty nine, often summarized by

699
00:34:37.920 --> 00:34:41.960
<v Speaker 3>the famous neuroscience phrase neurons that fire together, wire together.

700
00:34:42.039 --> 00:34:42.880
<v Speaker 2>I've heard that phrase.

701
00:34:43.000 --> 00:34:47.000
<v Speaker 3>Modern neuromorphic chips like Intel's lowiha actually have highly configurable

702
00:34:47.079 --> 00:34:51.079
<v Speaker 3>learning rules built directly into the silicon that replicate these forms.

703
00:34:50.760 --> 00:34:53.079
<v Speaker 2>Of STDP, so they can learn locally.

704
00:34:53.400 --> 00:34:57.199
<v Speaker 3>Yes, this enables the chip to learn continuously on the fly,

705
00:34:57.920 --> 00:35:02.199
<v Speaker 3>directly from streaming sensory data in the real world, adapting

706
00:35:02.199 --> 00:35:04.760
<v Speaker 3>to new patterns without ever needing to connect to a

707
00:35:04.800 --> 00:35:08.519
<v Speaker 3>massive cloud server to run a backpropagation algorithm.

708
00:35:08.559 --> 00:35:12.199
<v Speaker 2>But wait, if we follow this logic to its physical conclusion,

709
00:35:12.559 --> 00:35:16.360
<v Speaker 2>we hit a roadblock. What's the roadblock if STDP requires

710
00:35:16.440 --> 00:35:19.719
<v Speaker 2>the physical synapps to remember its own history to know

711
00:35:19.760 --> 00:35:22.480
<v Speaker 2>if it fired before or after its neighbor how do

712
00:35:22.519 --> 00:35:25.960
<v Speaker 2>you actually do that? In silicon? A standard digital transistor

713
00:35:26.000 --> 00:35:28.599
<v Speaker 2>forgets everything the exact second the power drops.

714
00:35:28.679 --> 00:35:29.840
<v Speaker 3>That is the core issue.

715
00:35:29.960 --> 00:35:32.760
<v Speaker 2>Yes, you can't build a brain out of amnesiacs. Did

716
00:35:32.800 --> 00:35:35.199
<v Speaker 2>Carver meat have an answer for that? Or did engineers

717
00:35:35.199 --> 00:35:37.199
<v Speaker 2>have to invent a completely new material?

718
00:35:37.400 --> 00:35:40.840
<v Speaker 3>That is the exact problem that pushes neuromorphic engineering into

719
00:35:40.880 --> 00:35:44.559
<v Speaker 3>the realm of exotic materials science. To truly replicate a

720
00:35:44.599 --> 00:35:48.320
<v Speaker 3>biological synapse, we need a physical component that naturally remembers

721
00:35:48.360 --> 00:35:49.679
<v Speaker 3>its own electrical history.

722
00:35:49.880 --> 00:35:50.519
<v Speaker 2>And what is that?

723
00:35:50.840 --> 00:35:53.119
<v Speaker 3>And this brings us to what is widely considered the

724
00:35:53.119 --> 00:35:56.360
<v Speaker 3>holy grail of neuromorphic computing, the memorister.

725
00:35:56.679 --> 00:35:59.559
<v Speaker 2>Memorister, it sounds like a piece of alien technology. What

726
00:35:59.599 --> 00:36:00.440
<v Speaker 2>exactly is it?

727
00:36:00.719 --> 00:36:04.360
<v Speaker 3>The word itself is a portmanteau of memory and resistor.

728
00:36:05.280 --> 00:36:08.519
<v Speaker 3>In standard electronics, a resistor is a basic component that

729
00:36:08.599 --> 00:36:12.000
<v Speaker 3>simply resists the flow of electrical current by a fixed

730
00:36:12.159 --> 00:36:13.079
<v Speaker 3>unchanging amount.

731
00:36:13.199 --> 00:36:15.639
<v Speaker 2>Okay, like a bottleneck in a pipe, right.

732
00:36:16.079 --> 00:36:19.880
<v Speaker 3>But a memristor is a two terminal electronic device whose

733
00:36:19.920 --> 00:36:23.280
<v Speaker 3>physical resistance fundamentally changes based on the history of the

734
00:36:23.320 --> 00:36:25.159
<v Speaker 3>current that has flowed through it in the past.

735
00:36:25.400 --> 00:36:29.000
<v Speaker 2>The physical properties of the material permanently change based on

736
00:36:29.039 --> 00:36:30.039
<v Speaker 2>what it has experienced.

737
00:36:30.159 --> 00:36:33.039
<v Speaker 3>Yes, as current flows through it in one direction, its

738
00:36:33.079 --> 00:36:36.360
<v Speaker 3>resistance might drop. If current flows in the opposite direction,

739
00:36:36.559 --> 00:36:38.079
<v Speaker 3>its resistance might increase.

740
00:36:38.199 --> 00:36:38.760
<v Speaker 2>Wow.

741
00:36:39.000 --> 00:36:42.239
<v Speaker 3>And crucially, when you turn the power completely off, the

742
00:36:42.320 --> 00:36:46.440
<v Speaker 3>memorister remembers its last state of resistance indefinitely.

743
00:36:46.800 --> 00:36:48.039
<v Speaker 2>That is deeply profound.

744
00:36:48.280 --> 00:36:52.039
<v Speaker 3>It is the literal physical embodiment of a biological synapse

745
00:36:52.519 --> 00:36:54.880
<v Speaker 3>in a living brain. The weight of a synapse, how

746
00:36:54.920 --> 00:36:58.400
<v Speaker 3>strongly it connects two neurons changes based on the history

747
00:36:58.440 --> 00:36:59.559
<v Speaker 3>of ions passing through it.

748
00:37:00.000 --> 00:37:01.599
<v Speaker 2>Memristor does the exact same.

749
00:37:01.400 --> 00:37:06.000
<v Speaker 3>Thing electrically in a neuromorphic chip utilizing memoristers, The electrical

750
00:37:06.039 --> 00:37:09.559
<v Speaker 3>resistance of the device directly encodes the strength of the connection.

751
00:37:10.280 --> 00:37:15.000
<v Speaker 3>That resistance physically alters when current flows, effectively implementing a

752
00:37:15.000 --> 00:37:18.000
<v Speaker 3>local learning rule directly in the atomic physics of the

753
00:37:18.039 --> 00:37:18.880
<v Speaker 3>device itself.

754
00:37:19.119 --> 00:37:21.880
<v Speaker 2>So we don't need a separate memory chip down the

755
00:37:21.880 --> 00:37:24.800
<v Speaker 2>street to remember the synaptic weight, and we don't need

756
00:37:24.840 --> 00:37:27.000
<v Speaker 2>a processor to calculate how it should change.

757
00:37:27.079 --> 00:37:27.800
<v Speaker 3>No, you don't.

758
00:37:27.920 --> 00:37:31.920
<v Speaker 2>The memorists are just is the memory in the processor simultaneously.

759
00:37:31.400 --> 00:37:35.639
<v Speaker 3>And the engineering implications of that collocation are staggering, particularly

760
00:37:35.679 --> 00:37:39.119
<v Speaker 3>when it comes to the core mathematics of artificial intelligence.

761
00:37:39.280 --> 00:37:41.920
<v Speaker 2>The core mathematics, we're talking about matrix multiplication, right.

762
00:37:42.079 --> 00:37:45.920
<v Speaker 3>Yes. To appreciate this, we need to understand the massive

763
00:37:46.039 --> 00:37:48.000
<v Speaker 3>mathematical burden of AI.

764
00:37:48.280 --> 00:37:51.400
<v Speaker 2>Okay, I need to understand this matrix math thing. Why

765
00:37:51.480 --> 00:37:55.159
<v Speaker 2>is multiplying grids of numbers such a massive burden for

766
00:37:55.239 --> 00:37:56.119
<v Speaker 2>normal computers?

767
00:37:56.480 --> 00:38:00.519
<v Speaker 3>Think about how an artificial intelligence actually sees an emas image.

768
00:38:00.840 --> 00:38:03.239
<v Speaker 3>Let's say we have a tiny image just one hundred

769
00:38:03.239 --> 00:38:04.719
<v Speaker 3>pixels by one hundred pixels.

770
00:38:04.920 --> 00:38:08.119
<v Speaker 2>That's ten thousand numbers representing brightness exactly.

771
00:38:08.639 --> 00:38:11.480
<v Speaker 3>Now, feed that into a neural network layer with ten

772
00:38:11.599 --> 00:38:15.599
<v Speaker 3>thousand artificial neurons. Every single neuron needs to connect to

773
00:38:15.639 --> 00:38:19.039
<v Speaker 3>every single pixel. Oh wow, that creates a matrix of

774
00:38:19.159 --> 00:38:23.480
<v Speaker 3>one hundred million distinct weights. To calculate the output, the

775
00:38:23.519 --> 00:38:26.639
<v Speaker 3>computer must multiply the ten thousand pixel values by that

776
00:38:26.679 --> 00:38:28.039
<v Speaker 3>one hundred million weights.

777
00:38:28.119 --> 00:38:31.159
<v Speaker 2>So the processors basically doing hundreds of millions of tiny

778
00:38:31.199 --> 00:38:33.360
<v Speaker 2>math homework problems one by one.

779
00:38:33.119 --> 00:38:36.320
<v Speaker 3>And because of the von Neumann bottleneck, it is fetching

780
00:38:36.440 --> 00:38:38.960
<v Speaker 3>the textbook from the library down the street for every

781
00:38:39.000 --> 00:38:40.280
<v Speaker 3>single one of those problems.

782
00:38:40.320 --> 00:38:42.239
<v Speaker 2>Just endlessly fetching and returning.

783
00:38:42.360 --> 00:38:45.480
<v Speaker 3>It is fetching weights for memory, doing a digital multiplication,

784
00:38:45.960 --> 00:38:48.679
<v Speaker 3>adding the result to a running total, and storing it back.

785
00:38:49.239 --> 00:38:52.159
<v Speaker 3>Trillions of operations for a single frame of video.

786
00:38:52.239 --> 00:38:53.679
<v Speaker 2>It's exhausting just thinking about it.

787
00:38:53.760 --> 00:38:56.960
<v Speaker 3>Now, replace that conventional architecture with a massive grid of

788
00:38:57.000 --> 00:39:01.400
<v Speaker 3>memoristers arranged in what we call a crossbar picture, a

789
00:39:01.440 --> 00:39:03.320
<v Speaker 3>microscopic to tech toeboard.

790
00:39:03.440 --> 00:39:05.800
<v Speaker 2>Okay, a dense grid of intersecting wires.

791
00:39:06.199 --> 00:39:09.800
<v Speaker 3>At every single intersection where a vertical wire crosses a

792
00:39:09.800 --> 00:39:13.519
<v Speaker 3>horizontal wire, there is a memorister connecting them. In this

793
00:39:13.599 --> 00:39:16.280
<v Speaker 3>physical setup, we don't do digital math at all.

794
00:39:16.440 --> 00:39:17.119
<v Speaker 2>Wait, no math.

795
00:39:17.400 --> 00:39:20.320
<v Speaker 3>We map the input data the pixels of our image

796
00:39:20.599 --> 00:39:24.440
<v Speaker 3>as electrical voltages sent down the vertical wires. The stored

797
00:39:24.480 --> 00:39:27.320
<v Speaker 3>weights of our neural network are physically represented by the

798
00:39:27.360 --> 00:39:29.519
<v Speaker 3>conductance of the memorisers at the.

799
00:39:29.440 --> 00:39:33.239
<v Speaker 2>Intersections, conductance being the opposite of resistance. How easily they

800
00:39:33.280 --> 00:39:35.000
<v Speaker 2>let the electricity through exactly.

801
00:39:35.039 --> 00:39:38.280
<v Speaker 3>Now, basic high school physics takes over Alms law states

802
00:39:38.280 --> 00:39:41.280
<v Speaker 3>that current equals voltage multiplied by conductance.

803
00:39:41.440 --> 00:39:43.159
<v Speaker 2>Okay, remember that, so as the.

804
00:39:43.199 --> 00:39:46.719
<v Speaker 3>Voltage flows into the memorister, the resulting electrical current coming

805
00:39:46.719 --> 00:39:51.039
<v Speaker 3>out the other side is the exact literal mathematical product

806
00:39:51.159 --> 00:39:52.119
<v Speaker 3>of the input and.

807
00:39:52.079 --> 00:39:54.000
<v Speaker 2>The weight It multiplies it physically.

808
00:39:54.159 --> 00:39:57.559
<v Speaker 3>Yes, and then Kershoff's current law states that currents flowing

809
00:39:57.599 --> 00:40:01.119
<v Speaker 3>into a single node naturally add together. So the currents

810
00:40:01.119 --> 00:40:03.800
<v Speaker 3>from all the memoristers in a column naturally sum up

811
00:40:03.840 --> 00:40:05.519
<v Speaker 3>as they flow into the horizontal wire.

812
00:40:05.639 --> 00:40:09.480
<v Speaker 2>Wait, the physics engine of the universe calculates the multiplication

813
00:40:09.679 --> 00:40:13.159
<v Speaker 2>and the addition for us instantly, just by letting electricity

814
00:40:13.199 --> 00:40:13.960
<v Speaker 2>flow through the grid.

815
00:40:14.239 --> 00:40:18.880
<v Speaker 3>The entire matrix vector multiplication, the grueling operation that requires

816
00:40:18.920 --> 00:40:22.440
<v Speaker 3>billions of digital steps and massive energy on a standard GPU,

817
00:40:23.000 --> 00:40:26.920
<v Speaker 3>happens in a single instantaneous analog step.

818
00:40:26.719 --> 00:40:28.199
<v Speaker 2>That is unbelievable.

819
00:40:28.239 --> 00:40:31.519
<v Speaker 3>There is zero fetching from memory. The math happens naturally

820
00:40:31.559 --> 00:40:35.000
<v Speaker 3>as the electricity flows through the material. The promise here

821
00:40:35.119 --> 00:40:38.639
<v Speaker 3>is an improvement in energy efficiency for AI inference tasks,

822
00:40:38.960 --> 00:40:41.199
<v Speaker 3>not by a factor of ten or twenty, but by

823
00:40:41.320 --> 00:40:42.920
<v Speaker 3>several orders of magnitude.

824
00:40:42.960 --> 00:40:44.840
<v Speaker 2>I have to play the skeptic here, though, Okay, go

825
00:40:44.920 --> 00:40:48.039
<v Speaker 2>for it, because what you are describing sounds like pure magic.

826
00:40:48.519 --> 00:40:51.159
<v Speaker 2>If we have a physical component that acts exactly like

827
00:40:51.159 --> 00:40:55.239
<v Speaker 2>a biological sinn apse, eliminates the von Neumann bottleneck, entirely

828
00:40:55.559 --> 00:40:58.920
<v Speaker 2>uses zero digital computation and saves massive amounts of power.

829
00:41:00.119 --> 00:41:03.079
<v Speaker 3>Why isn't it in my smartphone? The fair question why

830
00:41:03.159 --> 00:41:06.840
<v Speaker 3>isn't every tech giant completely abandoning traditional silicon and churning

831
00:41:06.840 --> 00:41:10.679
<v Speaker 3>out memorisarch chips, Because the engineering reality of working with

832
00:41:10.719 --> 00:41:15.039
<v Speaker 3>these novel materials is brutal. Bridging the gap from a

833
00:41:15.079 --> 00:41:20.840
<v Speaker 3>beautiful theoretical physics concept to a mass produced, highly reliable

834
00:41:20.880 --> 00:41:25.679
<v Speaker 3>commercial chip is an incredibly difficult materials science challenge.

835
00:41:25.719 --> 00:41:26.760
<v Speaker 2>It's just hard to build them.

836
00:41:27.159 --> 00:41:31.599
<v Speaker 3>Memoristive devices suffer from several severe practical physical hurdles that

837
00:41:31.639 --> 00:41:34.039
<v Speaker 3>have slowed their widespread commercial deployment.

838
00:41:34.360 --> 00:41:35.840
<v Speaker 2>What kind of hurdles are we talking about it?

839
00:41:35.880 --> 00:41:40.960
<v Speaker 3>The first major obstacle is device variability variability. Yes, when

840
00:41:41.000 --> 00:41:45.639
<v Speaker 3>a foundry manufactures a standard digital silicon transistor, the process

841
00:41:45.719 --> 00:41:49.480
<v Speaker 3>is so perfected that the transistors are incredibly uniform. But

842
00:41:49.559 --> 00:41:52.480
<v Speaker 3>when you attempt to create millions or billions of nanoscale

843
00:41:52.559 --> 00:41:55.800
<v Speaker 3>memoristers in a dense array, individual devices tend to have

844
00:41:55.880 --> 00:41:56.800
<v Speaker 3>slightly different.

845
00:41:56.519 --> 00:41:58.400
<v Speaker 2>Physical characteristics, so they aren't identical.

846
00:41:58.480 --> 00:42:01.880
<v Speaker 3>Their atomic structures aren't perfectly identical. They don't all react

847
00:42:01.880 --> 00:42:04.920
<v Speaker 3>to electrical current in exactly the same way. In a

848
00:42:05.000 --> 00:42:08.679
<v Speaker 3>digital system, minor variations don't matter because a strong signal

849
00:42:08.760 --> 00:42:12.360
<v Speaker 3>is still read as a one, But in an analog system,

850
00:42:12.519 --> 00:42:14.119
<v Speaker 3>the exact value matters.

851
00:42:14.480 --> 00:42:17.760
<v Speaker 2>It's like having a world class orchestra where every single

852
00:42:17.880 --> 00:42:21.760
<v Speaker 2>violin is tuned just slightly differently. It might still loosely

853
00:42:21.800 --> 00:42:25.360
<v Speaker 2>sound like a sympathy, but the high level precision required

854
00:42:25.360 --> 00:42:27.599
<v Speaker 2>for complex tasks is totally.

855
00:42:27.159 --> 00:42:31.719
<v Speaker 3>Lost exactly the analog noise overwhelms the signal. The second

856
00:42:31.760 --> 00:42:33.760
<v Speaker 3>major physical challenge is drift.

857
00:42:34.000 --> 00:42:34.360
<v Speaker 2>Drift.

858
00:42:34.440 --> 00:42:37.000
<v Speaker 3>What's that even when a memory star is not being

859
00:42:37.039 --> 00:42:40.559
<v Speaker 3>deliberately programmed or changed, when it's just sitting there holding

860
00:42:40.599 --> 00:42:44.639
<v Speaker 3>a memory its internal resistance the stored synaptic weight can

861
00:42:44.679 --> 00:42:48.719
<v Speaker 3>slowly shift or degrade on its own overtime on its own.

862
00:42:49.119 --> 00:42:52.159
<v Speaker 3>This is due to thermal fluctuations or the natural movement

863
00:42:52.199 --> 00:42:55.599
<v Speaker 3>of atoms within the crystal lattice of the material. Imagine

864
00:42:55.599 --> 00:42:59.440
<v Speaker 3>your computer's hard drive slowly randomly changing the text inside

865
00:42:59.440 --> 00:43:02.199
<v Speaker 3>its own five while the power is turned off. It

866
00:43:02.280 --> 00:43:05.599
<v Speaker 3>progressively degrades the accuracy of the neural network over time.

867
00:43:05.760 --> 00:43:08.719
<v Speaker 2>That sounds like an absolute nightmare for reliability. If you

868
00:43:08.800 --> 00:43:11.199
<v Speaker 2>train a self driving car in a memorister chip, you

869
00:43:11.199 --> 00:43:13.559
<v Speaker 2>don't want it slowly forgetting what a pedestrian looks like

870
00:43:13.599 --> 00:43:14.440
<v Speaker 2>over the course of a year.

871
00:43:14.559 --> 00:43:17.079
<v Speaker 3>No, you definitely do not. And the third major hurdle

872
00:43:17.199 --> 00:43:22.599
<v Speaker 3>is endurance. Endurance biological synapses are remarkably resilient. They can

873
00:43:22.679 --> 00:43:27.000
<v Speaker 3>dynamically strengthen and weaken continuously for an entire human lifespan

874
00:43:27.159 --> 00:43:31.719
<v Speaker 3>a century or more. Memorisers, however, rely on physically moving

875
00:43:31.760 --> 00:43:35.719
<v Speaker 3>atoms or altering crystalline structures at the nanoscale every time

876
00:43:35.719 --> 00:43:38.599
<v Speaker 3>they are reprogrammed, so they wear out. They eventually suffer

877
00:43:38.639 --> 00:43:41.360
<v Speaker 3>from physical wear and tear. The number of times a

878
00:43:41.400 --> 00:43:45.639
<v Speaker 3>memorister can be reliably programmed before its properties degrade irreversibly

879
00:43:45.719 --> 00:43:49.199
<v Speaker 3>is limited. This endurance limit is a severe bottleneck for

880
00:43:49.239 --> 00:43:51.679
<v Speaker 3>systems that are meant to learn continuously on the fly.

881
00:43:51.960 --> 00:43:54.159
<v Speaker 2>So it's brilliant in theory and it works in the lab,

882
00:43:54.639 --> 00:43:58.119
<v Speaker 2>but the physical materials keep breaking down, drifting, or forgetting

883
00:43:58.119 --> 00:43:59.599
<v Speaker 2>things when we try to scale it up.

884
00:43:59.679 --> 00:44:03.519
<v Speaker 3>It is an active, intensely competitive battleground in material science.

885
00:44:03.599 --> 00:44:08.840
<v Speaker 3>Right now. Researchers worldwide are investigating numerous exotic candidate technologies

886
00:44:08.920 --> 00:44:10.960
<v Speaker 3>fighting for supremacy to solve these.

887
00:44:10.840 --> 00:44:12.920
<v Speaker 2>Exact issues, like what kind of technologies?

888
00:44:13.119 --> 00:44:16.039
<v Speaker 3>We are looking at phase change memory, which uses heat

889
00:44:16.159 --> 00:44:20.360
<v Speaker 3>to change the material from amorphos to crystalline. We are

890
00:44:20.440 --> 00:44:24.639
<v Speaker 3>looking at resistive RAM, conductive bridging RAM, which physically builds

891
00:44:24.639 --> 00:44:27.800
<v Speaker 3>and breaks tiny metallic wires. At the atomic level, we

892
00:44:27.880 --> 00:44:30.000
<v Speaker 3>are looking at ferroelectric devices.

893
00:44:30.079 --> 00:44:31.679
<v Speaker 2>That sounds like a lot of options.

894
00:44:31.840 --> 00:44:35.079
<v Speaker 3>Each of these exotic materials offers a different trade off.

895
00:44:35.880 --> 00:44:40.360
<v Speaker 3>One might have incredible endurance but terrible variability. Another might

896
00:44:40.400 --> 00:44:43.360
<v Speaker 3>be highly stable but require too much energy to program.

897
00:44:44.039 --> 00:44:47.880
<v Speaker 3>Progress is steady, but we haven't found the perfect Goldilocks

898
00:44:47.960 --> 00:44:51.360
<v Speaker 3>material yet that can be seamlessly integrated into standard commercial

899
00:44:51.400 --> 00:44:52.400
<v Speaker 3>silicon foundries.

900
00:44:52.519 --> 00:44:55.199
<v Speaker 2>Okay, so the memborister revolution is still fighting its way

901
00:44:55.239 --> 00:44:59.239
<v Speaker 2>out of the materials science labs. But despite these massive

902
00:44:59.320 --> 00:45:03.599
<v Speaker 2>physical herds, neuromorphic chips, even the ones relying on standard

903
00:45:03.599 --> 00:45:06.960
<v Speaker 2>silicon like Intel's LOWI heat, are already leaving the lab

904
00:45:06.960 --> 00:45:09.239
<v Speaker 2>and entering the wild. They definitely are, and they are

905
00:45:09.280 --> 00:45:13.800
<v Speaker 2>thriving in very specific environments, environments where traditional power hungry

906
00:45:13.800 --> 00:45:15.519
<v Speaker 2>AI simply cannot survive.

907
00:45:15.719 --> 00:45:18.119
<v Speaker 3>If you want to find the perfect natural habitat for

908
00:45:18.199 --> 00:45:21.639
<v Speaker 3>neuromorphic technology. Today you look to a field called edge

909
00:45:21.679 --> 00:45:23.159
<v Speaker 3>computing the edge.

910
00:45:23.320 --> 00:45:26.920
<v Speaker 2>Edge computing refers to environments that are physically far removed

911
00:45:26.960 --> 00:45:30.480
<v Speaker 2>from the cloud and those massive warehouse sized data centers

912
00:45:30.480 --> 00:45:31.480
<v Speaker 2>we discussed earlier.

913
00:45:31.519 --> 00:45:32.519
<v Speaker 3>So out in the real world.

914
00:45:32.679 --> 00:45:35.760
<v Speaker 2>We are talking about devices operating out in the remote,

915
00:45:35.840 --> 00:45:40.840
<v Speaker 2>chaotic physical world. These devices are constrained by strict physical limits.

916
00:45:41.079 --> 00:45:44.039
<v Speaker 2>They're usually running on limited batteries, they cannot be plugged

917
00:45:44.079 --> 00:45:48.519
<v Speaker 2>into a wall. Furthermore, they require instant, zero latency reactions

918
00:45:48.519 --> 00:45:51.679
<v Speaker 2>to their environment. They simply cannot afford the time or

919
00:45:51.719 --> 00:45:54.719
<v Speaker 2>the energy to beam sensory data back to a server farm,

920
00:45:55.039 --> 00:45:57.880
<v Speaker 2>wait for a massive GPU to process it, and wait

921
00:45:57.880 --> 00:45:59.639
<v Speaker 2>for the instructions to be beamed back.

922
00:46:00.079 --> 00:46:03.280
<v Speaker 3>And a major part of operating effectively at the edge

923
00:46:03.760 --> 00:46:07.280
<v Speaker 3>is how the machine actually perceives the physical world. We

924
00:46:07.320 --> 00:46:10.000
<v Speaker 3>talked earlier about how traditional processors are forced to march

925
00:46:10.039 --> 00:46:13.079
<v Speaker 3>to a rigid metronome clock cycle. Yes, well, it turns

926
00:46:13.119 --> 00:46:16.360
<v Speaker 3>out our traditional sensors, the eyes of our machines, do

927
00:46:16.480 --> 00:46:19.519
<v Speaker 3>the exact same thing, which brings us to the concept

928
00:46:19.639 --> 00:46:24.400
<v Speaker 3>of event based cameras. Right. Traditional cameras operate on a

929
00:46:24.599 --> 00:46:29.639
<v Speaker 3>rigid frame based paradigm. A standard digital camera takes a

930
00:46:29.719 --> 00:46:34.519
<v Speaker 3>full complete picture, let's say, sixty complete frames every single.

931
00:46:34.280 --> 00:46:36.559
<v Speaker 2>Second, regardless of what's happened exactly.

932
00:46:36.599 --> 00:46:40.159
<v Speaker 3>It captures the light value of every single pixel, millions

933
00:46:40.159 --> 00:46:43.559
<v Speaker 3>of pixels, over and over, completely regardless of what is

934
00:46:43.599 --> 00:46:45.239
<v Speaker 3>actually happening in the scene in front of it.

935
00:46:45.400 --> 00:46:48.199
<v Speaker 2>I love the analogy for this. A traditional frame based

936
00:46:48.239 --> 00:46:51.960
<v Speaker 2>camera is like a terribly inefficient security guard sitting at

937
00:46:51.960 --> 00:46:54.079
<v Speaker 2>a desk watching an empty hallway.

938
00:46:53.800 --> 00:46:55.440
<v Speaker 3>A very bored security guard.

939
00:46:55.559 --> 00:46:58.280
<v Speaker 2>The guard is forced by protocol to pick up the

940
00:46:58.360 --> 00:47:00.800
<v Speaker 2>radio and call headquarters sixty two times a second to

941
00:47:00.800 --> 00:47:03.639
<v Speaker 2>give a full report. The hallway is still empty. The

942
00:47:03.679 --> 00:47:05.920
<v Speaker 2>hallway is still empty. The hallway is still empty.

943
00:47:06.000 --> 00:47:06.960
<v Speaker 3>That sounds exhausting.

944
00:47:07.119 --> 00:47:10.760
<v Speaker 2>The camera is generating massive amounts of entirely redundant data,

945
00:47:11.119 --> 00:47:14.400
<v Speaker 2>and the processor downstream has to expand mass amounts of

946
00:47:14.480 --> 00:47:18.239
<v Speaker 2>energy to process all those identical pictures just to mathematically

947
00:47:18.280 --> 00:47:19.880
<v Speaker 2>confirm that nothing changed.

948
00:47:20.039 --> 00:47:22.639
<v Speaker 3>And the consequence of that setup is a massive waste

949
00:47:22.719 --> 00:47:25.119
<v Speaker 3>of power and computational bandwidth.

950
00:47:25.159 --> 00:47:26.360
<v Speaker 2>So what's the alternative.

951
00:47:26.400 --> 00:47:30.159
<v Speaker 3>An event based sensor, often called a dynamic vision sensor,

952
00:47:30.920 --> 00:47:35.320
<v Speaker 3>is intrinsically tied to the neuromorphic architecture, and it fundamentally

953
00:47:35.400 --> 00:47:39.000
<v Speaker 3>changes this paradigm. Instead of capturing full frames on a

954
00:47:39.079 --> 00:47:43.559
<v Speaker 3>rigid clock, an event based camera only outputs a signal,

955
00:47:43.719 --> 00:47:47.719
<v Speaker 3>a spike, when a specific individual pixel detects a meaningful

956
00:47:47.800 --> 00:47:48.840
<v Speaker 3>change in brightness.

957
00:47:49.360 --> 00:47:50.880
<v Speaker 2>Every pixel acts independently.

958
00:47:51.000 --> 00:47:53.480
<v Speaker 3>Every single pixel acts independently.

959
00:47:52.960 --> 00:47:55.199
<v Speaker 2>So our security guard completely goes to sleep. They use

960
00:47:55.239 --> 00:47:58.079
<v Speaker 2>absolutely zero power. They only pick up the radio and

961
00:47:58.079 --> 00:48:00.400
<v Speaker 2>call headquarters when someone actually opens the door at the

962
00:48:00.480 --> 00:48:01.079
<v Speaker 2>end of the hallway.

963
00:48:01.280 --> 00:48:06.159
<v Speaker 3>Exactly the camera produces a sparse, asynchronous stream of spikes

964
00:48:06.400 --> 00:48:09.480
<v Speaker 3>corresponding only to movement or change in the visual field.

965
00:48:09.960 --> 00:48:13.199
<v Speaker 3>If the scene is static, the camera outputs absolutely nothing

966
00:48:13.360 --> 00:48:15.079
<v Speaker 3>and consumes practically no power.

967
00:48:15.280 --> 00:48:16.840
<v Speaker 2>But if something moves, But.

968
00:48:16.800 --> 00:48:20.159
<v Speaker 3>If an object moves rapidly, the individual pixels capture that

969
00:48:20.199 --> 00:48:23.760
<v Speaker 3>movement with microsecond precision, far faster than a standard sixty

970
00:48:23.760 --> 00:48:26.760
<v Speaker 3>frame per second camera could ever hope to catch. Furthermore,

971
00:48:26.840 --> 00:48:30.159
<v Speaker 3>because each pixel manages its own exposure based only on

972
00:48:30.239 --> 00:48:34.280
<v Speaker 3>local change, event cameras don't get blinded by looking directly

973
00:48:34.360 --> 00:48:36.920
<v Speaker 3>at the sun the way normal cameras do. They have

974
00:48:37.000 --> 00:48:39.119
<v Speaker 3>immense dynamic range.

975
00:48:38.719 --> 00:48:41.079
<v Speaker 2>And because this sensory data is already in the form

976
00:48:41.119 --> 00:48:45.519
<v Speaker 2>of discrete spikes. It forms a perfectly matched, seamless pipeline

977
00:48:45.760 --> 00:48:47.639
<v Speaker 2>directly into a neual morphic chip.

978
00:48:47.760 --> 00:48:49.480
<v Speaker 3>It speaks the same language.

979
00:48:49.519 --> 00:48:52.280
<v Speaker 2>The chip doesn't have to translate a massive JPEG image

980
00:48:52.320 --> 00:48:55.000
<v Speaker 2>into a neural network. It just receives the spikes and

981
00:48:55.039 --> 00:48:58.519
<v Speaker 2>reacts instantly. Where's this hardware actually being deployed today?

982
00:48:58.760 --> 00:49:02.360
<v Speaker 3>The most compelling and rap rapidly advancing application domain by

983
00:49:02.480 --> 00:49:03.639
<v Speaker 3>far is robotics.

984
00:49:03.840 --> 00:49:05.280
<v Speaker 2>Robotics that makes sense.

985
00:49:05.440 --> 00:49:09.719
<v Speaker 3>Autonomous robots face the exact same fundamental survival problems that

986
00:49:09.800 --> 00:49:13.159
<v Speaker 3>biological animals face in the wild. They have to process

987
00:49:13.159 --> 00:49:17.400
<v Speaker 3>incredibly rich, noisy sensory data from a chaotic real world.

988
00:49:17.639 --> 00:49:21.800
<v Speaker 2>They must adapt their physical behavior to unpredictable changing environments.

989
00:49:22.000 --> 00:49:24.760
<v Speaker 3>They must make rapid life or death decisions without the

990
00:49:24.840 --> 00:49:27.360
<v Speaker 3>latency of calling a cloud server. And they have to

991
00:49:27.360 --> 00:49:29.920
<v Speaker 3>do it all while carrying their own limited energy source

992
00:49:30.119 --> 00:49:31.039
<v Speaker 3>a heavy battery.

993
00:49:31.320 --> 00:49:34.920
<v Speaker 2>Right picture an autonomous drone trying to navigate through a

994
00:49:35.000 --> 00:49:37.000
<v Speaker 2>dense forest at forty miles per hour.

995
00:49:37.199 --> 00:49:38.400
<v Speaker 3>That's a perfect example.

996
00:49:38.639 --> 00:49:41.639
<v Speaker 2>It can't pause mid air for two seconds to upload

997
00:49:41.679 --> 00:49:44.599
<v Speaker 2>a video frame to a server, wait for a neural

998
00:49:44.639 --> 00:49:47.599
<v Speaker 2>network to calculate the depth of the forest and wait

999
00:49:47.599 --> 00:49:50.559
<v Speaker 2>for instructions on how to dodge a branch. It will

1000
00:49:50.559 --> 00:49:53.480
<v Speaker 2>crash into a tree before the data even reaches the cloud.

1001
00:49:54.119 --> 00:49:57.960
<v Speaker 3>When researchers equip these robots with event based sensors directly

1002
00:49:58.000 --> 00:50:02.000
<v Speaker 3>wired into neuromorphic processors, they achieve extraordinary results.

1003
00:50:02.039 --> 00:50:03.280
<v Speaker 2>Oh like, what kind of results?

1004
00:50:03.360 --> 00:50:07.519
<v Speaker 3>These robotic systems can perform adaptive locomotion like a multi

1005
00:50:07.559 --> 00:50:11.079
<v Speaker 3>legged robot learning to walk over uneven shifting terrain on

1006
00:50:11.079 --> 00:50:14.400
<v Speaker 3>the fly, and high speed obstacle avoidance at power budgets

1007
00:50:14.599 --> 00:50:17.360
<v Speaker 3>that would absolutely crush a conventional computer.

1008
00:50:17.480 --> 00:50:20.239
<v Speaker 2>Because conventional computers are just too heavy and power hungry.

1009
00:50:20.360 --> 00:50:22.559
<v Speaker 3>If you tried to run a traditional state of the

1010
00:50:22.639 --> 00:50:25.679
<v Speaker 3>art computer vision model on a small flying drone, the

1011
00:50:25.760 --> 00:50:29.119
<v Speaker 3>battery would be entirely consumed just running the processor, leaving

1012
00:50:29.239 --> 00:50:31.599
<v Speaker 3>zero energy for the actual rotors to keep it in

1013
00:50:31.639 --> 00:50:35.159
<v Speaker 3>the air. Neuromorphic chips solve the fundamental power and weight

1014
00:50:35.280 --> 00:50:37.719
<v Speaker 3>constraint problems inherent in mobile robotics.

1015
00:50:38.039 --> 00:50:42.800
<v Speaker 2>It's essentially giving machines the survival instincts of an insect, fast,

1016
00:50:43.199 --> 00:50:46.079
<v Speaker 2>incredibly energy efficient, and fully self contained.

1017
00:50:46.280 --> 00:50:50.880
<v Speaker 3>But the applications are not limited merely to building better, faster, robots.

1018
00:50:51.239 --> 00:50:55.639
<v Speaker 3>There is a profound bidirectional relationship here. Building these physical

1019
00:50:55.639 --> 00:50:59.360
<v Speaker 3>neuromorphic chips is proving to be an invaluable tangible tool

1020
00:50:59.440 --> 00:51:01.760
<v Speaker 3>for the field of computational neuroscience itself.

1021
00:51:01.880 --> 00:51:05.599
<v Speaker 2>Wait, how so does building a silicon brain actually help

1022
00:51:05.679 --> 00:51:09.079
<v Speaker 2>us understand the wet biological brain better immensely.

1023
00:51:09.639 --> 00:51:13.119
<v Speaker 3>In theoretical neuroscience, researchers can sometimes get away with vague

1024
00:51:13.119 --> 00:51:16.519
<v Speaker 3>mathematical models or high level assumptions about how large populations

1025
00:51:16.519 --> 00:51:17.440
<v Speaker 3>of neurons.

1026
00:51:17.079 --> 00:51:18.599
<v Speaker 2>Interact because it's just theory.

1027
00:51:18.800 --> 00:51:21.039
<v Speaker 3>But when you are forced to build a physical piece

1028
00:51:21.079 --> 00:51:26.760
<v Speaker 3>of hardware in silicon you cannot be vague. Hardware requires explicit, rigorous,

1029
00:51:26.880 --> 00:51:28.039
<v Speaker 3>quantitative commitment.

1030
00:51:28.159 --> 00:51:29.599
<v Speaker 2>Right. You have to actually build the circuit.

1031
00:51:29.679 --> 00:51:33.280
<v Speaker 3>You must specify exactly how the voltage operates, exactly, how

1032
00:51:33.280 --> 00:51:36.559
<v Speaker 3>the timing windows work, exactly how the local learning rules

1033
00:51:36.599 --> 00:51:40.880
<v Speaker 3>are applied. The physical implementation of these theories forces neuroscientists

1034
00:51:40.920 --> 00:51:42.440
<v Speaker 3>to be absolutely rigorous.

1035
00:51:42.800 --> 00:51:46.280
<v Speaker 2>It acts as a strict physical reality check for their theories.

1036
00:51:46.440 --> 00:51:50.400
<v Speaker 3>Exactly when they run their biological models on physical platforms

1037
00:51:50.440 --> 00:51:53.599
<v Speaker 3>like the Heidelberg Brain Scale s system, the behavior of

1038
00:51:53.639 --> 00:51:56.519
<v Speaker 3>the hardware provides a tangible test of their theories about

1039
00:51:56.559 --> 00:52:00.440
<v Speaker 3>working memory, temporal coding, and sensory at adas adaptation.

1040
00:52:00.760 --> 00:52:02.719
<v Speaker 2>And what if the hardware doesn't act like the brain.

1041
00:52:03.280 --> 00:52:05.559
<v Speaker 3>If the silicon network does not behave the way the

1042
00:52:05.559 --> 00:52:08.960
<v Speaker 3>living animal does, it indicates that the biological theory is

1043
00:52:09.039 --> 00:52:14.079
<v Speaker 3>fundamentally flawed or incomplete. It creates a beautiful reciprocal feedback

1044
00:52:14.119 --> 00:52:19.559
<v Speaker 3>loop where engineering constraints inform neuroscience and biological discoveries inspire

1045
00:52:19.599 --> 00:52:20.480
<v Speaker 3>better engineering.

1046
00:52:20.679 --> 00:52:23.239
<v Speaker 2>We spend a lot of time celebrating the hardware. The

1047
00:52:23.280 --> 00:52:27.039
<v Speaker 2>analog architecture makes complete sense. The power savings are literally

1048
00:52:27.159 --> 00:52:30.679
<v Speaker 2>orders of magnitude better, the event based sensors are brilliant,

1049
00:52:30.960 --> 00:52:34.960
<v Speaker 2>and the edge robotics applications are undeniably the future. Yes,

1050
00:52:35.199 --> 00:52:37.039
<v Speaker 2>so I have to address the elephant in the room.

1051
00:52:37.079 --> 00:52:38.480
<v Speaker 3>The elephant in the room. Let's hear it.

1052
00:52:38.760 --> 00:52:42.599
<v Speaker 2>If this hardware is so perfectly designed, so biologically inspired,

1053
00:52:42.639 --> 00:52:46.800
<v Speaker 2>and so vastly efficient, why isn't the entire tech industry pivoting?

1054
00:52:47.480 --> 00:52:52.000
<v Speaker 2>Why is the artificial intelligence landscape still almost entirely dominated

1055
00:52:52.039 --> 00:52:57.480
<v Speaker 2>by massive power hungry GPUs churning through conventional deep.

1056
00:52:57.360 --> 00:53:00.519
<v Speaker 3>Learning Because of the software. The software we have hit

1057
00:53:00.599 --> 00:53:04.480
<v Speaker 3>what the industry calls the software wall. The harsh, pragmatic

1058
00:53:04.559 --> 00:53:08.280
<v Speaker 3>truth of neuromorphic computing today is that programming a spiking

1059
00:53:08.320 --> 00:53:12.159
<v Speaker 3>neuromorphic chip to do complex tasks is incredibly difficult.

1060
00:53:12.199 --> 00:53:15.000
<v Speaker 2>Why is it so hard? We have millions of brilliant

1061
00:53:15.000 --> 00:53:18.400
<v Speaker 2>software developers in the world. We build massive operating systems

1062
00:53:18.400 --> 00:53:19.599
<v Speaker 2>in complex video games.

1063
00:53:19.719 --> 00:53:23.360
<v Speaker 3>We do. But the entire global ecosystem of software development,

1064
00:53:23.599 --> 00:53:28.000
<v Speaker 3>the massive, highly polished libraries, the intuitive training frameworks like

1065
00:53:28.039 --> 00:53:32.280
<v Speaker 3>PyTorch or TensorFlow, the decades of accumulated mathematical knowledge is

1066
00:53:32.440 --> 00:53:35.519
<v Speaker 3>entirely built around conventional continuous digital math.

1067
00:53:35.639 --> 00:53:38.000
<v Speaker 2>Oh right, Because we're used to the von Neumann system.

1068
00:53:38.119 --> 00:53:43.119
<v Speaker 3>The software ecosystem for neuromorphic computing is by comparison, very immature.

1069
00:53:43.880 --> 00:53:46.679
<v Speaker 3>But the problem isn't just a lack of polished tools

1070
00:53:46.800 --> 00:53:51.679
<v Speaker 3>or developer familiarity. It's a fundamental mathematical mismatch at the

1071
00:53:51.719 --> 00:53:52.199
<v Speaker 3>core of the.

1072
00:53:52.159 --> 00:53:54.199
<v Speaker 2>Technology because of the spikes.

1073
00:53:54.280 --> 00:53:57.400
<v Speaker 3>Because of the spikes, we talked earlier about backpropagation, the

1074
00:53:57.559 --> 00:54:01.159
<v Speaker 3>calculus based algorithm used to train on almost all modern

1075
00:54:01.199 --> 00:54:02.599
<v Speaker 3>AI to high levels of.

1076
00:54:02.519 --> 00:54:04.039
<v Speaker 2>Accuracy, the micromanager.

1077
00:54:04.079 --> 00:54:10.719
<v Speaker 3>The micromanager backpropagation relies on calculating gradients, smooth, continuous mathematical

1078
00:54:10.760 --> 00:54:14.199
<v Speaker 3>slopes that tell the network exactly which direction to adjust

1079
00:54:14.199 --> 00:54:17.639
<v Speaker 3>its weights to reduce errors calculus. You can only use

1080
00:54:17.639 --> 00:54:21.000
<v Speaker 3>this calculus if the mathematical function you are analyzing is

1081
00:54:21.079 --> 00:54:25.800
<v Speaker 3>continuous and smooth. It must be what mathematicians call differentiable.

1082
00:54:25.239 --> 00:54:28.039
<v Speaker 2>Meaning you can calculate a precise slope at any given

1083
00:54:28.039 --> 00:54:28.880
<v Speaker 2>point on the curve.

1084
00:54:29.039 --> 00:54:32.039
<v Speaker 3>But spikes are not smooth or continuous. A spike is

1085
00:54:32.079 --> 00:54:35.239
<v Speaker 3>a discrete, binary event in time, it either happens or

1086
00:54:35.239 --> 00:54:38.679
<v Speaker 3>it doesn't. Mathematically, a spike is not a gentle hill.

1087
00:54:39.239 --> 00:54:43.679
<v Speaker 3>It is a sudden vertical sheer cliff face. You cannot

1088
00:54:43.679 --> 00:54:47.400
<v Speaker 3>calculate a smooth slope on a sheer cliff face. Therefore,

1089
00:54:47.960 --> 00:54:51.440
<v Speaker 3>spiking neural networks are fundamentally non differentiable.

1090
00:54:51.920 --> 00:54:54.880
<v Speaker 2>I think we need a grounded analogy for this. Imagine

1091
00:54:54.880 --> 00:54:58.280
<v Speaker 2>trying to train a dog okay to dog with traditional AI,

1092
00:54:58.599 --> 00:55:01.360
<v Speaker 2>where you have continuous math in calculus gradients, it's like

1093
00:55:01.400 --> 00:55:04.079
<v Speaker 2>training a dog using a leash. You have continuous control.

1094
00:55:04.440 --> 00:55:06.719
<v Speaker 2>You can gently pull the dog to the left or

1095
00:55:06.840 --> 00:55:10.239
<v Speaker 2>smoothly guide it to the right. It's a smooth, continuous correction.

1096
00:55:10.400 --> 00:55:12.519
<v Speaker 2>I like that, But a spiking network, it's like a

1097
00:55:12.559 --> 00:55:14.840
<v Speaker 2>light switch. It's either on or off. You can't gently

1098
00:55:14.840 --> 00:55:17.320
<v Speaker 2>flip a switch. You can't give a nuanced correction, you

1099
00:55:17.360 --> 00:55:19.800
<v Speaker 2>can't use the leash of calculus. How do you train

1100
00:55:19.840 --> 00:55:22.519
<v Speaker 2>the network You can't directly use the loosh. That is

1101
00:55:22.599 --> 00:55:25.440
<v Speaker 2>the central algorithmic challenge of the field. How do you

1102
00:55:25.480 --> 00:55:28.840
<v Speaker 2>train a network that speaks exclusively in discrete, non differential

1103
00:55:28.960 --> 00:55:32.800
<v Speaker 2>events to perform highly complex, nuanced tasks at the exact

1104
00:55:32.800 --> 00:55:35.719
<v Speaker 2>same level of accuracy as a conventional network trained with

1105
00:55:35.760 --> 00:55:39.880
<v Speaker 2>advanced calculus. So what is the compromise? How are researchers

1106
00:55:39.920 --> 00:55:42.320
<v Speaker 2>currently trying to bridge this mathematical gap.

1107
00:55:42.400 --> 00:55:47.440
<v Speaker 3>The current dominant approach relies heavily on techniques called surrogate

1108
00:55:47.519 --> 00:55:53.880
<v Speaker 3>gradient methods. Surrogate gradients essentially scientists mathematically cheat during the

1109
00:55:53.920 --> 00:55:57.480
<v Speaker 3>training phase in their computer simulations. They take the sheer,

1110
00:55:57.800 --> 00:56:01.519
<v Speaker 3>undifferentiable cliff face of a spike and replace it with

1111
00:56:01.599 --> 00:56:05.599
<v Speaker 3>a smooth, continuous curve a surrogate just for the purposes

1112
00:56:05.639 --> 00:56:06.679
<v Speaker 3>of doing the calculus.

1113
00:56:06.960 --> 00:56:09.119
<v Speaker 2>To use my analogy, they pretend the light switch is

1114
00:56:09.159 --> 00:56:12.519
<v Speaker 2>actually a smooth volume dial, just long enough to figure

1115
00:56:12.559 --> 00:56:14.039
<v Speaker 2>out which direction they need to turn it.

1116
00:56:14.159 --> 00:56:16.320
<v Speaker 3>That's a great way to think of it. They calculate

1117
00:56:16.360 --> 00:56:20.280
<v Speaker 3>the approximate gradients using backpropagation, update the weights, and then

1118
00:56:20.440 --> 00:56:24.480
<v Speaker 3>map those newly trained weights back onto the discrete spiking hardware.

1119
00:56:24.559 --> 00:56:27.800
<v Speaker 2>It's an approximation. You are forcing the analog hardware to

1120
00:56:27.920 --> 00:56:29.920
<v Speaker 2>learn using a digital math translation.

1121
00:56:30.079 --> 00:56:33.440
<v Speaker 3>It is an approximation. Then, while surrogate gradients have driven

1122
00:56:33.480 --> 00:56:37.159
<v Speaker 3>significant progress and allowed spiking networks to tackle much harder problems,

1123
00:56:37.440 --> 00:56:39.440
<v Speaker 3>it remains a mathematical.

1124
00:56:38.800 --> 00:56:41.119
<v Speaker 2>Compromise, so it's not perfect. No.

1125
00:56:42.000 --> 00:56:46.000
<v Speaker 3>Currently, spiking neural networks trained using these surrogate methods still

1126
00:56:46.079 --> 00:56:49.639
<v Speaker 3>generally lag behind conventional deep neural networks when tested on

1127
00:56:49.880 --> 00:56:54.800
<v Speaker 3>highly complex industry standard benchmarks like massive natural language processing

1128
00:56:54.880 --> 00:56:59.639
<v Speaker 3>or hyper detailed image generation. The raw accuracy isn't quite

1129
00:56:59.639 --> 00:57:00.199
<v Speaker 3>there yet.

1130
00:57:00.360 --> 00:57:03.239
<v Speaker 2>So we have built a physical hardware architecture that is

1131
00:57:03.320 --> 00:57:07.159
<v Speaker 2>theoretically perfect for energy efficiency, but we haven't cracked the

1132
00:57:07.199 --> 00:57:10.960
<v Speaker 2>code on how to teach it complex tasks as effectively

1133
00:57:11.039 --> 00:57:14.239
<v Speaker 2>as our flawed brute force power hungry systems.

1134
00:57:14.400 --> 00:57:17.679
<v Speaker 3>Closing the software performance gap, figuring out how to achieve

1135
00:57:17.719 --> 00:57:20.920
<v Speaker 3>state of the art accuracy natively on spiking hardware while

1136
00:57:20.960 --> 00:57:24.960
<v Speaker 3>maintaining that insane energy efficiency is arguably the defining engineering

1137
00:57:25.000 --> 00:57:27.360
<v Speaker 3>challenge of the next decade in this field.

1138
00:57:27.320 --> 00:57:29.400
<v Speaker 2>Let's step back and look at the massive journey we've

1139
00:57:29.440 --> 00:57:32.119
<v Speaker 2>just taken. We started by looking at the crippling heat

1140
00:57:32.159 --> 00:57:35.840
<v Speaker 2>and fundamental inefficiency of the von Neumann bottleneck, the processor

1141
00:57:36.079 --> 00:57:39.400
<v Speaker 2>and memory forever commuting back and forth burning power. We

1142
00:57:39.519 --> 00:57:42.639
<v Speaker 2>cover a lot of ground, we really did. We explored

1143
00:57:42.679 --> 00:57:46.639
<v Speaker 2>Carver Meade's radical vision of using analog physics to mirror biology,

1144
00:57:47.039 --> 00:57:50.519
<v Speaker 2>ditching the rigid digital metronome for this silent, event driven

1145
00:57:50.599 --> 00:57:52.480
<v Speaker 2>jazz ensemble of spiking.

1146
00:57:52.119 --> 00:57:55.440
<v Speaker 3>Networks, and we looked at the incredible silicon blains actually

1147
00:57:55.440 --> 00:57:56.480
<v Speaker 3>being forged.

1148
00:57:56.159 --> 00:58:00.480
<v Speaker 2>Today, from Intel's lowy heat to the massive semi relations

1149
00:58:00.480 --> 00:58:03.639
<v Speaker 2>of Spinnaker and the accelerated physics of brain scale less.

1150
00:58:04.400 --> 00:58:08.360
<v Speaker 2>We dove into the holy grail of membristers, physical devices

1151
00:58:08.360 --> 00:58:11.679
<v Speaker 2>that literally remember their own electrical history like a living synapse,

1152
00:58:12.039 --> 00:58:14.559
<v Speaker 2>allowing the laws of physics to do the math for us.

1153
00:58:14.559 --> 00:58:17.400
<v Speaker 3>And we saw how this technology is finding its true

1154
00:58:17.400 --> 00:58:20.519
<v Speaker 3>calling at the edge of the network, powering the survival

1155
00:58:20.559 --> 00:58:22.559
<v Speaker 3>instincts of next generation robotics.

1156
00:58:22.639 --> 00:58:25.840
<v Speaker 2>It is a vast, incredibly complex landscape.

1157
00:58:25.320 --> 00:58:28.920
<v Speaker 3>It is, but the overarching trajectory is clear, and I

1158
00:58:29.000 --> 00:58:33.440
<v Speaker 3>believe inevitable. Biological brains have already provided an existence proof.

1159
00:58:33.480 --> 00:58:34.639
<v Speaker 2>They've proven its possible.

1160
00:58:34.719 --> 00:58:39.159
<v Speaker 3>They have proven definitively that general, highly flexible, adaptable intelligence

1161
00:58:39.159 --> 00:58:42.039
<v Speaker 3>can run on just twenty watts of power. Replicating that

1162
00:58:42.079 --> 00:58:45.360
<v Speaker 3>precise physical operating system in silicon isn't just a quirky

1163
00:58:45.400 --> 00:58:47.039
<v Speaker 3>alternative to mainstream computing.

1164
00:58:47.119 --> 00:58:47.679
<v Speaker 2>It's essential.

1165
00:58:47.840 --> 00:58:51.639
<v Speaker 3>It is perhaps the most ambitious, fundamental and consequential project

1166
00:58:51.639 --> 00:58:55.119
<v Speaker 3>in the history of human engineering. The biological brain has

1167
00:58:55.119 --> 00:58:59.079
<v Speaker 3>solved the thermodynamic problem of intelligent computation in a way

1168
00:58:59.119 --> 00:59:00.960
<v Speaker 3>that our current machine simply have not.

1169
00:59:01.239 --> 00:59:03.920
<v Speaker 2>I want to leave you, the listener, with a completely

1170
00:59:03.960 --> 00:59:07.400
<v Speaker 2>new thought to mull over, something that builds on everything

1171
00:59:07.440 --> 00:59:10.800
<v Speaker 2>we've discussed about physically mirroring the brain and hardware. What's

1172
00:59:10.840 --> 00:59:14.760
<v Speaker 2>the thought if we truly succeed in making this hardware,

1173
00:59:15.360 --> 00:59:19.559
<v Speaker 2>if we overcome the massive materials science turdles with membisters,

1174
00:59:19.760 --> 00:59:22.800
<v Speaker 2>if we finally crack the software wall, If our machines

1175
00:59:22.840 --> 00:59:26.840
<v Speaker 2>start learning locally, adapting, dynamically, and physically rewriting their own

1176
00:59:26.880 --> 00:59:31.199
<v Speaker 2>electrical pathways exactly like our biological minds do, will these

1177
00:59:31.239 --> 00:59:35.079
<v Speaker 2>neuromorphic computers eventually inherit our biological flaws.

1178
00:59:35.119 --> 00:59:37.480
<v Speaker 3>Wow, that's a fascinating question.

1179
00:59:37.599 --> 00:59:40.880
<v Speaker 2>Think about it. We know human brains are highly susceptible

1180
00:59:40.920 --> 00:59:44.440
<v Speaker 2>to optical illusions, specifically because of how our neural pathways

1181
00:59:44.480 --> 00:59:46.760
<v Speaker 2>take rapid shortcuts to save energy.

1182
00:59:46.960 --> 00:59:51.360
<v Speaker 3>We suffer from cognitive fatigue when our synaptic neurotransmitters are depleted.

1183
00:59:51.800 --> 00:59:56.639
<v Speaker 2>Exactly, we are prone to irrationality and emotional biases because

1184
00:59:56.639 --> 01:00:01.079
<v Speaker 2>our physical wiring connects logic centers directly to keep survival instincts.

1185
01:00:01.079 --> 01:00:02.119
<v Speaker 3>That makes perfect sense.

1186
01:00:02.239 --> 01:00:05.800
<v Speaker 2>If we build an artificial brain that physically thinks exactly

1187
01:00:05.960 --> 01:00:08.719
<v Speaker 2>like us just to save power, what else does it

1188
01:00:08.760 --> 01:00:11.760
<v Speaker 2>start doing like us? Does a neuromorphic AI start seeing

1189
01:00:11.800 --> 01:00:12.719
<v Speaker 2>phantom patterns?

1190
01:00:12.920 --> 01:00:16.800
<v Speaker 3>Does an autonomous robot experience the silicon equivalent of fatigue

1191
01:00:17.039 --> 01:00:19.960
<v Speaker 3>or a loss of attention after processing too much sensory

1192
01:00:20.079 --> 01:00:21.280
<v Speaker 3>data without a reset.

1193
01:00:21.519 --> 01:00:24.239
<v Speaker 2>If we build the machine in our exact physical image,

1194
01:00:24.360 --> 01:00:27.840
<v Speaker 2>do we inevitably build our biological vulnerabilities into it as well?

1195
01:00:28.079 --> 01:00:28.920
<v Speaker 2>Something to ponder
