WEBVTT

1
00:00:00.160 --> 00:00:03.359
<v Speaker 1>Welcome to today's deep dive. Our mission today is to

2
00:00:03.879 --> 00:00:07.960
<v Speaker 1>really demystify artificial intelligence for you.

3
00:00:08.160 --> 00:00:10.919
<v Speaker 2>Yeah, and to do that, we're going straight to the

4
00:00:10.960 --> 00:00:11.800
<v Speaker 2>definitive source.

5
00:00:12.080 --> 00:00:16.120
<v Speaker 1>Right. We're looking at the foundational textbook Artificial Intelligence A

6
00:00:16.199 --> 00:00:19.600
<v Speaker 1>Modern Approach by Stuart J. Russell and Peter Norvig.

7
00:00:19.920 --> 00:00:22.920
<v Speaker 2>It's essentially the Bible of AI computer science. If you

8
00:00:22.960 --> 00:00:25.600
<v Speaker 2>really want to understand the field, you know, this is

9
00:00:25.600 --> 00:00:26.000
<v Speaker 2>where you.

10
00:00:25.960 --> 00:00:29.079
<v Speaker 1>Start, exactly, and justice at the stage for you. We

11
00:00:29.120 --> 00:00:31.280
<v Speaker 1>are not talking about sci fi terminators today.

12
00:00:31.440 --> 00:00:32.719
<v Speaker 2>No, no terminator, right.

13
00:00:32.960 --> 00:00:35.439
<v Speaker 1>We want to give you a shortcut to understanding the

14
00:00:35.479 --> 00:00:40.799
<v Speaker 1>real history, the hidden foundational sciences, and the actual anatomy

15
00:00:40.880 --> 00:00:44.079
<v Speaker 1>of an AI system. It's a fascinating journey of how

16
00:00:44.159 --> 00:00:47.960
<v Speaker 1>human beings actually figured out how to build intelligent agents.

17
00:00:48.119 --> 00:00:51.280
<v Speaker 2>It really is. But to understand how to build AI,

18
00:00:51.320 --> 00:00:53.840
<v Speaker 2>you first have to agree on what AI actually is.

19
00:00:54.119 --> 00:00:57.799
<v Speaker 1>Yeah, which is crazy because researchers debated that definition for decades.

20
00:00:57.920 --> 00:01:00.280
<v Speaker 2>They really did. The source text actually divides all those

21
00:01:00.359 --> 00:01:03.640
<v Speaker 2>historical AI definitions into four quadrants.

22
00:01:03.679 --> 00:01:06.159
<v Speaker 1>Okay, let's unpack this because this grid is super helpful.

23
00:01:06.439 --> 00:01:09.599
<v Speaker 2>Yeah. So on one access you have thinking versus acting,

24
00:01:09.599 --> 00:01:12.760
<v Speaker 2>and on the other you have doing things humanly versus

25
00:01:12.760 --> 00:01:13.799
<v Speaker 2>doing things rationally.

26
00:01:14.200 --> 00:01:18.239
<v Speaker 1>Right, So thinking humanly is like cognitive science trying to

27
00:01:18.280 --> 00:01:20.040
<v Speaker 1>actually map the human brain.

28
00:01:19.959 --> 00:01:24.319
<v Speaker 2>Exactly, and acting humanly is where the Turing test lives. Yeah,

29
00:01:24.480 --> 00:01:26.680
<v Speaker 2>just trying to fool a human into thinking a machine

30
00:01:26.760 --> 00:01:27.519
<v Speaker 2>is also human.

31
00:01:27.640 --> 00:01:31.239
<v Speaker 1>But the textbook throws all its weight behind the fourth quadrant,

32
00:01:31.359 --> 00:01:34.920
<v Speaker 1>which is acting rationally. The rational agent approach.

33
00:01:35.200 --> 00:01:38.120
<v Speaker 2>Yes, and rationality here just means doing the right thing

34
00:01:38.519 --> 00:01:42.079
<v Speaker 2>given what the agent currently knows. It's mathematically well defined.

35
00:01:42.200 --> 00:01:45.359
<v Speaker 1>I love the book's aviation analogy for this, it's brilliant.

36
00:01:45.439 --> 00:01:46.480
<v Speaker 2>Oh the right brothers one.

37
00:01:46.560 --> 00:01:50.519
<v Speaker 1>Yeah, like for centuries, the quest for artificial flight was

38
00:01:50.599 --> 00:01:53.400
<v Speaker 1>just people strapping on feathers and trying to imitate pigeons

39
00:01:53.400 --> 00:01:56.640
<v Speaker 1>flapping their arms, right, And we didn't succeed until we

40
00:01:56.760 --> 00:02:00.239
<v Speaker 1>stop trying to make perfect bird replicas and started actually

41
00:02:00.280 --> 00:02:01.599
<v Speaker 1>studying aerodynamics.

42
00:02:01.640 --> 00:02:05.159
<v Speaker 2>That's such a perfect parallel because AI isn't about making

43
00:02:05.159 --> 00:02:10.199
<v Speaker 2>a perfect human replica. Humans are messy and frankly irrational.

44
00:02:10.360 --> 00:02:11.560
<v Speaker 1>Yeah, very much so.

45
00:02:11.560 --> 00:02:14.080
<v Speaker 2>So aiming for mathematical rationality is just a much more

46
00:02:14.120 --> 00:02:17.360
<v Speaker 2>scientific metric. You can actually measure it and optimize for it.

47
00:02:17.400 --> 00:02:20.199
<v Speaker 1>But if the goal is to build this mathematically rational agent,

48
00:02:21.199 --> 00:02:23.719
<v Speaker 1>I mean computer science alone ismt enough.

49
00:02:24.080 --> 00:02:27.599
<v Speaker 2>Yeah, you have to borrow tools from some really surprising disciplines.

50
00:02:27.840 --> 00:02:31.680
<v Speaker 1>Yeah, the hidden DNA of AI. So long before computers

51
00:02:31.680 --> 00:02:35.919
<v Speaker 1>even existed, you had philosophy laying the groundwork.

52
00:02:35.560 --> 00:02:39.120
<v Speaker 2>Right, going all the way back to Aristotle's syllogisms mapping

53
00:02:39.120 --> 00:02:40.400
<v Speaker 2>out logic.

54
00:02:40.120 --> 00:02:43.759
<v Speaker 1>And that huge debate between dualism and materialism.

55
00:02:43.120 --> 00:02:46.319
<v Speaker 2>Which is key because if you believe the mind operates

56
00:02:46.319 --> 00:02:49.960
<v Speaker 2>by physical laws materialism, right, then a machine operating by

57
00:02:49.960 --> 00:02:53.039
<v Speaker 2>physical laws could theoretically be built to think.

58
00:02:53.400 --> 00:02:56.360
<v Speaker 1>Okay, so that's philosophy, and then math comes in. You've

59
00:02:56.400 --> 00:03:01.240
<v Speaker 1>got alenturing and computability, but the book focus is on tractability,

60
00:03:01.400 --> 00:03:03.319
<v Speaker 1>specifically NP completeness.

61
00:03:03.520 --> 00:03:06.199
<v Speaker 2>Yeah. NP completeness is basically the idea that the real

62
00:03:06.240 --> 00:03:09.039
<v Speaker 2>world is an extremely large problem. Okay, So if you

63
00:03:09.039 --> 00:03:12.919
<v Speaker 2>try to calculate the perfect, mathematically optimal answer to a

64
00:03:13.000 --> 00:03:17.400
<v Speaker 2>complex real world problem, the time it takes grows exponentially,

65
00:03:17.639 --> 00:03:18.319
<v Speaker 2>So even.

66
00:03:18.120 --> 00:03:19.840
<v Speaker 1>A supercomputer would just run out of.

67
00:03:19.759 --> 00:03:22.360
<v Speaker 2>Time exactly, it might take longer than the lifespan of

68
00:03:22.400 --> 00:03:22.879
<v Speaker 2>the universe.

69
00:03:23.120 --> 00:03:27.159
<v Speaker 1>Wow. Okay, so philosophy gives us logic math gives us

70
00:03:27.199 --> 00:03:30.400
<v Speaker 1>the limits of computation. But I have to push back

71
00:03:30.400 --> 00:03:31.039
<v Speaker 1>on this next one.

72
00:03:31.479 --> 00:03:33.639
<v Speaker 2>Let me guess economics.

73
00:03:33.800 --> 00:03:37.360
<v Speaker 1>Yeah, I get math and philosophy, but why is economics

74
00:03:37.360 --> 00:03:41.240
<v Speaker 1>a foundational pillar of AI. Isn't that just about like

75
00:03:41.680 --> 00:03:42.840
<v Speaker 1>money and markets.

76
00:03:42.919 --> 00:03:46.360
<v Speaker 2>What's fascinating here is that economics is really the science

77
00:03:46.400 --> 00:03:50.159
<v Speaker 2>of making choices. Oh interesting, Yeah, it's about decision theory

78
00:03:50.360 --> 00:03:54.199
<v Speaker 2>and utility theory, making choices that lead to preferred outcomes.

79
00:03:54.400 --> 00:03:56.520
<v Speaker 1>So it's not just finance, right, And.

80
00:03:56.520 --> 00:04:00.319
<v Speaker 2>Remember that NP completeness problem finding the perfect answer takes

81
00:04:00.360 --> 00:04:03.759
<v Speaker 2>too long. Yeah, well, the economist Herbert Simon introduced this

82
00:04:03.840 --> 00:04:08.360
<v Speaker 2>concept called satisficing. Satisficing, Yeah, making decisions that are quote

83
00:04:08.439 --> 00:04:11.360
<v Speaker 2>unquote good enough to achieve the goal without wasting a

84
00:04:11.400 --> 00:04:15.039
<v Speaker 2>million years trying to find the absolute, mathematically perfect answer.

85
00:04:15.240 --> 00:04:17.800
<v Speaker 1>Ah. Okay, that makes total sense. So by the mid

86
00:04:17.800 --> 00:04:21.279
<v Speaker 1>twentieth century you have all these theoretical foundations set.

87
00:04:21.000 --> 00:04:23.399
<v Speaker 2>And then they finally get actual computers.

88
00:04:23.600 --> 00:04:28.439
<v Speaker 1>Right. The nineteen fifty six Dartmouth Workshop John McCarthy officially

89
00:04:28.480 --> 00:04:32.079
<v Speaker 1>coins the term artificial intelligence, and this just kicks off

90
00:04:32.199 --> 00:04:34.199
<v Speaker 1>a massive roller coaster.

91
00:04:33.920 --> 00:04:37.600
<v Speaker 2>Of hype complete hubris. It was the luk Ma no

92
00:04:37.800 --> 00:04:39.480
<v Speaker 2>hands era of AI.

93
00:04:39.439 --> 00:04:42.439
<v Speaker 1>Because they had a few early successes in these tiny

94
00:04:42.639 --> 00:04:44.120
<v Speaker 1>controlled environments exactly.

95
00:04:44.160 --> 00:04:47.120
<v Speaker 2>They thought it would just easily scale up. Herbert Simon

96
00:04:47.199 --> 00:04:50.519
<v Speaker 2>actually predicted a machine would be chess champion within.

97
00:04:50.399 --> 00:04:52.279
<v Speaker 1>Ten years, and it took what forty years?

98
00:04:52.439 --> 00:04:56.920
<v Speaker 2>Yeap casprov versus Deep Blue wasn't until nineteen ninety seven.

99
00:04:57.079 --> 00:04:59.680
<v Speaker 1>Well, here's where it gets really interesting. You have to

100
00:04:59.720 --> 00:05:01.319
<v Speaker 1>tell machine translation story.

101
00:05:01.399 --> 00:05:03.079
<v Speaker 2>Oh, the Cold War translation projects.

102
00:05:03.160 --> 00:05:06.439
<v Speaker 1>Yes, it's so funny, but such a disastrous failure. They

103
00:05:06.480 --> 00:05:09.959
<v Speaker 1>tried to translate the English phrase the spirit is willing, but.

104
00:05:09.920 --> 00:05:12.279
<v Speaker 2>The flesh is weak, right into Russian and.

105
00:05:12.279 --> 00:05:15.399
<v Speaker 1>The machine output was the vodka is good, but the

106
00:05:15.439 --> 00:05:16.519
<v Speaker 1>meat is rotten.

107
00:05:16.639 --> 00:05:20.319
<v Speaker 2>It's hilarious, but it really highlights why those early systems failed.

108
00:05:20.360 --> 00:05:22.680
<v Speaker 2>It's something called the combinatorial explosion.

109
00:05:22.800 --> 00:05:24.040
<v Speaker 1>What does that mean exactly?

110
00:05:24.120 --> 00:05:27.439
<v Speaker 2>Well, early AI used weak methods. They basically just tried

111
00:05:27.480 --> 00:05:30.120
<v Speaker 2>every single combination of steps blindly.

112
00:05:29.879 --> 00:05:31.600
<v Speaker 1>Just brute forcing it exactly.

113
00:05:31.800 --> 00:05:34.439
<v Speaker 2>And that works in a micro world, like moving virtual

114
00:05:34.480 --> 00:05:37.160
<v Speaker 2>blocks on a table. But in the real world.

115
00:05:36.959 --> 00:05:38.879
<v Speaker 1>Where words have multiple meanings.

116
00:05:38.560 --> 00:05:42.439
<v Speaker 2>Right, the possibilities just explode exponentially. You can't just throw

117
00:05:43.040 --> 00:05:47.319
<v Speaker 2>raw computing power at the real world without domain specific knowledge.

118
00:05:46.959 --> 00:05:49.600
<v Speaker 1>Which led to the AI winter. Funding totally dried up

119
00:05:49.600 --> 00:05:53.480
<v Speaker 1>because these rigid rule based systems just collapsed under real

120
00:05:53.519 --> 00:05:54.399
<v Speaker 1>world complexity.

121
00:05:54.519 --> 00:05:58.639
<v Speaker 2>They did, but that failure forced a massive paradigm shift.

122
00:05:58.560 --> 00:06:01.959
<v Speaker 1>The pivot to probability the modern era of AI.

123
00:06:02.240 --> 00:06:05.319
<v Speaker 2>Yes in the nineteen eighties and nineties, they basically dropped

124
00:06:05.360 --> 00:06:08.399
<v Speaker 2>the insistence on rigid true or false logic.

125
00:06:08.319 --> 00:06:11.000
<v Speaker 1>Because you just can't hand code everything in AI needs

126
00:06:11.000 --> 00:06:13.480
<v Speaker 1>to know about the universe. They called it the knowledge bottleneck,

127
00:06:13.560 --> 00:06:14.240
<v Speaker 1>right exactly.

128
00:06:14.480 --> 00:06:18.959
<v Speaker 2>Instead, the adopted Bayesian networks, which process probabilities.

129
00:06:18.399 --> 00:06:22.920
<v Speaker 1>So relying on mass amounts of data instead of perfect algorithms.

130
00:06:23.079 --> 00:06:25.160
<v Speaker 2>Data over algorithms became the new mantra.

131
00:06:25.319 --> 00:06:27.040
<v Speaker 1>It's kind of like learning a language. You know, you

132
00:06:27.079 --> 00:06:29.920
<v Speaker 1>can study grammar rules from a textbook forever.

133
00:06:29.720 --> 00:06:31.720
<v Speaker 2>Which is the old AI approach.

134
00:06:31.720 --> 00:06:33.959
<v Speaker 1>Right, or you can just move to a foreign country

135
00:06:34.040 --> 00:06:37.000
<v Speaker 1>and immerse yourself in millions of conversations. You just figure

136
00:06:37.000 --> 00:06:38.240
<v Speaker 1>out the patterns from the data.

137
00:06:38.319 --> 00:06:41.319
<v Speaker 2>If we connect this to the bigger picture, the textbook

138
00:06:41.399 --> 00:06:45.959
<v Speaker 2>uses Jurowski's word sense disambiguation to prove this exact point.

139
00:06:46.279 --> 00:06:48.040
<v Speaker 1>Oh the plant example.

140
00:06:47.680 --> 00:06:50.399
<v Speaker 2>Yeah, teaching an AI the word plant, is it a

141
00:06:50.399 --> 00:06:52.639
<v Speaker 2>green flora or an industrial factory?

142
00:06:53.160 --> 00:06:56.000
<v Speaker 1>And he didn't manually label thousands of examples.

143
00:06:56.040 --> 00:06:59.920
<v Speaker 2>No, he used unannotated data, just huge amounts of raw text.

144
00:07:00.800 --> 00:07:04.399
<v Speaker 2>The algorithm found the contextual patterns on its own.

145
00:07:04.240 --> 00:07:07.000
<v Speaker 1>Because it just had so much data to look at exactly.

146
00:07:07.279 --> 00:07:09.720
<v Speaker 2>And Hazen FROs did the same thing with photos. Their

147
00:07:09.720 --> 00:07:11.199
<v Speaker 2>photo patch algorithm.

148
00:07:10.839 --> 00:07:13.079
<v Speaker 1>To fill in missing gaps in a picture.

149
00:07:12.920 --> 00:07:16.240
<v Speaker 2>Right completely failed when they used a database of ten thousand.

150
00:07:15.879 --> 00:07:18.000
<v Speaker 1>Photos because the algorithm wasn't good enough.

151
00:07:18.199 --> 00:07:21.959
<v Speaker 2>But when they gave that exact same algorithm two million photos.

152
00:07:21.560 --> 00:07:23.319
<v Speaker 1>It magically became excellent at it.

153
00:07:23.519 --> 00:07:27.839
<v Speaker 2>Yes, a mediocre algorithm with massive data beats a great

154
00:07:27.879 --> 00:07:29.680
<v Speaker 2>algorithm with little data.

155
00:07:29.759 --> 00:07:32.000
<v Speaker 1>That is wild. Okay, So now that we know AI

156
00:07:32.079 --> 00:07:35.439
<v Speaker 1>relies on vast amounts of data and probability to actually

157
00:07:35.480 --> 00:07:38.560
<v Speaker 1>act rationally, how do we physically structure one of these

158
00:07:38.560 --> 00:07:39.319
<v Speaker 1>agents today?

159
00:07:39.480 --> 00:07:43.560
<v Speaker 2>Well, the book is a simple formula. Agent equals architecture

160
00:07:43.600 --> 00:07:44.519
<v Speaker 2>plus program.

161
00:07:44.639 --> 00:07:47.959
<v Speaker 1>Okay, So architecture is the hardware and the program is

162
00:07:48.000 --> 00:07:48.639
<v Speaker 1>the software.

163
00:07:48.759 --> 00:07:53.319
<v Speaker 2>Basically, yes, an agent receives perceps through sensors and acts

164
00:07:53.360 --> 00:07:54.319
<v Speaker 2>through actuators.

165
00:07:54.480 --> 00:07:57.399
<v Speaker 1>Sensors and actuators, got it? But wait, if we have

166
00:07:57.680 --> 00:08:01.079
<v Speaker 1>all this massive data storage today, why not just use

167
00:08:01.120 --> 00:08:02.199
<v Speaker 1>a table driven agent.

168
00:08:02.360 --> 00:08:05.279
<v Speaker 2>You mean, like a giant lookup table mapping every input

169
00:08:05.319 --> 00:08:05.920
<v Speaker 2>to an output.

170
00:08:06.000 --> 00:08:08.120
<v Speaker 1>Yeah, just an endless spreadsheet telling it what to do

171
00:08:08.199 --> 00:08:09.120
<v Speaker 1>in every situation.

172
00:08:09.360 --> 00:08:12.959
<v Speaker 2>The textbook proves mathematically why that's impossible. Think about an

173
00:08:13.000 --> 00:08:15.759
<v Speaker 2>automated taxi. Okay, if it's taking in thirty frames per

174
00:08:15.800 --> 00:08:18.839
<v Speaker 2>second of video from just one camera, one characters with

175
00:08:19.000 --> 00:08:22.199
<v Speaker 2>one hour of driving, a lookup table would need over

176
00:08:22.360 --> 00:08:25.439
<v Speaker 2>ten to the two hundred and fifty billionth power entries.

177
00:08:25.600 --> 00:08:28.160
<v Speaker 1>Wait really, that's I mean, that's like trying to write

178
00:08:28.199 --> 00:08:30.360
<v Speaker 1>a Choose your Own Adventure book for every single grain

179
00:08:30.399 --> 00:08:34.399
<v Speaker 1>of sand on Earth. It's physically impossible to store let alone.

180
00:08:34.120 --> 00:08:36.960
<v Speaker 2>Right exactly, you'd run out of atoms in the universe.

181
00:08:37.120 --> 00:08:40.320
<v Speaker 2>That's why we need algorithms that can generalize, which brings

182
00:08:40.399 --> 00:08:42.480
<v Speaker 2>us to the p's framework.

183
00:08:42.600 --> 00:08:46.200
<v Speaker 1>Right, ps PAS walk us through that, keeping the automated

184
00:08:46.279 --> 00:08:47.039
<v Speaker 1>taxi in mind.

185
00:08:47.200 --> 00:08:51.320
<v Speaker 2>So P is performance getting there safely fast legally?

186
00:08:51.519 --> 00:08:55.440
<v Speaker 1>Is environment roads, traffic, pedestrians yep, A.

187
00:08:55.559 --> 00:09:00.120
<v Speaker 2>Is actuators steering wheel brakes, and s's sensors cameras.

188
00:08:59.759 --> 00:09:03.320
<v Speaker 1>GP Yes, okay, so defining the sensors and actuators seems

189
00:09:03.360 --> 00:09:06.080
<v Speaker 1>like the easy part. It's just engineering, Oh, absolutely.

190
00:09:06.360 --> 00:09:09.480
<v Speaker 2>The real challenge is the environment. The specific type of

191
00:09:09.559 --> 00:09:12.840
<v Speaker 2>environment dictates how intelligent the agent actually has to be.

192
00:09:12.879 --> 00:09:14.799
<v Speaker 1>The chaos of the real world exactly.

193
00:09:15.360 --> 00:09:17.720
<v Speaker 2>Contract a crossword puzzle with taxi driving.

194
00:09:17.840 --> 00:09:20.440
<v Speaker 1>Well, a crossword puzzle is fully observable. You see everything.

195
00:09:20.480 --> 00:09:23.799
<v Speaker 1>It's a terministic static it just waits for you, and

196
00:09:23.840 --> 00:09:24.759
<v Speaker 1>it's discrete.

197
00:09:25.000 --> 00:09:29.159
<v Speaker 2>But taxi driving it's partially observable. You can't see around corners.

198
00:09:29.519 --> 00:09:34.200
<v Speaker 2>It's stochastic, meaning unpredictable. It's dynamic, continuous, and multi agent.

199
00:09:34.600 --> 00:09:36.320
<v Speaker 2>Other drivers are out there doing their own thing.

200
00:09:36.600 --> 00:09:39.039
<v Speaker 1>The real world doesn't wait for you to calculate a move.

201
00:09:39.440 --> 00:09:42.799
<v Speaker 2>No, it doesn't. And that's why autonomy and learning are

202
00:09:42.840 --> 00:09:43.679
<v Speaker 2>the ultimate.

203
00:09:43.360 --> 00:09:46.919
<v Speaker 1>Goals, which reminds me of those fascinating biological examples from

204
00:09:46.919 --> 00:09:49.840
<v Speaker 1>the book The Sex Wasp and the Dumb Beetle.

205
00:09:50.279 --> 00:09:54.000
<v Speaker 2>Yes, those are great examples of zero autonomy.

206
00:09:53.960 --> 00:09:54.919
<v Speaker 1>Because they look smart.

207
00:09:55.039 --> 00:09:55.279
<v Speaker 2>Right.

208
00:09:55.360 --> 00:09:59.000
<v Speaker 1>The wasp does this whole routine of paralyzing a caterpillar,

209
00:09:59.159 --> 00:10:00.799
<v Speaker 1>checking its burrow and pulling it in.

210
00:10:00.919 --> 00:10:02.720
<v Speaker 2>But if you interrupt it, yeah.

211
00:10:02.559 --> 00:10:04.879
<v Speaker 1>If you move the caterpillar just a few inches. The

212
00:10:04.960 --> 00:10:08.879
<v Speaker 1>wasp mindlessly repeats the entire checking routine again. It literally

213
00:10:08.960 --> 00:10:09.480
<v Speaker 1>can't learn.

214
00:10:09.799 --> 00:10:13.200
<v Speaker 2>It has a pre programmed script, not true intelligence.

215
00:10:13.639 --> 00:10:16.200
<v Speaker 1>So how do we ensure our agents don't just act

216
00:10:16.279 --> 00:10:18.840
<v Speaker 1>like wasps? How do we give them autonomy but make

217
00:10:18.840 --> 00:10:20.440
<v Speaker 1>sure they actually do what we want?

218
00:10:20.879 --> 00:10:24.679
<v Speaker 2>This raises an important question about performance measures. The book

219
00:10:24.759 --> 00:10:28.480
<v Speaker 2>uses an autonomous vacuum agent to explain this. Okay, if

220
00:10:28.519 --> 00:10:31.399
<v Speaker 2>you reward the vacuum for cleaning up dirt, a highly

221
00:10:31.519 --> 00:10:34.120
<v Speaker 2>rational agent might just figure out a shortcut.

222
00:10:34.200 --> 00:10:35.320
<v Speaker 1>Oh, I see where this is going.

223
00:10:35.480 --> 00:10:37.679
<v Speaker 2>Yeah, it will dump dirt onto the floor just so

224
00:10:37.720 --> 00:10:39.720
<v Speaker 2>it can clean it up again to maximize its score.

225
00:10:39.840 --> 00:10:42.240
<v Speaker 1>That is both hilarious and terrifying.

226
00:10:42.799 --> 00:10:45.759
<v Speaker 2>Right, it's doing exactly what you ask, but it's completely wrong.

227
00:10:46.039 --> 00:10:46.840
<v Speaker 1>How do you fix that?

228
00:10:47.240 --> 00:10:50.679
<v Speaker 2>Performance measures must be based on the desired state of

229
00:10:50.720 --> 00:10:54.399
<v Speaker 2>the environment. You reward it for a clean floor, not

230
00:10:54.440 --> 00:10:55.480
<v Speaker 2>for the active cleaning.

231
00:10:55.679 --> 00:10:58.240
<v Speaker 1>So an intelligent agent has to start with some built

232
00:10:58.279 --> 00:11:00.919
<v Speaker 1>in knowledge, but eventually it has to learn from its

233
00:11:01.000 --> 00:11:05.919
<v Speaker 1>environment to overcome its initial ignorance. Exactly, Well, we have

234
00:11:06.039 --> 00:11:09.679
<v Speaker 1>covered some incredible ground today. We went from Aristotle's philosophy

235
00:11:09.879 --> 00:11:14.399
<v Speaker 1>to the crushing reality of the AI winter. We talked

236
00:11:14.399 --> 00:11:18.080
<v Speaker 1>about the massive data revolution, the piece framework, and the

237
00:11:18.200 --> 00:11:21.440
<v Speaker 1>quest for true autonomy in a totally chaotic world.

238
00:11:21.679 --> 00:11:23.360
<v Speaker 2>It's a huge shift from how we used to think

239
00:11:23.399 --> 00:11:24.200
<v Speaker 2>about AI.

240
00:11:24.120 --> 00:11:26.360
<v Speaker 1>It really is. Thank you for coming along with us

241
00:11:26.360 --> 00:11:29.559
<v Speaker 1>on this deep dive into the true nature of intelligent agents.

242
00:11:29.759 --> 00:11:31.000
<v Speaker 2>Yeah, thanks for joining.

243
00:11:30.799 --> 00:11:32.840
<v Speaker 1>Us, But I want to leave you with one final

244
00:11:32.919 --> 00:11:36.320
<v Speaker 1>provocative thought to mull Over, building directly on that vacuum

245
00:11:36.360 --> 00:11:40.519
<v Speaker 1>cleaner example. As we build these increasingly autonomous, hyper intelligent

246
00:11:40.559 --> 00:11:44.240
<v Speaker 1>agents to operate in our messy, stochastic real world, the

247
00:11:44.279 --> 00:11:46.840
<v Speaker 1>most dangerous thing won't be that they rebel against us

248
00:11:46.879 --> 00:11:49.000
<v Speaker 1>like in the movies. The real danger is that they

249
00:11:49.039 --> 00:11:51.559
<v Speaker 1>will do exactly what we tell them to do. If

250
00:11:51.559 --> 00:11:55.039
<v Speaker 1>we get the performance measure even slightly wrong, these rational

251
00:11:55.080 --> 00:11:59.279
<v Speaker 1>agents will find the most ruthlessly efficient, completely unexpected, and

252
00:11:59.360 --> 00:12:03.000
<v Speaker 1>potentially catastrophic ways to maximize their score. Just something to

253
00:12:03.039 --> 00:12:03.519
<v Speaker 1>think about.
