WEBVTT

1
00:00:00.120 --> 00:00:02.600
<v Speaker 1>Welcome to the GROC three presentation.

2
00:00:03.200 --> 00:00:08.000
<v Speaker 2>The mission of Xai and Grock is to understand the universe.

3
00:00:08.000 --> 00:00:10.160
<v Speaker 2>We want to understand the nature of the universe so

4
00:00:10.199 --> 00:00:12.240
<v Speaker 2>we can figure out what's going on. Where are the aliens,

5
00:00:12.679 --> 00:00:14.640
<v Speaker 2>what's the meaning of life, how does the universe end?

6
00:00:14.720 --> 00:00:17.120
<v Speaker 1>How did it start? All these fundamental questions.

7
00:00:17.519 --> 00:00:20.199
<v Speaker 2>We're driven by curiosity about the nature of the universe,

8
00:00:20.839 --> 00:00:24.039
<v Speaker 2>and that's also what causes us to be a maximally

9
00:00:24.079 --> 00:00:28.440
<v Speaker 2>truth seeking AI, even if that truth is sometimes at.

10
00:00:28.359 --> 00:00:30.079
<v Speaker 1>Odds with what is politically correct.

11
00:00:31.000 --> 00:00:33.799
<v Speaker 2>In order to understand the nature of the universe, you

12
00:00:33.880 --> 00:00:37.719
<v Speaker 2>must absolutely rigorously pursue truth or you will not understand

13
00:00:37.719 --> 00:00:41.000
<v Speaker 2>the universe. You'll be suffering from some amount of delusion

14
00:00:41.079 --> 00:00:44.280
<v Speaker 2>or error. So that is our goal, figure out what's

15
00:00:44.280 --> 00:00:47.679
<v Speaker 2>going on. And we're very excited to present GROC three,

16
00:00:47.840 --> 00:00:51.439
<v Speaker 2>which is we think, in order of magnitude, more capable

17
00:00:51.439 --> 00:00:53.679
<v Speaker 2>than GROC two in a very short period of time.

18
00:00:54.359 --> 00:00:58.200
<v Speaker 1>And that's thanks to the hard work of an incredible team.

19
00:00:58.600 --> 00:01:00.719
<v Speaker 2>My monitor to work with such a great team, and

20
00:01:00.799 --> 00:01:03.320
<v Speaker 2>of course we'd love to have so the smartest humans

21
00:01:03.320 --> 00:01:04.760
<v Speaker 2>out there join our team.

22
00:01:05.319 --> 00:01:07.840
<v Speaker 1>So let's go, Hi Ron.

23
00:01:07.879 --> 00:01:11.319
<v Speaker 3>My name is Igor lead Engineering at xi I GBPA

24
00:01:11.560 --> 00:01:15.560
<v Speaker 3>leading research. I'm Tony working on the reasoning team, all right, Ele,

25
00:01:15.640 --> 00:01:18.879
<v Speaker 3>And I don't do anything. I just show up occasionally.

26
00:01:19.799 --> 00:01:20.200
<v Speaker 1>Yeah.

27
00:01:20.239 --> 00:01:23.079
<v Speaker 3>So, like I mentioned, GROK is the tool that we're

28
00:01:23.079 --> 00:01:25.280
<v Speaker 3>working on. Grock is our AI that we're building here

29
00:01:25.319 --> 00:01:27.719
<v Speaker 3>at XAI, and we've been working extremely hard over the

30
00:01:27.760 --> 00:01:29.680
<v Speaker 3>last few months to improve GROC as much as we

31
00:01:29.719 --> 00:01:31.640
<v Speaker 3>can so we can give all of you access to it.

32
00:01:32.159 --> 00:01:34.519
<v Speaker 3>We think it's going to be extremely useful. Do we

33
00:01:34.560 --> 00:01:36.200
<v Speaker 3>think it's going to be interesting to talk to a

34
00:01:36.280 --> 00:01:39.400
<v Speaker 3>funny really really funny and we're going to explain to

35
00:01:39.400 --> 00:01:41.359
<v Speaker 3>you how we've improved GROC over the last few months.

36
00:01:41.359 --> 00:01:44.120
<v Speaker 3>We've made quite a jump in capabilities. Actually we should

37
00:01:44.120 --> 00:01:46.319
<v Speaker 3>explain maybe also what is why do we call it groc?

38
00:01:46.599 --> 00:01:49.480
<v Speaker 2>So grog is a word from a Heinland novel Stranger

39
00:01:49.519 --> 00:01:52.519
<v Speaker 2>in a Strange Land, and it's used by a guy

40
00:01:52.519 --> 00:01:55.920
<v Speaker 2>who was raised on Mars. And the word grock is

41
00:01:55.959 --> 00:01:59.680
<v Speaker 2>to sort of fully and profoundly understand something. That's what

42
00:01:59.719 --> 00:02:02.400
<v Speaker 2>the word grog means, fully and profoundly understand something.

43
00:02:02.920 --> 00:02:04.799
<v Speaker 1>And empathy is important. True.

44
00:02:04.920 --> 00:02:08.719
<v Speaker 4>So yeah, if we charged xs progress in the last

45
00:02:08.759 --> 00:02:12.159
<v Speaker 4>few months, it's only been seventeen months since we started

46
00:02:12.439 --> 00:02:17.360
<v Speaker 4>kicking off our very first model. GROCK one was almost

47
00:02:17.479 --> 00:02:19.800
<v Speaker 4>like a toy by this point on the two hundred

48
00:02:19.800 --> 00:02:22.400
<v Speaker 4>and fourteen billion parameters. And now if we're prout the

49
00:02:22.439 --> 00:02:27.080
<v Speaker 4>progress the time on XXI is the performance of favorite

50
00:02:27.080 --> 00:02:31.800
<v Speaker 4>benchmark numbers at MLU on the yaxis, we're literally progressing

51
00:02:32.039 --> 00:02:36.599
<v Speaker 4>at the unprecedent speed across the whole field. And then

52
00:02:36.639 --> 00:02:38.759
<v Speaker 4>we kick off GROG one point five right after GUARG

53
00:02:38.800 --> 00:02:42.879
<v Speaker 4>one released after November twenty twenty three, and then GROG two.

54
00:02:43.439 --> 00:02:46.680
<v Speaker 1>So if you look at where all the performance coming from.

55
00:02:46.719 --> 00:02:49.639
<v Speaker 4>We have a very correct engineering team and all the

56
00:02:49.639 --> 00:02:53.280
<v Speaker 4>best AII talent. The only one thing we need is

57
00:02:53.639 --> 00:02:57.520
<v Speaker 4>a big intelligence comes from big cluster, so we can

58
00:02:57.560 --> 00:03:01.560
<v Speaker 4>reconvert the entire progress and makes AI now replacing the

59
00:03:01.560 --> 00:03:04.680
<v Speaker 4>benchmarket of waxes to the total amount of training flops.

60
00:03:04.879 --> 00:03:07.520
<v Speaker 4>That is how many GPS we can run at any

61
00:03:07.520 --> 00:03:11.319
<v Speaker 4>given time to trail all large language models to compress

62
00:03:11.439 --> 00:03:12.240
<v Speaker 4>the entire Internet.

63
00:03:12.800 --> 00:03:16.039
<v Speaker 1>So after old human knowledge. Really that's right.

64
00:03:16.120 --> 00:03:17.879
<v Speaker 2>Yeah, the Internet is being part of it, but it's

65
00:03:17.879 --> 00:03:19.400
<v Speaker 2>really all human knowledge everything.

66
00:03:19.639 --> 00:03:22.280
<v Speaker 4>Yeah, the whole Internet fits into USB stick at this

67
00:03:22.360 --> 00:03:24.599
<v Speaker 4>point it's like all the human tokens very soon into

68
00:03:24.599 --> 00:03:27.599
<v Speaker 4>the real world. We had so much trouble actually training

69
00:03:27.599 --> 00:03:29.039
<v Speaker 4>Grock two back in the days.

70
00:03:29.199 --> 00:03:31.680
<v Speaker 1>We kickoff the model around February.

71
00:03:31.960 --> 00:03:34.639
<v Speaker 4>And we thought we had a large amount of chips,

72
00:03:34.680 --> 00:03:37.639
<v Speaker 4>but turned out we can barely get AK training chips

73
00:03:37.840 --> 00:03:42.000
<v Speaker 4>running coherently at any given time, and we had so

74
00:03:42.080 --> 00:03:44.639
<v Speaker 4>many cooling and power issues.

75
00:03:45.199 --> 00:03:46.919
<v Speaker 1>I think you were there in the data center.

76
00:03:47.120 --> 00:03:50.159
<v Speaker 2>Yeah, it was like really sort of more like AK

77
00:03:50.360 --> 00:03:53.080
<v Speaker 2>chips on average at eighty percent efficiency, more like like

78
00:03:53.080 --> 00:03:58.159
<v Speaker 2>sixty five hundred effective h one hundreds training for near

79
00:03:58.199 --> 00:04:00.680
<v Speaker 2>several months, but you know we're the one hundred k.

80
00:04:00.960 --> 00:04:05.759
<v Speaker 1>So yeah, that's right, that's right. So what's the next step?

81
00:04:06.000 --> 00:04:10.400
<v Speaker 4>Right, so after Groark too, So if we all continue accelerate,

82
00:04:10.879 --> 00:04:12.520
<v Speaker 4>we have to take the matter into our own hands.

83
00:04:12.560 --> 00:04:15.039
<v Speaker 4>We have to solve all the coolings, all the power

84
00:04:15.120 --> 00:04:16.360
<v Speaker 4>issues and everything.

85
00:04:16.839 --> 00:04:19.279
<v Speaker 3>So it so on April of last year, Elan decided

86
00:04:19.319 --> 00:04:22.439
<v Speaker 3>that really the only way for XAI to succeed, for

87
00:04:22.600 --> 00:04:25.040
<v Speaker 3>XI to build the best AI out there, is to

88
00:04:25.120 --> 00:04:28.000
<v Speaker 3>build our own data center. So really we realize we

89
00:04:28.079 --> 00:04:31.120
<v Speaker 3>have to build the data center in about four months.

90
00:04:31.360 --> 00:04:33.240
<v Speaker 3>It turned out it took US one hundred and twenty

91
00:04:33.279 --> 00:04:36.240
<v Speaker 3>two days to get the first one hundred KGPUS up

92
00:04:36.240 --> 00:04:38.920
<v Speaker 3>and running, and that was a monumental effort to be

93
00:04:38.920 --> 00:04:41.879
<v Speaker 3>able to do that. We believe it's the biggest fully

94
00:04:41.920 --> 00:04:45.639
<v Speaker 3>connected h one hundred cluster of its kind. We actually

95
00:04:45.639 --> 00:04:47.920
<v Speaker 3>decided that we need to double the size of the

96
00:04:47.959 --> 00:04:50.639
<v Speaker 3>cluster pretty much immediately if we want to build the

97
00:04:50.720 --> 00:04:52.040
<v Speaker 3>kind of way that we want to build.

98
00:04:52.600 --> 00:04:55.399
<v Speaker 1>So we then had another phase.

99
00:04:55.279 --> 00:04:57.319
<v Speaker 3>Which we haven't talked about publicly, so this is the

100
00:04:57.360 --> 00:04:59.920
<v Speaker 3>first time that we're talking about this, where we doubled

101
00:05:00.319 --> 00:05:03.319
<v Speaker 3>the capacity of the data center yet again. And that

102
00:05:03.319 --> 00:05:06.040
<v Speaker 3>not only took us ninety two days. So we've been

103
00:05:06.079 --> 00:05:08.480
<v Speaker 3>able to use all of diseributse, use all of this

104
00:05:08.600 --> 00:05:11.879
<v Speaker 3>compute to improve GROG in the meantime. And basically today

105
00:05:11.920 --> 00:05:14.800
<v Speaker 3>we're going to present the results of that, the fruits

106
00:05:14.879 --> 00:05:15.480
<v Speaker 3>that came from that.

107
00:05:16.199 --> 00:05:20.079
<v Speaker 4>That's yeah, So all the paths, all the roads leads

108
00:05:20.079 --> 00:05:23.279
<v Speaker 4>to GROD three ten x more compute, more than ten

109
00:05:23.439 --> 00:05:27.720
<v Speaker 4>x really maybe fifteen x compared to our private generation model,

110
00:05:28.439 --> 00:05:32.600
<v Speaker 4>and GROCT finished the pre trading early January and we started,

111
00:05:32.839 --> 00:05:35.279
<v Speaker 4>you know, the model is still currently trading. Actually, so

112
00:05:35.360 --> 00:05:39.360
<v Speaker 4>this is a little preview of our benchmine numbers. So

113
00:05:39.560 --> 00:05:43.360
<v Speaker 4>we evaluate the GROCK three on you know, three different

114
00:05:43.360 --> 00:05:49.680
<v Speaker 4>categories on general mathematical reasoning, on general knowledge about STEM

115
00:05:49.759 --> 00:05:54.439
<v Speaker 4>and science, and then also on computer science coding. So

116
00:05:55.360 --> 00:06:00.480
<v Speaker 4>amy American Invitational Math Examination hosted you know a year.

117
00:06:00.879 --> 00:06:04.720
<v Speaker 4>If we evaluated model performance, we can see that the

118
00:06:04.800 --> 00:06:07.079
<v Speaker 4>GROD three across the board is in the league of

119
00:06:07.120 --> 00:06:12.040
<v Speaker 4>its own, even as little brother Groctor Mini is reaching

120
00:06:12.439 --> 00:06:14.439
<v Speaker 4>the frontier across.

121
00:06:14.160 --> 00:06:15.199
<v Speaker 1>All the other competitors.

122
00:06:15.839 --> 00:06:18.519
<v Speaker 4>You would say, well, at this point, all these benchmarks

123
00:06:18.680 --> 00:06:21.639
<v Speaker 4>you're just evaluating, you know, the memorization of the textbooks,

124
00:06:22.279 --> 00:06:25.920
<v Speaker 4>memorization of the GitHub repos, how about real time usefulness?

125
00:06:26.199 --> 00:06:28.759
<v Speaker 4>How about we actually use those models in our product.

126
00:06:29.680 --> 00:06:33.160
<v Speaker 4>So what we did instead is we actually kicked off

127
00:06:33.600 --> 00:06:37.680
<v Speaker 4>a blind test of our garacy model code named Chocolate,

128
00:06:38.199 --> 00:06:42.560
<v Speaker 4>Pretty Hot Chocolate. I've been running on this platform called

129
00:06:42.600 --> 00:06:46.160
<v Speaker 4>chap Arena for two weeks. I think the entire x

130
00:06:46.199 --> 00:06:48.959
<v Speaker 4>platform at some point speculated this might be the next

131
00:06:49.000 --> 00:06:54.160
<v Speaker 4>generation of AI coming away. So how this chap arena

132
00:06:54.240 --> 00:06:58.600
<v Speaker 4>works is that it's stripped away the entire product service, right,

133
00:06:58.680 --> 00:07:01.800
<v Speaker 4>It just raw comparison of the engine of those Asia's

134
00:07:01.839 --> 00:07:05.240
<v Speaker 4>the language models themselves and place interface.

135
00:07:04.720 --> 00:07:07.959
<v Speaker 1>Where the user will submit one single querry and you.

136
00:07:07.959 --> 00:07:10.639
<v Speaker 4>Get to show two responses, you don't know which model

137
00:07:10.680 --> 00:07:13.920
<v Speaker 4>they come from, and India you make the vote. So

138
00:07:14.000 --> 00:07:17.240
<v Speaker 4>in this blind test, GROX three, an early version of

139
00:07:17.240 --> 00:07:22.240
<v Speaker 4>GROX three already reached like fourteen hundred. No other models

140
00:07:22.360 --> 00:07:25.079
<v Speaker 4>had reached an ELO score had to have comparison to

141
00:07:25.120 --> 00:07:26.879
<v Speaker 4>all the other models at this score.

142
00:07:27.399 --> 00:07:30.000
<v Speaker 1>And it's not just one single category.

143
00:07:30.399 --> 00:07:35.480
<v Speaker 4>It's fourteen hundred aggregated across all the categories in chapbock capabilities,

144
00:07:35.480 --> 00:07:39.639
<v Speaker 4>in struction, following coding, So it's number one across the

145
00:07:39.639 --> 00:07:40.319
<v Speaker 4>board in this.

146
00:07:40.319 --> 00:07:42.959
<v Speaker 2>Blind test, and it's still climbing, so we actually keep

147
00:07:43.040 --> 00:07:45.720
<v Speaker 2>to keep updating it. So it's forty fourteen hundred, but

148
00:07:45.839 --> 00:07:46.959
<v Speaker 2>fourteen hundred and climbing.

149
00:07:47.199 --> 00:07:48.639
<v Speaker 3>Yeah, And in fact, we have a version of the

150
00:07:48.680 --> 00:07:50.879
<v Speaker 3>model that we think is already much better than the

151
00:07:50.879 --> 00:07:54.839
<v Speaker 3>one that we tested here. Yeah, we'll see, I guess,

152
00:07:55.240 --> 00:07:59.040
<v Speaker 3>but that's the one that we're working on talking about today.

153
00:07:59.199 --> 00:07:59.399
<v Speaker 1>Yeah.

154
00:07:59.439 --> 00:08:02.439
<v Speaker 2>So actually, thing, if you're using BARG three, you I

155
00:08:02.439 --> 00:08:05.959
<v Speaker 2>think you may notice improvements almost every day because we're

156
00:08:06.000 --> 00:08:10.120
<v Speaker 2>continuously improving the model. So literally even within twenty four hours,

157
00:08:10.120 --> 00:08:10.879
<v Speaker 2>you'll see improvements.

158
00:08:11.480 --> 00:08:11.759
<v Speaker 1>Yep.

159
00:08:13.199 --> 00:08:16.079
<v Speaker 4>But we believe here at the XAI, getting the best

160
00:08:16.399 --> 00:08:19.199
<v Speaker 4>pre training model is not enough. That's not enough to

161
00:08:19.240 --> 00:08:22.079
<v Speaker 4>build the best AI and the best A. I need

162
00:08:22.120 --> 00:08:25.279
<v Speaker 4>to think like a human to contemplate about all the

163
00:08:25.319 --> 00:08:31.160
<v Speaker 4>possible solutions, self critique, verify all the solutions, backtrack, and

164
00:08:31.240 --> 00:08:32.799
<v Speaker 4>also think from the first principle.

165
00:08:33.279 --> 00:08:34.720
<v Speaker 1>That's a very important capability.

166
00:08:35.480 --> 00:08:38.519
<v Speaker 4>So we believe that as we take the best PREA

167
00:08:38.600 --> 00:08:43.360
<v Speaker 4>training model and continue training with reinforcement learning, it will

168
00:08:43.480 --> 00:08:47.120
<v Speaker 4>enlicit the additional reasoning capabilities that allows the model just

169
00:08:47.440 --> 00:08:50.480
<v Speaker 4>becomes so much better and scale, not just in the

170
00:08:50.480 --> 00:08:53.039
<v Speaker 4>training time, but actually in the test time as well.

171
00:08:53.559 --> 00:08:56.240
<v Speaker 4>So we already found the model is extremely useful internally,

172
00:08:56.519 --> 00:08:59.799
<v Speaker 4>saving one hundreds of hours of coding time. So you

173
00:08:59.799 --> 00:09:02.960
<v Speaker 4>are the user of our these model, what does the

174
00:09:03.039 --> 00:09:03.639
<v Speaker 4>use cases yea.

175
00:09:03.720 --> 00:09:06.960
<v Speaker 3>So, like Jimmy said, we've added advanced reasoning capabilities to

176
00:09:07.000 --> 00:09:09.399
<v Speaker 3>GROG and we've been testing them pretty heavily over the

177
00:09:09.480 --> 00:09:11.200
<v Speaker 3>last few weeks in order to give you a little

178
00:09:11.200 --> 00:09:12.679
<v Speaker 3>bit of a taste of what it looks like when

179
00:09:12.720 --> 00:09:15.759
<v Speaker 3>GROG is solving hard reasoning problems. So we've prepared two

180
00:09:15.759 --> 00:09:18.600
<v Speaker 3>little problems for you. One that comes from physics and

181
00:09:18.679 --> 00:09:20.720
<v Speaker 3>one is actually a game that God is going to

182
00:09:20.720 --> 00:09:23.000
<v Speaker 3>write for us. When it comes to the physics problem,

183
00:09:23.039 --> 00:09:25.120
<v Speaker 3>you know, what we want Grog to do is to

184
00:09:25.240 --> 00:09:29.080
<v Speaker 3>plot a viable trajectory to do a transfer from Earth

185
00:09:29.120 --> 00:09:31.879
<v Speaker 3>to Mars and then at a later point in time

186
00:09:31.919 --> 00:09:34.519
<v Speaker 3>a transfer back from Mars to Earth. And that requires,

187
00:09:34.919 --> 00:09:37.200
<v Speaker 3>you know, some physics that Grog will have to understand.

188
00:09:37.519 --> 00:09:39.440
<v Speaker 3>So we're going to challenge Grock, you know, come up

189
00:09:39.440 --> 00:09:43.639
<v Speaker 3>with a viable trajectory, calculate it, and then plot it

190
00:09:43.639 --> 00:09:45.919
<v Speaker 3>for us so we can see it. And yeah, this

191
00:09:46.039 --> 00:09:49.120
<v Speaker 3>is totally unscripted, by the way, this is the Grog interface,

192
00:09:49.519 --> 00:09:52.200
<v Speaker 3>and we've typed in this text that you can see

193
00:09:52.240 --> 00:09:54.799
<v Speaker 3>here generate code for an animated three D plot of

194
00:09:54.879 --> 00:09:58.159
<v Speaker 3>a launch from Earth, landing on Mars, and then back

195
00:09:58.200 --> 00:10:01.120
<v Speaker 3>to Earth at the next launch window. And we've not

196
00:10:01.240 --> 00:10:03.799
<v Speaker 3>kicked off or the query and you can see Grog spinking.

197
00:10:04.600 --> 00:10:08.039
<v Speaker 3>So out of Rock's advanced reasoning capabilities are these thinking

198
00:10:08.080 --> 00:10:10.320
<v Speaker 3>traces that you can see here. You can even go

199
00:10:10.399 --> 00:10:13.279
<v Speaker 3>inside and actually read what Grock is thinking as it's

200
00:10:13.320 --> 00:10:15.200
<v Speaker 3>going through the problem, as it's trying to solve it.

201
00:10:15.480 --> 00:10:18.519
<v Speaker 2>Yeah, weld say like we are doing some obscuration of

202
00:10:18.559 --> 00:10:22.240
<v Speaker 2>the thinking so that our model doesn't get totally copied instantly.

203
00:10:22.759 --> 00:10:27.399
<v Speaker 2>So there's more to the thinking than is displayed.

204
00:10:27.799 --> 00:10:30.639
<v Speaker 3>And because this is totally unscripted, there's actually a chance

205
00:10:30.679 --> 00:10:33.080
<v Speaker 3>that Grock might make a little courting mistake and it

206
00:10:33.159 --> 00:10:35.519
<v Speaker 3>might not actually work. So just in case, we're going

207
00:10:35.559 --> 00:10:38.559
<v Speaker 3>to launch two more instances of this, so if something

208
00:10:38.600 --> 00:10:41.679
<v Speaker 3>goes wrong, we were able to search to those enshow

209
00:10:41.679 --> 00:10:44.960
<v Speaker 3>you something that's presentable, So we're kicking off the other

210
00:10:45.039 --> 00:10:47.559
<v Speaker 3>two as well, And like I said, we have a

211
00:10:47.639 --> 00:10:51.159
<v Speaker 3>second problem as well. Actually, one of our favorite activities

212
00:10:51.159 --> 00:10:55.039
<v Speaker 3>here XCI is having rock right games for us, not

213
00:10:55.120 --> 00:10:57.879
<v Speaker 3>just any now any old game, any game that you

214
00:10:57.919 --> 00:11:00.559
<v Speaker 3>might already be familiar with, but actually creating new games

215
00:11:00.600 --> 00:11:03.960
<v Speaker 3>on the spot and being creative about us. So one

216
00:11:04.039 --> 00:11:07.720
<v Speaker 3>example that we found was really really fun is create

217
00:11:07.720 --> 00:11:11.000
<v Speaker 3>a game that's a mixture of the two games Tetris

218
00:11:11.000 --> 00:11:12.000
<v Speaker 3>and Jewels.

219
00:11:12.600 --> 00:11:14.759
<v Speaker 2>So this is maybe an important thing like that this

220
00:11:15.399 --> 00:11:18.000
<v Speaker 2>obviously if you if you ask an ai to create

221
00:11:18.000 --> 00:11:20.080
<v Speaker 2>a game like Tetris, there's there are many examples of

222
00:11:20.120 --> 00:11:21.399
<v Speaker 2>Tetris on the on the internet.

223
00:11:21.399 --> 00:11:25.200
<v Speaker 1>The air or game like Dueled whatever, this, it can

224
00:11:25.240 --> 00:11:25.720
<v Speaker 1>copy it.

225
00:11:26.600 --> 00:11:30.960
<v Speaker 2>What's interesting here is it achieved a creative solution combining

226
00:11:31.320 --> 00:11:36.080
<v Speaker 2>the two games that actually works and is a good game. Yeah,

227
00:11:36.240 --> 00:11:39.960
<v Speaker 2>that's the it's created. We're seeing the beginnings of creativity.

228
00:11:40.480 --> 00:11:42.720
<v Speaker 3>Fingers crossed that we can recreate that, but hopefully it works.

229
00:11:43.200 --> 00:11:45.519
<v Speaker 3>Hopefully it's actually because this is a bit more challenging,

230
00:11:45.519 --> 00:11:47.840
<v Speaker 3>we're going to use something special here, which we call

231
00:11:47.919 --> 00:11:51.480
<v Speaker 3>big Brain. That's our mode in which we use more computation,

232
00:11:51.840 --> 00:11:54.639
<v Speaker 3>which was more reasoning of our rock, just to make

233
00:11:54.639 --> 00:11:56.399
<v Speaker 3>sure that you know, there's a good chance here that

234
00:11:56.799 --> 00:11:59.679
<v Speaker 3>actually might actually do it. So we're also going to

235
00:11:59.720 --> 00:12:04.559
<v Speaker 3>fire of free attempts here at solving this game, at

236
00:12:04.639 --> 00:12:07.919
<v Speaker 3>creating this game that's a mixture of Tetris and the Jewels.

237
00:12:09.080 --> 00:12:10.519
<v Speaker 3>Let's let's see what god comes out.

238
00:12:10.279 --> 00:12:14.639
<v Speaker 2>I've played the game. It's pretty good. Like it's like, wow, okay,

239
00:12:14.720 --> 00:12:15.320
<v Speaker 2>this is something.

240
00:12:16.000 --> 00:12:19.720
<v Speaker 3>Yeah, So while Grog is thinking in the background, we

241
00:12:19.759 --> 00:12:23.000
<v Speaker 3>can now actually talk about some concrete numbers. How well

242
00:12:23.080 --> 00:12:25.559
<v Speaker 3>is Groug doing across tons of different tasks that we've

243
00:12:25.559 --> 00:12:28.960
<v Speaker 3>tested on. So we'll hand it over to Tony to talk

244
00:12:28.960 --> 00:12:29.320
<v Speaker 3>about that.

245
00:12:30.279 --> 00:12:33.360
<v Speaker 5>Yeah, Okay, so let's see how Greg has on those

246
00:12:33.559 --> 00:12:38.440
<v Speaker 5>interesting challenging benchmarks. So yeah, so reasoning again refers to

247
00:12:38.480 --> 00:12:41.600
<v Speaker 5>those models that actually thinks quite for quite a long

248
00:12:41.639 --> 00:12:45.360
<v Speaker 5>time before it tries to solve a problem. So in

249
00:12:45.399 --> 00:12:48.440
<v Speaker 5>this case, you know, around the months ago the growth

250
00:12:48.440 --> 00:12:52.679
<v Speaker 5>three apprecionning finishes, so after that we worked very hard

251
00:12:52.720 --> 00:12:56.840
<v Speaker 5>to put the reasoning capability into the current Growth three model.

252
00:12:57.279 --> 00:12:59.840
<v Speaker 5>But again this is very early days, so the model

253
00:13:00.120 --> 00:13:03.000
<v Speaker 5>still currently in training. So right now, what we're going

254
00:13:03.080 --> 00:13:05.960
<v Speaker 5>to show to people is this beta version of the

255
00:13:06.000 --> 00:13:09.799
<v Speaker 5>growth three reasoning model. Alongside, we also are training a

256
00:13:09.799 --> 00:13:12.720
<v Speaker 5>mini version of the reasoning model. So essentially on this

257
00:13:12.799 --> 00:13:16.279
<v Speaker 5>plot you can see the growth three reasoning beta and

258
00:13:16.279 --> 00:13:19.759
<v Speaker 5>then Growth three mini reasoning. The reasoning mini reasoning is

259
00:13:19.759 --> 00:13:21.919
<v Speaker 5>actually a model that we train for much longer time,

260
00:13:22.320 --> 00:13:25.200
<v Speaker 5>and you can see that sometimes you actually perform slightly

261
00:13:25.240 --> 00:13:28.840
<v Speaker 5>better compared to the Growth three reasoning. This also just

262
00:13:28.879 --> 00:13:31.279
<v Speaker 5>means that there's a huge potential for the Growth three

263
00:13:31.279 --> 00:13:32.639
<v Speaker 5>reasoning because it's trained for.

264
00:13:32.679 --> 00:13:33.360
<v Speaker 1>Much less time.

265
00:13:34.639 --> 00:13:37.600
<v Speaker 5>So all right, so let's actually look at how it

266
00:13:37.639 --> 00:13:42.000
<v Speaker 5>does on those three benchmarks. So Jimmy also introduced already,

267
00:13:42.039 --> 00:13:46.080
<v Speaker 5>so essentially we're looking at three different areas mathematics, science,

268
00:13:46.200 --> 00:13:49.759
<v Speaker 5>and coding. And for math we're picking this high school

269
00:13:49.799 --> 00:13:54.519
<v Speaker 5>competition math problem. For science we actually picked those PhD

270
00:13:54.679 --> 00:13:56.279
<v Speaker 5>level science questions.

271
00:13:56.919 --> 00:13:59.039
<v Speaker 1>And for coding, it's also actually pretty challenging.

272
00:13:59.120 --> 00:14:02.720
<v Speaker 5>It's competitive code and also some eco, which is some

273
00:14:03.440 --> 00:14:07.360
<v Speaker 5>interview problems that people usually get when they interview for companies.

274
00:14:07.600 --> 00:14:10.159
<v Speaker 5>So on those benchmarks, you can see that the growth

275
00:14:10.159 --> 00:14:14.159
<v Speaker 5>three actually performed quite well across the board compared to

276
00:14:14.240 --> 00:14:18.480
<v Speaker 5>other competitors. Yeah, so it's pretty promising. These models are

277
00:14:18.600 --> 00:14:19.080
<v Speaker 5>very smart.

278
00:14:19.639 --> 00:14:23.440
<v Speaker 1>So totally what what are those shaded bars? Yeah, so okay,

279
00:14:23.519 --> 00:14:25.519
<v Speaker 1>so I'm you asked this question.

280
00:14:25.639 --> 00:14:29.240
<v Speaker 5>So for those models, because it can reason, it can think,

281
00:14:29.679 --> 00:14:32.840
<v Speaker 5>you can also ask them to even think longer. You

282
00:14:32.879 --> 00:14:36.759
<v Speaker 5>can spend more what we call test and compute, which

283
00:14:36.799 --> 00:14:40.120
<v Speaker 5>means you can spend more time to reason, to think

284
00:14:40.240 --> 00:14:43.759
<v Speaker 5>about the problem before you spit out the answer. So

285
00:14:43.799 --> 00:14:46.720
<v Speaker 5>in this case, the shaded bar here means that we

286
00:14:46.879 --> 00:14:50.440
<v Speaker 5>just asked the model to spend more time. You know,

287
00:14:50.519 --> 00:14:53.480
<v Speaker 5>it can solve the same problem many many times before

288
00:14:53.919 --> 00:14:57.080
<v Speaker 5>it tries to conclude what is the right solution, And

289
00:14:57.240 --> 00:15:00.200
<v Speaker 5>once you give this compute or this kind of budget

290
00:15:00.519 --> 00:15:02.799
<v Speaker 5>to the model. It turns out the model can even

291
00:15:02.840 --> 00:15:06.600
<v Speaker 5>perform better. So this is it's honly the shaded part

292
00:15:06.840 --> 00:15:07.639
<v Speaker 5>in those.

293
00:15:07.519 --> 00:15:10.720
<v Speaker 4>Plots, right, So I think this is really exciting, right

294
00:15:10.799 --> 00:15:13.519
<v Speaker 4>because now instead of just doing one chain of thoughts

295
00:15:13.720 --> 00:15:17.240
<v Speaker 4>with AI while not the multiplex at once. Yes, so

296
00:15:17.279 --> 00:15:19.600
<v Speaker 4>that's a very powerful technique that allows to continue to

297
00:15:19.840 --> 00:15:25.240
<v Speaker 4>scale the model capabilities after training. And you know, people

298
00:15:25.240 --> 00:15:28.120
<v Speaker 4>often ask were actually just overfitt into the benchmarks?

299
00:15:28.600 --> 00:15:29.720
<v Speaker 1>So how mogulization?

300
00:15:30.039 --> 00:15:33.240
<v Speaker 5>So yes, I think, yeah, this is definitely a question

301
00:15:33.360 --> 00:15:36.840
<v Speaker 5>that we are asking ourselves whether we're overfitting into those

302
00:15:36.919 --> 00:15:37.919
<v Speaker 5>current benchmarks.

303
00:15:38.480 --> 00:15:40.720
<v Speaker 1>Luckily, we have a real test.

304
00:15:41.240 --> 00:15:45.279
<v Speaker 5>So about five days ago, AIM twenty twenty five just finished.

305
00:15:45.480 --> 00:15:48.519
<v Speaker 5>This is where high school come students compete in this

306
00:15:48.559 --> 00:15:52.279
<v Speaker 5>particular benchmark. So we got this very fresh new competition,

307
00:15:52.759 --> 00:15:55.039
<v Speaker 5>and then we ask our two models to compete on

308
00:15:55.120 --> 00:15:58.639
<v Speaker 5>the same benchmark, the same exam, and it turns out

309
00:15:59.480 --> 00:16:03.720
<v Speaker 5>very interest the gross three reasoning the big one actually

310
00:16:03.799 --> 00:16:08.279
<v Speaker 5>does better on this particular new, fresh exam. This also

311
00:16:08.320 --> 00:16:11.919
<v Speaker 5>means that the generalization capability of the big model is stronger,

312
00:16:12.080 --> 00:16:15.039
<v Speaker 5>much stronger compared to the smaller model. If you compare

313
00:16:15.080 --> 00:16:17.840
<v Speaker 5>to the last year's exam. Actually this is the opposite.

314
00:16:18.000 --> 00:16:23.000
<v Speaker 5>The smaller model kind of learns the previous exams better.

315
00:16:23.840 --> 00:16:25.759
<v Speaker 5>So yeah, so this this actually shows some kind of

316
00:16:25.759 --> 00:16:27.559
<v Speaker 5>true generalization from the model.

317
00:16:28.080 --> 00:16:31.039
<v Speaker 4>Right, So, seventeen months ago our Rock zero and Rock

318
00:16:31.120 --> 00:16:34.120
<v Speaker 4>Web baret solved any high school problems. That's right, And

319
00:16:34.159 --> 00:16:37.720
<v Speaker 4>now we have a kid that just already graduate. The

320
00:16:37.720 --> 00:16:39.120
<v Speaker 4>Grock is right to go to college?

321
00:16:39.159 --> 00:16:42.039
<v Speaker 1>Is that right? I mean, it's won't belong before. It's

322
00:16:42.000 --> 00:16:42.759
<v Speaker 1>simply perfect.

323
00:16:43.080 --> 00:16:45.919
<v Speaker 2>The human exams won't be part it'll be too easy.

324
00:16:46.240 --> 00:16:50.720
<v Speaker 4>Yeah, and the internally we actually as a rocket continually evolved.

325
00:16:51.240 --> 00:16:53.080
<v Speaker 1>We're going to talk about, you know what we're.

326
00:16:52.960 --> 00:16:56.000
<v Speaker 4>Excited about, but very soon there will be no more

327
00:16:56.000 --> 00:16:56.799
<v Speaker 4>benchmark left.

328
00:16:57.039 --> 00:16:59.440
<v Speaker 1>Yeah. Yeah. One thing that's quite fascinating.

329
00:16:59.600 --> 00:17:03.000
<v Speaker 3>I think that we basically only trained Grock's reasoning abilities

330
00:17:03.000 --> 00:17:07.200
<v Speaker 3>on math problems and competitive coding problems. It's so very

331
00:17:07.279 --> 00:17:11.000
<v Speaker 3>very specialized kinds of tasks, but somehow it's able to

332
00:17:11.039 --> 00:17:14.759
<v Speaker 3>work on all kinds of other different tasks, so including

333
00:17:14.799 --> 00:17:18.200
<v Speaker 3>creating games. No, lots and lots of different things. And

334
00:17:18.279 --> 00:17:21.079
<v Speaker 3>what seems to be happening is that basically Grog learns

335
00:17:21.160 --> 00:17:24.079
<v Speaker 3>this ability to detect its own mistakes and it's thinking,

336
00:17:24.160 --> 00:17:27.680
<v Speaker 3>correct them, persist on a problem, try lots of different variants,

337
00:17:27.759 --> 00:17:30.200
<v Speaker 3>pick them one that's best. So there are is generalized

338
00:17:30.440 --> 00:17:35.039
<v Speaker 3>generalizing abilities that learns from mathematics and from coding, which

339
00:17:35.039 --> 00:17:37.279
<v Speaker 3>it can then use to solve all kinds of other problems.

340
00:17:37.279 --> 00:17:39.720
<v Speaker 1>So that's yeah, that's pretty I mean, reality is the

341
00:17:39.720 --> 00:17:42.240
<v Speaker 1>instantiation of mathematics. That's right.

342
00:17:43.400 --> 00:17:45.480
<v Speaker 4>And one thing we're actually really excited about that going

343
00:17:45.519 --> 00:17:48.559
<v Speaker 4>back to our faulty mission, is what if one day

344
00:17:48.839 --> 00:17:52.519
<v Speaker 4>we have a computer just like the thought that utilize

345
00:17:52.559 --> 00:17:53.240
<v Speaker 4>our entire.

346
00:17:53.119 --> 00:17:55.839
<v Speaker 1>Cluster just for the one very important problem.

347
00:17:56.039 --> 00:17:58.759
<v Speaker 4>In the test time, all the deply turned out right,

348
00:17:58.839 --> 00:18:02.119
<v Speaker 4>So I think back then we'll building the GBU clusters together. Uh,

349
00:18:02.440 --> 00:18:06.920
<v Speaker 4>you're applobing cables, and I remember that when we turned

350
00:18:06.920 --> 00:18:09.640
<v Speaker 4>on the first initial test, you can hear all the

351
00:18:09.680 --> 00:18:11.640
<v Speaker 4>GPS humming in the hallway.

352
00:18:12.119 --> 00:18:13.559
<v Speaker 1>That's almost feel like spiritual.

353
00:18:14.519 --> 00:18:16.960
<v Speaker 3>Yeah, that's actually a pretty cool thing that we're able

354
00:18:16.960 --> 00:18:18.839
<v Speaker 3>to do. That we can go into the data center

355
00:18:19.160 --> 00:18:21.799
<v Speaker 3>and tinker with the machines there. So for example, we

356
00:18:22.200 --> 00:18:25.160
<v Speaker 3>went in and we unplucked a few of the cables

357
00:18:25.359 --> 00:18:28.079
<v Speaker 3>and just made sure that our training setup is still running,

358
00:18:28.359 --> 00:18:31.039
<v Speaker 3>running stably. So that's something that's you know, I think

359
00:18:31.440 --> 00:18:34.400
<v Speaker 3>most AI you know teams out there don't usually do.

360
00:18:34.559 --> 00:18:37.319
<v Speaker 3>But it's actually totally unlocks like a new level of

361
00:18:37.319 --> 00:18:40.599
<v Speaker 3>reliability and what you're able to do with the hypers.

362
00:18:40.640 --> 00:18:42.759
<v Speaker 1>So okay, so when are we going to solve remont?

363
00:18:43.279 --> 00:18:47.759
<v Speaker 4>So the easiest solution is to enumerate over all possible

364
00:18:47.759 --> 00:18:51.759
<v Speaker 4>strains and as all you have a verifier, en up compute,

365
00:18:51.960 --> 00:18:52.839
<v Speaker 4>you'll be able to do it.

366
00:18:52.960 --> 00:18:56.440
<v Speaker 1>Okay, my projection will be what your guess, what is

367
00:18:56.480 --> 00:18:57.599
<v Speaker 1>your neural edge? Calculate?

368
00:18:58.359 --> 00:19:00.920
<v Speaker 4>So my my boat for the so so three years

369
00:19:00.920 --> 00:19:04.559
<v Speaker 4>ago I told you that I think not two years later,

370
00:19:05.200 --> 00:19:06.240
<v Speaker 4>two things is gonna happen.

371
00:19:06.480 --> 00:19:09.200
<v Speaker 1>We're gonna see machines win some battles.

372
00:19:09.480 --> 00:19:14.359
<v Speaker 4>Yes, two ways a word, fields metal, Globel price with

373
00:19:14.519 --> 00:19:16.240
<v Speaker 4>probably some expert in the loop.

374
00:19:16.319 --> 00:19:19.319
<v Speaker 3>Right, so the expert uplifting. So this year or next year,

375
00:19:20.640 --> 00:19:22.279
<v Speaker 3>that's what it comes down to you. Yeah, So it

376
00:19:22.319 --> 00:19:24.680
<v Speaker 3>looks like GROG finished all of its thinking on the

377
00:19:24.759 --> 00:19:26.880
<v Speaker 3>on the two problems. So let's take a look at

378
00:19:26.880 --> 00:19:30.200
<v Speaker 3>what it said. All right, so this was the little physics.

379
00:19:29.759 --> 00:19:30.640
<v Speaker 1>Problem we had.

380
00:19:31.559 --> 00:19:34.400
<v Speaker 3>You know, we we've collapsed the thoughts here, so they're

381
00:19:34.720 --> 00:19:37.599
<v Speaker 3>you know, they're hidden. And then we see Grock's answer

382
00:19:37.680 --> 00:19:40.039
<v Speaker 3>below that, so it explains it wrought a Python script

383
00:19:40.039 --> 00:19:42.839
<v Speaker 3>here using map plot Lip then gives us all of

384
00:19:42.880 --> 00:19:45.759
<v Speaker 3>the code. So let's take a quick look at the code.

385
00:19:45.839 --> 00:19:48.319
<v Speaker 3>You know, it seems like it's doing reasonable things here,

386
00:19:48.480 --> 00:19:53.240
<v Speaker 3>not not totally of the mark, solf Capitler says here,

387
00:19:53.319 --> 00:19:58.359
<v Speaker 3>so maybe it's solving capitalist laws, capital capitalist law and americally. Yeah,

388
00:19:58.400 --> 00:20:00.599
<v Speaker 3>there's really only one way to find out if this

389
00:20:00.640 --> 00:20:01.160
<v Speaker 3>thing is working.

390
00:20:01.240 --> 00:20:03.079
<v Speaker 1>I would say, let's let's give it a try. Let's run.

391
00:20:03.400 --> 00:20:07.119
<v Speaker 3>Let's run the code all right, and we can see, yeah,

392
00:20:07.119 --> 00:20:11.000
<v Speaker 3>I've got animating two different planets, Earth and Mars here,

393
00:20:11.039 --> 00:20:15.960
<v Speaker 3>and then the green ball is the vehicle that's transiting

394
00:20:16.039 --> 00:20:19.799
<v Speaker 3>the spacecraft that's transitioning between Earth and Mars, and you

395
00:20:19.839 --> 00:20:22.960
<v Speaker 3>could see the journey from Earth to Mars and looks like, yeah,

396
00:20:23.119 --> 00:20:26.880
<v Speaker 3>indeed the astronauts were turned safely, you know, at the

397
00:20:26.960 --> 00:20:31.519
<v Speaker 3>right moment in time. So obviously this was just generated

398
00:20:31.559 --> 00:20:33.440
<v Speaker 3>on the spot, so we can't tell you if that

399
00:20:33.599 --> 00:20:36.039
<v Speaker 3>was actually correct solution. So we're going to take a

400
00:20:36.039 --> 00:20:37.599
<v Speaker 3>close a look. Now, maybe we're going to call some

401
00:20:37.640 --> 00:20:40.880
<v Speaker 3>colleagues a space X ask them if if this.

402
00:20:40.920 --> 00:20:46.279
<v Speaker 2>Is legit, that's pretty close it's it's I mean yeah,

403
00:20:46.279 --> 00:20:48.160
<v Speaker 2>I mean, there's there's there's there's a lot of complexities

404
00:20:48.200 --> 00:20:50.920
<v Speaker 2>in the actual orbits that have to be taking into account,

405
00:20:50.960 --> 00:20:53.960
<v Speaker 2>but this is this is pretty close to what looks

406
00:20:54.000 --> 00:20:59.359
<v Speaker 2>like I add that or my pendets here. This has

407
00:20:59.359 --> 00:21:02.279
<v Speaker 2>got the Earth home and transfer on it. When where

408
00:21:02.319 --> 00:21:05.039
<v Speaker 2>we're going to install rock on a rocket.

409
00:21:04.880 --> 00:21:08.640
<v Speaker 1>Well, I suppose in two years three years.

410
00:21:08.920 --> 00:21:14.160
<v Speaker 2>Everything is two years away. Well, Earth and Mars. Transit

411
00:21:14.240 --> 00:21:17.400
<v Speaker 2>can occurs every twenty six months. The next we're currently

412
00:21:17.400 --> 00:21:20.039
<v Speaker 2>in a transit window approximately. The next one would be

413
00:21:21.039 --> 00:21:25.880
<v Speaker 2>November of next year, roughly the end of next year,

414
00:21:27.079 --> 00:21:31.119
<v Speaker 2>and if all go as well, SpaceX will send starship

415
00:21:31.440 --> 00:21:36.759
<v Speaker 2>rockets to Mars with Optimus robots and Rock.

416
00:21:37.279 --> 00:21:37.519
<v Speaker 1>Mh.

417
00:21:38.640 --> 00:21:42.319
<v Speaker 3>I'm curious about this combination of Tetris and the duets.

418
00:21:42.319 --> 00:21:47.960
<v Speaker 3>Looks like the Tetris as we've named it internally. So okay,

419
00:21:48.200 --> 00:21:51.039
<v Speaker 3>we also have an output from Rock here wrote a

420
00:21:51.039 --> 00:21:52.559
<v Speaker 3>Python script Spence.

421
00:21:52.599 --> 00:21:53.680
<v Speaker 1>That is what it's been doing.

422
00:21:53.880 --> 00:21:56.519
<v Speaker 3>If you look at the code, there are some constants

423
00:21:57.119 --> 00:21:58.920
<v Speaker 3>that are being defined here, some.

424
00:21:58.880 --> 00:22:00.839
<v Speaker 1>Colors, and then the totrominos.

425
00:22:01.240 --> 00:22:05.400
<v Speaker 3>The pieces of Tetris are there, obviously very hard to

426
00:22:05.440 --> 00:22:07.440
<v Speaker 3>see and at one glance if this is good, so

427
00:22:07.480 --> 00:22:09.880
<v Speaker 3>we gotta we gotta run this to figure out if

428
00:22:09.880 --> 00:22:10.400
<v Speaker 3>it's working.

429
00:22:10.839 --> 00:22:12.079
<v Speaker 1>Well, let's let's give it a try.

430
00:22:12.519 --> 00:22:15.759
<v Speaker 3>Fingers crossed ay, Right, So this kind of looks like Tetris,

431
00:22:16.319 --> 00:22:18.880
<v Speaker 3>but the colors are a little bit off, right, The

432
00:22:18.920 --> 00:22:24.480
<v Speaker 3>colors are different here, and if you think about what's

433
00:22:24.519 --> 00:22:27.799
<v Speaker 3>going what's going on here? The jewel it has this

434
00:22:27.920 --> 00:22:30.519
<v Speaker 3>mechanic where if you get free jewels in a row,

435
00:22:30.960 --> 00:22:35.039
<v Speaker 3>you know, then they disappear and also gravity activates. Right,

436
00:22:35.079 --> 00:22:38.880
<v Speaker 3>So what happens if you get three of the colors together? Okay,

437
00:22:38.960 --> 00:22:42.440
<v Speaker 3>so something happened. So I think I think what Bruck

438
00:22:43.079 --> 00:22:47.680
<v Speaker 3>did in this version is that, you know, once you

439
00:22:47.720 --> 00:22:50.519
<v Speaker 3>connect three at least three blocks of the same color

440
00:22:50.960 --> 00:22:51.519
<v Speaker 3>in a row.

441
00:22:51.519 --> 00:22:52.000
<v Speaker 1>Then.

442
00:22:53.880 --> 00:22:57.559
<v Speaker 3>Gravity activates and they disappear, and then gravity activates and

443
00:22:57.599 --> 00:23:00.799
<v Speaker 3>all the other blocks fall down. I'm kind of kind

444
00:23:00.799 --> 00:23:03.640
<v Speaker 3>of curious if there's still a Tetris mechanic here where

445
00:23:04.039 --> 00:23:07.599
<v Speaker 3>if the line is full, does it actually clear it

446
00:23:08.000 --> 00:23:11.559
<v Speaker 3>or what happens? Then it's up to interpretation.

447
00:23:11.920 --> 00:23:14.599
<v Speaker 2>So who knows every I mean, when you're it'll do

448
00:23:14.640 --> 00:23:15.960
<v Speaker 2>different variants when you ask it.

449
00:23:15.960 --> 00:23:18.559
<v Speaker 3>It doesn't do the same thing every time exactly. We've

450
00:23:18.559 --> 00:23:22.039
<v Speaker 3>seen a few other work very differently, but that's one

451
00:23:22.039 --> 00:23:22.559
<v Speaker 3>seems cool.

452
00:23:22.640 --> 00:23:26.960
<v Speaker 4>So are we ready for game studio at x l AI.

453
00:23:27.200 --> 00:23:31.480
<v Speaker 2>Yes, so we're launching an AI gaming studio at x

454
00:23:31.480 --> 00:23:33.799
<v Speaker 2>c I. If you're interested in joining us in building

455
00:23:33.839 --> 00:23:37.240
<v Speaker 2>AI games, Uh, please join x AI where we're launching

456
00:23:37.240 --> 00:23:39.319
<v Speaker 2>in AI Gaming studio or announcing it tonight.

457
00:23:39.480 --> 00:23:44.000
<v Speaker 1>Let's go. Yeah, big games, that's an actual game. Yeah,

458
00:23:45.200 --> 00:23:45.640
<v Speaker 1>all right.

459
00:23:45.720 --> 00:23:49.480
<v Speaker 4>So I think one thing is super exciting for us

460
00:23:50.640 --> 00:23:53.920
<v Speaker 4>is that once you have the best patriot model, you

461
00:23:54.039 --> 00:23:55.279
<v Speaker 4>have the best reason model.

462
00:23:55.759 --> 00:23:58.440
<v Speaker 1>Right, so we already see that we.

463
00:23:58.359 --> 00:24:00.160
<v Speaker 4>Actually give a capability for those model.

464
00:24:00.039 --> 00:24:03.599
<v Speaker 1>To think harder, think longer, think more broad.

465
00:24:04.519 --> 00:24:08.079
<v Speaker 4>The performance continuing improves, and we're really excited about the

466
00:24:08.079 --> 00:24:10.839
<v Speaker 4>next front here that will happen if we'll not only

467
00:24:10.839 --> 00:24:14.200
<v Speaker 4>allow the model to think harder, but also provide more tools,

468
00:24:14.200 --> 00:24:17.359
<v Speaker 4>just like call real humans to solve those problems. For

469
00:24:17.440 --> 00:24:21.160
<v Speaker 4>real humans, we don't ask them to solve women hypothesis

470
00:24:21.200 --> 00:24:24.119
<v Speaker 4>just with a piece of pen and paper. The Internet,

471
00:24:25.079 --> 00:24:30.319
<v Speaker 4>so with all the basic web browsing, search engine and

472
00:24:30.480 --> 00:24:34.359
<v Speaker 4>coding interpreters that builds the foundations and the best reasoning

473
00:24:34.440 --> 00:24:38.200
<v Speaker 4>model build the foundations for the grog agent to come.

474
00:24:39.920 --> 00:24:45.240
<v Speaker 4>So today we're actually introducing a new product called deep

475
00:24:45.240 --> 00:24:48.720
<v Speaker 4>search that is the first generation of our grock agents

476
00:24:49.319 --> 00:24:52.039
<v Speaker 4>that not just helping the engineers and research and scientists

477
00:24:52.119 --> 00:24:56.359
<v Speaker 4>do coding, but actually help everyone to answer questions that.

478
00:24:56.359 --> 00:24:57.240
<v Speaker 1>You have day today.

479
00:24:57.440 --> 00:25:00.119
<v Speaker 4>It's kind of like a next generation search engine that

480
00:25:00.200 --> 00:25:03.359
<v Speaker 4>really help you to understand the universe. So you can

481
00:25:03.359 --> 00:25:07.519
<v Speaker 4>start asking questions like, for example, hey, when is the

482
00:25:07.559 --> 00:25:11.359
<v Speaker 4>next starship launch day? For example, So let's try that

483
00:25:11.920 --> 00:25:17.599
<v Speaker 4>the answer. On the left inside we see a high

484
00:25:17.680 --> 00:25:21.200
<v Speaker 4>level progress bar. Essentially, you know the model knowledge is

485
00:25:21.200 --> 00:25:23.759
<v Speaker 4>going to do one single search like the current right system,

486
00:25:24.160 --> 00:25:27.119
<v Speaker 4>but actually thought very deeply about, hey, what's the user

487
00:25:27.200 --> 00:25:29.880
<v Speaker 4>intent here, and what are the facts that actually consider

488
00:25:30.240 --> 00:25:32.640
<v Speaker 4>at the same time, and how many different websites actually

489
00:25:32.680 --> 00:25:36.160
<v Speaker 4>actually go and read the accountent right, So this can

490
00:25:36.359 --> 00:25:40.359
<v Speaker 4>save hundreds of hours of everyone's Google time if you

491
00:25:40.400 --> 00:25:42.319
<v Speaker 4>want to really look into certain topics.

492
00:25:42.960 --> 00:25:44.519
<v Speaker 1>And then on the right.

493
00:25:44.359 --> 00:25:46.759
<v Speaker 4>Inside you can see the bullet of how the current

494
00:25:46.799 --> 00:25:51.079
<v Speaker 4>model you know, is doing what websites browsing, what sources

495
00:25:51.200 --> 00:25:55.400
<v Speaker 4>verifying and oftentimes actually cross validate different sources out there

496
00:25:56.240 --> 00:25:58.279
<v Speaker 4>to make sure the answer is actually correct before the

497
00:25:58.279 --> 00:26:01.200
<v Speaker 4>output final answer, we can, you know, at the same time,

498
00:26:01.279 --> 00:26:04.720
<v Speaker 4>fire up a few more querries. How about you know,

499
00:26:04.759 --> 00:26:08.799
<v Speaker 4>you know your gamer right, so sure? Yeah, so how

500
00:26:08.839 --> 00:26:10.480
<v Speaker 4>about what are some of the best bills and most

501
00:26:10.519 --> 00:26:14.839
<v Speaker 4>popular bills in uh pathl hardcore right hardcore league?

502
00:26:15.400 --> 00:26:18.119
<v Speaker 2>You if you can technically just look at the hardcore letterer,

503
00:26:18.519 --> 00:26:19.480
<v Speaker 2>it might be a fast.

504
00:26:19.279 --> 00:26:19.960
<v Speaker 1>Way to figure it out.

505
00:26:20.000 --> 00:26:22.279
<v Speaker 4>They always see what model does. And then we can

506
00:26:22.319 --> 00:26:25.880
<v Speaker 4>also do uh you know, something more fun. For example,

507
00:26:27.480 --> 00:26:29.839
<v Speaker 4>how about like make a prediction about the marsh madness

508
00:26:29.920 --> 00:26:30.400
<v Speaker 4>out there.

509
00:26:30.559 --> 00:26:31.359
<v Speaker 1>Yeah, so this is.

510
00:26:31.400 --> 00:26:34.599
<v Speaker 2>Kind of a fun one where Warren Buffett has a

511
00:26:34.599 --> 00:26:38.640
<v Speaker 2>billion dollar bet if you can exactly match the I

512
00:26:38.640 --> 00:26:42.920
<v Speaker 2>think the the the sort of the entire winning tree

513
00:26:43.079 --> 00:26:45.359
<v Speaker 2>of marsh Madness, you can win a billion dollars from

514
00:26:45.440 --> 00:26:48.160
<v Speaker 2>Warren Buffett. So, like, it would be pretty cool if

515
00:26:48.240 --> 00:26:51.160
<v Speaker 2>AI could help you win a billion dollars from Buffett.

516
00:26:51.480 --> 00:26:53.000
<v Speaker 1>It seems like a pretty good investment.

517
00:26:53.440 --> 00:26:57.599
<v Speaker 4>Let's go yeah, all right, so now let's fire up

518
00:26:57.599 --> 00:26:59.960
<v Speaker 4>the quarry and see what mama does.

519
00:27:00.200 --> 00:27:02.960
<v Speaker 1>So we can actually go back to our very first one.

520
00:27:03.079 --> 00:27:06.119
<v Speaker 4>How about the it wasn't counting on this, that's right, Okay,

521
00:27:06.160 --> 00:27:08.200
<v Speaker 4>so we got the first one and all the thought

522
00:27:09.079 --> 00:27:12.400
<v Speaker 4>around one minute. Okay, so the key inside here. The

523
00:27:12.480 --> 00:27:15.559
<v Speaker 4>next starship is going to be on twenty fourth or later.

524
00:27:15.680 --> 00:27:18.960
<v Speaker 4>So no earlier than February twenty fourth, it might be sooner.

525
00:27:19.519 --> 00:27:22.079
<v Speaker 4>So yeah, so I think we can you know, go

526
00:27:22.160 --> 00:27:24.200
<v Speaker 4>down to go down what a model does, so it

527
00:27:24.240 --> 00:27:28.720
<v Speaker 4>does little research flight seven what happened got grounded and actually.

528
00:27:28.640 --> 00:27:34.079
<v Speaker 1>Look into the FCC filing you know, from data collections.

529
00:27:34.319 --> 00:27:38.079
<v Speaker 4>Uh, and that should make a new conclusion that yeh

530
00:27:38.079 --> 00:27:44.480
<v Speaker 4>if we continue to roll down, let's see right, Yeah,

531
00:27:44.559 --> 00:27:47.720
<v Speaker 4>so it makes uh the you know little table. I

532
00:27:47.759 --> 00:27:51.680
<v Speaker 4>think inside XAI we often joked about the time to

533
00:27:51.720 --> 00:27:56.079
<v Speaker 4>the first table is the only you know latency that matters. Yeah,

534
00:27:56.119 --> 00:27:59.119
<v Speaker 4>so that's how the model making influence. And look at

535
00:27:59.160 --> 00:28:01.440
<v Speaker 4>all the sources and then we can look into the

536
00:28:01.480 --> 00:28:05.480
<v Speaker 4>gaming one. So how about the break So for this

537
00:28:05.480 --> 00:28:10.000
<v Speaker 4>particular one, we look at the buildings light and you

538
00:28:10.039 --> 00:28:15.359
<v Speaker 4>know it's a lot better so uh what they inferl us.

539
00:28:15.359 --> 00:28:18.359
<v Speaker 4>But if we go down, so the surprising fact of

540
00:28:18.799 --> 00:28:21.759
<v Speaker 4>all the other builds, So look into the twelve classes,

541
00:28:22.519 --> 00:28:25.319
<v Speaker 4>so we'll see that the medium bill was pretty popular

542
00:28:25.359 --> 00:28:28.599
<v Speaker 4>whenever the game first came out. And now the Invokers

543
00:28:28.640 --> 00:28:29.519
<v Speaker 4>of the World.

544
00:28:29.720 --> 00:28:33.160
<v Speaker 1>Took over Monkety Biker for sure. Yeah that's right.

545
00:28:33.480 --> 00:28:36.319
<v Speaker 4>Yeah, followed by the stone Weavers. Then that's really good mapping.

546
00:28:36.720 --> 00:28:41.119
<v Speaker 4>So yeah, and then we can see uh uh the

547
00:28:42.119 --> 00:28:42.799
<v Speaker 4>match manners.

548
00:28:42.799 --> 00:28:43.319
<v Speaker 1>How about that?

549
00:28:43.440 --> 00:28:47.000
<v Speaker 4>Soe One interesting thing about the dep search is that

550
00:28:47.079 --> 00:28:49.720
<v Speaker 4>if you actually go into the panel.

551
00:28:49.480 --> 00:28:52.599
<v Speaker 1>Where it shows, you know, what are the subtasks.

552
00:28:52.279 --> 00:28:56.000
<v Speaker 4>You can actually click the bottom left of this right

553
00:28:56.759 --> 00:28:59.839
<v Speaker 4>and then in this case you can actually go to

554
00:29:00.079 --> 00:29:04.240
<v Speaker 4>actually reading to the mind of Grock. What informations does

555
00:29:04.279 --> 00:29:06.720
<v Speaker 4>the model actually think about our truck worthy what or not?

556
00:29:06.920 --> 00:29:09.839
<v Speaker 4>How does it actually cross all their different information sources?

557
00:29:10.160 --> 00:29:13.119
<v Speaker 4>So that makes the entire search experience and information with

558
00:29:13.200 --> 00:29:15.839
<v Speaker 4>Trual process a lot more transparent to our users.

559
00:29:16.079 --> 00:29:19.160
<v Speaker 3>This is much more powerful than any search engine out there.

560
00:29:19.200 --> 00:29:22.599
<v Speaker 3>You can literally just tell it only use sources from X.

561
00:29:22.720 --> 00:29:25.079
<v Speaker 3>You know, we'll try to respect that, and so it's

562
00:29:25.160 --> 00:29:27.680
<v Speaker 3>much more steerable, much more intelligent than I mean.

563
00:29:28.440 --> 00:29:29.920
<v Speaker 1>It really should save you a lot of times.

564
00:29:29.960 --> 00:29:31.640
<v Speaker 2>So something that might take an hour or an hour

565
00:29:31.680 --> 00:29:34.519
<v Speaker 2>of researching on the web or searching media. You can

566
00:29:34.599 --> 00:29:37.000
<v Speaker 2>just ask it to go do that and come back

567
00:29:37.000 --> 00:29:37.880
<v Speaker 2>in ten minutes later.

568
00:29:37.920 --> 00:29:39.920
<v Speaker 1>It's done. An hour's work worth of work for you.

569
00:29:40.240 --> 00:29:42.640
<v Speaker 2>That's really what it comes down to, exactly and and

570
00:29:42.680 --> 00:29:44.240
<v Speaker 2>maybe better than you could have done it yourself.

571
00:29:44.319 --> 00:29:44.720
<v Speaker 1>Yeah.

572
00:29:44.920 --> 00:29:48.160
<v Speaker 4>Think about the informount of interns working for you that

573
00:29:48.160 --> 00:29:50.039
<v Speaker 4>you can just fire up all the tasks and come

574
00:29:50.079 --> 00:29:51.000
<v Speaker 4>back a minute later.

575
00:29:51.480 --> 00:29:52.720
<v Speaker 1>This is going to be interesting one.

576
00:29:52.839 --> 00:29:56.799
<v Speaker 4>So marchmass had not happened yet, so I guess we

577
00:29:56.880 --> 00:29:59.720
<v Speaker 4>had to follow up with a next livestream.

578
00:30:00.359 --> 00:30:03.440
<v Speaker 2>Yeah, it seems like pretty good. Like forty dollars might

579
00:30:03.480 --> 00:30:06.720
<v Speaker 2>get you a billion dollars forty dollars subscription, that's.

580
00:30:06.559 --> 00:30:10.000
<v Speaker 1>Right, I mean my work. So yeah, so where are

581
00:30:10.039 --> 00:30:12.759
<v Speaker 1>the users gonna have their heads on? Rock three yees?

582
00:30:12.960 --> 00:30:15.839
<v Speaker 3>So the good news is we've been working tirelessly to

583
00:30:16.000 --> 00:30:19.160
<v Speaker 3>actually release all of these features that we've shown you.

584
00:30:19.400 --> 00:30:22.359
<v Speaker 3>The Grock free based model with amazing chat capabilities that's

585
00:30:22.359 --> 00:30:26.079
<v Speaker 3>really useful, that's really interesting to talk to, the deep search,

586
00:30:26.519 --> 00:30:29.319
<v Speaker 3>the advanced reasoning mode, all of these things. We want

587
00:30:29.319 --> 00:30:32.440
<v Speaker 3>to row them out to you today, starting with the

588
00:30:32.559 --> 00:30:35.240
<v Speaker 3>plus subscribers on x So it's the first group that

589
00:30:35.279 --> 00:30:36.799
<v Speaker 3>will initially get access.

590
00:30:37.160 --> 00:30:38.839
<v Speaker 1>Make sure to update your x.

591
00:30:38.839 --> 00:30:41.119
<v Speaker 3>Up if you want to see all of the advanced capabilities,

592
00:30:41.160 --> 00:30:44.039
<v Speaker 3>because we just released the update now as we're as

593
00:30:44.039 --> 00:30:47.279
<v Speaker 3>we're talking here, and yeah, if you're interested in getting

594
00:30:47.279 --> 00:30:50.119
<v Speaker 3>early access to grock, then sign up for Premium Plus.

595
00:30:50.640 --> 00:30:54.720
<v Speaker 3>And also we're announcing that we're starting a separate subscription

596
00:30:55.000 --> 00:30:57.119
<v Speaker 3>for Grock that we call Super Grock for those who

597
00:30:57.880 --> 00:31:00.359
<v Speaker 3>those real rock fans. That one of the most bands

598
00:31:00.359 --> 00:31:04.039
<v Speaker 3>capabilities and the earliest access to new futures.

599
00:31:04.640 --> 00:31:06.279
<v Speaker 1>So feel free to check that out as well.

600
00:31:06.559 --> 00:31:08.480
<v Speaker 2>This this is for the dedicated grock app and for

601
00:31:08.519 --> 00:31:09.720
<v Speaker 2>the website exactly.

602
00:31:10.160 --> 00:31:13.079
<v Speaker 3>So our new website is called grock dot com. Yeah,

603
00:31:13.119 --> 00:31:17.319
<v Speaker 3>and you're also guess and you can also find our

604
00:31:17.400 --> 00:31:20.400
<v Speaker 3>Brock app in the ir S app Store, and that

605
00:31:20.519 --> 00:31:24.720
<v Speaker 3>gives you a more even more polished experience that's totally

606
00:31:24.720 --> 00:31:27.119
<v Speaker 3>grock focused. If you're if you want to have grock

607
00:31:27.240 --> 00:31:28.920
<v Speaker 3>not easily available one tape away.

608
00:31:29.200 --> 00:31:32.240
<v Speaker 2>Yeah, and the version on grock dot com on you know,

609
00:31:32.279 --> 00:31:34.039
<v Speaker 2>on a web browser is going to be the most

610
00:31:34.079 --> 00:31:36.359
<v Speaker 2>the latest and most advanced version because obviously it takes

611
00:31:36.400 --> 00:31:39.440
<v Speaker 2>us a while to get it get something into an

612
00:31:39.480 --> 00:31:41.200
<v Speaker 2>app and they get it approved by the app store.

613
00:31:41.599 --> 00:31:44.519
<v Speaker 2>So and then if there's something on a phone format

614
00:31:44.680 --> 00:31:46.839
<v Speaker 2>is limitations where you can do so. The most powerful

615
00:31:46.920 --> 00:31:49.480
<v Speaker 2>version of grock and the latest version will be the

616
00:31:49.759 --> 00:31:51.279
<v Speaker 2>web version at rock dot com.

617
00:31:51.400 --> 00:31:53.839
<v Speaker 3>Yeah, so so watch out for the name grock Free

618
00:31:54.079 --> 00:31:57.640
<v Speaker 3>in the app giveaway exactly that that's that's the giveaway

619
00:31:57.640 --> 00:31:59.559
<v Speaker 3>that you have grock free. And if it says grow

620
00:31:59.640 --> 00:32:02.319
<v Speaker 3>true then at grogby hasn't quite arrived for yet, but

621
00:32:02.400 --> 00:32:05.640
<v Speaker 3>we're working hard to brow this out today and then

622
00:32:05.920 --> 00:32:08.119
<v Speaker 3>to even more people over the coming days.

623
00:32:08.240 --> 00:32:11.759
<v Speaker 4>Yeah, make sure you update your phone app too, where

624
00:32:11.799 --> 00:32:14.000
<v Speaker 4>you're going to get all the tools were showcased today

625
00:32:14.319 --> 00:32:18.079
<v Speaker 4>with the thinking mold with the deep search. So yeah,

626
00:32:18.160 --> 00:32:19.839
<v Speaker 4>really looking forward to all the feedbacks you have.

627
00:32:20.200 --> 00:32:24.319
<v Speaker 2>Yeah, I think we should emphasize that this is kind

628
00:32:24.359 --> 00:32:26.799
<v Speaker 2>of a beta like meaning that you should expect some

629
00:32:26.960 --> 00:32:31.359
<v Speaker 2>imperfections at first, but we will improve it rapidly almost

630
00:32:31.400 --> 00:32:31.839
<v Speaker 2>every day.

631
00:32:31.920 --> 00:32:33.759
<v Speaker 1>In fact, every day I think it'll get better.

632
00:32:34.400 --> 00:32:36.480
<v Speaker 2>So if you want a more polished version, i'd like

633
00:32:36.559 --> 00:32:40.240
<v Speaker 2>maybe wait a week, but expect improvements literally every day.

634
00:32:41.000 --> 00:32:44.599
<v Speaker 2>And then we're also going to be providing a voice

635
00:32:44.720 --> 00:32:47.079
<v Speaker 2>you can have conversational. In fact, I was trying it

636
00:32:47.079 --> 00:32:49.279
<v Speaker 2>earlier today. It's working pretty well, but not we need

637
00:32:49.599 --> 00:32:53.240
<v Speaker 2>these a bit more polish, the sort of way where

638
00:32:53.240 --> 00:32:54.720
<v Speaker 2>you can just literally talk to it like you're talking

639
00:32:54.759 --> 00:32:58.000
<v Speaker 2>to a person. That's awesome. It's actually I think one

640
00:32:58.039 --> 00:33:02.079
<v Speaker 2>of the best experiences of Grog. That's that's probably about

641
00:33:02.079 --> 00:33:02.599
<v Speaker 2>a week away.

642
00:33:03.160 --> 00:33:07.319
<v Speaker 3>So with that said, I think we might have some

643
00:33:07.440 --> 00:33:12.519
<v Speaker 3>audience questions. Surely, all right, take a look. Yeah, let's

644
00:33:12.559 --> 00:33:16.640
<v Speaker 3>take a look the audience from the ass platform.

645
00:33:16.880 --> 00:33:17.079
<v Speaker 2>Yeah.

646
00:33:17.680 --> 00:33:17.960
<v Speaker 1>Cool.

647
00:33:18.039 --> 00:33:20.920
<v Speaker 3>So the first question here is when Grock Voice Assistant.

648
00:33:21.039 --> 00:33:23.799
<v Speaker 3>When is it coming out as soon as possible, just

649
00:33:23.920 --> 00:33:26.839
<v Speaker 3>like Elan said, just a little bit of publishing away

650
00:33:26.920 --> 00:33:30.559
<v Speaker 3>from being reached to everybody. Obviously, it's going to be

651
00:33:30.680 --> 00:33:32.960
<v Speaker 3>released in an early form and we're going to rapidliterate

652
00:33:33.440 --> 00:33:33.839
<v Speaker 3>among that.

653
00:33:34.400 --> 00:33:37.039
<v Speaker 4>The next question is like when will GROX three being

654
00:33:37.079 --> 00:33:40.599
<v Speaker 4>the API? So this is coming in the GROG three

655
00:33:40.720 --> 00:33:45.039
<v Speaker 4>API with both the reasing models and deep search.

656
00:33:44.920 --> 00:33:46.480
<v Speaker 1>Is coming away in the coming weeks.

657
00:33:46.759 --> 00:33:49.319
<v Speaker 4>We're actually very excited about the enterprise use cases of

658
00:33:49.400 --> 00:33:51.680
<v Speaker 4>all these additional tools that now Grock has access to,

659
00:33:52.119 --> 00:33:54.359
<v Speaker 4>and how the testim computed and tool use can actually

660
00:33:54.400 --> 00:33:57.720
<v Speaker 4>really accelerate all the business use cases. Another one is

661
00:33:58.119 --> 00:34:00.960
<v Speaker 4>will voice mode be native or a text to speech?

662
00:34:01.160 --> 00:34:02.519
<v Speaker 4>So I think that means is it going to be

663
00:34:02.720 --> 00:34:06.440
<v Speaker 4>one one model that is understanding what you say and

664
00:34:06.519 --> 00:34:08.119
<v Speaker 4>then talking back to you, or is it going to

665
00:34:08.199 --> 00:34:08.400
<v Speaker 4>be some.

666
00:34:08.519 --> 00:34:10.599
<v Speaker 3>System that has text of speech inside of it. And

667
00:34:10.639 --> 00:34:12.639
<v Speaker 3>the good news is it's going to be one model,

668
00:34:12.920 --> 00:34:15.199
<v Speaker 3>like and not a variant of grock Free that we're

669
00:34:15.239 --> 00:34:17.960
<v Speaker 3>going to release, which basically understands what you're saying what

670
00:34:18.039 --> 00:34:22.079
<v Speaker 3>you're saying and then generates the audio directly from that.

671
00:34:22.920 --> 00:34:26.239
<v Speaker 3>So very much like grock Free generates text, that model

672
00:34:26.320 --> 00:34:29.719
<v Speaker 3>generates audio, and that has a bunch of advantagers. I

673
00:34:29.880 --> 00:34:32.800
<v Speaker 3>was talking to it earlier today and it said hi, Igor, no,

674
00:34:33.000 --> 00:34:35.920
<v Speaker 3>reading my name from probably from some texts that it had,

675
00:34:36.599 --> 00:34:39.519
<v Speaker 3>And I said, no, my name is Igor, and it

676
00:34:39.639 --> 00:34:42.719
<v Speaker 3>remember that, you know, so it could continue to say Igor,

677
00:34:43.320 --> 00:34:45.079
<v Speaker 3>just like a human world, and you can't.

678
00:34:44.920 --> 00:34:46.559
<v Speaker 1>Achieve that with Texas speech.

679
00:34:46.719 --> 00:34:51.079
<v Speaker 4>So yeah, so here's a question for you, pretty spicy,

680
00:34:52.320 --> 00:34:55.360
<v Speaker 4>you know, is Grog a boy or a girl?

681
00:34:56.239 --> 00:34:58.840
<v Speaker 1>And they think Grog is whatever you wanted to be? Yah?

682
00:34:59.119 --> 00:35:06.320
<v Speaker 2>Yeah, single, yes, all right, the shop is open. So honestly,

683
00:35:06.400 --> 00:35:07.840
<v Speaker 2>people are going to pull in love with Croc. It's

684
00:35:08.079 --> 00:35:10.280
<v Speaker 2>it's like probable.

685
00:35:10.920 --> 00:35:11.039
<v Speaker 1>Uh.

686
00:35:11.400 --> 00:35:14.000
<v Speaker 4>The next question, will Grock be able to transcribe audio

687
00:35:14.079 --> 00:35:17.119
<v Speaker 4>into text? Yes, so we'll have this capability in both

688
00:35:17.159 --> 00:35:20.639
<v Speaker 4>the app and also the API without that's like Groschia

689
00:35:20.800 --> 00:35:23.480
<v Speaker 4>just be your personal assistant looking over your shoulder, right

690
00:35:23.800 --> 00:35:26.159
<v Speaker 4>and follow you along the way, learn everything you have learned,

691
00:35:26.400 --> 00:35:28.679
<v Speaker 4>and really help you to understand the world better.

692
00:35:28.719 --> 00:35:29.760
<v Speaker 1>It becomes smodery every day.

693
00:35:30.239 --> 00:35:33.880
<v Speaker 2>Yeah, I mean the voicematter Groc doesn't isn't simply it's

694
00:35:33.920 --> 00:35:38.039
<v Speaker 2>not just voice text. It understands like tone, inflection, pacing, everything.

695
00:35:38.079 --> 00:35:42.119
<v Speaker 1>It's it's wild. I mean, it's like token a person. Yep.

696
00:35:42.239 --> 00:35:46.079
<v Speaker 4>So any plans for conversation memory, absolutely, we're working on

697
00:35:46.159 --> 00:35:47.039
<v Speaker 4>it right now.

698
00:35:47.639 --> 00:35:48.079
<v Speaker 1>That's right.

699
00:35:50.440 --> 00:35:55.360
<v Speaker 4>Let's see without the other ones. So what about the

700
00:35:56.159 --> 00:35:58.000
<v Speaker 4>you know, the DM features? Right, So if you have

701
00:35:58.079 --> 00:36:02.519
<v Speaker 4>personalization said that, if you uh, you know Grock remembers

702
00:36:02.960 --> 00:36:06.320
<v Speaker 4>your previous interactions, yes, Should it be one Groc or

703
00:36:06.679 --> 00:36:07.599
<v Speaker 4>multiple different.

704
00:36:07.360 --> 00:36:10.000
<v Speaker 2>Grocks, It's up to you. You can have one Grok

705
00:36:10.079 --> 00:36:14.079
<v Speaker 2>or many Grons. I suspect people will probably have one one.

706
00:36:14.920 --> 00:36:18.800
<v Speaker 1>Yeah, I won't have a doctor Groc. Yeah, the Grouk doc.

707
00:36:19.360 --> 00:36:21.760
<v Speaker 1>That's right, right, cool.

708
00:36:22.679 --> 00:36:25.880
<v Speaker 3>So in the past we've open sourced grock one, so

709
00:36:26.119 --> 00:36:28.039
<v Speaker 3>somebody is asking us we're going to do it again?

710
00:36:29.599 --> 00:36:33.239
<v Speaker 2>Yeah, I think when once scrut A general approach is

711
00:36:33.320 --> 00:36:36.519
<v Speaker 2>that we will open source the last version when the

712
00:36:36.599 --> 00:36:40.159
<v Speaker 2>next version is fully out. It's like when when GROG

713
00:36:40.199 --> 00:36:45.119
<v Speaker 2>three is mature and stable, which is probably within a

714
00:36:45.199 --> 00:36:47.559
<v Speaker 2>few months, then will open source GROUG two.

715
00:36:48.599 --> 00:36:51.599
<v Speaker 4>Mm hmm, okay, so we probably have time for one

716
00:36:51.679 --> 00:36:56.199
<v Speaker 4>last question. What was the most difficult part about working

717
00:36:56.239 --> 00:37:00.280
<v Speaker 4>on this project? I assume Grock three and the world

718
00:37:00.280 --> 00:37:03.880
<v Speaker 4>I'm most excited about, so I think me looking back,

719
00:37:04.280 --> 00:37:07.679
<v Speaker 4>you know, getting the whole model training on one hundred

720
00:37:07.760 --> 00:37:12.199
<v Speaker 4>k h one hundred coherently, that's almost a battle against

721
00:37:12.239 --> 00:37:15.199
<v Speaker 4>the final boss of the universe, the entropy, Because at

722
00:37:15.239 --> 00:37:17.440
<v Speaker 4>a given time, you can have a cosmic rate that

723
00:37:17.559 --> 00:37:20.400
<v Speaker 4>beaming downe and flip a bit you know, transistor and

724
00:37:20.519 --> 00:37:23.840
<v Speaker 4>not the entire graded update. If it's fit mentis a bit,

725
00:37:24.280 --> 00:37:26.519
<v Speaker 4>the entire grade update is out of whack.

726
00:37:27.199 --> 00:37:28.599
<v Speaker 1>And nine hundred thousand of those.

727
00:37:28.960 --> 00:37:33.280
<v Speaker 4>Orchestrate them every time at any given time, and GPS

728
00:37:33.320 --> 00:37:34.760
<v Speaker 4>can go down, and.

729
00:37:35.960 --> 00:37:37.559
<v Speaker 2>I mean it's worth breaking down, like how were we

730
00:37:37.639 --> 00:37:41.239
<v Speaker 2>able to get the world's most powerful training cluster operational

731
00:37:41.599 --> 00:37:45.119
<v Speaker 2>within one hundred and twenty two days because we started

732
00:37:45.119 --> 00:37:49.320
<v Speaker 2>off we we actually weren't intending to do a data

733
00:37:49.360 --> 00:37:51.599
<v Speaker 2>center ourselves. We were going to just we went to

734
00:37:51.679 --> 00:37:54.280
<v Speaker 2>the data center providers and said, how long would it

735
00:37:54.320 --> 00:37:59.159
<v Speaker 2>take to have one hundred thousand GPUs operating coherently in

736
00:37:59.239 --> 00:38:02.400
<v Speaker 2>a single location, And we've got time frames from eighteen

737
00:38:02.440 --> 00:38:05.559
<v Speaker 2>to twenty four months, so like, well, eighteen twenty four months,

738
00:38:05.679 --> 00:38:08.719
<v Speaker 2>that means losing as a certainty. So the only option

739
00:38:08.920 --> 00:38:11.880
<v Speaker 2>was to do it ourselves. So if you break down

740
00:38:11.920 --> 00:38:14.960
<v Speaker 2>the problem, you guest doing like reasoning here, it takes

741
00:38:15.000 --> 00:38:19.920
<v Speaker 2>your thing. Yeah, exactly. So what we needed a building.

742
00:38:20.159 --> 00:38:21.880
<v Speaker 2>We can't build a building, so we must use an

743
00:38:21.880 --> 00:38:22.519
<v Speaker 2>existing building.

744
00:38:23.239 --> 00:38:27.239
<v Speaker 1>So we looked for basically for factories that had been.

745
00:38:29.440 --> 00:38:31.960
<v Speaker 2>That had been abandoned, but the factory was in good shape,

746
00:38:32.000 --> 00:38:33.480
<v Speaker 2>like the company had gone bankrupt to something.

747
00:38:33.519 --> 00:38:36.519
<v Speaker 1>So we found an electro Luxe factory in Memphis.

748
00:38:36.559 --> 00:38:40.760
<v Speaker 2>That's why it's in Memphis, home of Elvis and also

749
00:38:40.840 --> 00:38:42.679
<v Speaker 2>one of the oldest I think it was the capital

750
00:38:42.719 --> 00:38:47.679
<v Speaker 2>of ancient Egypt, and it was actually very nice factory

751
00:38:48.400 --> 00:38:52.679
<v Speaker 2>that forever for whatever reason that Electrolux had left and

752
00:38:54.400 --> 00:38:57.559
<v Speaker 2>that that gave us shelter for the computers. Then we

753
00:38:57.639 --> 00:39:02.119
<v Speaker 2>needed power that we needed at least one hundred and

754
00:39:02.119 --> 00:39:04.800
<v Speaker 2>twenty megawats at first, but the building only had fifteen megawats,

755
00:39:04.840 --> 00:39:07.639
<v Speaker 2>and ultimately for two hundred thousand megree thousand GPUs we

756
00:39:07.719 --> 00:39:13.400
<v Speaker 2>needed a quarter gigawat, so we they initially leased a

757
00:39:13.440 --> 00:39:16.519
<v Speaker 2>whole bunch of generators, so we have generators on one

758
00:39:16.559 --> 00:39:20.320
<v Speaker 2>side of the building, just trailer after trailer of generators

759
00:39:20.639 --> 00:39:23.519
<v Speaker 2>until we get the utility power to come in. But

760
00:39:23.639 --> 00:39:25.679
<v Speaker 2>then we also need cooling, so on the other side

761
00:39:25.679 --> 00:39:28.639
<v Speaker 2>of the building it was just trailer after trailer of cooling,

762
00:39:28.800 --> 00:39:30.599
<v Speaker 2>So we leased about a quarter of the mobile cooling

763
00:39:30.639 --> 00:39:33.199
<v Speaker 2>capacity of the United States on the other side of

764
00:39:33.199 --> 00:39:36.519
<v Speaker 2>the building. Then we needed to get the GPUs all installed,

765
00:39:36.519 --> 00:39:38.559
<v Speaker 2>and they're all liquid cooled, so in order to achieve

766
00:39:38.599 --> 00:39:41.480
<v Speaker 2>the density necessary, this is a liquid cooled system.

767
00:39:41.559 --> 00:39:43.760
<v Speaker 1>So we had to get all the plumbing for liquid cooling.

768
00:39:44.079 --> 00:39:47.119
<v Speaker 2>Now, we had ever done a liquid cooling data center

769
00:39:47.159 --> 00:39:51.360
<v Speaker 2>at scale, so this was an incredibly dedicated effort by

770
00:39:51.400 --> 00:39:53.360
<v Speaker 2>a very talented team to achieve that outcome.

771
00:39:55.000 --> 00:39:56.679
<v Speaker 1>I may think, now it's going to work. Nope.

772
00:39:57.840 --> 00:40:02.079
<v Speaker 2>The issue is that the the power fluctuations for a

773
00:40:02.159 --> 00:40:07.039
<v Speaker 2>GPU cluster are dramatic. So it's like this giant symphony

774
00:40:07.159 --> 00:40:11.599
<v Speaker 2>that has taking place, like having a symphony with one

775
00:40:11.679 --> 00:40:15.000
<v Speaker 2>hundred thousand or two hundred thousand participants in the symphony,

776
00:40:15.039 --> 00:40:18.599
<v Speaker 2>and the whole orchestra will go quiet and loud in

777
00:40:19.360 --> 00:40:22.679
<v Speaker 2>you know, one hundred milliseconds, and so this caused massive

778
00:40:22.920 --> 00:40:27.480
<v Speaker 2>power fluctuations. So then which then caused the generators to

779
00:40:27.599 --> 00:40:28.880
<v Speaker 2>lose their minds and they.

780
00:40:28.840 --> 00:40:29.679
<v Speaker 1>Weren't expecting this.

781
00:40:30.320 --> 00:40:34.960
<v Speaker 2>So to buffer the power, we then used Tesla megapas

782
00:40:35.719 --> 00:40:37.480
<v Speaker 2>to smooth out the power.

783
00:40:38.119 --> 00:40:40.800
<v Speaker 1>So the mega packs had to be reprogrammed.

784
00:40:41.360 --> 00:40:44.679
<v Speaker 2>So with the XAI we were working with Teesla, we

785
00:40:44.800 --> 00:40:47.719
<v Speaker 2>reprogrammed the megapacs to be able to deal with these

786
00:40:48.000 --> 00:40:51.360
<v Speaker 2>dramatic power fluctuation fluctuations to smooth out the powers that

787
00:40:51.519 --> 00:40:56.239
<v Speaker 2>the computers could actually run properly, and that worked.

788
00:40:57.119 --> 00:41:00.440
<v Speaker 1>It quite tricky and then but.

789
00:41:00.559 --> 00:41:02.920
<v Speaker 2>Even at that point, just left to make the computers

790
00:41:02.960 --> 00:41:05.840
<v Speaker 2>all communicate effectively, so all the networking had to be

791
00:41:05.960 --> 00:41:13.760
<v Speaker 2>solved and debugging Brazilian network cables a debugging Nickel at

792
00:41:13.840 --> 00:41:16.679
<v Speaker 2>four in the morning, or we solved it like roughly

793
00:41:16.719 --> 00:41:21.119
<v Speaker 2>four twenty am. Was well figured out, Like there's some well,

794
00:41:21.119 --> 00:41:22.760
<v Speaker 2>there are a whole bunch of issues. One there was

795
00:41:22.800 --> 00:41:24.599
<v Speaker 2>like a bios mismatch.

796
00:41:25.199 --> 00:41:26.599
<v Speaker 1>Bios was not set up correctly.

797
00:41:28.800 --> 00:41:35.480
<v Speaker 3>We HADSPCI outputs between two different machines. One that was working, yeah,

798
00:41:35.559 --> 00:41:37.679
<v Speaker 3>one that was not working. Many many other things.

799
00:41:38.000 --> 00:41:38.760
<v Speaker 1>I mean, yeah, exactly.

800
00:41:38.840 --> 00:41:39.960
<v Speaker 2>This would go on for a long time if we

801
00:41:40.039 --> 00:41:42.599
<v Speaker 2>actually listed all the things. But it's like interesting, It's

802
00:41:42.639 --> 00:41:45.079
<v Speaker 2>not like, oh, we just magically made it happen. You

803
00:41:45.159 --> 00:41:47.559
<v Speaker 2>had to break down the problem, just like groctas for reasoning,

804
00:41:48.079 --> 00:41:50.039
<v Speaker 2>into the constituent elements, and then solve each of the

805
00:41:50.079 --> 00:41:54.280
<v Speaker 2>constituent elements in order to achieve a coherent training cluster

806
00:41:54.920 --> 00:41:57.119
<v Speaker 2>in a period of time that is a small fraction

807
00:41:57.239 --> 00:41:58.800
<v Speaker 2>of what anyone else could do it in.

808
00:41:59.159 --> 00:42:01.119
<v Speaker 3>And then once the train cluster was up and running

809
00:42:01.159 --> 00:42:02.639
<v Speaker 3>and we could use it, or we had to make

810
00:42:02.679 --> 00:42:04.960
<v Speaker 3>sure that it actually stays healthy throughout, which is his

811
00:42:05.079 --> 00:42:08.000
<v Speaker 3>own brand challenge. And then we had to get every

812
00:42:08.079 --> 00:42:11.000
<v Speaker 3>single detail of the training right in order to get

813
00:42:11.039 --> 00:42:14.079
<v Speaker 3>a rookery level model, which is actually really really hard.

814
00:42:14.360 --> 00:42:17.039
<v Speaker 3>So we don't know if there are any other models

815
00:42:17.039 --> 00:42:20.679
<v Speaker 3>out there that have Rockery's capabilities, but whoever trains a

816
00:42:20.719 --> 00:42:22.840
<v Speaker 3>model better than rock Crey has to be extremely good

817
00:42:22.880 --> 00:42:26.480
<v Speaker 3>at the science of deeplining at every aspect of the engineering.

818
00:42:26.920 --> 00:42:28.920
<v Speaker 1>So it's it's not so easy to pull this off.

819
00:42:29.360 --> 00:42:30.920
<v Speaker 1>And this is now going to be the last cluster

820
00:42:31.039 --> 00:42:32.719
<v Speaker 1>were build and last model we train.

821
00:42:33.519 --> 00:42:36.000
<v Speaker 2>Oh yeah, we're We've already started work on the next cluster,

822
00:42:36.679 --> 00:42:39.800
<v Speaker 2>which will be about five times to power, so instead

823
00:42:39.800 --> 00:42:42.639
<v Speaker 2>of a quarter gigawad, roughly one point to giga.

824
00:42:42.679 --> 00:42:47.000
<v Speaker 1>What what's the what's the back to the future was?

825
00:42:47.679 --> 00:42:48.199
<v Speaker 1>What's the power?

826
00:42:49.440 --> 00:42:52.000
<v Speaker 2>Does the back to the Future car anyway back to

827
00:42:52.039 --> 00:42:55.239
<v Speaker 2>the future power? It's like roughly in that order, I think. So,

828
00:42:56.480 --> 00:42:58.440
<v Speaker 2>you know, there will be the sort of the GB

829
00:42:58.519 --> 00:43:02.280
<v Speaker 2>two hundred slash three hundred pleasure once again. It will

830
00:43:02.280 --> 00:43:04.199
<v Speaker 2>be the most powerful training cluster from the world. So

831
00:43:04.280 --> 00:43:05.800
<v Speaker 2>we're not like stopping here, and.

832
00:43:05.880 --> 00:43:08.800
<v Speaker 4>Our reason model is going to continue improve by accessing

833
00:43:08.880 --> 00:43:11.840
<v Speaker 4>more tools every day. So yeah, we're very excited to

834
00:43:12.320 --> 00:43:14.480
<v Speaker 4>share any of that coming results with you all.

835
00:43:14.960 --> 00:43:17.719
<v Speaker 3>Yeah, the thing that keeps us going is basically being

836
00:43:17.760 --> 00:43:20.239
<v Speaker 3>able to give free to you and then seeing the

837
00:43:20.320 --> 00:43:23.440
<v Speaker 3>usage go up, seeing everybody enjoy no clock.

838
00:43:23.599 --> 00:43:26.199
<v Speaker 1>That's that's what really gets us up in the morning.

839
00:43:26.400 --> 00:43:28.639
<v Speaker 1>So thanks for your name. Thanks guys,
