WEBVTT

1
00:00:00.080 --> 00:00:02.640
<v Speaker 1>You know, usually when we talk about a medical diagnosis,

2
00:00:02.640 --> 00:00:06.200
<v Speaker 1>there's this expectation of like pure precision.

3
00:00:06.280 --> 00:00:08.400
<v Speaker 2>Oh totally, it feels like engineering.

4
00:00:07.960 --> 00:00:10.839
<v Speaker 1>Right exactly, like you break your arm, the X ray

5
00:00:10.960 --> 00:00:13.720
<v Speaker 1>shows that jagged white line and the doctor just points

6
00:00:13.720 --> 00:00:14.800
<v Speaker 1>and says, well, there it is.

7
00:00:15.160 --> 00:00:19.480
<v Speaker 2>It's binary broken or not broken. It's clean, and honestly,

8
00:00:19.559 --> 00:00:21.160
<v Speaker 2>it's very comforting for patients.

9
00:00:21.160 --> 00:00:23.600
<v Speaker 1>We like things to be visible, easily categorized.

10
00:00:23.640 --> 00:00:24.359
<v Speaker 2>Yeah, we really do.

11
00:00:24.839 --> 00:00:29.039
<v Speaker 1>But then you step into the world of analyzing the

12
00:00:29.039 --> 00:00:32.119
<v Speaker 1>the microscopic landscape of our own blood, or you try

13
00:00:32.200 --> 00:00:37.920
<v Speaker 1>to extract a single meaningful diagnostic fact from thousands of

14
00:00:37.960 --> 00:00:39.840
<v Speaker 1>pages of dense medical texts.

15
00:00:39.880 --> 00:00:41.640
<v Speaker 2>Oh yeah, that's a nightmare.

16
00:00:41.439 --> 00:00:44.439
<v Speaker 1>And suddenly that clean X ray machine is useless. We're

17
00:00:44.439 --> 00:00:47.560
<v Speaker 1>looking at a data landscape that is murky. It's chaotic.

18
00:00:47.719 --> 00:00:50.960
<v Speaker 2>It is the absolute definition of diagnostic muddy waters.

19
00:00:51.000 --> 00:00:53.520
<v Speaker 1>Okay, let's unpack this, because if you were trying to

20
00:00:53.560 --> 00:00:57.039
<v Speaker 1>make sense of this modern avalanche of information, you need

21
00:00:57.079 --> 00:01:00.560
<v Speaker 1>to understand how the tools are evolving today. Our grounding

22
00:01:00.600 --> 00:01:04.000
<v Speaker 1>material is a really fascinating collection of research from the

23
00:01:04.000 --> 00:01:09.400
<v Speaker 1>proceedings of the International Conference on Data Science, Computation and.

24
00:01:09.319 --> 00:01:13.719
<v Speaker 2>Security, which is a really long title, Yeah.

25
00:01:13.480 --> 00:01:16.959
<v Speaker 1>It is. Specifically, we're looking at the IDSCS twenty twenty one.

26
00:01:16.920 --> 00:01:20.159
<v Speaker 2>Papers, right, and this collection is really a treasure trove

27
00:01:20.400 --> 00:01:25.519
<v Speaker 2>of how modern computational techniques are being applied to incredibly messy,

28
00:01:25.799 --> 00:01:27.760
<v Speaker 2>real world problems exactly.

29
00:01:28.079 --> 00:01:30.159
<v Speaker 1>And our mission today on this deep dive is to

30
00:01:30.200 --> 00:01:34.439
<v Speaker 1>explore how cutting edge data science is solving three massive,

31
00:01:34.519 --> 00:01:35.560
<v Speaker 1>interconnected challenges.

32
00:01:35.719 --> 00:01:37.439
<v Speaker 2>Yeah, three really big ones.

33
00:01:37.519 --> 00:01:39.840
<v Speaker 1>We are going to look at how algorithms are finally

34
00:01:39.920 --> 00:01:43.560
<v Speaker 1>learning to digest human knowledge without losing the meaning crucial step,

35
00:01:44.000 --> 00:01:47.319
<v Speaker 1>how they are identifying microscopic threats hiding in our bloodstreams,

36
00:01:47.359 --> 00:01:49.200
<v Speaker 1>and crucially, how they are figuring out how to do

37
00:01:49.239 --> 00:01:53.000
<v Speaker 1>all of this without demanding access to our private raw data.

38
00:01:53.040 --> 00:01:55.519
<v Speaker 2>Because at the core of all these papers is a single,

39
00:01:55.640 --> 00:01:59.640
<v Speaker 2>unifying mathematical challenge, which is extracting the vital signal from

40
00:01:59.680 --> 00:02:01.000
<v Speaker 2>an over welming amount of noise.

41
00:02:01.319 --> 00:02:04.439
<v Speaker 1>Well, let's start with the loudest noise of all, which

42
00:02:04.480 --> 00:02:07.480
<v Speaker 1>is a problem I know you the listener deal with

43
00:02:07.879 --> 00:02:08.840
<v Speaker 1>every single day.

44
00:02:09.080 --> 00:02:10.680
<v Speaker 2>Oh, information overload.

45
00:02:11.000 --> 00:02:16.919
<v Speaker 1>Yes, we are drowning in text emails, one hundred page reports, articles,

46
00:02:17.039 --> 00:02:18.000
<v Speaker 1>research papers.

47
00:02:18.199 --> 00:02:18.919
<v Speaker 2>It's endless.

48
00:02:19.439 --> 00:02:22.400
<v Speaker 1>The traditional AI summary tools we've been using for the

49
00:02:22.479 --> 00:02:26.560
<v Speaker 1>last few years often feel frankly kind of dumb.

50
00:02:26.919 --> 00:02:31.400
<v Speaker 2>Why is that, Well, because traditional tech summarization algorithms often

51
00:02:31.479 --> 00:02:34.719
<v Speaker 2>just skim the surface. Okay, The core problem is their methodology.

52
00:02:34.800 --> 00:02:38.680
<v Speaker 2>They essentially look for frequently repeated words and just grab

53
00:02:38.719 --> 00:02:40.280
<v Speaker 2>the sentences containing them.

54
00:02:40.159 --> 00:02:41.960
<v Speaker 1>So they're just like scanning.

55
00:02:41.520 --> 00:02:44.159
<v Speaker 2>For matches, right, But by doing that, they lose the

56
00:02:44.159 --> 00:02:47.319
<v Speaker 2>crucial context. They miss the nuances I see, and they

57
00:02:47.360 --> 00:02:52.039
<v Speaker 2>frequently drop highly useful specialized entities that maybe only appear

58
00:02:52.120 --> 00:02:54.479
<v Speaker 2>once or twice, but they change the entire meaning of

59
00:02:54.479 --> 00:02:57.800
<v Speaker 2>a paragraph. They don't actually understand what they are summarizing.

60
00:02:57.800 --> 00:02:58.840
<v Speaker 2>They're literally discounting.

61
00:02:59.000 --> 00:03:02.919
<v Speaker 1>So instead of just reading textbook and blindly highlighting repeated words,

62
00:03:03.400 --> 00:03:07.240
<v Speaker 1>it's like a stressed college student highlighting the word mitochondria

63
00:03:07.599 --> 00:03:09.639
<v Speaker 1>fifty times without knowing what it does.

64
00:03:10.319 --> 00:03:13.960
<v Speaker 2>That is a very accurate, albeit depressing analogy.

65
00:03:14.280 --> 00:03:16.319
<v Speaker 1>Yeah. So the first paper we are looking at today,

66
00:03:16.680 --> 00:03:21.120
<v Speaker 1>by Sithan Seeing in Drundiepak proposes a knowledge centric semantic approach.

67
00:03:21.280 --> 00:03:22.120
<v Speaker 2>Yeah they do.

68
00:03:22.280 --> 00:03:24.439
<v Speaker 1>How does this actually fix the counting problem?

69
00:03:24.560 --> 00:03:27.479
<v Speaker 2>By building what they call a term based ontology model.

70
00:03:27.639 --> 00:03:29.319
<v Speaker 1>Okay, term based ontology.

71
00:03:29.400 --> 00:03:33.199
<v Speaker 2>Yeah, let's break down how this algorithm actually reads. After

72
00:03:33.240 --> 00:03:33.960
<v Speaker 2>it cleans up the.

73
00:03:33.919 --> 00:03:37.479
<v Speaker 1>Text, like removing basic stop words and punctuations.

74
00:03:36.759 --> 00:03:40.199
<v Speaker 2>Exactly after that, it applies something called TFIDF.

75
00:03:40.360 --> 00:03:44.199
<v Speaker 1>Okay, what is tf IDF because that sounds like, I

76
00:03:44.199 --> 00:03:46.400
<v Speaker 1>don't know, heavy military jargon.

77
00:03:46.560 --> 00:03:50.479
<v Speaker 2>It does. It stands for term frequency inverse document frequency.

78
00:03:50.599 --> 00:03:51.520
<v Speaker 1>It is a mouthful.

79
00:03:51.719 --> 00:03:53.680
<v Speaker 2>It is. Think of it as a way to weigh

80
00:03:53.680 --> 00:03:55.919
<v Speaker 2>the importance of a word. It looks at the most

81
00:03:55.960 --> 00:03:59.520
<v Speaker 2>frequent words in a specific document. Okay, but it compares

82
00:03:59.560 --> 00:04:02.719
<v Speaker 2>that to how rare those words are across a massive

83
00:04:02.919 --> 00:04:04.520
<v Speaker 2>general corpus of text.

84
00:04:04.680 --> 00:04:05.919
<v Speaker 1>Wait, can you give me an example.

85
00:04:06.240 --> 00:04:09.319
<v Speaker 2>Sure, if the word blood appears fifty times in a

86
00:04:09.319 --> 00:04:14.039
<v Speaker 2>medical paper but rarely in general English, TFIDF flags it

87
00:04:14.080 --> 00:04:17.639
<v Speaker 2>as a highly unique identifier for that specific text.

88
00:04:17.839 --> 00:04:21.240
<v Speaker 1>Okay, So it finds the unique keywords, but how does

89
00:04:21.240 --> 00:04:22.000
<v Speaker 1>it know what they mean?

90
00:04:22.360 --> 00:04:25.040
<v Speaker 2>Well, this is where it gets semantic. It takes these

91
00:04:25.079 --> 00:04:29.560
<v Speaker 2>extracted features and cross references them with external knowledge sources

92
00:04:29.879 --> 00:04:31.839
<v Speaker 2>like what specifically wikidata?

93
00:04:32.199 --> 00:04:34.120
<v Speaker 1>Oh wow, Yeah, it.

94
00:04:34.160 --> 00:04:37.199
<v Speaker 2>Is actively looking up the concepts it finds to build

95
00:04:37.279 --> 00:04:38.879
<v Speaker 2>a domain based ontology.

96
00:04:39.079 --> 00:04:42.319
<v Speaker 1>So like a literal mathematical map of how all these

97
00:04:42.439 --> 00:04:45.079
<v Speaker 1>terms relate to each other in the real world exactly,

98
00:04:45.120 --> 00:04:47.279
<v Speaker 1>So it's not just looking at the document in a vacuum.

99
00:04:47.480 --> 00:04:50.759
<v Speaker 1>It's using wikidata to build a web of meaning, linking

100
00:04:50.839 --> 00:04:52.800
<v Speaker 1>terms together before it even tries to summarize.

101
00:04:52.839 --> 00:04:55.480
<v Speaker 2>You've got it. But then comes the hard part, which

102
00:04:55.519 --> 00:04:58.319
<v Speaker 2>is deciding which sentences to actually keep and which to

103
00:04:58.360 --> 00:05:00.920
<v Speaker 2>throw away. Right, this is where they bring in heavy

104
00:05:00.959 --> 00:05:03.720
<v Speaker 2>duty statistical tools, starting with cross entropy.

105
00:05:03.759 --> 00:05:07.279
<v Speaker 1>All right, slow down? What is cross entropy in plain English?

106
00:05:07.360 --> 00:05:07.560
<v Speaker 2>Right?

107
00:05:07.600 --> 00:05:07.959
<v Speaker 1>Sorry?

108
00:05:08.160 --> 00:05:12.279
<v Speaker 2>In information theory, cross entropy essentially measures the difference between

109
00:05:12.399 --> 00:05:14.279
<v Speaker 2>two probability distributions.

110
00:05:14.560 --> 00:05:16.199
<v Speaker 1>Still a bit technical, okay.

111
00:05:16.279 --> 00:05:19.600
<v Speaker 2>In the context of reading text, the algorithm is calculating

112
00:05:19.600 --> 00:05:21.879
<v Speaker 2>the surprise factor of a new sentence.

113
00:05:22.160 --> 00:05:24.199
<v Speaker 1>The surprise factor, Yeah.

114
00:05:23.800 --> 00:05:27.240
<v Speaker 2>It's mathematically asking, based on the web of meaning I've

115
00:05:27.279 --> 00:05:30.800
<v Speaker 2>already built from the previous sentences, how much genuinely new

116
00:05:30.959 --> 00:05:34.720
<v Speaker 2>surprising information does this next sentence actually give me?

117
00:05:34.959 --> 00:05:37.519
<v Speaker 1>Well, that is brilliant, it really is. So instead of

118
00:05:37.560 --> 00:05:40.720
<v Speaker 1>just reading blindly, this algorithm acts like a genius friend

119
00:05:40.720 --> 00:05:43.720
<v Speaker 1>who actually understands the meaning of the words. If the

120
00:05:43.759 --> 00:05:46.959
<v Speaker 1>cross entropy is low, it means the algorithm isn't surprised

121
00:05:47.000 --> 00:05:47.279
<v Speaker 1>at all.

122
00:05:47.360 --> 00:05:51.120
<v Speaker 2>Precisely, it already knows this information. So the sentence is redundant.

123
00:05:50.759 --> 00:05:53.199
<v Speaker 1>And it mathematically proves which sentences are redundant.

124
00:05:53.360 --> 00:05:56.639
<v Speaker 2>Yes, And to further eliminate redundancy, it pairs this with

125
00:05:56.959 --> 00:06:02.120
<v Speaker 2>NPMI or normalize point wise mutual information alongside ENOVA.

126
00:06:02.279 --> 00:06:03.879
<v Speaker 1>Okay, NPMI, what does that do?

127
00:06:04.120 --> 00:06:07.839
<v Speaker 2>NPMI looks at cooccurrence. If two concepts, say interest rates

128
00:06:07.839 --> 00:06:10.920
<v Speaker 2>and inflation almost always show up together in the text,

129
00:06:11.319 --> 00:06:13.079
<v Speaker 2>NPMI flags that strong.

130
00:06:12.839 --> 00:06:15.240
<v Speaker 1>Relationship makes sense and ANOVA.

131
00:06:15.319 --> 00:06:19.879
<v Speaker 2>The algorithm then uses an analysis of variance or ANOVA

132
00:06:20.319 --> 00:06:24.399
<v Speaker 2>to generate statistical P values for these term relationships.

133
00:06:24.519 --> 00:06:27.560
<v Speaker 1>So it's assigning a strict mathematical grade to every single

134
00:06:27.600 --> 00:06:28.680
<v Speaker 1>word relationship.

135
00:06:28.800 --> 00:06:30.720
<v Speaker 2>Yes, and the grading is ruthless.

136
00:06:31.000 --> 00:06:32.480
<v Speaker 1>We really, Oh yeah.

137
00:06:32.680 --> 00:06:37.000
<v Speaker 2>The system group's sentences based on these P values. It

138
00:06:37.120 --> 00:06:41.120
<v Speaker 2>uses a strict threshold, like a cutoff point. Exactly, if

139
00:06:41.160 --> 00:06:43.920
<v Speaker 2>the calculated value of cross entropy and the intersection of

140
00:06:43.959 --> 00:06:47.639
<v Speaker 2>those NPMI scores is less than point five, that sentence

141
00:06:47.680 --> 00:06:49.040
<v Speaker 2>is entirely eliminated.

142
00:06:49.240 --> 00:06:50.399
<v Speaker 1>Just gone, gone.

143
00:06:50.519 --> 00:06:53.839
<v Speaker 2>It is mathematically proving that the sentence adds no new

144
00:06:53.959 --> 00:06:55.040
<v Speaker 2>value to the summary.

145
00:06:55.439 --> 00:06:58.000
<v Speaker 1>But wait, if you mathematically chop up a fifty page

146
00:06:58.040 --> 00:07:01.519
<v Speaker 1>document based on variance and entropy. The resulting summary might

147
00:07:01.560 --> 00:07:03.839
<v Speaker 1>contain the right facts, but it's going to sound like

148
00:07:03.879 --> 00:07:06.079
<v Speaker 1>a glitching robot trying to speak English.

149
00:07:05.839 --> 00:07:07.079
<v Speaker 2>Right, it would be super disjointed.

150
00:07:07.040 --> 00:07:09.639
<v Speaker 1>Sentences will just abruptly smash into each other, which is

151
00:07:09.680 --> 00:07:12.839
<v Speaker 1>exactly why the authors included a final polishing step using

152
00:07:12.879 --> 00:07:15.040
<v Speaker 1>two distinct agents. Oh they fixed the flow.

153
00:07:15.279 --> 00:07:18.360
<v Speaker 2>Yes, First, a lexical agent using word net two point

154
00:07:18.480 --> 00:07:21.279
<v Speaker 2>zero steps in what's word It acts like a massive

155
00:07:21.319 --> 00:07:26.160
<v Speaker 2>conceptual dictionary to ensure the vocabulary transitions naturally and captures

156
00:07:26.199 --> 00:07:27.120
<v Speaker 2>the right lexims.

157
00:07:27.279 --> 00:07:28.879
<v Speaker 1>Okay, that helps the vocabulary.

158
00:07:28.920 --> 00:07:32.240
<v Speaker 2>And then a grammatical agent restructures the phrasing to fix

159
00:07:32.279 --> 00:07:35.879
<v Speaker 2>the grammatical errors that inevitably happen when you stitch disparate

160
00:07:35.920 --> 00:07:36.800
<v Speaker 2>sentences together.

161
00:07:37.360 --> 00:07:40.560
<v Speaker 1>So what were the actual results of this highly semantic,

162
00:07:40.639 --> 00:07:42.360
<v Speaker 1>mathematically ruthless approach.

163
00:07:42.480 --> 00:07:45.439
<v Speaker 2>Well, they tested this on the DUC two thousand and

164
00:07:45.480 --> 00:07:46.319
<v Speaker 2>seven data.

165
00:07:46.120 --> 00:07:47.759
<v Speaker 1>Set, which is what like a benchmark.

166
00:07:47.920 --> 00:07:51.600
<v Speaker 2>Yeah, it's a standard academic benchmark containing hundreds of documents

167
00:07:51.600 --> 00:07:55.839
<v Speaker 2>with manually created, human written summaries to test algorithms against. Okay,

168
00:07:55.959 --> 00:07:59.040
<v Speaker 2>kind of, The sing and Deepak model achieved an F

169
00:07:59.120 --> 00:08:01.600
<v Speaker 2>measure of eighty eight point twenty percent.

170
00:08:01.759 --> 00:08:03.480
<v Speaker 1>Wow, that's high it is, And.

171
00:08:03.439 --> 00:08:06.279
<v Speaker 2>Perhaps more importantly, a false negative rate of just zero

172
00:08:06.279 --> 00:08:07.160
<v Speaker 2>point one four.

173
00:08:07.360 --> 00:08:10.680
<v Speaker 1>Wait point one four, that's tiny. Let's translate that false

174
00:08:10.720 --> 00:08:13.279
<v Speaker 1>negative rate into the real world. Okay, A false negative

175
00:08:13.279 --> 00:08:15.800
<v Speaker 1>means the algorithm looked at a crucial piece of information

176
00:08:15.920 --> 00:08:19.199
<v Speaker 1>and mistakenly decided to delete it. A rate of point

177
00:08:19.279 --> 00:08:23.560
<v Speaker 1>one four means it is almost never deleting vital information exactly.

178
00:08:24.000 --> 00:08:27.480
<v Speaker 2>For context, they compared it to baseline models like IKTWA,

179
00:08:27.920 --> 00:08:30.680
<v Speaker 2>which only hit a seventy eight point three six percent

180
00:08:30.959 --> 00:08:31.480
<v Speaker 2>F measure.

181
00:08:31.560 --> 00:08:34.159
<v Speaker 1>If I'm relying on an AI to summarize a massive

182
00:08:34.240 --> 00:08:37.480
<v Speaker 1>legal contract or a dense medical history, I need to

183
00:08:37.559 --> 00:08:41.200
<v Speaker 1>know it didn't accidentally delete the hidden fee clause or

184
00:08:41.240 --> 00:08:42.559
<v Speaker 1>the patient's drug allergy.

185
00:08:42.720 --> 00:08:43.320
<v Speaker 2>It's vital.

186
00:08:43.440 --> 00:08:47.200
<v Speaker 1>This math actually provides that confidence. It's so efficient. This

187
00:08:47.320 --> 00:08:49.960
<v Speaker 1>directly benefits anyone trying to learn faster.

188
00:08:50.159 --> 00:08:52.919
<v Speaker 2>It is a massive leap forward. It shows that by

189
00:08:52.919 --> 00:08:56.759
<v Speaker 2>mapping the ontology of words, computers can finally move past

190
00:08:56.799 --> 00:08:59.639
<v Speaker 2>just counting text to actually distilling human thought.

191
00:09:00.000 --> 00:09:03.519
<v Speaker 1>If we can use statistical thresholds to filter out useless sentences,

192
00:09:03.879 --> 00:09:06.399
<v Speaker 1>can we use that exact same logic to filter out

193
00:09:06.480 --> 00:09:07.919
<v Speaker 1>useless noise in a medical scan.

194
00:09:08.120 --> 00:09:09.720
<v Speaker 2>That is exactly what we're looking at.

195
00:09:09.759 --> 00:09:13.080
<v Speaker 1>Next, we're moving from processing human language to processing human

196
00:09:13.080 --> 00:09:17.279
<v Speaker 1>biology because data science isn't just about reading text faster, right,

197
00:09:17.639 --> 00:09:19.600
<v Speaker 1>It's about seeing what the human eye misses.

198
00:09:19.799 --> 00:09:23.840
<v Speaker 2>It is the transition from semantic analysis to geometric analysis

199
00:09:24.279 --> 00:09:26.519
<v Speaker 2>using the exact same underlying.

200
00:09:26.039 --> 00:09:30.480
<v Speaker 1>Principle extracting vital features from a sea of noise. This

201
00:09:30.559 --> 00:09:33.720
<v Speaker 1>brings us to the second paper by Ali Siddam hasim

202
00:09:33.799 --> 00:09:37.840
<v Speaker 1>geda Way and Gamela Judah. They're detecting abnormal red blood

203
00:09:37.840 --> 00:09:43.039
<v Speaker 1>cells or RBCs using morphology and rotation. Now, why is

204
00:09:43.080 --> 00:09:45.399
<v Speaker 1>this a problem that needs a data science solution?

205
00:09:45.960 --> 00:09:49.840
<v Speaker 2>Because the steaks are incredibly high for conditions like hemolytic anemia,

206
00:09:50.320 --> 00:09:52.559
<v Speaker 2>with sickle cell anemia being a prime example.

207
00:09:52.639 --> 00:09:53.639
<v Speaker 1>Okay, hemoltic anemia.

208
00:09:53.799 --> 00:09:56.919
<v Speaker 2>Yeah, in a healthy person, red blood cells are perfectly

209
00:09:56.960 --> 00:10:01.120
<v Speaker 2>circular and flexible, but genetic abnormality can cause these cells

210
00:10:01.120 --> 00:10:05.639
<v Speaker 2>to become deformed. They get all misshapen, right, They turn elliptical, rectangular,

211
00:10:05.720 --> 00:10:07.480
<v Speaker 2>or sickle shaped like a crescent moon.

212
00:10:07.639 --> 00:10:11.360
<v Speaker 1>And because of that elongated jagged shape, they become rigid.

213
00:10:11.399 --> 00:10:13.759
<v Speaker 1>They get stuck in blood vessels, and they rupture easily

214
00:10:13.799 --> 00:10:15.320
<v Speaker 1>as they pass through our capillaries.

215
00:10:15.440 --> 00:10:18.399
<v Speaker 2>Right now, The traditional way to diagnose this involves a

216
00:10:18.440 --> 00:10:21.240
<v Speaker 2>highly trained hematology technicians sitting at a.

217
00:10:21.159 --> 00:10:23.360
<v Speaker 1>Microscope, staring through the lens all day.

218
00:10:23.559 --> 00:10:27.480
<v Speaker 2>Manually examining a glass slide smeared with blood and looking

219
00:10:27.480 --> 00:10:31.120
<v Speaker 2>for these deformed cells among hundreds or thousands of normal ones.

220
00:10:31.200 --> 00:10:32.159
<v Speaker 1>That sounds exhausting.

221
00:10:32.480 --> 00:10:36.159
<v Speaker 2>It is tedious, it is painstakingly slow, and it is

222
00:10:36.279 --> 00:10:39.799
<v Speaker 2>highly prone to fatigue. You are relying entirely on a

223
00:10:39.840 --> 00:10:41.000
<v Speaker 2>tired human eye.

224
00:10:41.080 --> 00:10:44.600
<v Speaker 1>So Sudiam and his team built an automated solution. Yes

225
00:10:44.639 --> 00:10:47.120
<v Speaker 1>they did, And what fascinated me is how they prep

226
00:10:47.200 --> 00:10:50.720
<v Speaker 1>the image before the computer even looks for the cells.

227
00:10:51.279 --> 00:10:52.679
<v Speaker 2>The preprocessing is key.

228
00:10:52.759 --> 00:10:55.600
<v Speaker 1>They have to find the region of interest or ROI.

229
00:10:56.080 --> 00:10:59.759
<v Speaker 1>They take the standard grayscale microscope image and essentially peel

230
00:10:59.759 --> 00:11:02.240
<v Speaker 1>it up heart by converting it into pure black and

231
00:11:02.279 --> 00:11:03.480
<v Speaker 1>white binary images.

232
00:11:03.559 --> 00:11:05.200
<v Speaker 2>But they don't just do it once.

233
00:11:05.159 --> 00:11:10.039
<v Speaker 1>Right, They process it at very specific intensity thresholds like

234
00:11:10.440 --> 00:11:15.240
<v Speaker 1>sixty seventy eighty ninety one hundred correct. Why those specific numbers?

235
00:11:15.320 --> 00:11:17.240
<v Speaker 1>Why not just make the dark stuff black and the

236
00:11:17.320 --> 00:11:18.080
<v Speaker 1>light stuff white?

237
00:11:18.320 --> 00:11:21.159
<v Speaker 2>Well, because a blood smear is messy. Lighting under a

238
00:11:21.200 --> 00:11:24.799
<v Speaker 2>microscope isn't perfectly even some cells overlap, some are faded.

239
00:11:24.919 --> 00:11:27.080
<v Speaker 1>Oh, so it's not a uniform image, not at all.

240
00:11:27.120 --> 00:11:30.399
<v Speaker 2>By running multiple thresholds, the algorithm is essentially adjusting the

241
00:11:30.440 --> 00:11:31.759
<v Speaker 2>exposure step by step.

242
00:11:31.840 --> 00:11:34.000
<v Speaker 1>Oh like changing the settings on a camera exactly.

243
00:11:34.039 --> 00:11:36.840
<v Speaker 2>It's finding the optimal contrast where the true edge of

244
00:11:36.879 --> 00:11:38.879
<v Speaker 2>the cell separates from the background fluid.

245
00:11:39.120 --> 00:11:42.240
<v Speaker 1>And as it creates these binary images, it runs a

246
00:11:42.279 --> 00:11:45.799
<v Speaker 1>cleaning protocol. Yes, anything that shows up as a smooth

247
00:11:45.879 --> 00:11:49.600
<v Speaker 1>region smaller than one hundred pixels is instantly deleted.

248
00:11:49.679 --> 00:11:51.879
<v Speaker 2>It mathematically decides this is too small to be a

249
00:11:51.919 --> 00:11:53.679
<v Speaker 2>red blood cell. It must be a speck of dust

250
00:11:53.759 --> 00:11:55.000
<v Speaker 2>or an artifact on the glass.

251
00:11:55.440 --> 00:11:59.320
<v Speaker 1>This leaves the algorithm with a clean map of distinct objects. Right,

252
00:11:59.399 --> 00:12:02.159
<v Speaker 1>but an object, it's just a blob of pixels. Yeah,

253
00:12:02.200 --> 00:12:04.559
<v Speaker 1>How does the computer know if it's a normal circle

254
00:12:04.759 --> 00:12:06.320
<v Speaker 1>or an abnormal sickle cell.

255
00:12:06.480 --> 00:12:07.559
<v Speaker 2>That's the real challenge.

256
00:12:07.639 --> 00:12:09.399
<v Speaker 1>Here's where it gets really interesting. I was looking at

257
00:12:09.399 --> 00:12:13.440
<v Speaker 1>the paper and it explains that once the algorithm isolates

258
00:12:13.440 --> 00:12:16.919
<v Speaker 1>a cell, it actually rotates the image of that cell

259
00:12:16.960 --> 00:12:20.879
<v Speaker 1>by ten twenty thirty and forty degrees counterclockwise.

260
00:12:21.000 --> 00:12:22.360
<v Speaker 2>Yes, it spins the image.

261
00:12:22.399 --> 00:12:25.360
<v Speaker 1>And my first thought was, wait, why does the algorithm

262
00:12:25.399 --> 00:12:27.279
<v Speaker 1>then rotate the images? Why not just look at the

263
00:12:27.320 --> 00:12:30.000
<v Speaker 1>cell as it is. If a sickle cell is shaped

264
00:12:30.039 --> 00:12:33.159
<v Speaker 1>like a crescent moon, rotating it on a slide doesn't

265
00:12:33.200 --> 00:12:36.320
<v Speaker 1>magically turn it into a circle. Why does the computer

266
00:12:36.440 --> 00:12:37.720
<v Speaker 1>care what angle it's sitting at.

267
00:12:37.759 --> 00:12:40.399
<v Speaker 2>It's a great question. It's because computers don't see shapes

268
00:12:40.440 --> 00:12:42.720
<v Speaker 2>the way human eyes do. What do you mean They

269
00:12:42.720 --> 00:12:45.919
<v Speaker 2>don't look at a cluster of pixels and instantly recognize

270
00:12:45.919 --> 00:12:49.519
<v Speaker 2>a crescent. They understand geometry through bounding boxes.

271
00:12:49.600 --> 00:12:50.519
<v Speaker 1>It's owning boxes.

272
00:12:50.639 --> 00:12:53.600
<v Speaker 2>Yeah. Think of a bounding box as drawing a strict

273
00:12:53.639 --> 00:12:57.519
<v Speaker 2>square or rectangle around the absolute furthest edges of the object.

274
00:12:57.559 --> 00:12:59.799
<v Speaker 1>Okay, I'm visualizing drawing a tight box.

275
00:12:59.600 --> 00:13:02.799
<v Speaker 2>Around a sus and that box is aligned perfectly with

276
00:13:02.879 --> 00:13:06.799
<v Speaker 2>a horizontal x axis and a vertical y axis. Right now,

277
00:13:06.879 --> 00:13:10.440
<v Speaker 2>think about how blood is smeared onto a slide. The

278
00:13:10.559 --> 00:13:13.360
<v Speaker 2>cells land completely randomly.

279
00:13:13.120 --> 00:13:14.879
<v Speaker 1>Just splattered everywhere right.

280
00:13:14.759 --> 00:13:19.240
<v Speaker 2>They are oriented at all possible chaotic angles. A normal,

281
00:13:19.360 --> 00:13:20.879
<v Speaker 2>healthy red blood cell is.

282
00:13:20.879 --> 00:13:23.759
<v Speaker 1>Circular, so it's the same in every direction exactly.

283
00:13:23.960 --> 00:13:26.519
<v Speaker 2>Its height and its width are roughly equaled no matter

284
00:13:26.559 --> 00:13:28.519
<v Speaker 2>how you spin it inside that bounding box.

285
00:13:28.720 --> 00:13:31.480
<v Speaker 1>But a sickle cell has a distinct long axis in

286
00:13:31.519 --> 00:13:34.720
<v Speaker 1>a short axis. Ah So, if an elongated sickle cell

287
00:13:34.759 --> 00:13:37.720
<v Speaker 1>happens to land diagonally on the slide and the computer

288
00:13:37.840 --> 00:13:40.240
<v Speaker 1>draws a straight up and down bounding box around it,

289
00:13:40.759 --> 00:13:43.799
<v Speaker 1>the box has to stretch out horizontally and vertically to

290
00:13:43.879 --> 00:13:45.879
<v Speaker 1>capture the diagonal corners exactly.

291
00:13:45.960 --> 00:13:48.639
<v Speaker 2>If it lands diagonally, the bounding box might actually look

292
00:13:48.639 --> 00:13:49.519
<v Speaker 2>perfectly square.

293
00:13:49.879 --> 00:13:51.960
<v Speaker 1>Oh well, I wouldn't have thought of that.

294
00:13:52.159 --> 00:13:55.559
<v Speaker 2>The computer won't capture the cell's true maximum length versus

295
00:13:55.559 --> 00:13:58.759
<v Speaker 2>its true minimum width. It'll just see a big square box.

296
00:13:59.360 --> 00:14:02.240
<v Speaker 1>So by rote hitting the object by ten, twenty thirty

297
00:14:02.279 --> 00:14:05.679
<v Speaker 1>and forty degrees, the algorithm forces the cell into alignment

298
00:14:06.080 --> 00:14:07.519
<v Speaker 1>with the x and y axis.

299
00:14:07.840 --> 00:14:11.600
<v Speaker 2>Yes, it tests different angles until it finds the orientation

300
00:14:11.720 --> 00:14:15.240
<v Speaker 2>where the bounding box is stretched to its absolute maximum limit.

301
00:14:15.840 --> 00:14:19.600
<v Speaker 1>That is so incredibly clever. It's essentially testing the geometry

302
00:14:19.600 --> 00:14:23.480
<v Speaker 1>at every angle to find the cell's true stretched out shape,

303
00:14:23.679 --> 00:14:24.440
<v Speaker 1>and the math.

304
00:14:24.240 --> 00:14:27.639
<v Speaker 2>They use to flag it as diseased is so beautifully simple.

305
00:14:27.759 --> 00:14:29.639
<v Speaker 1>It really is just basic subtraction. Right.

306
00:14:29.720 --> 00:14:32.720
<v Speaker 2>Yeah, they calculate the difference between the height and width

307
00:14:32.759 --> 00:14:33.879
<v Speaker 2>of that bounding box.

308
00:14:33.919 --> 00:14:34.960
<v Speaker 1>They call it delta, right.

309
00:14:35.039 --> 00:14:38.559
<v Speaker 2>Delta equals the absolute value of height minus width. If

310
00:14:38.600 --> 00:14:41.279
<v Speaker 2>the minimum difference they find during all those rotations is

311
00:14:41.320 --> 00:14:42.919
<v Speaker 2>greater than seven pixels, and.

312
00:14:42.960 --> 00:14:46.000
<v Speaker 1>The cell's total area falls within the biological norm of

313
00:14:46.039 --> 00:14:48.559
<v Speaker 1>four hundred and fifty to one thousand pixels.

314
00:14:48.159 --> 00:14:50.919
<v Speaker 2>Then the algorithm officially flags that cell is abnormal.

315
00:14:51.039 --> 00:14:53.720
<v Speaker 1>That's it. That's simple, yep. If the height and width

316
00:14:53.759 --> 00:14:56.639
<v Speaker 1>remain relatively equal, so a delta of less than seven,

317
00:14:57.320 --> 00:14:59.840
<v Speaker 1>it's classified as a normal, healthy cell.

318
00:15:00.120 --> 00:15:03.399
<v Speaker 2>And the results from this geometric approach, they tested it

319
00:15:03.440 --> 00:15:07.279
<v Speaker 2>on a data set of forty real blood smear images

320
00:15:07.759 --> 00:15:09.240
<v Speaker 2>from the erythrocytes IDB.

321
00:15:09.440 --> 00:15:11.759
<v Speaker 1>This is a very solid test set. Yeah.

322
00:15:11.799 --> 00:15:14.519
<v Speaker 2>It achieved an eighty six percent detection rate with only

323
00:15:14.559 --> 00:15:16.720
<v Speaker 2>a fourteen percent false alarm rate.

324
00:15:16.919 --> 00:15:18.639
<v Speaker 1>That's incredibly promising.

325
00:15:18.759 --> 00:15:22.200
<v Speaker 2>We are talking about taking a diagnostic process that usually

326
00:15:22.240 --> 00:15:26.279
<v Speaker 2>requires a highly trained hematologist in a specialized lab and

327
00:15:26.480 --> 00:15:29.600
<v Speaker 2>codifying it into an automated algorithm that could run on

328
00:15:29.639 --> 00:15:31.559
<v Speaker 2>a basic computer in a remote clinic.

329
00:15:31.679 --> 00:15:34.399
<v Speaker 1>It's the democratization of diagnostics. It really is.

330
00:15:34.519 --> 00:15:36.639
<v Speaker 2>Okay, so we have an AI that can read and

331
00:15:36.639 --> 00:15:40.000
<v Speaker 2>summarize complex documents like a genius, and another AI that

332
00:15:40.039 --> 00:15:43.679
<v Speaker 2>can tirelessly diagnose our blood based on rotating bounding boxes.

333
00:15:43.759 --> 00:15:45.360
<v Speaker 1>Two massive leaps forward.

334
00:15:45.519 --> 00:15:49.080
<v Speaker 2>But both of these incredible tools an algorithm that digests

335
00:15:49.080 --> 00:15:51.879
<v Speaker 2>our personal documents in a system that diagnoses our blood

336
00:15:51.919 --> 00:15:53.480
<v Speaker 2>share a massive vulnerability.

337
00:15:53.600 --> 00:15:54.440
<v Speaker 1>They absolutely do.

338
00:15:54.600 --> 00:15:57.480
<v Speaker 2>They are completely reliant on digesting massive amounts of data,

339
00:15:57.600 --> 00:15:59.759
<v Speaker 2>which brings up the elephant in the room data hunger.

340
00:15:59.799 --> 00:16:01.600
<v Speaker 1>Who who owns this data and how do we keep

341
00:16:01.600 --> 00:16:04.799
<v Speaker 1>it safe? Ginchus throw all of our most intimate data

342
00:16:04.840 --> 00:16:08.320
<v Speaker 1>onto a giant, centralized public server so an algorithm can

343
00:16:08.360 --> 00:16:11.039
<v Speaker 1>practice on it. No, we really can't, which brings us

344
00:16:11.080 --> 00:16:14.519
<v Speaker 1>to the third paper, a systematic review by Kapelle Tawari,

345
00:16:14.960 --> 00:16:19.600
<v Speaker 1>Semi Shashukla, and JOSSP. George on privacy preserving machine learning

346
00:16:20.000 --> 00:16:20.960
<v Speaker 1>or PPML.

347
00:16:21.279 --> 00:16:24.720
<v Speaker 2>What's fascinating here is that this addresses the central paradox

348
00:16:24.879 --> 00:16:26.639
<v Speaker 2>of modern artificial intelligence.

349
00:16:26.720 --> 00:16:28.120
<v Speaker 1>The paradox yeah.

350
00:16:28.320 --> 00:16:32.519
<v Speaker 2>ML's effectiveness relies entirely on the amount, distribution, and variety

351
00:16:32.559 --> 00:16:33.480
<v Speaker 2>of training data.

352
00:16:33.639 --> 00:16:35.559
<v Speaker 1>Right, it needs to see a lot of examples to.

353
00:16:35.559 --> 00:16:38.960
<v Speaker 2>Learn exactly, and AI trained only on data from one

354
00:16:39.000 --> 00:16:42.440
<v Speaker 2>hospital in London won't be very accurate at diagnosing patients

355
00:16:42.440 --> 00:16:45.480
<v Speaker 2>in a rural clinic in India. It needs diverse data

356
00:16:45.519 --> 00:16:46.360
<v Speaker 2>to avoid bias.

357
00:16:46.519 --> 00:16:49.440
<v Speaker 1>But getting that data from multiple diverse sources is a

358
00:16:49.519 --> 00:16:54.879
<v Speaker 1>nightmare because of privacy concerns, security threats, data sovereignty laws.

359
00:16:54.759 --> 00:16:58.440
<v Speaker 2>IPI in the US, GDPR in Europe. A hospital in

360
00:16:58.440 --> 00:17:01.519
<v Speaker 2>London legally cannot and over its patient files to a

361
00:17:01.559 --> 00:17:03.399
<v Speaker 2>tech company in Silicon Valley, and.

362
00:17:03.320 --> 00:17:05.640
<v Speaker 1>Companies want to protect their competitive advantages too.

363
00:17:06.000 --> 00:17:08.440
<v Speaker 2>Exactly, even if it were legal, they wouldn't want to share.

364
00:17:08.960 --> 00:17:13.160
<v Speaker 2>So we face a roadblock. We have these algorithms, but

365
00:17:13.400 --> 00:17:16.480
<v Speaker 2>the data is locked away in disconnected silos.

366
00:17:16.599 --> 00:17:19.240
<v Speaker 1>So what does this all mean. We're basically stuck between

367
00:17:20.000 --> 00:17:24.400
<v Speaker 1>wanting these life saving, time saving AI tools and not

368
00:17:24.519 --> 00:17:27.720
<v Speaker 1>wanting to hand over our private medical records or personal

369
00:17:27.759 --> 00:17:29.480
<v Speaker 1>notes to a giant central server.

370
00:17:29.720 --> 00:17:32.920
<v Speaker 2>Yeah, we are stuck. But PPML is the crucial bridge here. Okay,

371
00:17:33.000 --> 00:17:36.039
<v Speaker 2>it's an emerging suite of techniques designed to aggregate data,

372
00:17:36.359 --> 00:17:40.000
<v Speaker 2>train models, and serve inferences without ever actually exposing the

373
00:17:40.079 --> 00:17:41.799
<v Speaker 2>underlying raw data.

374
00:17:41.880 --> 00:17:45.160
<v Speaker 1>So it's the silent security guard making the tech summarization

375
00:17:45.240 --> 00:17:48.200
<v Speaker 1>and the medical imaging possible in the real world, But

376
00:17:48.240 --> 00:17:50.960
<v Speaker 1>how does it actually work? A security guard just stops

377
00:17:50.960 --> 00:17:51.720
<v Speaker 1>people at the door.

378
00:17:51.799 --> 00:17:55.440
<v Speaker 2>Well, PPML isn't just one algorithm, it's an entire suite

379
00:17:55.480 --> 00:17:59.200
<v Speaker 2>of cryptographic and statistical techniques. Likewise, one of the most

380
00:17:59.279 --> 00:18:02.839
<v Speaker 2>powerful mechanisms they review in this paper is called federated learning.

381
00:18:03.079 --> 00:18:04.880
<v Speaker 1>Okay, walk me through federated learning.

382
00:18:05.079 --> 00:18:07.279
<v Speaker 2>In traditional AI, you take all the data from all

383
00:18:07.279 --> 00:18:09.799
<v Speaker 2>over the world, move it to one giant central server,

384
00:18:10.039 --> 00:18:13.480
<v Speaker 2>and train the model there. Federated learning completely reverses that

385
00:18:13.599 --> 00:18:16.880
<v Speaker 2>architecture reverses it. Yeah, instead of moving the data to

386
00:18:16.960 --> 00:18:18.960
<v Speaker 2>the model, we move the model to the data.

387
00:18:19.119 --> 00:18:21.799
<v Speaker 1>Oh wow, It's like asking a thousand hospitals to bake

388
00:18:21.839 --> 00:18:25.319
<v Speaker 1>a cake together. You want the ultimate perfect recipe, but

389
00:18:25.359 --> 00:18:29.039
<v Speaker 1>you aren't allowed to know whose kitchen provided the eggs,

390
00:18:29.240 --> 00:18:31.960
<v Speaker 1>or who sifted the flour, or what their kitchens even

391
00:18:32.000 --> 00:18:32.400
<v Speaker 1>look like.

392
00:18:32.559 --> 00:18:36.440
<v Speaker 2>That is a brilliant way to conceptualize it. In federated learning,

393
00:18:36.480 --> 00:18:39.799
<v Speaker 2>a central server sends a blank, untrained copy of the

394
00:18:39.839 --> 00:18:45.119
<v Speaker 2>AI model out to thousands of local hospitals or say smartphones. Okay,

395
00:18:45.440 --> 00:18:48.519
<v Speaker 2>the model trains locally on that private data behind the

396
00:18:48.559 --> 00:18:53.000
<v Speaker 2>hospital's own firewalls. The raw data never ever leaves the building.

397
00:18:53.160 --> 00:18:55.680
<v Speaker 1>Wait, if the data never leaves, how does the central

398
00:18:55.720 --> 00:18:56.680
<v Speaker 1>AI get any smarter?

399
00:18:57.000 --> 00:18:59.599
<v Speaker 2>Because the local model doesn't send back the medical records,

400
00:18:59.640 --> 00:19:02.119
<v Speaker 2>It only sends back the math. The math, Yeah, it

401
00:19:02.200 --> 00:19:05.640
<v Speaker 2>sends back the updated weights and biases the microscopic adjustments

402
00:19:05.640 --> 00:19:08.279
<v Speaker 2>it made to its own internal logic. Oh I say

403
00:19:08.319 --> 00:19:12.279
<v Speaker 2>the central server collects these mathematical tweaks from thousands of hospitals,

404
00:19:12.759 --> 00:19:16.440
<v Speaker 2>averages them out, and creates a master model that has

405
00:19:16.559 --> 00:19:19.359
<v Speaker 2>learned the patterns of the disease without ever seeing a

406
00:19:19.400 --> 00:19:21.359
<v Speaker 2>single patient's name or scan.

407
00:19:21.680 --> 00:19:25.680
<v Speaker 1>That is wild. It learns the lesson, but immediately forgets

408
00:19:25.720 --> 00:19:29.200
<v Speaker 1>the teacher exactly. But what if a hacker intercepts those

409
00:19:29.240 --> 00:19:32.599
<v Speaker 1>mathematical tweaks. Couldn't they reverse engineer them to figure out

410
00:19:32.599 --> 00:19:33.920
<v Speaker 1>the original patient data.

411
00:19:34.319 --> 00:19:37.279
<v Speaker 2>That is a real risk, which is why federated learning

412
00:19:37.359 --> 00:19:40.200
<v Speaker 2>is often paired with another PPML technique reviewed in the

413
00:19:40.240 --> 00:19:42.000
<v Speaker 2>paper differential privacy.

414
00:19:42.119 --> 00:19:43.720
<v Speaker 1>How does differential privacy work?

415
00:19:44.039 --> 00:19:47.839
<v Speaker 2>It works by intentionally injecting mathematical noise into the data

416
00:19:48.160 --> 00:19:49.519
<v Speaker 2>before it's even analyzed.

417
00:19:49.559 --> 00:19:52.240
<v Speaker 1>Hold on, my brain is stuck again. If you inject

418
00:19:52.279 --> 00:19:56.279
<v Speaker 1>noise into a medical diagnostic tool, don't you ruin the accuracy?

419
00:19:56.319 --> 00:19:58.960
<v Speaker 1>Why would you purposefully make the data blurrier.

420
00:19:58.599 --> 00:20:03.119
<v Speaker 2>Because it's a very specific, carefully calculated statistical.

421
00:20:02.400 --> 00:20:03.880
<v Speaker 1>Noise, like static on a TV.

422
00:20:04.200 --> 00:20:06.559
<v Speaker 2>Sort of. Think of it like taking a photograph of

423
00:20:06.599 --> 00:20:09.440
<v Speaker 2>a massive crowd in a stadium. If you apply a

424
00:20:09.519 --> 00:20:13.799
<v Speaker 2>specific blurring filter, you can completely obscure the individual faces

425
00:20:13.839 --> 00:20:15.480
<v Speaker 2>of every single person.

426
00:20:15.200 --> 00:20:15.720
<v Speaker 1>In the crowd.

427
00:20:16.200 --> 00:20:19.359
<v Speaker 2>No one can be identified. Right, there's blobs, but you

428
00:20:19.400 --> 00:20:22.200
<v Speaker 2>can still easily tell what color jerseys the crowd is wearing,

429
00:20:22.480 --> 00:20:24.640
<v Speaker 2>or which section of the stadium is the most crowded.

430
00:20:25.119 --> 00:20:29.359
<v Speaker 1>Oh okay, you destroy the micro details to protect the individual,

431
00:20:29.720 --> 00:20:32.640
<v Speaker 1>but you preserve the macro trends so the AI can

432
00:20:32.680 --> 00:20:34.559
<v Speaker 1>still learn the big picture precisely.

433
00:20:35.119 --> 00:20:38.640
<v Speaker 2>Differential privacy ensures that the presence or absence of any

434
00:20:38.680 --> 00:20:42.519
<v Speaker 2>single individual's data in the training set does not significantly

435
00:20:42.519 --> 00:20:44.400
<v Speaker 2>affect the final output of the model.

436
00:20:44.519 --> 00:20:49.880
<v Speaker 1>Wow. By combining federated learning and differential privacy, these researchers

437
00:20:49.880 --> 00:20:52.359
<v Speaker 1>are building a world where an algorithm can master human

438
00:20:52.440 --> 00:20:56.759
<v Speaker 1>language and diagnose human biology without causing a privacy apocalypse.

439
00:20:56.799 --> 00:20:58.240
<v Speaker 2>It's the only way forward. Really.

440
00:20:58.480 --> 00:21:01.200
<v Speaker 1>Okay, we've covered some serious intellectual ground today on this

441
00:21:01.319 --> 00:21:04.240
<v Speaker 1>deep dive. Let's recap this journey for you, the learner.

442
00:21:04.640 --> 00:21:08.880
<v Speaker 1>We started by exploring how data science is conquering information overload.

443
00:21:09.519 --> 00:21:12.799
<v Speaker 1>Algorithms are moving past blindly counting words right.

444
00:21:12.799 --> 00:21:16.279
<v Speaker 2>Using wiki data to build semantic maps exactly.

445
00:21:16.119 --> 00:21:20.920
<v Speaker 1>And by applying stripped statistical thresholds like cross entropy and anova,

446
00:21:21.359 --> 00:21:25.160
<v Speaker 1>they can mathematically prove which sentences carry the most meaning,

447
00:21:25.519 --> 00:21:29.119
<v Speaker 1>summarizing dense text with over eighty eight percent accuracy.

448
00:21:29.519 --> 00:21:33.039
<v Speaker 2>And then we saw those same principles of feature extraction

449
00:21:33.240 --> 00:21:35.240
<v Speaker 2>applied to the microscopic.

450
00:21:34.640 --> 00:21:36.079
<v Speaker 1>World, right the blood cells.

451
00:21:36.240 --> 00:21:39.480
<v Speaker 2>We learned how algorithms use binary contrast thresholds and the

452
00:21:39.559 --> 00:21:44.279
<v Speaker 2>rotating geometry of bounding boxes to catch the elongated, dangerous

453
00:21:44.319 --> 00:21:47.000
<v Speaker 2>shapes of sickle cells that a tired human eye might

454
00:21:47.079 --> 00:21:47.880
<v Speaker 2>easily miss.

455
00:21:48.359 --> 00:21:51.799
<v Speaker 1>And finally, we address the necessary shield for all this innovation.

456
00:21:52.599 --> 00:21:56.000
<v Speaker 1>We explored how privacy preserving techniques like federated learning and

457
00:21:56.039 --> 00:21:59.599
<v Speaker 1>differential privacy allow these algorithms to travel to the data, they.

458
00:21:59.559 --> 00:22:02.559
<v Speaker 2>Learn the mapp madical lessons and blur out the faces.

459
00:22:02.240 --> 00:22:04.720
<v Speaker 1>Ensuring they can learn from our most intimate documents and

460
00:22:04.799 --> 00:22:07.400
<v Speaker 1>our biology without actually spying on us.

461
00:22:07.680 --> 00:22:10.319
<v Speaker 2>It all comes back to extracting the signal from the noise,

462
00:22:10.359 --> 00:22:14.079
<v Speaker 2>whether that noise is redundant text, background, cellular artifacts, or

463
00:22:14.119 --> 00:22:16.400
<v Speaker 2>the logistical nightmare of locked data silos.

464
00:22:16.480 --> 00:22:18.400
<v Speaker 1>It really does. It's all incredibly connected.

465
00:22:18.640 --> 00:22:20.680
<v Speaker 2>And if I can leave you with one final thought,

466
00:22:21.480 --> 00:22:25.279
<v Speaker 2>think about where these three streams of technology inevitably converge.

467
00:22:25.720 --> 00:22:29.200
<v Speaker 2>If data science can perfectly summarize the complexities of human

468
00:22:29.240 --> 00:22:33.039
<v Speaker 2>thought and diagnose our diseases just by measuring the geometry

469
00:22:33.039 --> 00:22:37.799
<v Speaker 2>of ourselves, what happens when privacy preserving machine learning allows

470
00:22:37.839 --> 00:22:41.000
<v Speaker 2>these systems to securely cross reference both on a global scale.

471
00:22:41.079 --> 00:22:43.839
<v Speaker 2>Oh wow, Well, the ultimate a of the future know

472
00:22:43.920 --> 00:22:46.400
<v Speaker 2>the intricacies of our minds and our bodies better than

473
00:22:46.440 --> 00:22:49.680
<v Speaker 2>we know ourselves, all while never actually knowing our names.

474
00:22:49.839 --> 00:22:52.880
<v Speaker 1>Wow. That is a massive thought to chew on. You

475
00:22:52.960 --> 00:22:55.279
<v Speaker 1>came here today to connect the dots and get well informed,

476
00:22:55.279 --> 00:22:57.200
<v Speaker 1>and I think we definitely hit the mark. Thank you

477
00:22:57.279 --> 00:22:59.480
<v Speaker 1>so much for joining us on this deep dive. Until

478
00:22:59.480 --> 00:23:01.960
<v Speaker 1>next time, keep questioning, keep learning, and we'll see you

479
00:23:01.960 --> 00:23:02.519
<v Speaker 1>on the next one.
