WEBVTT 1 00:00:00.080 --> 00:00:02.200 Welcome back to the deep dive, where we take a 2 00:00:02.240 --> 00:00:06.200 stack of information and transform it into instant expertise tailored 3 00:00:06.280 --> 00:00:09.599 just for you. Today, we're plunging into well, the very 4 00:00:09.599 --> 00:00:13.519 bedrock of computer science, data structures and algorithms. 5 00:00:13.599 --> 00:00:16.359 Yeah, it's a foundational topic. We're drawing our insights from 6 00:00:16.519 --> 00:00:19.879 Ruby data structures and Algorithms, which is a really comprehensive resource. 7 00:00:20.079 --> 00:00:22.480 It helps us understand not just what these concepts are, 8 00:00:22.519 --> 00:00:25.480 but really why they're so critical to everything you interact 9 00:00:25.519 --> 00:00:29.559 with in the digital world. So our mission today is 10 00:00:29.600 --> 00:00:33.320 to kind of distill the core principles, maybe reveal some 11 00:00:33.359 --> 00:00:37.079 surprising implications of different design choices, and also highlight how 12 00:00:37.159 --> 00:00:41.840 Ruby's unique philosophy shapes these essential building blocks of code. 13 00:00:42.039 --> 00:00:44.960 Think of this as pulling back the curtain on the powerful, 14 00:00:45.000 --> 00:00:48.159 invisible machinery that makes your favorite apps tick. You'll hopefully 15 00:00:48.280 --> 00:00:52.399 uncover insights that might change how you think about efficiency, flexibility, 16 00:00:52.799 --> 00:00:56.880 and even the subtle art of crafting robust software. Okay, 17 00:00:56.920 --> 00:00:59.840 let's start at the absolute beginning foundational ideas. We hear 18 00:00:59.840 --> 00:01:03.280 a lot about abstract data types or ADTs. What's the 19 00:01:03.920 --> 00:01:05.319 core concept behind an ADT? 20 00:01:05.799 --> 00:01:09.400 Right? Well, and ADT is essentially the pure abstract idea 21 00:01:09.439 --> 00:01:12.120 of what a data structure should do. It's completely separate 22 00:01:12.120 --> 00:01:13.719 from how it's implemented on a computer. 23 00:01:14.239 --> 00:01:14.400 Right. 24 00:01:14.680 --> 00:01:16.879 It defines a set of values could be all the 25 00:01:16.920 --> 00:01:20.319 integers or maybe just true and false, and the operations 26 00:01:20.319 --> 00:01:23.480 you can perform on them, so I think addition, subtraction, 27 00:01:23.840 --> 00:01:25.959 or logical negation, stuff like that. 28 00:01:26.079 --> 00:01:26.640 Oh okay. 29 00:01:26.760 --> 00:01:30.319 The real insight here, I think is this principle you 30 00:01:30.400 --> 00:01:33.400 define the behavior first, focusing on what you need to 31 00:01:33.480 --> 00:01:37.000 achieve before you even consider the specific coding details. 32 00:01:37.359 --> 00:01:40.439 So it's almost like setting up a conceptual contract for 33 00:01:40.519 --> 00:01:42.359 what your data will be and how it will behave 34 00:01:42.680 --> 00:01:45.079 even before you write a single line of code precisely. 35 00:01:45.159 --> 00:01:47.760 Yeah, that's a great analogy. And then once that conceptual 36 00:01:47.799 --> 00:01:50.879 ADT is actually built in a programming language, well that's 37 00:01:50.879 --> 00:01:52.120 when we call it a data type. 38 00:01:52.159 --> 00:01:52.719 Gotcha. 39 00:01:52.799 --> 00:01:56.159 So, for example, the abstract integer ADT becomes the concrete 40 00:01:56.480 --> 00:01:59.280 type in Java or maybe the integer class in Ruby. 41 00:01:59.840 --> 00:02:02.719 And the specific way we arrange that data in memory 42 00:02:03.519 --> 00:02:06.200 to represent those values, that's our data structure. 43 00:02:06.319 --> 00:02:08.400 And the algorithms where do they fit into this picture? 44 00:02:08.599 --> 00:02:11.879 Ah, algorithms, they're the step by step instructions kind of 45 00:02:11.879 --> 00:02:15.000 like the recipes that bring an ADT's operations to life. 46 00:02:15.639 --> 00:02:18.360 They use the data structures you've chosen to perform those 47 00:02:18.360 --> 00:02:22.039 defined actions. So you see, data structures and algorithms are 48 00:02:22.080 --> 00:02:24.199 really two sides of the same coin. You can't really 49 00:02:24.199 --> 00:02:26.599 have one without the other, and together they let us 50 00:02:26.639 --> 00:02:30.319 implement these powerful abstract concepts in our programs. 51 00:02:30.599 --> 00:02:33.759 Okay, that makes sense. Now, as we build these systems, 52 00:02:33.919 --> 00:02:39.120 making sure they're correct is well. Paramount Our source emphasizes 53 00:02:39.199 --> 00:02:42.520 something called assertions. What role do they play? Right? 54 00:02:42.599 --> 00:02:46.560 Assertions? An assertion is simply a statement that must be 55 00:02:46.599 --> 00:02:49.960 true at a specific point in a program's execution. They 56 00:02:50.000 --> 00:02:53.719 serve two main purposes, one clearly documenting constraints for human 57 00:02:53.759 --> 00:02:57.800 developers reading the code, and two crucially checking those constraints 58 00:02:57.879 --> 00:03:01.319 during execution to catch faults, you know as early as possible. 59 00:03:01.560 --> 00:03:04.400 What are the most vital types of assertions? Developers tend 60 00:03:04.439 --> 00:03:05.000 to rely. 61 00:03:04.840 --> 00:03:08.800 On, good question. The most common and i'd say crucial 62 00:03:08.879 --> 00:03:13.479 types are preconditions and post conditions. A precondition make sure 63 00:03:13.479 --> 00:03:16.400 your inputs are valid before an operation even starts. 64 00:03:16.520 --> 00:03:19.319 Like making sure a square root function only gets a 65 00:03:19.360 --> 00:03:21.960 non negative number. You can't square root, a negative right, 66 00:03:22.280 --> 00:03:27.000 and a post condition verifies the outcome after the operation 67 00:03:27.120 --> 00:03:30.159 is done, confirming the result is what you actually expect it. 68 00:03:30.280 --> 00:03:30.599 Okay. 69 00:03:31.159 --> 00:03:34.360 There are also class invariants which maintain the integrity of 70 00:03:34.400 --> 00:03:38.520 an object between method calls, and some other specialized types too. 71 00:03:38.599 --> 00:03:42.680 But those core assertions, preconditions and post conditions, that's where 72 00:03:42.680 --> 00:03:44.439 a lot of your bug catching power lies. 73 00:03:44.639 --> 00:03:47.919 That sounds incredibly helpful for catching bugs early. But how 74 00:03:47.919 --> 00:03:50.479 does Ruby handle these if at all? Is it built in? 75 00:03:50.759 --> 00:03:53.400 Well, what's kind of fascinating about Ruby here is its approach. 76 00:03:54.000 --> 00:03:57.560 It provides no direct, built in support for assertions in 77 00:03:57.639 --> 00:03:58.599 the language itself. 78 00:03:59.039 --> 00:03:59.360 None. 79 00:03:59.599 --> 00:04:03.680 Nope. And what's more, Ruby's dynamic weekly type nature means 80 00:04:03.919 --> 00:04:07.039 it doesn't even enforce basic type checking for method parameters 81 00:04:07.120 --> 00:04:10.159 or return values at the language level. So those crucial 82 00:04:10.280 --> 00:04:14.560 type related pre and post conditions, they aren't automatically checked. Wow. 83 00:04:15.039 --> 00:04:18.199 So if you're a Ruby programmer, it sounds like the 84 00:04:18.360 --> 00:04:23.160 entire burden of checking assertions, of enforcing these constraints falls 85 00:04:23.199 --> 00:04:26.800 squarely on your shoulders. That freedom could certainly make certain 86 00:04:26.839 --> 00:04:29.600 types of bugs well much harder to find, couldn't it. 87 00:04:29.720 --> 00:04:33.160 Absolutely, it's a trade off for example, many unintended or 88 00:04:33.959 --> 00:04:37.680 erroneous array references might be considered legal by the Ruby interpreter. 89 00:04:38.120 --> 00:04:41.000 They might just return nil instead of raising an error 90 00:04:41.040 --> 00:04:43.839 like you'd see in a more strongly typed language. So yeah, 91 00:04:43.839 --> 00:04:47.360 it truly pushes the responsibility for robust error checking onto 92 00:04:47.360 --> 00:04:51.560 the developer using Ruby. Now, when we categorize data types 93 00:04:51.600 --> 00:04:54.160 and programming, they typically fall into two buckets. 94 00:04:54.240 --> 00:04:54.360 Right. 95 00:04:54.399 --> 00:04:58.480 You've got simple types like individual integers or booleans, things 96 00:04:58.519 --> 00:05:00.920 that can't really be broken down for there, and then 97 00:05:01.000 --> 00:05:03.879 structured types, which are composed of multiple parts, things like 98 00:05:04.000 --> 00:05:05.040 arrays or hashes. 99 00:05:05.480 --> 00:05:08.959 And Ruby has some really interesting nuances here, even with 100 00:05:09.000 --> 00:05:12.800 things that seem like simple types, especially it's string and 101 00:05:12.839 --> 00:05:14.360 symbol classes. 102 00:05:13.879 --> 00:05:17.800 It really does. Ruby's string class creates mutable sequences of 103 00:05:17.920 --> 00:05:21.279 Unicode characters. Mutable means you can change them after you 104 00:05:21.360 --> 00:05:26.519 create them. Okay, But the symbol class creates immutable character sequences. 105 00:05:27.399 --> 00:05:32.000 And here's the surprising implication. Symbols are stored uniquely in 106 00:05:32.120 --> 00:05:33.959 Ruby's interpreter's symbol table. 107 00:05:34.079 --> 00:05:35.160 What does that mean exactly? 108 00:05:35.439 --> 00:05:38.879 It guarantees only a single instance exists for any given 109 00:05:38.959 --> 00:05:42.519 character sequence like dot my symbol. So while you might 110 00:05:42.560 --> 00:05:45.319 just think of them as different kinds of text. Ruby 111 00:05:45.360 --> 00:05:48.839 effectively offers two distinct implementations of a string eighty T. 112 00:05:49.639 --> 00:05:53.120 Each has different performance and memory characteristics, making them suited 113 00:05:53.160 --> 00:05:54.279 for different use cases. 114 00:05:54.360 --> 00:05:55.959 It's quite clever interesting. 115 00:05:56.040 --> 00:05:59.120 Another crucial structured type in Ruby is, of course, the array. 116 00:05:59.800 --> 00:06:02.759 Now Unlike the fixed sized static arrays you might find 117 00:06:02.759 --> 00:06:06.360 in some older languages, Ruby arrays are dynamic. 118 00:06:06.000 --> 00:06:08.040 Arrays, meaning they can grow exactly. 119 00:06:08.199 --> 00:06:10.639 Their size can change at runtime. When an array needs 120 00:06:10.639 --> 00:06:12.879 more memory to grow, the system has to allocate a 121 00:06:12.920 --> 00:06:16.519 whole new, larger chunk of memory right, copy all the 122 00:06:16.560 --> 00:06:19.600 existing elements over, and then free up the old memory space. 123 00:06:20.639 --> 00:06:25.959 It is that reallocation is computationally expensive, so to minimize 124 00:06:26.000 --> 00:06:30.399 how often this happens, Ruby often doubles the array's allocated size, 125 00:06:30.480 --> 00:06:33.920 even if you only ask for say one more slot. 126 00:06:34.399 --> 00:06:36.720 It's trying to anticipate future growth. 127 00:06:36.680 --> 00:06:40.000 That proactiicizing makes a lot of sense avoiding those frequent, 128 00:06:40.120 --> 00:06:44.399 expensive copy operations. But Ruby arrays have some other truly 129 00:06:44.480 --> 00:06:47.480 unique characteristics too, don't they Things that really set them 130 00:06:47.519 --> 00:06:49.079 apart from arrays in other languages. 131 00:06:49.199 --> 00:06:53.279 I absolutely do. For one, Ruby arrays automatically expand if 132 00:06:53.279 --> 00:06:55.600 you try to store a value beyond their current end. 133 00:06:55.920 --> 00:06:59.000 To the programmer, they kind of seem unbounded, okay, And 134 00:06:59.040 --> 00:07:00.879 if a location within when the array doesn't have an 135 00:07:00.920 --> 00:07:03.839 assigned value, it just defaults to nil, right nil. Ruby 136 00:07:03.879 --> 00:07:07.160 also allows negative indices, so a one directly references the 137 00:07:07.240 --> 00:07:09.480 very last element, A two to the second to last, 138 00:07:09.519 --> 00:07:12.000 and so on. Bandy, and maybe most strikingly, if you 139 00:07:12.040 --> 00:07:14.600 try to access an index that's completely out of bounds, 140 00:07:14.680 --> 00:07:17.160 it just returns nil instead of causing an error. 141 00:07:18.319 --> 00:07:22.920 That unbounded nil defaulting behavior seems incredibly flexible, almost convenient 142 00:07:22.959 --> 00:07:25.560 on the surface. Yeah, but when does that flexibility become 143 00:07:25.800 --> 00:07:27.120 maybe a hidden trap. 144 00:07:26.920 --> 00:07:31.199 For developers exactly. That's the crux of it. While it's convenient, 145 00:07:31.360 --> 00:07:36.240 this freedom can inadvertently hide bugs. Think about it. In 146 00:07:36.279 --> 00:07:39.240 a language like Java, if you mistakenly try to assign 147 00:07:39.240 --> 00:07:42.240 a string value to an array meant for integers, the 148 00:07:42.279 --> 00:07:45.360 compiler would immediately flag it as an error boom cauterly 149 00:07:45.560 --> 00:07:48.879 makes sense. In Ruby, the interpreter won't complain at that point. 150 00:07:48.959 --> 00:07:51.279 It might just happily put a string where you expected 151 00:07:51.319 --> 00:07:55.079 an integer, or return nil from an invalid array access. 152 00:07:55.600 --> 00:07:58.120 This makes those types of books potentially much harder to 153 00:07:58.160 --> 00:08:01.639 diagnose until much later, maybe when something unexpected happens further 154 00:08:01.720 --> 00:08:02.560 down the line. 155 00:08:02.759 --> 00:08:05.600 So the core takeaway here seems to be that Ruby rays, 156 00:08:06.319 --> 00:08:10.360 because they can store arbitrary values and grow automatically, behave 157 00:08:10.439 --> 00:08:14.519 more like highly flexible lists than the strict homogeneous arrayse 158 00:08:14.560 --> 00:08:16.000 you find in many other languages. 159 00:08:16.079 --> 00:08:18.959 Precisely, and they also come with an incredibly rich set 160 00:08:19.000 --> 00:08:22.839 of built in methods. They effectively bundle in set operations 161 00:08:22.920 --> 00:08:26.680 like membership testing or unions, string like operations, and even 162 00:08:26.720 --> 00:08:30.600 built in stack operations push pop and Q operations shift 163 00:08:30.879 --> 00:08:34.279 a pen. They're very versatile. So moving beyond those simple types, 164 00:08:34.480 --> 00:08:38.679 containers are a really crucial category of complex abstract data types. 165 00:08:39.080 --> 00:08:42.679 A container is simply an entity that holds a finite 166 00:08:42.759 --> 00:08:45.120 number of other entities. Think of it like a box 167 00:08:45.679 --> 00:08:48.080 or a bag designed to organize things. 168 00:08:47.919 --> 00:08:51.039 That box or bag. Analogy is helpful, it helps visualize it. 169 00:08:51.399 --> 00:08:54.039 But if I'm a developer trying to choose the right container, 170 00:08:54.320 --> 00:08:56.600 what are the critical distinctions I need to keep in mind? 171 00:08:56.720 --> 00:08:58.840 How do I pick the right box for my data? 172 00:08:58.879 --> 00:09:01.799 That's a great question. Key distinctions really boil down to 173 00:09:01.960 --> 00:09:06.840 three main properties. First, their structure. Do they hold elements 174 00:09:06.840 --> 00:09:09.360 in a specific order like a list does, or are 175 00:09:09.360 --> 00:09:10.440 they unordered like a set? 176 00:09:10.519 --> 00:09:11.519 Okay, order matters. 177 00:09:11.639 --> 00:09:15.279 Second, access restrictions. Can you add, remove, or look at 178 00:09:15.279 --> 00:09:18.559 elements anywhere in the container or only at specific points 179 00:09:18.600 --> 00:09:20.399 like just the top of a stack, right like a 180 00:09:20.440 --> 00:09:24.320 stack or queue exactly? And finally, keyed access. Can you 181 00:09:24.360 --> 00:09:27.559 retrieve elements using some unique identifier like how a map 182 00:09:27.639 --> 00:09:30.320 uses a key to find its associated value. 183 00:09:30.120 --> 00:09:32.559 Structure access keyed access? Got it? 184 00:09:32.960 --> 00:09:35.120 Yeah, And we can sort of visualize this with a 185 00:09:35.159 --> 00:09:38.480 conceptual hierarchy like a family tree. At the root, you 186 00:09:38.559 --> 00:09:41.440 might have a very general container. All it knows is 187 00:09:41.480 --> 00:09:44.200 its size, whether it's empty, and how to clear itself 188 00:09:44.240 --> 00:09:47.480 out basic stuff. Right, Then you branch out. You might 189 00:09:47.519 --> 00:09:50.600 have collection, which is a traversible container, meaning you can 190 00:09:50.639 --> 00:09:54.320 get to all its elements. This includes things like lists, sets, 191 00:09:54.360 --> 00:09:57.759 and maps, And separately, you might have dispenser, which is 192 00:09:57.799 --> 00:10:00.960 a non troversible container with those restricts access points we 193 00:10:01.000 --> 00:10:02.799 talked about, like stacks and queues. 194 00:10:03.360 --> 00:10:06.320 These sound a lot like interfaces, which is a common 195 00:10:06.600 --> 00:10:11.519 concept in object oriented programming. But how does Ruby handle 196 00:10:11.639 --> 00:10:16.000 these structural ideas given its famously flexible duck typing approach. 197 00:10:16.679 --> 00:10:18.960 It doesn't really do interfaces in the same way, does it. 198 00:10:19.080 --> 00:10:21.120 You're absolutely right to point that out. Ruby doesn't have 199 00:10:21.159 --> 00:10:24.360 explicit interfaces like Java or c sharp do, where a 200 00:10:24.399 --> 00:10:28.240 class formally declares it implements an interface. Instead, it relies 201 00:10:28.279 --> 00:10:31.840 heavily on duck typing. If it walks like a duck, exactly, 202 00:10:31.960 --> 00:10:34.159 if it walks like a duck and quacks like a duck, 203 00:10:34.360 --> 00:10:37.159 Ruby treats it as a duck. When an object calls 204 00:10:37.200 --> 00:10:40.399 a method on another object, Ruby simply checks at run 205 00:10:40.480 --> 00:10:44.360 time if the receiving object actually implements that method. Does 206 00:10:44.399 --> 00:10:45.360 it respond to that. 207 00:10:45.279 --> 00:10:47.559 Message ah, run time checking yes. 208 00:10:48.120 --> 00:10:51.360 It achieves pretty much the same goal as formal interfaces, 209 00:10:51.399 --> 00:10:54.679 making sure method calls are valid, but without requiring classes 210 00:10:54.759 --> 00:10:58.919 to declare upfront which contracts or interfaces they fulfill. It's 211 00:10:58.960 --> 00:11:03.080 a very dynamic, flexible way to handle type patibility. 212 00:11:03.159 --> 00:11:06.320 Okay, let's dive into some specific linear containers then, starting 213 00:11:06.320 --> 00:11:09.519 with stacks. I'm picturing that stack of plates or maybe 214 00:11:09.559 --> 00:11:11.639 a pile of shirts. The last thing you put on 215 00:11:11.759 --> 00:11:13.720 is the first thing you take off the LEFO. 216 00:11:14.080 --> 00:11:17.279 That's precisely the model. A stack is an ordered container. 217 00:11:17.480 --> 00:11:19.840 And the key thing is that access is restricted to 218 00:11:19.960 --> 00:11:23.000 just one end, which we call the top. The core 219 00:11:23.080 --> 00:11:25.759 operations are push for adding an element to the top, 220 00:11:25.799 --> 00:11:28.799 putting a plate on right, pop for removing the top element. 221 00:11:28.600 --> 00:11:30.039 Taking the top plate off yep. 222 00:11:29.840 --> 00:11:31.840 And top for just peaking at the top element without 223 00:11:31.840 --> 00:11:32.679 actually removing it. 224 00:11:32.919 --> 00:11:38.799 Okay, where does this last in first out logic really shine? 225 00:11:38.840 --> 00:11:40.320 Where is it used in practice? 226 00:11:40.519 --> 00:11:43.919 Oh? All over the place. A classic example is managing 227 00:11:43.960 --> 00:11:47.879 function calls in a program. When function A calls function B, 228 00:11:47.879 --> 00:11:51.120 b's context goes on the stack. If B call C, 229 00:11:51.120 --> 00:11:54.159 C goes on top. When C finishes it pops off. 230 00:11:54.200 --> 00:11:56.759 Then B then a last one called is the first 231 00:11:56.759 --> 00:12:00.000 to finish, right the call stack exactly. Another good example 232 00:12:00.200 --> 00:12:03.440 is maybe reversing something like if you read characters of 233 00:12:03.480 --> 00:12:05.759 a word one by one and push them onto a stack, 234 00:12:05.840 --> 00:12:07.039 then pop them off, you'll get. 235 00:12:06.919 --> 00:12:08.759 The word in reverse simple but effective. 236 00:12:09.120 --> 00:12:12.120 Very or that print school or example we mentioned earlier, 237 00:12:12.159 --> 00:12:15.120 pushing pages on popping them off to print in reverse order. 238 00:12:15.200 --> 00:12:18.240 Gotcha. Now, when it comes to actually implementing a stack, 239 00:12:18.399 --> 00:12:19.840 what are the main ways to do it? 240 00:12:20.039 --> 00:12:23.600 There are basically two primary approaches. You can use contiguous 241 00:12:23.639 --> 00:12:27.080 memory locations, which usually means using an array, or you 242 00:12:27.120 --> 00:12:30.320 can use linked structures. In Ruby, It's built in array 243 00:12:30.360 --> 00:12:34.240 class is particularly well suited for a contiguous stack implementation. 244 00:12:34.879 --> 00:12:38.879 Why because it already has push and pop methods built 245 00:12:38.919 --> 00:12:42.279 right in and they automatically handle the memory expansion and 246 00:12:42.320 --> 00:12:43.080 shrinking for you. 247 00:12:43.200 --> 00:12:45.720 It's very convenient, nice and the linked way. 248 00:12:46.200 --> 00:12:49.240 For a linked implementation, you typically use what's called a 249 00:12:49.279 --> 00:12:52.799 singly linked list. You have nodes each pointing to the next, 250 00:12:52.960 --> 00:12:54.919 and you just keep a reference to the headnode, which 251 00:12:54.919 --> 00:12:57.559 acts as your top node. The advantage here is that 252 00:12:57.600 --> 00:13:00.519 it never really becomes full unless you complete run out 253 00:13:00.519 --> 00:13:03.080 of memory, and it can sometimes be more efficient with 254 00:13:03.159 --> 00:13:05.320 space only using what it needs. 255 00:13:05.919 --> 00:13:09.879 Interesting trade offs. Okay, so then we have queues. Unlike stacks, 256 00:13:09.919 --> 00:13:14.159 these are first in, first out FIFO, like a line 257 00:13:14.200 --> 00:13:16.080 at the bank or for free lunch. 258 00:13:16.240 --> 00:13:18.799 Exactly like a line, a queue is a dispenser where 259 00:13:18.879 --> 00:13:20.600 elements are inserted at one end. 260 00:13:20.600 --> 00:13:23.320 Called the rear, joining the back of the line right, and. 261 00:13:23.279 --> 00:13:25.840 They're removed or access from the other end the front, 262 00:13:25.919 --> 00:13:30.000 getting served at the front precisely. Key operations include enter 263 00:13:30.240 --> 00:13:32.960 or on queue to add to the rear, leave or 264 00:13:33.000 --> 00:13:35.279 to queue to remove from the front and front to 265 00:13:35.399 --> 00:13:37.679 just peek at the element at the front without removing it. 266 00:13:37.799 --> 00:13:40.559 And there's a catch with leave in front, isn't there? 267 00:13:40.720 --> 00:13:44.679 Yes, both leave and front share an important precondition. The 268 00:13:44.799 --> 00:13:48.000 queue must not be empty. You can't serve someone if 269 00:13:48.000 --> 00:13:48.840 the line is empty. 270 00:13:49.399 --> 00:13:53.639 Makes sense. So how would our print spooler use a 271 00:13:53.759 --> 00:13:56.120 queue instead of the stack we talked about earlier? 272 00:13:56.240 --> 00:13:59.080 Well, a queue is perfect for managing print jobs fairly. 273 00:13:59.480 --> 00:14:02.559 As new print jobs arrive from different users, they enter 274 00:14:02.600 --> 00:14:05.240 the queue at the rear. When the printer becomes free, 275 00:14:05.440 --> 00:14:07.240 it takes the job at the front, the one that's 276 00:14:07.279 --> 00:14:08.799 been waiting the longest, by having it. 277 00:14:08.799 --> 00:14:11.519 Leave the queue first come, first served exactly. 278 00:14:12.080 --> 00:14:15.480 This guarantees no job gets held up indefinitely. Everyone gets 279 00:14:15.519 --> 00:14:18.320 their turn in the order they arrived. It ensures fairness. 280 00:14:18.759 --> 00:14:22.039 Okay, So when we think about implementing queues, what's the 281 00:14:22.080 --> 00:14:25.080 core challenge, especially if we try to use a standard array, 282 00:14:25.679 --> 00:14:27.360 and how do we usually solve it? 283 00:14:27.759 --> 00:14:31.639 Right with a contiguous array implementation, The challenge is that 284 00:14:31.799 --> 00:14:35.519 removing an element from the front would normally require shifting 285 00:14:35.639 --> 00:14:38.919 every single other element down one spot to fill the gap, 286 00:14:39.120 --> 00:14:39.399 and it. 287 00:14:39.399 --> 00:14:42.320 Sounds terribly inefficient, especially for long queues. 288 00:14:42.639 --> 00:14:45.120 It is it would make the leave operation very slow. 289 00:14:45.639 --> 00:14:48.159 So the clever solution is to use what's called a 290 00:14:48.240 --> 00:14:52.399 circular array or ring buffer. Imagine the array is bent 291 00:14:52.440 --> 00:14:55.639 into a circle. The data can float within the array, 292 00:14:55.960 --> 00:14:58.639 and the front and rear printers can wrap around from 293 00:14:58.639 --> 00:14:59.600 the end back. 294 00:14:59.399 --> 00:15:01.519 To the start. Ah, so you don't have to shift 295 00:15:01.559 --> 00:15:02.679 everything exactly. 296 00:15:03.120 --> 00:15:07.519 This dramatically improves efficiency by avoiding those constant, costly data 297 00:15:07.600 --> 00:15:09.360 shifts when elements leave the front. 298 00:15:09.759 --> 00:15:11.519 Clever, but is that the preferred way? 299 00:15:11.679 --> 00:15:14.879 It works? But a linked implementation is often generally preferred 300 00:15:14.879 --> 00:15:18.000 For cues. You typically use a singly linked list, but 301 00:15:18.080 --> 00:15:20.600 this time you need pointers to both the front ptr 302 00:15:20.759 --> 00:15:23.720 the head and the rear ptr the tail to allow 303 00:15:23.720 --> 00:15:26.120 efficient additions at the back and removals from the front. 304 00:15:26.240 --> 00:15:27.200 Why is that often better? 305 00:15:27.399 --> 00:15:31.600 Well, it's truly unbounded, again, limited only by memory. It's 306 00:15:31.679 --> 00:15:35.840 usually very efficient in space usage, and both entering and 307 00:15:36.039 --> 00:15:40.480 leaving are very fast constant time operations because you don't 308 00:15:40.480 --> 00:15:42.559 need to traverse the whole list, just update a couple 309 00:15:42.600 --> 00:15:43.159 of pointers. 310 00:15:43.240 --> 00:15:47.080 Got it. Lots of implementation choices with different trade offs. 311 00:15:47.120 --> 00:15:51.080 Absolutely. Now let's shift gears slightly and turn our attention 312 00:15:51.200 --> 00:15:55.960 back to collections. Remember those they're traversible containers. 313 00:15:55.519 --> 00:15:56.960 Right, meaning you can access all their. 314 00:15:56.879 --> 00:16:00.000 Elements exactly, and that process of accessing all the other 315 00:16:00.000 --> 00:16:01.279 elements is called iteration. 316 00:16:01.639 --> 00:16:04.279 Okay, iteration, But there are different ways to actually do 317 00:16:04.399 --> 00:16:05.440 the iterating, aren't there? 318 00:16:05.600 --> 00:16:09.440 Yes, there are a few main styles. Iteration can be external, 319 00:16:09.840 --> 00:16:13.720 where a separate entity an iterator object, controls the traversal. 320 00:16:14.159 --> 00:16:17.000 It asks the collection for the next element, then the next, 321 00:16:17.120 --> 00:16:20.440 giving you fine grained control over the process. 322 00:16:20.120 --> 00:16:22.240 Like having a separate remote control for the collection. 323 00:16:22.440 --> 00:16:25.480 Kind of yeah. The well known iterator design pattern is 324 00:16:25.480 --> 00:16:29.320 a formal model for these external iterators. Or iteration can 325 00:16:29.360 --> 00:16:33.080 be internal. Here, the collection itself provides a method maybe 326 00:16:33.120 --> 00:16:35.360 called each or four each, that accepts a block of 327 00:16:35.440 --> 00:16:38.960 code like a function or lambda, and applies that code 328 00:16:38.960 --> 00:16:42.279 to each element within the collection. The collection manages the 329 00:16:42.279 --> 00:16:43.279 traversal internally. 330 00:16:43.399 --> 00:16:45.759 Okay, external control versus internal application. 331 00:16:46.000 --> 00:16:49.000 Right, And given its reputation for flexibility, how does Ruby 332 00:16:49.039 --> 00:16:51.120 fit into this picture? Does it prefer one way? 333 00:16:51.559 --> 00:16:56.399 Ruby is remarkably versatile here. It actually supports most of 334 00:16:56.440 --> 00:16:59.679 the common iteration alternatives you find in different languages. 335 00:16:59.320 --> 00:17:02.320 But it's preferred mechanism. The most idiomatic way to traverse 336 00:17:02.320 --> 00:17:06.079 collections in Ruby is internal iteration. This is primarily done 337 00:17:06.119 --> 00:17:09.160 through the innumerable mix and module. If a class includes 338 00:17:09.160 --> 00:17:12.920 innumerable and defines in each method, it automatically gets access 339 00:17:12.960 --> 00:17:17.000 to over twenty powerful internal iteration and collection processing methods 340 00:17:17.039 --> 00:17:19.000 like map, select, reduce, and so on. 341 00:17:19.160 --> 00:17:22.319 Wow, innumerable is powerful, then hugely powerful. 342 00:17:22.680 --> 00:17:27.160 But Ruby also supports enumerator objects. These are interesting because 343 00:17:27.400 --> 00:17:30.279 they're kind of like iterators that can perform both internal 344 00:17:30.359 --> 00:17:34.680 and external iteration. They can even handle things like exceptions 345 00:17:34.720 --> 00:17:37.200 if you need to stop the iteration process early. So 346 00:17:37.319 --> 00:17:41.480 Ruby gives developers a really nice blend of options for iteration. 347 00:17:41.359 --> 00:17:45.400 Very flexible. Let's talk about a fundamental collection type lists. 348 00:17:45.559 --> 00:17:48.720 Right lists are ordered linear collections. They're absolutely fundamental for 349 00:17:48.759 --> 00:17:52.599 countless applications. Common operations you'd expect on a list include 350 00:17:52.720 --> 00:17:56.200 inserting an element at a specific index, deleting an element 351 00:17:56.240 --> 00:17:59.960 at a specific index, delete heat, accessing elements by the 352 00:18:00.160 --> 00:18:04.519 index using the square brackets, replacing elements by index, finding 353 00:18:04.559 --> 00:18:07.720 the index of a particular element, and maybe creating slices 354 00:18:07.799 --> 00:18:09.160 which are like sub lists. 355 00:18:09.240 --> 00:18:11.720 Yeah, that example you gave earlier. A calendar programs to do. 356 00:18:11.839 --> 00:18:14.720 List seems like a perfect fit. Items have an order, 357 00:18:14.720 --> 00:18:17.559 maybe based on precedence, and need to add, remove, or 358 00:18:17.640 --> 00:18:20.359 maybe move them around freely. A list seems ideal. 359 00:18:20.480 --> 00:18:23.279 It is a list ADT is the perfect container for 360 00:18:23.319 --> 00:18:25.880 that kind of dynamic ordered data. 361 00:18:25.960 --> 00:18:28.799 So when we talk about implementing lists, what are the 362 00:18:28.799 --> 00:18:31.200 primary trade offs? We touched on this a bit with 363 00:18:31.279 --> 00:18:33.839 stacks and queues using arrays versus link. 364 00:18:33.720 --> 00:18:37.599 Structures exactly the same core trade off. Supply lists are 365 00:18:37.640 --> 00:18:42.039 actually very straightforward to implement with a contiguous implementation, Elements 366 00:18:42.039 --> 00:18:45.319 are stored sequentially right next to each other in an array. 367 00:18:45.960 --> 00:18:48.960 Static arrays have a fixed size, which can be limiting. 368 00:18:49.400 --> 00:18:52.680 Dynamic arrays like the ones Ruby provides, grow as needed. 369 00:18:52.839 --> 00:18:54.839 Remember the doubling strategy. 370 00:18:54.440 --> 00:18:56.200 Right to avoid frequent reallocations. 371 00:18:56.319 --> 00:18:59.559 Yeah. Now, the big trade off with contiguous lists is 372 00:18:59.640 --> 00:19:03.119 insert and deletion in the middle. If you insert or 373 00:19:03.160 --> 00:19:05.640 delete an element somewhere in the middle of a long list, 374 00:19:05.720 --> 00:19:08.559 you might have to shift many many other elements up 375 00:19:08.640 --> 00:19:11.359 or down to make space or close the gap. This 376 00:19:11.519 --> 00:19:16.400 makes those operations potentially slow on in the worst case, proportional. 377 00:19:15.759 --> 00:19:17.799 To the list length, But accessing is fast. 378 00:19:17.920 --> 00:19:22.200 But accessing an element directly by its index number, that's instantaneous. 379 00:19:22.359 --> 00:19:24.799 Oh one, because you know exactly where it is in 380 00:19:24.839 --> 00:19:26.079 memory based on the index. 381 00:19:26.160 --> 00:19:28.960 Okay, and what about linked implementations for lists? 382 00:19:29.160 --> 00:19:32.920 For a linked implementation, you use that series of nodes 383 00:19:32.960 --> 00:19:36.759 we talked about, connected by references or pointers. You can 384 00:19:36.880 --> 00:19:39.839 use singly linked lists each node points to the next, 385 00:19:39.960 --> 00:19:42.720 or doubly linked lists each node points to the next 386 00:19:42.799 --> 00:19:43.119 and the. 387 00:19:43.079 --> 00:19:45.759 Previous, and the trade off here is flipped pretty much. 388 00:19:46.279 --> 00:19:49.799 The advantage is that insertion and deletion are very fast. 389 00:19:49.920 --> 00:19:51.880 Once you've found the node where you want to insert 390 00:19:51.960 --> 00:19:54.480 or delete, you just need to update a few pointers. 391 00:19:54.519 --> 00:19:56.680 It doesn't matter how long the list is, right, But 392 00:19:56.799 --> 00:20:00.799 the significant drawback is accessing an element by its index. 393 00:20:01.359 --> 00:20:04.920 Since the nodes aren't stored contiguously, to find the element 394 00:20:05.000 --> 00:20:08.079 at index, say one thousand, you have to start at 395 00:20:08.119 --> 00:20:11.480 the beginning ahead and follow the links one thousand times. 396 00:20:11.559 --> 00:20:14.119 Ah, So accessing by index become slow. 397 00:20:14.240 --> 00:20:15.359 Oh m, exactly. 398 00:20:15.559 --> 00:20:15.799 Yeah. 399 00:20:15.839 --> 00:20:18.480 So this raises that important question again, when would you 400 00:20:18.599 --> 00:20:19.720 choose one over the other. 401 00:20:20.119 --> 00:20:21.680