James Bowen 7/21/25 James Bowen 7/21/25

Image Rotation: Mutable Arrays in Haskell

In last week’s article, we took our first step into working with multi-dimensional arrays. Today, we’ll be working with another Matrix problem that involves in-place mutation. The Haskell solution uses the MArray interface, which takes us out of our usual

The MArray interface is a little tricky to work with. If you want a full overview of the API, you should sign up for our Solve.hs course, where we cover mutable arrays in module 2!

The Problem

Today’s problem is Rotate Image. We’re going to take a 2D Matrix of integer values as our input and rotate the matrix 90 degrees clockwise. We must accomplish this in place, modifying the input value without allocating a new Matrix. The input matrix is always “square” (n x n).

Here are a few examples to illustrate the idea. We can start with a 2x2 matrix:

1  2   |   3  1
3  4   |   4  2

The 4x4 rotation makes it more clear that we’re not just moving numbers one space over. Each corner element will go to a new corner. You can also see how the inside of the matrix is also rotating:

1  2  3  4    |  13  9  5  1
5  6  7  8    |  14 10  6  2
9  10 11 12   |  15 11  7  3
13 14 15 16   |  16 12  8  4

The 3x3 version shows how with an odd number of rows and columns, the inner most number will stand still.

1  2  3   |   7  4  1
4  5  6   |   8  5  2
7  8  9   |   9  6  3

The Algorithm

While this problem might be a little intimidating at first, we just have to break it into sufficiently small and repeatable pieces. The core step is that we swap four numbers into each other’s positions. It’s easy to see, for example, that the four corners always trade places with one another (1, 4, 13, 16 in the 4x4 example).

What’s important is seeing the other sets of 4. We move clockwise to get the next 4 values:

The value to the right of the top left corner
The value below the top right corner
The value to the left of the bottom right corner
The value above the bottom left corner.

So in the 4x4 example, these would be 2, 8, 15, 9. Then another group is 3, 12, 14, 15.

Those 3 groups are all the rotations we need for the “outer layer”. Then we move to the next layer, where we have a single group of 4: 6, 7, 10, 11.

This should tell us that we have a 3-step process:

Loop through each layer of the matrix
Identify all groups of 4 in this layer
Rotate each group of 4

It helps to put a count on the size of each of these loops. For an n x n matrix, the number of layers to rotate is n / 2, rounded down, because the inner-most layer needs no rotation in an odd-sized matrix.

Then for a layer spanning from column c1 to c2, the number of groups in that layer is just c2 - c1. So for the first layer in a 4x4, we span columns 0 to 3, and there are 3 groups of 4. In the inner layer, we span columns 1 to 2, so there is only 1 group of 4.

Rust Solution

As is typical, we’ll see more of a loop structure in our Rust code, and a recursive version of this solution in Haskell. We’ll also start by defining various terms we’ll use. There are multiple ways to approach the details of this problem, but we’ll take an approach that maximizes the clarity of our inner loops.

We’ll define each “layer” using the four corner coordinates of that layer. So for an n x n matrix, these are (0,0), (0, n - 1), (n - 1, n - 1), (n - 1, 0). After we finish looping through a layer, we can simply increment/decrement each of these values as appropriate to get the corner coordinates of the next layer ((1,1), (1, n - 2), etc.).

So let’s start our solution by defining the 8 mutable values for these 4 corners. Each corner (top/left/bottom/right) has a row R and column C value.

pub fn rotate(matrix: &mut Vec<Vec<i32>>) {
    let n = matrix.len();
    let numLayers = n / 2;
    let mut topLeftR = 0;
    let mut topLeftC = 0;
    let mut topRightR = 0;
    let mut topRightC = n - 1;
    let mut bottomRightR = n - 1;
    let mut bottomRightC = n - 1;
    let mut bottomLeftR = n - 1;
    let mut bottomLeftC = 0;
    ...
}

It would be possible to solve the problem without these values, determining coordinates using the layer number. But I’ve found this to be somewhat more error prone, since we’re constantly adding and subtracting from different coordinates in different combinations. We get the number of layers from n / 2.

Now let’s frame the outer loop. We conclude the loop by modifying each coordinate point. Then at the beginning of the loop, we can determine the number of “groups” for the layer by taking the difference between the left and right column coordinates.

pub fn rotate(matrix: &mut Vec<Vec<i32>>) {
    ...
    for i in 0..numLayers {
        let numGroups = topRightC - topLeftC;

        for j in 0..numGroups {
            ...
        }

        topLeftR += 1;
        topLeftC += 1;
        topRightR += 1;
        topRightC -= 1;
        bottomRightR -= 1;
        bottomRightC -= 1;
        bottomLeftR -= 1;
        bottomLeftC += 1;
    }
}

Now we just need the logic for rotating a single group of 4 points. This is a 5-step process:

Save top left value as temp
Move bottom left to top left
Move bottom right to bottom left
Move top right to bottom right
Move temp (original top left) to top right

Unlike the layer number, we’ll use the group variable j for arithmetic here. When you’re writing this yourself, it’s important to go slowly to make sure you’re using the right corner values and adding/subtracting j from the correct dimension.

pub fn rotate(matrix: &mut Vec<Vec<i32>>) {
    ...
    for i in 0..numLayers {
        let numGroups = topRightC - topLeftC;

        for j in 0..numGroups {
            let temp = matrix[topLeftR][topLeftC + j];
            matrix[topLeftR][topLeftC + j] = matrix[bottomLeftR - j][bottomLeftC];
            matrix[bottomLeftR - j][bottomLeftC] = matrix[bottomRightR][bottomRightC - j];
            matrix[bottomRightR][bottomRightC - j] = matrix[topRightR + j][topRightC];
            matrix[topRightR + j][topRightC] = temp;
        }

        ... // (update corners)
    }
}

And then we’re done! We don’t actually need to return a value since we’re just modifying the input in place. Here’s the full solution:

pub fn rotate(matrix: &mut Vec<Vec<i32>>) {
    let n = matrix.len();
    let numLayers = n / 2;
    let mut topLeftR = 0;
    let mut topLeftC = 0;
    let mut topRightR = 0;
    let mut topRightC = n - 1;
    let mut bottomRightR = n - 1;
    let mut bottomRightC = n - 1;
    let mut bottomLeftR = n - 1;
    let mut bottomLeftC = 0;
    for i in 0..numLayers {
        let numGroups = topRightC - topLeftC;

        for j in 0..numGroups {
            let temp = matrix[topLeftR][topLeftC + j];
            matrix[topLeftR][topLeftC + j] = matrix[bottomLeftR - j][bottomLeftC];
            matrix[bottomLeftR - j][bottomLeftC] = matrix[bottomRightR][bottomRightC - j];
            matrix[bottomRightR][bottomRightC - j] = matrix[topRightR + j][topRightC];
            matrix[topRightR + j][topRightC] = temp;
        }

        topLeftR += 1;
        topLeftC += 1;
        topRightR += 1;
        topRightC -= 1;
        bottomRightR -= 1;
        bottomRightC -= 1;
        bottomLeftR -= 1;
        bottomLeftC += 1;
    }
}

Haskell Solution

This is an interesting problem to solve in Haskell because Haskell is a generally immutable language. Unlike Rust, we can’t make values mutable just by putting the keyword mut in front of them.

With arrays, we can modify them in place though using the MArray monad class. We won’t go through all the details of the interface in this article (you can learn about all that in Solve.hs Module 2). But we’ll start with the type signature:

rotateImage :: (MArray array Int m) => array (Int, Int) Int -> m ()

This tells us we are taking a mutable array, where the array type is polymorphic but tied to the monad m. For example, IOArray would work with the IO monad. We don’t return anything, because we’re modifying our input.

We still begin our function by defining terms, but now we need to use monadic actions to retrieve even the bounds our our array.

rotateImage :: (MArray array Int m) => array (Int, Int) Int -> m ()
rotateImage arr = do
  ((minR, minC), (maxR, maxC)) <- getBounds arr
  let n = maxR - minR + 1
  let numLayers = n `quot` 2
  ...

Our algorithm has two loop levels. The outer loop goes through the different layers of the matrix. The inner layer goes through each group of 4 within the layer. In Haskell, both of these loops are recursive, monadic functions. Our Rust loops treat the four corner points of the layer as stateful values, so these need to be inputs to our recursive functions. In addition, each function will take the layer/group number as an input.

rotateImage :: (MArray array Int m) => array (Int, Int) Int -> m ()
rotateImage arr = do
  ((minR, minC), (maxR, maxC)) <- getBounds arr
  let n = maxR - minR + 1
  let numLayers = n `quot` 2
  ...
  where
    rotateLayer tl@(tlR, tlC) tr@(trR, trC) br@(brR, brC) bl@(blR, blC) n = ...
    
    rotateGroup (tlR, tlC) (trR, trC) (brR, brC) (blR, blC) j = ...

Now we just have to fill in these functions. For rotateLayer, we use the “layer number” parameter as a countdown. Once it reaches 0, we’ll be done. We just need to determine the number of groups in this layer using the column difference of left and right. Then we’ll call rotateGroup for each group.

We make the first call to rotateLayer with numLayers and the original corners, coming from our dimensions. When we recurse, we add/subtract 1 from the corner dimensions, and subtract 1 from the layer number.

rotateImage :: (MArray array Int m) => array (Int, Int) Int -> m ()
rotateImage arr = do
  ((minR, minC), (maxR, maxC)) <- getBounds arr
  let n = maxR - minR + 1
  let numLayers = n `quot` 2
  rotateLayer (minR, minC) (minR, maxC) (maxR, maxC) (maxR, minC) numLayers
  where
    rotateLayer _ _ _ _ 0 = return ()
    rotateLayer tl@(tlR, tlC) tr@(trR, trC) br@(brR, brC) bl@(blR, blC) n = do
      let numGroups = ([0..(trC - tlC - 1)] :: [Int])
      forM_ numGroups (rotateGroup tl tr br bl)
      rotateLayer (tlR + 1, tlC + 1) (trR + 1, trC - 1) (brR - 1, brC - 1) (blR - 1, blC + 1) (n - 1)
    
    rotateGroup (tlR, tlC) (trR, trC) (brR, brC) (blR, blC) j = ...

And how do we rotate a group? We use the same five steps we took in Rust. We save the top left as temp and then move the values around. We use the monadic functions readArray and writeArray to perform these actions in place on our Matrix.

rotateImage :: (MArray array Int m) => array (Int, Int) Int -> m ()
rotateImage arr = do
  ...
  where
    ...
    
    rotateGroup (tlR, tlC) (trR, trC) (brR, brC) (blR, blC) j = do
      temp <- readArray arr (tlR, tlC + j)
      readArray arr (blR - j, blC) >>= writeArray arr (tlR, tlC + j)
      readArray arr (brR, brC - j) >>= writeArray arr (blR - j, blC)
      readArray arr (trR + j, trC) >>= writeArray arr (brR, brC - j)
      writeArray arr (trR + j, trC) temp

Here’s the full implementation:

rotateImage :: (MArray array Int m) => array (Int, Int) Int -> m ()
rotateImage arr = do
  ((minR, minC), (maxR, maxC)) <- getBounds arr
  let n = maxR - minR + 1
  let numLayers = n `quot` 2
  rotateLayer (minR, minC) (minR, maxC) (maxR, maxC) (maxR, minC) numLayers
  where
    rotateLayer _ _ _ _ 0 = return ()
    rotateLayer tl@(tlR, tlC) tr@(trR, trC) br@(brR, brC) bl@(blR, blC) n = do
      let numGroups = ([0..(trC - tlC - 1)] :: [Int])
      forM_ numGroups (rotateGroup tl tr br bl)
      rotateLayer (tlR + 1, tlC + 1) (trR + 1, trC - 1) (brR - 1, brC - 1) (blR - 1, blC + 1) (n - 1)
    
    rotateGroup (tlR, tlC) (trR, trC) (brR, brC) (blR, blC) j = do
      temp <- readArray arr (tlR, tlC + j)
      readArray arr (blR - j, blC) >>= writeArray arr (tlR, tlC + j)
      readArray arr (brR, brC - j) >>= writeArray arr (blR - j, blC)
      readArray arr (trR + j, trC) >>= writeArray arr (brR, brC - j)
      writeArray arr (trR + j, trC) temp

Conclusion

We’ve got one more Matrix problem to solve next time, and then we’ll move on to some other data structures. To learn more about using Data Structures and Algorithms in Haskell, you take our Solve.hs course. You’ll get the chance to write a number of data structures from scratch, and you’ll get plenty of practice working with them and using them in algorithms!

James Bowen 7/14/25 James Bowen 7/14/25

Binary Search in a 2D Matrix

In our problem last week, we covered a complex problem that used a binary search. Today, we’ll apply binary search again to solidify our understanding of it. This time, instead of extra algorithmic complexity, we’ll start adding some data structure complexity. We’ll be working with a 2D Matrix instead of basic arrays.

To learn more about data structures and algorithms in Haskell, you should take a look at our Solve.hs course! In particular, you’ll cover multi-dimensional arrays in module 2, and you’ll learn how to write algorithms in Haskell in module 3!

The Problem

Today’s problem is Search a 2D Matrix, and the description is straightforward. We’re given a 2D m x n matrix, as well as a target number. We have to return a boolean for whether or not that number is in the Matrix.

This is trivial with a simple scan, but we have an additional constraint that lets us solve the problem faster. The matrix is essentially ordered. Each row is non-decreasing, and the first element of each successive row is no smaller than the last element of the preceding row.

This allows us to get a solution that is O(log(n + m)), a considerable improvement over a linear scan.

The Algorithm

The algorithm is simple as well. We’ll do two binary searches. First, we’ll search over the rows to identify the last row which could contain the element. Then we’ll do a binary search of that row to see if the element is present or not.

We’ll have a slightly different form to our searches compared to last time. In last week’s problem, we knew we had to find a valid index for our search. Now, we may find that no valid index exists.

So we’ll structure our search interval in a semi-open fashion. The first index in our search interval is inclusive, meaning that it could still be a valid index. The second index is exclusive, meaning it is the lowest index that we consider invalid.

In mathematical notation, we would represent such an interval with a square bracket on the left and a parenthesis on the right. So if that interval is [0, 4), then 0, 1, 2, 3 are valid values. The interval [2,2) would be considered empty, with no valid values. We’ll see how we apply this idea in practice.

Rust Solution

We don’t have that many terms to define at the start of this solution. We’ll save the size of both dimensions, and then prepare ourselves for the first binary search by assigning low as 0 (the first potential “valid” answer), hi as m (the lowest “invalid” answer), and creating our output rowWithTarget value. For this, we also assign m, an invalid value. If we fail to re-assign rowWithTarget in our binary search, we want it assigned to an easily testable invalid value.

pub fn search_matrix(matrix: Vec<Vec<i32>>, target: i32) -> bool {
    let m = matrix.len();
    let n = matrix[0].len();

    let mut low = 0;
    let mut hi = m;
    let mut rowWithTarget = m;
    ...
}

Now we write our first binary search, looking for a row that could contain our target value. We maintain the typical pattern of binary search, using the loop while (low < hi) and assigning mid = (low + hi) / 2.

pub fn search_matrix(matrix: Vec<Vec<i32>>, target: i32) -> bool {
    ...
    while (low < hi) {
        let mid: usize = (low + hi) / 2;
        if (matrix[mid][0] > target) {
            hi = mid;
        } else if (matrix[mid][n - 1] < target) {
            low = mid + 1;
        } else {
            rowWithTarget = mid;
            break;
        }
    }
    if (rowWithTarget >= m) {
        return false;
     }
    ...
}

If the first element of the row is too large, we know that mid is “invalid”, so we can assign it as hi and continue. If the last element is too small, then we reassign low as mid + 1, as we want low to still be a potentially valid value.

Otherwise, we have found a potential row, so we assign rowWithTarget and break. If, after this search, rowWithTarget has the “invalid” value of m, we can return false, as there are no valid values.

Now we just do the same thing over again, but within rowWithTarget! We reassign low and hi (as n this time) to reset the while loop. And now our comparisons will look at the specific value matrix[rowWithTarget][mid].

pub fn search_matrix(matrix: Vec<Vec<i32>>, target: i32) -> bool {
    ...
    low = 0;
    hi = n;
    while (low < hi) {
        let mid: usize = (low + hi) / 2;
        if (matrix[rowWithTarget][mid] > target) {
            hi = mid;
        } else if (matrix[rowWithTarget][mid] < target) {
            low = mid + 1;
        } else {
            return true;
        }
    }
    return false;
}

Again, we follow the same pattern of re-assigning low and hi. If we don’t hit the return true case in the loop, we’ll end up with return false at the end, because we haven’t found the target.

Here’s the full solution:

pub fn search_matrix(matrix: Vec<Vec<i32>>, target: i32) -> bool {
    let m = matrix.len();
    let n = matrix[0].len();

    let mut low = 0;
    let mut hi = m;
    let mut rowWithTarget = m;

    while (low < hi) {
        let mid: usize = (low + hi) / 2;
        if (matrix[mid][0] > target) {
            hi = mid;
        } else if (matrix[mid][n - 1] < target) {
            low = mid + 1;
        } else {
            rowWithTarget = mid;
            break;
        }
    }
    if (rowWithTarget >= m) {
        return false;
     }

    low = 0;
    hi = n;
    while (low < hi) {
        let mid: usize = (low + hi) / 2;
        if (matrix[rowWithTarget][mid] > target) {
            hi = mid;
        } else if (matrix[rowWithTarget][mid] < target) {
            low = mid + 1;
        } else {
            return true;
        }
    }
    return false;
}

Haskell Solution

In our Haskell solution, the main difference of course will be using recursion for the binary search. However, we’ll also change up the data structure a bit. In the Rust framing of the problem, we had a vector of vectors of values. We could do this in Haskell, but we could also use Array (Int, Int) Int. This lets us map row/column pairs to numbers in a more intuitive way.

import qualified Data.Array as A

search2DMatrix :: A.Array (Int, Int) Int -> Int -> Bool
search2DMatrix matrix target = ...
  where
      ((minR, minC), (maxR, maxC)) = A.bounds matrix

Another unique feature of arrays is that the bounds don’t have to start from 0. We can have totally custom bounding dimensions for our rows and columns. So instead of using m and n, we’ll need to use the min and max of the row and column dimensions.

So now let’s define our first binary search, looking for the valid row. As we did last week, the input to our function will be two Int values, for the low and hi. As in our Rust solution we’ll access the first and last element of the row defined by the “middle” of low and hi, and compare them against the target. We make recursive calls to searchRow if the row isn’t valid.

search2DMatrix :: A.Array (Int, Int) Int -> Int -> Bool
search2DMatrix matrix target = result
  where
      ((minR, minC), (maxR, maxC)) = A.bounds matrix

      searchRow :: (Int, Int) -> Int
      searchRow (low, hi) = if low >= hi then maxR + 1 else
        let mid = (low + hi) `quot` 2
            firstInRow = matrix A.! (mid, minC)
            lastInRow = matrix A.! (mid, maxC)
        in  if firstInRow > target
              then searchRow (low, mid)
              else if lastInRow < target
                then searchRow (mid + 1, hi)
                else mid

      rowWithTarget = searchRow (minR, maxR + 1)
      result = rowWithTarget <= maxR && ...

Instead of m, we have maxR + 1, which we use as the initial hi value, as well as a return value in the base case where low meets hi. We can return a result of False if rowWithTarget does not come back with a value smaller than maxR.

Now for our second search, we follow the same pattern, but now we’re returning a boolean. The base case returns False, and we return True if we find the value in rowWithTarget at position mid. Here’s what that looks like:

search2DMatrix :: A.Array (Int, Int) Int -> Int -> Bool
search2DMatrix matrix target = result
  where
      ...

      rowWithTarget = searchRow (minR, maxR + 1)

      searchCol :: (Int, Int) -> Bool
      searchCol (low, hi) = low < hi &&
        let mid = (low + hi) `quot` 2
            val = matrix A.! (rowWithTarget, mid)
        in  if val > target
              then searchCol (low, mid)
              else if val < target
                then searchCol (mid + 1, hi)
                else True
      
      result = rowWithTarget <= maxR && searchCol (minC, maxC + 1)

You’ll see we now use the outcome of searchCol for result. And this completes our solution! Here’s the full code:

search2DMatrix :: A.Array (Int, Int) Int -> Int -> Bool
search2DMatrix matrix target = result
  where
      ((minR, minC), (maxR, maxC)) = A.bounds matrix

      searchRow :: (Int, Int) -> Int
      searchRow (low, hi) = if low >= hi then maxR + 1 else
        let mid = (low + hi) `quot` 2
            firstInRow = matrix A.! (mid, minC)
            lastInRow = matrix A.! (mid, maxC)
        in  if firstInRow > target
              then searchRow (low, mid)
              else if lastInRow < target
                then searchRow (mid + 1, hi)
                else mid

      rowWithTarget = searchRow (minR, maxR + 1)

      searchCol :: (Int, Int) -> Bool
      searchCol (low, hi) = low < hi &&
        let mid = (low + hi) `quot` 2
            val = matrix A.! (rowWithTarget, mid)
        in  if val > target
              then searchCol (low, mid)
              else if val < target
                then searchCol (mid + 1, hi)
                else True
      
      result = rowWithTarget <= maxR && searchCol (minC, maxC + 1)

Conclusion

Next week, we’ll stay on the subject of 2D matrices, but we’ll learn about array mutation. This is a very tricky subject in Haskell, so make sure to come back for that article!

To learn how these data structures work in Haskell, read about Solve.hs, our Haskell Data Structures & Algorithms course!

James Bowen 7/7/25 James Bowen 7/7/25

Binary Search in Haskell and Rust

This week we’ll be continuing our series of problem solving in Haskell and Rust. But now we’re going to start moving beyond the terrain of “basic” problem solving techniques with strings, lists and arrays, and start moving in the direction of more complicated data structures and algorithms. Today we’ll explore a problem that is still array-based, but uses a tricky algorithm that involves binary search!

You’ll learn more about Data Structures and Algorithms in our Solve.hs course! The last 7 weeks or so of blog articles have focused on the types of problems you’ll see in Module 1 of that course, but now we’re going to start encountering ideas from Modules 2 & 3, which look extensively at essential data structures and algorithms you need to know for problem solving.

The Problem

Today’s problem is median of two sorted arrays. In this problem, we receive two arrays of numbers as input, each of them in sorted order. The arrays are not necessarily of the same size. Our job is to find the median of the cumulative set of numbers.

Now there’s a conceptually easy approach to this. We could simply scan through the two arrays, keeping track of one index for each one. We would increase the index for whichever number is currently smaller, and stop once we have passed by half of the total numbers. This approach is essentially the “merge” part of merge sort, and it would take O(n) time, since we are scanning half of all the numbers.

However, there’s a faster approach! And if you are asked this question in an interview for anything other than a very junior position, your interviewer will expect you to find this faster approach. Because the arrays are sorted, we can leverage binary search to find the median in O(log n) time. The approach isn’t easy to see though! Let’s go over the algorithm before we get into any code.

The Algorithm

This algorithm is a little tricky to follow (this problem is rated as “hard” on LeetCode). So we’re going to treat this a bit like a mathematical proof, and begin by defining useful terms. Then it will be easy to describe the coding concepts behind the algorithm.

Defining our Terms

Our input consists of 2 arrays, arr1 and arr2 with potentially different sizes n and m, respectively. Without loss of generality, let arr1 be the “shorter” array, so that n <= m. We’ll also define t as the total number of elements, n + m.

It is worthwhile to note right off the bat that if t is odd, then a single element from one of the two lists will be the median. If t is even, then we will average two elements together. Even though we won’t actually create the final merged array, we can imagine that it consists of 3 parts:

The “prior” portion - all numbers before the median element(s)
The median element(s), either 1 or 2.
The “latter” portion - all numbers after the median element(s)

The total number of elements in the “prior” portion will end up being (t - 1) / 2, bearing in mind how integer division works. For example, whether t is 15 or 16, we get 7 elements in the “prior” portion. We’ll use p for this number.

Finally, let’s imagine p1, the number of elements from arr1 that will end up in the prior portion. If we know p1, then p2, the number of elements from arr2 in the prior portion is fixed, because p1 + p2 = p. We can then think of p1 as an index into arr1, the index of the first element that is not in the prior portion. The only trick is that this index could be n indicating that all elements of arr1 are in the prior portion.

Getting the Final Answer from our Terms

If we have the “correct” values for p1 and p2, then finding the median is easy. If t is odd, then the lower number between arr1[p1] and arr2[p2] is the median. If t is even, then we average the two smallest numbers among (arr1[p1], arr2[p2], arr1[p1 + 1], arr2[p2 + 1]).

So we’ve reduced this problem to a matter of finding p1, since p2 can be easily derived from it. How do we know we have the “correct” value for p1, and how do we search for it efficiently?

Solving for p1

The answer is that we will conduct a binary search on arr1 in order to find the correct value of p1. For any particular choice of p1, we determine the corresponding value of p2. Then we make two comparisons:

Compare arr1[p1 - 1] to arr2[p2]
Compare arr2[p2 - 1] to arr1[p1]

If both comparisons are less-than-or-equals, then our two p values are correct! The slices arr1[0..p1-1] and arr2[0..p2-1] always constitute a total of p values, and if these values are smaller than arr1[p1] and arr2[p2], then they constitute the entire “prior” set.

If, on the other hand, the first comparison yields “greater than”, then we have too many values for arr1 in our prior set. This means we need to recursively do the binary search on the left side of arr1, since p1 should be smaller.

Then if the second comparison yields “greater than”, we have too few values from arr1 in the “prior” set. We should increase p1 by searching the right half of our array.

This provides a complete algorithm for us to follow!

Rust Implementation

Our algorithm description was quite long, but the advantage of having so many details is that the code starts to write itself! We’ll start with our Rust implementation. Stage 1 is to define all of the terms using our input values. We want to define our sizes and array references generically so that arr1 is the shorter array:

pub fn find_median_sorted_arrays(nums1: Vec<i32>, nums2: Vec<i32>) -> f64 {
    let mut n = nums1.len();
    let mut m = nums2.len();
    let mut arr1: &Vec<i32> = &nums1;
    let mut arr2: &Vec<i32> = &nums2;
    if (m < n) {
        n = nums2.len();
        m = nums1.len();
        arr1 = &nums2;
        arr2 = &nums1;
    }
    let t = n + m;
    let p: usize = (t - 1) / 2;

    ...
}

Anatomy of a Binary Search

The next stage is the binary search, so we can find p1 and p2. Now a binary search is a particular kind of loop pattern. Like many of the loop patterns we worked with in the previous weeks, we can express it recursively, or with a loop construct like for or while. We’ll start with a while loop solution for Rust, and then show the recursive solution with Haskell.

All loops maintain some kind of state. For a binary search, the primary state is the two endpoints representing our “interval of interest”. This starts out as the entire interval, and shrinks by half each time until we’ve narrowed to a single element (or no elements). We’ll represent these with interval end points with low and hi. Our loop concludes once low is as large as hi.

let mut low = 0;
// Use the shorter array size!
let mut hi = n;
while (low < hi) {
    ...
}

In our particular case, we are also trying to determine the values for p1 and p2. Each time we specify an interval, we’ll see if the midpoint of that interval (between low and hi) is the correct value of p1:

...

let mut low = 0;
let mut hi = n;
let mut p1 = 0;
let mut p2 = 0;
while (low < hi) {
    p1 = (low + hi) / 2;
    p2 = p - p1;
    ...
}

Now we evaluate this p1 value using the two conditions we specified in our algorithm. These are self-explanatory, except we do need to cover some edge cases where one of our values is at the edge of the array bounds.

For example, if p1 is 0, the first condition is always “true”. If this condition is negated, this means we want fewer elements from arr1, but this is impossible if p1 is 0.

...

let mut low = 0;
let mut hi = n;
let mut p1 = 0;
let mut p2 = 0;
while (low < hi) {
    p1 = (low + hi) / 2;
    p2 = p - p1;
    let cond1 = p1 == 0 || arr1[p1 - 1] <= arr2[p2];
    let cond2 = p1 == n || p2 == 0 || arr2[p2 - 1] <= arr1[p1];
    if (cond1 && cond2) {
        break;
    } else if (!cond1) {
        p1 -= 1;
        hi = p1;
    } else {
        p1 += 1;
        low = p1;
    }
}
p2 = p - p1;

...

If both conditions are met, you’ll see we break, because we’ve found the right value for p1! Otherwise, we know p1 is invalid. This means we want to exclude the existing p1 value from further consideration by changing either low or hi to remove it from the interval of interest.

So if cond1 is false, hi becomes p1 - 1, and if cond2 is false, it becomes p1 + 1. In both cases, we also modify p1 itself first so that our loop does not conclude with p1 in an invalid location.

Getting the Final Answer

Now that we have p1 and p2, we have to do a couple final tricks to get the final answer. We want to get the first “smaller” value between arr1[p1] and arr2[p2]. But we have to handle the edge case where p1 might be n AND we want to increment the index for the array we take. Note that p2 cannot be out of bounds right now!

let mut median = arr2[p2];
if (p1 < n && arr1[p1] < arr2[p2]) {
    median = arr1[p1];
    p1 += 1;
} else {
    p2 += 1;
}

If the total number of elements is odd, we can simply return this number (converting to a float). However, in the even case we need one more number to take an average. So we’ll compare the values at the indices again, but now accounting that either (but not both) could be out of bounds.

let mut median = arr2[p2];
if (p1 < n && arr1[p1] < arr2[p2]) {
    median = arr1[p1];
    p1 += 1;
} else {
    p2 += 1;
}

if (t % 2 == 0) {
    if (p1 >= n) {
        median += arr2[p2];
    } else if (p2 >= m) {
        median += arr1[p1];
    } else {
        median += cmp::min(arr1[p1], arr2[p2]);
    }
    let medianF: f64 = median.into();
    return medianF / 2.0;
} else {
    return median.into();
}

Here’s the complete implementation:

pub fn find_median_sorted_arrays(nums1: Vec<i32>, nums2: Vec<i32>) -> f64 {
    let mut n = nums1.len();
    let mut m = nums2.len();
    let mut arr1: &Vec<i32> = &nums1;
    let mut arr2: &Vec<i32> = &nums2;
    if (m < n) {
        n = nums2.len();
        m = nums1.len();
        arr1 = &nums2;
        arr2 = &nums1;
    }
    let t = n + m;
    let p: usize = (t - 1) / 2;

    let mut low = 0;
    let mut hi = n;
    let mut p1 = 0;
    let mut p2 = 0;
    while (low < hi) {
        p1 = (low + hi) / 2;
        p2 = p - p1;
        let cond1 = p1 == 0 || arr1[p1 - 1] <= arr2[p2];
        let cond2 = p1 == n || p2 == 0 || arr2[p2 - 1] <= arr1[p1];
        if (cond1 && cond2) {
            break;
        } else if (!cond1) {
            p1 -= 1;
            hi = p1;
        } else {
            p1 += 1;
            low = p1;
        }
    }
    p2 = p - p1;

    let mut median = arr2[p2];
    if (p1 < n && arr1[p1] < arr2[p2]) {
        median = arr1[p1];
        p1 += 1;
    } else {
        p2 += 1;
    }

    if (t % 2 == 0) {
        if (p1 >= n) {
            median += arr2[p2];
        } else if (p2 >= m) {
            median += arr1[p1];
        } else {
            median += cmp::min(arr1[p1], arr2[p2]);
        }
        let medianF: f64 = median.into();
        return medianF / 2.0;
    } else {
        return median.into();
    }
}

Haskell Implementation

Now let’s examine the Haskell implementation. Unlike the LeetCode version, we’ll just assume our inputs are Double already instead of doing a conversion. Once again, we start by defining the terms:

medianSortedArrays :: V.Vector Double -> V.Vector Double -> Double
medianSortedArrays input1 input2 = ...
  where
    n' = V.length input1
    m' = V.length input2
    t = n' + m'
    p = (t - 1) `quot` 2
    (n, m, arr1, arr2) = if V.length input1 <= V.length input2
      then (n', m', input1, input2) else (m', n', input2, input1)

    ...

Now we’ll implement the binary search, this time doing a recursive function. We’ll do this in two parts, starting with a helper function. This helper function will simply tell us if a particular index is correct for p1. The trick though is that we’ll return an Ordering instead of just a Bool:

-- data Ordering = LT | EQ | GT
f :: Int -> Ordering

This lets us signal 3 possibilities. If we return EQ, this means the index is valid. If we return LT, this will mean we want fewer values from arr1. And then GT means we want more values from arr1.

With this framing it’s easy to see the implementation of this helper now. We determine the appropriate p2, figure out our two conditions, and return the value for each condition:

medianSortedArrays :: V.Vector Double -> V.Vector Double -> Double
medianSortedArrays input1 input2 = ...
  where
    ...
    f :: Int -> Ordering
    f pi1 =
      let pi2 = p - pi1
          cond1 = pi1 == 0 || arr1 V.! (pi1 - 1) <= arr2 V.! pi2
          cond2 = pi1 == n || pi2 == 0 || (arr2 V.! (pi2 - 1) <= arr1 V.! pi1)
      in  if cond1 && cond2 then EQ else if (not cond1) then LT else GT

Now applying we can use this in a recursive binary search. The binary search tracks two pieces of state for our interval ((Int, Int)), and it will return the correct value for p1. The implementation applies the base case (return low if low >= hi), determines the midpoint, calls our helper, and then recurses appropriately based on the helper result.

medianSortedArrays :: V.Vector Double -> V.Vector Double -> Double
medianSortedArrays input1 input2 = ...
  where
    ...
    f :: Int -> Ordering
    f pi1 = ...
    
    search :: (Int, Int) -> Int
    search (low, hi) = if low >= hi then low else
      let mid = (low + hi) `quot` 2
      in  case f mid of
            EQ -> mid
            LT -> search (low, mid - 1)
            GT -> search (mid + 1, hi)

    p1 = search (0, n)
    p2 = p - p1

    ...

For the final part of the problem, we’ll define a helper. Given p1 and p2, it will emit the “lower” value between the two indices in the array (accounting for edge cases) as well as the two new indices (since one will increment).

This is a matter of lazily defining the “next” value for each array, the “end” condition of each array, and the “result” if that array’s value is chosen:

medianSortedArrays :: V.Vector Double -> V.Vector Double -> Double
medianSortedArrays input1 input2 = ...
  where
    ...

    findNext pi1 pi2 =
      let next1 = arr1 V.! pi1
          next2 = arr2 V.! pi2
          end1 = pi1 >= n
          end2 = pi2 >= m
          res1 = (next1, pi1 + 1, pi2)
          res2 = (next2, pi1, pi2 + 1)
      in  if end1 then res2
            else if end2 then res1
            else if next1 <= next2 then res1 else res2

Now we just apply this either once or twice to get our result!

medianSortedArrays :: V.Vector Double -> V.Vector Double -> Double
medianSortedArrays input1 input2 = result
  where
    ...

    tIsEven = even t
    (median1, nextP1, nextP2) = findNext p1 p2
    (median2, _, _) = findNext nextP1 nextP2
    result = if tIsEven
      then (median1 + median2) / 2.0
      else median1

Here’s the complete implementation:

medianSortedArrays :: V.Vector Double -> V.Vector Double -> Double
medianSortedArrays input1 input2 = result
  where
    n' = V.length input1
    m' = V.length input2
    t = n' + m'
    p = (t - 1) `quot` 2
    (n, m, arr1, arr2) = if V.length input1 <= V.length input2
      then (n', m', input1, input2) else (m', n', input2, input1)

    -- Evaluate the index in arr1
    -- If this does in indicate the index can be part of a median, return EQ
    -- If it indicates we need to move left in shortArr, return LT
    -- If it indicates we need to move right in shortArr, return GT
    -- Precondition: p1 <= n
    f :: Int -> Ordering
    f pi1 =
      let pi2 = p - pi1
          cond1 = pi1 == 0 || arr1 V.! (pi1 - 1) <= arr2 V.! pi2
          cond2 = pi1 == n || pi2 == 0 || (arr2 V.! (pi2 - 1) <= arr1 V.! pi1)
      in  if cond1 && cond2 then EQ else if (not cond1) then LT else GT
    
    search :: (Int, Int) -> Int
    search (low, hi) = if low >= hi then low else
      let mid = (low + hi) `quot` 2
      in  case f mid of
            EQ -> mid
            LT -> search (low, mid - 1)
            GT -> search (mid + 1, hi)
    
    findNext pi1 pi2 =
      let next1 = arr1 V.! pi1
          next2 = arr2 V.! pi2
          end1 = pi1 >= n
          end2 = pi2 >= m
          res1 = (next1, pi1 + 1, pi2)
          res2 = (next2, pi1, pi2 + 1)
      in  if end1 then res2
            else if end2 then res1
            else if next1 <= next2 then res1 else res2

    p1 = search (0, n)
    p2 = p - p1

    tIsEven = even t
    (median1, nextP1, nextP2) = findNext p1 p2
    (median2, _, _) = findNext nextP1 nextP2
    result = if tIsEven
      then (median1 + median2) / 2.0
      else median1

Conclusion

If you want to learn more about these kinds of problem solving techniques, you should take our course Solve.hs! In the coming weeks, we’ll see more problems related to data structures and algorithms, which are covered extensively in Modules 2 and 3 of that course!

James Bowen 6/30/25 James Bowen 6/30/25

Buffer & Save with a Challenging Example

Welcome back to our series comparing LeetCode problems in Haskell and Rust. Today we’ll learn a new paradigm that I call “Buffer and Save”. This will also be the hardest problem we’ve done so far! The core loop structure isn’t that hard, but there are a couple layers of tricks to massage our data to get the final answer.

This will be the last problem we do that focuses strictly on string and list manipulation. The next set of problems we do will all rely on more advanced data structures or algorithmic ideas.

For more complete practice on problem solving in Haskell, check out Solve.hs, our newest course. This course will teach you everything you need to know about problem solving, data structures, and algorithms in Haskell. You’ll get loads of practice building structures and algorithms from scratch, which is very important for understanding and remembering how they work.

The Problem

Today’s problem is Text Justification. The idea here is that we are taking a list of words and a “maximum width” and printing out the words grouped into equal-width lines that are evenly spaced. Here’s an example input and output:

Example Input (list of 9 strings):
[“Study”, “Haskell”, “with”, “us”, “every”, “Monday”, “Morning”, “for”, “fun”]
Max Width: 16

Output (list of 4 strings):
“Study    Haskell”
“with   us  every”
“Monday   Morning”
“for fun         ”

There are a few notable rules, constraints, and edge cases. Here’s a list to sumarize them:

There is at least one word
No word is larger than the max width
All output strings must have max width as their length (including spaces)
The first word of every line is set to the left
The last line always has 1 space between words, and then enough spaces after the last word to read the max width.
All other lines with multiple words will align the final word all the way to the right
The spaces in non-final lines are distributed as evenly as possible, but extra spaces go between words to the left.

The final point is potentially the trickiest to understand. Consider the second line above, with us every. The max width is 16, and we have 3 words with a total of 11 characters. This leaves us 5 spaces. Having 3 words means 2 blanks, so the “left” blank gets 3 spaces and the “right” blank gets 2 spaces.

If you had a line with 5 words, a max width of 30, and 16 characters, you would place 4 spaces in the left two blanks, and 3 spaces in the right two blanks. The relative length of the words does not matter.

Words in Line: [“A”, “good”, “day”, “to”, “endure”]

Output Line:
“A    good    day   to   endure”

The Algorithm

As mentioned above, our main algorithmic idea could be called “buffer and save”. We’ve been defining all of our loops based on the state we must maintain between iterations of the loop. The buffer and save approach highlights two pieces of state for us:

The strings we’ve accumulated for our answer so far (the “result”)
A buffer of the strings in the “current” line we’re building.

So we’ll loop through the input words one at a time. We’ll consider if the next word can be added to the “current” line. If it would cause our current line to exceed the maximum width, we’ll “save” our current line and write it out to the “result” list, adding the required spaces.

To help our calculations, we’ll also include two other pieces of state in our loop:

The number of characters in our “current” line
The number of words in our “current” line

Finally, there’s the question of how to construct each output line. Combining the math with list-mechanics is a little tricky. But the central idea consists of 4 simple steps:

Find the number of spaces (subtract number of characters from max width)
Divide the number of spaces by the number of “blanks” (number of words - 1)
The quotient is the “base” number of spaces per blank
The remainder is the number of blanks (starting from the left) that get an extra space

The exact implementation of this idea differs between Haskell and Rust. Again this rests a lot on the “reverse” differences between Rust vectors and Haskell lists.

The final line has a slightly different (but easier) process. And we should note that the final line will still be in our buffer when we exit the loop! So we shouldn’t forget to add it to the result.

Haskell Solution

We know enough now to jump into our Haskell solution. Our solution should be organized around a loop. Since we go through the input word-by-word, this should follow a fold pattern. So here’s our outline:

justifyText :: [String] -> Int -> [String]
justifyText inputWords maxWidth = ...
  where
    -- f = ‘final’
    (fLine, fWordsInLine, fCharsInLine, result) = foldl loop ([], 0, 0, []) inputWords

    loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
    loop (currentLine, wordsInLine, charsInLine, currResult) newWord = ...

Let’s focus in on the choice we have to make in the loop. We need to determine if this new word fits in our current line. So we’ll get its length and add it to the number of characters in the line AND consider the number of words in the line. We count the words too since each word we already have requires at least one space!

-- (maxWidth is still in scope here)
loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
  let newWordLen = length newWord
  in  if newWordLen + charsInLine + wordsInLine > maxWidth
        then ...
        else ...

How do we fill in these choices? If we don’t overflow the line, we just append the new word, bump the count of the words, and add the new word’s length to the character count.

loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
  let newWordLen = length newWord
  in  if newWordLen + charsInLine + wordsInLine > maxWidth
        then ...
        else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

The overflow case isn’t hard, but it does require us to have a function that can convert our current line into the final string. This function will also take the number of words and characters in this line. Assuming this function exists, we just make this new line, append it to result, and then reset our other stateful values so that they only reflect the “new word” as part of our current line.

loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
  let newWordLen = length newWord
      resultLine = makeLine currentLine wordsInLine charsInLine
  in  if newWordLen + charsInLine + wordsInLine > maxWidth
        then ([newWord], 1, newWordLen, resultLine : currResult)
        else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

makeLine :: String -> Int -> Int -> String
makeLine = ...

Before we think about the makeLine implementation though, we just about have enough to fill in the rest of the “top” of our function definition. We’d just need another function for making the “final” line, since this is different from other lines. Then when we get our “final” state values, we’ll plug them into this function to get our final line, append this to the result, and reverse it all.

justifyText :: [String] -> Int -> [String]
justifyText inputWords maxWidth = 
  reverse (makeLineFinal flLine fWordsInLine fCharsInLine : result)
  where
    (fLine, fWordsInLine, fCharsInLine, result) = foldl loop ([], 0, 0, []) inputWords

    loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
    loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
      let newWordLen = length newWord
          resultLine = makeLine currentLine wordsInLine charsInLine
      in  if newWordLen + charsInLine + wordsInLine > maxWidth
            then ([newWord], 1, newWordLen, resultLine : currResult)
            else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

    makeLine :: [String] -> Int -> Int -> String
    makeLine = ...

    makeLineFinal :: [String] -> Int -> Int -> String
    makeLineFinal = ...

Now let’s discuss forming these lines, starting with the general case. We can start with a couple edge cases. This should never be called with an empty list. And with a singleton, we just left-align the word and add the right number of spaces:

makeLine :: [String] -> Int -> Int -> String
makeLine [] _ _ = error "Cannot makeLine with empty string!"
makeLine [onlyWord] _ charsInLine =
  let extraSpaces = replicate (maxWidth - charsInLine) ' '
  in  onlyWord <> extraSpaces
makeLine (first : rest) wordsInLine charsInLine = ...

Now we’ll calculate the quotient and remainder to get the spacing sizes, as mentioned in our algorithm section. But how do we combine them? There are multiple ways, but the idea I thought of was to zip the tail of the list with the number of spaces it needs to append. Then we can fold it into a resulting list using a function like this:

-- (String, Int) is the next string and the number of spaces after it
combine :: String -> (String, Int) -> String
combine suffix (nextWord, numSpaces) =
  nextWord <> replicate numSpaces ' ' <> suffix

Remember while doing this that we’ve accumulated the words for each line in reverse order. So we want to append each one in succession, together with the number of spaces that come after it.

To use this function, we can “fold” over the “tail” of our current line, while using the first word in our list as the base of the fold! Don’t forget the quotRem math going on in here!

makeLine :: [String] -> Int -> Int -> String
makeLine [] _ _ = error "Cannot makeLine with empty string!"
makeLine [onlyWord] _ charsInLine =
  let extraSpaces = replicate (maxWidth - charsInLine) ' '
  in  onlyWord <> extraSpaces
makeLine (first : rest) wordsInLine charsInLine = ...
  let (baseNumSpaces, numWithExtraSpace) = quotRem (maxWidth - charsInLine) (wordsInLine - 1)
      baseSpaces = replicate (wordsInLine - 1 - numWithExtraSpace) baseNumSpaces
      extraSpaces = replicate numWithExtraSpace (baseNumSpaces + 1)
      wordsWithSpaces = zip rest (baseSpaces <> extraSpaces)
  in  foldl combine first wordsWithSpaces

combine :: String -> (String, Int) -> String
combine suffix (nextWord, numSpaces) =
  nextWord <> replicate numSpaces ' ' <> suffix

To make the final line, we can also leverage our combine function! It’s just a matter of combining each word in our input with the appropriate number of spaces. In this case, almost every word gets 1 space except for the last one (which comes first in our list). This just gets however many trailing spaces we need!

makeLineFinal :: [String] -> Int -> Int -> String
makeLineFinal [] _ _ = error "Cannot makeLine with empty string!"
makeLineFinal strs wordsInLine charsInLine =
  let trailingSpaces = maxWidth - charsInLine - (wordsInLine - 1)
  in  foldl combine "" (zip strs (trailingSpaces : repeat 1))

Putting all these pieces together, we have our complete solution!

justifyText :: [String] -> Int -> [String]
justifyText inputWords maxWidth = 
  reverse (makeLineFinal flLine fWordsInLine fCharsInLine : result)
  where
    (fLine, fWordsInLine, fCharsInLine, result) = foldl loop ([], 0, 0, []) inputWords

    loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
    loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
      let newWordLen = length newWord
          resultLine = makeLine currentLine wordsInLine charsInLine
      in  if newWordLen + charsInLine + wordsInLine > maxWidth
            then ([newWord], 1, newWordLen, resultLine : currResult)
            else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

    makeLine :: [String] -> Int -> Int -> String
    makeLine [] _ _ = error "Cannot makeLine with empty string!"
    makeLine [onlyWord] _ charsInLine =
      let extraSpaces = replicate (maxWidth - charsInLine) ' '
      in  onlyWord <> extraSpaces
    makeLine (first : rest) wordsInLine charsInLine =
      let (baseNumSpaces, numWithExtraSpace) = quotRem (maxWidth - charsInLine) (wordsInLine - 1)
          baseSpaces = replicate (wordsInLine - 1 - numWithExtraSpace) baseNumSpaces
          extraSpaces = replicate numWithExtraSpace (baseNumSpaces + 1)
          wordsWithSpaces = zip rest (baseSpaces <> extraSpaces)
      in  foldl combine first wordsWithSpaces

    makeLineFinal :: [String] -> Int -> Int -> String
    makeLineFinal [] _ _ = error "Cannot makeLine with empty string!"
    makeLineFinal strs wordsInLine charsInLine =
      let trailingSpaces = maxWidth - charsInLine - (wordsInLine - 1)
      in  foldl combine "" (zip strs (trailingSpaces : repeat 1))

    combine :: String -> (String, Int) -> String
    combine suffix (nextWord, numSpaces) = nextWord <> replicate numSpaces ' ' <> suffix

Rust Solution

Now let’s put together our Rust solution. Since we have a reasonable outline from writing this in Haskell, let’s start with the simpler elements, makeLine and makeLineFinal. We’ll use library functions as much as possible for the string manipulation. For example, we can start makeLineFinal by using join on our input vector of strings.

pub fn make_line_final(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = currentLine.join(" ");
    ...
}

Now we just need to calculate the number of trailing spaces, subtracting the number of characters in the joined string. We append this to the end by taking a blank space and using repeat for the correct number of times.

pub fn make_line_final(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = currentLine.join(" ");
    let trailingSpaces = max_width - result.len();
    result.push_str(&" ".repeat(trailingSpaces));
    return result;
}

For those unfamiliar with Rust, the type of our input vector might seem odd. When we have &Vec<&str>, this means a reference to a vector of string slices. String slices are portions of a String that we hold a reference to, but they aren’t copied. However, when we join them, we make a new String result.

Also note that we aren’t passing wordsInLine as a separate parameter. We can get this value using .len() in constant time in Rust. In Haskell, length is O(n) so we don’t want to always do that.

Now for the general make_line function, we have the same type signature, but we start with our base case, where we only have one string in our current line. Again, we use repeat with the number of spaces.

pub fn make_line(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = String::new();
    let n = currentLine.len();
    if (n == 1) {
        result.push_str(currentLine[0]);
        result.push_str(&" ".repeat(max_width - charsInLine));
        return result;
    }
    ...
}

Now we do the “math” portion of this. Rust doesn’t have a single quotRem function in its base library, so we calculate these values separately.

pub fn make_line(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = String::new();
    let n = currentLine.len();
    if (n == 1) {
        result.push_str(currentLine[0]);
        result.push_str(&" ".repeat(max_width - charsInLine));
        return result;
    }
    let numSpaces = (max_width - charsInLine);
    let baseNumSpaces = numSpaces / (n - 1);
    let numWithExtraSpace = numSpaces % (n - 1);
    let mut i = 0;
    while i < n {
        ...
    }
    return result;
}

The while loop we’ll write here is instructive. We use an index instead of a for each pattern because the index tells us how many spaces to use. If our index is smaller than numWithExtraSpace, we add 1 to the base number of spaces. Otherwise we use the base until the index n - 1. This index has no extra spaces, so we’re done at that point!

pub fn make_line(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = String::new();
    let n = currentLine.len();
    if (n == 1) {
        result.push_str(currentLine[0]);
        result.push_str(&" ".repeat(max_width - charsInLine));
        return result;
    }
    let numSpaces = (max_width - charsInLine);
    let baseNumSpaces = numSpaces / (n - 1);
    let numWithExtraSpace = numSpaces % (n - 1);
    let mut i = 0;
    while i < n {
        result.push_str(currentLine[i]);
        if i < numWithExtraSpace {
            result.push_str(&" ".repeat(baseNumSpaces + 1));
        } else if i < n - 1 {
            result.push_str(&" ".repeat(baseNumSpaces));
        }
        i += 1;
    }
    return result;
}

Now we frame our solution. Let’s start by setting up our state variables (again, omitting numWordsInLine). We’ll also redefine max_width as a usize value for ease of comparison later.

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    ...
}

Now we’d like to frame our solution as a “for each” loop. However, this doesn’t work, for Rust-related reasons we’ll describe after the solution! Instead, we’ll use an index loop.

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        ...
    }
}

We’ll get the word by index on each iteration, and use its length to see if we’ll exceed the max width. If not, we can safely push it onto currentLine and increase the character count:

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        let word = &words[i];
        if word.len() + charsInLine + currentLine.len() > mw {
            ...
        } else {
            currentLine.push(&words[i]);
            charsInLine += word.len();
        }
    }
}

Now when we do exceed the max width, we have to push our current line onto result (calling make_line). We clear the current line, push our new word, and use its length for charsInLine.

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        let word = &words[i];
        if word.len() + charsInLine + currentLine.len() > mw {
            result.push(make_line(&currentLine, mw, charsInLine));
            currentLine.clear();
            currentLine.push(&words[i]);
            charsInLine = word.len();
        } else {
            currentLine.push(&words[i]);
            charsInLine += word.len();
        }
    }
    ...
}

After our loop, we’ll just call make_line_final on whatever is left in our currentLine! Here’s our complete full_justify function that calls make_line and make_line_final as we wrote above:

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        let word = &words[i];
        if word.len() + charsInLine + currentLine.len() > mw {
            result.push(make_line(&currentLine, mw, charsInLine));
            currentLine.clear();
            currentLine.push(&words[i]);
            charsInLine = word.len();
        } else {
            currentLine.push(&words[i]);
            charsInLine += word.len();
        }
    }
    result.push(make_line_final(&currentLine, mw, charsInLine));
    return result;
}

Why an Index Loop?

Inside our Rust loop, we have an odd pattern in getting the “word” for this iteration. We first assign word = &words[i], and then later on, when we push that word, we reference words[i] again, using currentLine.push(&words[i]).

Why do this? Why not currentLen.push(word)? And then, why can’t we just do for word in words as our loop?

If we write our loop as for word in words, then we cannot reference the value word after the loop. It is “scoped” to the loop. However, currentLine “outlives” the loop! We have to reference currentLine at the end when we make our final line.

To get around this, we would basically have to copy the word instead of using a string reference &str, but this is unnecessarily expensive.

These are the sorts of odd “lifetime” quirks you have to learn to deal with in Rust. Haskell is easier in that it spares us from thinking about this. But Rust gains a significant performance boost with these sorts of ideas.

Conclusion

This was definitely the most involved problem we’ve dealt with so far. We learned a new paradigm (buffer and save), and got some experience dealing with some of the odd quirks and edge cases of string manipulation, especially in Rust. It was a fairly tricky problem, as far as list manipulation goes. For an easier example of a buffer and save problem, try solving Merge Intervals.

If you want to level up your Haskell problem solving skills, you need to take our course Solve.hs. This course will teach you everything you need to know about problem solving, data structures, and algorithms in Haskell. After this course, you’ll be in great shape to deal with these sorts of LeetCode style problems as they come up in your projects.

James Bowen 6/23/25 James Bowen 6/23/25

The Sliding Window in Haskell & Rust

In last week’s problem, we covered a two-pointer algorithm, and compared Rust and Haskell solutions as we have been for this whole series. Today, we’ll study a related concept, the sliding window problem. Whereas the general two-pointer problem can often be tackled by a single loop, we’ll have to use nested loops in this problem. This problem will also mark our first use of the Set data structure in this series.

If you want a deeper look at problem solving techniques in Haskell, you should enroll in our Solve.hs course! You’ll learn everything you need for general problem solving knowledge in Haskell, including data structures, algorithms, and parsing!

The Problem

Today’s LeetCode problem is Longest Substring without Repeating Characters. It’s a lengthy problem name, but the name basically tells you everything you need to know! We want to find a substring of our input that does not repeat any characters within the substring, and then get the longest such substring.

For example, abaca would give us an answer of 3, since we have the substringbac that consists of 3 unique characters. However, abaaca only gives us 2. There is no run of 3 characters where the three characters are all unique.

The Algorithm

The approach we’ll use, as mentioned above, is called a sliding window algorithm. In some ways, this is similar to the two-pointer approach last week. We’ll have, in a sense, two different pointers within our input. One dictates the “left end” of a window and one dictates the “right end” of a window. Unlike last week’s problem though, both pointers will move in the same direction, rather than converging from opposite directions.

The goal of a sliding window problem is “find a continuous subsequence of an input that matches the criteria”. And for many problems like ours, you want to find the longest such subsequence. The main process for a sliding window problem is this:

Grow the window by increasing the “right end” until (or while) the predicate is satisfied
Once you cannot grow the window any more, shrink the window by increasing the “left end” until we’re in a position to grow the window again.
Continue until one or both pointers go off the end of the input list.

So for our problem today, we want to “grow” our sliding window as long as we can get more unique characters. Once we hit a character we’ve already seen in our current window, we’ll need to shrink the window until that duplicate character is removed from the set.

As we’re doing this, we’ll need to keep track of the largest substring size we’ve seen so far.

Here are the steps we would take with the input abaca. At each step, we process a new input character.

1. Index 0 (‘a’) - window is “a” which is all unique.
2. Index 1 (‘b’) - window is “ab” which is all unique
3. Index 2 (‘a’) - window is “aba”, which is not all unique
3b. Shrink window, removing first ‘a’, so it is now “ba”
4. Index 3 (‘c’) - window is “bac”, which is all unique
5. Index 4 (‘a’) - window is “baca”, which is not unique
5b. Shrink window, remove ‘b’ and ‘a’, leaving “ca”

The largest unique window we saw was bac, so the final answer is 3.

Haskell Solution

For a change of pace, let’s discuss the Haskell approach first. Our algorithm is laid out in such a way that we can process one character at a time. Each character either grows the window, or forces it to shrink to accommodate the character. This means we can use a fold!

Let’s think about what state we need to track within this fold. Naturally, we want to track the current “set” of characters in our window. Each time we see the next character, we have to quickly determine if it’s already in the window. We’ll also want to track the largest set size we’ve seen so far, since by the end of the string our window might no longer reflect the largest subsequence.

With a general sliding window approach, you would also need to track both the start and the end index of your current window. In this problem though, we can get away with just tracking the start index. We can always derive the end index by taking the start index and adding the size of the set. And since we’re iterating through the characters anyway, we don’t need the end index to get the “next” character.

This means our fold-loop function will have this type signature:

-- State: (start index, set of letters, largest seen)
loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)

Now, using our idea of “beginning from the end”, we can already write the invocation of this loop:

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

    loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
    ...

Using 0 for the start index right away is a little hand-wavy, since we haven’t actually added the first character to our set yet! But if we see a single character, we’ll always add it, and as we’ll see, the “adding” branch of our loop never increases this number.

With that in mind, let’s write this branch of our loop handler! If we have not seen the next character in the string, we keep the same start index (left side of the window isn’t moving), we add the character to our set, and we take the new size of the set as the “best” value if it’s greater than the original. We get the new size by adding 1 to the original set size.

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

   loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
   loop (startIndex, charSet, bestSoFar) c = if S.notMember c charSet
    then (startIndex, S.insert c charSet, max bestSoFar (S.size charSet + 1))
      else ...

Now we reach the tricky case! If we’ve already seen the next character, we need to remove characters from our set until we reach the instance of this character in the set. Since we might need to remove multiple characters, “shrinking” is an iterative process with a variable number of steps. This means it would be a while-loop in most languages, which means we need another recursive function!

The goal of this function is to change two of our stateful values (the start index and the character set) until we can once again have a unique character set with the new input character. So each iteration it takes the existing values for these, and will ultimately return updated values. Here’s its type signature:

shrink :: (Int, S.Set Char) -> Char -> (Int, S.Set Char)

Before we implement this, we can invoke it in our primary loop! When we’ve seen the new character in our set, we shrink the input to match this character, and then return these new stateful values along with our previous best (shrinking never increases the size).

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

   loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
   loop (startIndex, charSet, bestSoFar) c = if S.notMember c charSet
    then (startIndex, S.insert c charSet, max bestSoFar (S.size charSet + 1))
      else
        let (newStart, newSet) = shrink (startIndex, charSet) c
        in  (newStart, newSet, bestSoFar)

    shrink :: (Int, S.Set Char) -> Char -> (Int, S.Set Char)
    shrink = undefined

Now we implement “shrink” by considering the base case and recursive case. In the base case, the character at this index matches the new character we’ve trying to remove. So we can return the same set of characters, but increase the index.

In the recursive case, we still increase the index, but now we remove the character at the start index from the set without replacement. (Note how we need a vector for efficient indexing here).

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

   loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
   loop (startIndex, charSet, bestSoFar) c = if S.notMember c charSet
    then (startIndex, S.insert c charSet, max bestSoFar (S.size charSet + 1))
      else
        let (newStart, newSet) = shrink (startIndex, charSet) c
        in  (newStart, newSet, bestSoFar)

    shrink :: (Int, S.Set Char) -> Char -> (Int, S.Set Char)
    shrink (startIndex, charSet) c =
      let nextC = inputV V.! startIndex
          // Base Case: nextC is equal to newC
      in  if nextC == c then (startIndex + 1, charSet)
            // Recursive Case: Remove startIndex
            else shrink (startIndex + 1, S.delete nextC charSet) c

Now we have a complete Haskell solution!

Rust Solution

Now in our Rust solution, we’ll follow the same pattern we’ve been doing for these problems. We’ll set up our loop variables, write the loop, and handle the different cases in the loop. Because we had the nested recursive “shrink” function in Haskell, this will translate to a “while” loop in Rust, nested within our for-loop.

Here’s how we set up our loop variables:

pub fn length_of_longest_substring(s: String) -> i32 {
    let mut best = 0;
    let mut startIndex = 0;
    let inputV: Vec<char> = s.chars().collect();
    let mut charSet = HashSet::new();
    for c in s.chars() {
        ...
    }
}

Within the loop, we have the “easy” case, where the next character is not already in our set. We just insert it into our set, and we update best if we have a new maximum.

pub fn length_of_longest_substring(s: String) -> i32 {
    let mut best = 0;
    let mut startIndex = 0;
    let inputV: Vec<char> = s.chars().collect();
    let mut charSet = HashSet::new();
    for c in s.chars() {
        if charSet.contains(&c) {
            ...
        } else {
            charSet.insert(c);
            best = std::cmp::max(best, charSet.len());                
        }
    }
    return best as i32;
}

The Rust-specific oddity is that when we call contains on the HashSet, we must use &c, passing a reference to the character. In C++ we could just copy the character, or it could be handled by the function using const&. But Rust handles these things a little differently.

Now we get to the “tricky” case within our loop. How do we “shrink” our set to consume a new character?

In our case, we’ll actually just use the loop functionality of Rust, which works like while (true), requiring a manual break inside the loop. Our idea is that we’ll inspect the character at the “start” index of our window. If this character is the same as the new character, we will advance the start index (indicating we are dropping the old version), but then we’ll break. Otherwise, we’ll still increase the index, but we’ll remove the other character from the set as well.

Here’s what this loop looks like in relative isolation:

if charSet.contains(&c) {
    loop {
        // Look at “first” character of window
        let nextC = inputV[startIndex];
        if (nextC == c) {
            // If it’s the new character, we advance past it and break
            startIndex += 1;
            break;
        } else {
            // Otherwise, advance AND drop it from the set
            startIndex += 1;
            charSet.remove(&nextC);
        }
    }
} else {
    ...
}

The inner condition (nextC == c) feels a little flimsy to use with a while (true) loop. But it’s perfectly sound because of the invariant that if charSet contains c, we’ll necessarily find nextC == c before startIndex gets too large. We could also write it as a normal while loop, but loop is an interesting Rust-specific idea to bring in here.

Here’s our complete Rust solution!

pub fn length_of_longest_substring(s: String) -> i32 {
    let mut best = 0;
    let mut startIndex = 0;
    let inputV: Vec<char> = s.chars().collect();
    let mut charSet = HashSet::new();
    for c in s.chars() {
        if charSet.contains(&c) {
            loop {
                let nextC = inputV[startIndex];
                if (nextC == c) {
                    startIndex += 1;
                    break;
                } else {
                    startIndex += 1;
                    charSet.remove(&nextC);
                }
            }
        } else {
            charSet.insert(c);
            best = std::cmp::max(best, charSet.len());                
        }
    }
    return best as i32;
}

Conclusion

With today’s problem, we’ve covered another important problem-solving concept: the sliding window. We saw how this approach could work even with a fold in Haskell, considering one character at a time. We also saw how nested loops compare across Haskell and Rust.

For more problem solving tips and tricks, take a look at Solve.hs, our complete course on problem solving, data structures, and algorithms in Haskell. You’ll get tons of practice on problems like these so you can significantly level up your skills!

James Bowen 6/16/25 James Bowen 6/16/25

Two Pointer Algorithms

We’re now on to part 5 of our series comparing Haskell and Rust solutions for LeetCodeproblems. You can also look at the previous parts (Part 1, Part 2, Part 3, Part 4) to get some more context on what we’ve learned so far comparing these two languages.

For a full look at problem solving in Haskell, check out Solve.hs, our latest course! You’ll get full breakdowns on the processes for solving problems in Haskell, from basic list and loop problems to advanced algorithms!

The Problem

Today we’ll be looking at a problem called Trapping Rain Water. In this problem, we’re given a vector of heights, which form a sort of 1-dimensional topology. Our job is to figure out how many units of water could be collected within the topology.

As a very simple example, the input [1,0,2] could collect 1 unit of water. Here’s a visualization of that system, where x shows the topology and o shows water we collect:

x
xox

We can never collect any water over the left or right “edges” of the array, since it would flow off. The middle index of our array though is lower than its neighbors. So we take the lower of these neighboring values, and we see that we can collect 1 unit of water in this system.

For a bigger example that collects water, we might have the input [4, 2, 1, 1, 3, 5]. Here’s what that looks like:

x
x o o o o x
x o o o x x 
x x o o x x
x x x x x x

The total water here is 9.

A flat system like [2,2,2], or a system that looks like a peak [1,2,3,2,1] cannot collect any water, so we should return 0 in these cases.

The Algorithm

There are a couple ways to solve this. One approach would be a two-pass solution, similar to what we used in Product of Array Except Self. We loop from the left side, tracking the maximum water we can store in each unit based on its left neighbors. Then we loop again from the right side and compare the maximum we can store based on the right neighbors to the prior value from the left. This solution is O(n) time, but O(n) space as well.

A more optimal solution for this problem is a two-pointer approach that can use O(1) additional space. In this kind of solution, we look at the left and right of the input simultaneously. Each step of the way, we make a decision to either increase the “left pointer” or decrease the “right pointer” until they meet in the middle. Each time we move, we get more information about our solution.

In this particular problem, we’ll track the maximum value we’ve seen from the left side and the maximum value we’ve seen from the right side. As we traverse each index, we update both sides for the current left and right indices if we have a new maximum.

The crucial step is to see that if the current “left max” is smaller than the current “right max”, we know how much water can be stored at the left index. This is just the left max minus the left index. Then we can increment the left index.

If the opposite is true, we calculate how much water can be stored at the right index, and decrease the right index.

So we keep a running tally of these sums, and we end our loop when they meet in the middle.

Rust Solution

We can describe our algorithm as a simple while loop. This loop goes until the left index exceeds the right index. The loop needs to track 5 values:

Left Index
Right Index
Left Max
Right Max
Total sum so far

So let’s write the setup portion of the loop:

pub fn trap(height: Vec<i32>) -> i32 {
    let mut leftMax = -1;
    let mut rightMax = -1;
    let mut leftI = 0;
    let mut rightI = height.len() - 1;
    let mut total = 0;
    while leftI <= rightI {
        ...
    }
}

A subtle thing…the constraints on the LeetCode problem are that the length is at least 1. But to handle length 0 cases, we would need a special case. Rust uses unsigned integers for vector length, so taking height.len() - 1 on a length-0 vector would give the maximum integer, and this would mess up our loop and indexing.

Within the while loop, we run the algorithm.

Adjust leftMax and rightMax if necessary.
If leftMax is not larger, recurse, incrementing leftI and adding to total from the left
If rightMax is smaller, decrement rightI and add total from the right

And at the end, we return our total!

pub fn trap(height: Vec<i32>) -> i32 {
    let n = height.len();
    if n <= 1 {
        return 0;
    }
    let mut leftMax = -1;
    let mut rightMax = -1;
    let mut leftI = 0;
    let mut rightI = n - 1;
    let mut total = 0;
    while leftI <= rightI {
        // Step 1
        leftMax = std::cmp::max(leftMax, height[leftI]);
        rightMax = std::cmp::max(rightMax, height[rightI]);
        if leftMax <= rightMax {
            // Step 2
            total += leftMax - height[leftI];
            leftI += 1;
        } else {
            // Step 3
            total += rightMax - height[rightI];
            rightI -= 1;
        }
    }
    return total;
}

Haskell Solution

Now that we’ve seen our Rust solution with a single loop, let’s remember our process for translating this idea to Haskell. With a two-pointer loop, the way in which we traverse the elements of the input is unpredictable, thus we need a raw recursive function, rather than a fold or a map.

Since we’re tracking 5 integer values, we’ll want to write a loop function that looks like this:

-- (leftIndex, rightIndex, leftMax, rightMax, sum)
loop :: (Int, Int, Int, Int, Int) -> Int

Knowing this, we can already “start from the end” and figure out how to invoke our loop from the start of our function:

trapWater :: V.Vector Int -> Int
trapWater input = loop (0, n - 1, -1, -1, 0)
  where
    n = V.length input

    loop :: (Int, Int, Int, Int, Int) -> Int
    loop = undefined

In writing our recursive loop, we’ll start with the base case. Once leftI is the bigger index, we return the total.

trapWater :: V.Vector Int -> Int
trapWater input = loop (0, n - 1, -1, -1, 0)
  where
    n = V.length input

    loop :: (Int, Int, Int, Int, Int) -> Int
    loop (leftI, rightI, leftMax, rightMax, total) = if leftI > rightI then total
      else …

Within the else case, we just follow our algorithm, with the same 3 steps we saw with Rust.

trapWater :: V.Vector Int -> Int
trapWater input = loop (0, n - 1, -1, -1, 0)
  where
    n = V.length input

    -- (leftIndex, rightIndex, leftMax, rightMax, sum)
    loop :: (Int, Int, Int, Int, Int) -> Int
    loop (leftI, rightI, leftMax, rightMax, total) = if leftI > rightI then total
      else
        -- Step 1
        let leftMax' = max leftMax (input V.! leftI)
            rightMax' = max rightMax (input V.! rightI)
        in  if leftMax' <= rightMax'
              -- Step 2
              then loop (leftI + 1, rightI, leftMax', rightMax', total + leftMax' - input V.! leftI)
              -- Step 3
              else loop (leftI, rightI - 1, leftMax', rightMax', total + rightMax' - input V.! rightI)

And we have our Haskell solution!

Conclusion

If you’ve been following this whole series so far, hopefully you’re starting to get a feel for comparing basic algorithms in Haskell and Rust (standing as a proxy for most loop-based languages). In general, we can write loops as recursive functions in Haskell, capturing the “state” of the list as the input parameter for that function.

In particular cases where each iteration deals with exactly one element of an input list, we can employ folds as a tool to simplify our functions. But the two-pointer algorithm we explored today falls into the general recursive category.

To learn the details of understanding these problem solving techniques, take a look at our course, Solve.hs! You’ll learn everything from basic loop and list techniques, to advanced data structures and algorithms!

James Bowen 6/9/25 James Bowen 6/9/25

Spatial Reasoning with Zigzag Patterns!

Today we’re continuing our study of Rust and Haskell solutions to basic coding problems. This algorithm is going to be a little harder than the last few we’ve done in this series, and it will get trickier from here!

For a complete study of problem solving techniques in Haskell, make sure to check out Solve.hs. This course runs the gamut from basic solving techniques to advanced data structures and algorithms, so you’ll learn a lot!

The Problem

Today’s problem is Zigzag Conversion. This is an odd problem that stretches your ability to think iteratively and spatially. The idea is that you’re given an input string and a number of “rows”. You need to then imagine the input word written as a zig-zag pattern, where you write the letters in order first going down, and then diagonally up to the right until you get back to the first row. Then it goes down again. Your output must be characters re-ordered in “row-order” after this zig-zag rearrangement.

This makes the most sense looking at examples. Let’s go through several variations with the string MONDAYMORNINGHASKELL. Here’s what it looks like with 3 rows.

M   A   R   G   K
O D Y O N N H S E L
N   M   I   A   L

So to get the answer, we read along the top line first (MARGK), then the second (ODYONNHSEL), and then the third (NMIAL). So the final answer is MARGKODYONNHSELNMIAL.

Now let’s look at the same string in 4 rows:

M     M     G     L
O   Y O   N H   E L
N A   R I   A K
D     N     S

The answer here is MMGLOYONHELNARIAKDNS.

Here’s 5 rows:

M       R       K
O     O N     S E
N   M   I   A   L
D Y     N H     L
A       G

The answer here is MRKOONSENMIALDYNHLAG.

And now that we have the pattern, we can also consider 2 rows, which doesn’t visually look like a zig-zag as much:

M N A M R I G A K L
O D Y O N N H S E L

This gives the answer MNAMRIGAKLODYONNHSEL.

Finally, if there’s only 1 row, you can simply return the original string.

The Algorithm

So how do we go about solving this? The algorithm here is a bit more involved than the last few weeks!

Our output order is row-by-row, so for our solution we should think in a row-by-row fashion. If we can devise a function that will determine the indices of the original string that belong in each row, then we can simply loop over the rows and append these results!

In order to create this function, we have to think about the zig-zag in terms of “cycles”. Each cycle begins at the top row, goes down to the bottom row, and then up diagonally to the second row. The next element to go at the top row starts a new cycle. By thinking about cycles, we’ll discover a few key facts:

With n rows (n >= 2), a complete cycle has 2n - 2 letters.
The top and bottom row get one letter per cycle.
All other rows get two letters per cycle.

Now we can start to think mathematically about the indices that belong in each row. It’s easiest to think about the top and bottom rows, since they only get one letter each cycle. Each of these has a starting index (0 and n - 1, respectively), and then we add the cycle length 2n - 2 to these starting indices until it exceeds the length.

The middle rows have this same pattern, only now they have 2 starting indices. They have the starting index from the “down” direction and then their first index going up and to the right. The first index for row i is obviously i - 1, but the second index is harder to see.

The easiest way to find the second index is backwards! The next cycle starts at 2n - 2. So row index 1 has its second index at 2n - 2 - 1, and row index 2 has its second index at 2n - 2 - 2, and so on! The pattern of adding the “cycle number” will work for all starting indices.

Once we have the indices for each row, our task is simple. We build a string for each row and combine them together in order.

So suppose we have our 4-row example.

M     M     G     L
O   Y O   N H   E L
N A   R I   A K
D     N     S

The “cycle num” is 6 (2 * 4 - 2). So the first row has indices [0, 6, 12, 18]. The fourth row starts with index 3, and so its indices also go up by 6 each time: [3, 9, 15].

The second row (index 1) has starting indices 1 and 5 (6 - 1). So its indices are [1, 5, 7, 11, 13, 17, 19]. Then the third row has indices [2, 4, 8, 10, 14, 16].

A vector input will allow us to efficiently use and combine these indices.

As a final note, the “cycle num” logic doesn’t end up working with only 1 row. The cycle length using our calculation would be 0, not 1 as it should. The discrepancy is because our “cycle num” logic really depends on having a “first” and “last” row. So if we only have 1 row, we’ll hardcode that case and return the input string.

Rust Solution

In our rust solution, we’ll accumulate our result string in place. To accomplish this we’ll do a few setup steps:

Handle our base case (1 row)
Get the string length and cycle number
Make a vector of the input chars for easy indexing (Rust doesn’t allow string indexing)
Initialize our mutable result string

pub fn convert(s: String, num_rows: i32) -> String {
    if (num_rows == 1) {
        return s;
    }
    let n = s.len();
    let nr = num_rows as usize; // Convenience for comparison
    let cycleLen: usize = (2 * nr - 2);
    let sChars: Vec<char> = s.chars().collect();
    let mut result = String::new();
    ...
}

Now we have to add the rows in order. Since the logic differs for the first and last rows, we have 3 sections: first row, middle rows, and last row. The first and last row are straightforward using our algorithm. Each is a simple while loop.

pub fn convert(s: String, num_rows: i32) -> String {
   if (num_rows == 1) {
       return s;
   }
   let n = s.len();
   let nr = num_rows as usize; // Convenience for comparison
   let cycleLen: usize = (2 * nr - 2);
   let sChars: Vec<char> = s.chars().collect();
   let mut result = String::new();
   
   // First Row
   let mut i = 0;
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }

   // Middle Rows
   ...

   // Last Row
   i = (nr - 1);
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }
   return result;
}

Now the middle rows section is similar. We loop through each of the possible rows in the middle. For each of these, we’ll do a while loop similar to the first and last row. These loops are different though, because we have to track two possible values, the “first” and “second” of each cycle.

If the “first” is already past the end of the vector, then we’re already done and can skip the loop. But even if not, we still need an “if check” on the “second” value as well. Each time through the loop, we increase both values by cycleLen.

pub fn convert(s: String, num_rows: i32) -> String {
   if (num_rows == 1) {
       return s;
   }
   let n = s.len();
   let nr = num_rows as usize; // Convenience for comparison
   let cycleLen: usize = (2 * nr - 2);
   let sChars: Vec<char> = s.chars().collect();
   let mut result = String::new();
   
   // First Row
   let mut i = 0;
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }

   // Middle Rows
   for row in 1..(nr - 1) {
       let mut first = row;
       let mut second = cycleLen - row;
       while first < n {
           result.push(sChars[first]);
           if second < n {
               result.push(sChars[second]);
           }
           first += cycleLen;
           second += cycleLen;
       }
   }

   // Last Row
   i = (nr - 1);
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }
   return result;
}

And that’s our complete solution!

Haskell Solution

The Haskell solution follows the same algorithm, but we’ll make a few stylistic changes compared to Rust. In Haskell, we’ll go ahead and define specific lists of indices for each row. That way, we can combine these lists and make our final string all at once using concatMap. This approach will let us demonstrate the power of ranges in Haskell.

We start our defining our base case and core parameters:

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else ...
  where
    n = length input
    cycleLen = 2 * numRows - 2

    ...

Now we can define index-lists for the first and last rows. These are just ranges! We have the starting element, and we know to increment it by cycleLen. The range should go no higher than n - 1. Funny enough, the range can figure out that it should be empty in the edge case that our input is too small to fill all the rows!

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else ...
  where
    n = length input
    cycleLen = 2 * numRows - 2

    firstRow :: [Int]
    firstRow = [0,cycleLen..n - 1]

    lastRow :: [Int]
    lastRow = [numRows - 1, numRows - 1 + cycleLen..n - 1]

    ...

In Rust, we used a while-loop with two state values to calculate the middle rows. Hopefully you know from this series now that this while loop translates into a recursive function in Haskell. We’ll accumulate our list of indices as a tail argument, and keep the two stateful values as our other input parameters. We’ll combine all our lists together into one big list of int-lists, allRows.

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else ...
  where
    n = length input
    cycleLen = 2 * numRows - 2

    firstRow :: [Int]
    firstRow = [0,cycleLen..n - 1]

    lastRow :: [Int]
    lastRow = [numRows - 1, numRows - 1 + cycleLen..n - 1]

    middleRow :: Int -> Int -> [Int] -> [Int]
    middleRow first second acc = if first >= n then reverse acc
      else if second >= n then reverse (first : acc)
      else middleRow (first + cycleLen) (second + cycleLen) (second : first : acc)

    middleRows :: [[Int]]
    middleRows = map (\i -> middleRow i (cycleLen - i) []) [1..numRows-2]

    allRows :: [[Int]]
    allRows = firstRow : middleRows <> [lastRow]

    ...

Now we bring it all together with one final step. We make a vector from our input, and define a function to turn a single int-list into a single String. Then at the top level of our function (the original else branch), we use concatMap to bring these together into our final result String.

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else concatMap rowIndicesToString  allRows
  where
    n = length input
    cycleLen = 2 * numRows - 2

    firstRow :: [Int]
    firstRow = [0,cycleLen..n - 1]

    lastRow :: [Int]
    lastRow = [numRows - 1, numRows - 1 + cycleLen..n - 1]

    middleRow :: Int -> Int -> [Int] -> [Int]
    middleRow first second acc = if first >= n then reverse acc
      else if second >= n then reverse (first : acc)
      else middleRow (first + cycleLen) (second + cycleLen) (second : first : acc)

    middleRows :: [[Int]]
    middleRows = map (\i -> middleRow i (cycleLen - i) []) [1..numRows-2]

    allRows :: [[Int]]
    allRows = firstRow : middleRows <> [lastRow]

    inputV :: V.Vector Char
    inputV = V.fromList input

    rowIndicesToString :: [Int] -> String
    rowIndicesToString = map (inputV V.!)

Conclusion

This comparison once again showed how while loops in Rust track with recursive functions in Haskell. We also saw some nifty Haskell features like ranges and tail recursion. Most of all, we saw that even with a trickier algorithm, we can still keep the same basic shape of our algorithm in a functional or imperative style.

To learn more about these problem solving concepts, take a look at Solve.hs, our comprehensive course on problem solving in Haskell. You’ll learn about recursion, list manipulation, data structures, graph algorithms, and so much more!

James Bowen 6/2/25 James Bowen 6/2/25

Starting from the End: Solving “Product Except Self”

Today we continue our series exploring LeetCode problems and comparing Haskell and Rust solutions. We’re staying in the realm of list/vector manipulation, but the problems are going to start getting more challenging!

If you want to learn more about problem solving in Haskell, you should take a closer look at Solve.hs! You’ll particularly learn how to translate common ideas from loop-based into Haskell’s recursive ideas!

The Problem

Today’s problem is Product of Array Except Self. The idea is that we are given a vector of n integers. We are supposed to return another vector of n integers, where output[i] is equivalent to the product of all the input integers except for input[i].

The key constraint here is that we are not allowed to use division. If we could use division, the answer would be simple! We would find the product of the input numbers and then divide this product by each input number to find the corresponding value. But division is more expensive than most other numeric operations, so we want to avoid it if possible!

The Algorithm

The approach we’ll use in this article relies on “prefix products” and “suffix products”. We’ll make two separate vectors called prefixes and suffixes, where prefixes[i] is the product of all numbers strictly before index i, and suffixes[i] is the product of all numbers strictly after index i.

Then, we can easily produce our results. The value output[i] is simply the product of prefixes[i] and suffixes[i].

As an example, our input might be [3, 4, 5]. The prefixes vector should be [1, 3, 12], and the suffixes vector should be [20, 5, 1]. Then our final output should be [20, 15, 12].

prefixes: [1, 3, 12]
suffixes: [20, 5, 1]
output: [20, 15, 12]

Rust Solution

Here’s our Rust solution:

impl Solution {
    pub fn product_except_self(nums: Vec<i32>) -> Vec<i32> {
        let n = nums.len();
        let mut prefixes = vec![0; n];
        let mut suffixes = vec![0; n];
        let mut totalPrefix = 1;
        let mut totalSuffix = 1;

        // Loop 1: Populate prefixes & suffixes
        for i in 0..n {
            prefixes[i] = totalPrefix;
            totalPrefix *= nums[i];
            suffixes[n - i - 1] = totalSuffix;
            totalSuffix *= nums[n - i - 1];
        }

        let mut results = vec![0; n];

        // Loop 2: Populate results
        for i in 0..n {
            results[i] = prefixes[i] * suffixes[i];
        }
        return results;
    }
}

The two for-loops provide this solution with its shape. The first loop generates our vectors prefixes and suffixes. We keep track of a running tally of the totalPrefix and the totalSuffix. Each of these is initially 1.

let n = nums.len();
let mut prefixes = vec![0; n];
let mut suffixes = vec![0; n];
let mut totalPrefix = 1;
let mut totalSuffix = 1;

On each iteration, we assign the current “total prefix” to the prefixes vector in the front index i, and then the “total suffix” to the suffixes vector in the back index n - i - 1. Then we multiply each total value by the input value (nums) from that index so it’s ready for the next iteration.

// Loop 1: Populate prefixes & suffixes
for i in 0..n {
    prefixes[i] = totalPrefix;
    totalPrefix *= nums[i];
    suffixes[n - i - 1] = totalSuffix;
    totalSuffix *= nums[n - i - 1];
}

And now we calculate the result, by taking the product of prefixes and suffixes at each index.

let mut results = vec![0; n];

// Loop 2: Populate results
for i in 0..n {
    results[i] = prefixes[i] * suffixes[i];
}
return results;

Haskell Solution

In Haskell, we can follow this same template. However, a couple differences stand out. First, we don’t use for-loops. We have to use recursion or recursive helpers to accomplish these loops. Second, when constructing prefixes and suffixes, we want to use lists instead of modifying mutable vectors.

When performing recursion and accumulating linked lists, it can be tricky to reason about which lists need to be reversed at which points in our algorithm. For this reason, it’s often very helpful in Haskell to start from the end of our algorithm.

Let’s write out a template of our solution that leaves prefixes and suffixes as undefined stubs. Then the first step we’ll work through is how to get the solution from that:

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution ???
  where
    n = V.length inputs

    solution :: ??? -> V.Vector Int

    prefixes :: [Int]
    prefixes = undefined

    suffixes :: [Int]
    suffixes = undefined

So given prefixes and suffixes, how do we find our solution? The ideal case is that both these lists are already in reverse-index order with respect to the input vector (i.e. n - 1 to 0). Then we don’t need to do an additional reverse to get our solution.

We can then implement solution as a simple tail recursive helper function that peels one element off each input and multiplies them together. When we’re out of inputs, it returns its result:

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution (prefixes, suffixes, [])
  where
    n = V.length inputs

    -- Loop 2: Populate Results
    solution :: ([Int], [Int], [Int]) -> V.Vector Int
    solution ([], [], acc) = V.fromList acc
    solution (p : ps, s : ss, acc) = solution (ps, ss, p * s : acc)
    solution _ = error “Prefixes and suffixes must be the same size!”

    prefixes :: [Int]

    suffixes :: [Int]

So now we’ve done “Loop 2” already, and we just have to implement “Loop 1” so that it produces the right results. Again, we’ll make a tail recursive helper, and this will produce both prefixes and suffixes at once. It will take the index, as well as the “total” prefix and suffix so far, and then two accumulator lists. At the end of this, we want both lists in reverse index order.

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution (prefixes, suffixes, [])
  where
    n = V.length inputs

    -- Loop 2: Populate Results
    solution :: ([Int], [Int], [Int]) -> V.Vector Int

    prefixes :: [Int]
    suffixes :: [Int]
    (prefixes, suffixes) = mkPrefixSuffix (0, 1, [], 1, [])

    -- Loop 1: Populate prefixes & suffixes
    mkPrefixSuffix :: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
    mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = undefined

Now we fill in mkPrefixSuffix as we would any tail recursive helper. First we satisfy the base case. This occurs once i is at least n. We’ll return the accumulated lists.

mkPrefixSuffix :: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = if i >= n then (pres, reverse suffs)
  else ...

But observe we’ll need to reverse suffixes! This becomes clear when we map out what each iteration of the loop looks like for a simple input. Doing this kind of “loop tracking” is a very helpful problem solving skill for walking through your code!

input = [3, 4, 5]
i = 0: (0, 1, [], 1, [])
i = 1: (1, 3, [1], 5, [1])
i = 2: (2, 12, [3, 1], 20, [5, 1])
i = 3: (3, 60, [12, 3, 1], 60, [20, 5, 1])

Our prefixes are [12, 3, 1], which is properly reversed, but the suffixes are [20, 5, 1]. We don’t want both lists ending in 1! So we reverse the suffixes.

Now that we’ve figured this out, it’s simple enough to fill in the recursive case using what we already know from “Loop 1” in the Rust solution. We get the “front” index of input with i, and the “back” index with n - i - 1, use these to get the new products, and then save the old products in our list.

mkPrefixSuffix :: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = if i >= n then (pres, reverse suffs)
  else
    let nextPre = nums V.! i
        nextSuff = nums V.! (n - i - 1)
    in mkPrefixSuffix (i + 1, totalPre * nextPre, totalPre : pres, totalSuff * nextSuff, totalSuff : suffs)

Here’s our complete Haskell solution!

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution (prefixes, suffixes, [])
  where
    n = V.length inputs

    solution :: ([Int], [Int], [Int]) -> V.Vector Int
    solution ([], [], acc) = V.fromList acc
    solution (p : ps, s : ss, acc) = solution (ps, ss, p * s : acc)
    solution _ = error "Invalid solution!"

    prefixes :: [Int]
    suffixes :: [Int]
    (prefixes, suffixes) = mkPrefixSuffix (0, 1, [], 1, [])


    mkPrefixSuffix:: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
    mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = if i >= n then (pres, reverse suffs)
      else
        let nextPre = inputs V.! i
            nextSuff = inputs V.! (n - i - 1)
        in  mkPrefixSuffix (i + 1, totalPre * nextPre, totalPre : pres, totalSuff * nextSuff, totalSuff : suffs)

Conclusion

In this comparison, we saw a couple important differences in problem solving with a loop-based language like Rust compared to Haskell.

For-loops have to become recursion in Haskell
We want to use lists in Haskell, not mutable vectors
It takes a bit of planning to figure out when to reverse lists!

This led us to a couple important insights when solving problems in Haskell.

“Starting from the end” can be very helpful in plotting out our solution
“Loop tracking” is a very helpful skill to guide our solutions

For an in-depth look at these sorts of comparisons, check out our Solve.hs course. You’ll learn all the most important tips and tricks for solving coding problems in Haskell! In particular you’ll get an in-depth look at tail recursion, a vital concept for solving problems in Haskell.

James Bowen 5/26/25 James Bowen 5/26/25

Learning from Multiple Solution Approaches

Welcome to the second article in our Rust vs. Haskell problem solving series. Last week we saw some basic differences between Rust loops and Haskell recursion. We also saw how to use the concept of “folding” to simplify a recursive loop function.

This week, we’ll look at another simple problem and consider multiple solutions in each language. We’ll consider what a “basic” solution looks like, using relatively few library functions. Then we’ll consider more “advanced” solutions that make use of library functionality, and greatly simplify the structure of our solutions.

To learn more about problem solving in Haskell, including the importance of list library functions, take a look at our course Solve.hs! You’ll write most of Haskell’s list API from scratch so you get an in-depth understanding of the functions that are available!

The Problem

This week’s problem is Reverse Words in a String. The idea is simple. Our input is a string, which naturally has “words” separated by whitespace. We want to return a string that has all the words reversed! So if the input is ”A quick brown fox”, the result should be ”fox brown quick A”.

Notice that all whitespace is truncated in our output. We should only have a single space between words in our answer, with no leading or trailing whitespace.

The Algorithm

The algorithmic idea is simple and hardly needs explanation. We want to gather letters from the input word until we encounter whitespace. Then we append this buffered word to a growing result string, and keep following this process until we run out of input.

There is one wrinkle, which is whether we want to accumulate our answer in the forward or reverse direction. This changes across languages!

In Haskell, it’s actually more efficient to accumulate the “back” of our resulting string first, meaning we should start by iterating from the front of the input. This is more consistent with linked list construction.

In Rust, we’ll iterate from the back of the input so that we can accumulate our result from the “front”.

Basic Rust Solution

In our basic solution, we’re going to consider a character-by-character approach. As outlined in our algorithm, we can accomplish this task with a single loop, with two stateful values. First, we have the “current” word we’re accumulating of non-whitespace characters. Second, we have the final “result” we’re accumulating.

It’s efficient to append to the end of strings, meaning we want to construct our result from front-to-back. This means we’ll loop through the characters of our string in reverse, as shown with .rev() here:

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        ...
    }
}

Within the loop, we now just have to consider what to do with each character. If the character is not whitespace, the answer is simple. We just append this character to our “current” word. Because we’re looping through the input in reverse, our “current” word will also be in reverse!

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        if !c.is_whitespace() {
            current.push(c);
        } else {
            ...
        }
    }
}

So what happens when we encounter whitespace? There’s a few conditions to consider:

If “current” is empty, do nothing.
If “result” is empty, append “current” (in reverse order) to result.
If “result” is not empty, add a space and then append “current” in reverse.
Regardless, clear “current” and prepare to gather a new string.

Here’s what the code looks like:

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        if !c.is_whitespace() {
            current.push(c);
        } else {
            // Step 1: Skip if empty
            if !current.is_empty() {
                // Step 2/3 Only push an empty space is result is not empty
                if !result.is_empty() {
                    result.push(' ');
                }
                // Step 2/3 Reverse current and append
                for b in current.chars().rev() {
                    result.push(b);
                }
                // Step 4: Clear “current”
                current.clear();
            }
        }
    }
}

There’s one final trick. Unless the word begins with whitespace, we’ll still have non-empty current at the end and we will not have appended it. So we do one final check, and once again append “current” in reverse order.

Here’s our final basic solution:

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        if !c.is_whitespace() {
            current.push(c);
        } else {
            // Step 1: Skip if empty
            if !current.is_empty() {
                // Step 2/3 Only push an empty space is result is not empty
                if !result.is_empty() {
                    result.push(' ');
                }
                // Step 2/3 Reverse current and append
                for b in current.chars().rev() {
                    result.push(b);
                }
                // Step 4: Clear “current”
                current.clear();
            }
        }
    }
   if !current.is_empty() {
        if !result.is_empty() {
            result.push(' ');
        }
        for b in current.chars().rev() {
            result.push(b);
        }
    }
    return result;
}

Advanced Rust Solution

Looping character-by-character is a bit cumbersome. However, since basic whitespace related operations are so common, there are some useful library functions for dealing with them.

Rust also prioritizes the ability to chain iterative operations together. This gives us the following one-line solution!

pub fn reverse_words(s: String) -> String {
    s.split_whitespace().rev().collect::<Vec<&str>>().join(" ")
}

It has four stages:

Split the input based on whitespace.
Reverse the split-up words.
Collect these words as a vector of strings.
Join them together with one space in between them.

What is interesting about this structure is that each stage of the process has a separate type. Step 1 creates a SplitWhitespace struct. Step 2 creates a Reverse struct. Step 3 then creates a normal vector, and step 4 concludes by producing a string.

The two preliminary structures are essentially wrappers with iterators to help chain the operations together. As we’ll see, the comparable Haskell solution only uses basic lists, and this is a noteworthy difference between the languages.

Basic Haskell Solution

Our “basic” Haskell solution will follow the same outline as the basic Rust solution, but we’ll work in the opposite direction! We’ll loop through the input in forward order, and accumulate our output in reverse order.

Before we even get started though, we can make an observation from our basic Rust solution that we duplicated some code! The concept of combining the “current” word and the “result” had several edge cases to handle, so let’s write a combine function to handle these.

-- “current” is reversed and then goes in *front* of result
-- (Rust version put “current” at the back)
combine :: (String, String) -> String
combine (current, res) = if null current then res
  else reverse current <> if null res then "" else (' ' : res)

Now let’s think about our loop structure. We are going through the input, character-by-character. This means we should be able to use a fold, like we did last week! Whenever we’re using a fold, we want to think about the “state” we’re passing through each iteration. In our case, the state is the “current” word and the “result” string. This means our folding function should look like this:

loop :: (String, String) -> Char -> (String, String)
loop (current, result) c = ...

Now we just have to distinguish between the “whitespace” case and the non-whitespace case. If we encounter a space, we just combine the current word with the accumulated result. If we encounter a normal character, we append this to our current word (again, accumulating “current” in reverse).

loop :: (String, String) -> Char -> (String, String)
loop (currentWord, result) c = if isSpace c
  then ("", combine (currentWord, result))
  else (c : currentWord, result)

Now to complete the solution, we just call ‘foldl’ with our ‘loop’ and the input, and we just have to remember to combine the final “current” word with the output! Here’s our complete “basic” solution.

reverseWords :: String -> String
reverseWords input = combine $ foldl loop ("", "") input
  where
    combine :: (String, String) -> String
    combine (current, res) = if null current then res
      else reverse current <> if null res then "" else (' ' : res)

    loop :: (String, String) -> Char -> (String, String)
    loop (currentWord, result) c = if isSpace c
      then ("", combine (currentWord, result))
      else (c : currentWord, result)

Advanced Haskell Solutions

Now that we’ve seen a basic, character-by-character solution in Haskell, we can also consider more advanced solutions that incorporate library functions. The first improvement we can make is to lean on list functions like break and dropWhile.

Using break splits off the first part of a list that does not satisfy a predicate. We’ll use this to gather non-space characters. Then dropWhile allows us to drop the first series of characters in a list that satisfy a predicate. We’ll use this to get rid of whitespace as we move along!

So we’ll define this solution using a basic recursive loop rather than a fold, because each iteration will consume a variable number of characters. The “state” of this loop will be two strings: the remaining part of the input, and the accumulated result.

Since there’s no “current” word, our base case is easy. If the remaining input is empty, we return the accumulated result.

loop :: (String, String) -> String
loop ([], output) = output
...

Otherwise, we’ll follow this process:

Separate the first “word” using break isSpace.
Combine this word with the output (if it’s not null)
Recurse with the new output, dropping the initial whitespace from the remainder.

Here’s what it looks like:

loop :: (String, String) -> String
loop ([], output) = output
loop (cs, output) =
  -- Step 1: Separate next word from rest
  let (nextWord, rest) = L.break isSpace cs
  -- Step 2: Make new output (account for edge cases)
  -- (Can’t use ‘combine’ from above because we aren’t reversing!)
      newOutput = if null output then nextWord
                    else if null nextWord then output
                    else nextWord <> (' ' : output)
  -- Drop spaces from remainder and recurse
  in  loop (L.dropWhile isSpace rest, newOutput)

And completing the function is as simple as calling this loop with the base inputs:

reverseWords :: String -> String
reverseWords input = loop (input, “”)

The Simplest Haskell Solution

The final (and recommended) Haskell solution uses the library functions words and unwords. These do exactly what we want for this problem! We separate words based on whitespace using words, and then join them with a single space with unwords. All we have to do in between is reverse.

reverseWords :: String -> String
reverseWords = unwords . reverse . words

This has a similar elegance to the advanced Rust solution, but is much simpler to understand since there are no complex structs or iterators involved. The types of all functions involved simply relate to lists. Here are the signatures, specialized to String for this problem.

words :: String -> [String]
reverse :: [String] -> [String]
unwords :: [String] -> String

Conclusion

A simple problem will often have many solutions, but in this case, each of these solutions teaches us something new about the language we’re working with. Working character-by-character helps us understand some of the core mechanics of the language, showing us how it works under the hood. But using library functions helps us see the breadth of available options we have for simplifying future code we write.

In our Solve.hs course, you’ll go through all of these steps with Haskell. You’ll implement list library functions, data structures, and algorithms from scratch so you understand how they work under the hood. Then, you’ll know they exist and be able to apply them to efficiently solve harder problems. Take a look at the course today!

James Bowen 5/19/25 James Bowen 5/19/25

Comparing Code: LeetCode Problems in Rust vs. Haskell

Today will be the first in a series where we’ll be exploring some LeetCode problems and comparing different solutions from Haskell and Rust. The main idea is to demonstrate how you might translate ideas between the recursive core of Haskell, and the loop-based framing of most other languages.

If you want to learn more about problem solving in Haskell, you should take a closer look at Solve.hs! This course will give you an in-depth walkthrough of problem solving ideas in Haskell, including how concepts compare to more typical languages.

The Problem

The first problem we’ll consider is called H-Index. In academia, a person has an “H-Index” of n if they have published at least n papers that have n or more citations. So the input to our problem is a list of integers, where each integer is the number of citations of a particular paper the author wrote. Our job is to calculate the author’s H-Index.

The Algorithm

This problem is fairly straightforward if you sort the input list. Once we do this, we can look at any index i, and consider the number of remaining entries (e.g. n - i), and we’ll know that the number of papers with at least that many citations is n - i.

So we can accomplish this task with a single loop over the sorted list. Throughout this loop, we’ll be tracking the maximum “H-Index” we’ve seen so far (maxH). At each iteration, we take the following steps:

Get the number of remaining papers (rem) and the citations at this index (next) If the rem is greater than next, then update maxH to next if next is larger. Otherwise, update maxH to rem if rem is greater.

The last step is a key edge case! If we have the list [1, 1, 1, 9, 9], we’ll get to index 3, with next being 9 and rem being 2. The remainder is smaller than the index, but we would still update maxH to 2, because there are at least 2 citations remaining that are 2 or greater.

Rust Solution

Here’s our Rust solution:

pub fn h_index(citations: Vec<i32>) -> i32 {
    let mut cp = citations.clone();
    cp.sort();
    let n = cp.len();
    let mut maxH: i32 = 0;
    for i in 0..n {
        let next = cp[i];
        let rem: i32 = (n - i) as i32;
        if (rem >= next) {
            maxH = std::cmp::max(next, maxH);
        } else {
            maxH = std::cmp::max(rem, maxH);
        }
    }
    return maxH;
}

We have the first part, where we clone the input, sort it, and set up our loop variables:

pub fn h_index(citations: Vec<i32>) -> i32 {
    let mut cp = citations.clone();
    cp.sort();
    let n = cp.len();
    let mut maxH: i32 = 0;
    ...
}

Then we have the loop itself, where we have our two cases to consider:

for i in 0..n {
    let next = cp[i];
    let rem: i32 = (n - i) as i32;
    if (rem >= next) {
        // There are at least ‘next’ papers >= ‘next’
        maxH = std::cmp::max(next, maxH);
    } else {
        // ‘next’ > ‘rem’, so there are at least ‘rem’ papers >= ‘rem’
        maxH = std::cmp::max(rem, maxH);
    }
}

So this is pretty straightforward. Now how do we approach this kind of problem in Haskell?

Haskell Solution

Our Haskell solution will have the same structure, but instead of running a loop and indexing into a vector, we’ll use a linked list and call a recursive function. Let’s begin by getting the length and sorting our input:

import qualified Data.List as L

hIndex :: [Int] -> Int
hIndex inputs = ...
  where
    n = length inputs
    sorted = L.sort inputs

    ...

Now we need to think about our recursive loop function. At each iteration, we need access to the remaining number of values, the next citation value, and we need to pass along maxH. As with many list-based recursive functions, we’ll peel off one element of the input list each time. Ultimately we’ll return maxH from this loop when we hit our base case of an empty input list. So its type signature should look like this:

loop :: (Int, [Int], Int) -> Int

When writing a recursive function, we always handle the base case first:

loop :: (Int, [Int], Int) -> Int
loop (_, [], maxH) = maxH

Now in the recursive case, we can apply our algorithm, updating maxH if necessary:

loop :: (Int, [Int], Int) -> Int
loop (_, [], maxH) = maxH
loop (remaining, next : rest, maxH) = if remaining >= next
  then loop (remaining - 1, rest, max next maxH)
  else loop (remaining - 1, rest, max remaining maxH)

To finish up, all we need to do is call our loop function with the appropriate initial inputs (n, sorted, 0). Here’s our complete Haskell solution:

import qualified Data.List as L

hIndex :: [Int] -> Int
hIndex inputs = loop (n, sorted, 0)
  where
    n = length inputs
    sorted = L.sort inputs

    loop :: (Int, [Int], Int) -> Int
    loop (_, [], maxH) = maxH
    loop (remaining, next : rest, maxH) = if remaining >= next
      then loop (remaining - 1, rest, max next maxH)
      else loop (remaining - 1, rest, max remaining maxH)

Using a Fold

Now we can notice that our loop has a particular structure. We have one piece of accumulated state (maxH), and this changes based on each value in our list (combined with the remaining values). We can easily re-imagine this kind of loop using a fold. We just have to think of the folding function like this:

loop :: Int -> (Int, Int) -> Int
loop maxH (remaining, next) = if remaining >= next
  then max next maxH
  else max remaining maxH

This has the a -> b -> a structure of a left-fold function, where a is our accumulated maxH value, and the other values come from our list. The main benefit here is that our loop function no longer has to deal with the burden of calling a base case or passing the “shrinking” list as an argument to the next recursive call.

We can invoke this loop at the top level like so:

hIndex :: [Int] -> Int
hIndex inputs = foldl loop 0 (zip [n,n-1..1] sorted)
  where
    n = length inputs
    sorted = L.sort inputs

    loop :: Int -> (Int, Int) -> Int
    loop maxH (remaining, next) = if remaining >= next
      then max next maxH
      else max remaining maxH

We just have to zip the decreasing indices together with our sorted list. Now our recursive “loop” is more like a typical for-loop. We’re only considering one element at a time, and we’re updating the important state each time.

Conclusion

In this comparison, we saw a good comparison between a normal for-loop in Rust, and a recursive solution in Haskell. We also saw how we could simplify this recursive formulation into a “fold” structure.

If you're interested in learning more about writing recursive functions in Haskell, check out our Solve.hs course. You’ll learn how to start thinking about problems in a functional way, and you’ll learn the step-by-step processes for tackling problems with basic recursion and folds like we saw in this example.

James Bowen 5/12/25 James Bowen 5/12/25

Hey ChatGPT, Write me a Haskell Course?

In last week’s article, I discussed how Monday Morning Haskell courses compare to other Haskell courses that I’ve seen out there online. Obviously I’m wildly biased, but I think MMH courses have some serious advantages.

But there’s still the elephant in the room…how do my courses compare to the possibility of using generative AI (e.g. ChatGPT) to learn Haskell instead? While AI is a great tool that has opened up a lot of doors in terms of learning complex concepts, human-developed courses still have some important advantages over the current way you would learn from a chatbot.

Analogy: Going to the Library

I’ll start my case by drawing an (imperfect) analogy. Suppose you are enrolled in a college and want to learn a particular subject, like physical chemistry. You could enroll in your school’s physical chemistry course. Or you could spend the same amount of time going to the library. After all, the library has tons of books on physical chemistry. So you could read all these books and gain the same level of insight, right?

In this example, most people would recognize the shortcomings of just going to the library. For example, you are now responsible for determining the curriculum and course of study.

You could, of course, look at the table of contents of an introductory book and just run with that way of organizing the material. But how much of that do you need to learn? Most college courses aren’t going all the way through a textbook, because the professor already has a good idea of what material is the most important, and has organized the course around that.

A professor will also know when and how to introduce supplemental material from other sources. If you’re just “learning from the library”, you’d be responsible for selecting which materials are the most important, and you probably aren’t qualified!

Also, while textbooks may have practice problems, and they may even have answers to those problems, you still have to do the work of figuring out which problems to study, and how many you need to study before you know the material. Taking a full course with assignments would solve this for you.

Finally, textbooks will rarely tell you about the human process of learning a particular subject. You probably aren’t going to read a sentence like “lots of students struggle to understand this, here’s a way of thinking about the problem that has helped a lot of them.” These are insights you’ll gain from working with a professor (who has taught real students) or other students in the class.

So let’s sum up the shortcomings of “learning from the library”:

Direction - You must take on the cognitive overhead of determining which areas of the subject to study.
Filtering - You must figure out how much detail is necessary, and how much practice you need to learn it.
Human Learning Insight - Textbooks are generally lacking in the actual insights and breakthroughs that help students understand particularly challenging ideas.

From Physical to Online Learning

Now let’s consider what changes about the analogy if instead of comparing physical learning environments, we think about the current online learning environment. Entering an online course is significantly easier than enrolling in a university course. You don’t have to wait for the start of the semester, or go to a physical location.

But using ChatGPT as your “library” is vastly easier than studying from textbooks. In a matter of minutes, you can get tons of information on your screen that would have taken hours or days of effort at the library. And best of all, you can get information on virtually any topic, rather than just those that have pre-existing textbooks.

But I would still claim that using Chatbots for learning shares some of the drawbacks of “learning from the library”. And for these reasons, it’s still worthwhile to consider online courses where they exist instead of relying solely on ChatGPT. Some of these drawbacks might seem counterintuitive, but let’s think about it.

Direction of Study

You might think, “I don’t need to set my own direction, ChatGPT will do that for me!” And yes, you can ask it to lay out a syllabus for you (I did this myself in one of the examples below). This will give you a list of topics to study.

But it won’t just write out the whole course for you based on this initial syllabus in one go. You have to keep prompting it to provide you with the information you want. And it will get sidetracked, consistently asking you to go deeper and deeper down particular rabbit holes.

So it’s still up to you to determine how much you really want to study about particular topics, and you need to maintain the discipline to pull it back out and shift gears. A human-designed course puts these limits in there for you, so that you don’t need to carry that cognitive load.

Filtering

This brings us to the next issue of “filtering”. ChatGPT will provide you with a lot of information, all at once. You’ll ask a simple question, and get a very complicated answer with lots of tables comparing various different ways of looking at the question.

Sometimes, this is nice. It will expose you to ideas you wouldn’t have thought of otherwise. Sometimes though, it’s very distracting. It takes you away from the core of what you’re trying to learn. You have to make sure you aren’t getting dragged into an infinite loop of concepts.

The “practice” problem also exists. ChatGPT can keep coming up with practice problems, but it’s up to you to know how many you really need to study. In our case study below, we’ll also consider that it’s not necessarily the best tool for coming up with practice problems.

Again, a human-designed course does the filtering and measuring for you.

Human Insight

Once at my job, I was reviewing a teammate’s code that implemented a complicated algorithm. I told him, “after I looked closely at this one particular line, my understanding of this algorithm went from like 30% to 70%”, so adding an explanatory comment here would be very helpful!”.

This experience helped me understand the idea of “knowledge inflection points”. These are the key insights that really help you understand a topic. I’ve had several of these with various Haskell concepts, from monads, to folds, to data structures and certain algorithms. I’ve done my best to incorporate these insights into my course content.

An example from Solve.hs might be my understanding of “the common API” of Haskell data structures. This made it much easier for me to reason about using different structures in Haskell.

An AI probably wouldn’t frame the issue in the way I did, unless you already have the knowledge to prompt it. AI’s don’t have the experience of “learning” a concept piece-by-piece, and knowing when things finally “clicked”. You could try asking the chatbot what insights help people learn a topic, but it will only be able to try piecing that information together from what other people have written. On the whole, it still doesn’t beat the experience of someone who’s been there.

Human insights around learning are always going to get baked into a human-designed course, whereas AI is not generally going to be thinking in these terms.

Case Study: Learning Concurrency

I wanted to share a couple case studies that highlight some of the promise but also some of the frustrations with using AI for learning. Here’s a link to an extensive, multi-day study I did with ChatGPT to learn about concurrency topics. It helped me review a lot of topics I had learned in college (10 years ago), and also learn many new things. But there were still some pain points.

The “filtering” problem should be very evident. For each prompt I gave, ChatGPT provided tons of information. It was entirely up to me to figure out how much of this I really needed to know in order to be satisfied.

The “direction” problem is also clear. I started by asking for an organizational outline, and the chatbot duly obliged. But as I dug into certain topics, its preference was to ask me to keep going deeper down certain knowledge paths. I had to consistently drag it back to the syllabus it originally designed.

There were also no clear insights on what the key knowledge was. Over the course of the study, I figured some of these out for myself. But again, I had to filter through a lot of data to get there.

Another drawback I haven’t mentioned yet is the “memory” issue. Chatbots have limited, token-based memory, so they’ll forget what you’ve already learned over even a medium length study. My concurrency study introduced the idea of a “lock-free queue” using compare-and-swap operations early on. ChatGPT reintroduces this idea later as if I had never heard of it. Human-designed courses will avoid this sort of behavior.

I didn’t ask for practice problems in this study, so let’s consider another case study where I was specifically looking to do this in Haskell.

Case Study: Dijkstra’s Algorithm

In this quick study, I asked ChatGPT to come up with a practice problem for learning Dijkstra’s algorithm. Some things were good about its response, but some things weren’t.

On the positive side, the code works, the tests work, and some of the follow-up suggestions are also pretty good. For example, putting a bound on the number of nodes your path can have, or allowing multiple starts are simple extensions that didn’t occur to me when I was writing problems.

My main gripe is that the problems are a bit too obvious as graph problems. It started essentially with “implement Dijkstra’s algorithm” rather than giving me a practice problem using Dijkstra’s algorithm. And when I asked for a “disguised graph problem”, it gave me the delivery problem which wasn’t much of a disguise.

Also, the code used PSQueue, rather than the more beginner-friendly Data.Heap. This package may be better for certain things, but the type operator it uses would be a bit more confusing for a novice.

The line-by-line explanations were pretty good on the whole, but I don’t know that they’re a perfect substitute for really good visual/slide-based instructions like you would find in one of my courses.

With enough prompt engineering, you could get around these issues. But that’s exactly my point. It’s nice to not have to keep coming up with new prompts to get what you’re looking for, especially when you get a long explanation after every question.

Conclusion

Generative AI is a massive innovation for learning, especially on subjects that don’t have a lot of good guide material. But extensive, well-thought-out, human-designed content still has some significant advantages. The content is informed by the personal experience of someone who has actually been in your shoes and has had to learn something the same way you’ll learn it. This is not something an AI can relate to.

Prompt engineering involves a lot of cognitive effort. You have to constantly be directing the flow of what you’re supposed to learn, filter out the unnecessary parts, and then you have to learn it! While the freedom of being able to learn almost anything can be desirable, it can also be exhausting to always be directing the flow. It can be much easier and more helpful to just follow the lead of what another person has done.

I’ve used generative AI for learning and will continue to do so. But when human-designed content is available, I’ll look there first, and consider using AI as a supplement where I feel there are gaps.

When it comes to generating content, I don’t like AI as much, certainly not as a general purpose content producer. But it certainly has its uses. Looking back on course creation, I wish I had used it for writing test cases, for example. Another idea might be translating my work into other languages.

I’ll continue to experiment with AI going forward. But a solid guiding principle is that you should be using AI to enhance yourself, and not replace yourself. I still believe that human content has an edge over AI content for the same subject matter, so I encourage you to take another look at our courses at Monday Morning Haskell Academy, and to subscribe to our mailing list for future updates and discounts!

James Bowen 5/5/25 James Bowen 5/5/25

Comparing Courses: MMH vs. The Rest

Due to some technical issues, our Spring Sale has been extended! You have until Monday, May 12 to get 20% off of all our courses and bundles with the code SOLVE25, and you can get an even bigger 30% discount if you subscribe to our mailing list.

Having now released the final portion of Solve.hs (probably my last course for a while) I wanted to consider the broader landscape of Haskell courses. What other courses are out there? Are they better than mine?

So I’ve actually purchased a few other Haskell courses, and spent a decent amount of time going through their material. I may not be the smartest person to write a Haskell course and I definitely don’t have the most industry experience with Haskell. But, having explored some of these other courses, I think there are some good reasons to consider my courses among the top tier in the Haskell community.

So on the last day of this sale, I wanted to explore a few areas where I think my courses stand out above the rest.

Breadth of Material

There’s a common thread among most Haskell material out there, including and especially courses. They will generally all cover the same topics. You can generally expect to see all of the following in a Haskell course:

Basic Syntax and Types
Typeclasses and polymorphism
Basic Recursion
Understanding Functors, Applicatives & Monads
Using the IO monad
Basic use of the Map type

In some cases, you’ll also see something like a basic web server. And there’s a good reason for this progression. I covered the same material in Haskell From Scratch!

But there’s generally a lack of material in a lot of cool and interesting areas. I’ve done my best to cover a lot of these areas throughout my courses. Here are some of those topics, and the corresponding courses that cover them.

Data structures (beyond lists and maps) - Solve.hs
Algorithms - Solve.hs
Parsing Complex Data - Making Sense of Monads, Solve.hs
Advanced Web Servers - Practical Haskell
Complex Effect Stacks - Effectful Haskell, Practical Haskell
Unit Testing Details - Practical Haskell
Machine Learning - The Haskell Brain

Simply put, I haven’t found a Haskell resource anywhere else that puts all these concepts together in a course-like environment. You could potentially find some blog posts that discuss them, or read the documentation, but this leads to the next point.

Detailed and Challenging Exercises

Reading by itself is rarely enough to retain knowledge, especially when it comes to programming. If you read a great article about unit testing in Haskell, you’ll probably forget all the details and have to go back to it the next time you actually want to use the ideas.

You can try to follow along with the article by writing the code in your own IDE. But you’ll still probably end up just copying things, which also isn’t the best way to learn.

You can even try to devise your own project to use the knowledge. But there’s often a significant cognitive effort involved in coming up with a new idea that fits these requirements…different enough from the article that you’re actually testing yourself, but similar enough that you can actually apply the concept.

Great programming courses should provide exercises so that you can try the techniques in your own environment, without a spoon-fed answer already available to you. They should remove the overhead of coming up with your own way to test yourself, while also providing rapid feedback on whether or not you’ve succeeded.

A lot of Haskell courses I’ve seen out there don’t satisfy these criteria. I’ve seen courses out there that don’t have any exercises. And the ones that do often have at least one of the following issues:

Only 1-2 problems per lecture
Problems are too easy
Lack of test cases
No starter code (i.e. you’re only given a written description)
No toolchain integration (i.e. you’re just given a file, but no project to work with or limited build instructions)

Every course on Monday Morning Haskell Academy comes with detailed exercises to help you learn the material. You’ll usually get several problems per lecture (4-6), and the starter code for these problems comes with full toolchain integration and instructions, plus automated unit test cases.

Difficulty is always going to be a bit subjective, but for most lectures I’ve made an effort to have some easier problems as well as more challenging ones.

Lecture Content and Slides

Naturally, the core content of the course is the lecture materials, so it’s worth talking about that as well! Some Haskell courses rely strictly on written material, but most incorporate slides and audio presentation.

For the most part, course authors do a fine job with their slides. But I think I go above and beyond the norm by using bold text to highlight the most important parts of the code presented, and using colors to show the relationship between different elements on the same slide and across slides.

With our courses, you’re able to get the slides as a downloadable asset. And with the level of detail on them, they serve as a useful reference for you to quickly come back to, even without listening again to the lecture audio.

Other Guarantees

Finally, it’s worth noting that our courses and bundles all come with a 14-day money back guarantee. If you don’t like the materials, you can get a refund within 14 days with no questions asked.

Additionally, all our courses guarantee lifetime access to the content. There’s no recurring subscription. So if your life is too busy to go through the full course right now, you can always save it for later!

So you may as well take a look at our course listings now, since you can get a 20% discount using the code SOLVE25 (today only!). If you subscribe to our mailing list, you’ll get an extra 10% off as well.

Our new bundles (e.g. Beginners & Advanced) are a great way to save money while exploring the full breadth of Haskell materials and topics we have to offer. If you get MMH Complete, you’ll get lifetime access to all our course content, past, present and future! So don’t miss out, take advantage of the sale today!

James Bowen 4/28/25 James Bowen 4/28/25

New Course Bundles!

With the release of the final module of Solve.hs last week, we now have 7 finished courses available at Monday Morning Haskell Academy. With this many courses, it might be a little challenging to pick the right one.

While we have a little guide on our website to help you pick, I also wanted to make it a bit easier to select the courses for the right level of experience, and also provide some really great deals on our course material.

So last week, we released 3 new course bundles to help you save. The 3 levels are Beginner, Advanced, and Complete.

Beginner Bundle

Our Beginner Bundle includes a total of 4 courses.

This bundle is great if you’re just starting out with Haskell, even if you haven’t even installed it or written a line before! The first two courses in this bundle will help you install your toolchain and learn the language fundamentals. Then after that, you’ll learn about some trickier Haskell concepts, like monads and advanced problem solving techniques.

Along the way, you’ll also get the chance to write a couple small projects to build your skills and confidence. With the progression of these courses, you can really go from “Zero Knowledge” to “Confident Haskell User”.

Advanced Bundle

Our Advanced Bundle is for Haskellers who’ve mastered the basics and are trying to learn how to apply Haskell in some more “real-world” settings. The courses are:

In the first three courses, you’ll learn about things like machine learning, writing web servers, deploying applications, and managing complex effect stacks.

Then you’ll see that our newly completed Solve.hs course appears in both bundles. It bridges the gap between basic problem solving skills, like manipulating lists and strings, to more advanced ideas, like implementing data structures from scratch and writing complex algorithms. So even if you’ve got some decent skills already, you’ll definitely still find quite a few challenges in this course!

MMH Complete

Finally, MMH Complete will give you access to our entire library of courses. You’ll get all 7 courses, at a substantial discount! Plus you are guaranteed to receive any new course content we come up with in the future.

Discounts!

Speaking of discounts, here are the discounts you would get for each bundle vs. purchasing each course individually:

Beginner Bundle - 20% off
Advanced Bundle - 30% off
MMH Complete - 35% off

Plus, this week, you can get an extra 20% off all courses and bundles using the code SOLVE25. If you want an even bigger discount, you can subscribe to our newsletter. You’ll get monthly updates AND a code for 30% off all products.

So don’t miss out on these offers! Head to the courses page now! Next week, they’ll be going away!

James Bowen 4/21/25 James Bowen 4/21/25

Solve.hs Module 4 Now Available!

Back in 2023, I introduced Solve.hs, my newest course focused on problem solving in Haskell. This course was inspired by my experiences solving programming puzzles with Haskell, especially by the feeling of how different it was compared to other languages.

Solve.hs will teach you all the core knowledge you need around data structures and algorithms to tackle not only these kinds of puzzles (which often appear as interview questions), but also the mindset shifts you have to make when solving them in Haskell.

In 2023, I released the first two modules, which focused on data structures, with a special emphasis on how Haskell uses linked lists. These also explored the patterns that replace ’for’ and ‘while’ loops from other languages.

Then in 2024 I released module 3, which explained all of the most essential algorithms in great detail, and showed how we have to implement them differently in Haskell.

Finally, today, I am releasing the fourth and final module for this course! This module explains parsing in great detail. You’ll learn:

Basic string manipulation techniques for simple parsing
How to use libraries to parse common data formats (e.g. JSON)
How to use the Megaparsec library to parse any other kind of structured data
How to write your own monadic parser
How to use regular expressions for parsing in Haskell

These skills can be important in puzzle solving challenges where your input is just a string. But they’re also applicable in a wide variety of “real world” projects!

For the next 2 weeks, you can get Solve.hs for 20% off with the code SOLVE25. You can also get an extra 10% discount by subscribing to our newsletter!

After these 2 weeks are up, you’ll not only lose the discount, but the price of the course will go up to reflect the added material from module 4. This course will never be cheaper, so grab it now by going to the course page!

James Bowen 7/1/24 James Bowen 7/1/24

Solve.hs Module 3 + Summer Sale!

After 6 months of hard work, I am happy to announce that Solve.hs now has a new module - Essential Algorithms! You’ll learn the “Haskell Way” to write all of the most important algorithms for solving coding problems, such as Breadth First Search, Dijkstra’s Algorithm, and more!

You can get a 20% discount code for this and all of our other courses by subscribing to our mailing list! Starting next week, the price for Solve.hs will go up to reflect the increased content. So if you subscribe and purchase this week, you’ll end up saving 40% vs. buying later!

So don’t miss out, head to the course sales page to buy today!

James Bowen 1/15/24 James Bowen 1/15/24

Functional Programming vs. Object Oriented Programming

Functional Programming (FP) and Object Oriented Programming (OOP) are the two most important programming paradigms in use today. In this article, we'll discuss these two different programming paradigms and compare their key differences, strengths and weaknesses. We'll also highlight a few specific ways Haskell fits into this discussion. Here's a quick outline if you want to skip around a bit!

What is a Programming Paradigm?
The Object Oriented Paradigm
The Functional Paradigm
Functional Programming vs. OOP
OOP Languages
FP Languages
Advantages of Functional Programming
Disadvantages of Functional Programming
A Full Introduction to Haskell

What is a Programming Paradigm?

A paradigm is a way of thinking about a subject. It's a model against which we can compare examples of something.

In programming, there are many ways to write code to solve a particular task. Our tasks normally involve taking some kind of input, whether data from a database or commands from a user. A program's job is then to produce outputs of some kind, like updates in that database or images on the user's screen.

Programming paradigms help us to organize our thinking so that we can rapidly select an implementation path that makes sense to us and other developers looking at the code. Paradigms also provide mechanisms for reusing code, so that we don't have to start from scratch every time we write a new program.

The two dominant paradigms in programming today are Object Oriented Programming (OOP) and Functional Programming (FP).

The Object Oriented Paradigm

In object oriented programming, our program's main job is to maintain objects. Objects almost always store data, and they have particular ways of acting on other objects and being acted on by other objects (these are the object's methods). Objects often have mutable data - many actions you take on your objects are capable of changing some of the object's underlying data.

Object oriented programming allows code reuse through a system called inheritance. Objects belong to classes which share the same kinds of data and actions. Classes can inherit from a parent class (or multiple classes, depending on the language), so that they also have access to the data from the base class and some of the same code that manipulates it.

The Functional Paradigm

In functional programming, we think about programming in terms of functions. This idea is rooted in the mathematical idea of a function. A function in math is a process which takes some input (or a series of different inputs) and produces some kind of output. A simple example would be a function that takes an input number and produces the square of that number. Many functional languages emphasize pure functions, which produce the exact same output every time when given the same input.

In programming, we may view our entire program as a function. It is a means by which some kind of input (file data or user commands), is transformed into some kind of output (new files, messages on our terminal). Individual functions within our program might take smaller portions of this input and produce some piece of our output, or some intermediate result that is needed to eventually produce this output.

In functional programming, we still need to organize our data in some way. So some of the ideas of objects/classes are still used to combine separate pieces of data in meaningful ways. However, we generally do not attach "actions" to data in the same way that classes do in OOP languages.

Since we don't perform actions directly on our data, functional languages are more likely to use immutable data as a default, rather than mutable data. (We should note though that both paradigms use both kinds of data in their own ways).

Functional Programming vs. OOP

The main point of separation between these paradigms is the question of "what is the fundamental building block of my program?" In object oriented programming, our programs are structured around objects. Functions are things we can do to an object or with an object.

In functional programming, functions are always first class citizens - the main building block of our code. In object oriented programming, functions can be first class citizens, but they do not need to be. Even in languages where they can be, they often are not used in this way, since this isn't as natural within the object oriented paradigm.

Object Oriented Programming Languages

Many of the most popular programming languages are OOP languages. Java, for a long time the most widely used language, is perhaps the most archetypal OO language. All code must exist within an object, even in a simple "Hello World" program:

class MyProgram {
  public static void main(String[] args) {
    System.out.println("Hello World!");
  }
}

In this example, we could not write our 'main' function on its own, without the use of 'class MyProgram'.

Java has a single basic 'Object' class, and all other classes (including any new classes you write) must inherit from it for basic behaviors like memory allocation. Java classes only allow single inheritance. This means that a class cannot inherit from multiple different types. Thus, all Java classes you would use can be mapped out on a tree structure with 'Object' as the root of the tree.

Other object oriented languages use the general ideas of classes, objects, and inheritance, but with some differences. C++ and Python both allow multiple inheritance, so that a class can inherit behavior from multiple existing classes. While these are both OOP languages, they are also more flexible in allowing functions to exist outside of classes. A basic script in either of these languages need not use any classes. In Python, we'd just write:

if __name__ == "__main__":
  print("Hello World!")

In C++, this looks like:

int main() {
  std::cout << "Hello World!" << std::endl;
}

These languages also don't have such a strictly defined inheritance structure. You can create classes that do not inherit from anything else, and they'll still work.

FP Languages

Haskell is perhaps the language that is most identifiable with the functional paradigm. Its type system and compiler really force you to adopt functional ideas, especially around immutable data, pure functions, and tail call optimization. It also embraces lazy evaluation, which is aligned with FP principles, but not a requirement for a functional language.

There are several other programming languages that generally get associated with the functional paradigm include Clojure, OCaml, Lisp, Scala and Rust. These languages aren't all functional in the same way as Haskell; there are many notable differences. Lisp bills itself specifically as a multi-paradigm language, and Scala is built to cross-compile with Java! Meanwhile Rust's syntax looks more object oriented, but its inheritance system (traits) feel much more like Haskell. However, on balance, these languages express functional programming ideas much more than their counterparts.

Amongst the languages mentioned in the object oriented section, Python has the most FP features. It is more natural to write functions outside of your class objects, and concepts like higher order functions and lambda expressions are more idiomatic than in C++ or Java. This is part of the reason Python is often recommended for beginners, with another reason being that its syntax makes it a relatively simple language to learn.

Advantages of Functional Programming

Fewer Bugs

FP code has a deserved reputation for having fewer bugs. Anecdotally, I certainly find I have a much easier time writing bug free code in Haskell than Python. Many bugs in object oriented code are caused by the proliferation of mutable state. You might pass an object to a method and expect your object to come back unchanged...only to find that the method does in fact change your object's state. With objects, it's also very easy for unstated pre-conditions to pop up in class methods. If your object is not in the state you expect when the method is called, you'll end up with behavior you didn't intend.

A lot of function-based code makes these errors impossible by imposing immutable objects as the default, if not making it a near requirement, as Haskell does. When the function is the building block of your code, you must specify precisely what the inputs of the function are. This gives you more opportunities to determine pre-conditions for this data. It also ensures that the return results of the function are the primary way you affect the rest of your program.

Functions also tend to be easier to test than objects. It is often tricky to create objects with the precise state you want to assess in a unit test, whereas to test a function you only need to reproduce the inputs.

More Expressive, Reasonable Design

The more you work with functions as your building blocks, and the more you try to fill your code with pure functions, the easier it will be to reason about your code. Imagine you have a couple dozen fields on an object in OO code. If someone calls a function on that object, any of those fields could impact the result of the method call.

Functions give you the opportunity to narrow things down to the precise values that you actually need to perform the computation. They let you separate the essential information from superfluous information, making it more obvious what the responsibilities are for each part of your code.

Multithreading

You can do parallel programming no matter what programming language you're using, but the functional programming paradigm aligns very well with parallel processing. To kick off a new thread in any language, you pretty much always have to pass a function as an argument, and this is more natural in FP. And with pure functions that don't modify shared mutable objects, FP is generally much easier to break into parallelizable pieces that don't require complex locking schemes.

Disadvantages of Functional Programming

Intuition of Complete Objects

Functional programming can feel less intuitive than object oriented programming. Perhaps one reason for this is that object oriented programming allows us to reason about "complete" objects, whose state at any given time is properly defined.

Functions are, in a sense, incomplete. A function is not a what that you can hold as a picture in your head. A function is a how. Given some inputs, how do you produce the outputs? In other words, it's a procedure. And a procedure can only really be imagined as a concrete object once you've filled in its inputs. This is best exemplified by the fact that functions have no native 'Show' instance in Haskell.

>> show (+)
No instance for Show (Integer -> Integer -> Integer) arising from a use of 'show'

If you apply the '+' function to arguments (and so create what could be called an "object"), then we can print it. But until then, it doesn't make much sense. If objects are the building block of your code though, you could, hypothetically, print the state of the objects in your code every step of the way.

Mutable State can be Useful!

As much as mutable state can cause a lot of bugs, it is nonetheless a useful tool for many problems, and decidedly more intuitive for certain data structures. If we just imagine something like the "Snake" game, it has a 2D grid that remains mostly the same from tick to tick, with just a couple things updating. This is easier to capture with mutable data.

Web development is another area where mutable objects are extremely useful. Anytime the user enters information on the page, some object has to change! Web development in FP almost requires its own paradigm (see "Functional Reactive Programming"). Haskell can represent mutable data, but the syntax is more cumbersome; you essentially need a separate data structure. Likewise, other functional languages might make mutability easier than Haskell, but mutability is still, again, more intuitive when objects are your fundamental building block, rather than functions on those objects.

We can see this even with something as simple as loops. Haskell doesn't perform "for-loops" in the same way as other languages, because most for loops essentially rely on the notion that there is some kind of state updating on each iteration of the loop, even if that state is only the integer counter. To write loops in Haskell, you have to learn concepts like maps and folds, which require you to get very used to writing new functions on the fly.

A Full Introduction to Haskell (and its Functional Aspects)

So functional programming languages are perhaps a bit more difficult to learn, but can offer a significant payoff if you put in the time to master the skills. Ultimately, you can use either paradigm for most kinds of projects and keep your development productive. It's down to your personal preference which you try while building software.

If you really want to dive into functional programming though, Haskell is a great language, since it will force you to learn FP principles more than other functional languages. For a complete introduction to Haskell, you should take a look at Haskell From Scratch, our beginner-level course for those new to the language. It will teach you everything you need to know about syntax and fundamental concepts, while providing you with a ton of hands-on practice through exercises and projects.

Haskell From Scratch also includes Making Sense of Monads, our course that shows the more functional side of Haskell by teaching you about the critical concept of monads. With these two courses under your belt, you'll be well on your way to mastery of functional programming! Head over here to learn more about these courses!

James Bowen 1/8/24 James Bowen 1/8/24

How to Write Comments in Haskell

Comments are often a simple item to learn, but there's a few ways we can get more sophisticated with them! This article is all about writing comments in Haskell. Here's a quick outline to get you started!

What is a Comment?
Single Line Comments
Multi-Line Comments
Inline Comments
Writing Formal Documentation Comments
Intro to Haddock
Basic Haddock Comments
Creating Our Haskell Report
Documenting the Module Header
Module Header Fields
Haddock Comments Below
Commenting Type Signatures
Commenting Constructors
Commenting Record Fields
Commenting Class Definitions
A Complete Introduction to the Haskell Programming Language

What is a Comment?

A comment is non-code note you write in a code file. You write it to explain what the code does or how it works, in order to help someone else reading it. Comments are ignored by a language's compiler or interpreter. There is usually some kind of syntax to comments to distinguish them from code. Writing comments in Haskell isn't much different from other programming languages. But in this article, we'll look extensively at Haddock, a more advanced program for writing nice-looking documentation.

Single Line Comments

The basic syntax for comments in Haskell is easy, even if it is unusual compared to more common programming languages. In languages like Java, Javascript and C++, you use two forward slashes to start a single line comment:
```
int main() {
// This line will print the string value "Hello, World!" to the console
std::cerr << "Hello, World!" << std::endl;
}
```
But in Haskell, single line comments start with two hyphens, '--':
```
-- This is our 'main' function, which will print a string value to the console
main :: IO ()
main = putStrLn "Hello World!"
```
You can have these take up an entire line by themselves, or you can add a comment after a line of code. In this simple "Hello World" program, we place a comment at the end of the first line of code, giving instructions on what would need to happen if you extended the program.
```
main :: IO ()
main = -- Add 'do' to this line if you add another 'putStrLn' statement!
putStrLn "Hello World!"
```
Multi-Line Comments

While you can always start multiple consecutive lines with whatever a comment line starts with in your language, many languages also have a specific way to make multiline comments. And generally speaking, this method has a "start" and an "end" sequence. For example, in C++ or Java, you start a multi line comment block with the characters '/' and end it with '/'
```
/*
This function returns a new list
that is a reversed copy of the input. 

It iterates through each value in the input 
and uses 'push_front' on the new copy.
*/
std::list<int> reverseList(const std::list<int>& ints) {
std::list<int> result;
for (const auto& i : ints) {
  result.push_front(i);
}
return result;
}
```
In Haskell, it is very similar. You use the brace and a hyphen character to open ('{-') and then the reverse to close the block ('-}').
```
{- This function returns a new list
 that is a reversed copy of the input.

 It uses a tail recursive helper function.
-}
reverse :: [a] -> [a]
reverse = reverseTail []
where
  reverseTail acc [] = acc
  reverseTail acc (x : xs) = reverseTail (x : acc) xs
```
Notice we don't have to start every line in the comment with double hyphens. Everything in there is part of the comment, until we reach the closing character sequence. Comments like these with multiple lines are also known as "block comments". They are useful because it is easy to add more information to the comment without adding any more formatting.

Inline Comments

While you generally use the brace/hyphen sequence to write a multiline comment, this format is surprisingly also useful for a particular form of single line comments. You can write an "inline" comment, where the content is in between operational code on that line.
```
reverse :: [a] -> [a]
reverse = reverseTail []
where
  reverseTail {- Base Case -}      acc [] = acc
  reverseTail {- Recursive Case -} acc (x : xs) = reverseTail (x : acc) xs
```
The fact that our code has a start and end sequence means that the compiler knows where the real code starts up again. This is impossible when you use double hyphens to signify a comment.

Writing Formal Documentation Comments

If the only people using this code will be you or a small team, the two above techniques are all you really need. They tell people looking at your source code (including your future self) why you have written things in a certain way, and how they should work. However, if other people will be using your code as a library without necessarily looking at the source code, there's a much deeper area you can explore. In these cases, you will want to write formal documentation comments. A documentation comment tells someone what a function does, generally without going into the details of how it works. More importantly, documentation comments are usually compiled into a format for someone to look at outside of the source code. These sorts of comments are aimed at people using your code as a library. They'll import your module into their own programs, rather than modifying it themselves. You need to answer questions they'll have like "How do I use this feature?", or "What argument do I need to provide for this function to work"? You should also consider having examples in this kind of documentation, since these can explain your library much better than plain statements. A simple code snippet often provides way more clarification than a long document of function descriptions.

Intro to Haddock

As I mentioned above, formal documentation needs to be compiled into a format that is more readable than source code. In most cases, this requires an additional tool. Doxygen, for example, is one tool that supports many programming languages, like C++ and Python. Haskell has a special tool called Haddock. Luckily, you probably don't need to go through any additional effort to install Haddock. If you used GHCup to install Haskell, then Haddock comes along with it automatically. (For a full walkthrough on getting Haskell installed, you can read our Startup Guide). It also integrates well with Haskell's package tools, Stack and Cabal. In this article we'll use it through Stack. So if you want to follow along, you should create a new Haskell project on your machine with Stack, calling it 'HaddockTest'. Then build the code before we add comments so you don't have to wait for it later:
```
>> stack new HaddockTest
>> cd HaddockTest
>> stack build
```
You can write all the code from the rest of the article in the file 'src/Lib.hs', which Stack creates by default.

Basic Haddock Comments

Now let's see how easy it is to write Haddock comments! To write basic comments, you just have to add a vertical bar character after the two hyphens:
```
-- | Get the "block" distance of two 2D coordinate pairs
manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)
```
It still works even if you add a second line without the vertical bar. All comment lines until the type signature or function definition will be considered part of the Haddock comment.
```
-- | Get the "block" distance of two 2D coordinate pairs
-- This is the sum of the absolute difference in x and y values.
manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)
```
You can also make a block comment in the Haddock style. It involves the same character sequences as multi line comments, but once again, you just add a vertical bar after the start sequence. The end sequence does not need the bar:
```
{-| Get the "block" distance of two 2D coordinate pairs
 This is the sum of the absolute difference in x and y values.
-}
manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)
```
No matter which of these options you use, your comment will look the same in the final document. Next, we'll see how to generate our Haddock document. To contrast Haddock comments with normal comments, we'll add a second function in our code with a "normal" single line comment. We also need to add both functions to the export list of our module at the top: `haskell module Lib ( someFunc, , manhattanDistance , euclidenDistance ) where

...

-- Get the Euclidean distance of two 2D coordinate pairs (not Haddock) euclideanDistance :: (Double, Double) -> (Double, Double) -> Double euclideanDistance (x1, y1) (x2, y2) = sqrt ((x2 - x1) ^ 2 + (y2 - y1) ^ 2)

Now let's create our document!
## Creating Our Haskell Report
To generate our document, we just use the following command:
```bash
>> stack haddock

This will compile our code. At the end of the process, it will also inform us about what percentage of the elements in our code used Haddock comments. For example:

25% (  1 /  4) in 'Lib'
  Missing documentation for:
    Module header
    someFunc (src/Lib.hs:7)
    euclideanDistance (src/Lib.hs:17)

As expected, 'euclideanDistance' is not considered to have a Haddock comment. We also haven't defined a Haddock comment for our module header. We'll do that in the next section. We'll get rid of the 'someFunc' expression, which is just a stub. This command will generate HTML files for us, most importantly an index file! They get generated in the '.stack-work' directory, usually in a folder that looks like '{project}/.stack-work/install/{os}/{hash}/{ghc_version}/doc/'. For example, the full path of my index file in this example is:

/home/HaddockTest/.stack-work/install/x86_64-linux-tinfo6/6af01190efdb20c14a771b6e2823b492cb22572e9ec30114989156919ec4ab3a/9.6.3/doc/index.html

You can open the file with your web browser, and you'll find a mostly blank page listing the modules in your project, which at this point should only be 'Lib'. If you click on 'Lib', it will take you to a page that looks like this:

We can see that all three expressions from our file are there, but only 'manhattanDistance' has its comment visible on the page. What's neat is that the type links all connect to documentation for the base libraries. If we click on 'Int', it will take us to the page for the 'base' package module 'Data.Int', giving documentation on 'Int' and other integer types.

Documenting the Module Header

In the picture above, you'll see a blank space between our module name and the 'Documentation' section. This is where the module header documentation should go. Let's see how to add this into our code. Just as Haddock comments for functions should go above their type signatures, the module comment should go above the module declaration. You can start it with the same format as you would have with other Haddock block comments:

{-| This module exposes a couple functions
    related to 2D distance calculation.
-}
module Lib
  ( manhattanDistance
  , euclideanDistance
  ) where

...

If you rerun 'stack haddock' and refresh your Haddock page, this comment will now appear under 'Lib' and above 'Documentation'. This is the simplest thing you can do to provide general information about the module.

Module Header Fields

However, there are also additional fields you can add to the header that Haddock will specifically highlight on the page. Suppose we update our block comment to have these lines:

{-|
Module: Lib
Description: A module for distance functions.
Copyright: (c) Monday Morning Haskell, 2023
License: MIT
Maintainer: person@mmhaskell.com

The module has two functions. One calculates the "Manhattan" distance, or "block" distance on integer 2D coordinates. The other calculates the Euclidean distance for a floating-point coordinate system.
-}
module Lib
  ( manhattanDistance
  , euclideanDistance
  ) where

...

At the bottom of the multi line comment, after all the lines for the fields, we can put a longer description, as you see. After adding this, removing 'someFunc', and making our prior comment on Euclidean distance a Haddock comment, we now get 100% marks on the documentation for this module when we recompile it:

100% (  3 /  3) in 'Lib'

And here's what our HTML page looks like now. Note how the fields we entered are populated in the small box in the upper right.

Note that the short description we gave is now visible next to the module name on the index page. This page still only contains the description below the fields.

Haddock Comments Below

So far, we've been using the vertical bar character to place Haddock comments above our type signatures. However, it is also possible to place comments below the type signatures, and this will introduce us to a new syntax technique that we'll use for other areas. The general idea is that we can use a caret character '^' instead of the vertical bar, indicating that the item we are commenting is "above" or "before" the comment. We can do this either with single line comments or block comments. Here's how we would use this technique with our existing functions:

manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
-- ^ Get the "blocK" distance of two 2D coordinate pairs
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)

euclideanDistance :: (Double, Double) -> (Double, Double) -> Double
{- ^ Get the Euclidean distance of two 2D coordinate pairs
     This uses the Pythagorean formula.
-}
euclideanDistance (x1, y1) (x2, y2) = sqrt ((x2 - x1) ^ 2 + (y2 - y1) ^ 2)

The comments will appear the same in the final documentation.

Commenting Type Signatures

The comments we've written so far have described each function as a unit. However, sometimes you want to make notes on specific function arguments. The most common way to write these comments in Haskell with Haddock is with the "above" style. Each argument goes on its own line with a "caret" Haddock comment after it. Here's an example:

-- | Given a base point and a list of other points, returns
-- the shortest distance from the base point to a point in the list.
shortestDistance ::
  (Double, Double) -> -- ^ The base point we are measuring from
  [(Double, Double)] -> -- ^ The list of alternative points
  Double
shortestDistance base [] = -1.0
shorestDistance base rest = minimum $ (map (euclideanDistance base) rest)

It is also possible to write these with the vertical bar above each argument, but then you will need a second line for the comment.

-- | Given a base point and a list of other points, returns
-- the shortest distance from the base point to a point in the list.
shortestDistance ::
  -- | The base point we are measuring from
  (Double, Double) ->
  -- | The list of alternative points
  [(Double, Double)] -> 
  Double
shortestDistance base [] = -1.0
shorestDistance base rest = minimum $ (map (euclideanDistance base) rest)

It is even possible to write the comments before AND on the same line as inline comments. However, this is less common since developers usually prefer seeing the type as the first thing on the line.

Commenting Constructors

You can also use Haddock comments for type definitions. Here is an example of a data type with different constructors. Each gets a comment.

data Direction =
  DUp    | -- ^ Positive y direction
  DRight | -- ^ Positive x direction
  DDown  | -- ^ Negative y direction
  DLeft    -- ^ Negative x direction

Commenting Record Fields

You can also comment record fields within a single constructor.

data Movement = Movement
  { direction :: Direction -- ^ Which way we are moving
  , distance  :: Int       -- ^ How far we are moving
  }

An important note is that if you have a constructor on the same line as its fields, a single caret comment will refer to the constructor, not to its last field.

data Point =
  Point2I Int Int       |      -- ^ 2d integral coordinate
  Point2D Double Double |      -- ^ 2d floating point coordinate
  Point3I Int Int Int   |      -- ^ 3d integral coordinate
  Point3D Double Double Double -- ^ 3d floating point coordinate

Commenting Class Definitions

As one final feature, we can add these sorts of comments to class definitions as well. With class functions, it is usually better to use "before" comments with the vertical bar. Unlike constructors and fields, an "after" comment will get associated with the argument, not the method.

{-| The Polar class describes objects which can be described
    in "polar" coordinates, with a magnitude and angle
-}
class Polar a where
  -- | The total length of the item
  magnitude :: a -> Double 
  -- | The angle (in radians) of the point around the z-axis
  angle :: a -> Double

Here's what all these new pieces look like in our documentation:

You can see the way that each comment is associated with a particular field or argument.

A Complete Introduction to the Haskell Programming Language

Of course, comments are useless if you have no code or projects to write them in! If you're a beginner to Haskell, the fastest way to get up to writing project-level code is our course, Haskell From Scratch! This course features hours of video lectures, over 100 programming exercises, and a final project to test your skills! Learn more about it on this page!

James Bowen 1/1/24 James Bowen 1/1/24

How to Write “Hello World” in Haskell

In this article we're going to write the easiest program we can in the Haskell programming language. We're going to write a simple example program that prints "Hello World!" to the console. It's such a simple program that we can do it in one line! But it's still the first thing you should do when starting a new programming language. Even with such a simple program there are several details we can learn about writing a Haskell program. Here's a quick table of contents if you want to jump around!

Writing Haskell "Hello World"
The Simplest Way to Run the Code
Functional Programming and Types
Requirements of an Executable Haskell Program
Using the GHC Compiler
Using GHCI - The Haskell Interpreter
A Closer Look at Our Types
Compilation Errors
A Quick Look At Type Classes
Echo - Another Example Program
A Complete Introduction to the Haskell Programming Language

Now let's get started!

Writing Haskell "Hello World"

To write our "Haskell Hello World" program, we just need to open a file named 'HelloWorld.hs' in our code editor and write the following line:

main = putStrLn "Hello World!"

This is all the code you need! With just this one line, there's still another way you could write it. You could use the function 'print' instead of 'putStrLn':

main = print "Hello World!"

These programs will both accomplish our goal, but their behavior is slightly different! But to explore this, we first need to run our program!

The Simplest Way to Run the Code

Hopefully you've already installed the Haskell language tools on your machine. The old way to do this was through Haskell Platform, but now you should use GHCup. You can read our Startup Guide for more instructions on that! But assuming you've installed everything, the simplest way to run your program is to use the 'runghc' command on your file:

>> runghc HelloWorld.hs

With the first version of our code using 'putStrLn', we'll see this printed to our terminal:

Hello World!

If we use 'print' instead, we'll get this output:

"Hello World!"

In the second example, there are quotation marks! To understand why this is, we need to understand a little more about types, which are extremely important in Haskell code.

Functional Programming and Types

Haskell is a functional programming language with a strong, static type system. Even something as simple as our "Hello World" program is comprised of expressions, and each of these expressions has a type. For that matter, our whole program has a type!

In fact, every Haskell program has the same type: 'IO ()'. The IO type signifies any expression which can perform Input/Output activities, like printing to the terminal and reading user input. Most functions you write in Haskell won't need to do these tasks. But since we're printing, we need the IO signifier. The second part of the type is the empty tuple, '()'. This is also referred to as the "unit type". When used following 'IO', it is similar to having a 'void' return value in other programming languages.

Now, our 'main' expression signifies our whole program, and we can explicitly declare it to have this type by putting a type signature above it in our code. We give the expression name, two colons, and then the type:

main :: IO ()
main = putStrLn "Hello World!"

Our program will run the same with the type signature. We didn't need to put it there, because GHC, the Haskell compiler, can usually infer the types of expressions. With more complicated programs, it can get stuck without explicit type signatures, but we don't have to worry about that right now.

Requirements of an Executable Haskell Program

Now if we gave any other type to our main function, we won't be able to run our program! Our file is supposed to be an entry point - the root of an executable program. And Haskell has several requirements for such files.

These files must have an expression named 'main'. This expression must have the type 'IO ()'. Finally, if we put a module name on our code, that module name should be Main. Module names go at the top of our file, prefaced by "module", and followed by the word "where". Here's how we can explicitly declare the name of our module:

module Main where

main :: IO ()
main = putStrLn "Hello World!"

Like the type signature on our function 'main', GHC could infer the module name as well. But let's try giving it a different module name:

module HelloWorld where

main :: IO ()
main = putStrLn "Hello World!"

For most Haskell modules you write, using the file name (minus the '.hs' extension) IS how you want to name the module. But runnable entry point modules are different. If we use the 'runghc' command on this code, it will still work. However, if we get into more specific behaviors of GHC, we'll see that Haskell treats our file differently if we don't use 'Main'.

Using the GHC Compiler

Instead of using 'runghc', a command designed mainly for one-off files like this, let's try to compile our code more directly using the Haskell compiler. Suppose we have used HelloWorld as the module name. What files does it produce when we compile it with the 'ghc' command?

>> ghc HelloWorld.hs
[1 of 1] Compiling HelloWorld       ( HelloWorld.hs, HelloWorld.o )
>> ls
HelloWorld.hi HelloWorld.hs HelloWorld.o

This produces two output files beside our source module. The '.hi' file is an interface file. The '.o' file is an object file. Unfortunately, neither of these are runnable! So let's try changing our module name back to Main.

module Main where

main :: IO ()
main = putStrLn "Hello World!"

Now we'll go back to the command line and run it again:

>> ghc HelloWorld.hs
[1 of 2] Compiling Main       ( HelloWorld.hs, HelloWorld.o )
[2 of 2] Linking HelloWorld
>> ls 
HelloWorld HelloWorld.hi HelloWorld.hs HelloWorld.o

This time, things are different! We now have two compilation steps. The first says 'Compiling Main', referring to our code module. The second says 'Linking HelloWorld'. This refers to the creation of the 'HelloWorld' file, which is executable code! (On Windows, this file will be called 'HelloWorld.exe'). We can "run" this file on the command line now, and our program will run!

>> ./HelloWorld
Hello World!

Using GHCI - The Haskell Interpreter

Now there's another simple way for us to run our code. We can also use the GHC Interpreter, known as GHCI. We open it with the command 'ghci' on our command line terminal. This brings us a prompt where we can enter Haskell expressions. We can also load code from our modules, using the ':load' command. Let's load our hello world program and run its 'main' function.

>> ghci
GHCI, version 9.4.7: https://www.haskell.org/ghc/   :? for help
ghci> :load HelloWorld
[1 of 2] Compiling Main          ( HelloWorld.hs, interpreted )
ghci> main
Hello World!

If we wanted, we could also just run our "Hello World" code in the interpreter itself:

ghci> putStrLn "Hello World!"
Hello World!

It's also possible to assign our string to a value and then use it in another expression:

ghci> let myString = "Hello World!"
ghci> putStrLn myString
Hello World!

A Closer Look at Our Types

A very useful function of GHCI is that it can tell us the types of our expressions. We just have to use the ':type' command, or ':t' for short. We have two expressions in our Haskell program: 'putStrLn', and "Hello World!". Let's look at their types. We'll start with "Hello World!":

ghci> :type "Hello World!"
"Hello World!" :: String

The type of "Hello World!" itself is a 'String'. This is the name given for a list of characters. We can look at the type of an individual character as well:

ghci> :type 'H'
'H' :: Char

What about 'putStrLn'?

ghci> :t putStrLn
putStrLn :: String -> IO ()

The type for 'putStrLn' looks like 'String -> IO ()'. Any type with an arrow in it ('->') is a function. It takes a 'String' as an input and it returns a value of type 'IO ()', which we've discussed. In order to apply a function, we place its argument next to it in our code. This is very different from other programming languages, where you usually need parentheses to apply a function on arguments. Once we apply a function, the type of the resulting expression is just whatever is on the right side of the arrow. So applying our string to the function 'putStrLn', we get 'IO ()' as the resulting type!

ghci> :t putStrLn "Hello World!"
putStrLn "Hello World!" :: IO ()

Compilation Errors

For a different example, let's see what happens if we try to use an integer with 'putStrLn':

ghci> putStrLn 5
No instance for (Num String) arising from the literal '5'

The 'putStrLn' function only works with values of the 'String' type, while 5 has a type more like 'Int'. So we can't use these expressions together.

A Quick Look At Type Classes

However, this is where 'print' comes in. Let's look at its type signature:

ghci> :t print
print :: Show a => a -> IO ()

Unlike 'putStrLn', the 'print' function takes a more generic input. A "type class" is a general category describing a behavior. Many different types can perform the behavior. One such class is 'Show'. The behavior is that Show-able items can be converted to strings for printing. The 'Int' type is part of this type class, so we can use 'print' with it!

ghci> print 5
5

When use 'show' on a string, Haskell adds quotation marks to the string. This is why it looks different to use 'print' instead of 'putStrLn' in our initial program:

ghci> print "Hello World!"
"Hello World!"

Echo - Another Example Program

Our Haskell "Hello World" program is the most basic example of a program we can write. It only showed one side of the input/output equation. Here's an "echo" program, which first waits for the user to enter some text on the command line and then prints that line back out:

main :: IO ()
main = do
  input <- getLine
  putStrLn input

Let's quickly check the type of 'getLine':

ghci> :t getLine
getLine :: IO String

We can see that 'getLine' is an IO action returning a string. When we use the backwards arrow '<-' in our code, this means we unwrap the IO value and get the result on the left side. So the type of 'input' in our code is just 'String', meaning we can then use it with 'putStrLn'! Then we use the 'do' keyword to string together two consecutive IO actions. Here's what it looks like to run the program. The first line is us entering input, the second line is our program repeating it back to us!

>> runghc Echo.hs
I'm entering input!
I'm entering input!

A Complete Introduction to the Haskell Programming Language

Our Haskell "Hello World" program is the most basic thing you can do with the language. But if you want a comprehensive look at the syntax and every fundamental concept of Haskell, you should take our beginners course, Haskell From Scratch.

You'll get several hours of video lectures, plus a lot of hands-on experience with 100+ exercise problems with automated testing.

All-in-all, you'll only need 10-15 hours to work through all the material, so within a couple weeks you'll be ready for action! Read more about the course here!

James Bowen 11/27/23 James Bowen 11/27/23

Black Friday Sale: Last Day!

We've come to Cyber Monday, marking the last day of our Black Friday sale! Today is your last chance to get big discounts on all of our courses. You can get 20% by using the code BFSOLVE23 at checkout. Or you can subscribe to our mailing list to receive a 30% discount code. You must use these codes by the end of the day in order to get the discount!

Here's a final runthrough of the courses we have available, including our newest course, Solve.hs!

Solve.hs

We just released the first part of our newest course last week! These two detailed modules dive into the fundamentals of problem solving in Haskell. You'll get to rewrite the list type and most of its API from scratch, teaching you all the different ways you can write "loop" code in Haskell. Then you'll get an in-depth look at how data structures work in Haskell, including the quick process to learn a data structure from start to finish!

Course Page

Normal Price: $89 Sale Price: $71.20 Subscriber Price: $62.30

Haskell From Scratch

This is our extensive, 7-module beginners course. You'll get a complete introduction to Haskell's syntax and core concepts, including things like monads and tricky type conversions.

Course Page

Normal Price: $99 Sale Price: $79.20 Subscriber Price: $69.30

Practical Haskell

Practical Haskell is designed to break the idea that "Haskell is only an academic language". In our longest and most detailed course, you'll learn the ins and outs of communicating with a database in Haskell, building a web server, and connecting that server to a functional frontend page. You'll also learn about the flexibility that comes with Haskell's effect systems, as well as best practices for testing your code, including tricky test cases like IO based functions!

Course Page

Normal Price: $149 Sale Price: $119.20 Subscriber Price: $104.30

Making Sense of Monads

The first of our shorter, more targeted courses, Making Sense of Monads will teach you how to navigate monads, one of Haskell's defining concepts. This idea is a bit tricky at first but also quite important for unleashing Haskell's full power. The course is well suited to beginners who know all the basic syntax but want more conceptual practice.

Note that Making Sense of Monads is bundled with Haskell From Scratch. So if you buy the full beginners course, you'll get this in-depth look at monads for free!

Course Page

Normal Price: $29 Sale Price: $23.20 Subscriber Price: $20.30

Effectful Haskell

If Making Sense of Monads is best for teaching the basics of monads, Effectful Haskell will show you how to maximize the potential of this idea. You'll develop a more complete idea of what we mean by "effects" in your code. You'll see a variety of ways to incorporate them into your code and learn some interesting ideas about effect substitution!

Course Page

Normal Price: $39 Sale Price: $31.20 Subscriber Price: $27.30

Haskell Brain

Last, but not least, Haskell Brain will teach you how to perform machine learning tasks in Haskell with TensorFlow. There's a lot of steps involved in linking these two technologies. So while machine learning is a valuable skill to have in today's world, understanding the ways we can link software together is almost as valuable!

Course Page

Normal Price: $39 Sale Price: $31.20 Subscriber Price: $27.30

Conclusion

So don't miss out on this special offer! You can use the code BFSOLVE23 for 20% off, or you can subscribe to our mailing list to get a code for 30% off! This offer ends tonight, so don't wait!

James Bowen 11/24/23 James Bowen 11/24/23

Spotlight: Quick, Focused Haskell Courses

A couple days ago I gave a brief spotlight on the longer, more in-depth courses I've written. The newest of these is Solve.hs, with its focus on problem solving, and the original two I wrote were Haskell From Scratch and Practical Haskell.

After my first two courses, I transitioned towards writing a few shorter courses. These are designed to teach vital concepts in a shorter period of time. They all consist of just a single module and have a shorter total lecture time (1.5 to 2 hours each). You can finish any of them in a concentrated 1-2 week effort. Today I'll give a brief summary of each of these, listed from most abstract to most practical, and easiest to hardest.

Remember, all of these are on sale at 20% off using the code BFSOLVE23 at checkout! You can also subscribe to our mailing list to get an even bigger discount, at 30% off!

Making Sense of Monads

This is for those of you who have been writing Haskell long enough that you've got the hang of the syntax, but you still struggle a bit to understand monads. You might look at parts of Modules 4 and 5 from Haskell From Scratch and think they look useful, but you don't think you need the rest of the course.

Making Sense of Monads really "zooms in" on Module 5. It goes deeper in understanding all of the simpler structures that help us understand monads, and it gives a sizable amount of practice with writing monadic code. You'll also get a crash course on parsing (a common use of monadic operations), and write two fairly complex parsers. So it's a great option if you want a shorter but more concentrated approach on some of the basics!

Effectful Haskell

Effectful Haskell takes a lot of the core ideas and concepts in Making Sense of Monads and goes one step beyond into the more practical realm of applying monadic effects in a program. You'll learn more abstractly what an effect is, but then also the different ways to incorporate polymorphic effects into your Haskell program. You'll see how to use monads and monad classes to swap effectful behaviors in your program, and why this is useful.

This course culminates in a similar (but smaller) project to Practical Haskell, where you'll deploy an effectful web server to Heroku.

Haskell Brain

This course is the hardest and most practically-oriented of this series. You will take on the challenge of incorporating TensorFlow and machine learning into Haskell. This is easier said than done, because TensorFlow has many dependencies beyond the normal packages you can simply pick up on Hackage. So you'll gain valuable experience going through this installation process, and then we'll run through some of the main information you need to know when it comes to creating tensors in Haskell, and building moderately complex models.

Conclusion

So while these courses are shorter, they still pack a decent amount of material! And with the subscriber discount, you can get each of them for less than $30! This offer will only last until Monday though, so make up your mind quickly!