James Bowen 6/30/25 James Bowen 6/30/25

Buffer & Save with a Challenging Example

Welcome back to our series comparing LeetCode problems in Haskell and Rust. Today we’ll learn a new paradigm that I call “Buffer and Save”. This will also be the hardest problem we’ve done so far! The core loop structure isn’t that hard, but there are a couple layers of tricks to massage our data to get the final answer.

This will be the last problem we do that focuses strictly on string and list manipulation. The next set of problems we do will all rely on more advanced data structures or algorithmic ideas.

For more complete practice on problem solving in Haskell, check out Solve.hs, our newest course. This course will teach you everything you need to know about problem solving, data structures, and algorithms in Haskell. You’ll get loads of practice building structures and algorithms from scratch, which is very important for understanding and remembering how they work.

The Problem

Today’s problem is Text Justification. The idea here is that we are taking a list of words and a “maximum width” and printing out the words grouped into equal-width lines that are evenly spaced. Here’s an example input and output:

Example Input (list of 9 strings):
[“Study”, “Haskell”, “with”, “us”, “every”, “Monday”, “Morning”, “for”, “fun”]
Max Width: 16

Output (list of 4 strings):
“Study    Haskell”
“with   us  every”
“Monday   Morning”
“for fun         ”

There are a few notable rules, constraints, and edge cases. Here’s a list to sumarize them:

There is at least one word
No word is larger than the max width
All output strings must have max width as their length (including spaces)
The first word of every line is set to the left
The last line always has 1 space between words, and then enough spaces after the last word to read the max width.
All other lines with multiple words will align the final word all the way to the right
The spaces in non-final lines are distributed as evenly as possible, but extra spaces go between words to the left.

The final point is potentially the trickiest to understand. Consider the second line above, with us every. The max width is 16, and we have 3 words with a total of 11 characters. This leaves us 5 spaces. Having 3 words means 2 blanks, so the “left” blank gets 3 spaces and the “right” blank gets 2 spaces.

If you had a line with 5 words, a max width of 30, and 16 characters, you would place 4 spaces in the left two blanks, and 3 spaces in the right two blanks. The relative length of the words does not matter.

Words in Line: [“A”, “good”, “day”, “to”, “endure”]

Output Line:
“A    good    day   to   endure”

The Algorithm

As mentioned above, our main algorithmic idea could be called “buffer and save”. We’ve been defining all of our loops based on the state we must maintain between iterations of the loop. The buffer and save approach highlights two pieces of state for us:

The strings we’ve accumulated for our answer so far (the “result”)
A buffer of the strings in the “current” line we’re building.

So we’ll loop through the input words one at a time. We’ll consider if the next word can be added to the “current” line. If it would cause our current line to exceed the maximum width, we’ll “save” our current line and write it out to the “result” list, adding the required spaces.

To help our calculations, we’ll also include two other pieces of state in our loop:

The number of characters in our “current” line
The number of words in our “current” line

Finally, there’s the question of how to construct each output line. Combining the math with list-mechanics is a little tricky. But the central idea consists of 4 simple steps:

Find the number of spaces (subtract number of characters from max width)
Divide the number of spaces by the number of “blanks” (number of words - 1)
The quotient is the “base” number of spaces per blank
The remainder is the number of blanks (starting from the left) that get an extra space

The exact implementation of this idea differs between Haskell and Rust. Again this rests a lot on the “reverse” differences between Rust vectors and Haskell lists.

The final line has a slightly different (but easier) process. And we should note that the final line will still be in our buffer when we exit the loop! So we shouldn’t forget to add it to the result.

Haskell Solution

We know enough now to jump into our Haskell solution. Our solution should be organized around a loop. Since we go through the input word-by-word, this should follow a fold pattern. So here’s our outline:

justifyText :: [String] -> Int -> [String]
justifyText inputWords maxWidth = ...
  where
    -- f = ‘final’
    (fLine, fWordsInLine, fCharsInLine, result) = foldl loop ([], 0, 0, []) inputWords

    loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
    loop (currentLine, wordsInLine, charsInLine, currResult) newWord = ...

Let’s focus in on the choice we have to make in the loop. We need to determine if this new word fits in our current line. So we’ll get its length and add it to the number of characters in the line AND consider the number of words in the line. We count the words too since each word we already have requires at least one space!

-- (maxWidth is still in scope here)
loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
  let newWordLen = length newWord
  in  if newWordLen + charsInLine + wordsInLine > maxWidth
        then ...
        else ...

How do we fill in these choices? If we don’t overflow the line, we just append the new word, bump the count of the words, and add the new word’s length to the character count.

loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
  let newWordLen = length newWord
  in  if newWordLen + charsInLine + wordsInLine > maxWidth
        then ...
        else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

The overflow case isn’t hard, but it does require us to have a function that can convert our current line into the final string. This function will also take the number of words and characters in this line. Assuming this function exists, we just make this new line, append it to result, and then reset our other stateful values so that they only reflect the “new word” as part of our current line.

loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
  let newWordLen = length newWord
      resultLine = makeLine currentLine wordsInLine charsInLine
  in  if newWordLen + charsInLine + wordsInLine > maxWidth
        then ([newWord], 1, newWordLen, resultLine : currResult)
        else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

makeLine :: String -> Int -> Int -> String
makeLine = ...

Before we think about the makeLine implementation though, we just about have enough to fill in the rest of the “top” of our function definition. We’d just need another function for making the “final” line, since this is different from other lines. Then when we get our “final” state values, we’ll plug them into this function to get our final line, append this to the result, and reverse it all.

justifyText :: [String] -> Int -> [String]
justifyText inputWords maxWidth = 
  reverse (makeLineFinal flLine fWordsInLine fCharsInLine : result)
  where
    (fLine, fWordsInLine, fCharsInLine, result) = foldl loop ([], 0, 0, []) inputWords

    loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
    loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
      let newWordLen = length newWord
          resultLine = makeLine currentLine wordsInLine charsInLine
      in  if newWordLen + charsInLine + wordsInLine > maxWidth
            then ([newWord], 1, newWordLen, resultLine : currResult)
            else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

    makeLine :: [String] -> Int -> Int -> String
    makeLine = ...

    makeLineFinal :: [String] -> Int -> Int -> String
    makeLineFinal = ...

Now let’s discuss forming these lines, starting with the general case. We can start with a couple edge cases. This should never be called with an empty list. And with a singleton, we just left-align the word and add the right number of spaces:

makeLine :: [String] -> Int -> Int -> String
makeLine [] _ _ = error "Cannot makeLine with empty string!"
makeLine [onlyWord] _ charsInLine =
  let extraSpaces = replicate (maxWidth - charsInLine) ' '
  in  onlyWord <> extraSpaces
makeLine (first : rest) wordsInLine charsInLine = ...

Now we’ll calculate the quotient and remainder to get the spacing sizes, as mentioned in our algorithm section. But how do we combine them? There are multiple ways, but the idea I thought of was to zip the tail of the list with the number of spaces it needs to append. Then we can fold it into a resulting list using a function like this:

-- (String, Int) is the next string and the number of spaces after it
combine :: String -> (String, Int) -> String
combine suffix (nextWord, numSpaces) =
  nextWord <> replicate numSpaces ' ' <> suffix

Remember while doing this that we’ve accumulated the words for each line in reverse order. So we want to append each one in succession, together with the number of spaces that come after it.

To use this function, we can “fold” over the “tail” of our current line, while using the first word in our list as the base of the fold! Don’t forget the quotRem math going on in here!

makeLine :: [String] -> Int -> Int -> String
makeLine [] _ _ = error "Cannot makeLine with empty string!"
makeLine [onlyWord] _ charsInLine =
  let extraSpaces = replicate (maxWidth - charsInLine) ' '
  in  onlyWord <> extraSpaces
makeLine (first : rest) wordsInLine charsInLine = ...
  let (baseNumSpaces, numWithExtraSpace) = quotRem (maxWidth - charsInLine) (wordsInLine - 1)
      baseSpaces = replicate (wordsInLine - 1 - numWithExtraSpace) baseNumSpaces
      extraSpaces = replicate numWithExtraSpace (baseNumSpaces + 1)
      wordsWithSpaces = zip rest (baseSpaces <> extraSpaces)
  in  foldl combine first wordsWithSpaces

combine :: String -> (String, Int) -> String
combine suffix (nextWord, numSpaces) =
  nextWord <> replicate numSpaces ' ' <> suffix

To make the final line, we can also leverage our combine function! It’s just a matter of combining each word in our input with the appropriate number of spaces. In this case, almost every word gets 1 space except for the last one (which comes first in our list). This just gets however many trailing spaces we need!

makeLineFinal :: [String] -> Int -> Int -> String
makeLineFinal [] _ _ = error "Cannot makeLine with empty string!"
makeLineFinal strs wordsInLine charsInLine =
  let trailingSpaces = maxWidth - charsInLine - (wordsInLine - 1)
  in  foldl combine "" (zip strs (trailingSpaces : repeat 1))

Putting all these pieces together, we have our complete solution!

justifyText :: [String] -> Int -> [String]
justifyText inputWords maxWidth = 
  reverse (makeLineFinal flLine fWordsInLine fCharsInLine : result)
  where
    (fLine, fWordsInLine, fCharsInLine, result) = foldl loop ([], 0, 0, []) inputWords

    loop :: ([String], Int, Int, [String]) -> String -> ([String], Int, Int, [String])
    loop (currentLine, wordsInLine, charsInLine, currResult) newWord =
      let newWordLen = length newWord
          resultLine = makeLine currentLine wordsInLine charsInLine
      in  if newWordLen + charsInLine + wordsInLine > maxWidth
            then ([newWord], 1, newWordLen, resultLine : currResult)
            else (newWord : currentLine, wordsInLine + 1, charsInLine + newWordLen, currResult)

    makeLine :: [String] -> Int -> Int -> String
    makeLine [] _ _ = error "Cannot makeLine with empty string!"
    makeLine [onlyWord] _ charsInLine =
      let extraSpaces = replicate (maxWidth - charsInLine) ' '
      in  onlyWord <> extraSpaces
    makeLine (first : rest) wordsInLine charsInLine =
      let (baseNumSpaces, numWithExtraSpace) = quotRem (maxWidth - charsInLine) (wordsInLine - 1)
          baseSpaces = replicate (wordsInLine - 1 - numWithExtraSpace) baseNumSpaces
          extraSpaces = replicate numWithExtraSpace (baseNumSpaces + 1)
          wordsWithSpaces = zip rest (baseSpaces <> extraSpaces)
      in  foldl combine first wordsWithSpaces

    makeLineFinal :: [String] -> Int -> Int -> String
    makeLineFinal [] _ _ = error "Cannot makeLine with empty string!"
    makeLineFinal strs wordsInLine charsInLine =
      let trailingSpaces = maxWidth - charsInLine - (wordsInLine - 1)
      in  foldl combine "" (zip strs (trailingSpaces : repeat 1))

    combine :: String -> (String, Int) -> String
    combine suffix (nextWord, numSpaces) = nextWord <> replicate numSpaces ' ' <> suffix

Rust Solution

Now let’s put together our Rust solution. Since we have a reasonable outline from writing this in Haskell, let’s start with the simpler elements, makeLine and makeLineFinal. We’ll use library functions as much as possible for the string manipulation. For example, we can start makeLineFinal by using join on our input vector of strings.

pub fn make_line_final(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = currentLine.join(" ");
    ...
}

Now we just need to calculate the number of trailing spaces, subtracting the number of characters in the joined string. We append this to the end by taking a blank space and using repeat for the correct number of times.

pub fn make_line_final(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = currentLine.join(" ");
    let trailingSpaces = max_width - result.len();
    result.push_str(&" ".repeat(trailingSpaces));
    return result;
}

For those unfamiliar with Rust, the type of our input vector might seem odd. When we have &Vec<&str>, this means a reference to a vector of string slices. String slices are portions of a String that we hold a reference to, but they aren’t copied. However, when we join them, we make a new String result.

Also note that we aren’t passing wordsInLine as a separate parameter. We can get this value using .len() in constant time in Rust. In Haskell, length is O(n) so we don’t want to always do that.

Now for the general make_line function, we have the same type signature, but we start with our base case, where we only have one string in our current line. Again, we use repeat with the number of spaces.

pub fn make_line(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = String::new();
    let n = currentLine.len();
    if (n == 1) {
        result.push_str(currentLine[0]);
        result.push_str(&" ".repeat(max_width - charsInLine));
        return result;
    }
    ...
}

Now we do the “math” portion of this. Rust doesn’t have a single quotRem function in its base library, so we calculate these values separately.

pub fn make_line(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = String::new();
    let n = currentLine.len();
    if (n == 1) {
        result.push_str(currentLine[0]);
        result.push_str(&" ".repeat(max_width - charsInLine));
        return result;
    }
    let numSpaces = (max_width - charsInLine);
    let baseNumSpaces = numSpaces / (n - 1);
    let numWithExtraSpace = numSpaces % (n - 1);
    let mut i = 0;
    while i < n {
        ...
    }
    return result;
}

The while loop we’ll write here is instructive. We use an index instead of a for each pattern because the index tells us how many spaces to use. If our index is smaller than numWithExtraSpace, we add 1 to the base number of spaces. Otherwise we use the base until the index n - 1. This index has no extra spaces, so we’re done at that point!

pub fn make_line(
        currentLine: &Vec<&str>,
        max_width: usize,
        charsInLine: usize) -> String {
    let mut result = String::new();
    let n = currentLine.len();
    if (n == 1) {
        result.push_str(currentLine[0]);
        result.push_str(&" ".repeat(max_width - charsInLine));
        return result;
    }
    let numSpaces = (max_width - charsInLine);
    let baseNumSpaces = numSpaces / (n - 1);
    let numWithExtraSpace = numSpaces % (n - 1);
    let mut i = 0;
    while i < n {
        result.push_str(currentLine[i]);
        if i < numWithExtraSpace {
            result.push_str(&" ".repeat(baseNumSpaces + 1));
        } else if i < n - 1 {
            result.push_str(&" ".repeat(baseNumSpaces));
        }
        i += 1;
    }
    return result;
}

Now we frame our solution. Let’s start by setting up our state variables (again, omitting numWordsInLine). We’ll also redefine max_width as a usize value for ease of comparison later.

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    ...
}

Now we’d like to frame our solution as a “for each” loop. However, this doesn’t work, for Rust-related reasons we’ll describe after the solution! Instead, we’ll use an index loop.

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        ...
    }
}

We’ll get the word by index on each iteration, and use its length to see if we’ll exceed the max width. If not, we can safely push it onto currentLine and increase the character count:

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        let word = &words[i];
        if word.len() + charsInLine + currentLine.len() > mw {
            ...
        } else {
            currentLine.push(&words[i]);
            charsInLine += word.len();
        }
    }
}

Now when we do exceed the max width, we have to push our current line onto result (calling make_line). We clear the current line, push our new word, and use its length for charsInLine.

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        let word = &words[i];
        if word.len() + charsInLine + currentLine.len() > mw {
            result.push(make_line(&currentLine, mw, charsInLine));
            currentLine.clear();
            currentLine.push(&words[i]);
            charsInLine = word.len();
        } else {
            currentLine.push(&words[i]);
            charsInLine += word.len();
        }
    }
    ...
}

After our loop, we’ll just call make_line_final on whatever is left in our currentLine! Here’s our complete full_justify function that calls make_line and make_line_final as we wrote above:

pub fn full_justify(words: Vec<String>, max_width: i32) -> Vec<String> {
    let mut currentLine = Vec::new();
    let mut charsInLine = 0;
    let mut result = Vec::new();
    let mw = max_width as usize;
    let mut i = 0;
    let n = words.len();
    for i in 0..n {
        let word = &words[i];
        if word.len() + charsInLine + currentLine.len() > mw {
            result.push(make_line(&currentLine, mw, charsInLine));
            currentLine.clear();
            currentLine.push(&words[i]);
            charsInLine = word.len();
        } else {
            currentLine.push(&words[i]);
            charsInLine += word.len();
        }
    }
    result.push(make_line_final(&currentLine, mw, charsInLine));
    return result;
}

Why an Index Loop?

Inside our Rust loop, we have an odd pattern in getting the “word” for this iteration. We first assign word = &words[i], and then later on, when we push that word, we reference words[i] again, using currentLine.push(&words[i]).

Why do this? Why not currentLen.push(word)? And then, why can’t we just do for word in words as our loop?

If we write our loop as for word in words, then we cannot reference the value word after the loop. It is “scoped” to the loop. However, currentLine “outlives” the loop! We have to reference currentLine at the end when we make our final line.

To get around this, we would basically have to copy the word instead of using a string reference &str, but this is unnecessarily expensive.

These are the sorts of odd “lifetime” quirks you have to learn to deal with in Rust. Haskell is easier in that it spares us from thinking about this. But Rust gains a significant performance boost with these sorts of ideas.

Conclusion

This was definitely the most involved problem we’ve dealt with so far. We learned a new paradigm (buffer and save), and got some experience dealing with some of the odd quirks and edge cases of string manipulation, especially in Rust. It was a fairly tricky problem, as far as list manipulation goes. For an easier example of a buffer and save problem, try solving Merge Intervals.

If you want to level up your Haskell problem solving skills, you need to take our course Solve.hs. This course will teach you everything you need to know about problem solving, data structures, and algorithms in Haskell. After this course, you’ll be in great shape to deal with these sorts of LeetCode style problems as they come up in your projects.

James Bowen 6/23/25 James Bowen 6/23/25

The Sliding Window in Haskell & Rust

In last week’s problem, we covered a two-pointer algorithm, and compared Rust and Haskell solutions as we have been for this whole series. Today, we’ll study a related concept, the sliding window problem. Whereas the general two-pointer problem can often be tackled by a single loop, we’ll have to use nested loops in this problem. This problem will also mark our first use of the Set data structure in this series.

If you want a deeper look at problem solving techniques in Haskell, you should enroll in our Solve.hs course! You’ll learn everything you need for general problem solving knowledge in Haskell, including data structures, algorithms, and parsing!

The Problem

Today’s LeetCode problem is Longest Substring without Repeating Characters. It’s a lengthy problem name, but the name basically tells you everything you need to know! We want to find a substring of our input that does not repeat any characters within the substring, and then get the longest such substring.

For example, abaca would give us an answer of 3, since we have the substringbac that consists of 3 unique characters. However, abaaca only gives us 2. There is no run of 3 characters where the three characters are all unique.

The Algorithm

The approach we’ll use, as mentioned above, is called a sliding window algorithm. In some ways, this is similar to the two-pointer approach last week. We’ll have, in a sense, two different pointers within our input. One dictates the “left end” of a window and one dictates the “right end” of a window. Unlike last week’s problem though, both pointers will move in the same direction, rather than converging from opposite directions.

The goal of a sliding window problem is “find a continuous subsequence of an input that matches the criteria”. And for many problems like ours, you want to find the longest such subsequence. The main process for a sliding window problem is this:

Grow the window by increasing the “right end” until (or while) the predicate is satisfied
Once you cannot grow the window any more, shrink the window by increasing the “left end” until we’re in a position to grow the window again.
Continue until one or both pointers go off the end of the input list.

So for our problem today, we want to “grow” our sliding window as long as we can get more unique characters. Once we hit a character we’ve already seen in our current window, we’ll need to shrink the window until that duplicate character is removed from the set.

As we’re doing this, we’ll need to keep track of the largest substring size we’ve seen so far.

Here are the steps we would take with the input abaca. At each step, we process a new input character.

1. Index 0 (‘a’) - window is “a” which is all unique.
2. Index 1 (‘b’) - window is “ab” which is all unique
3. Index 2 (‘a’) - window is “aba”, which is not all unique
3b. Shrink window, removing first ‘a’, so it is now “ba”
4. Index 3 (‘c’) - window is “bac”, which is all unique
5. Index 4 (‘a’) - window is “baca”, which is not unique
5b. Shrink window, remove ‘b’ and ‘a’, leaving “ca”

The largest unique window we saw was bac, so the final answer is 3.

Haskell Solution

For a change of pace, let’s discuss the Haskell approach first. Our algorithm is laid out in such a way that we can process one character at a time. Each character either grows the window, or forces it to shrink to accommodate the character. This means we can use a fold!

Let’s think about what state we need to track within this fold. Naturally, we want to track the current “set” of characters in our window. Each time we see the next character, we have to quickly determine if it’s already in the window. We’ll also want to track the largest set size we’ve seen so far, since by the end of the string our window might no longer reflect the largest subsequence.

With a general sliding window approach, you would also need to track both the start and the end index of your current window. In this problem though, we can get away with just tracking the start index. We can always derive the end index by taking the start index and adding the size of the set. And since we’re iterating through the characters anyway, we don’t need the end index to get the “next” character.

This means our fold-loop function will have this type signature:

-- State: (start index, set of letters, largest seen)
loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)

Now, using our idea of “beginning from the end”, we can already write the invocation of this loop:

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

    loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
    ...

Using 0 for the start index right away is a little hand-wavy, since we haven’t actually added the first character to our set yet! But if we see a single character, we’ll always add it, and as we’ll see, the “adding” branch of our loop never increases this number.

With that in mind, let’s write this branch of our loop handler! If we have not seen the next character in the string, we keep the same start index (left side of the window isn’t moving), we add the character to our set, and we take the new size of the set as the “best” value if it’s greater than the original. We get the new size by adding 1 to the original set size.

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

   loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
   loop (startIndex, charSet, bestSoFar) c = if S.notMember c charSet
    then (startIndex, S.insert c charSet, max bestSoFar (S.size charSet + 1))
      else ...

Now we reach the tricky case! If we’ve already seen the next character, we need to remove characters from our set until we reach the instance of this character in the set. Since we might need to remove multiple characters, “shrinking” is an iterative process with a variable number of steps. This means it would be a while-loop in most languages, which means we need another recursive function!

The goal of this function is to change two of our stateful values (the start index and the character set) until we can once again have a unique character set with the new input character. So each iteration it takes the existing values for these, and will ultimately return updated values. Here’s its type signature:

shrink :: (Int, S.Set Char) -> Char -> (Int, S.Set Char)

Before we implement this, we can invoke it in our primary loop! When we’ve seen the new character in our set, we shrink the input to match this character, and then return these new stateful values along with our previous best (shrinking never increases the size).

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

   loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
   loop (startIndex, charSet, bestSoFar) c = if S.notMember c charSet
    then (startIndex, S.insert c charSet, max bestSoFar (S.size charSet + 1))
      else
        let (newStart, newSet) = shrink (startIndex, charSet) c
        in  (newStart, newSet, bestSoFar)

    shrink :: (Int, S.Set Char) -> Char -> (Int, S.Set Char)
    shrink = undefined

Now we implement “shrink” by considering the base case and recursive case. In the base case, the character at this index matches the new character we’ve trying to remove. So we can return the same set of characters, but increase the index.

In the recursive case, we still increase the index, but now we remove the character at the start index from the set without replacement. (Note how we need a vector for efficient indexing here).

largestUniqueSubsequence :: String -> Int
largestUniqueSubsequence input = best
  where
    (_, _, best) = foldl loop (0, S.empty, 0) input

   loop :: (Int, S.Set Char, Int) -> Char -> (Int, S.Set Char, Int)
   loop (startIndex, charSet, bestSoFar) c = if S.notMember c charSet
    then (startIndex, S.insert c charSet, max bestSoFar (S.size charSet + 1))
      else
        let (newStart, newSet) = shrink (startIndex, charSet) c
        in  (newStart, newSet, bestSoFar)

    shrink :: (Int, S.Set Char) -> Char -> (Int, S.Set Char)
    shrink (startIndex, charSet) c =
      let nextC = inputV V.! startIndex
          // Base Case: nextC is equal to newC
      in  if nextC == c then (startIndex + 1, charSet)
            // Recursive Case: Remove startIndex
            else shrink (startIndex + 1, S.delete nextC charSet) c

Now we have a complete Haskell solution!

Rust Solution

Now in our Rust solution, we’ll follow the same pattern we’ve been doing for these problems. We’ll set up our loop variables, write the loop, and handle the different cases in the loop. Because we had the nested recursive “shrink” function in Haskell, this will translate to a “while” loop in Rust, nested within our for-loop.

Here’s how we set up our loop variables:

pub fn length_of_longest_substring(s: String) -> i32 {
    let mut best = 0;
    let mut startIndex = 0;
    let inputV: Vec<char> = s.chars().collect();
    let mut charSet = HashSet::new();
    for c in s.chars() {
        ...
    }
}

Within the loop, we have the “easy” case, where the next character is not already in our set. We just insert it into our set, and we update best if we have a new maximum.

pub fn length_of_longest_substring(s: String) -> i32 {
    let mut best = 0;
    let mut startIndex = 0;
    let inputV: Vec<char> = s.chars().collect();
    let mut charSet = HashSet::new();
    for c in s.chars() {
        if charSet.contains(&c) {
            ...
        } else {
            charSet.insert(c);
            best = std::cmp::max(best, charSet.len());                
        }
    }
    return best as i32;
}

The Rust-specific oddity is that when we call contains on the HashSet, we must use &c, passing a reference to the character. In C++ we could just copy the character, or it could be handled by the function using const&. But Rust handles these things a little differently.

Now we get to the “tricky” case within our loop. How do we “shrink” our set to consume a new character?

In our case, we’ll actually just use the loop functionality of Rust, which works like while (true), requiring a manual break inside the loop. Our idea is that we’ll inspect the character at the “start” index of our window. If this character is the same as the new character, we will advance the start index (indicating we are dropping the old version), but then we’ll break. Otherwise, we’ll still increase the index, but we’ll remove the other character from the set as well.

Here’s what this loop looks like in relative isolation:

if charSet.contains(&c) {
    loop {
        // Look at “first” character of window
        let nextC = inputV[startIndex];
        if (nextC == c) {
            // If it’s the new character, we advance past it and break
            startIndex += 1;
            break;
        } else {
            // Otherwise, advance AND drop it from the set
            startIndex += 1;
            charSet.remove(&nextC);
        }
    }
} else {
    ...
}

The inner condition (nextC == c) feels a little flimsy to use with a while (true) loop. But it’s perfectly sound because of the invariant that if charSet contains c, we’ll necessarily find nextC == c before startIndex gets too large. We could also write it as a normal while loop, but loop is an interesting Rust-specific idea to bring in here.

Here’s our complete Rust solution!

pub fn length_of_longest_substring(s: String) -> i32 {
    let mut best = 0;
    let mut startIndex = 0;
    let inputV: Vec<char> = s.chars().collect();
    let mut charSet = HashSet::new();
    for c in s.chars() {
        if charSet.contains(&c) {
            loop {
                let nextC = inputV[startIndex];
                if (nextC == c) {
                    startIndex += 1;
                    break;
                } else {
                    startIndex += 1;
                    charSet.remove(&nextC);
                }
            }
        } else {
            charSet.insert(c);
            best = std::cmp::max(best, charSet.len());                
        }
    }
    return best as i32;
}

Conclusion

With today’s problem, we’ve covered another important problem-solving concept: the sliding window. We saw how this approach could work even with a fold in Haskell, considering one character at a time. We also saw how nested loops compare across Haskell and Rust.

For more problem solving tips and tricks, take a look at Solve.hs, our complete course on problem solving, data structures, and algorithms in Haskell. You’ll get tons of practice on problems like these so you can significantly level up your skills!

James Bowen 6/16/25 James Bowen 6/16/25

Two Pointer Algorithms

We’re now on to part 5 of our series comparing Haskell and Rust solutions for LeetCodeproblems. You can also look at the previous parts (Part 1, Part 2, Part 3, Part 4) to get some more context on what we’ve learned so far comparing these two languages.

For a full look at problem solving in Haskell, check out Solve.hs, our latest course! You’ll get full breakdowns on the processes for solving problems in Haskell, from basic list and loop problems to advanced algorithms!

The Problem

Today we’ll be looking at a problem called Trapping Rain Water. In this problem, we’re given a vector of heights, which form a sort of 1-dimensional topology. Our job is to figure out how many units of water could be collected within the topology.

As a very simple example, the input [1,0,2] could collect 1 unit of water. Here’s a visualization of that system, where x shows the topology and o shows water we collect:

x
xox

We can never collect any water over the left or right “edges” of the array, since it would flow off. The middle index of our array though is lower than its neighbors. So we take the lower of these neighboring values, and we see that we can collect 1 unit of water in this system.

For a bigger example that collects water, we might have the input [4, 2, 1, 1, 3, 5]. Here’s what that looks like:

x
x o o o o x
x o o o x x 
x x o o x x
x x x x x x

The total water here is 9.

A flat system like [2,2,2], or a system that looks like a peak [1,2,3,2,1] cannot collect any water, so we should return 0 in these cases.

The Algorithm

There are a couple ways to solve this. One approach would be a two-pass solution, similar to what we used in Product of Array Except Self. We loop from the left side, tracking the maximum water we can store in each unit based on its left neighbors. Then we loop again from the right side and compare the maximum we can store based on the right neighbors to the prior value from the left. This solution is O(n) time, but O(n) space as well.

A more optimal solution for this problem is a two-pointer approach that can use O(1) additional space. In this kind of solution, we look at the left and right of the input simultaneously. Each step of the way, we make a decision to either increase the “left pointer” or decrease the “right pointer” until they meet in the middle. Each time we move, we get more information about our solution.

In this particular problem, we’ll track the maximum value we’ve seen from the left side and the maximum value we’ve seen from the right side. As we traverse each index, we update both sides for the current left and right indices if we have a new maximum.

The crucial step is to see that if the current “left max” is smaller than the current “right max”, we know how much water can be stored at the left index. This is just the left max minus the left index. Then we can increment the left index.

If the opposite is true, we calculate how much water can be stored at the right index, and decrease the right index.

So we keep a running tally of these sums, and we end our loop when they meet in the middle.

Rust Solution

We can describe our algorithm as a simple while loop. This loop goes until the left index exceeds the right index. The loop needs to track 5 values:

Left Index
Right Index
Left Max
Right Max
Total sum so far

So let’s write the setup portion of the loop:

pub fn trap(height: Vec<i32>) -> i32 {
    let mut leftMax = -1;
    let mut rightMax = -1;
    let mut leftI = 0;
    let mut rightI = height.len() - 1;
    let mut total = 0;
    while leftI <= rightI {
        ...
    }
}

A subtle thing…the constraints on the LeetCode problem are that the length is at least 1. But to handle length 0 cases, we would need a special case. Rust uses unsigned integers for vector length, so taking height.len() - 1 on a length-0 vector would give the maximum integer, and this would mess up our loop and indexing.

Within the while loop, we run the algorithm.

Adjust leftMax and rightMax if necessary.
If leftMax is not larger, recurse, incrementing leftI and adding to total from the left
If rightMax is smaller, decrement rightI and add total from the right

And at the end, we return our total!

pub fn trap(height: Vec<i32>) -> i32 {
    let n = height.len();
    if n <= 1 {
        return 0;
    }
    let mut leftMax = -1;
    let mut rightMax = -1;
    let mut leftI = 0;
    let mut rightI = n - 1;
    let mut total = 0;
    while leftI <= rightI {
        // Step 1
        leftMax = std::cmp::max(leftMax, height[leftI]);
        rightMax = std::cmp::max(rightMax, height[rightI]);
        if leftMax <= rightMax {
            // Step 2
            total += leftMax - height[leftI];
            leftI += 1;
        } else {
            // Step 3
            total += rightMax - height[rightI];
            rightI -= 1;
        }
    }
    return total;
}

Haskell Solution

Now that we’ve seen our Rust solution with a single loop, let’s remember our process for translating this idea to Haskell. With a two-pointer loop, the way in which we traverse the elements of the input is unpredictable, thus we need a raw recursive function, rather than a fold or a map.

Since we’re tracking 5 integer values, we’ll want to write a loop function that looks like this:

-- (leftIndex, rightIndex, leftMax, rightMax, sum)
loop :: (Int, Int, Int, Int, Int) -> Int

Knowing this, we can already “start from the end” and figure out how to invoke our loop from the start of our function:

trapWater :: V.Vector Int -> Int
trapWater input = loop (0, n - 1, -1, -1, 0)
  where
    n = V.length input

    loop :: (Int, Int, Int, Int, Int) -> Int
    loop = undefined

In writing our recursive loop, we’ll start with the base case. Once leftI is the bigger index, we return the total.

trapWater :: V.Vector Int -> Int
trapWater input = loop (0, n - 1, -1, -1, 0)
  where
    n = V.length input

    loop :: (Int, Int, Int, Int, Int) -> Int
    loop (leftI, rightI, leftMax, rightMax, total) = if leftI > rightI then total
      else …

Within the else case, we just follow our algorithm, with the same 3 steps we saw with Rust.

trapWater :: V.Vector Int -> Int
trapWater input = loop (0, n - 1, -1, -1, 0)
  where
    n = V.length input

    -- (leftIndex, rightIndex, leftMax, rightMax, sum)
    loop :: (Int, Int, Int, Int, Int) -> Int
    loop (leftI, rightI, leftMax, rightMax, total) = if leftI > rightI then total
      else
        -- Step 1
        let leftMax' = max leftMax (input V.! leftI)
            rightMax' = max rightMax (input V.! rightI)
        in  if leftMax' <= rightMax'
              -- Step 2
              then loop (leftI + 1, rightI, leftMax', rightMax', total + leftMax' - input V.! leftI)
              -- Step 3
              else loop (leftI, rightI - 1, leftMax', rightMax', total + rightMax' - input V.! rightI)

And we have our Haskell solution!

Conclusion

If you’ve been following this whole series so far, hopefully you’re starting to get a feel for comparing basic algorithms in Haskell and Rust (standing as a proxy for most loop-based languages). In general, we can write loops as recursive functions in Haskell, capturing the “state” of the list as the input parameter for that function.

In particular cases where each iteration deals with exactly one element of an input list, we can employ folds as a tool to simplify our functions. But the two-pointer algorithm we explored today falls into the general recursive category.

To learn the details of understanding these problem solving techniques, take a look at our course, Solve.hs! You’ll learn everything from basic loop and list techniques, to advanced data structures and algorithms!

James Bowen 6/9/25 James Bowen 6/9/25

Spatial Reasoning with Zigzag Patterns!

Today we’re continuing our study of Rust and Haskell solutions to basic coding problems. This algorithm is going to be a little harder than the last few we’ve done in this series, and it will get trickier from here!

For a complete study of problem solving techniques in Haskell, make sure to check out Solve.hs. This course runs the gamut from basic solving techniques to advanced data structures and algorithms, so you’ll learn a lot!

The Problem

Today’s problem is Zigzag Conversion. This is an odd problem that stretches your ability to think iteratively and spatially. The idea is that you’re given an input string and a number of “rows”. You need to then imagine the input word written as a zig-zag pattern, where you write the letters in order first going down, and then diagonally up to the right until you get back to the first row. Then it goes down again. Your output must be characters re-ordered in “row-order” after this zig-zag rearrangement.

This makes the most sense looking at examples. Let’s go through several variations with the string MONDAYMORNINGHASKELL. Here’s what it looks like with 3 rows.

M   A   R   G   K
O D Y O N N H S E L
N   M   I   A   L

So to get the answer, we read along the top line first (MARGK), then the second (ODYONNHSEL), and then the third (NMIAL). So the final answer is MARGKODYONNHSELNMIAL.

Now let’s look at the same string in 4 rows:

M     M     G     L
O   Y O   N H   E L
N A   R I   A K
D     N     S

The answer here is MMGLOYONHELNARIAKDNS.

Here’s 5 rows:

M       R       K
O     O N     S E
N   M   I   A   L
D Y     N H     L
A       G

The answer here is MRKOONSENMIALDYNHLAG.

And now that we have the pattern, we can also consider 2 rows, which doesn’t visually look like a zig-zag as much:

M N A M R I G A K L
O D Y O N N H S E L

This gives the answer MNAMRIGAKLODYONNHSEL.

Finally, if there’s only 1 row, you can simply return the original string.

The Algorithm

So how do we go about solving this? The algorithm here is a bit more involved than the last few weeks!

Our output order is row-by-row, so for our solution we should think in a row-by-row fashion. If we can devise a function that will determine the indices of the original string that belong in each row, then we can simply loop over the rows and append these results!

In order to create this function, we have to think about the zig-zag in terms of “cycles”. Each cycle begins at the top row, goes down to the bottom row, and then up diagonally to the second row. The next element to go at the top row starts a new cycle. By thinking about cycles, we’ll discover a few key facts:

With n rows (n >= 2), a complete cycle has 2n - 2 letters.
The top and bottom row get one letter per cycle.
All other rows get two letters per cycle.

Now we can start to think mathematically about the indices that belong in each row. It’s easiest to think about the top and bottom rows, since they only get one letter each cycle. Each of these has a starting index (0 and n - 1, respectively), and then we add the cycle length 2n - 2 to these starting indices until it exceeds the length.

The middle rows have this same pattern, only now they have 2 starting indices. They have the starting index from the “down” direction and then their first index going up and to the right. The first index for row i is obviously i - 1, but the second index is harder to see.

The easiest way to find the second index is backwards! The next cycle starts at 2n - 2. So row index 1 has its second index at 2n - 2 - 1, and row index 2 has its second index at 2n - 2 - 2, and so on! The pattern of adding the “cycle number” will work for all starting indices.

Once we have the indices for each row, our task is simple. We build a string for each row and combine them together in order.

So suppose we have our 4-row example.

M     M     G     L
O   Y O   N H   E L
N A   R I   A K
D     N     S

The “cycle num” is 6 (2 * 4 - 2). So the first row has indices [0, 6, 12, 18]. The fourth row starts with index 3, and so its indices also go up by 6 each time: [3, 9, 15].

The second row (index 1) has starting indices 1 and 5 (6 - 1). So its indices are [1, 5, 7, 11, 13, 17, 19]. Then the third row has indices [2, 4, 8, 10, 14, 16].

A vector input will allow us to efficiently use and combine these indices.

As a final note, the “cycle num” logic doesn’t end up working with only 1 row. The cycle length using our calculation would be 0, not 1 as it should. The discrepancy is because our “cycle num” logic really depends on having a “first” and “last” row. So if we only have 1 row, we’ll hardcode that case and return the input string.

Rust Solution

In our rust solution, we’ll accumulate our result string in place. To accomplish this we’ll do a few setup steps:

Handle our base case (1 row)
Get the string length and cycle number
Make a vector of the input chars for easy indexing (Rust doesn’t allow string indexing)
Initialize our mutable result string

pub fn convert(s: String, num_rows: i32) -> String {
    if (num_rows == 1) {
        return s;
    }
    let n = s.len();
    let nr = num_rows as usize; // Convenience for comparison
    let cycleLen: usize = (2 * nr - 2);
    let sChars: Vec<char> = s.chars().collect();
    let mut result = String::new();
    ...
}

Now we have to add the rows in order. Since the logic differs for the first and last rows, we have 3 sections: first row, middle rows, and last row. The first and last row are straightforward using our algorithm. Each is a simple while loop.

pub fn convert(s: String, num_rows: i32) -> String {
   if (num_rows == 1) {
       return s;
   }
   let n = s.len();
   let nr = num_rows as usize; // Convenience for comparison
   let cycleLen: usize = (2 * nr - 2);
   let sChars: Vec<char> = s.chars().collect();
   let mut result = String::new();
   
   // First Row
   let mut i = 0;
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }

   // Middle Rows
   ...

   // Last Row
   i = (nr - 1);
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }
   return result;
}

Now the middle rows section is similar. We loop through each of the possible rows in the middle. For each of these, we’ll do a while loop similar to the first and last row. These loops are different though, because we have to track two possible values, the “first” and “second” of each cycle.

If the “first” is already past the end of the vector, then we’re already done and can skip the loop. But even if not, we still need an “if check” on the “second” value as well. Each time through the loop, we increase both values by cycleLen.

pub fn convert(s: String, num_rows: i32) -> String {
   if (num_rows == 1) {
       return s;
   }
   let n = s.len();
   let nr = num_rows as usize; // Convenience for comparison
   let cycleLen: usize = (2 * nr - 2);
   let sChars: Vec<char> = s.chars().collect();
   let mut result = String::new();
   
   // First Row
   let mut i = 0;
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }

   // Middle Rows
   for row in 1..(nr - 1) {
       let mut first = row;
       let mut second = cycleLen - row;
       while first < n {
           result.push(sChars[first]);
           if second < n {
               result.push(sChars[second]);
           }
           first += cycleLen;
           second += cycleLen;
       }
   }

   // Last Row
   i = (nr - 1);
   while i < n {
       result.push(sChars[i]);
       i += cycleLen;
   }
   return result;
}

And that’s our complete solution!

Haskell Solution

The Haskell solution follows the same algorithm, but we’ll make a few stylistic changes compared to Rust. In Haskell, we’ll go ahead and define specific lists of indices for each row. That way, we can combine these lists and make our final string all at once using concatMap. This approach will let us demonstrate the power of ranges in Haskell.

We start our defining our base case and core parameters:

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else ...
  where
    n = length input
    cycleLen = 2 * numRows - 2

    ...

Now we can define index-lists for the first and last rows. These are just ranges! We have the starting element, and we know to increment it by cycleLen. The range should go no higher than n - 1. Funny enough, the range can figure out that it should be empty in the edge case that our input is too small to fill all the rows!

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else ...
  where
    n = length input
    cycleLen = 2 * numRows - 2

    firstRow :: [Int]
    firstRow = [0,cycleLen..n - 1]

    lastRow :: [Int]
    lastRow = [numRows - 1, numRows - 1 + cycleLen..n - 1]

    ...

In Rust, we used a while-loop with two state values to calculate the middle rows. Hopefully you know from this series now that this while loop translates into a recursive function in Haskell. We’ll accumulate our list of indices as a tail argument, and keep the two stateful values as our other input parameters. We’ll combine all our lists together into one big list of int-lists, allRows.

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else ...
  where
    n = length input
    cycleLen = 2 * numRows - 2

    firstRow :: [Int]
    firstRow = [0,cycleLen..n - 1]

    lastRow :: [Int]
    lastRow = [numRows - 1, numRows - 1 + cycleLen..n - 1]

    middleRow :: Int -> Int -> [Int] -> [Int]
    middleRow first second acc = if first >= n then reverse acc
      else if second >= n then reverse (first : acc)
      else middleRow (first + cycleLen) (second + cycleLen) (second : first : acc)

    middleRows :: [[Int]]
    middleRows = map (\i -> middleRow i (cycleLen - i) []) [1..numRows-2]

    allRows :: [[Int]]
    allRows = firstRow : middleRows <> [lastRow]

    ...

Now we bring it all together with one final step. We make a vector from our input, and define a function to turn a single int-list into a single String. Then at the top level of our function (the original else branch), we use concatMap to bring these together into our final result String.

zigzagConversion :: String -> Int -> String
zigzagConversion input numRows = if numRows == 1 then input
  else concatMap rowIndicesToString  allRows
  where
    n = length input
    cycleLen = 2 * numRows - 2

    firstRow :: [Int]
    firstRow = [0,cycleLen..n - 1]

    lastRow :: [Int]
    lastRow = [numRows - 1, numRows - 1 + cycleLen..n - 1]

    middleRow :: Int -> Int -> [Int] -> [Int]
    middleRow first second acc = if first >= n then reverse acc
      else if second >= n then reverse (first : acc)
      else middleRow (first + cycleLen) (second + cycleLen) (second : first : acc)

    middleRows :: [[Int]]
    middleRows = map (\i -> middleRow i (cycleLen - i) []) [1..numRows-2]

    allRows :: [[Int]]
    allRows = firstRow : middleRows <> [lastRow]

    inputV :: V.Vector Char
    inputV = V.fromList input

    rowIndicesToString :: [Int] -> String
    rowIndicesToString = map (inputV V.!)

Conclusion

This comparison once again showed how while loops in Rust track with recursive functions in Haskell. We also saw some nifty Haskell features like ranges and tail recursion. Most of all, we saw that even with a trickier algorithm, we can still keep the same basic shape of our algorithm in a functional or imperative style.

To learn more about these problem solving concepts, take a look at Solve.hs, our comprehensive course on problem solving in Haskell. You’ll learn about recursion, list manipulation, data structures, graph algorithms, and so much more!

James Bowen 6/2/25 James Bowen 6/2/25

Starting from the End: Solving “Product Except Self”

Today we continue our series exploring LeetCode problems and comparing Haskell and Rust solutions. We’re staying in the realm of list/vector manipulation, but the problems are going to start getting more challenging!

If you want to learn more about problem solving in Haskell, you should take a closer look at Solve.hs! You’ll particularly learn how to translate common ideas from loop-based into Haskell’s recursive ideas!

The Problem

Today’s problem is Product of Array Except Self. The idea is that we are given a vector of n integers. We are supposed to return another vector of n integers, where output[i] is equivalent to the product of all the input integers except for input[i].

The key constraint here is that we are not allowed to use division. If we could use division, the answer would be simple! We would find the product of the input numbers and then divide this product by each input number to find the corresponding value. But division is more expensive than most other numeric operations, so we want to avoid it if possible!

The Algorithm

The approach we’ll use in this article relies on “prefix products” and “suffix products”. We’ll make two separate vectors called prefixes and suffixes, where prefixes[i] is the product of all numbers strictly before index i, and suffixes[i] is the product of all numbers strictly after index i.

Then, we can easily produce our results. The value output[i] is simply the product of prefixes[i] and suffixes[i].

As an example, our input might be [3, 4, 5]. The prefixes vector should be [1, 3, 12], and the suffixes vector should be [20, 5, 1]. Then our final output should be [20, 15, 12].

prefixes: [1, 3, 12]
suffixes: [20, 5, 1]
output: [20, 15, 12]

Rust Solution

Here’s our Rust solution:

impl Solution {
    pub fn product_except_self(nums: Vec<i32>) -> Vec<i32> {
        let n = nums.len();
        let mut prefixes = vec![0; n];
        let mut suffixes = vec![0; n];
        let mut totalPrefix = 1;
        let mut totalSuffix = 1;

        // Loop 1: Populate prefixes & suffixes
        for i in 0..n {
            prefixes[i] = totalPrefix;
            totalPrefix *= nums[i];
            suffixes[n - i - 1] = totalSuffix;
            totalSuffix *= nums[n - i - 1];
        }

        let mut results = vec![0; n];

        // Loop 2: Populate results
        for i in 0..n {
            results[i] = prefixes[i] * suffixes[i];
        }
        return results;
    }
}

The two for-loops provide this solution with its shape. The first loop generates our vectors prefixes and suffixes. We keep track of a running tally of the totalPrefix and the totalSuffix. Each of these is initially 1.

let n = nums.len();
let mut prefixes = vec![0; n];
let mut suffixes = vec![0; n];
let mut totalPrefix = 1;
let mut totalSuffix = 1;

On each iteration, we assign the current “total prefix” to the prefixes vector in the front index i, and then the “total suffix” to the suffixes vector in the back index n - i - 1. Then we multiply each total value by the input value (nums) from that index so it’s ready for the next iteration.

// Loop 1: Populate prefixes & suffixes
for i in 0..n {
    prefixes[i] = totalPrefix;
    totalPrefix *= nums[i];
    suffixes[n - i - 1] = totalSuffix;
    totalSuffix *= nums[n - i - 1];
}

And now we calculate the result, by taking the product of prefixes and suffixes at each index.

let mut results = vec![0; n];

// Loop 2: Populate results
for i in 0..n {
    results[i] = prefixes[i] * suffixes[i];
}
return results;

Haskell Solution

In Haskell, we can follow this same template. However, a couple differences stand out. First, we don’t use for-loops. We have to use recursion or recursive helpers to accomplish these loops. Second, when constructing prefixes and suffixes, we want to use lists instead of modifying mutable vectors.

When performing recursion and accumulating linked lists, it can be tricky to reason about which lists need to be reversed at which points in our algorithm. For this reason, it’s often very helpful in Haskell to start from the end of our algorithm.

Let’s write out a template of our solution that leaves prefixes and suffixes as undefined stubs. Then the first step we’ll work through is how to get the solution from that:

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution ???
  where
    n = V.length inputs

    solution :: ??? -> V.Vector Int

    prefixes :: [Int]
    prefixes = undefined

    suffixes :: [Int]
    suffixes = undefined

So given prefixes and suffixes, how do we find our solution? The ideal case is that both these lists are already in reverse-index order with respect to the input vector (i.e. n - 1 to 0). Then we don’t need to do an additional reverse to get our solution.

We can then implement solution as a simple tail recursive helper function that peels one element off each input and multiplies them together. When we’re out of inputs, it returns its result:

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution (prefixes, suffixes, [])
  where
    n = V.length inputs

    -- Loop 2: Populate Results
    solution :: ([Int], [Int], [Int]) -> V.Vector Int
    solution ([], [], acc) = V.fromList acc
    solution (p : ps, s : ss, acc) = solution (ps, ss, p * s : acc)
    solution _ = error “Prefixes and suffixes must be the same size!”

    prefixes :: [Int]

    suffixes :: [Int]

So now we’ve done “Loop 2” already, and we just have to implement “Loop 1” so that it produces the right results. Again, we’ll make a tail recursive helper, and this will produce both prefixes and suffixes at once. It will take the index, as well as the “total” prefix and suffix so far, and then two accumulator lists. At the end of this, we want both lists in reverse index order.

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution (prefixes, suffixes, [])
  where
    n = V.length inputs

    -- Loop 2: Populate Results
    solution :: ([Int], [Int], [Int]) -> V.Vector Int

    prefixes :: [Int]
    suffixes :: [Int]
    (prefixes, suffixes) = mkPrefixSuffix (0, 1, [], 1, [])

    -- Loop 1: Populate prefixes & suffixes
    mkPrefixSuffix :: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
    mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = undefined

Now we fill in mkPrefixSuffix as we would any tail recursive helper. First we satisfy the base case. This occurs once i is at least n. We’ll return the accumulated lists.

mkPrefixSuffix :: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = if i >= n then (pres, reverse suffs)
  else ...

But observe we’ll need to reverse suffixes! This becomes clear when we map out what each iteration of the loop looks like for a simple input. Doing this kind of “loop tracking” is a very helpful problem solving skill for walking through your code!

input = [3, 4, 5]
i = 0: (0, 1, [], 1, [])
i = 1: (1, 3, [1], 5, [1])
i = 2: (2, 12, [3, 1], 20, [5, 1])
i = 3: (3, 60, [12, 3, 1], 60, [20, 5, 1])

Our prefixes are [12, 3, 1], which is properly reversed, but the suffixes are [20, 5, 1]. We don’t want both lists ending in 1! So we reverse the suffixes.

Now that we’ve figured this out, it’s simple enough to fill in the recursive case using what we already know from “Loop 1” in the Rust solution. We get the “front” index of input with i, and the “back” index with n - i - 1, use these to get the new products, and then save the old products in our list.

mkPrefixSuffix :: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = if i >= n then (pres, reverse suffs)
  else
    let nextPre = nums V.! i
        nextSuff = nums V.! (n - i - 1)
    in mkPrefixSuffix (i + 1, totalPre * nextPre, totalPre : pres, totalSuff * nextSuff, totalSuff : suffs)

Here’s our complete Haskell solution!

productOfArrayExceptSelf :: V.Vector Int -> V.Vector Int
productOfArrayExceptSelf inputs = solution (prefixes, suffixes, [])
  where
    n = V.length inputs

    solution :: ([Int], [Int], [Int]) -> V.Vector Int
    solution ([], [], acc) = V.fromList acc
    solution (p : ps, s : ss, acc) = solution (ps, ss, p * s : acc)
    solution _ = error "Invalid solution!"

    prefixes :: [Int]
    suffixes :: [Int]
    (prefixes, suffixes) = mkPrefixSuffix (0, 1, [], 1, [])


    mkPrefixSuffix:: (Int, Int, [Int], Int, [Int]) -> ([Int], [Int])
    mkPrefixSuffix (i, totalPre, pres, totalSuff, suffs) = if i >= n then (pres, reverse suffs)
      else
        let nextPre = inputs V.! i
            nextSuff = inputs V.! (n - i - 1)
        in  mkPrefixSuffix (i + 1, totalPre * nextPre, totalPre : pres, totalSuff * nextSuff, totalSuff : suffs)

Conclusion

In this comparison, we saw a couple important differences in problem solving with a loop-based language like Rust compared to Haskell.

For-loops have to become recursion in Haskell
We want to use lists in Haskell, not mutable vectors
It takes a bit of planning to figure out when to reverse lists!

This led us to a couple important insights when solving problems in Haskell.

“Starting from the end” can be very helpful in plotting out our solution
“Loop tracking” is a very helpful skill to guide our solutions

For an in-depth look at these sorts of comparisons, check out our Solve.hs course. You’ll learn all the most important tips and tricks for solving coding problems in Haskell! In particular you’ll get an in-depth look at tail recursion, a vital concept for solving problems in Haskell.

James Bowen 5/26/25 James Bowen 5/26/25

Learning from Multiple Solution Approaches

Welcome to the second article in our Rust vs. Haskell problem solving series. Last week we saw some basic differences between Rust loops and Haskell recursion. We also saw how to use the concept of “folding” to simplify a recursive loop function.

This week, we’ll look at another simple problem and consider multiple solutions in each language. We’ll consider what a “basic” solution looks like, using relatively few library functions. Then we’ll consider more “advanced” solutions that make use of library functionality, and greatly simplify the structure of our solutions.

To learn more about problem solving in Haskell, including the importance of list library functions, take a look at our course Solve.hs! You’ll write most of Haskell’s list API from scratch so you get an in-depth understanding of the functions that are available!

The Problem

This week’s problem is Reverse Words in a String. The idea is simple. Our input is a string, which naturally has “words” separated by whitespace. We want to return a string that has all the words reversed! So if the input is ”A quick brown fox”, the result should be ”fox brown quick A”.

Notice that all whitespace is truncated in our output. We should only have a single space between words in our answer, with no leading or trailing whitespace.

The Algorithm

The algorithmic idea is simple and hardly needs explanation. We want to gather letters from the input word until we encounter whitespace. Then we append this buffered word to a growing result string, and keep following this process until we run out of input.

There is one wrinkle, which is whether we want to accumulate our answer in the forward or reverse direction. This changes across languages!

In Haskell, it’s actually more efficient to accumulate the “back” of our resulting string first, meaning we should start by iterating from the front of the input. This is more consistent with linked list construction.

In Rust, we’ll iterate from the back of the input so that we can accumulate our result from the “front”.

Basic Rust Solution

In our basic solution, we’re going to consider a character-by-character approach. As outlined in our algorithm, we can accomplish this task with a single loop, with two stateful values. First, we have the “current” word we’re accumulating of non-whitespace characters. Second, we have the final “result” we’re accumulating.

It’s efficient to append to the end of strings, meaning we want to construct our result from front-to-back. This means we’ll loop through the characters of our string in reverse, as shown with .rev() here:

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        ...
    }
}

Within the loop, we now just have to consider what to do with each character. If the character is not whitespace, the answer is simple. We just append this character to our “current” word. Because we’re looping through the input in reverse, our “current” word will also be in reverse!

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        if !c.is_whitespace() {
            current.push(c);
        } else {
            ...
        }
    }
}

So what happens when we encounter whitespace? There’s a few conditions to consider:

If “current” is empty, do nothing.
If “result” is empty, append “current” (in reverse order) to result.
If “result” is not empty, add a space and then append “current” in reverse.
Regardless, clear “current” and prepare to gather a new string.

Here’s what the code looks like:

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        if !c.is_whitespace() {
            current.push(c);
        } else {
            // Step 1: Skip if empty
            if !current.is_empty() {
                // Step 2/3 Only push an empty space is result is not empty
                if !result.is_empty() {
                    result.push(' ');
                }
                // Step 2/3 Reverse current and append
                for b in current.chars().rev() {
                    result.push(b);
                }
                // Step 4: Clear “current”
                current.clear();
            }
        }
    }
}

There’s one final trick. Unless the word begins with whitespace, we’ll still have non-empty current at the end and we will not have appended it. So we do one final check, and once again append “current” in reverse order.

Here’s our final basic solution:

pub fn reverse_words(s: String) -> String {
    let mut current = String::new();
    let mut result = String::new();
    for c in s.chars().rev() {
        if !c.is_whitespace() {
            current.push(c);
        } else {
            // Step 1: Skip if empty
            if !current.is_empty() {
                // Step 2/3 Only push an empty space is result is not empty
                if !result.is_empty() {
                    result.push(' ');
                }
                // Step 2/3 Reverse current and append
                for b in current.chars().rev() {
                    result.push(b);
                }
                // Step 4: Clear “current”
                current.clear();
            }
        }
    }
   if !current.is_empty() {
        if !result.is_empty() {
            result.push(' ');
        }
        for b in current.chars().rev() {
            result.push(b);
        }
    }
    return result;
}

Advanced Rust Solution

Looping character-by-character is a bit cumbersome. However, since basic whitespace related operations are so common, there are some useful library functions for dealing with them.

Rust also prioritizes the ability to chain iterative operations together. This gives us the following one-line solution!

pub fn reverse_words(s: String) -> String {
    s.split_whitespace().rev().collect::<Vec<&str>>().join(" ")
}

It has four stages:

Split the input based on whitespace.
Reverse the split-up words.
Collect these words as a vector of strings.
Join them together with one space in between them.

What is interesting about this structure is that each stage of the process has a separate type. Step 1 creates a SplitWhitespace struct. Step 2 creates a Reverse struct. Step 3 then creates a normal vector, and step 4 concludes by producing a string.

The two preliminary structures are essentially wrappers with iterators to help chain the operations together. As we’ll see, the comparable Haskell solution only uses basic lists, and this is a noteworthy difference between the languages.

Basic Haskell Solution

Our “basic” Haskell solution will follow the same outline as the basic Rust solution, but we’ll work in the opposite direction! We’ll loop through the input in forward order, and accumulate our output in reverse order.

Before we even get started though, we can make an observation from our basic Rust solution that we duplicated some code! The concept of combining the “current” word and the “result” had several edge cases to handle, so let’s write a combine function to handle these.

-- “current” is reversed and then goes in *front* of result
-- (Rust version put “current” at the back)
combine :: (String, String) -> String
combine (current, res) = if null current then res
  else reverse current <> if null res then "" else (' ' : res)

Now let’s think about our loop structure. We are going through the input, character-by-character. This means we should be able to use a fold, like we did last week! Whenever we’re using a fold, we want to think about the “state” we’re passing through each iteration. In our case, the state is the “current” word and the “result” string. This means our folding function should look like this:

loop :: (String, String) -> Char -> (String, String)
loop (current, result) c = ...

Now we just have to distinguish between the “whitespace” case and the non-whitespace case. If we encounter a space, we just combine the current word with the accumulated result. If we encounter a normal character, we append this to our current word (again, accumulating “current” in reverse).

loop :: (String, String) -> Char -> (String, String)
loop (currentWord, result) c = if isSpace c
  then ("", combine (currentWord, result))
  else (c : currentWord, result)

Now to complete the solution, we just call ‘foldl’ with our ‘loop’ and the input, and we just have to remember to combine the final “current” word with the output! Here’s our complete “basic” solution.

reverseWords :: String -> String
reverseWords input = combine $ foldl loop ("", "") input
  where
    combine :: (String, String) -> String
    combine (current, res) = if null current then res
      else reverse current <> if null res then "" else (' ' : res)

    loop :: (String, String) -> Char -> (String, String)
    loop (currentWord, result) c = if isSpace c
      then ("", combine (currentWord, result))
      else (c : currentWord, result)

Advanced Haskell Solutions

Now that we’ve seen a basic, character-by-character solution in Haskell, we can also consider more advanced solutions that incorporate library functions. The first improvement we can make is to lean on list functions like break and dropWhile.

Using break splits off the first part of a list that does not satisfy a predicate. We’ll use this to gather non-space characters. Then dropWhile allows us to drop the first series of characters in a list that satisfy a predicate. We’ll use this to get rid of whitespace as we move along!

So we’ll define this solution using a basic recursive loop rather than a fold, because each iteration will consume a variable number of characters. The “state” of this loop will be two strings: the remaining part of the input, and the accumulated result.

Since there’s no “current” word, our base case is easy. If the remaining input is empty, we return the accumulated result.

loop :: (String, String) -> String
loop ([], output) = output
...

Otherwise, we’ll follow this process:

Separate the first “word” using break isSpace.
Combine this word with the output (if it’s not null)
Recurse with the new output, dropping the initial whitespace from the remainder.

Here’s what it looks like:

loop :: (String, String) -> String
loop ([], output) = output
loop (cs, output) =
  -- Step 1: Separate next word from rest
  let (nextWord, rest) = L.break isSpace cs
  -- Step 2: Make new output (account for edge cases)
  -- (Can’t use ‘combine’ from above because we aren’t reversing!)
      newOutput = if null output then nextWord
                    else if null nextWord then output
                    else nextWord <> (' ' : output)
  -- Drop spaces from remainder and recurse
  in  loop (L.dropWhile isSpace rest, newOutput)

And completing the function is as simple as calling this loop with the base inputs:

reverseWords :: String -> String
reverseWords input = loop (input, “”)

The Simplest Haskell Solution

The final (and recommended) Haskell solution uses the library functions words and unwords. These do exactly what we want for this problem! We separate words based on whitespace using words, and then join them with a single space with unwords. All we have to do in between is reverse.

reverseWords :: String -> String
reverseWords = unwords . reverse . words

This has a similar elegance to the advanced Rust solution, but is much simpler to understand since there are no complex structs or iterators involved. The types of all functions involved simply relate to lists. Here are the signatures, specialized to String for this problem.

words :: String -> [String]
reverse :: [String] -> [String]
unwords :: [String] -> String

Conclusion

A simple problem will often have many solutions, but in this case, each of these solutions teaches us something new about the language we’re working with. Working character-by-character helps us understand some of the core mechanics of the language, showing us how it works under the hood. But using library functions helps us see the breadth of available options we have for simplifying future code we write.

In our Solve.hs course, you’ll go through all of these steps with Haskell. You’ll implement list library functions, data structures, and algorithms from scratch so you understand how they work under the hood. Then, you’ll know they exist and be able to apply them to efficiently solve harder problems. Take a look at the course today!

James Bowen 5/19/25 James Bowen 5/19/25

Comparing Code: LeetCode Problems in Rust vs. Haskell

Today will be the first in a series where we’ll be exploring some LeetCode problems and comparing different solutions from Haskell and Rust. The main idea is to demonstrate how you might translate ideas between the recursive core of Haskell, and the loop-based framing of most other languages.

If you want to learn more about problem solving in Haskell, you should take a closer look at Solve.hs! This course will give you an in-depth walkthrough of problem solving ideas in Haskell, including how concepts compare to more typical languages.

The Problem

The first problem we’ll consider is called H-Index. In academia, a person has an “H-Index” of n if they have published at least n papers that have n or more citations. So the input to our problem is a list of integers, where each integer is the number of citations of a particular paper the author wrote. Our job is to calculate the author’s H-Index.

The Algorithm

This problem is fairly straightforward if you sort the input list. Once we do this, we can look at any index i, and consider the number of remaining entries (e.g. n - i), and we’ll know that the number of papers with at least that many citations is n - i.

So we can accomplish this task with a single loop over the sorted list. Throughout this loop, we’ll be tracking the maximum “H-Index” we’ve seen so far (maxH). At each iteration, we take the following steps:

Get the number of remaining papers (rem) and the citations at this index (next) If the rem is greater than next, then update maxH to next if next is larger. Otherwise, update maxH to rem if rem is greater.

The last step is a key edge case! If we have the list [1, 1, 1, 9, 9], we’ll get to index 3, with next being 9 and rem being 2. The remainder is smaller than the index, but we would still update maxH to 2, because there are at least 2 citations remaining that are 2 or greater.

Rust Solution

Here’s our Rust solution:

pub fn h_index(citations: Vec<i32>) -> i32 {
    let mut cp = citations.clone();
    cp.sort();
    let n = cp.len();
    let mut maxH: i32 = 0;
    for i in 0..n {
        let next = cp[i];
        let rem: i32 = (n - i) as i32;
        if (rem >= next) {
            maxH = std::cmp::max(next, maxH);
        } else {
            maxH = std::cmp::max(rem, maxH);
        }
    }
    return maxH;
}

We have the first part, where we clone the input, sort it, and set up our loop variables:

pub fn h_index(citations: Vec<i32>) -> i32 {
    let mut cp = citations.clone();
    cp.sort();
    let n = cp.len();
    let mut maxH: i32 = 0;
    ...
}

Then we have the loop itself, where we have our two cases to consider:

for i in 0..n {
    let next = cp[i];
    let rem: i32 = (n - i) as i32;
    if (rem >= next) {
        // There are at least ‘next’ papers >= ‘next’
        maxH = std::cmp::max(next, maxH);
    } else {
        // ‘next’ > ‘rem’, so there are at least ‘rem’ papers >= ‘rem’
        maxH = std::cmp::max(rem, maxH);
    }
}

So this is pretty straightforward. Now how do we approach this kind of problem in Haskell?

Haskell Solution

Our Haskell solution will have the same structure, but instead of running a loop and indexing into a vector, we’ll use a linked list and call a recursive function. Let’s begin by getting the length and sorting our input:

import qualified Data.List as L

hIndex :: [Int] -> Int
hIndex inputs = ...
  where
    n = length inputs
    sorted = L.sort inputs

    ...

Now we need to think about our recursive loop function. At each iteration, we need access to the remaining number of values, the next citation value, and we need to pass along maxH. As with many list-based recursive functions, we’ll peel off one element of the input list each time. Ultimately we’ll return maxH from this loop when we hit our base case of an empty input list. So its type signature should look like this:

loop :: (Int, [Int], Int) -> Int

When writing a recursive function, we always handle the base case first:

loop :: (Int, [Int], Int) -> Int
loop (_, [], maxH) = maxH

Now in the recursive case, we can apply our algorithm, updating maxH if necessary:

loop :: (Int, [Int], Int) -> Int
loop (_, [], maxH) = maxH
loop (remaining, next : rest, maxH) = if remaining >= next
  then loop (remaining - 1, rest, max next maxH)
  else loop (remaining - 1, rest, max remaining maxH)

To finish up, all we need to do is call our loop function with the appropriate initial inputs (n, sorted, 0). Here’s our complete Haskell solution:

import qualified Data.List as L

hIndex :: [Int] -> Int
hIndex inputs = loop (n, sorted, 0)
  where
    n = length inputs
    sorted = L.sort inputs

    loop :: (Int, [Int], Int) -> Int
    loop (_, [], maxH) = maxH
    loop (remaining, next : rest, maxH) = if remaining >= next
      then loop (remaining - 1, rest, max next maxH)
      else loop (remaining - 1, rest, max remaining maxH)

Using a Fold

Now we can notice that our loop has a particular structure. We have one piece of accumulated state (maxH), and this changes based on each value in our list (combined with the remaining values). We can easily re-imagine this kind of loop using a fold. We just have to think of the folding function like this:

loop :: Int -> (Int, Int) -> Int
loop maxH (remaining, next) = if remaining >= next
  then max next maxH
  else max remaining maxH

This has the a -> b -> a structure of a left-fold function, where a is our accumulated maxH value, and the other values come from our list. The main benefit here is that our loop function no longer has to deal with the burden of calling a base case or passing the “shrinking” list as an argument to the next recursive call.

We can invoke this loop at the top level like so:

hIndex :: [Int] -> Int
hIndex inputs = foldl loop 0 (zip [n,n-1..1] sorted)
  where
    n = length inputs
    sorted = L.sort inputs

    loop :: Int -> (Int, Int) -> Int
    loop maxH (remaining, next) = if remaining >= next
      then max next maxH
      else max remaining maxH

We just have to zip the decreasing indices together with our sorted list. Now our recursive “loop” is more like a typical for-loop. We’re only considering one element at a time, and we’re updating the important state each time.

Conclusion

In this comparison, we saw a good comparison between a normal for-loop in Rust, and a recursive solution in Haskell. We also saw how we could simplify this recursive formulation into a “fold” structure.

If you're interested in learning more about writing recursive functions in Haskell, check out our Solve.hs course. You’ll learn how to start thinking about problems in a functional way, and you’ll learn the step-by-step processes for tackling problems with basic recursion and folds like we saw in this example.

James Bowen 5/12/25 James Bowen 5/12/25

Hey ChatGPT, Write me a Haskell Course?

In last week’s article, I discussed how Monday Morning Haskell courses compare to other Haskell courses that I’ve seen out there online. Obviously I’m wildly biased, but I think MMH courses have some serious advantages.

But there’s still the elephant in the room…how do my courses compare to the possibility of using generative AI (e.g. ChatGPT) to learn Haskell instead? While AI is a great tool that has opened up a lot of doors in terms of learning complex concepts, human-developed courses still have some important advantages over the current way you would learn from a chatbot.

Analogy: Going to the Library

I’ll start my case by drawing an (imperfect) analogy. Suppose you are enrolled in a college and want to learn a particular subject, like physical chemistry. You could enroll in your school’s physical chemistry course. Or you could spend the same amount of time going to the library. After all, the library has tons of books on physical chemistry. So you could read all these books and gain the same level of insight, right?

In this example, most people would recognize the shortcomings of just going to the library. For example, you are now responsible for determining the curriculum and course of study.

You could, of course, look at the table of contents of an introductory book and just run with that way of organizing the material. But how much of that do you need to learn? Most college courses aren’t going all the way through a textbook, because the professor already has a good idea of what material is the most important, and has organized the course around that.

A professor will also know when and how to introduce supplemental material from other sources. If you’re just “learning from the library”, you’d be responsible for selecting which materials are the most important, and you probably aren’t qualified!

Also, while textbooks may have practice problems, and they may even have answers to those problems, you still have to do the work of figuring out which problems to study, and how many you need to study before you know the material. Taking a full course with assignments would solve this for you.

Finally, textbooks will rarely tell you about the human process of learning a particular subject. You probably aren’t going to read a sentence like “lots of students struggle to understand this, here’s a way of thinking about the problem that has helped a lot of them.” These are insights you’ll gain from working with a professor (who has taught real students) or other students in the class.

So let’s sum up the shortcomings of “learning from the library”:

Direction - You must take on the cognitive overhead of determining which areas of the subject to study.
Filtering - You must figure out how much detail is necessary, and how much practice you need to learn it.
Human Learning Insight - Textbooks are generally lacking in the actual insights and breakthroughs that help students understand particularly challenging ideas.

From Physical to Online Learning

Now let’s consider what changes about the analogy if instead of comparing physical learning environments, we think about the current online learning environment. Entering an online course is significantly easier than enrolling in a university course. You don’t have to wait for the start of the semester, or go to a physical location.

But using ChatGPT as your “library” is vastly easier than studying from textbooks. In a matter of minutes, you can get tons of information on your screen that would have taken hours or days of effort at the library. And best of all, you can get information on virtually any topic, rather than just those that have pre-existing textbooks.

But I would still claim that using Chatbots for learning shares some of the drawbacks of “learning from the library”. And for these reasons, it’s still worthwhile to consider online courses where they exist instead of relying solely on ChatGPT. Some of these drawbacks might seem counterintuitive, but let’s think about it.

Direction of Study

You might think, “I don’t need to set my own direction, ChatGPT will do that for me!” And yes, you can ask it to lay out a syllabus for you (I did this myself in one of the examples below). This will give you a list of topics to study.

But it won’t just write out the whole course for you based on this initial syllabus in one go. You have to keep prompting it to provide you with the information you want. And it will get sidetracked, consistently asking you to go deeper and deeper down particular rabbit holes.

So it’s still up to you to determine how much you really want to study about particular topics, and you need to maintain the discipline to pull it back out and shift gears. A human-designed course puts these limits in there for you, so that you don’t need to carry that cognitive load.

Filtering

This brings us to the next issue of “filtering”. ChatGPT will provide you with a lot of information, all at once. You’ll ask a simple question, and get a very complicated answer with lots of tables comparing various different ways of looking at the question.

Sometimes, this is nice. It will expose you to ideas you wouldn’t have thought of otherwise. Sometimes though, it’s very distracting. It takes you away from the core of what you’re trying to learn. You have to make sure you aren’t getting dragged into an infinite loop of concepts.

The “practice” problem also exists. ChatGPT can keep coming up with practice problems, but it’s up to you to know how many you really need to study. In our case study below, we’ll also consider that it’s not necessarily the best tool for coming up with practice problems.

Again, a human-designed course does the filtering and measuring for you.

Human Insight

Once at my job, I was reviewing a teammate’s code that implemented a complicated algorithm. I told him, “after I looked closely at this one particular line, my understanding of this algorithm went from like 30% to 70%”, so adding an explanatory comment here would be very helpful!”.

This experience helped me understand the idea of “knowledge inflection points”. These are the key insights that really help you understand a topic. I’ve had several of these with various Haskell concepts, from monads, to folds, to data structures and certain algorithms. I’ve done my best to incorporate these insights into my course content.

An example from Solve.hs might be my understanding of “the common API” of Haskell data structures. This made it much easier for me to reason about using different structures in Haskell.

An AI probably wouldn’t frame the issue in the way I did, unless you already have the knowledge to prompt it. AI’s don’t have the experience of “learning” a concept piece-by-piece, and knowing when things finally “clicked”. You could try asking the chatbot what insights help people learn a topic, but it will only be able to try piecing that information together from what other people have written. On the whole, it still doesn’t beat the experience of someone who’s been there.

Human insights around learning are always going to get baked into a human-designed course, whereas AI is not generally going to be thinking in these terms.

Case Study: Learning Concurrency

I wanted to share a couple case studies that highlight some of the promise but also some of the frustrations with using AI for learning. Here’s a link to an extensive, multi-day study I did with ChatGPT to learn about concurrency topics. It helped me review a lot of topics I had learned in college (10 years ago), and also learn many new things. But there were still some pain points.

The “filtering” problem should be very evident. For each prompt I gave, ChatGPT provided tons of information. It was entirely up to me to figure out how much of this I really needed to know in order to be satisfied.

The “direction” problem is also clear. I started by asking for an organizational outline, and the chatbot duly obliged. But as I dug into certain topics, its preference was to ask me to keep going deeper down certain knowledge paths. I had to consistently drag it back to the syllabus it originally designed.

There were also no clear insights on what the key knowledge was. Over the course of the study, I figured some of these out for myself. But again, I had to filter through a lot of data to get there.

Another drawback I haven’t mentioned yet is the “memory” issue. Chatbots have limited, token-based memory, so they’ll forget what you’ve already learned over even a medium length study. My concurrency study introduced the idea of a “lock-free queue” using compare-and-swap operations early on. ChatGPT reintroduces this idea later as if I had never heard of it. Human-designed courses will avoid this sort of behavior.

I didn’t ask for practice problems in this study, so let’s consider another case study where I was specifically looking to do this in Haskell.

Case Study: Dijkstra’s Algorithm

In this quick study, I asked ChatGPT to come up with a practice problem for learning Dijkstra’s algorithm. Some things were good about its response, but some things weren’t.

On the positive side, the code works, the tests work, and some of the follow-up suggestions are also pretty good. For example, putting a bound on the number of nodes your path can have, or allowing multiple starts are simple extensions that didn’t occur to me when I was writing problems.

My main gripe is that the problems are a bit too obvious as graph problems. It started essentially with “implement Dijkstra’s algorithm” rather than giving me a practice problem using Dijkstra’s algorithm. And when I asked for a “disguised graph problem”, it gave me the delivery problem which wasn’t much of a disguise.

Also, the code used PSQueue, rather than the more beginner-friendly Data.Heap. This package may be better for certain things, but the type operator it uses would be a bit more confusing for a novice.

The line-by-line explanations were pretty good on the whole, but I don’t know that they’re a perfect substitute for really good visual/slide-based instructions like you would find in one of my courses.

With enough prompt engineering, you could get around these issues. But that’s exactly my point. It’s nice to not have to keep coming up with new prompts to get what you’re looking for, especially when you get a long explanation after every question.

Conclusion

Generative AI is a massive innovation for learning, especially on subjects that don’t have a lot of good guide material. But extensive, well-thought-out, human-designed content still has some significant advantages. The content is informed by the personal experience of someone who has actually been in your shoes and has had to learn something the same way you’ll learn it. This is not something an AI can relate to.

Prompt engineering involves a lot of cognitive effort. You have to constantly be directing the flow of what you’re supposed to learn, filter out the unnecessary parts, and then you have to learn it! While the freedom of being able to learn almost anything can be desirable, it can also be exhausting to always be directing the flow. It can be much easier and more helpful to just follow the lead of what another person has done.

I’ve used generative AI for learning and will continue to do so. But when human-designed content is available, I’ll look there first, and consider using AI as a supplement where I feel there are gaps.

When it comes to generating content, I don’t like AI as much, certainly not as a general purpose content producer. But it certainly has its uses. Looking back on course creation, I wish I had used it for writing test cases, for example. Another idea might be translating my work into other languages.

I’ll continue to experiment with AI going forward. But a solid guiding principle is that you should be using AI to enhance yourself, and not replace yourself. I still believe that human content has an edge over AI content for the same subject matter, so I encourage you to take another look at our courses at Monday Morning Haskell Academy, and to subscribe to our mailing list for future updates and discounts!

James Bowen 5/5/25 James Bowen 5/5/25

Comparing Courses: MMH vs. The Rest

Due to some technical issues, our Spring Sale has been extended! You have until Monday, May 12 to get 20% off of all our courses and bundles with the code SOLVE25, and you can get an even bigger 30% discount if you subscribe to our mailing list.

Having now released the final portion of Solve.hs (probably my last course for a while) I wanted to consider the broader landscape of Haskell courses. What other courses are out there? Are they better than mine?

So I’ve actually purchased a few other Haskell courses, and spent a decent amount of time going through their material. I may not be the smartest person to write a Haskell course and I definitely don’t have the most industry experience with Haskell. But, having explored some of these other courses, I think there are some good reasons to consider my courses among the top tier in the Haskell community.

So on the last day of this sale, I wanted to explore a few areas where I think my courses stand out above the rest.

Breadth of Material

There’s a common thread among most Haskell material out there, including and especially courses. They will generally all cover the same topics. You can generally expect to see all of the following in a Haskell course:

Basic Syntax and Types
Typeclasses and polymorphism
Basic Recursion
Understanding Functors, Applicatives & Monads
Using the IO monad
Basic use of the Map type

In some cases, you’ll also see something like a basic web server. And there’s a good reason for this progression. I covered the same material in Haskell From Scratch!

But there’s generally a lack of material in a lot of cool and interesting areas. I’ve done my best to cover a lot of these areas throughout my courses. Here are some of those topics, and the corresponding courses that cover them.

Data structures (beyond lists and maps) - Solve.hs
Algorithms - Solve.hs
Parsing Complex Data - Making Sense of Monads, Solve.hs
Advanced Web Servers - Practical Haskell
Complex Effect Stacks - Effectful Haskell, Practical Haskell
Unit Testing Details - Practical Haskell
Machine Learning - The Haskell Brain

Simply put, I haven’t found a Haskell resource anywhere else that puts all these concepts together in a course-like environment. You could potentially find some blog posts that discuss them, or read the documentation, but this leads to the next point.

Detailed and Challenging Exercises

Reading by itself is rarely enough to retain knowledge, especially when it comes to programming. If you read a great article about unit testing in Haskell, you’ll probably forget all the details and have to go back to it the next time you actually want to use the ideas.

You can try to follow along with the article by writing the code in your own IDE. But you’ll still probably end up just copying things, which also isn’t the best way to learn.

You can even try to devise your own project to use the knowledge. But there’s often a significant cognitive effort involved in coming up with a new idea that fits these requirements…different enough from the article that you’re actually testing yourself, but similar enough that you can actually apply the concept.

Great programming courses should provide exercises so that you can try the techniques in your own environment, without a spoon-fed answer already available to you. They should remove the overhead of coming up with your own way to test yourself, while also providing rapid feedback on whether or not you’ve succeeded.

A lot of Haskell courses I’ve seen out there don’t satisfy these criteria. I’ve seen courses out there that don’t have any exercises. And the ones that do often have at least one of the following issues:

Only 1-2 problems per lecture
Problems are too easy
Lack of test cases
No starter code (i.e. you’re only given a written description)
No toolchain integration (i.e. you’re just given a file, but no project to work with or limited build instructions)

Every course on Monday Morning Haskell Academy comes with detailed exercises to help you learn the material. You’ll usually get several problems per lecture (4-6), and the starter code for these problems comes with full toolchain integration and instructions, plus automated unit test cases.

Difficulty is always going to be a bit subjective, but for most lectures I’ve made an effort to have some easier problems as well as more challenging ones.

Lecture Content and Slides

Naturally, the core content of the course is the lecture materials, so it’s worth talking about that as well! Some Haskell courses rely strictly on written material, but most incorporate slides and audio presentation.

For the most part, course authors do a fine job with their slides. But I think I go above and beyond the norm by using bold text to highlight the most important parts of the code presented, and using colors to show the relationship between different elements on the same slide and across slides.

With our courses, you’re able to get the slides as a downloadable asset. And with the level of detail on them, they serve as a useful reference for you to quickly come back to, even without listening again to the lecture audio.

Other Guarantees

Finally, it’s worth noting that our courses and bundles all come with a 14-day money back guarantee. If you don’t like the materials, you can get a refund within 14 days with no questions asked.

Additionally, all our courses guarantee lifetime access to the content. There’s no recurring subscription. So if your life is too busy to go through the full course right now, you can always save it for later!

So you may as well take a look at our course listings now, since you can get a 20% discount using the code SOLVE25 (today only!). If you subscribe to our mailing list, you’ll get an extra 10% off as well.

Our new bundles (e.g. Beginners & Advanced) are a great way to save money while exploring the full breadth of Haskell materials and topics we have to offer. If you get MMH Complete, you’ll get lifetime access to all our course content, past, present and future! So don’t miss out, take advantage of the sale today!

James Bowen 4/28/25 James Bowen 4/28/25

New Course Bundles!

With the release of the final module of Solve.hs last week, we now have 7 finished courses available at Monday Morning Haskell Academy. With this many courses, it might be a little challenging to pick the right one.

While we have a little guide on our website to help you pick, I also wanted to make it a bit easier to select the courses for the right level of experience, and also provide some really great deals on our course material.

So last week, we released 3 new course bundles to help you save. The 3 levels are Beginner, Advanced, and Complete.

Beginner Bundle

Our Beginner Bundle includes a total of 4 courses.

This bundle is great if you’re just starting out with Haskell, even if you haven’t even installed it or written a line before! The first two courses in this bundle will help you install your toolchain and learn the language fundamentals. Then after that, you’ll learn about some trickier Haskell concepts, like monads and advanced problem solving techniques.

Along the way, you’ll also get the chance to write a couple small projects to build your skills and confidence. With the progression of these courses, you can really go from “Zero Knowledge” to “Confident Haskell User”.

Advanced Bundle

Our Advanced Bundle is for Haskellers who’ve mastered the basics and are trying to learn how to apply Haskell in some more “real-world” settings. The courses are:

In the first three courses, you’ll learn about things like machine learning, writing web servers, deploying applications, and managing complex effect stacks.

Then you’ll see that our newly completed Solve.hs course appears in both bundles. It bridges the gap between basic problem solving skills, like manipulating lists and strings, to more advanced ideas, like implementing data structures from scratch and writing complex algorithms. So even if you’ve got some decent skills already, you’ll definitely still find quite a few challenges in this course!

MMH Complete

Finally, MMH Complete will give you access to our entire library of courses. You’ll get all 7 courses, at a substantial discount! Plus you are guaranteed to receive any new course content we come up with in the future.

Discounts!

Speaking of discounts, here are the discounts you would get for each bundle vs. purchasing each course individually:

Beginner Bundle - 20% off
Advanced Bundle - 30% off
MMH Complete - 35% off

Plus, this week, you can get an extra 20% off all courses and bundles using the code SOLVE25. If you want an even bigger discount, you can subscribe to our newsletter. You’ll get monthly updates AND a code for 30% off all products.

So don’t miss out on these offers! Head to the courses page now! Next week, they’ll be going away!

James Bowen 4/21/25 James Bowen 4/21/25

Solve.hs Module 4 Now Available!

Back in 2023, I introduced Solve.hs, my newest course focused on problem solving in Haskell. This course was inspired by my experiences solving programming puzzles with Haskell, especially by the feeling of how different it was compared to other languages.

Solve.hs will teach you all the core knowledge you need around data structures and algorithms to tackle not only these kinds of puzzles (which often appear as interview questions), but also the mindset shifts you have to make when solving them in Haskell.

In 2023, I released the first two modules, which focused on data structures, with a special emphasis on how Haskell uses linked lists. These also explored the patterns that replace ’for’ and ‘while’ loops from other languages.

Then in 2024 I released module 3, which explained all of the most essential algorithms in great detail, and showed how we have to implement them differently in Haskell.

Finally, today, I am releasing the fourth and final module for this course! This module explains parsing in great detail. You’ll learn:

Basic string manipulation techniques for simple parsing
How to use libraries to parse common data formats (e.g. JSON)
How to use the Megaparsec library to parse any other kind of structured data
How to write your own monadic parser
How to use regular expressions for parsing in Haskell

These skills can be important in puzzle solving challenges where your input is just a string. But they’re also applicable in a wide variety of “real world” projects!

For the next 2 weeks, you can get Solve.hs for 20% off with the code SOLVE25. You can also get an extra 10% discount by subscribing to our newsletter!

After these 2 weeks are up, you’ll not only lose the discount, but the price of the course will go up to reflect the added material from module 4. This course will never be cheaper, so grab it now by going to the course page!

James Bowen 7/1/24 James Bowen 7/1/24

Solve.hs Module 3 + Summer Sale!

After 6 months of hard work, I am happy to announce that Solve.hs now has a new module - Essential Algorithms! You’ll learn the “Haskell Way” to write all of the most important algorithms for solving coding problems, such as Breadth First Search, Dijkstra’s Algorithm, and more!

You can get a 20% discount code for this and all of our other courses by subscribing to our mailing list! Starting next week, the price for Solve.hs will go up to reflect the increased content. So if you subscribe and purchase this week, you’ll end up saving 40% vs. buying later!

So don’t miss out, head to the course sales page to buy today!

James Bowen 1/15/24 James Bowen 1/15/24

Functional Programming vs. Object Oriented Programming

Functional Programming (FP) and Object Oriented Programming (OOP) are the two most important programming paradigms in use today. In this article, we'll discuss these two different programming paradigms and compare their key differences, strengths and weaknesses. We'll also highlight a few specific ways Haskell fits into this discussion. Here's a quick outline if you want to skip around a bit!

What is a Programming Paradigm?
The Object Oriented Paradigm
The Functional Paradigm
Functional Programming vs. OOP
OOP Languages
FP Languages
Advantages of Functional Programming
Disadvantages of Functional Programming
A Full Introduction to Haskell

What is a Programming Paradigm?

A paradigm is a way of thinking about a subject. It's a model against which we can compare examples of something.

In programming, there are many ways to write code to solve a particular task. Our tasks normally involve taking some kind of input, whether data from a database or commands from a user. A program's job is then to produce outputs of some kind, like updates in that database or images on the user's screen.

Programming paradigms help us to organize our thinking so that we can rapidly select an implementation path that makes sense to us and other developers looking at the code. Paradigms also provide mechanisms for reusing code, so that we don't have to start from scratch every time we write a new program.

The two dominant paradigms in programming today are Object Oriented Programming (OOP) and Functional Programming (FP).

The Object Oriented Paradigm

In object oriented programming, our program's main job is to maintain objects. Objects almost always store data, and they have particular ways of acting on other objects and being acted on by other objects (these are the object's methods). Objects often have mutable data - many actions you take on your objects are capable of changing some of the object's underlying data.

Object oriented programming allows code reuse through a system called inheritance. Objects belong to classes which share the same kinds of data and actions. Classes can inherit from a parent class (or multiple classes, depending on the language), so that they also have access to the data from the base class and some of the same code that manipulates it.

The Functional Paradigm

In functional programming, we think about programming in terms of functions. This idea is rooted in the mathematical idea of a function. A function in math is a process which takes some input (or a series of different inputs) and produces some kind of output. A simple example would be a function that takes an input number and produces the square of that number. Many functional languages emphasize pure functions, which produce the exact same output every time when given the same input.

In programming, we may view our entire program as a function. It is a means by which some kind of input (file data or user commands), is transformed into some kind of output (new files, messages on our terminal). Individual functions within our program might take smaller portions of this input and produce some piece of our output, or some intermediate result that is needed to eventually produce this output.

In functional programming, we still need to organize our data in some way. So some of the ideas of objects/classes are still used to combine separate pieces of data in meaningful ways. However, we generally do not attach "actions" to data in the same way that classes do in OOP languages.

Since we don't perform actions directly on our data, functional languages are more likely to use immutable data as a default, rather than mutable data. (We should note though that both paradigms use both kinds of data in their own ways).

Functional Programming vs. OOP

The main point of separation between these paradigms is the question of "what is the fundamental building block of my program?" In object oriented programming, our programs are structured around objects. Functions are things we can do to an object or with an object.

In functional programming, functions are always first class citizens - the main building block of our code. In object oriented programming, functions can be first class citizens, but they do not need to be. Even in languages where they can be, they often are not used in this way, since this isn't as natural within the object oriented paradigm.

Object Oriented Programming Languages

Many of the most popular programming languages are OOP languages. Java, for a long time the most widely used language, is perhaps the most archetypal OO language. All code must exist within an object, even in a simple "Hello World" program:

class MyProgram {
  public static void main(String[] args) {
    System.out.println("Hello World!");
  }
}

In this example, we could not write our 'main' function on its own, without the use of 'class MyProgram'.

Java has a single basic 'Object' class, and all other classes (including any new classes you write) must inherit from it for basic behaviors like memory allocation. Java classes only allow single inheritance. This means that a class cannot inherit from multiple different types. Thus, all Java classes you would use can be mapped out on a tree structure with 'Object' as the root of the tree.

Other object oriented languages use the general ideas of classes, objects, and inheritance, but with some differences. C++ and Python both allow multiple inheritance, so that a class can inherit behavior from multiple existing classes. While these are both OOP languages, they are also more flexible in allowing functions to exist outside of classes. A basic script in either of these languages need not use any classes. In Python, we'd just write:

if __name__ == "__main__":
  print("Hello World!")

In C++, this looks like:

int main() {
  std::cout << "Hello World!" << std::endl;
}

These languages also don't have such a strictly defined inheritance structure. You can create classes that do not inherit from anything else, and they'll still work.

FP Languages

Haskell is perhaps the language that is most identifiable with the functional paradigm. Its type system and compiler really force you to adopt functional ideas, especially around immutable data, pure functions, and tail call optimization. It also embraces lazy evaluation, which is aligned with FP principles, but not a requirement for a functional language.

There are several other programming languages that generally get associated with the functional paradigm include Clojure, OCaml, Lisp, Scala and Rust. These languages aren't all functional in the same way as Haskell; there are many notable differences. Lisp bills itself specifically as a multi-paradigm language, and Scala is built to cross-compile with Java! Meanwhile Rust's syntax looks more object oriented, but its inheritance system (traits) feel much more like Haskell. However, on balance, these languages express functional programming ideas much more than their counterparts.

Amongst the languages mentioned in the object oriented section, Python has the most FP features. It is more natural to write functions outside of your class objects, and concepts like higher order functions and lambda expressions are more idiomatic than in C++ or Java. This is part of the reason Python is often recommended for beginners, with another reason being that its syntax makes it a relatively simple language to learn.

Advantages of Functional Programming

Fewer Bugs

FP code has a deserved reputation for having fewer bugs. Anecdotally, I certainly find I have a much easier time writing bug free code in Haskell than Python. Many bugs in object oriented code are caused by the proliferation of mutable state. You might pass an object to a method and expect your object to come back unchanged...only to find that the method does in fact change your object's state. With objects, it's also very easy for unstated pre-conditions to pop up in class methods. If your object is not in the state you expect when the method is called, you'll end up with behavior you didn't intend.

A lot of function-based code makes these errors impossible by imposing immutable objects as the default, if not making it a near requirement, as Haskell does. When the function is the building block of your code, you must specify precisely what the inputs of the function are. This gives you more opportunities to determine pre-conditions for this data. It also ensures that the return results of the function are the primary way you affect the rest of your program.

Functions also tend to be easier to test than objects. It is often tricky to create objects with the precise state you want to assess in a unit test, whereas to test a function you only need to reproduce the inputs.

More Expressive, Reasonable Design

The more you work with functions as your building blocks, and the more you try to fill your code with pure functions, the easier it will be to reason about your code. Imagine you have a couple dozen fields on an object in OO code. If someone calls a function on that object, any of those fields could impact the result of the method call.

Functions give you the opportunity to narrow things down to the precise values that you actually need to perform the computation. They let you separate the essential information from superfluous information, making it more obvious what the responsibilities are for each part of your code.

Multithreading

You can do parallel programming no matter what programming language you're using, but the functional programming paradigm aligns very well with parallel processing. To kick off a new thread in any language, you pretty much always have to pass a function as an argument, and this is more natural in FP. And with pure functions that don't modify shared mutable objects, FP is generally much easier to break into parallelizable pieces that don't require complex locking schemes.

Disadvantages of Functional Programming

Intuition of Complete Objects

Functional programming can feel less intuitive than object oriented programming. Perhaps one reason for this is that object oriented programming allows us to reason about "complete" objects, whose state at any given time is properly defined.

Functions are, in a sense, incomplete. A function is not a what that you can hold as a picture in your head. A function is a how. Given some inputs, how do you produce the outputs? In other words, it's a procedure. And a procedure can only really be imagined as a concrete object once you've filled in its inputs. This is best exemplified by the fact that functions have no native 'Show' instance in Haskell.

>> show (+)
No instance for Show (Integer -> Integer -> Integer) arising from a use of 'show'

If you apply the '+' function to arguments (and so create what could be called an "object"), then we can print it. But until then, it doesn't make much sense. If objects are the building block of your code though, you could, hypothetically, print the state of the objects in your code every step of the way.

Mutable State can be Useful!

As much as mutable state can cause a lot of bugs, it is nonetheless a useful tool for many problems, and decidedly more intuitive for certain data structures. If we just imagine something like the "Snake" game, it has a 2D grid that remains mostly the same from tick to tick, with just a couple things updating. This is easier to capture with mutable data.

Web development is another area where mutable objects are extremely useful. Anytime the user enters information on the page, some object has to change! Web development in FP almost requires its own paradigm (see "Functional Reactive Programming"). Haskell can represent mutable data, but the syntax is more cumbersome; you essentially need a separate data structure. Likewise, other functional languages might make mutability easier than Haskell, but mutability is still, again, more intuitive when objects are your fundamental building block, rather than functions on those objects.

We can see this even with something as simple as loops. Haskell doesn't perform "for-loops" in the same way as other languages, because most for loops essentially rely on the notion that there is some kind of state updating on each iteration of the loop, even if that state is only the integer counter. To write loops in Haskell, you have to learn concepts like maps and folds, which require you to get very used to writing new functions on the fly.

A Full Introduction to Haskell (and its Functional Aspects)

So functional programming languages are perhaps a bit more difficult to learn, but can offer a significant payoff if you put in the time to master the skills. Ultimately, you can use either paradigm for most kinds of projects and keep your development productive. It's down to your personal preference which you try while building software.

If you really want to dive into functional programming though, Haskell is a great language, since it will force you to learn FP principles more than other functional languages. For a complete introduction to Haskell, you should take a look at Haskell From Scratch, our beginner-level course for those new to the language. It will teach you everything you need to know about syntax and fundamental concepts, while providing you with a ton of hands-on practice through exercises and projects.

Haskell From Scratch also includes Making Sense of Monads, our course that shows the more functional side of Haskell by teaching you about the critical concept of monads. With these two courses under your belt, you'll be well on your way to mastery of functional programming! Head over here to learn more about these courses!

James Bowen 1/8/24 James Bowen 1/8/24

How to Write Comments in Haskell

Comments are often a simple item to learn, but there's a few ways we can get more sophisticated with them! This article is all about writing comments in Haskell. Here's a quick outline to get you started!

What is a Comment?
Single Line Comments
Multi-Line Comments
Inline Comments
Writing Formal Documentation Comments
Intro to Haddock
Basic Haddock Comments
Creating Our Haskell Report
Documenting the Module Header
Module Header Fields
Haddock Comments Below
Commenting Type Signatures
Commenting Constructors
Commenting Record Fields
Commenting Class Definitions
A Complete Introduction to the Haskell Programming Language

What is a Comment?

A comment is non-code note you write in a code file. You write it to explain what the code does or how it works, in order to help someone else reading it. Comments are ignored by a language's compiler or interpreter. There is usually some kind of syntax to comments to distinguish them from code. Writing comments in Haskell isn't much different from other programming languages. But in this article, we'll look extensively at Haddock, a more advanced program for writing nice-looking documentation.

Single Line Comments

The basic syntax for comments in Haskell is easy, even if it is unusual compared to more common programming languages. In languages like Java, Javascript and C++, you use two forward slashes to start a single line comment:
```
int main() {
// This line will print the string value "Hello, World!" to the console
std::cerr << "Hello, World!" << std::endl;
}
```
But in Haskell, single line comments start with two hyphens, '--':
```
-- This is our 'main' function, which will print a string value to the console
main :: IO ()
main = putStrLn "Hello World!"
```
You can have these take up an entire line by themselves, or you can add a comment after a line of code. In this simple "Hello World" program, we place a comment at the end of the first line of code, giving instructions on what would need to happen if you extended the program.
```
main :: IO ()
main = -- Add 'do' to this line if you add another 'putStrLn' statement!
putStrLn "Hello World!"
```
Multi-Line Comments

While you can always start multiple consecutive lines with whatever a comment line starts with in your language, many languages also have a specific way to make multiline comments. And generally speaking, this method has a "start" and an "end" sequence. For example, in C++ or Java, you start a multi line comment block with the characters '/' and end it with '/'
```
/*
This function returns a new list
that is a reversed copy of the input. 

It iterates through each value in the input 
and uses 'push_front' on the new copy.
*/
std::list<int> reverseList(const std::list<int>& ints) {
std::list<int> result;
for (const auto& i : ints) {
  result.push_front(i);
}
return result;
}
```
In Haskell, it is very similar. You use the brace and a hyphen character to open ('{-') and then the reverse to close the block ('-}').
```
{- This function returns a new list
 that is a reversed copy of the input.

 It uses a tail recursive helper function.
-}
reverse :: [a] -> [a]
reverse = reverseTail []
where
  reverseTail acc [] = acc
  reverseTail acc (x : xs) = reverseTail (x : acc) xs
```
Notice we don't have to start every line in the comment with double hyphens. Everything in there is part of the comment, until we reach the closing character sequence. Comments like these with multiple lines are also known as "block comments". They are useful because it is easy to add more information to the comment without adding any more formatting.

Inline Comments

While you generally use the brace/hyphen sequence to write a multiline comment, this format is surprisingly also useful for a particular form of single line comments. You can write an "inline" comment, where the content is in between operational code on that line.
```
reverse :: [a] -> [a]
reverse = reverseTail []
where
  reverseTail {- Base Case -}      acc [] = acc
  reverseTail {- Recursive Case -} acc (x : xs) = reverseTail (x : acc) xs
```
The fact that our code has a start and end sequence means that the compiler knows where the real code starts up again. This is impossible when you use double hyphens to signify a comment.

Writing Formal Documentation Comments

If the only people using this code will be you or a small team, the two above techniques are all you really need. They tell people looking at your source code (including your future self) why you have written things in a certain way, and how they should work. However, if other people will be using your code as a library without necessarily looking at the source code, there's a much deeper area you can explore. In these cases, you will want to write formal documentation comments. A documentation comment tells someone what a function does, generally without going into the details of how it works. More importantly, documentation comments are usually compiled into a format for someone to look at outside of the source code. These sorts of comments are aimed at people using your code as a library. They'll import your module into their own programs, rather than modifying it themselves. You need to answer questions they'll have like "How do I use this feature?", or "What argument do I need to provide for this function to work"? You should also consider having examples in this kind of documentation, since these can explain your library much better than plain statements. A simple code snippet often provides way more clarification than a long document of function descriptions.

Intro to Haddock

As I mentioned above, formal documentation needs to be compiled into a format that is more readable than source code. In most cases, this requires an additional tool. Doxygen, for example, is one tool that supports many programming languages, like C++ and Python. Haskell has a special tool called Haddock. Luckily, you probably don't need to go through any additional effort to install Haddock. If you used GHCup to install Haskell, then Haddock comes along with it automatically. (For a full walkthrough on getting Haskell installed, you can read our Startup Guide). It also integrates well with Haskell's package tools, Stack and Cabal. In this article we'll use it through Stack. So if you want to follow along, you should create a new Haskell project on your machine with Stack, calling it 'HaddockTest'. Then build the code before we add comments so you don't have to wait for it later:
```
>> stack new HaddockTest
>> cd HaddockTest
>> stack build
```
You can write all the code from the rest of the article in the file 'src/Lib.hs', which Stack creates by default.

Basic Haddock Comments

Now let's see how easy it is to write Haddock comments! To write basic comments, you just have to add a vertical bar character after the two hyphens:
```
-- | Get the "block" distance of two 2D coordinate pairs
manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)
```
It still works even if you add a second line without the vertical bar. All comment lines until the type signature or function definition will be considered part of the Haddock comment.
```
-- | Get the "block" distance of two 2D coordinate pairs
-- This is the sum of the absolute difference in x and y values.
manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)
```
You can also make a block comment in the Haddock style. It involves the same character sequences as multi line comments, but once again, you just add a vertical bar after the start sequence. The end sequence does not need the bar:
```
{-| Get the "block" distance of two 2D coordinate pairs
 This is the sum of the absolute difference in x and y values.
-}
manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)
```
No matter which of these options you use, your comment will look the same in the final document. Next, we'll see how to generate our Haddock document. To contrast Haddock comments with normal comments, we'll add a second function in our code with a "normal" single line comment. We also need to add both functions to the export list of our module at the top: `haskell module Lib ( someFunc, , manhattanDistance , euclidenDistance ) where

...

-- Get the Euclidean distance of two 2D coordinate pairs (not Haddock) euclideanDistance :: (Double, Double) -> (Double, Double) -> Double euclideanDistance (x1, y1) (x2, y2) = sqrt ((x2 - x1) ^ 2 + (y2 - y1) ^ 2)

Now let's create our document!
## Creating Our Haskell Report
To generate our document, we just use the following command:
```bash
>> stack haddock

This will compile our code. At the end of the process, it will also inform us about what percentage of the elements in our code used Haddock comments. For example:

25% (  1 /  4) in 'Lib'
  Missing documentation for:
    Module header
    someFunc (src/Lib.hs:7)
    euclideanDistance (src/Lib.hs:17)

As expected, 'euclideanDistance' is not considered to have a Haddock comment. We also haven't defined a Haddock comment for our module header. We'll do that in the next section. We'll get rid of the 'someFunc' expression, which is just a stub. This command will generate HTML files for us, most importantly an index file! They get generated in the '.stack-work' directory, usually in a folder that looks like '{project}/.stack-work/install/{os}/{hash}/{ghc_version}/doc/'. For example, the full path of my index file in this example is:

/home/HaddockTest/.stack-work/install/x86_64-linux-tinfo6/6af01190efdb20c14a771b6e2823b492cb22572e9ec30114989156919ec4ab3a/9.6.3/doc/index.html

You can open the file with your web browser, and you'll find a mostly blank page listing the modules in your project, which at this point should only be 'Lib'. If you click on 'Lib', it will take you to a page that looks like this:

We can see that all three expressions from our file are there, but only 'manhattanDistance' has its comment visible on the page. What's neat is that the type links all connect to documentation for the base libraries. If we click on 'Int', it will take us to the page for the 'base' package module 'Data.Int', giving documentation on 'Int' and other integer types.

Documenting the Module Header

In the picture above, you'll see a blank space between our module name and the 'Documentation' section. This is where the module header documentation should go. Let's see how to add this into our code. Just as Haddock comments for functions should go above their type signatures, the module comment should go above the module declaration. You can start it with the same format as you would have with other Haddock block comments:

{-| This module exposes a couple functions
    related to 2D distance calculation.
-}
module Lib
  ( manhattanDistance
  , euclideanDistance
  ) where

...

If you rerun 'stack haddock' and refresh your Haddock page, this comment will now appear under 'Lib' and above 'Documentation'. This is the simplest thing you can do to provide general information about the module.

Module Header Fields

However, there are also additional fields you can add to the header that Haddock will specifically highlight on the page. Suppose we update our block comment to have these lines:

{-|
Module: Lib
Description: A module for distance functions.
Copyright: (c) Monday Morning Haskell, 2023
License: MIT
Maintainer: person@mmhaskell.com

The module has two functions. One calculates the "Manhattan" distance, or "block" distance on integer 2D coordinates. The other calculates the Euclidean distance for a floating-point coordinate system.
-}
module Lib
  ( manhattanDistance
  , euclideanDistance
  ) where

...

At the bottom of the multi line comment, after all the lines for the fields, we can put a longer description, as you see. After adding this, removing 'someFunc', and making our prior comment on Euclidean distance a Haddock comment, we now get 100% marks on the documentation for this module when we recompile it:

100% (  3 /  3) in 'Lib'

And here's what our HTML page looks like now. Note how the fields we entered are populated in the small box in the upper right.

Note that the short description we gave is now visible next to the module name on the index page. This page still only contains the description below the fields.

Haddock Comments Below

So far, we've been using the vertical bar character to place Haddock comments above our type signatures. However, it is also possible to place comments below the type signatures, and this will introduce us to a new syntax technique that we'll use for other areas. The general idea is that we can use a caret character '^' instead of the vertical bar, indicating that the item we are commenting is "above" or "before" the comment. We can do this either with single line comments or block comments. Here's how we would use this technique with our existing functions:

manhattanDistance :: (Int, Int) -> (Int, Int) -> Int
-- ^ Get the "blocK" distance of two 2D coordinate pairs
manhattanDistance (x1, y1) (x2, y2) = abs (x2 - x1) + abs (y2 - y1)

euclideanDistance :: (Double, Double) -> (Double, Double) -> Double
{- ^ Get the Euclidean distance of two 2D coordinate pairs
     This uses the Pythagorean formula.
-}
euclideanDistance (x1, y1) (x2, y2) = sqrt ((x2 - x1) ^ 2 + (y2 - y1) ^ 2)

The comments will appear the same in the final documentation.

Commenting Type Signatures

The comments we've written so far have described each function as a unit. However, sometimes you want to make notes on specific function arguments. The most common way to write these comments in Haskell with Haddock is with the "above" style. Each argument goes on its own line with a "caret" Haddock comment after it. Here's an example:

-- | Given a base point and a list of other points, returns
-- the shortest distance from the base point to a point in the list.
shortestDistance ::
  (Double, Double) -> -- ^ The base point we are measuring from
  [(Double, Double)] -> -- ^ The list of alternative points
  Double
shortestDistance base [] = -1.0
shorestDistance base rest = minimum $ (map (euclideanDistance base) rest)

It is also possible to write these with the vertical bar above each argument, but then you will need a second line for the comment.

-- | Given a base point and a list of other points, returns
-- the shortest distance from the base point to a point in the list.
shortestDistance ::
  -- | The base point we are measuring from
  (Double, Double) ->
  -- | The list of alternative points
  [(Double, Double)] -> 
  Double
shortestDistance base [] = -1.0
shorestDistance base rest = minimum $ (map (euclideanDistance base) rest)

It is even possible to write the comments before AND on the same line as inline comments. However, this is less common since developers usually prefer seeing the type as the first thing on the line.

Commenting Constructors

You can also use Haddock comments for type definitions. Here is an example of a data type with different constructors. Each gets a comment.

data Direction =
  DUp    | -- ^ Positive y direction
  DRight | -- ^ Positive x direction
  DDown  | -- ^ Negative y direction
  DLeft    -- ^ Negative x direction

Commenting Record Fields

You can also comment record fields within a single constructor.

data Movement = Movement
  { direction :: Direction -- ^ Which way we are moving
  , distance  :: Int       -- ^ How far we are moving
  }

An important note is that if you have a constructor on the same line as its fields, a single caret comment will refer to the constructor, not to its last field.

data Point =
  Point2I Int Int       |      -- ^ 2d integral coordinate
  Point2D Double Double |      -- ^ 2d floating point coordinate
  Point3I Int Int Int   |      -- ^ 3d integral coordinate
  Point3D Double Double Double -- ^ 3d floating point coordinate

Commenting Class Definitions

As one final feature, we can add these sorts of comments to class definitions as well. With class functions, it is usually better to use "before" comments with the vertical bar. Unlike constructors and fields, an "after" comment will get associated with the argument, not the method.

{-| The Polar class describes objects which can be described
    in "polar" coordinates, with a magnitude and angle
-}
class Polar a where
  -- | The total length of the item
  magnitude :: a -> Double 
  -- | The angle (in radians) of the point around the z-axis
  angle :: a -> Double

Here's what all these new pieces look like in our documentation:

You can see the way that each comment is associated with a particular field or argument.

A Complete Introduction to the Haskell Programming Language

Of course, comments are useless if you have no code or projects to write them in! If you're a beginner to Haskell, the fastest way to get up to writing project-level code is our course, Haskell From Scratch! This course features hours of video lectures, over 100 programming exercises, and a final project to test your skills! Learn more about it on this page!

James Bowen 1/1/24 James Bowen 1/1/24

How to Write “Hello World” in Haskell

In this article we're going to write the easiest program we can in the Haskell programming language. We're going to write a simple example program that prints "Hello World!" to the console. It's such a simple program that we can do it in one line! But it's still the first thing you should do when starting a new programming language. Even with such a simple program there are several details we can learn about writing a Haskell program. Here's a quick table of contents if you want to jump around!

Writing Haskell "Hello World"
The Simplest Way to Run the Code
Functional Programming and Types
Requirements of an Executable Haskell Program
Using the GHC Compiler
Using GHCI - The Haskell Interpreter
A Closer Look at Our Types
Compilation Errors
A Quick Look At Type Classes
Echo - Another Example Program
A Complete Introduction to the Haskell Programming Language

Now let's get started!

Writing Haskell "Hello World"

To write our "Haskell Hello World" program, we just need to open a file named 'HelloWorld.hs' in our code editor and write the following line:

main = putStrLn "Hello World!"

This is all the code you need! With just this one line, there's still another way you could write it. You could use the function 'print' instead of 'putStrLn':

main = print "Hello World!"

These programs will both accomplish our goal, but their behavior is slightly different! But to explore this, we first need to run our program!

The Simplest Way to Run the Code

Hopefully you've already installed the Haskell language tools on your machine. The old way to do this was through Haskell Platform, but now you should use GHCup. You can read our Startup Guide for more instructions on that! But assuming you've installed everything, the simplest way to run your program is to use the 'runghc' command on your file:

>> runghc HelloWorld.hs

With the first version of our code using 'putStrLn', we'll see this printed to our terminal:

Hello World!

If we use 'print' instead, we'll get this output:

"Hello World!"

In the second example, there are quotation marks! To understand why this is, we need to understand a little more about types, which are extremely important in Haskell code.

Functional Programming and Types

Haskell is a functional programming language with a strong, static type system. Even something as simple as our "Hello World" program is comprised of expressions, and each of these expressions has a type. For that matter, our whole program has a type!

In fact, every Haskell program has the same type: 'IO ()'. The IO type signifies any expression which can perform Input/Output activities, like printing to the terminal and reading user input. Most functions you write in Haskell won't need to do these tasks. But since we're printing, we need the IO signifier. The second part of the type is the empty tuple, '()'. This is also referred to as the "unit type". When used following 'IO', it is similar to having a 'void' return value in other programming languages.

Now, our 'main' expression signifies our whole program, and we can explicitly declare it to have this type by putting a type signature above it in our code. We give the expression name, two colons, and then the type:

main :: IO ()
main = putStrLn "Hello World!"

Our program will run the same with the type signature. We didn't need to put it there, because GHC, the Haskell compiler, can usually infer the types of expressions. With more complicated programs, it can get stuck without explicit type signatures, but we don't have to worry about that right now.

Requirements of an Executable Haskell Program

Now if we gave any other type to our main function, we won't be able to run our program! Our file is supposed to be an entry point - the root of an executable program. And Haskell has several requirements for such files.

These files must have an expression named 'main'. This expression must have the type 'IO ()'. Finally, if we put a module name on our code, that module name should be Main. Module names go at the top of our file, prefaced by "module", and followed by the word "where". Here's how we can explicitly declare the name of our module:

module Main where

main :: IO ()
main = putStrLn "Hello World!"

Like the type signature on our function 'main', GHC could infer the module name as well. But let's try giving it a different module name:

module HelloWorld where

main :: IO ()
main = putStrLn "Hello World!"

For most Haskell modules you write, using the file name (minus the '.hs' extension) IS how you want to name the module. But runnable entry point modules are different. If we use the 'runghc' command on this code, it will still work. However, if we get into more specific behaviors of GHC, we'll see that Haskell treats our file differently if we don't use 'Main'.

Using the GHC Compiler

Instead of using 'runghc', a command designed mainly for one-off files like this, let's try to compile our code more directly using the Haskell compiler. Suppose we have used HelloWorld as the module name. What files does it produce when we compile it with the 'ghc' command?

>> ghc HelloWorld.hs
[1 of 1] Compiling HelloWorld       ( HelloWorld.hs, HelloWorld.o )
>> ls
HelloWorld.hi HelloWorld.hs HelloWorld.o

This produces two output files beside our source module. The '.hi' file is an interface file. The '.o' file is an object file. Unfortunately, neither of these are runnable! So let's try changing our module name back to Main.

module Main where

main :: IO ()
main = putStrLn "Hello World!"

Now we'll go back to the command line and run it again:

>> ghc HelloWorld.hs
[1 of 2] Compiling Main       ( HelloWorld.hs, HelloWorld.o )
[2 of 2] Linking HelloWorld
>> ls 
HelloWorld HelloWorld.hi HelloWorld.hs HelloWorld.o

This time, things are different! We now have two compilation steps. The first says 'Compiling Main', referring to our code module. The second says 'Linking HelloWorld'. This refers to the creation of the 'HelloWorld' file, which is executable code! (On Windows, this file will be called 'HelloWorld.exe'). We can "run" this file on the command line now, and our program will run!

>> ./HelloWorld
Hello World!

Using GHCI - The Haskell Interpreter

Now there's another simple way for us to run our code. We can also use the GHC Interpreter, known as GHCI. We open it with the command 'ghci' on our command line terminal. This brings us a prompt where we can enter Haskell expressions. We can also load code from our modules, using the ':load' command. Let's load our hello world program and run its 'main' function.

>> ghci
GHCI, version 9.4.7: https://www.haskell.org/ghc/   :? for help
ghci> :load HelloWorld
[1 of 2] Compiling Main          ( HelloWorld.hs, interpreted )
ghci> main
Hello World!

If we wanted, we could also just run our "Hello World" code in the interpreter itself:

ghci> putStrLn "Hello World!"
Hello World!

It's also possible to assign our string to a value and then use it in another expression:

ghci> let myString = "Hello World!"
ghci> putStrLn myString
Hello World!

A Closer Look at Our Types

A very useful function of GHCI is that it can tell us the types of our expressions. We just have to use the ':type' command, or ':t' for short. We have two expressions in our Haskell program: 'putStrLn', and "Hello World!". Let's look at their types. We'll start with "Hello World!":

ghci> :type "Hello World!"
"Hello World!" :: String

The type of "Hello World!" itself is a 'String'. This is the name given for a list of characters. We can look at the type of an individual character as well:

ghci> :type 'H'
'H' :: Char

What about 'putStrLn'?

ghci> :t putStrLn
putStrLn :: String -> IO ()

The type for 'putStrLn' looks like 'String -> IO ()'. Any type with an arrow in it ('->') is a function. It takes a 'String' as an input and it returns a value of type 'IO ()', which we've discussed. In order to apply a function, we place its argument next to it in our code. This is very different from other programming languages, where you usually need parentheses to apply a function on arguments. Once we apply a function, the type of the resulting expression is just whatever is on the right side of the arrow. So applying our string to the function 'putStrLn', we get 'IO ()' as the resulting type!

ghci> :t putStrLn "Hello World!"
putStrLn "Hello World!" :: IO ()

Compilation Errors

For a different example, let's see what happens if we try to use an integer with 'putStrLn':

ghci> putStrLn 5
No instance for (Num String) arising from the literal '5'

The 'putStrLn' function only works with values of the 'String' type, while 5 has a type more like 'Int'. So we can't use these expressions together.

A Quick Look At Type Classes

However, this is where 'print' comes in. Let's look at its type signature:

ghci> :t print
print :: Show a => a -> IO ()

Unlike 'putStrLn', the 'print' function takes a more generic input. A "type class" is a general category describing a behavior. Many different types can perform the behavior. One such class is 'Show'. The behavior is that Show-able items can be converted to strings for printing. The 'Int' type is part of this type class, so we can use 'print' with it!

ghci> print 5
5

When use 'show' on a string, Haskell adds quotation marks to the string. This is why it looks different to use 'print' instead of 'putStrLn' in our initial program:

ghci> print "Hello World!"
"Hello World!"

Echo - Another Example Program

Our Haskell "Hello World" program is the most basic example of a program we can write. It only showed one side of the input/output equation. Here's an "echo" program, which first waits for the user to enter some text on the command line and then prints that line back out:

main :: IO ()
main = do
  input <- getLine
  putStrLn input

Let's quickly check the type of 'getLine':

ghci> :t getLine
getLine :: IO String

We can see that 'getLine' is an IO action returning a string. When we use the backwards arrow '<-' in our code, this means we unwrap the IO value and get the result on the left side. So the type of 'input' in our code is just 'String', meaning we can then use it with 'putStrLn'! Then we use the 'do' keyword to string together two consecutive IO actions. Here's what it looks like to run the program. The first line is us entering input, the second line is our program repeating it back to us!

>> runghc Echo.hs
I'm entering input!
I'm entering input!

A Complete Introduction to the Haskell Programming Language

Our Haskell "Hello World" program is the most basic thing you can do with the language. But if you want a comprehensive look at the syntax and every fundamental concept of Haskell, you should take our beginners course, Haskell From Scratch.

You'll get several hours of video lectures, plus a lot of hands-on experience with 100+ exercise problems with automated testing.

All-in-all, you'll only need 10-15 hours to work through all the material, so within a couple weeks you'll be ready for action! Read more about the course here!

James Bowen 11/27/23 James Bowen 11/27/23

Black Friday Sale: Last Day!

We've come to Cyber Monday, marking the last day of our Black Friday sale! Today is your last chance to get big discounts on all of our courses. You can get 20% by using the code BFSOLVE23 at checkout. Or you can subscribe to our mailing list to receive a 30% discount code. You must use these codes by the end of the day in order to get the discount!

Here's a final runthrough of the courses we have available, including our newest course, Solve.hs!

Solve.hs

We just released the first part of our newest course last week! These two detailed modules dive into the fundamentals of problem solving in Haskell. You'll get to rewrite the list type and most of its API from scratch, teaching you all the different ways you can write "loop" code in Haskell. Then you'll get an in-depth look at how data structures work in Haskell, including the quick process to learn a data structure from start to finish!

Course Page

Normal Price: $89 Sale Price: $71.20 Subscriber Price: $62.30

Haskell From Scratch

This is our extensive, 7-module beginners course. You'll get a complete introduction to Haskell's syntax and core concepts, including things like monads and tricky type conversions.

Course Page

Normal Price: $99 Sale Price: $79.20 Subscriber Price: $69.30

Practical Haskell

Practical Haskell is designed to break the idea that "Haskell is only an academic language". In our longest and most detailed course, you'll learn the ins and outs of communicating with a database in Haskell, building a web server, and connecting that server to a functional frontend page. You'll also learn about the flexibility that comes with Haskell's effect systems, as well as best practices for testing your code, including tricky test cases like IO based functions!

Course Page

Normal Price: $149 Sale Price: $119.20 Subscriber Price: $104.30

Making Sense of Monads

The first of our shorter, more targeted courses, Making Sense of Monads will teach you how to navigate monads, one of Haskell's defining concepts. This idea is a bit tricky at first but also quite important for unleashing Haskell's full power. The course is well suited to beginners who know all the basic syntax but want more conceptual practice.

Note that Making Sense of Monads is bundled with Haskell From Scratch. So if you buy the full beginners course, you'll get this in-depth look at monads for free!

Course Page

Normal Price: $29 Sale Price: $23.20 Subscriber Price: $20.30

Effectful Haskell

If Making Sense of Monads is best for teaching the basics of monads, Effectful Haskell will show you how to maximize the potential of this idea. You'll develop a more complete idea of what we mean by "effects" in your code. You'll see a variety of ways to incorporate them into your code and learn some interesting ideas about effect substitution!

Course Page

Normal Price: $39 Sale Price: $31.20 Subscriber Price: $27.30

Haskell Brain

Last, but not least, Haskell Brain will teach you how to perform machine learning tasks in Haskell with TensorFlow. There's a lot of steps involved in linking these two technologies. So while machine learning is a valuable skill to have in today's world, understanding the ways we can link software together is almost as valuable!

Course Page

Normal Price: $39 Sale Price: $31.20 Subscriber Price: $27.30

Conclusion

So don't miss out on this special offer! You can use the code BFSOLVE23 for 20% off, or you can subscribe to our mailing list to get a code for 30% off! This offer ends tonight, so don't wait!

James Bowen 11/24/23 James Bowen 11/24/23

Spotlight: Quick, Focused Haskell Courses

A couple days ago I gave a brief spotlight on the longer, more in-depth courses I've written. The newest of these is Solve.hs, with its focus on problem solving, and the original two I wrote were Haskell From Scratch and Practical Haskell.

After my first two courses, I transitioned towards writing a few shorter courses. These are designed to teach vital concepts in a shorter period of time. They all consist of just a single module and have a shorter total lecture time (1.5 to 2 hours each). You can finish any of them in a concentrated 1-2 week effort. Today I'll give a brief summary of each of these, listed from most abstract to most practical, and easiest to hardest.

Remember, all of these are on sale at 20% off using the code BFSOLVE23 at checkout! You can also subscribe to our mailing list to get an even bigger discount, at 30% off!

Making Sense of Monads

This is for those of you who have been writing Haskell long enough that you've got the hang of the syntax, but you still struggle a bit to understand monads. You might look at parts of Modules 4 and 5 from Haskell From Scratch and think they look useful, but you don't think you need the rest of the course.

Making Sense of Monads really "zooms in" on Module 5. It goes deeper in understanding all of the simpler structures that help us understand monads, and it gives a sizable amount of practice with writing monadic code. You'll also get a crash course on parsing (a common use of monadic operations), and write two fairly complex parsers. So it's a great option if you want a shorter but more concentrated approach on some of the basics!

Effectful Haskell

Effectful Haskell takes a lot of the core ideas and concepts in Making Sense of Monads and goes one step beyond into the more practical realm of applying monadic effects in a program. You'll learn more abstractly what an effect is, but then also the different ways to incorporate polymorphic effects into your Haskell program. You'll see how to use monads and monad classes to swap effectful behaviors in your program, and why this is useful.

This course culminates in a similar (but smaller) project to Practical Haskell, where you'll deploy an effectful web server to Heroku.

Haskell Brain

This course is the hardest and most practically-oriented of this series. You will take on the challenge of incorporating TensorFlow and machine learning into Haskell. This is easier said than done, because TensorFlow has many dependencies beyond the normal packages you can simply pick up on Hackage. So you'll gain valuable experience going through this installation process, and then we'll run through some of the main information you need to know when it comes to creating tensors in Haskell, and building moderately complex models.

Conclusion

So while these courses are shorter, they still pack a decent amount of material! And with the subscriber discount, you can get each of them for less than $30! This offer will only last until Monday though, so make up your mind quickly!

James Bowen 11/22/23 James Bowen 11/22/23

Spotlight: In-Depth Haskell Courses!

On Monday, I announced the release of Part 1 of Solve.hs, our newest course focused on problem solving. In addition, this entire week is our Black Friday sale, with two options to save on any of our courses, including the newest one. Your first option is to just use the code BFSOLVE23, which will get you 20% off. You can also get a special discount code by signing up for our mailing list, which will get you 30% off!

Some of our courses are shorter and more focused, while others are longer and cover more ground. Today on the blog I wanted to highlight the longer courses available at Monday Morning Haskell Academy. These go into a lot of detail on many different subjects, so they can fill a large gap in your Haskell awareness. However, each obviously has its own general subject area. Here they are, ordered from most beginner-level to most advanced.

Haskell From Scratch

Our very first course, Haskell From Scratch, is perfect for the curious reader of Haskell who has hardly (if ever) gotten around to writing any lines of Haskell code. Over the course of 7 modules, you'll learn everything you really need to get started writing some code. We'll go over a bit of basic setup, cover Haskell's core syntax extensively, and go over many of the important conceptual ideas that separate Haskell from other languages, like immutability, laziness, and monads. You will get many chances to try your new skills out with an abundant series of practice problems, as well as a final project!

Learn more about it on the course page!

Solve.hs

Solve.hs is our newest course, with the first two modules released this week! It will teach you the fundamental patterns of problem solving in Haskell. You'll start with an in-depth look at Haskell's lists and how we can use them to implement loop patterns. Then you'll learn A LOT of details about Haskell's data structures, including some implementation ideas that are unique to Haskell. It's a great choice if you have a solid grasp on the basic syntax and fundamental concepts of Haskell, but are looking to improve your fluency in the language.

Find out more by going here!

Practical Haskell

Practical Haskell is, by some distance, our longest and most detailed course, with close to 6 hours of lectures and an additional 4 hours of screencast content spread over 5 modules. You'll build different parts of a functioning website, starting with the database connections, then the server, and finally a frontend written in Elm. You'll also learn how to launch this server on Heroku, as well as some tips for best practices in testing.

Take a look on the MMH Academy page!

Conclusion

These courses are all great for helping you take the next big step in your Haskell journey. Until next Monday, you can get 20% off all of them using the code BFSOLVE23 when you check out! And you can get an even bigger discount (30%) if you subscribe to our mailing list! So don't miss out on this opportunity; you've only got 5 more days!

James Bowen 11/20/23 James Bowen 11/20/23

New Course: Solve.hs!

A few weeks ago I hinted that, while I hadn't been able to publish consistently for much of this year, I had been working on something important. And now, I'm finally ready for the big reveal!

Today I am excited to announce the release of Part 1 of my new course Solve.hs. This course is focused on problem solving in Haskell. This announcement also kicks off the Black Friday sale week for Monday Morning Haskell, where you can get 20% off any of our courses with the code BFSOLVE23. And if you subscribe to our mailing list, you'll get an even bigger discount, 30% off on everything, including Solve.hs!

Why Problem Solving?

I settled on problem solving as a great topic for a course because it occupies a good middle ground between, on the one hand, super basic setup and syntax (covered by Setup.hs and Haskell From Scratch, respectively), and on the other hand, more complex practical topics covered by courses like Practical Haskell, Effectful Haskell, and Haskell Brain.

Problem solving is a relatively quick and frictionless path to getting familiar with a language (especially one like Haskell that defies many conventions from more common languages). Once you have your toolchain set up, it's not hard to find problems that will quickly stretch your understanding of the language and even teach you some pretty tricky concepts! With problem solving, it's also easier to do a lot of different problems and get a lot of "reps" in, so to speak.

My actual Haskell career trajectory with Haskell went pretty quickly from learning the basics to jumping into practical applications like web servers. This is entirely doable, but it does run the risk of pigeon-holing you into some pretty specific areas of understanding, causing you to skip many of the fundamentals.

Problem solving is a great tool to fill this gap. It forces you to learn about new techniques that you might not need on a particular project, but which will help you grow and be able to apply to future projects. Let's learn a bit more about the course itself.

The Course

This course release is exciting for me because, even just considering the first part, it's the longest and most detailed course I've written since Practical Haskell. It doesn't appear to be that long, consisting of only two modules. But by several important metrics, it actually rivals the 7-module Haskell From Scratch. It has nearly as much lecture material, about as many practice problems, and the amount of code you'll actually have to write for the exercises is quite a bit more!

Course	Lecture Time	Practice Problems	Solution LOC*
Haskell From Scratch	4h05m	~180	1316
Solve.hs	3h47m	~180	2587

*Solution LOC measures the GitHub diff comparing my answers branches to the original

Solve.hs is good for users with many levels of Haskell experience. Beginner and intermediate Haskellers will benefit a lot from learning the core patterns in this first part, and even experienced Haskellers will find some of the practice problems challenging. To see why, let's look at the two modules currently on offer!

Module 1: Lists and Loop Patterns

In the first module, we'll do extensive work with lists. Lists are one of the basic building blocks of Haskell - the simplest collection type you can have. To use them well, and so become "fluent" in Haskell, you need to familiarize yourself with a lot of different patterns of how to use them.

In Module 1, we'll explicitly name these patterns and draw helpful comparisons to code from other languages. This will help you finally understand how to answer questions like "how do I write a for-loop in Haskell?"

You'll learn large parts of the List API by implementing the type from scratch, as well as dozens of its helper functions. This will help you drill in the patterns of how lists work, and what operations we can (and cannot) do efficiently with them.

Module 2: Data Structures

Once you're well-versed in all the things Haskell can do with lists, it will be time to learn about other structures that allow for different efficient operations. This is the subject of Module 2. Now I've written plenty about Data Structures before. In fact there are a couple ebooks you can check out if you want.

But this module goes way deeper. In Solve.hs, we'll look into how some of these structures are implemented under the hood, and specifically how they're implemented differently in Haskell compared to other languages.

There are plenty of practice problems to help you get a feel for each structure. You'll learn about some of my favorite "derived structures" - abstract ideas that use the core structures under the hood, but express specific operations cleanly. On top of that, you'll get to implement a couple different data structures from scratch, which is super helpful for training yourself to understand how these structures perform in real life.

Part 2 - 2024

I wanted to release Part 1 of this course now, since these two modules on their own can be super impactful for beginning and intermediate Haskellers. Also, Advent of Code is coming up, a time when I have focused this blog's attention on problem solving anyway for the last couple years. So this course makes a perfect match for those of you who want to dive into Advent of Code this year, writing all your solutions in Haskell!

However, since problem solving is such a rich field with many areas to explore, I plan to add two more modules to this course in 2024. Module 3 will cover Essential Algorithms, with a special focus on graph algorithms in Haskell. Then Module 4 will go in-depth with Parsing, since many problem solving contests like Advent of Code do require you to parse your problem input, often from odd formats. I also intend to incorporate some material related to scale and performance testing.

After the full course release, Part 1 will continue to be its own option for purchase. There is, however, no risk in buying Part 1 now. Anyone who purchases Part 1 before the full release will receive a discount coupon for the full course based on what they paid for Part 1.

So, if you're interested, head to the course sales page to learn more and enroll in the course! But first make sure to subscribe to the Monday Morning Haskell mailing list, which will give you a 30% discount code for this and all of our other courses! This offer will only be good until Monday, November 27th, so don't wait!

James Bowen 11/13/23 James Bowen 11/13/23

Ballparking Solutions

In last week's article, we went over how to build a simple benchmarking library for ourselves by timing computations. Now most of the time, we're inclined to use benchmarking in a purely reactive way. We write a solution to a problem, and then document how long it takes.

But perhaps the most important use of benchmarking is proactive. In this article, we'll learn how to use ballpark estimates to guide our problem-solving process. This can save you a lot of time when you're working on performance-intensive problems, whether on Advent of Code, Hackerrank, or, of course, in the real world of software development!

The Uses of Ballparking

A ballpark estimate essentially allows us to provide an order of magnitude on the time we expect our solution to take. We should be able to abstractly define the runtime of our solution with Big-O notation. And given this designation, as well as a problem size, we should be able to determine if we'll get an answer within a second, a few seconds, a few minutes, or a few hours.

Using this ballpark estimate, we can then determine if our proposed solution is even feasible, without needing to implement the solution. Whether we're solving a problem on Advent of Code, Hackerrank, Leetcode, or in the real world, we can usually have some idea of the scale of a problem. AoC will let you observe the "large" problem input on which to test your code. Hackerrank will give you an idea of the bounds of various problem parameters.

These problem parameters can, in fact, tell us if a solution is acceptable before we even write that solution. For example, if my input problem size is 10000, roughly how long should my program take if it's O(n)? What about O(n log n)? What about O(n^2)?

Now it's up to situational judgment to determine what's "acceptable". As a general rule, no one wants to wait around for hours to wait for their code to finish. And yet, in real world machine learning development, taking several hours to train a model on the latest data is quite commonplace. However, Hackerrank will reject your solution (or at least not give you full points) if it takes more than a few seconds on the largest input. In Advent of Code meanwhile, you run the code locally, and you only need to get it once. So if your code takes 30 minutes or even an hour to run, you can still use it if you're sure the algorithm is correct (I've done this a few times).

Getting Ballpark Values

So how do we get an idea for these ballpark values? How do we estimate how an algorithm will perform on some data? Well we can use the library we built last time! All we have to do is select some known operations and time how long they take on varying sizes of inputs. So we can choose some candidate functions for different run times:

O(n) -> Take the sum of a list of integers
O(n log n) -> Sort a list of values
O(n^2) -> Take the intersection of two sets of size n
O(n^3) -> Populate a "cubic" multiplication table.

For each operation we can propose different sizes, and see what the times are. We'll do this with sum first. To start, let's write a function to generate a randomized list of integers, given a range and a size:

import Control.Monad (replicateM)
import System.Random (randomRIO)

randomList :: (Int, Int) -> Int -> IO [Int]
randomList rng n = replicateM n (randomRIO rng)

Now we'll generate random lists for many different sizes:

sizes :: [Int]
sizes = [10, 100, 1000, 10000, 100000, 1000000, 10000000]

main :: IO ()
main = do
  listsWithSizes <- zip sizes <$> mapM (randomList (1, 10000)) sizes
  ...

And now we just use our timeOp function from the last article on each list. We'll print out the time it takes, followed by the size:

import qualified System.IO.Strict as S

timeOp :: (NFData b) => (a -> b) -> a -> S.SIO NominalDiffTime
...

main :: IO ()
main = do
  listsWithSizes <- zip sizes <$> mapM (randomList (1, 10000)) sizes
  putStrLn "Sum"
  forM_ listsWithSizes $ \(sz, lst) -> do
    t <- S.run (timeOp sum lst)
    putStr (show t)
    putStrLn $ " (" <> show sz <> ")"

With these sizes, we get the following output:

Sum
0.000039492s (10)
0.000024104s (100)
0.000031986s (1000)
0.000257356s (10000)
0.001830953s (100000)
0.014561602s (1000000)
0.287729319s (10000000)

And so we can see that if our algorithm is O(n), we can get an answer in less than a second, even for extremely large input sizes! If we went up to 100 million, it would start taking multiple seconds.

Larger Size Examples

Obviously though, O(n) is generally the best case scenario we can get, and most problems don't have such a solution. So we can start trying more expensive solutions. So for O(n log n), we'll do sorting. Now that we've got our template, it's easy to write this code:

import qualified Data.List as L

main :: IO ()
main = do
  listsWithSizes <- zip sizes <$> mapM (randomList (1, 10000)) sizes
  putStrLn "Sorting"
  forM_ listsWithSizes $ \(sz, lst) -> do
    t <- S.run (timeOp L.sort lst)
    putStr (show t)
    putStrLn $ " (" <> show sz <> ")"

And we get our results. With the same sizes, we see much longer times.

0.000011084s (10)
0.000028496s (100)
0.000264609s (1000)
0.004021375s (10000)
0.086308336s (100000)
1.691808146s (1000000)
39.815963511s (10000000)

We can sort 10 million items in under a minute, so an O(n log n) algorithm on such a large input would probably be fine for something like Advent of Code, but not for Hackerrank. If your size is limited to one hundred thousand or smaller, such an algorithm will probably work on Hackerrank.

As we increase the difficulty of the algorithm, we can't include as many test sizes, since we start getting prohibitively long times earlier! Let's try list intersection, which, using the naive algorithm in Data.List, is a quadratic algorithm. We'll only run up to 100000 items:

main = do
  listsWithSizes <- zip sizes <$> mapM (randomList (1, 10000)) (take 5 sizes)
  putStrLn "Intersect"
  forM_ listsWithSizes $ \(sz, lst) -> do
    t <- S.run (timeOp (L.intersect lst) lst')
    putStr (show t)
    putStrLn $ " (" <> show sz <> ")"

Here are the results:

Intersect
0.000009277s (10)
0.000092984s (100)
0.005050921s (1000)
0.396993992s (10000)
10.751072816s (100000)

So 10000 items might be doable. However, by intersecting a list with itself, we're being generous. The problem takes longer if the two lists are disjoint:

main = do
  listsWithSizes <- zip sizes <$> mapM (randomList (1, 10000)) (take 5 sizes)
  altLists <- mapM (randomList (10001, 20000)) (take 5 sizes)
  putStrLn "Intersect"
  forM_ (zip listsWithSizes altLists) $ \((sz, lst), lst') -> do
    t <- S.run (timeOp (L.intersect lst) lst')
    putStr (show t)
    putStrLn $ " (" <> show sz <> ")"

Now it takes longer:

Intersect
0.000009074s (10)
0.000167983s (100)
0.010436625s (1000)
1.109146166s (10000)
215.779740028s (100000)

We're over a second for 10000, but 100000 is now quite prohibitive. Keep in mind though, this problem also has low constant factors. So while 10000 seems doable, a good general rule of thumb is that you should look for something faster than O(n^2) for this problem size.

For one last example, we'll do a cubic algorithm. This function produces a cubic multiplication table:

cubicTable :: [Int] -> [((Int, Int, Int), Int)]
cubicTable ns = [((x, y, z), x * y * z) | x <- ns, y <- ns, z <- ns]

We have to majorly shrink the sizes for this to be even tractable! Let's just try 10, 100 and 200.

main = do
  listsWithSizes <- zip sizes <$> mapM (randomList (1, 10000)) [10, 100, 200]
  putStrLn "Intersect"
  forM_ listsWithSizes $ \(sz, lst) -> do
    t <- S.run (timeOp cubicTable lst
    putStr (show t)
    putStrLn $ " (" <> show sz <> ")"

Already at size 200, we're taking several seconds.

Cubic Table
0.000397915s (10)
0.553257425s (100)
3.549556318s (200)

So if your problem size is larger than 100, you'll really need to find something better than a cubic algorithm.

With slower algorithms, it gets even worse. You may be able to get away with an exhaustive exponential or factorial algorithm, but only at size 10 or below. These kinds of algorithms are rarely tested in programming problems though, since they are the most brute force thing you can do.

Example: Fences Problem

Here's an example of applying this ballpark methodology. I run through this example in more detail in my Testing Series (including benchmarking). But the problem is a Hackerrank question called John and Fences where we are essentially looking to calculate the largest rectangular area we can find amidst fence segments of varying heights.

The solution here is recursive. We find the minimum height plank over an interval. We then either take that height multiplied by the length of the interval, or we recursively look strictly left of the interval and strictly right. We return the maximum of these three options.

At each recursive step, we need to keep finding the minimum over a particular interval. Naively, this takes O(n) time, and we may have to do this step O(n) times, giving an O(n^2) algorithm.

However, we can observe the constraints on the problem, and find that n could be as high as 10000.

This is our clue that O(n^2) may not be fast enough. And indeed, to get full credit, you need to find an algorithm that is O(n log n). You can do this with a Segment Tree, which lets you calculate the minimum over any interval in your fence in logarithmic time. You can read the series for a complete solution.

The power of ballparking is when you realize this fact before you start writing your code, skipping the slow implementation altogether.

Conclusion

For more tips and content on problem solving, make sure to subscribe to our mailing list! There's a special deal coming out next week that you won't want to miss!

The Problem

The Algorithm

Haskell Solution

Rust Solution

Why an Index Loop?

Conclusion

The Problem

The Algorithm

Haskell Solution

Rust Solution

Conclusion

The Problem

The Algorithm

Rust Solution

Haskell Solution

Conclusion

The Problem

The Algorithm

Rust Solution

Haskell Solution

Conclusion

The Problem

The Algorithm

Rust Solution

Haskell Solution

Conclusion

The Problem

The Algorithm

Basic Rust Solution

Advanced Rust Solution

Basic Haskell Solution

Advanced Haskell Solutions

The Simplest Haskell Solution

Conclusion

The Problem

The Algorithm

Rust Solution

Haskell Solution

Using a Fold

Conclusion

Analogy: Going to the Library

From Physical to Online Learning

Direction of Study

Filtering

Human Insight

Case Study: Learning Concurrency

Case Study: Dijkstra’s Algorithm

Conclusion

Breadth of Material

Detailed and Challenging Exercises

Lecture Content and Slides

Other Guarantees

Beginner Bundle

Advanced Bundle

MMH Complete

Discounts!

What is a Programming Paradigm?

The Object Oriented Paradigm

The Functional Paradigm

Functional Programming vs. OOP

Object Oriented Programming Languages

FP Languages

Advantages of Functional Programming

Fewer Bugs

More Expressive, Reasonable Design

Multithreading

Disadvantages of Functional Programming

Intuition of Complete Objects

Mutable State can be Useful!

A Full Introduction to Haskell (and its Functional Aspects)

What is a Comment?

Single Line Comments

Multi-Line Comments

Inline Comments

Writing Formal Documentation Comments

Intro to Haddock

Basic Haddock Comments

Documenting the Module Header

Module Header Fields

Haddock Comments Below