LC 139 - Word Break

Posted Sep 30, 2024

By Xav

4 min read

LC 139 - Word Break

Question

Given a string s and a dictionary of strings wordDict, return true if s can be segmented into a space-separated sequence of one or more dictionary words.

Note that the same word in the dictionary may be reused multiple times in the segmentation.

Example 1:

Input: s = "leetcode", wordDict = ["leet","code"]
Output: true

Explanation: Return true because “leetcode” can be segmented as “leet code”.

Example 2:

Input: s = "applepenapple", wordDict = ["apple","pen"]
Output: true

Explanation: Return true because “applepenapple” can be segmented as “apple pen apple”. Note that you are allowed to reuse a dictionary word.

Example 3:

Input: s = "catsandog", wordDict = ["cats","dog","sand","and","cat"]
Output: false

Constraints:

1 <= s.length <= 300
1 <= wordDict.length <= 1000
1 <= wordDict[i].length <= 20
s and wordDict[i] consist of only lowercase English letters.
All the strings of wordDict are unique.

Links

Question here and solution here

Solution

concept

top down (memoization)

We can use DFS to check the remainder string if it can be solved, the key part is use for j in range(i, len(s)) in each DFS call such that each possible starting position in that particular substring being investigated by the DFS is looked into.

bottom up

we solve from backwards, the subproblem is can we break the remain words fromi using wordDict ?

code

  
class Solution:
	"""
	brute force
	TLE
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:
        word_set = set(wordDict)

        def dfs(i):
            if i == len(s):
                return True
            
            for j in range(i, len(s)):
                if s[i:j+1] in word_set:
                    if dfs(j+1):
                        return True
            return False

        return dfs(0)
        
class Solution:
	"""
	top down (memoization)
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:
        word_set = set(wordDict)
        cache = {len(s): True} # i: if from i onwards can be matched

        def dfs(i):
            if i == len(s):
                return cache[i]
            if i in cache:
                return cache[i]
            
            for j in range(i, len(s)):
                if s[i:j+1] in word_set:
                    if dfs(j+1):
                        cache[i] = True
                        return cache[i] # True
            cache[i] = False
            return cache[i] # False

        return dfs(0)
        
class Solution:
	"""
	Brute force 
	TLE
	similar to above but check each word instead of iterate through all index
	this is slightly more efficient
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:

        def dfs(i):
            if i == len(s):
                return True

            for w in wordDict:
                if ((i + len(w)) <= len(s) and
                     s[i : i + len(w)] == w
                ):
                    if dfs(i + len(w)):
                        return True
            return False

        return dfs(0)
        
class Solution:
	"""
	top down (memoization) of the above solution
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:
        memo = {len(s) : True}
        def dfs(i):
            if i in memo:
                return memo[i]

            for w in wordDict:
                if ((i + len(w)) <= len(s) and
                     s[i : i + len(w)] == w
                ):
                    if dfs(i + len(w)):
                        memo[i] = True
                        return True
            memo[i] = False
            return False

        return dfs(0)

class Solution:
	"""
	brute force
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:
        len_dict = defaultdict(set)
        for w in wordDict:
            len_dict[len(w)].add(w) # len -> set of words
        
        def dfs(i):
            if i == len(s):
                return True
            if i > len(s):
                return False

            for j in range(i, len(s)):
                if len(s[i:j+1]) in len_dict and s[i:j+1] in len_dict[len(s[i:j+1])]:
                    if dfs(j+1):
                        return True
            return False
        
        return dfs(0)

class Solution:
	"""
	top down memoization from above solution
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:
        len_dict = defaultdict(set)
        for w in wordDict:
            len_dict[len(w)].add(w)
        cache = {}
        
        def dfs(i):
            if i in cache:
                return cache[i]
            if i == len(s):
                return True
            if i > len(s):
                return False

            for j in range(i, len(s)):
                if len(s[i:j+1]) in len_dict and s[i:j+1] in len_dict[len(s[i:j+1])]:
                    if dfs(j+1):
                        cache[i] = True
                        return True
            cache[i] = False
            return False
        
        return dfs(0)

class Solution:
	"""
	bottom up solution
	"""
    def wordBreak(self, s: str, wordDict: List[str]) -> bool:
        # starting at index i till end of str, can it be matched with wordDict ?
        dp = [False] * (len(s) + 1) # one extra space for base case, i.e. if you reach the end
        dp[-1] = True

        for i in range(len(s)-1, -1, -1):
            for word in wordDict:
                if i + len(word) <= len(s) and s[i:i+len(word)] == word:
                    dp[i] = dp[i + len(word)] # this might turns out to be false, so this will propagate through
                if dp[i]: # slight optimisation
                    break

        return dp[0]

Complexity

time: $O(nmt)$ where $n$ is the length of string s, $m$ is length of the wordDict and t is the max length of the word in wordDict
space: $O(n)$

LeetCode, NeetCode150

This post is licensed under CC BY 4.0 by the author.