The use of nested Hashtables

Sometimes a solution to a problem requires the use of nested hash-tables, that is, a hash table whose values are other hash tables. This problem exemplifies this technique. It comes from Leetcode, here it is: https://leetcode.com/problems/sentence-similarity/description/:

Given two sentences words1, words2 (each represented as an array of strings), and a list of similar word pairs pairs, determine if two sentences are similar.
For example, "great acting skills" and "fine drama talent" are similar, if the similar word pairs are pairs = [["great", "fine"], ["acting","drama"], ["skills","talent"]].
Note that the similarity relation is not transitive. For example, if "great" and "fine" are similar, and "fine" and "good" are similar, "great" and "good" are not necessarily similar.
However, similarity is symmetric. For example, "great" and "fine" being similar is the same as "fine" and "great" being similar.
Also, a word is always similar with itself. For example, the sentences words1 = ["great"], words2 = ["great"], pairs = [] are similar, even though there are no specified similar word pairs.
Finally, sentences can only be similar if they have the same number of words. So a sentence like words1 = ["great"] can never be similar to words2 = ["doubleplus","good"].

Here is the idea: pre-process pairs in such a way to add the words to a hash table. Each element of the hash table will be another hash table. For example, if pairs contains (w1, w2) and (w1, w3), we'll have a hash table that contains w1 as the key and the value will be another hash table with keys w2 and w3. That way the access to it will be very fast and it will cost only one pass thru words1 to solve the problem. Code is down below, cheers, Marcelo.



public class Solution
{
public bool AreSentencesSimilar(string[] words1, string[] words2, string[,] pairs)
{
if (words1.Length != words2.Length) return false;

//Process the pairs into an outer and inner hash table
Hashtable outerMap = new Hashtable();
for (int i = 0; i < pairs.Length / 2; i++)
{
for (int index = 0; index <= 1; index++)
{
int currentIndex = index;
int otherIndex = (index + 1) % 2;
if (!outerMap.ContainsKey(pairs[i, currentIndex]))
{
Hashtable innerMap = new Hashtable();
if (!innerMap.ContainsKey(pairs[i, otherIndex]))
innerMap.Add(pairs[i, otherIndex], true);
outerMap.Add(pairs[i, currentIndex], innerMap);
}
else
{
Hashtable innerMap = (Hashtable)outerMap[pairs[i, currentIndex]];
if (!innerMap.ContainsKey(pairs[i, otherIndex]))
innerMap.Add(pairs[i, otherIndex], true);
outerMap[pairs[i, currentIndex]] = innerMap;
}
}
}

//Process the strings
for (int i = 0; i < words1.Length; i++)
{
if (words1[i] == words2[i]) continue;
if (!outerMap.ContainsKey(words1[i])) return false;
Hashtable innerMap = (Hashtable)outerMap[words1[i]];
if (!innerMap.ContainsKey(words2[i])) return false;
}
return true;
}
}

Comments

  1. Hash tables are probably one of the most frequently used data structures and mostly for a good reason :) This problem is a good example of their usefulness. Technically we can use sets as values of the top level hash table, but in practice it's usually best to use a set implementation based on a hash table, so hash tables are everywhere :) Conveniently C++ provides a very concise way to use them:

    class Solution {
    public:
    bool areSentencesSimilar(const vector& words1, const vector& words2, const vector> pairs) {
    if (words1.size() != words2.size()) return false;
    unordered_map> similar;
    for (const auto& p : pairs) similar[p.first].insert(p.second);
    auto are_similar = [&](const string& word1, const string& word2) {
    return word1 == word2 || similar[word1].count(word2) != 0 || similar[word2].count(word1) != 0;
    };
    for (int i = 0; i < words1.size(); ++i) if (!are_similar(words1[i], words2[i])) return false;
    return true;
    }
    };

    Cheers,
    Taras

    Reply Delete

Post a Comment

[フレーム]

Popular posts from this blog

Quasi FSM (Finite State Machine) problem + Vibe

Not really an FSM problem since the state isn't changing, it is just defined by the current input. Simply following the instructions should do it. Using VSCode IDE you can also engage the help of Cline or Copilot for a combo of coding and vibe coding, see below screenshot. Cheers, ACC. Process String with Special Operations I - LeetCode You are given a string  s  consisting of lowercase English letters and the special characters:  * ,  # , and  % . Build a new string  result  by processing  s  according to the following rules from left to right: If the letter is a  lowercase  English letter append it to  result . A  '*'   removes  the last character from  result , if it exists. A  '#'   duplicates  the current  result  and  appends  it to itself. A  '%'   reverses  the current  result . Return the final string  result  after processing all char...

Shortest Bridge – A BFS Story (with a Twist)

Here's another one from the Google 30 Days challenge on LeetCode — 934. Shortest Bridge . The goal? Given a 2D binary grid where two islands (groups of 1s) are separated by water (0s), flip the fewest number of 0s to 1s to connect them. Easy to describe. Sneaky to implement well. 🧭 My Approach My solution follows a two-phase Breadth-First Search (BFS) strategy: Find and mark one island : I start by scanning the grid until I find the first 1 , then use BFS to mark all connected land cells as 2 . I store their positions for later use. Bridge-building BFS : For each cell in the marked island, I run a BFS looking for the second island. Each BFS stops as soon as it hits a cell with value 1 . The minimum distance across all these searches gives the shortest bridge. πŸ” Code Snippet Here's the core logic simplified: public int ShortestBridge(int[][] grid) { // 1. Mark one island as '2' and gather its coordinates List<int> island = FindAndMark...

Classic Dynamic Programming IX

A bit of vibe code together with OpenAI O3. I asked O3 to just generate the sieve due to laziness. Sieve is used to calculate the first M primes (when I was using Miller-Rabin, was giving me TLE). The DP follows from that in a straightforward way: calculate the numbers from i..n-1, then n follows by calculating the min over all M primes. Notice that I made use of Goldbach's Conjecture as a way to optimize the code too. Goldbach's Conjecture estates that any even number greater than 2 is the sum of 2 primes. The conjecture is applied in the highlighted line. Cheers, ACC. PS: the prompt for the sieve was the following, again using Open AI O3 Advanced Reasoning: " give me a sieve to find the first M prime numbers in C#. The code should produce a List<int> with the first M primes " Minimum Number of Primes to Sum to Target - LeetCode You are given two integers  n  and  m . You have to select a multiset of  prime numbers  from the  first   m  pri...