Autocomplete System Using Trie in Python


Autocomplete System Using Trie in Python

An autocomplete system is a feature that suggests possible completions for a given prefix based on previously entered or stored words. A Trie (prefix tree) is a suitable data structure for implementing an efficient autocomplete system because it allows for quick lookups, insertions, and deletions, making it ideal for prefix-based queries.

Program Structure

The autocomplete system implementation using a Trie consists of the following components:

1. TrieNode Class

This class represents a single node in the Trie. It includes the following attributes:

  • children: A dictionary mapping each character to its corresponding child TrieNode.
  • is_end_of_word: A boolean indicating whether the node marks the end of a valid word.

2. Trie Class

This class encapsulates the functionality of the Trie. It includes the following methods:

  • __init__(self): Initializes the Trie with an empty root node.
  • insert(self, word): Inserts a word into the Trie.
  • search(self, word): Searches for a word in the Trie.
  • starts_with(self, prefix): Checks if there is any word in the Trie that starts with a given prefix.
  • autocomplete(self, prefix): Returns a list of all words in the Trie that start with a given prefix.

Python Code Implementation


class TrieNode:
def __init__(self):
"""
Initialize a TrieNode.
"""
self.children = {} # Maps each character to the corresponding TrieNode
self.is_end_of_word = False # True if the node represents the end of a word

class Trie:
def __init__(self):
"""
Initialize the Trie.
"""
self.root = TrieNode()

def insert(self, word):
"""
Insert a word into the Trie.
:param word: The word to insert.
"""
node = self.root
for char in word:
if char not in node.children:
node.children[char] = TrieNode()
node = node.children[char]
node.is_end_of_word = True

def search(self, word):
"""
Search for a word in the Trie.
:param word: The word to search for.
:return: True if the word is found, False otherwise.
"""
node = self.root
for char in word:
if char not in node.children:
return False
node = node.children[char]
return node.is_end_of_word

def starts_with(self, prefix):
"""
Check if there is any word in the Trie that starts with the given prefix.
:param prefix: The prefix to check.
:return: True if there is any word with the given prefix, False otherwise.
"""
node = self.root
for char in prefix:
if char not in node.children:
return False
node = node.children[char]
return True

def autocomplete(self, prefix):
"""
Get all words in the Trie that start with the given prefix.
:param prefix: The prefix for autocomplete suggestions.
:return: List of words that start with the given prefix.
"""
def dfs(node, path, results):
"""
Depth-first search to find all words with the given prefix.
:param node: The current TrieNode.
:param path: The current path representing the word.
:param results: The list of found words.
"""
if node.is_end_of_word:
results.append("".join(path))
for char, next_node in node.children.items():
dfs(next_node, path + [char], results)

results = []
node = self.root
for char in prefix:
if char not in node.children:
return [] # No words with the given prefix
node = node.children[char]
dfs(node, list(prefix), results)
return results

Explanation

The autocomplete system is built using a Trie data structure, which allows efficient insertion, searching, and prefix-based queries. The key components of this implementation include the TrieNode and Trie classes.

1. TrieNode Class

The TrieNode class represents a single node in the Trie. Each node contains:

  • children: A dictionary mapping characters to their corresponding child TrieNodes.
  • is_end_of_word: A boolean flag indicating whether the node represents the end of a word.

2. Trie Class

The Trie class manages the Trie data structure and provides methods for inserting words, searching for words, checking for prefixes, and generating autocomplete suggestions.

Insertion

The insert method adds a word to the Trie by iterating over each character of the word and creating a new TrieNode if it doesn’t already exist in the Trie.

Search

The search method checks if a word exists in the Trie by traversing the Trie according to the word’s characters. It returns True if the word is found and ends at a valid word node, and False otherwise.

Prefix Check

The starts_with method checks if any word in the Trie starts with a given prefix. It returns True if the prefix is found, and False otherwise.

Autocomplete

The autocomplete method generates a list of words in the Trie that start with a given prefix. It uses a depth-first search (DFS) to explore all possible words that start with the prefix.

Usage Example


# Example usage of the Trie for an autocomplete system

# Initialize the Trie and insert words
trie = Trie()
words = ["apple", "app", "apricot", "banana", "bat", "batch", "batman"]
for word in words:
trie.insert(word)

# Autocomplete for the prefix "ap"
suggestions = trie.autocomplete("ap")
print("Autocomplete suggestions for 'ap':", suggestions)
# Output: Autocomplete suggestions for 'ap': ['apple', 'app', 'apricot']

# Autocomplete for the prefix "bat"
suggestions = trie.autocomplete("bat")
print("Autocomplete suggestions for 'bat':", suggestions)
# Output: Autocomplete suggestions for 'bat': ['bat', 'batch', 'batman']

This example demonstrates how to create a Trie, insert words into it, and use the autocomplete functionality to generate suggestions based on a given prefix.


By Aditya Bhuyan

I work as a cloud specialist. In addition to being an architect and SRE specialist, I work as a cloud engineer and developer. I have assisted my clients in converting their antiquated programmes into contemporary microservices that operate on various cloud computing platforms such as AWS, GCP, Azure, or VMware Tanzu, as well as orchestration systems such as Docker Swarm or Kubernetes. For over twenty years, I have been employed in the IT sector as a Java developer, J2EE architect, scrum master, and instructor. I write about Cloud Native and Cloud often. Bangalore, India is where my family and I call home. I maintain my physical and mental fitness by doing a lot of yoga and meditation.

Leave a Reply

Your email address will not be published. Required fields are marked *

error

Enjoy this blog? Please spread the word :)