A* Search Algorithm: Step-by-Step Guide + Visualizer

Mohammed Islam Hadjoudj

Expert Operations Research Engineer

1. What is the A* Algorithm?
2. The Problem with Dijkstra's Algorithm
3. The Secret Sauce: Heuristics (f = g + h)
4. Choosing the Right Heuristic (Manhattan vs Euclidean)
5. Step-by-Step Execution of A*
6. Real-World Applications in Game Dev & AI
7. Frequently Asked Questions (FAQ)

1. What is the A* Algorithm?

The A* (pronounced "A-Star") algorithm is a graph traversal and path search algorithm that is widely used in computer science due to its completeness, optimality, and optimal efficiency. Invented in 1968 by Peter Hart, Nils Nilsson, and Bertram Raphael of the Stanford Research Institute, it was originally designed to help the Shakey robot navigate through rooms.

Today, A* is the absolute industry standard for routing and pathfinding. Whether a character in a strategy game is walking across a grid, or a GPS is calculating the fastest drive across a country, A* (or a variant of it) is likely doing the heavy lifting.

2. The Problem with Dijkstra's Algorithm

To understand why A* is brilliant, we must first understand what it improves upon. Before A*, the gold standard for finding the shortest path was Dijkstra's Algorithm.

Dijkstra's algorithm is guaranteed to find the shortest path. However, it is fundamentally "blind." When you ask Dijkstra to find a path from New York to Los Angeles, it will explore roads leading to Boston, Miami, and Chicago just as eagerly as it explores roads heading west. It searches outward in a perfect circle (or sphere) in all directions equally until it accidentally bumps into the target.

In a large map, exploring every possible direction is incredibly slow and wastes massive amounts of processing power. We need an algorithm that is "smart" enough to know which direction the target is in, so it can prioritize exploring paths that head towards the goal.

Two grids with the same start and goal. On the left, Dijkstra shades about 120 cells, spreading outward in every direction including behind the start. On the right, A* shades only about 45 cells, a compact rectangle stretching from the start toward the goal. Both find the same staircase shortest path. — Both reach the goal along the same shortest path, but Dijkstra expands every node closer than the goal, while A* only expands nodes that head toward it, exploring far fewer cells.

3. The Secret Sauce: Heuristics (f = g + h)

A* solves the "blindness" problem by introducing a Heuristic. A heuristic is an educated guess. In pathfinding, it's a function that estimates how far a node is from the final destination.

A* assigns a score to every node it discovers. The node with the lowest score is the one it explores next. The core equation of A* is:

f(n) = g(n) + h(n)

n is the current node being evaluated on the graph.
g(n) is the Exact Cost. It is the known distance from the starting node to node n. (This is exactly what Dijkstra uses).
h(n) is the Heuristic Estimate. It is the estimated distance from node n to the final goal.
f(n) is the Total Cost. This is the sum of g and h. A* always prioritizes the node with the lowest f score.

By adding the h(n) component, A* is pulled towards the goal like a magnet. It will ignore paths that go in the opposite direction of the goal, drastically reducing the search space compared to Dijkstra's algorithm.

A grid with a green Start cell, a red Goal cell, and a blue node n between them. A solid green line marks the path already travelled from Start to n, labeled g = 4. A dashed line from n to the Goal marks the heuristic estimate, labeled h = 6. The total score is f(n) = 4 + 6 = 10. — g is the real distance already travelled from the Start; h is the estimated distance still to go. A* sorts its frontier by their sum, f = g + h.

4. Choosing the Right Heuristic

The magic of A* relies entirely on the accuracy of the heuristic function h(n). If your heuristic overestimates the distance to the goal, A* is no longer guaranteed to find the shortest path. If it underestimates, it becomes slower. Therefore, choosing the right heuristic for your specific game or map is critical.

Manhattan Distance (For Grid Worlds without Diagonals)

If your game uses a grid (like Pac-Man or a traditional roguelike) and characters can only move Up, Down, Left, or Right, you should use the Manhattan Distance. It calculates the total number of blocks horizontally and vertically between the current node and the target.

function heuristic(node, goal) {
    return abs(node.x - goal.x) + abs(node.y - goal.y)
}

Euclidean Distance (For Open Worlds or Any-Angle Movement)

If characters can move in any direction (or if you are routing on a continuous map), you should use Euclidean Distance. This is the straight-line "as the crow flies" distance between two points, calculated using the Pythagorean theorem.

function heuristic(node, goal) {
    return sqrt((node.x - goal.x)^2 + (node.y - goal.y)^2)
}

Chebyshev Distance (For Grid Worlds with Diagonals)

If your game uses a grid but allows diagonal movement (where moving diagonally costs the same as moving straight), you use the Chebyshev distance. It takes the maximum of the horizontal or vertical differences.

function heuristic(node, goal) {
    return max(abs(node.x - goal.x), abs(node.y - goal.y))
}

Three small grids comparing heuristics for a node that is 4 columns and 3 rows from the goal. Manhattan uses a 4-direction staircase path: 4 + 3 = 7. Euclidean uses a straight diagonal line: square root of 25 = 5. Chebyshev allows diagonal moves: max(4, 3) = 4. — The same node-to-goal gap gives a different estimate depending on how movement is allowed: Manhattan (7) for 4-way grids, Euclidean (5) for any-angle, Chebyshev (4) for 8-way grids.

Visualize the Difference

Watch Dijkstra and A* race to solve the same maze. Notice how Dijkstra floods the entire map, while A* creates a laser-focused path directly toward the exit.

Launch A* Visualizer

5. Step-by-Step Execution of A*

Here is exactly how the algorithm maintains its state and processes nodes during execution:

Initialization: Create two sets: an OPEN set (nodes to be evaluated) and a CLOSED set (nodes already evaluated). Add the starting node to the OPEN set.
Loop Start: Find the node in the OPEN set with the lowest f cost. Let's call this the Current node.
Check Goal: If the Current node is the target goal, you are done! Follow the parent pointers backward to reconstruct the path.
Move to Closed: Remove the Current node from the OPEN set and add it to the CLOSED set.
Expand Neighbors: Look at all adjacent neighbors of the Current node. For each neighbor:
- If the neighbor is an obstacle (wall) or is in the CLOSED set, ignore it.
- Calculate the neighbor's g cost (Current g + distance to neighbor).
- If the neighbor is not in the OPEN set, calculate its h and f costs, set its parent to Current, and add it to the OPEN set.
- If the neighbor is already in the OPEN set, check if this new path to the neighbor is better (has a lower g cost). If it is, update its parent and g cost.
Repeat: Loop back to Step 2. If the OPEN set becomes empty and the goal hasn't been reached, there is no valid path.

6. Real-World Applications in Game Dev & AI

A* is the backbone of movement in digital spaces.

Real-Time Strategy (RTS) Games: When you drag-select 50 units in StarCraft or Age of Empires and click a point on the map, A* calculates 50 individual paths, routing them around buildings, rivers, and each other. (Often utilizing specialized variants like Flow Fields or Hierarchical A*).
RPG and Stealth Games: NPCs use A* on a "NavMesh" (Navigation Mesh - a simplified graph of walkable surfaces) to chase the player, patrol corridors, or find cover.
Robotics: Automated warehouse robots (like those used by Amazon) use A* to navigate factory floors without colliding with shelving units or other robots.
GPS Navigation: While standard A* is too slow for continental maps, modified versions (like ALT - A* with Landmarks and Triangle inequality) are used to quickly calculate optimal driving routes.

Frequently Asked Questions

How does the A* algorithm differ from Dijkstra's?

A* uses a heuristic function (estimating the distance to the goal) to guide its path search, whereas Dijkstra's algorithm is blind and explores in all directions equally. This makes A* significantly faster and more targeted.

What is an admissible heuristic in A*?

An admissible heuristic is one that never overestimates the actual cost to reach the goal. Admissibility guarantees that A* will find the mathematically shortest path.

When should I choose Manhattan vs. Euclidean distance?

Use Manhattan distance for grids where movement is restricted to 4 directions (up, down, left, right). Use Euclidean distance for continuous environments allowing movement in any angle.

A* Search Algorithm: Pathfinding in Games and AI

Table of Contents