Multi-agent Pac-Man

For those of you not familiar with Pac-Man, it's a game where Pac-Man (the yellow circle with a mouth in the above figure) moves around in a maze and tries to eat as many food pellets (the small white dots) as possible, while avoiding the ghosts (the other two agents with eyes in the above figure). If Pac-Man eats all the food in a maze, it wins. The big white dots at the top-left and bottom-right corner are capsules, which give Pac-Man power to eat ghosts in a limited time window, but you won't be worrying about them for the required part of the assignment. You can get familiar with the setting by playing a few games of classic Pac-Man, which we come to just after this introduction.

In this assignment, you will design agents for the classic version of Pac-Man, including ghosts. Along the way, you will implement both minimax and expectimax search.

The base code for this assignment contains a lot of files, which are listed towards the end of this page; you, however, do not need to go through these files to complete the assignment. These are present only to guide the more adventurous amongst you to the heart of Pac-Man. As in previous assignments, you will only be modifying submission.py.

Installation Guide for Homework Environment

Prerequisites:

Ensure that you're using Python version 3.12. If you have a different version, you might experience GUI-related issues. Check your Python version by running:

  python --version

Installing Miniconda:

Windows:

Download the Miniconda installer for Windows from the official site.
Double-click the .exe file to start the installation.
Follow the installation prompts. When asked to add Miniconda to your PATH, choose "Yes."

Linux:

Download the Miniconda installer for Linux from the official site.
Navigate to the directory where you downloaded the installer and run:

chmod +x Miniconda3-latest-Linux-x86_64.sh

./Miniconda3-latest-Linux-x86_64.sh

Follow the installation prompts.

Mac:

Download the Miniconda installer for Mac from the official site.
Open the downloaded .pkg file to start the installation.
Follow the installation prompts.

Setting Up the Homework Environment:

After installing Miniconda, set up your environment with the following commands:

conda create --name cs221 python=3.12

conda activate cs221

This homework does not require any additional packages, so feel free to reuse the cs221 environment you installed earlier for hw1 and hw2.

We've created a LaTeX template here for you to use that contains the prompts for each question.

Important Note: Please Read

Warmup

Options: Default ghosts are random; you can also play for fun with slightly smarter directional ghosts using -g DirectionalGhost. You can also play multiple games in a row with -n and an integer indicating the number of games to play. Turn off graphics with -q to run lots of games quickly.

Now that you are familiar enough with the interface, inspect the ReflexAgent code carefully (in submission.py) and make sure you understand what it's doing. The reflex agent code provides some helpful examples of methods that query the GameState: A GameState object specifies the full game state, including the food, capsules, agent configurations, and score changes: see submission.py for further information and helper methods, which you will be using in the actual coding part. We are giving an exhaustive and very detailed description below, for the sake of completeness and to save you from digging deeper into the starter code. The actual coding part is very small -- so please be patient if you think there is too much writing.

Note: If you wish to run the game in the terminal using a text-based interface, check out the terminal directory.

Note 2: If the action tiebreaking is done deterministically for Problems 1, 2, and 3, running on the mediumClassic map may cause mostly losses. This is alright since the grader test cases don’t run on these layouts.

Problem 1: Minimax

Problem 2: Alpha-beta pruning

Problem 3: Expectimax

Problem 4: Evaluation function (extra credit)

Problem 5: AI (Mis)Alignment and Reward Hacking

Before diving into the problem, it would be beneficial to refer to the AI alignment module to gain deeper insights and context:

In this problem we'll revisit the differences between our minimax and expectimax agents, and reflect upon the broader consequences of AI misalignment: when our agents don't do what we want them to do, or technically do, but cause unintended consequences along the way. Going back to Problem 3, consider the following runs of the minimax and expectimax agents on the small trappedClassic environment:

Be sure to run each command a few times, as there is some randomness in the environment and the agents' behaviors, and pay attention, as the episode lengths can be quite short. You can always add --frameTime 1 to the command line so the game pauses after every frame. What you should see is that the minimax agent will always rush towards the closest ghost, while the expectimax agent will occasionally be able to pick up all of the pellets and win the episode. (If you don't see this behavior, your implementations could be incorrect!) Then answer the following questions:

[1] For more examples of reward hacking (or "specification gaming"), see this article from DeepMind and this list of concrete examples of reward hacking observed in the AI literature.

`submission.py`	Where all of your multi-agent search agents will reside, and the only file that you need to concern yourself with for this assignment.
`pacman.py`	The main file that runs Pac-Man games. This file also describes a Pac-Man `GameState` type, which you will use extensively in this assignment.
`game.py`	The logic behind how the Pac-Man world works. This file describes several supporting types like `AgentState`, `Agent`, `Direction`, and `Grid`.
`util.py`	Useful data structures for implementing search algorithms.
`graphicsDisplay.py`	Graphics for Pac-Man.
`graphicsUtils.py`	Support for Pac-Man graphics.
`textDisplay.py`	ASCII graphics for Pac-Man.
`ghostAgents.py`	Agents to control ghosts.
`keyboardAgents.py`	Keyboard interfaces to control Pac-Man.
`layout.py`	Code for reading layout files and storing their contents.
`search.py`, `searchAgents.py`, `multiAgentsSolution.py`	These files are not relevant to this assignment and you do not need to modify them.

Introduction