ibsg: an Information Bottleneck analysis of sim-max signaling games

This repository contains code for constructing sim-max games, simulating dynamics, and quantifying the evolved signaling languages' efficiency w.r.t the Information Bottleneck bound.

Replicating

The full experiment involves fairly extensive parameter sweeping. This codebase is integrated with a SLURM job scheduler to efficiency run many simulations in parallel. The top-level script to run the full experiment is in hpc/scripts/array.sub. This reads over 800 parameter combinations specified in hpc/configs/sim.txt and submits one job per line.

To run with different parameter configurations, you can use the [hpc/misc/helper.py] script to automatically write to the lines of a config file with the combinations you want.

If you want to explore running locally (e.g., for just a few iterations of a dynamics and only a few parameter combinations), read the sections below.

Setting up an experiment

This codebase uses hydra to organize configurations and outputs:

The conf folder contains the main config.yaml file, which can be overriden with additional YAML files or command-line arguments.
Running the shell script scripts/run.sh will generate folders and files in outputs, where a .hydra folder will be found with a config.yaml file. Reference this file as an exhaustive list of config fields to override.

Example

Here is an example command that will execute an experiment, overriding the hydra config defaults.

TODO

Description of command line args, in order of appearance:

./scripts/run.sh -m
- The -m flag indicates to hydra that we are performing a 'multirun' sweep over configs.

For each of the N jobs, unique folders will be generated and outputs will be written to them under multirun. These folders are hierarchically organized by the parameters described above.

Requirements

Step 1. Create the conda environment:

Get the required packages by running

conda env create -f environment.yml

Step 2. Install ULTK via git:

Additionally, this project requires the unnatural language toolkit (ULTK). Install it via git with

python -m pip install 'git+https://github.com/CLMBRs/ultk.git'

Replicating the experimental results

The main experimental results can be reproduced by running ./scripts/run.sh.

This will perform four basic steps by running the following scripts (with the appropriate config overrides):

Compute the IB curve

python src/curve.py
Simulate evolution

python src/run_simulations.py

Run one or more trials of an evolutionary dynamics simulation on a sim-max game, and save the resulting data.
Measure efficiency and other data of trajectories

python src/measure.py

Measure the optimality of emergent systems w.r.t the IB bound, record most similar theoretically optimal systems, and save resulting data. Optionally:
Collect the results of all multisweeps into one dataframe

python src/get_all_data.py

Search recursively for the results of simulation sweeps and IB curve estimations, collect them into dataframes annotated with their appropriate parameters as columns, and write as one (long) dataframe to all_data in tidy data format. This can be used to produce plots as in this notebook.
- NOTE: It is important to run this script on the same machine used to run simulations or estimate curves. The reason for this is because this script will look for numpy binaries (e.g., betas.npy or optimal_encoders.npy) generated during those steps. However, all numpy binaries are excluded from git history via the .gitignore due to large file limits. So, if you ran experiments using a remote server, you must run this script on the same server if you want perform data analysis on those experiments.

References

This codebase represents the efforts of an extension of the following paper:

Imel, N. (2023). "The evolution of efficient compression in signaling games." Proceedings of the 45th Annual Meeting of the Cognitive Science Society.

and uses code from the following repositories:

The above paper's codebase,
Noga Zaslavsky's ib-color-naming model,
Michael Franke and Pedro Correia's vagueness-games simulations.

Name		Name	Last commit message	Last commit date
Latest commit History 183 Commits
.vscode		.vscode
analysis_data		analysis_data
conf		conf
data		data
hpc		hpc
outputs/universe=100/num_signals=100/prior=1e-08/dist=squared_dist/meaning_certainty=0.5		outputs/universe=100/num_signals=100/prior=1e-08/dist=squared_dist/meaning_certainty=0.5
paper_movies		paper_movies
scripts		scripts
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ibsg: an Information Bottleneck analysis of sim-max signaling games

Replicating

Setting up an experiment

Requirements

Replicating the experimental results

References

About

Releases

Packages

Languages

nathimel/ibsg

Folders and files

Latest commit

History

Repository files navigation

ibsg: an Information Bottleneck analysis of sim-max signaling games

Replicating

Setting up an experiment

Requirements

Replicating the experimental results

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages