Additional Python packages
Miniforge is a very streamlined Python Distribution containing only the most necessary packages. In addition to these, there are many other packages we need to do scientific computing. We will install those in this section of the guide.
Note
Miniforge (also Mambaforge, Miniconda) is based on the Anaconda Python Distribution, which includes a large number of packages related to scientific computing by default. However, it also means, that it is rather large, and many of those packages are of no interest to most economists. (If you're interested in all the packages included, click here and go to the Python 3.9 tab.) For this reason, start from a barebones distribution and only install what we require instead.
The following recipe works for all operating systems.
There are a multitude of package management tools that work with Python packages. In this course, we use conda
, which is the default one for Anaconda-based Python distributions. It combines dependency management and virtual environments in one program. Furthermore, it is designed with the scientific workflow in mind. For those packages that are not available in the conda repositories, we fall back to pip
, Python's standard package manager.
Note
Mixing conda
and pip
is generally advised against, but sometimes it cannot be avoided. However, by using conda
first and pip
second, and utilizing virtual environments, you are mostly safe. You can find more info here.
First let us update conda
by typing the following into the terminal
conda update conda --yes
Next, install a handful of packages using the following command
conda install numpy pandas
Press enter to accept the proposed package plan with 'y' and return
.
This command installs those two packages into the default (base) environment. It is fine, as we would like those to be shared across all environments we later create. However, it is generally not the case, as we want dependencies for our different projects well separated. For example, packages for this course should be wholly independent from packages you are using to write a paper. The next command creates a conda
environment called pp4rs
and installs all the packages required for the course. It might take a while.
conda env create -f https://pp4rs.github.io/2024-uzh-installation-guide/assets/pp4rs_environment.yml
Generally, you should have such a file for each of your projects in order to easily reproduce the environment it was built in any time in the future.
Note
You can find more info on how to create such environment specifications here
Now it is time to test your Python installation. Type into your shell the following
conda activate pp4rs_backup
You should now see that the CLI says (pp4rs_backup)
in the beginning.
To see which packages are install in this new pp4rs environment, run:
conda list
It should return a list with all of the packages that are installed in the pp4rs_backup
environment. In order to use them, you will need to activate this environment in the beginning of every new shell session.
You can always see which environment you are currently using, by looking at the beginning of the prompt in your terminal. If you open a new terminal it will say (base)
, meaning your base environment. To use the pp4rs environment, you need to activate it.