Python @ LUNARC
Python is available on the cluster. Depending on the users experience and needs, different distributions are available. Below is an non-exhaustive list of installed distributions:
Anaconda2 Anaconda3 Python Biopython
Information, such as available versions, on these distributions can be had using the command
module spider <distribution-name>
where one has to replace the
distribution-name with the name of the distribution on is interested in. For example, when being interested in a specific version of the Python distribution.
module spider Python
will list all the Python versions of the Python distribution we have installed. To activate the version, one needs to enquire about the details of a specific version, load the prerequisites (e.g. compiler, MPI library, etc.) and the Python distribution. E.g. when being interested in Python version 3.6.6 the command
module spider Python/3.6.6
will provide information for which combination of compiler, CUDA and/or MPI library that version of Python is available. Note that the latest
Pythonmodules lack packages such as MPI4PY, numpy and scipy. For this you need to load a
SciPy-bundle. See below for details.
Please note to utilise a distribution that requires CUDA, you need to have access to an LU Local or LVIS (HPC Desktop On-Demand) project. For more information on how to use the module system consult our separate guide on using the installed software.
Depending on the needs of the user, we recommend different distributions.
For users without special requirements, the recommended distribution is the Anaconda. Anaconda is provided for Python versions 2 and 3. The modules are named Anaconda2 resp. Anaconda3. These are also the recommended distribution for users who whish to use the interactive environment spyder, though spyder is also provided for a few selected versions of the Python distribution.
To get the full functionality of the Anaconda environment should use the following commands:
module load Anaconda3 source conda_config.sh
This enables the full functionality of the conda command.
Creating conda environments
To be able to create reproducable and custom environments with specific versions of Python, Numpy or any other packages. It is encouraged to create a specific conda environment for this. To create a new conda environment make sure that you have activated the conda command with
Now you can create a new environment with the following command:
conda create -n myenv
This will start the process of creating a new environment:
Collecting package metadata (current_repodata.json): done Solving environment: done ## Package Plan ## environment location: /home/bmjl/.conda/envs/myenv Proceed ([y]/n)?
Press y to start the process. When the environment has been created the following is shown:
Preparing transaction: done Verifying transaction: done Executing transaction: done # # To activate this environment, use # # $ conda activate myenv # # To deactivate an active environment, use # # $ conda deactivate
Activate the new environment using
conda activate myenv
When activated all package installations, both conda and pip, goes into this environment and will not interfere with the base conda environment.
Users who are planning to do larger scale calculations potentially using MPI4PY or requiring specific compilers and libraries, the recommended distribution is named Python. Please note that this these require loading of pre-requisites, such as compilers, MPI and/or CUDA libraries, as detailed in the Overview section.
Please note that there are several versions of the toolchains such as the foss or intel toolchain. Each with multiple versions of Python. Please choose the one appropriate for you. Note that after having chosen your toolchain use module avail Python to see versions available.
For more information on available toolchains see compiling-code-and-using-toolchains
Python distributions requiring a GCCcore module
Starting in 2019, the pre-requisites and site-package contents of the distributions named Python have changed significantly. The modules can be loaded after loading a GCCcore compiler, which is show in the output of the
module spider command. These Python distributions do not contain site-packages depending on an MPI or BLAS library, such as MPI4PY, numpy or scipy. To access such MPI and/or BLAS dependent packages loading a SciPy-bundle-module is required, which require a compiler and an MPI module. Use
module spider SciPy-bundle to see what is available.
By default the SciPy-bundle will load a Python 3 module. If you require a Python 2 module, load that prior to loading SciPy-bundle.
For users working in bioinformatics on of the Biopython packages is most likely the desired one.
Python site-packages (aka. Python packages)
The distribuitons provided by LUNARC have large number of site-packages (aka. Python packages) installed. Users are encouraged to list the set of installed packages to verify that all needed packages are in place.
Anaconda users can check the installed packages with:
All other installations are using the pip framework:
LUNARC installed packages
LUNARC staff will install packages in the various distributions (mainly in the non-anaconda based ones). If you have a packages that may be of use for other users, or if you are having trouble installing a package, please send a support request for the installation of the package and LUNARC staff will consider it for inclusion in the distribution.
User installed packages
Users are also able to install their own packages by creating their own local repository.
Anaconda Python distributions
In Anaconda (available as a loadable module) the users are free to create their own environment. The user cen either use the pre-installed environment or create an empty environment and populate it with packages that the user chooses using conda create and conda install. Please review: Manage environments (new window) for information on how to use environments in conda.
Please note: The .conda directory where the installed-files are located may become large which may become a problem as the default location for this directory is in /home/< username >. If space is a problem, users should consider using /lunarc/nobackup instead.
In all other installations (default or loadable module) users can install their own packages using pip
pip install --prefix=$HOME/local package_name
This will install the package package_name in the users home-directory. Make sure the installation location of your packages gets added to your PYTHONPATH environment variable (e.g. private module file, sourceing a script, ...). For this to work you need a Python package with a recent version of
pip. The use of the
--install-option to redirect the installation location is no longer recommended.
Again, please check using
pip list or `conda list if the package is already installed in the selected distribution.