Practical

and/or

Take advantage of the knowledge of your peers and practice your skills with the press_schechter code.
Some ideas include:

Add inline documentation and build docs using Sphinx
- Tip: use the Napolean extension and Numpy or Googledoc style docstrings.
- Bonus: Fork of the code_prac_hwsa2021 repo on Github and serve the docs online using Github Pages.
Develop unit tests for the code
- Bonus: Use coverage.py tool to measure your unit test coverage and aim for >90% coverage.
- Bonus bonus: Fork the code_prac_hwsa2021 repo and use Github Actions to automatically test the code on every push.
Add type annotations to the code then use mypy to check these.
If you manage to speed up the press_schechter function enough (or ask me for a faster version), try making an interactive tool for exploring the PS mass function using Jupyter Notebooks and ipywidgets (or a tool of your choice).

Optimising Python code

HWSA 2021

Simon Mutch

Session outline

The optimisation cycle

cProfile

cProfile

cProfile

cProfile

line_profiler

line_profiler

Testing

Pytest

Regression tests with pytest

Regression tests with pytest

Regression tests with pytest

Regression tests

Let's go fast!

Things to know about Python and speed

Python is an interpreted language and (almost) everything is an object

The most important scientific python optimisation rule

For loops should be avoided if possible. Take advantage of Numpy's ufuncs (which will vectorize it and do it more efficiently).

An aside...

A common misconception:

I've removed all the loops I can, but I still need more speed!!!

Memoisation

The idea

Works well when

Memoisation

A (very contrived) example1

Memoisation

A (very contrived) example1

Memoisation

However...

joblib.Memory

Going lower level...

Numba

Where Numba might not be enough(WARNING: very subjective!)

Parallelisation

ExtraNative parallelisation in Python is expensive

The GIL

Parallelisation with Numba

Practical time!

Practical

Main suggestion

Practical

and/or

Remember

A (very contrived) example¹

A (very contrived) example¹

Where Numba might not be enough
(WARNING: very subjective!)

Extra
Native parallelisation in Python is expensive