=================
Coding 
=================

General 
-------
* Software developers who train for interviews use `Hackerrank <https://www.hackerrank.com/interview/interview-preparation-kit>`_ but it has nice problems if you just want to improve your computational thinking and coding skills in general
* The `Common-Sense Guide to Data Structures and Algorithms: Level Up Your Core Programming Skills <https://pragprog.com/book/jwdsal/a-common-sense-guide-to-data-structures-and-algorithms>`_, written by Jay Wengrow, is a fast and easy read to get an introductions to datastructures, time complexity and some important sorting and searching algorithms  
* for filemagement I find it useful to sometimes use 
  
  *  `tree <http://mama.indstate.edu/users/ice/tree/>`_
  *  and for copying files etc. `midnightcommander <https://midnight-commander.org/>`_
* `Visualization of latency numbers every progammer should now <https://people.eecs.berkeley.edu/~rcs/research/interactive_latency.html>`_ (why you want to be in L1 Cache, also nice is the `huminized version with SSD random read as a "normal weekend" and L1 Cache as hearbeat <https://gist.github.com/hellerbarde/2843375>`_ )
* Bloom filters are interesting data structures for constant time lookup in a compact way, i found this blog article instructive: https://prakhar.me/articles/  bloom-filters-for-dummies/
* For SQL design and code the `WWW SQL Designer tool <http://ondras.zarovi.cz/sql/demo/>`_ is pretty useful
* `Best Practices for Scientific Computing <https://arxiv.org/abs/1210.0530>`_ is a good intro in software development for scientists

Python
-------
* `Why scientists should learn to program in Python <https://www.cambridge.org/core/journals/powder-diffraction/article/why-scientists-should-learn-to-program-in-python-/EB88FFCC7384998768AFDAE219EF6EFA>`_ is a neat introduction to python for natural scientists

Juypter noteboos tricks
```````````````````````
* magics are great :code:`% env` to list enviornmental variables :code:`!` to run shell commands and :code:`% lsmagic` to get a list with all of them. You can even do `profiling <http://pynash.org/2013/03/06/timing-and-profiling/>`_. Another nice one is :code:`% pastebin ` with which you can select linenumbers which you want to paste to pastebin
* ipython widgets can be nice to make simple interactive plots (i.e. for education purposes). The `dominadatalab blog has a nice overview and and interactive ping plot <https://blog.dominodatalab.com/interactive-dashboards-in-jupyter/>`_ 
* `numpy-html <https://pypi.org/project/numpy-html/>`_ is quite need to render numpy arrays in notebooks.

Jupyter notebooks on HPC environments
`````````````````````````````````````
Using configuration file
*************************

In some cases it can be useful to, instead of running the ipython kernel on the headnode, to just submit it
to computing node of cluster and then access the kernel from the browser (either from the headnode or your local machine).

You can achieve this by setting some things in your :code:`~/.jupyter/jupyter_notebook_config.py` file you can
create this file using :code:`jupyter notebook --generate-config`.

In this file you then can modify the following:

::

    c.NotebookApp.ip = '*'
    c.NotebookApp.open_browser = False
    c.NotebookApp.port = XXXXX

Where :code:`XXXX` is a number greater than 8888. You might also want to set a password instead of a token
(you can do this in the config file or by running :code:`jupyter-notebook password`).
In your submission script you can then add :code:`hostname` to print the hostname which you can then use to access
the notebook at :code:`hostname:XXXXX`. In some cases you might also want to hardcode  :code:`c.NotebookApp.ip` to
the ip of a particular compute node and then simply bookmark this address.


Using tunneling
***************
Add the following lines to your submission script (in this case setting a password is really useful)

::

    unset XDG_RUNTIME_DIR
    NODEIP=$(hostname -i)
    NODEPORT=$(shuf -i 8888-9999 -n 1)
    echo $NODEIP:$NODEPORT

    jupyter-notebook --ip=$NODEIP --port=$NODEPORT --no-browser

:code:`shuf -i 8888-9999 -n 1` is just to get a random port (binding to 0 and letting the OS is chose is better
practice, but this here is easier). You can then look into the output file which your
scheduler produced as it should contain :code:`NODEIP` and :code:`NODEPORT` which you can then use to
establish a tunnel connection from your local machine using

::

    ssh -N -L 8888:$NODEIP:$NODEPORT <username>@<machinename>

on your local machine you can now access the jupyter server at :code:`http://localhost:8888`. If you have to use
windows on your local machine, you can set up tunneling using MobaXTerm. On some schedulers you may want to enable
direct writing of output files, on the most recent version of PBS Pro, this is possible with :code:`qsub -koed`.

Note that in both approaches the kernel will of course die after the walltime is exceeded.

Speaking about jupyter notebooks, I like the `jupytertheme package <https://github.com/dunovank/jupyter-themes>`_. 

Online
``````
* Valuable and fun are always the `talks by Raymond Hettinger <https://www.youtube.com/playlist?list=PLRVdut2KPAguz3xcd22i_o_onnmDKj3MA>`_
* Great infoŕmation is in `The Hitchhiker's Guide to Python <https://docs.python-guide.org/>`_
* `Bernd Klein <https://www.python-course.eu/python3_course.php>`_ has also good information on advanced topics such as metaclasses
  or memoization with decorators

Faster Python
`````````````
* Consider trying `PyPY <http://pypy.org/features.html>`_ instead of CPython (check the `benchmarks <http://speed.pypy.org/>`_).
* Nice introduction in vectorization, numpy and numba (and what are the bottlenecks in python) by `Donald Whyte <https://www.youtube.com/watch?v=NoJr08FNQeg>`_ 

C
--

Online
``````
* `Build your own lisp <http://www.buildyourownlisp.com/>`_ is a nice way to get
  started with C and learn about lisps 


Editors/IDEs
------------

VIM
```
Vim is a really powerful editor, but you need to spend some time learning and
configuring it. 

Configuration
*************

Some useful settings for the :code:`.vimrc` file are:

* syntax highlighting :code:`syntax enable` is probably self-explanatory
* search
  :: 

       set incsearch           " lookahead search
       set ignorecase          " in most cases I want to be case-insenstivie
       set smartcase           " unless i explicitely use uppercase
       set hlsearch            " highlight matches

* identations
  ::

       set tabstop=4           " number of spaces per <TAB>
       set expandtab           " convert <TAB> key-presses to spaces in insert mode
       set shiftwidth=4        " set a <TAB> key-press equal to 4 spaces

       set autoindent          " copy indent from current line when starting a new line
       set smartindent         " even better autoindent ('smart' insert after e.g. {) 

* Persistent undo
  ::

       if has('persistent_undo')
         " Save all undo files in a single location (less messy, more risky)...
         set undodir=$HOME/.VIM_UNDO_FILES

         " Save a lot of back-history...
         set undolevels=5000

         " Actually switch on persistent undo
         set undofile

       endif

* I am paranoid, I want to lose at max 10 keystrokes
  ::

     set updatecount=10

* If you do not want to type all the search replace syntax (vide infra) remap it 
  ::
     
     nmap  S  :%s//g<LEFT><LEFT>

  now you need to type only 
  ::
     
     SX/Y<CR>

  for global search/replace on all lines.


If you want to see a really crazy setup, check out 
`Damian Conway's vim setup <https://github.com/thoughtstream/Damian-Conway-s-Vim-Setup>`_. 
There you can also find how to create the `Star Wars intro in vim <https://github.com/thoughtstream/Damian-Conway-s-Vim-Setup/blob/master/plugin/SWTC.vim>`_. 

Plugins 
*******
* `schelpp <https://github.com/zirrostig/vim-schlepp>`_: makes it easier to move stuff in visual block
* `fatfinger <https://github.com/chip/vim-fat-finger>`_: corrects common misspellings
* `python syntax highlighting <https://www.vim.org/scripts/script.php?script_id=790>`_
* `flake8 <https://github.com/nvie/vim-flake8>`_ for PEP8 style and error checking
* if you are used to :code:`<TAB>` completion, you might like `supertab <https://www.vim.org/scripts/script.php?script_id=1643>`_
* `jedi-vim <https://github.com/davidhalter/jedi-vim>`_ for some nice python autocompletion

Commands 
*********
* Use :code:`$` to get to the end of the lines 
* Use different navigation levels :code:`b`, :code:`w`, :code:`{` and :code:`(`
* Search/Replace (:code:`g` means global)   
     
     * all lines :code:`:%s/foo/bar/g` 
     * this line :code:`:s/foo/bar/g`

PyCharm
```````
PyCharm is the IDE I use for larger python projects, some useful features are:


Sublime
```````
Sublime is a lot faster than PyCharm and supports basically all languages. For setting it up, the `realpython blog <https://realpython.com/setting-up-sublime-text-3-for-full-stack-python-development/>`_ has some useful package recommendation (especially the package manager is really good). In addition to that I would recommend `PyYapf <https://github.com/jason-kane/PyYapf>`_ and the `Flake8 linter <https://github.com/SublimeLinter/SublimeLinter-flake8>`_


Development process
-------------------
Starting a project
``````````````````
The easiest way to start a (python) project is to use a `cookiecutter <https://github.com/audreyr/cookiecutter>`_ 
that creates the basic project structure and also some configuration files for you. 
A nice one in the field of molecular simulations is the 
`cookiecutter for computational molecular sciences python packages <https://github.com/MolSSI/cookiecutter-cms>`_ 

CI/CD
`````

Docker 
******
On HPC environments, where you don't have root rights, `singularity <https://www.sylabs.io/docs/>`_ might be a
way to go. There is also a `image to convert singularity images to docker images <https://github.com/singularityware/docker2singularity>`_

Git(hub)
********


Pre-Commit 
``````````

Documentation 
`````````````
* `ReStructured Text Quickreference <http://docutils.sourceforge.net/docs/user/rst/quickref.html>`_: useful when writing sphinx docs

Schedulers
``````````

SGE
***

* Kill all jobs:
  :: 

    squeue -u $USER | grep 5 | awk '{print $1}' | xargs -n 1 scancel

  replace :code:`5` with the number all your jobids start with.