This is a spare time blog to collect little recipes for data handling and data analysis mainly with Hadoop, Python and R. The selection of topics is driven by

  • the time it took to find relevant information online
  • the likelihood of ¬†forgetting it and having to look for it again

Disclaimer: this site has no relationship to commercial entities with potentially similar names.

