About
This is a spare time blog to collect small recipes for data handling and data analysis mainly with Hadoop, Python and R. The selection of topics is driven by:
  • the time it took to find relevant information online
  • the likelihood of forgetting and having to look for it again
Disclaimer
this personal blog has no relationship to commercial entities with potentially similar names.