Natural language processing (NLP) is a field of study dedicated to analyze of natural languages. In particular, using statistics and algorithms. This blog post provides a quick reference guide on what, when and how each of the methods RegExWord TokenizationBag of WordsWords as VectorsTf-idfName Entity Recognition can be used. Regular Expressions (RegEx) What it is … Continue reading My quick reference guide for a few natural language processing techniques
Blog
If you have Anaconda installed, you can run Jupyter notebooks by typing $python -m notebook In case you need to add a R kernel, you will need to install the iterative kernels using the following commands: Update conda using $ conda update ipython-notebook Install the interactive R kernel with: $ conda install -c r r-essentials … Continue reading Adding R kernel to Jupyter notebook
This is an interesting article about how to dockerize Python codes using Docker containers in a Linux environment. This is article has been reproduced. You may find the source here: https://runnable.com/docker/python/dockerize-your-python-application A cheat sheet for using docker is available here: https://www.docker.com/sites/default/files/Docker_CheatSheet_08.09.2016_0.pdf Dockerize your Python Application Dockerfiles enable you to create your own images. A Dockerfile describes the software … Continue reading Adding a Python Code to a Docker Container
In 2016, the 2015 global wine market was valued in €28.3 billion [6]. With such a large value, it makes sense to employ data science techniques to understand what physical and chemical properties affect wine quality. In this series of posts, I will work with the chemical components of the Vinho Verde wine (using the … Continue reading Wine quality part 1 of 3: data analysis
A friend of mine recommended an interesting article from the Harvard Business Review (HBR) about using Data and Analytics across different parts of organisations (I have copied the content below). It is worth to remember that there are methodologies to correctly employ data for business strategies. See, for instance, the CRISP-DM methodology illustrated below. … Continue reading From HBR: How to Integrate Data and Analytics into Every Part of Your Organization