Data Science

Statistics, data preparation, databases, you name it

colin working on a remote jupyter notebook

Remote jupyter notebooks with ssh port forwarding

Access your jupyter notebook server remotely

fit to the COVID-19 data and extrapolation, with uncertainties

COVID-19 Analysis: Uncertainties

Learn how to analyse the COVID-19 data from JHU by yourself, with a proper treatment of the uncertainties.

connecting to mongoDB from a jupyter notebook

MongoDB in 10' : Fill with python, Read with pandas

Set up your first MongoDB server, store data with python, and analyze it with pandas in a jupyter notebook.

remote mongoDB connection

MongoDB : Remote Access (raspberry pi)

With this uncomplicated recipe, you will learn how to manage ports on your raspberry pi (or any Debian), and how to enable remote access to your database server. The installation of mongodb on the pi is also covered.

anaconda logo

Install Anaconda for Machine Learning and Data Science in Python

General instructions to install Anaconda, which is used in most posts on this blog.

word encoding for natural language processing

Text Preprocessing For Machine Learning (yelp dataset part 2)

The yelp dataset is large, and it's in text format. Here are detailed explanations and all the code needed to convert it to a numpy array for machine learning.


Simple Text Mining with Pandas

Prepare the yelp dataset (shop reviews) for natural language processing.

screenshot of this blog with perfect jupyter integration

Jupyter Notebooks in Web Pages

Learn how to integrate jupyter notebooks in a web page, and how to highlight code and show nice equations.