Shallow vs Deep Copy in Python

Posted by Jack McKew on Fri 09 October 2020 in Data Science • Tagged with datascience, python • 4 min read

Shallow vs Deep Copy in Python

One of the utmost crucial parts in all programming languages is maintaining variables. We create, modify, compare, delete our variables to build more complex systems that eventually make up the software we use. This is typically done by using the = operator (eg x = 5 …


Continue reading

Types of Averages (Means)

Posted by Jack McKew on Fri 21 August 2020 in Data Science • Tagged with datascience, python • 4 min read

The most common analytical task is to take a bunch of numbers in dataset and summarise it with fewer numbers, preferably a single number. Enter the 'average', sum all the numbers and divide by the count of the numbers. In mathematical terms this is known as the 'arithmetic mean', and …


Continue reading

Dataclasses vs Attrs vs Pydantic

Posted by Jack McKew on Fri 07 August 2020 in Data Science • Tagged with datascience, python • 6 min read

Python 3.7 introduced dataclasses, a handy decorator that can make creating classes so much easier and seamless. This post will go into comparing a regular class, a 'dataclass' and a class using attrs. Dataclasses were based on attrs, which is a python package that also aims to make creating …


Continue reading

Generators in Python

Posted by Jack McKew on Thu 30 July 2020 in Data Science • Tagged with datascience, python • 4 min read

Generators are a special type of function in python, letting you 'lazy load' data; a function becomes a generator is with the yield statement. Lazy loading is when you access just a portion of a data set that you are interested in (eg, the part you are working with), as …


Continue reading

Street Suffix Analysis & Colouring with Python

Posted by Jack McKew on Fri 24 July 2020 in Data Science • Tagged with datascience, python • 5 min read

Street Suffix Visualisation with Python

Ever thought about how roads and streets are named where you live? How many are roads versus how many are streets? Is there a specific pattern to it where you live or is it just random? This post is going to go into how to …


Continue reading

Sentiment Analysis & Text Cleaning in Python with Vader

Posted by Jack McKew on Fri 17 July 2020 in Data Science • Tagged with datascience, python • 8 min read

Sentiment Analysis in Python with Vader

Sentiment analysis is the interpretation and classification of emotions (positive, negative and neutral) within text data using text analysis techniques. Essentially just trying to judge the amount of emotion from the written words & determine what type of emotion. This post we'll go into how …


Continue reading

Profitable Python Podcast - Show Notes

Posted by Jack McKew on Fri 10 July 2020 in Software • Tagged with software, datascience • 4 min read

I was recently a guest on the Profitable Python podcast with host Ben McNeill, the episode can be found at: https://anchor.fm/profitablepythonfm/episodes/Pandas-Alive--Jack-McKew-efui92/a-a2idber. This blog post serves as the show notes, if I've missed anything, please drop a comment below!

Projects Mentioned

A project where the …


Continue reading

Translating Text in Python

Posted by Jack McKew on Fri 29 May 2020 in Data Science • Tagged with datascience, python • 9 min read

Working with data in a connected digital world, means you will possibly encounter data in a language outside your own. In this post we'll go into ways to translate this data in Python.

First off we need some sample text, and what is better to read about then pizza!

In …

Continue reading

Creating Animated Plots with Pandas_Alive

Posted by Jack McKew on Thu 21 May 2020 in Data Science • Tagged with datascience, python, data-viz, datavisualisation • 5 min read

In this tutorial we'll learn how to create a series of animations using Pandas_Alive. This post is rendered in the style of a Jupyter Notebook. Find the source here: https://github.com/JackMcKew/jackmckew.dev/tree/master/content/2020/pandas_alive/notebooks/pandas_alive_demo.ipynb.

Pandas_Alive was created by me! I set …


Continue reading

COVID-19 Confirmed Cases NSW Australia - Animated Statistics over Time

Posted by Jack McKew on Thu 14 May 2020 in Data Science • Tagged with datascience, python, data-viz, datavisualiation • 1 min read

Recently, I had wanted to build a visualisation of the confirmed cases of COVID-19 in my home state NSW. This post is to cover the release of the visualisation on YouTube, and there is hopes to write future post(s) about building this visualisation & developing Pandas_Alive. Would love to hear …


Continue reading