Data Science
Data Science
Fighting the COVID-19 pandemic with data and context
The COVID-19 outbreak is in many ways an outlier. It emerged with unusual speed, spread rapidly throughout the globe and has elicited a public health response that is unprecedented in recent…
Data Science
COVID-19 and the first war of data science - Starschema Blog - Medium
In the subtitle of his remarkable history about the race for the nuclear bomb, science writer and historian of science Jim Baggott referred to World War II as the “first war of physics”. Today, the…
Data Science
Arguing with Edward Snowden - Starschema Blog - Medium
I’ve recently read Edward Snowden’s Permanent Record during my holiday. I think it is a great book that I highly recommend for basically anyone, however it is particularly interesting for IT-folks…
Data Science
Text preprocessing in different languages for Natural Language Processing in Python
In the first part, I outlined text pre-processing principles based on a framework from an academic article. The underlying goal of all these techniques was to reduce text data dimensionality but…
Data Science
Predictive maintenance helped win a war. Now, it can help you outpace the competition.
The year is 1943. Britain’s survival still hangs by a thin, precarious thread, despite America joining the war effort. Just a few months ago, in February 1942, two German battleships, the…
Data Science
A comprehensive guide to text pre-processing with python
This is Part 1 of a pair of tutorials on text pre-processing in python. In this first part, I’ll lay out the theoretical foundations. In the second part, I’ll demonstrate the steps described below…
Data Science
Answering the big questions (this time, in chemistry)
Why do some molecules have undesirable biological effects, but others don’t? A model can tell us which do and which don’t, but model introspection can go one step further: it can tell us why.
Data Science
Create a map of Budapest districts colored by income using folium in Python
Ever wondered how to draw a map of less common geographical areas? And color them based on some data? This pair of tutorials shows how to build this from scratch! First, you need to construct the…
Data Science
Draw a map of the districts of Budapest using the Overpass API of OpenStreetMap and Python
Ever wondered how to draw a map of less common geographical areas? Perhaps even colour them based on some data? This is the first in a series of two tutorials that show you how to build this from…
Data Science
JIT fast! Supercharge tensor processing in Python with JIT compilation
At Starschema, we’re constantly looking for ways to speed up some of the computationally intensive tasks we’re dealing with. Since a good amount of our work involves image processing, this means…
Data Science
Digging deeper into ensemble learning - Starschema Blog - Medium
Have you ever wondered how combining weak predictors can yield a strong predictor? Ensemble Learning is the answer! This is the second of a pair of articles in which I will explore ensemble learning…
Data Science
Combine your machine learning models for better out-of-sample accuracy
Have you ever wondered how combining weak predictors can yield a strong predictor? Ensemble Learning is the answer! This is the first of a pair of articles in which I will explore ensemble learning…
Data Science
Growing Neural Gas for Good: quantifying hard retinal exudates in diabetic retinopathy using GNGs
Diabetic retinopathy is a major cause of blindness in the developed world. Read how an uncommon neural network algorithm can be used to quantify the extent of disease.
Data Science
Quantifying hard retinal exudates using Growing Neural Gas algorithms
Diabetic retinopathy is a major cause of blindness in the developed world. Read how an uncommon neural network algorithm can be used to quantify the extent of disease.
Data Science
Self-Organising Feature Maps for fun and profit - Starschema Blog - Medium
This is Part 2 of a three-part series on competitive neural networks. You can find Part 1, an introduction to competitive neural networks, here. Part 3, which looks at a different competitive…
Data Science
Self-Organising Feature Maps for fun and profit - Chris von Csefalvay - Medium
This is Part 2 of a three-part series on competitive neural networks. You can find Part 1, an introduction to competitive neural networks, here. Part 3, which looks at a different competitive…
Data Science
Funderstanding competitive neural networks - Starschema Blog - Medium
Funderstanding is a little term I came up with a few years ago for fun ways of understanding complex concepts. The typical university way of teaching something is by laying the theoretical…