Data Engineering

Getting through a challenging age with data - Starschema Blog - Medium
Following the implosion of the US housing bubble, when the mid-2000s Great Recession began to hit markets worldwide, many enterprises found themselves in dire straits. In the ensuing crisis…
Deploying TabPy in Enterprise: Scaling and Hardening in Kubernetes
There are so many tutorials out in the wild about how to take an application, containerize it and run it in your enterprise’s on-prem/public cloud securely, but hey, this will be yet another one…
Mining your Tableau logs with Apache Drill - Starschema Blog - Medium
Tableau Server and Desktop logs each and every action you perform. The log data is a gold mine for people eager to understand what is happening under the hood and why. However, there is no easy way…
Text preprocessing in different languages for Natural Language Processing in Python
In the first part, I outlined text pre-processing principles based on a framework from an academic article. The underlying goal of all these techniques was to reduce text data dimensionality but…
Scaling out Tableau Extracts — Building a distributed Tableau Hyper Cluster
Tableau Hyper Database (“Extract”) is a great engine; it’s one of the reasons people are obsessed with Tableau analytics. However, being a single node database server, it has its limits (performance…

This website uses cookies

To provide you with the best possible experience on our website, we may use cookies, as described here. By clicking accept, closing this banner, or continuing to browse our websites, you consent to the use of such cookies.

I agree