k-means data mining algorthim

k-means data mining algorithm in plain English

The k-means data mining algorithm is part of a longer article about many more data mining algorithms.What does it do? k-means creates $latex k$ groups from a set of objects so that the members of a group are more similar. …

Space Needle

PyData Seattle 2015 – Simulation, PySpark and Deep Learning Tutorials

One month ago, I got an email from a colleague: The subject of his email was a single word: PyData The email had a single EventBrite.com URL. Intrigued, I clicked… and that’s when my adventure into PyData began. In this post, …

Gadget Graph

What is an exchangeability martingale?

Exchangeability martingales can be used to find strange or odd data points. The mechanics of exchangeability martingales is still a bit of a mystery to me. However, posted below are the basic components and how each step gets us to …

Broken Chain

Broken link checker analysis with R

You’ve found a few broken links on your blog or web site, and now you wonder whether there are more. Here’s a quick way to get a sense of how big the problem is. You could basically take 2 different …

teamwork

R set working directory using copy and paste

After using Windows Explorer to navigate to a directory, I just want to copy and paste the directory in R, but I get this error: I used the setwd() function and pasted the directory, but there is an answer! By …

overflow

SQL stress test: Simulate full database

Simulating a full database is a beautiful stress test, but how the heck can this be done quickly with SQL Server? A full database is likely never encountered during development and probably completely forgotten during testing (it was in our …

data_analysis

Newborn app using Twitter and R (data analysis)

In two previous posts (intro, data entry), I described a simple, quick and easy solution for recording newborn feedings, dirty diapers and just about anything of interest. Here’s the second part of the implementation: the data analysis.With all the data …

twitter

Newborn app using Twitter and R (entering the data)

In a previous post, I described a simple, quick and easy solution for recording newborn feedings, dirty diapers and just about anything of interest. Here’s the first part of the implementation: entering the data.The basic idea is everyone logs data …

baby_hand

Newborn app using Twitter and R

So you want to track your baby’s feedings and dirty diapers?Most digital solutions for newborn feeding trackers are either pricey or don’t make it easy for more than one person recording the feedings. I put together a convenient solution with …