Some fun with Kaggle

Kaggle Credit Scoring data science competition

What is Kaggle?

Kaggle is the most popular platform for hosting data science and machine learning competitions. A whole community of kagglers grew around the platform, ranging from those just starting out all the way to Geoffrey Hinton.

In 2017, Kaggle was acquired by Google and integrated with Google Cloud Platform. Now, both the competition data can be hosted in the cloud, and the compute can happen there as well. Kagglers have a possibility to run their competition code on GCP by creating the so-called Kaggle kernels (interactive Python or R notebooks), which make it possible to share code and submit competition entries.

Kaggle already hosted other competitions organized by the financial sector companies: forecasting stock movements based on newspredicting value of a transaction for a customer or predicting real estate value fluctuations. Financial data is most often tabular in nature. Recently, image classification and segmentation competitions were outnumbering the ones based on tabular and time-series data: pneumonia detection on X-Rayworking with satellite imageryseismic images, or just ordinary photographs.

The competition organized by Home Credit ended up being incredibly popular and by the number of competition entries it was the biggest competition ever on kaggle (by the number of participants, it’s second only to the playground challenge on predicting the survival on Titanic, which is a very romantic, but completely moot problem).

In fact, the Home Credit’s competition might right now hold the status of the biggest data science competition ever, if we consider the fact that Kaggle’s counterparts are not nearly as popular. CrowdAI focuses more on academia (e.g., for NIPS benchmarks) and governmental institutions, and are focusing on companies, and all of them combined do not attract nearly as many participants as Kaggle does.

Published by Indrek Ulst

I go up to eleven. Starting his career as a freelance web developer back in 2000 at age 15, Indrek is phenomenally good at it. There is hardly anybody who can implement a web user interface or Android app faster than him.

Leave a comment