This is a tutorial on using Spectral Coclustering to segment different groups of data science practinioners based on data from the Kaggle ML and DS survey. Coclustering groups together both rows and columns, and helps us focus on the features most pertinent to a given segment. We focus on applying Spectral Coclustering on the Kaggle Survey dataset and delve into some of the practical considerations. The code for the tutorial is available on GitHub.
Sep 19, 2020