
My data science cohort made its first foray into the world of Kaggle the other week. Kaggle is an online data science community with members all over the world. Not to be confused with Kegel, a male doctor in the 1940’s who invented vagina flexing (finally). Kaggle boasts tons of resources for discussing and learning about data science, but the real coup d’état is that it hosts machine learning and predictive analytics competitions. …
I recently had the opportunity to view a presentation by Dr. Stephanie Eckman on data quality. Eckman is a PhD in Survey Statistics & Methodology, and she addressed an audience of Data Science students with messages about data integrity, experimental design, and how ‘objective’ algorithms can be rife with biases. A main role of data scientists is to produce models and intelligible visualizations of massive datasets for the less data-savvy audience. These are valuable skills, and much of a data science curriculum focuses on how to master the tools to create the most informative and visually compelling products. Eckman, however…
