Machine-Learning
31 posts
From Weekend Hack to Core Tool: The category_encoders Journey
Explore category_encoders' journey from a weekend Python experiment to a widely used data science library, now part of scikit-learn-contrib.
Investment Review: Seer.ai
A review of my angel investment in Seer.ai, exploring how they align with my investment thesis and their unique value proposition in AI-powered analytics.
I Made a Model, Now What?
Practical insights from a PyData Atlanta talk on successfully deploying and maintaining machine learning models in production environments.
Elote: a python package of rating systems
Introducing Elote, a Python package implementing various rating systems like Elo and Glicko. Learn its core concepts and see how to use it for ranking.
Category Encoders v1.2.5 Release
Category Encoders v1.2.5 brings community updates including stable binary/BaseN encoding, new leave-one-out encoding, and pandas compatibility fixes.
Category Encoders v1.2.4 Release
Category Encoders v1.2.4 is out! Includes pandas categorical type support, improved missing value handling, better error messages, BaseN fixes, and docs.
Data Science Things Roundup #10
A curated collection of data science articles and tools exploring network analysis, StashPy for log processing, and Bayesian survival analysis techniques.
Data Science Things Roundup #9
Data Science Things Roundup #9: Highlighting Pedro Domingos'' ML paper, Spyre (Shiny for Python), and BetaGo, an AlphaGo-inspired Go bot framework.
Data Science Things Roundup #8
Data Science Things Roundup #8: Dive into LIME for model interpretation, sklearn-expertsys for interpretable classifiers, and the value of nearest neighbors.
BaseN Encoding Grid Search in Category Encoders
Explore category_encoders' BaseN encoder for representing categorical data. Learn how to use scikit-learn's grid search to find the optimal encoding base.