Data-Science
56 posts
Revisiting Python support in Apache Flink
Early 2018 look at Apache Flink's Python support. Checking compatibility, batch vs streaming capabilities, & future developments like Streaming API support.
Tendencies of Data Engineers and Scientists
Explore the relationship dynamics and challenges between data engineering and data science teams, including their approaches, collaboration, and best practices.
I Made a Model, Now What?
Practical insights from a PyData Atlanta talk on successfully deploying and maintaining machine learning models in production environments.
On taking things too seriously: holiday edition
Building a CFB bowl game prediction system with Python packages elote, keeks, & keeks-elote. Combines rating, betting strategies, and backtesting for analysis.
Elote: a python package of rating systems
Introducing Elote, a Python package implementing various rating systems like Elo and Glicko. Learn its core concepts and see how to use it for ranking.
Ripyr: sampled metrics on datasets using python's asyncio
An introduction to ripyr, a Python library for streaming through large datasets and parsing basic metrics using asyncio and type hinting
Category Encoders v1.2.5 Release
Category Encoders v1.2.5 brings community updates including stable binary/BaseN encoding, new leave-one-out encoding, and pandas compatibility fixes.
Data Science Things Roundup #11
A collection of interesting data science articles and projects, including SEC keynotes, Bayesian inference, and visualization tools
Category Encoders v1.2.4 Release
Category Encoders v1.2.4 is out! Includes pandas categorical type support, improved missing value handling, better error messages, BaseN fixes, and docs.
Data Science Things Roundup #10
A curated collection of data science articles and tools exploring network analysis, StashPy for log processing, and Bayesian survival analysis techniques.