Open Source Projects

For many years I’ve been building and maintaining open source software that helps developers work more efficiently and effectively. My projects span a range of domains including machine learning tools, data analysis libraries, development tooling, and geospatial utilities. With a focus on Python development, I’ve contributed to the scientific computing ecosystem through projects like Category Encoders (now part of scikit-learn-contrib) and created developer tools like Git Pandas for repository analysis.

Here are some of the open source projects I maintain:

Keeks

A Python library for key-value store benchmarking and performance testing.

Documentation | GitHub

Related Posts

Cookiecutter PIP Project

A modern, production-ready cookiecutter template for Python packages that follows best practices. This template sets up a complete development environment with all the tools needed to build, test, and publish professional Python libraries.

GitHub

Git Pandas

A Python library that wraps GitPython to produce pandas dataframes for Git repository analysis. Enables data-driven analysis of Git repositories with features for commit history, blame information, and project-level insights.

Documentation GitHub

Related Posts

Elote

A Python library implementing various rating and ranking algorithms.

Documentation | GitHub

Related Posts

Stargazers

A modern CLI tool to fetch, analyze, and summarize the stargazers or forkers of any public GitHub repository.

GitHub

Related Posts

PyGeoHash

A Python library for working with geohashes, providing encoding, decoding, and distance calculations.

Documentation | GitHub

Related Posts

Category Encoders

A scikit-learn-contrib library providing encoders for categorical variables as part of machine learning workflows. I authored this project but am no longer the day to day maintainer.

Documentation | GitHub

Related Posts