Journal
Posts and long-form series on AI, startups, venture capital, and more.
All Posts
Data Science Things Roundup #7
Data Science Things Roundup #7: Python-focused edition featuring Intel's Python Distribution, Go-Python for extensions, and PyFilesystem for unified access.
Category Encoders now on conda-forge
The category_encoders Python package is now available on conda-forge, making installation easier for Conda users. Learn about the package and feedstock.
Data Science Things Roundup #6
Data Science Things Roundup #6: Focuses on calendar visualizations with D3.js Calendar Heatmap and Bostock's Calendar View, plus insights on coding interviews.
Introducing unified glob-syntax in git-pandas
Explore the unified glob syntax (`include_globs`, `ignore_globs`) for git-pandas v2.0, offering flexible file pattern specification and usability.
Parallelizing cumulative blame in git-pandas with joblib
Boost git-pandas cumulative blame analysis performance with joblib. Parallel processing via multithreading speeds up this costly operation.
Growing Pedal Wrencher
How I worked to grow pedal wrencher after the launch hype faded, with blogging, user education, profile and registration redesigns, and social media.
Exit Interviews in Startups
Exit interviews are crucial for startups. Discover methods to gather honest feedback, identify systemic issues, and enhance retention from departing employees.
When do I work on what?
Use git-pandas to analyze and visualize work patterns across open source vs. closed source projects. Compare commit times with punchcard plots. Learn the code.
Building an Engineering Team Around Ownership
Clear ownership, not just cross-functional talent, defines high-performing engineering teams. Learn how role boundaries prevent micromanagement & build respect.
Estimating the time spent on a project with git-pandas
Learn how to estimate project development time using commit history with git-pandas. Compares to git_time_extractor, git-hours, and glass.