Journal
Posts and long-form series on AI, startups, venture capital, and more.
All Posts
Data Science Things Roundup #6
Data Science Things Roundup #6: Focuses on calendar visualizations with D3.js Calendar Heatmap and Bostock's Calendar View, plus insights on coding interviews.
Introducing unified glob-syntax in git-pandas
Explore the unified glob syntax (`include_globs`, `ignore_globs`) for git-pandas v2.0, offering flexible file pattern specification and usability.
Parallelizing cumulative blame in git-pandas with joblib
Boost git-pandas cumulative blame analysis performance with joblib. Parallel processing via multithreading speeds up this costly operation.
Exit Interviews in Startups
Exit interviews are crucial for startups. Discover methods to gather honest feedback, identify systemic issues, and enhance retention from departing employees.
When do I work on what?
Use git-pandas to analyze and visualize work patterns across open source vs. closed source projects. Compare commit times with punchcard plots. Learn the code.
Building an Engineering Team Around Ownership
Clear ownership, not just cross-functional talent, defines high-performing engineering teams. Learn how role boundaries prevent micromanagement & build respect.
Estimating the time spent on a project with git-pandas
Learn how to estimate project development time using commit history with git-pandas. Compares to git_time_extractor, git-hours, and glass.
Data Science Things Roundup #5
Explore Deep Q-Learning for Space Invaders, insights from Elasticsearch in production, and improved Python package management strategies.
Automating documentation workflow with sphinx and github pages
Explore a comprehensive guide on automating the deployment of Sphinx documentation to GitHub Pages, streamlining your workflow with efficient practices.
Pypi-publisher: a simple cli for publishing python libraries
Introducing pypi-publisher (ppp): a CLI tool simplifying Python library publishing. Handles .pypirc updates, linting, git tags, and PyPI sdist uploads.