Data-Analysis
13 posts
Metrics That Actually Matter (And the Ones That Don't)
Why most startup dashboards are full of vanity metrics that feel important but don't drive better decisions, and what to track instead.
Data-Driven Decisions vs. Data-Justified Decisions
The difference between using data to discover truth and using data to support conclusions you've already reached, and why most companies get this backwards.
Juggling Projects? Analyze Multiple Repos at Once with GitPandas
Managing multiple repositories is easier with the right tools. This post shares tips and best practices for handling multi-repo projects efficiently.
Digging into Code Churn with GitPandas
Quantify code churn in your Git repositories with the gitpandas Python library. Analyze file change rates and spot areas of high activity or instability.
Who Holds the Keys? Calculating Bus Factor with GitPandas
Explore the concept of bus factor and how gitpandas can help you quantify knowledge distribution and risk in your software project.
git-pandas Caching: Faster Analysis
Boost git repository analysis speed! Learn how git-pandas now uses caching to dramatically improve performance for repeated queries on large codebases.
Using survival analysis and git-pandas to estimate code quality
Apply survival analysis with git-pandas to measure code quality in Git repositories by analyzing code longevity and contributor patterns over time.
Create organization-wide punchcards with git-pandas
Learn how git-pandas enables creating organization-wide punchcard visualizations, aggregating commit activity across multiple repositories for a unified view.
Gitpandas v0.0.6: python 2.7, fileowners, file-wise blame and examples
Overview of git-pandas v0.0.6 release, highlighting new features like Python 2.7 support, file-wise blame, file owner determination, and other improvements.
Git-Pandas v0.0.5: coverage.py, risk, and more
Git-pandas v0.0.5 is out! Adds coverage.py support, file change rate metrics for risk analysis, API updates, time-based filtering for commits.