Exploring Ideas: A Blog on Technology, Startups, Food, and More
Welcome to my blog where I share thoughts and insights on technology, startups, and life in Atlanta. Browse through the articles below or explore by topic.
Analyzing GitPython and Pandas With GitPandas
November 19, 2015
A couple of weeks ago I posted about a new open source python library I started called git-pandas. The github page for it is here: https://github.com/wdm0006/git-pandas The basic idea is to provide an interface to a git repository or collection of git repositories via pandas DataFrames. With this, we can do some interesting analysis. In this example we will analyze the two projects that make git-p...
Create a pip-installable python package in 2 minutes
November 12, 2015
Between cookiecutter, cookiecutter-pipproject, github.com and some reasonably fast typing, you can now get version 0 of a project up to pypi and pip installable in pretty much no time at all (not that you should actually do that, but you know what I mean). The down and dirty: pip install cookiecutter cookiecutter https://github.com/wdm0006/cookiecutter-pipproject.git cd projectname git init git ad...
Blame the world with git-pandas
November 10, 2015
I’ve just pushed the first release of git-pandas to pypi, so you can now: pip install git-pandas Then in a few lines: from gitpandas.project import ProjectDirectory projd = ProjectDirectory(working_dir='foo/bar/') blame = projd.blame(extensions=['py'], ignore_dir=['lib']) print(blame) Get the aggregated git blame of every project within a directory and its subdirectories, in the form of a pandas D...
Data Science vs. Data Engineering
October 31, 2015
Data Science is a relatively young term for a relatively old field. In general, it tends to be applied statistics plus some other skill-base - stats+computer science, stats+software engineering, stats+data visualization, etc. There’s ongoing debate about the term itself, with some arguing that data science is more of an evolution of statistics than a separate field. With the growth of large data p...
TAG YP Technologist of the Year: The Results
September 23, 2015
I’m thrilled to announce that I was selected as the winner of the Technology Association of Georgia’s Young Professional Technologist of the Year award. It’s an incredible honor to receive this recognition from such a prestigious organization. The awards ceremony was a wonderful celebration of Atlanta’s vibrant tech community. The event featured great networking opportunities, excellent hospitalit...
TAG Young Professional Technologist of the Year
September 19, 2015
I’m honored and humbled to share that I’ve been named the Technology Association of Georgia’s Young Professional Technologist of the Year. This recognition from TAG, one of the largest state technology associations in North America, is a testament to the incredible work we’re doing at Predikto and the vibrant tech community in Atlanta. About TAG Young Professionals The Technology Association of Ge...
Data Science Things Roundup #3
September 10, 2015
Time again for the 3rd edition of the data science things roundup, where I share a few data science things I’ve come across recently. Check out previous editions here and here. Self Organizing Maps with TensorFlow Google’s open sourcing of TensorFlow late last year caused a pretty big splash in the machine learning and data science communities, and since then a ton of tutorials, examples and proje...
Data Science Things Roundup #2
May 20, 2015
This is the second edition of the now-regular series of posts: Data Science Things Roundup, where I round up data science things (as you’d probably guessed). Last week we had a scikit-learn extension, a GUI framework for python CLIs and some writing about how kaggle winners won their competitions. This week is a bit more data-science-y, so dig in. Lifelines If you haven’t already checked out lifel...
Data Science Things Roundup #1
February 15, 2015
This is the first in a new series of posts. There are a few of these around the internet that I like, notably: ds_ldn’s Data Machina RJMetrics’ Data Science Roundup Jeremy Singer-Vine’s Data is Plural Mine will probably be way less consistent, so if you like this, then for sure subscribe to those as well. Anyway, here are a few things I’ve stumbled across related to data science or python recently...
Solving Inherent Stickiness in SaaS: The Power of Convexity
November 2, 2014
In the startup community, especially among SaaS companies, there’s frequent discussion about making products “sticky”. While Health IT and on-premises solutions are often cited as naturally sticky, we need to examine what stickiness really means and how to achieve it sustainably. Understanding Stickiness Traditionally, a product’s stickiness is measured by its barrier to exit. Health IT is conside...
Subscribe to the Newsletter
Get the latest posts and insights delivered straight to your inbox.