Exploring Ideas: A Blog on Technology, Startups, Food, and More
Welcome to my blog where I share thoughts and insights on technology, startups, and life in Atlanta. Browse through the articles below or explore by topic.
Visualize all of your git repositories with gitnoc and git-pandas
December 13, 2015
In a few past posts, I’ve shown you some of the functionality of one of my projects: git-pandas. You can do aggregate analysis of all of those, or you can even do a cumulative blame across them all. But with a little bit of extra code, you can start to see where aggregated, higher level analysis of git repositories can be useful for a team. GitNOC is a flask/d3/redis based app that, with git-panda...
CyberLaunch: An Accelerator for Machine Learning Companies
December 8, 2015
Atlanta has a new accelerator focused on machine learning and information security companies, called CyberLaunch. It is a rebranding of a previous accelerator called SparkLabs, and is run by the same team. The accelerator is focused on companies that are building products in the machine learning and information security spaces, and is looking to invest in companies that are building products that ...
Data Science Things Roundup #4
December 5, 2015
Time for another edition of the data science things roundup, where I round up some data science things for ya’ll. Todays collections are uncharacteristically R heavy. It’s usually pretty python and machine learning heavy, so if you find something you like here, be sure to check out previous editions as well. Without further adieu: Scikit-Learn Groups Scikit-learn groups (skl-groups) is a python li...
Beyond One-Hot: An Exploration of Categorical Variables
November 29, 2015
In machine learning, data are king. The algorithms and models used to make predictions with the data are important, and very interesting, but ML is still subject to the idea of garbage-in-garbage-out. With that in mind, let’s look at a little subset of those input data: categorical variables. Categorical variables (wiki) are those that represent a fixed number of possible values, rather than a con...
Analyzing GitPython and Pandas With GitPandas
November 19, 2015
A couple of weeks ago I posted about a new open source python library I started called git-pandas. The github page for it is here: https://github.com/wdm0006/git-pandas The basic idea is to provide an interface to a git repository or collection of git repositories via pandas DataFrames. With this, we can do some interesting analysis. In this example we will analyze the two projects that make git-p...
Create a pip-installable python package in 2 minutes
November 12, 2015
Between cookiecutter, cookiecutter-pipproject, github.com and some reasonably fast typing, you can now get version 0 of a project up to pypi and pip installable in pretty much no time at all (not that you should actually do that, but you know what I mean). The down and dirty: pip install cookiecutter cookiecutter https://github.com/wdm0006/cookiecutter-pipproject.git cd projectname git init git ad...
Blame the world with git-pandas
November 10, 2015
I’ve just pushed the first release of git-pandas to pypi, so you can now: pip install git-pandas Then in a few lines: from gitpandas.project import ProjectDirectory projd = ProjectDirectory(working_dir='foo/bar/') blame = projd.blame(extensions=['py'], ignore_dir=['lib']) print(blame) Get the aggregated git blame of every project within a directory and its subdirectories, in the form of a pandas D...
Data Science vs. Data Engineering
October 31, 2015
Data Science is a relatively young term for a relatively old field. In general, it tends to be applied statistics plus some other skill-base - stats+computer science, stats+software engineering, stats+data visualization, etc. There’s ongoing debate about the term itself, with some arguing that data science is more of an evolution of statistics than a separate field. With the growth of large data p...
TAG YP Technologist of the Year: The Results
September 23, 2015
I’m thrilled to announce that I was selected as the winner of the Technology Association of Georgia’s Young Professional Technologist of the Year award. It’s an incredible honor to receive this recognition from such a prestigious organization. The awards ceremony was a wonderful celebration of Atlanta’s vibrant tech community. The event featured great networking opportunities, excellent hospitalit...
TAG Young Professional Technologist of the Year
September 19, 2015
I’m honored and humbled to share that I’ve been named the Technology Association of Georgia’s Young Professional Technologist of the Year. This recognition from TAG, one of the largest state technology associations in North America, is a testament to the incredible work we’re doing at Predikto and the vibrant tech community in Atlanta. About TAG Young Professionals The Technology Association of Ge...
Subscribe to the Newsletter
Get the latest posts and insights delivered straight to your inbox.