1. Open Food Facts Parquet

    For a recent project, I was browsing some food data from Open Food Facts. I haven't had a chance to really dig into it yet, but it seems like an interesting data source and a decent dataset to help me get more comfortable playing around with spark.


  2. Data Pipeline configuration oddities

    I've been working for a little while on a gem to configure and deploy AWS Data Pipelines. At my day job, we use Data Pipeline to schedule various types of repeating jobs. If you're interested, you can see more details on the company blog. To summarize, we wanted a library to configure Data Pipelines as Ruby objects so we could easily compose, reuse, version control, etc. …


  3. Extracting ADB backups

    Sometimes when developing, reversing, or otherwise working with Android apps, it can be useful to see what's in your (or your target's) data directory. Typically, you either have root or the app's signing keys so it's trivial to see what's going on. However, there are couple situations where you might not be rooted or can't directly read the app's data directory. For example, you might be working on a friend's device or debugging a library that is included in an app for which you don't have the signing keys. …


  4. It's a Jekyll!

    "Jekyll, Jekyll, Jekyll, it's a Jekyll, it looks like a jekyll!" …