Description: Over the past ten years I have gained a lot of experience with applying various data science techniques to many different data sources using the R programming language. I started consulting five years ago and have worked on dozens of projects for clients, ranging from small R scripts to full data pipelines.
Software Engineer
Department of Oncology, Old Road Campus Research Building, Roosevelt Drive, Headington, Oxford The Department of Oncology is seeking a Software Engineer to work in development projects in the Applied Computational Genomics Group. The Medical Sciences Division is an internationally recognised centre of excellence for biomedical and clinical research and teaching. The Department of Oncology, within […]
Calling Python from R with rPython
Python has generated a good bit of buzz over the past year as an alternative to R. Personal biases aside, an expert makes the best use of the available tools, and sometimes Python is better suited to a task. As a case in point, I recently wanted to pull data via the Reddit API. There […]
Creating your personal, portable R code library with GitHub
As I discussed in a previous post, I have a few helper functions I’ve created that I commonly use in my work. Until recently, I manually included these functions at the start of my R scripts by either the tried and true copy-and-paste method, or by extracting them from a local file with the <code>source()</code> […]
Global Indicator Analyses with R
I was recently asked by a client to create a large number of “proof of concept” visualizations that illustrated the power of R for compiling and analyzing disparate datasets. The client was specifically interested in automated analyses of global data. A little research led me to the WDI package.
Statistical Programmer
The position you are applying for is connected with a large bank initiative called Enterprise Target State Fraud Detection and Monitoring. Treasury Services is the largest wholesale bank in the world; and as a result, you have the rare opportunity to analyze the largest and most important transactional data set in existence anywhere. We use […]
R for Dummies Twitter Contest
ProgrammingR recently featured a review of DeVries and Meys’ R for Dummies. We were very impressed with the book and felt it would be a good addition to most R programmers’ libraries. To that end, we’re giving away a fresh new copy to a random Twitter user who retweets our newest article covering SPARQL, the […]
SPARQL with R in less than 5 minutes
In this article we’ll get up and running on the Semantic Web in less than 5 minutes using SPARQL with R. We’ll begin with a brief introduction to the Semantic Web then cover some simple steps for downloading and analyzing government data via a SPARQL query with the SPARQL R package.
Bioinformatician at OpenAnalytics
OpenAnalytics is an international company providing data analysis services and products across geographies and industrial sectors. We are currently looking to expand our bioinformatics team with a candidate who fits the following profile: Bioinformatician: MSc / PhD in Computational Biology, Computer Science or Bioinformatics A detailed description can be found at http://www.openanalytics.eu/jobs Strong R and […]
R for Dummies – De Vries and Meys (2012)
The for Dummies series has been around since 1991. (A bit of trivia, DOS for Dummies was the first title.) I’ve owned a few books in the series and have been adequately impressed with most of them, but when I learned there was an R for Dummies I was immediately skeptical. Possibly I was skeptical […]