This document discusses the motivation for creating reproducible science through documenting data provenance using an R implementation. It describes challenges such as standard R tools not collecting provenance and specialized tools having a steep learning curve. It then presents an approach where R scripts are instrumented to collect provenance information as data and process dependencies are executed, generating a directed acyclic graph database of the provenance that can be explored and visualized.