I. Introduction
In the course of scientific exploration, simulation, or analysis, various files are created (e.g., input data files, parameter files, intermediate files, output files). Over time, these files accumulate to the point that it becomes difficult for researchers to determine which files are related to each other. In addition, multiple versions of the same files are created due to adjustments in parameters or processing steps. Thus, it becomes difficult for researchers to remember which files contain which changes. On top of this, many of the current provenance techniques assist with recording provenance for current or future analyses, but reconstructing provenance on past analyses without execution logs is difficult.