One of the objectives of the WFIRST Coronagraph Instrument is to demonstrate post-processing algorithms for space-based exoplanet imaging data in the regime of very high contrast ratios (< 1E-8). We analyzed two 15-hour time series of laboratory images acquired with prototype coronagraph designs on a vacuum chamber testbed at JPL. These data sets enabled us to test the performance of reference differential imaging (RDI) PSF subtraction as a function of time and Euclidean (L2) speckle pattern distance. Starting from an analytical relationship between speckle pattern stability and classical subtraction gain, we compared the classical subtraction result to those of the KLIP RDI algorithm. In both data sets, KLIP RDI contrast gains are typically 3-4× greater than those of classical subtraction. These results suggest that a diverse reference PSF library will enable a significant gain in sensitivity beyond that suggested by the raw, frame-to-frame speckle pattern stability.