The analyzed text presents a methodological critique of a prominent 2015 study by economists Giovanni Peri, Kevin Shih, and Chad Sparber. By utilizing R and a Shiny dashboard to replicate the original Stata-based findings, the author challenges the study's core conclusions by altering specific econometric specifications. Below is an evaluation of the primary arguments raised in this critique.
The author successfully replicates the study's baseline 2-Stage Least Squares (2SLS) regression slopes and standard errors within a remarkably tight margin (mostly under 1%), utilizing standard R packages such as ivreg, sandwich, and lmtest.
The core of the author's critique rests on a structural data discrepancy: the original study mixes a 10-year growth period (1990–2000) with two 5-year growth periods (2000–2005 and 2005–2010) without annualizing or standardizing the rates.
Methodological Validity: High. In linear regression models tracking growth, failing to normalize time blocks means the 10-year period artificially inflates the magnitude of change for both the independent and dependent variables. The author demonstrates this effectively via two methods:
The analysis criticizes the study's inclusion of i.metarea (metropolitan area fixed effects), pointing out that adding 218 dummy variables to a dataset of only 657 observations yields roughly 3 observations per parameter. This violates Harrell's general rule of thumb, which requires a minimum of 10 observations per parameter to avoid overfitting.
The author correctly points out that when standardizing growth rates across intervals, using the natural logarithm of growth rates is mathematically necessary to maintain linearity across different base periods. As demonstrated in the author's final tables, applying log transforms ensures that the regression slopes stay perfectly constant regardless of whether the data is scaled to a 1-year or 5-year baseline, changing only the intercept. This adjustment grounds the critique in sound financial and economic mathematics.
The critique is analytically robust and highly persuasive. By combining open-source replication, simulated falsification, and parameter stress-testing, the author successfully demonstrates that the foundational claims of the paper—namely, that foreign STEM workers drastically boost native wages without harming employment—rely on fragile econometric specifications, over-parameterization, and a fundamental failure to normalize varying time intervals.