1. Introduction
The rise and partial ebbing of U.S. inflation since the COVID Recession (Figure 1) has spawned studies assessing the roles played by supply chain disruptions and excess demand growth. Some analysts argue that supply chain disruptions can largely account for the recent swings in inflation. However, most analysts argue that the combination of supply constraints and a strengthening of aggregate demand from stimulatory monetary and fiscal policy actions had created excessive upward pressures on inflation (see Blanchard et al. Reference Blanchard, Domash and Summers2022; Bordo and Levy, Reference Bordo and Levy2022; Bolhuis et al. Reference Bolhuis and Summers2022; DeSoyres et al. Reference DeSoyres, Santacreu and Young2022; Farie-e-Castro, Reference Faria-e-Castro2025). Other studies decompose these influences using VARs, such as Gordon and Clark (Reference Gordon and Clark.2023), Hall et al. (Reference Hall, Tavlas and Wang2023), and Liu and Nguyen (Reference Liu and Nguyen2023) or use model-based estimates (Bernanke and Blanchard Reference Bernanke and Blanchard2025; Koch and Noureldin, Reference Koch and Nourelding2023) or model-based calibrations (Di Giovanni et al. Reference Di Giovanni, Kalemli-Ozcan, Silva and Yildirim2022). Beyond food and energy cost shocks, the latter studies consider shocks emanating from either supply chain disruptions or COVID-related shifts in the composition of demand that spawned imbalances between supply and demand.

Figure 1. U.S. inflation and supply chain pressures post-Global Financial Crisis (GFC).
Sources: BEA and FRBNY.
The view that supply shocks alone cannot explain U.S. inflation is consistent with nominal GDP exceeding its pre-COVID trend and core PCE inflation exceeding a 2 percent path since late 2021 (Bordo and Duca, Reference Bordo and Duca2025). And while the inflation surge in 2020-21 coincided with the rise in the Federal Reserve Bank of New York’s (FRBNY) global index of supply pressures (Figure 1), the price level did not fully return to its pre-pandemic path consistent with 2 percent inflation after supply pressures fell back to their pre-pandemic range. Actual nominal GDP growth and inflation imply that aggregate demand contributed to inflation as the pandemic unwound.
Although broad Divisia money growth surged in 2020 (Figure 2) many macro-economists ignored this signal because of perceived instability in the demand for conventionally defined monetary aggregates.Footnote 1 By weakening the link between money (
$M$
) and GDP, such instability undermined the link between money and inflation. Hallman et al.’s (Reference Hallman, Porter and Small1991) original P-Star model of inflation rearranges the quantity theory of money equation,
$MV=PY$
, where
$Y$
is real income and
$P$
is the price level, to infer the long-run price level,
$P^{*}=MV^{*}/Y^{*}$
, where
$Y^{*}$
is long-run or trend real income,
$M$
is a monetary aggregate (originally simple-sum M2), and
$V^{*}$
is the long-run equilibrium velocity of money toward which velocity gradually converges (error-corrects). In this framework, inflation depends not only on its own lags, but also on the price gap, the extent to which the price level deviates from its long-run value. The price gap can be decomposed into the gap between long-run equilibrium velocity and its current level and the cyclical component of real income. That is,
$\ln (P^{*}/P)=\ln (V^{*}/V)+\ln (Y/Y^{*})$
, where the first term is the velocity gap and the latter is the cyclical component of real income.

Figure 2. U.S. broad Divisia money growth surges in the COVID-19 recession in sharp contrast to the great recession.
Sources: Center for Financial Stability and authors’ calculations.
If long-run velocity is unstable or difficult to model, the velocity gap will be less informative, thereby undermining the usefulness of the P-Star approach. Belongia and Ireland (Reference Belongia and Ireland2015, Reference Belongia and Ireland2017) estimate P-Star models for inflation and nominal income using a one-sided (i.e., real-time) HP filter to model trend velocity.Footnote 2 An innovative aspect of their work is that they use Divisia monetary aggregates rather than conventional money measures to define velocity. Building on Bordo and Duca’s (Reference Bordo and Duca2025) analysis of GDP measures of velocity, we employ a small-scale error-correction model to estimate trend consumption velocity and incorporate the resulting velocity gap into a P-Star model of PCE inflation.Footnote 3 Our approach focusses on the percentage fees (loads) for buying or selling mutual funds, declines in which in turn led to a permanent decline in the demand for money in the late 1980s and 1990s–especially small time deposits (Duca, Reference Duca2000; Bordo and Duca, Reference Bordo and Duca2025). This caused an upward level shift in velocity that led the P-Star model to underpredict inflation, discrediting simple-sum M2 versions of this framework (see Becsi and Duca, Reference Becsi and Duca1994; Ireland, Reference Ireland2022, Reference Ireland2023). Simple sum monetary aggregates assume perfect substitutability between their components implying that they may be distorted by optimal portfolio reallocation in response to interest rate changes and, moreover, M2 may be too narrow to capture some of the substitutions involved (see Barnett, Reference Barnett1982; Barnett et al. Reference Barnett, Liu, Mattson and van den Noort2013).
We find that the consumption velocity of Divisia M3 has a stable long-run relationship with the mutual fund cost variable we employ when we control for the effects of the Commodity Futures Modernization Act (CFMA). We accomplish the latter by estimating a broken constant version of the standard cointegration framework from Johansen et al. (Reference Johansen, Rosconi and Nielson2000). We then use the estimated long-run relationship to calculate the velocity gap out-of-sample, which we incorporate into a P-Star framework from 2013 to 2024. Following Ireland (Reference Ireland2024), we center our analysis on PCE-based measures of inflation. We add a role for supply pressures using an index from the FRBNY, and we add dummies for extreme declines in energy prices and for 2020 to account for the unprecedented scale of the pandemic shock.
As in Bordo and Duca (Reference Bordo and Duca2025), the acute initial phase of the pandemic is associated with a very steep drop in Divisia M3’s velocity reflecting both the scale of the real economic shock and the massive money growth shown in Figure 2. Divisia M3’s level abruptly shifted upwards relative to its pre-pandemic trend and its annual growth rate was over 18% between 2020Q2 and 2021Q1 and was over 9% for the following three quarters. Divisia’s annual growth fell rapidly during 2022 and turned negative in 2023 as the Fed tightened monetary policy. Divisia M3’s velocity correspondingly declined sharply in 2020Q2, rebounded sharply in 2020Q3, and then gradually rose, surpassing its 2020Q1 level in mid-2023. It took a while for inflation to reach its peak and largely ebb as shown in Figure 1. As COVID-19 waned, much pent-up aggregate demand was released that bolstered inflation due to both supply- and demand-side factors in 2021 to early 2023, reflected in the unwinding of earlier pandemic-related declines in velocity. The disinflationary impact of monetary policy tightening took hold starting in early 2023.
We find that the P-Star model is able to explain much of the increase and subsequent ebbing of inflation in terms of the deviation of velocity from its long-run equilibrium level even after controlling for supply pressures. In fact, we find that while the estimated contribution of the index of global supply pressure to the temporary run-up of core inflation is notable, more of it stems from aggregate demand pressures reflected by the price and velocity gap variables from the P-star model.Footnote 4 Finally, we show that our cointegration model of velocity significantly enhances the performance of the P-Star model relative to using a one-sided HP filter to calculate trend velocity.
The paper is organized as follows. Section 2 discusses Divisia monetary aggregates and presents our empirical specification of consumption velocity. Section 3 presents results from estimating velocity models. Section 4 lays out our P-Star model, which models core and overall PCE inflation from 2013 to 2024. Those models track aggregate demand pressures by incorporating detrended real consumption and the gaps between velocity and its estimated long-run equilibrium value, where the latter uses velocity coefficients estimated through 2012 to gauge equilibrium velocity since 2013. While we find that global supply pressures explain some of the large swings in core inflation since 2020, our results imply that swings in aggregate demand pressures driven by money growth played a larger role. Section 5 concludes by discussing our findings in a broader context.
2. The demand for Divisia monetary services
This section begins with a brief comparison of Divisia versus simple-sum measures of money before discussing the determinants of the demand for Divisia monetary services. Based on this discussion, Section 3 specifies estimable models of the velocity of Divisia money.
2.1. Broad Divisia measures of money versus narrow Divisia and simple-sum measures of money
Conventional (simple-sum) monetary aggregates implicitly assume that different monetary components are perfect substitutes by adding them up when aggregating. This is inconsistent with empirical estimates of elasticities of substitution from demand models for the underlying components (e.g., Jones et al. Reference Jones, Dutkowsky, Elger and Fleissig2008a, b; Fleissig and Jones, Reference Fleissig and Jones2015; Jadidzadeh and Serletis, Reference Jadidzadeh and Serletis2019; Xu and Serletis, Reference Xu and Serletis2022). Divisia monetary measures do not make implicit assumptions about substitution elasticities between the monetary assets that they include. However, the oft stated ability of Divisia aggregates to internalize pure substitution effects between its components does require weak separability without which demand for the aggregate can become unstable (see Barnett, Reference Barnett1982). Jones et al. (Reference Jones, Dutkowsky and Elger2005) found that separability tests favored broad over narrow aggregates when the monetary assets were adjusted for retail sweep accounts but not for commercial sweep accounts. Jadidzadeh and Serletis (Reference Jadidzadeh and Serletis2019) tested various separability structures within the Divisia M4 monetary aggregate published by the Center for Financial Stability (CFS), but rejected all of them, leading them to favor the broadest available monetary aggregate. Hjertstrand et al. (Reference Hjertstrand, Swofford and Whitney2016) found that M1 and a broad aggregate similar to M4 were weakly separable from consumption and leisure, but the evidence for M2 and M3 was weaker. We focus on Divisia M3, noting that movements in it are very similar to CFS’s Divisia M4- and Divisia M4.Footnote 5
The demand for Divisia money can be affected by changes in the liquidity of assets outside of it. This limitation suggests that broader Divisia indexes (e.g., M3) may be preferable to Divisia M2, because they are better able to internalize shifts between liquidity provided by insured deposits in M2 vis-à-vis uninsured liabilities issued by banks (e.g., large time deposits), institutional money funds, and repurchase agreements (RPs) outside of M2.Footnote 6 While the velocity of broad Divisia money may be affected by shifts in the liquidity of assets not spanned by them, the factors driving those changes are less numerous than for Divisia M2 and are feasible to track for Divisia M3 (Bordo and Duca, Reference Bordo and Duca2025). Indeed, consumption velocity (nominal PCE divided by money) is more mean reverting for Divisia M3 than for simple sum or Divisia M2 over the past four decades (Figure 3). As shown in Figure 3, Divisia M3 velocity does not exhibit any clear discernable trend.

Figure 3. Since the mid-1980s, the consumption velocity of broader Divisia money (M3) is more stable than that of simple-sum and Divisia M2 in the U.S.
Sources: CFS, Federal Reserve, and authors’ calculations. Shaded areas are NBER recessions.
2.2. Pre-pandemic long-run demand for Divisia monetary services
The long-run demand for money depends on overall transactions, as well as factors altering the relative liquidity of money versus that of other assets. We now provide an overview of the determinants of money demand that we include in our error-correction model of velocity. Our focus on stock loads and CFMA is very similar to Bordo and Duca (Reference Bordo and Duca2025), although we use consumption as the scale variable rather than GDP and consider several opportunity cost variables.
2.2.1. Tracking the transactions demand for Divisia M3
Since Friedman (Reference Friedman and Friedman1956) redefined the quantity theory of money by replacing overall transactions with nominal GDP, most money demand models have tracked the transactions demand for money with nominal GDP. However, for five reasons and in line with Ireland (Reference Ireland2024), we use a version of the P-Star model that relates PCE inflation to the gap between current and long-run consumption velocity rather than GDP velocity. First, we focus on modeling PCE measures of inflation, which the Federal Reserve monitors and targets. Second, consistent with Friedman (Reference Friedman and Friedman1956, Reference Friedman1957) and later empirical models of money demand, the demand for money is more linked to permanent income and consumption than to GDP, and the Life-Cycle/Permanent Income Hypotheses suggest that PCE can proxy for permanent income (Cochrane, Reference Cochrane1994). Third, the bulk of monetary assets within the Divisia aggregates are held by households, whose main spending is tracked by PCE. These reasons suggest that consumption velocity (
$\textit{PCE}/M$
) should be less variable than GDP velocity (
$\textit{GDP}/M$
), which is consistent with a lower standard deviation of consumption versus GDP velocity. Fourth, large swings in net exports are associated with nominal spending and output diverging. If spending is more pertinent to money demand than output (GDP), then consumption may better track the transactions demand for money. Fifth, and likely reflecting the impact of the second through fourth of these considerations when we use PCE instead of GDP, we do not need any non-COVID short-run shock variables in our cointegration models to avoid serial correlation in the residuals.
2.2.2. Factors affecting the liquidity provided by assets not covered by Divisia measures
The literature on Divisia indexes tends to overlook how and why the demand for broad indexes can shift in the long run and vary in the short run owing to factors other than income (consumption) and user costs or interest rates. We show that by addressing these issues Divisia M3 can be useful in tracking inflation. In doing so, our analysis helps address the skepticism about tracking money growth in light of past instability in the demand for simple-sum monetary aggregates such as M2.
Our specifications of the demand for monetary services from liquid assets emphasize two major factors that shift the demand between conventional monetary assets, and assets that traditionally have been viewed as less liquid. The first is the cost for the marginal agent holding money of shifting between monetary assets and stocks. The bulk of M2 and M3 assets are held by middle- and upper-middle income households, for whom stock mutual funds have been the main vehicle to hold a diversified portfolio of stocks (Duca, Reference Duca2000). Lower proportional transfer costs reduce money demand in contrast to the implications of the overly stylized Baumol-Tobin framework (see Brunner and Meltzer, Reference Brunner and Meltzer1967; Duca, Reference Duca2000). Following Anderson et al. (Reference Anderson, Bordo and Duca2017), who extend Duca’s (Reference Duca2000) analysis of simple-sum M2 velocity, we assume the transactions costs of switching between monetary assets and stock mutual funds is proportional to the average load fee for purchasing or selling a stock mutual fund in a one-year horizon (SLoad), based on extending Anderson et al.’s (Reference Anderson, Bordo and Duca2017) series. This variable is discussed at length in Anderson et al. (Reference Anderson, Bordo and Duca2017) and in Bordo and Duca (Reference Bordo and Duca2025). The data we use are identical to those calculated by the latter. As these costs fall, the liquidity of stock funds rises which lowers the demand for Divisia indexes that omit such assets.
The second major factor that affected the demand for Divisia indexes is a financial regulation that induced changes in financial intermediation by nonbank financial intermediaries (“shadow banks”) that issue higher-liquidity short-term debt to fund lower-liquidity (often long maturity) assets. The passage of the Commodity Futures Modernization Act (CFMA) in late 2000 made it feasible for credit default swaps (CDS) to be widely used to reduce the tail risk of assets (e.g., non-government issued MBS), after which CDS issuance became notable (Duca and Ling, Reference Duca and Ling2020).Footnote 7 Such enhancements allowed many nonbanks (e.g., conduits, investment banks, and special investment vehicles) to use derivatives to protect their portfolios from tail risk and helped them obtain investment grade ratings on the short-term debt that they used to fund investments. In turn, the investment grade short-term debt was purchased by banks and institutional money market mutual funds, the latter of which tripled their liabilities between the passage of CFMA and mid-2009. In periods of low financial market stress, CFMA also induced increased repo activity, as formerly ineligible paper became acceptable as collateral, thereby expanding the pool of assets used in repurchase agreements and the monetary services they provide. In seven years, institutional money funds and security broker RPs soared from 13 percent of M3 (OECD) to 25 percent until Lehman failed. By inducing large increases in highly liquid institutional money funds and RPs, CFMA was associated with an upward level shift in the broader Divisa aggregates and a corresponding downward shift in their velocities starting in 2001 when the law took effect. This is evident in the sharp jump in CDS issuance in early 2001 and likely reflects that the law’s passage was highly anticipated in 2000Q4, when the bill passed the House by a large majority and by unanimous consent by the U.S. Senate. We account for CFMA by allowing for a break in the constant in a model of long-run velocity coinciding with the passage of CFMA as detailed below.
2.2.3. Opportunity cost of money
Monetary aggregation theory builds upon the user cost of monetary assets; see Barnett (Reference Barnett1978) and Donovan (Reference Donovan1978). From the user cost formula, the opportunity cost of the monetary services derived from holding a particular asset corresponds to the foregone interest from holding that asset relative to a non-monetary “benchmark” asset. The growth rate of a Divisia index is the average expenditure share weighted sum of the growth rates of its component monetary assets, where the expenditure on each monetary asset depends on its user cost. Thus, monetary asset user costs are a key feature of how Divisia aggregates are computed. In addition, money demand models of Divisia aggregates often feature an opportunity cost variable. Some authors, such as Belongia and Ireland (Reference Belongia and Ireland2016, Reference Belongia and Ireland2018, Reference Belongia and Ireland2019), emphasize the use of aggregation-theoretic dual user cost measures, such as those produced by CFS, to model Divisia money demand.Footnote 8
Others, however, use a standard interest rate as an opportunity cost variable. Barnett et al. (Reference Barnett, Ghosh and Adil2022), for example, estimate cointegration relationships between money, output, interest rates, and real effective exchange rates for several countries including the U.S. They note that short-term interest rates are typically used to model narrow money measures, but long-term interest rates are typically used for broad money, such as Divisia M3 explaining that “[t]he conventional rationale for this choice is that many components of broad money, such as time deposits, are associated with long-term interest rates.” Following this reasoning, we assess both the Divisia M3 dual user cost index (Usercost) and the ten-year Treasury yield (Bond) as additional variables in our models.Footnote 9 As shown in Figure 4, the Usercost series is more variable and mean reverting than the slower moving stock load series, which experiences substantial declines in the 1980s and 1990s.

Figure 4. Stock fund loads and the CFS measure of Divisia M3 user cost.
Sources: CFS and Bordo and Duca (Reference Bordo and Duca2025).
The zero lower bound on short-term interest rates binds in much of the latter part of our sample (2008–2015 and 2020–2021), which by compressing user costs to a low positive range, could complicate the interpretation of dual user costs over those periods. Mattson and Valcarcel (Reference Mattson and Valcarcel2016) show that the fall of the federal funds rate “likely passed through to the user costs of many of the assets” in Divisia M4 resulting in “unprecedented compression in user costs” and to convergence between Divisia and simple-sum M4 starting in 2009. Nevertheless, the dual user cost does not hit the zero lower bound for any of the CFS Divisia measures (Chen and Valcarcel, Reference Chen and Valcarcel2024).Footnote 10
2.3. Modelling the consumption velocity of Divisia M3
Our analysis focuses on the consumption velocity of CFS’s Divisia M3 aggregate, which measures the monetary services from the components of the Federal Reserve’s simple-sum M2 aggregate as well as from large time deposits, institutional money market mutual funds, and repurchase agreements (RPs). The Federal Reserve’s simple sum M3 aggregate, which was published through early 2006, added institutional money funds, total large time deposits, total repurchase agreements and total Eurodollar deposits to M2; making its components roughly comparable, but not identical, to CFS’s Divisia M3 measure. As Bordo and Duca (Reference Bordo and Duca2025) stress, Divisia M3 helps internalize substitution between the liquidity provided by M2 and non-M2 liabilities, where the latter help track shifts in the role of liquidity provided by instruments funding the shadow banking sector. Of the three broad Divisia measures (M3, M4-, and M4) constructed by CFS, Bordo and Duca (Reference Bordo and Duca2025) were slightly better able to track the GDP velocity of Divisia M3.
The preceding discussion implies that the long-run demand for the log of real Divisia M3 (
$\textit{LDM}$
), takes the form:

where
$\textit{CFMA}$
is a level shift dummy equal to one following passage of CFMA and zero for earlier periods. The remaining variables are the logs of the stock load variable (
$\textit{LSLoad}$
), real consumption (
$\textit{LPCE}$
), and the opportunity cost (
$\textit{LOC}$
), which is either the long-term bond yield or the dual user cost index. For consistency, Divisia M3 should be converted to real terms using the PCE deflator. As discussed above, CFMA was associated with increased money holdings implying that
$\gamma _{1}\gt 0$
. Higher transfer costs should increase money demand implying that
$\gamma _{2}\gt 0$
by reducing the liquidity of assets outside of Divisia M3 and higher opportunity costs are expected to reduce money demand implying
$\gamma _{4}\lt 0$
. Finally, the elasticity of money demand with respect to consumption,
$\gamma _{3}$
, should be positive. For later reference, omitting the opportunity cost variable results in the simplified expression:

If the consumption elasticity of long-run Divisia money demand is unity,
$\gamma _{3}=1$
, (as Bordo and Duca (Reference Bordo and Duca2025) impose for GDP velocity), then eqs. (1) and (1’) can be rearranged into corresponding models of long-run consumption velocity (in logs):

and

respectively, since
$\textit{LV}=\textit{LPCE}-\textit{LDM}$
. Assuming that CFMA bolstered Divisia M3, as argued earlier, then it would have correspondingly lowered its velocity implying that
$\theta _{1}\lt 0$
. Previous arguments likewise imply that
$\theta _{2}\lt 0$
(higher transfer costs raise money demand and hence lower velocity) and
$\theta _{3}\gt 0$
(higher opportunity costs lower money demand and hence increase velocity).
We posit that velocity (real money) adjusts gradually to the deviation of it from its long-run equilibrium,
$\textit{LV}_{t}-\textit{LV}_{t}^{*}$
(
$\textit{LDM}_{t}-\textit{LDM}_{t}^{*}$
), which gives rise to an error-correction model. However, including the CFMA level shift dummy in the long-run relationship must be properly incorporated into the estimation, and there can be transitional short-run dynamics immediately following passage of the Act. To handle these issues, we estimate broken constant models from Johansen et al. (Reference Johansen, Rosconi and Nielson2000). These models generalize the standard cointegration models (see, e.g., Johansen, Reference Johansen1996) to allow for structural breaks in the deterministic components where the breakpoints are known. Since we are concerned with breaks associated with the adoption of the CFMA in our analysis, this is an attractive framework for our empirical design.
2.4. Technical details
The standard VECM framework consists of a model in the form:

where
$k$
is the number of lags of the underlying VAR model (in levels) and the vector X includes LV, LSload, and LOC. Let
$p$
be the number of variables in the VAR. Table 1 reports unit root tests for each of the variables over our sample period (1985–2022). These tests indicate that LV and LSLoad are I(1), but that both LBond and LUsercost may be I(0). While the VECM model can include stationary variables, doing so would increase the rank of the system. The table also includes tests for real Divisia M3 and real PCE (both in logs), which are used in Appendix 1.
Table 1. Unit root tests

Notes: +, * and ** denote 90%, 95% and 99% significance. Lag length or bandwidth statistics are in parentheses. Tests for SupPress begin in 1998 when the series starts. Lag lengths for ADF tests selected using the SIC. Phillips–Perron tests use the quadratic spectral kernel with an Andrews Bandwidth. All tests include an intercept and a time trend.
Under cointegration,
$\Pi$
is of reduced rank and is usually written as
$\Pi =\alpha \beta '$
, with
$\alpha ,\beta$
both being
$p$
by
$r$
. In that case,
$\beta 'X_{t-1}$
represents the (
$r$
) cointegration relations and
$\alpha$
represents the corresponding impact coefficients of those relations on the first differences of each of the system variables in the VAR.
$\Pi _{1}$
is typically imposed to be of the form
$\Pi _{1}=\alpha\gamma '$
, so that the model does not have a quadratic trend in levels (see Johansen et al. Reference Johansen, Rosconi and Nielson2000). Consequently, with the reduced rank condition imposed, the cointegration relations would be trend stationary, since
$\Pi X_{t-1}+\Pi _{1}t=\alpha (\beta 'X_{t-1}+\gamma 't)$
. In applications,
$\gamma =0$
can be further imposed, which eliminates the trend from the cointegration space, but not from the levels. Assuming
$\gamma =0$
, if it is further imposed that
$\mu =\alpha \rho '$
then the constant is restricted to the cointegrating relationship, and the model will not have a linear trend in any direction (Johansen et al. Reference Johansen, Rosconi and Nielson2000, p. 218).
Johansen et al. (Reference Johansen, Rosconi and Nielson2000) generalize this framework by introducing indicator variables for different sample periods:

where
$j=1,\ldots ,q$
. Setting
$T_{0}=0$
for convenience, the effective sample is
$t=1+k,\ldots ,T_{q}=T$
with
$X_{1},\ldots ,X_{k}$
being used as initial observations and there are
$q$
subsamples with effective sample periods running from
$1+k$
to
$T_{1}$
,
$T_{1}+1+k$
to
$T_{2}$
, and so on. Following Johansen et al. (Reference Johansen, Rosconi and Nielson2000, p. 219), we define the vector of sub-sample indicator variables as
$E_{t}=(E_{1,t},E_{2,t},\ldots ,E_{q,t})'$
.
Using this notation, Johansen et al. (Reference Johansen, Rosconi and Nielson2000) proposes several models that allow for breaks in the deterministic components. We impose
$\gamma =0$
, which eliminates the possibility of trends entering the long-run relationship and we focus on the broken constant specification from Johansen et al. (Reference Johansen, Rosconi and Nielson2000, p. 225). That specification is as follows:

where
$D_{j,t-i}=1$
if
$t=T_{j-1}+i$
(and zero otherwise) and
$\mu =(\mu _{1},\ldots ,\mu _{q})$
is a
$p$
by
$q$
matrix. The term,
$\mu E_{t}$
, allows for the constant in the VAR to differ across subsamples. As Johansen et al. (Reference Johansen, Rosconi and Nielson2000, p. 220) explain, the model dummies out the
$k$
initial observations following each breakpoint,
$X_{{T_{1}}+1},\ldots ,X_{{T_{1}}+k}$
,
$X_{{T_{2}}+1},\ldots ,X_{{T_{2}}+k},\ldots$
rendering “the corresponding residuals zero thereby essentially eliminating the corresponding factors from the likelihood function, and hence producing the conditional likelihood function given the initial values in each period.” The relevant reduced rank cointegration hypothesis (Johansen et al. (Reference Johansen, Rosconi and Nielson2000, p. 225) is

so that
$(\Pi ,\mu )=\alpha (\beta ',\rho ')$
, where
$\alpha ,\beta$
are both
$p$
by
$r$
matrices and
$\rho$
is
$q$
by
$r$
. With the reduced rank condition imposed,
$\Pi X_{t-1}+{\unicode{x03BC}} E_{t}=\alpha (\beta 'X_{t-1}+\rho 'E_{t}),$
corresponding to a set of
$r$
cointegration relations with different constants in each of the
$q$
subsamples. Within the same specification, Johansen et al. (Reference Johansen, Rosconi and Nielson2000, p. 219) also consider an alternative reduced rank hypothesis:

Johansen et al. (Reference Johansen, Rosconi and Nielson2000, pp. 218–220) consider the following specification as well:

For this specification, the reduced rank cointegration hypothesis is

so that

where
$\gamma _{j}$
is 1 by
$r$
for all
$j$
. Imposing the reduced rank condition, the cointegration relations have broken trends denoted by
$\gamma _{j}'t$
in each subsample
$j$
. We do not consider this specification further in this paper.
Johansen et al. (Reference Johansen, Rosconi and Nielson2000) derive likelihood ratio tests for
$H_{c}(r)$
and
$H_{\ell c}(r)$
against
$H_{\ell c}(p)=H_{c}(p)$
and for
$H_{\ell }(r)$
against
$H_{\ell }(p)$
, which are in the usual trace test form: i.e.
$\textit{LR}\{H_{c}(r)|H_{c}(p)=-T\sum\nolimits_{i=r+1}^{p}\ln (1-\hat{\lambda }_{i})$
. Corresponding tables of critical values for
$H_{c}(r)$
and
$H_{\ell }(r)$
can be found in Giles and Godwin (Reference Giles and Godwin2012). The critical values depend on the relative breakpoints,
$v_{j}=T_{j}/T$
, where
$T_{j}$
is the last observation of the jth subsample and
$T$
is the full sample size. Johansen et al. (Reference Johansen, Rosconi and Nielson2000, p. 219) find the
$H_{\ell c}(r)$
hypothesis to be “less attractive” than the other two hypotheses, however, in part due to their finding that “the asymptotic analysis is heavily burdened with nuisance parameters” in that case. Consequently, we focus on testing the
$H_{c}(r)$
hypothesis.
In our application, the only structural break we consider is associated with the adoption of CFMA. To align with our notation, we set the number of sub-samples,
$q$
, equal to 2 and set
$T_{1}$
to be 2000Q3 corresponding to the end of the pre-CFMA period, since CFMA passed in 2000Q4. As in eq. (4),
$E_{2,t}$
is an indicator variable that is one from
$T_{1}+1+k$
through the end of the sample, where
$k$
is the number of lags in the VAR, and zero for earlier periods. The inclusion of the
$D_{j,t-i}$
dummies in eq. (5) implies that we do not need to model the transitional impacts of CFMA and that
$E_{2,t}$
is, therefore, effectively an indicator for the post-CFMA period. We follow Johansen et al. (Reference Johansen, Rosconi and Nielson2000) as laid out above with one exception. We set
$E_{1,t}=1$
over the entire sample rather than over just the sub-sample covering the pre-CFMA period. This allows us to present the results in terms of a constant for the full sample and a shift in the constant for the post-CFMA period as in eq. (2). This minor change does not affect the estimation of any other parameters nor does it affect the test procedures employed. We do this to simplify exposition regarding the effect of CFMA.
Estimating eq. (5) and setting
$r=1$
yields coefficients on the X vector components for a single long-run relationship,
$\beta 'X_{t-1}$
, with constant term,
$\rho 'E_{t}$
. The corresponding impact coefficients,
$\alpha$
, determine how the endogenous variables each adjust to the long-run relationship. Since we focus on velocity, the impact of the long-run relationship on the stock load, the long-term Treasury rate, or the user cost are not of particular interest and are not reported. Similarly, we only report first-order short-run dynamics for the change in velocity, as captured by the corresponding row of
$\Gamma _{1}$
(full results are available upon request). By normalizing the
$\beta$
vector on the element corresponding to log velocity,
$\beta 'X_{t-1}$
can be represented in the following form:

Ignoring the constants in the cointegration, we can define long-run equilibrium velocity as

so that the corresponding error-correction term is as follows:

which are the deviations from long-run equilibrium. Incorporating the constants in the cointegration relations results in

as in eq. (2).Footnote 11 As discussed previously, theoretical priors imply
$\theta _{1},\, \theta _{2}\lt 0$
and
$\theta _{3}\gt 0$
.
Our model of consumption velocity builds on Bordo and Duca (Reference Bordo and Duca2025) who model the GDP velocity of broad Divisia but differs in three important respects: First, we use PCE to track the aggregate transactions instead of GDP as discussed above. Second, we adopt the technically more correct approach of estimating the broken constant specification rather than treating CFMA as a regular variable within the VECM as in Bordo and Duca (Reference Bordo and Duca2025). The trace test for the broken constant model can be compared to tables of critical values and are interpreted in the usual way as described above. Third, Bordo and Duca (Reference Bordo and Duca2025) model the impact of COVID-19 with a continuous variable that they construct from an index of government mobility restrictions and vaccination rates. For the sample that includes post-2019 observations, we instead just include simple impact dummies for each of the quarters of 2020, which are sufficient to produce stable long-run relationships over our full sample. Modelling velocity in terms of PCE is more attractive than GDP, because the Fed’s price stability goal is in terms of PCE inflation. We later incorporate estimates from our model of consumption velocity into a P-Star model of PCE inflation.
3. Estimation
3.1. Sample periods
We estimate our consumption velocity model over four sample periods that all start in 1985Q3 but have different end-of-sample periods. Earlier observations are used as initial starting values in the models. We generally find clean residuals for our estimated models with lag lengths of 7 quarters or less as described below. For 7 lags, the 1985Q3 start-of-sample implies that eq. (5) is estimated using only data (in differences) going back to 1984Q1. The starting point is chosen to avoid using earlier data that could be affected by deposit deregulation and financial innovations that led to the introduction of new types of monetary assets.
During the 1970s and early 1980s, there were shifts away from savings and time deposits with binding deposit interest rate ceilings (whose measured user costs were high, because Regulation Q capped deposit interest rates below market levels), to money market mutual funds (MMMFs), and later to money market deposit accounts (MMDAs). Super NOW accounts and MMDAs were both introduced beginning in late 1982. Super NOWs, MMDAs, and MMMFs all paid higher interest rates than regulated deposits and hence had user costs that were relatively low and MMDAs and MMMFs also had checking features unlike traditional savings and small-time deposits. In light of this, the shifts out of regulated deposits would be weighted by a Divisia index using expenditure shares based on relatively high user costs. In contrast, the movement of funds into the assets paying higher rates would receive expenditure share weights based on relatively low user costs. As a result, these shifts would be associated with a distorted decline in measured Divisia (but not simple-sum) money (see Bordo and Duca, Reference Bordo and Duca2025, for a discussion). Indeed, over this period, there was a corresponding upward level shift in Divisia velocity in the late-1970s and early 1980s. Chen and Valcarcel (Reference Chen and Valcarcel2024) find structural breaks in Divisia M2 and Divisia M3 around 1980 and subsequent breaks in 2012 and 1988 respectively. They also find structural breaks in the corresponding user cost measures in 2008, and that “[o]verall, these statistical tests show a preponderance of evidence for a structural break in 1980.”Footnote 12
The earliest end of sample period we consider is 2005Q4 (Model 1), which is just before the bursting of the subprime mortgage bubble and the GFC. Model 2 ends in 2012Q4 to allow ample time for the adjustment of money to the staggered implementation of new financial regulations under the Dodd-Frank Act, through which the U.S. implemented the Basel III reforms. Model 3 ends in 2019Q4, to span the pre-COVID period, while Model 4 uses a full sample ending in 2022Q4. The full sample is based on the availability of the underlying data used to construct the stock load variable. The full sample also differs in that we include dummies for each quarter in 2020 to control for the unprecedented impact of the pandemic during its acute initial phase.
3.2. Estimation results
Our specification of long-run equilibrium consumption velocity is spelled out in eq. (13) with the corresponding error-correction term in eq. (12) which enters into the short-run dynamics of the model. We estimate three versions of the model that differ in terms of opportunity costs. We first assess models that include stock fund loads and the long-term bond yield. We then assess models including stock fund loads and the Divisia M3 user cost measure from the CFS. Finally, we present models that just include stock mutual fund loads. In our estimations, we select lag lengths using two criteria: First, we estimate a simple unrestricted VAR in levels and employ LM tests for serial correlation (up to four quarters) for lag lengths from 2 to 8. From models having clean residuals, we select the lag length that minimizes the Schwartz Information Criterion (SIC).
Table 2. Quarterly models of U.S. consumption velocity including long treasury yields.

Notes: Significance is determined using critical values for the trace test from Giles and Godwin (Reference Giles and Godwin2012) for various values of
$v_{1}$
as described in footnote 13. +, *, and ** denote 90%, 95%, and 99% significance respectively.
$\dagger$
indicates significance at the 90% level for the lower bracketing value only. Long-term coefficient estimates are for
$r=1$
. Absolute t-statistics are in parentheses. For the short-run model, all coefficients are for the velocity equation.
${\Gamma} _{1}(\mathrm{i},\mathrm{j})$
is the
$i,j$
element of
${\Gamma} _{1}$
in eq. (5). Short run coefficients corresponding to higher order lags (
${\Gamma} _{2},\, \ldots {\Gamma} _{k-1}$
) are omitted to save space.
3.2.1. Estimation results using the long-term bond rate
Table 2 reports results for models that use the long-term bond rate as the opportunity cost variable using the lag length prescribed by our procedure (described above) over the four different sample periods. For these models, there are three variables (
$p=3$
), so we can use the trace test to evaluate
$H_{c}(r)$
for
$r=0,1,2$
. If
$r=0$
, then the system is difference stationary. If
$0\lt r\lt p$
, then the model has
$r$
cointegrating vectors and if
$r=p$
, then the system is stationary. Throughout this section, we report results in terms of the coefficients of the long-run equilibrium velocity relationship, eq. (13), since these are easily interpretable. For the shortest subsample, 1985Q3 to 2005Q4 (Model 1), the trace test rejects
$r=0$
at the 99% level.Footnote 13 The rank (
$r$
) is ambiguous though, since the trace test for
$r=1$
is borderline at the 90% significance level and the test for
$r=2$
is rejected at the 90% level suggesting the possibility that the system is stationary. Our main objective is to derive long-run values for consumption velocity for use in the P-star model and we will focus on a single cointegrating relationship throughout this section. Setting
$r=1$
, which corresponds to a single cointegrating relation, results in reasonably signed coefficient estimates for the load variable and for the long-term rate. The coefficient on LSLoad is −0.643 with a t-stat of 16.17 implying that higher loads raise money demand and thereby reduce velocity. The coefficient on LBond is 0.539 with a t-stat of 10.64 implying that higher long-term rates lower money demand and correspondingly raise velocity. The coefficient on CFMA is positive, contrary to expectations. Although it is small, its t-stat is 1.87. The model performs poorly out-of-sample in that the estimated error-correction term begins to trend significantly during the GFC and remains elevated through the onset of the pandemic.Footnote 14
Next, we consider the subsample from 1985Q3 to 2012Q4 (Model 2). For this sample, we find that there is at least one cointegrating vector (
$r=0$
is rejected at the 99% confidence level), but we cannot rule out two vectors at the 90% confidence level, so once again, there is some ambiguity about the rank. For this sample, imposing one cointegrating vector (setting
$r=1$
) yields different results than the pre-GFC sample (Model 1). The coefficient on LSload is −0.316, which is again consistent with theory. While the coefficient on the long-term rate is still positive (0.011), it is close to zero and has a t-stat of just 0.38. The coefficient on CFMA is −0.114, consistent with the view that CFMA promulgated the widespread use of credit default swaps. By enhancing the liquidity of assets held by shadow banks, CFMA thereby increased the demand for shadow bank liabilities in Divisia M3 and correspondingly lowered its velocity.
For the pre-pandemic subsample (Model 3), we find a single cointegrating relationship (
$r=1$
) with sensible coefficients on LSload (−0.277) and CFMA (−0.12) with high t-stats. The coefficient on the long-term Treasury rate, however, is −0.026 with a t-stat of 2.35. While the coefficient is small, its sign is inconsistent with theory.Footnote 15 Finally, we estimate the model over the full sample through 2022Q4 adding dummy variables for each of quarter in 2020 to control for the onset of the pandemic. For the full sample,
$r=0$
is rejected at the 95% significance level and
$r=1$
cannot be rejected at the 90% level again indicating a single cointegrating relation. In this sample, the coefficient on LSload is −0.3 and the coefficient on CFMA is −0.126. The coefficient on the long-term rate is −0.043, however, with a t-stat of 1.62. The COVID dummies effectively delete the observations from 2020 from the likelihood function and their coefficients pick up the steep drop in velocity in 2020Q2 and its partial recovery in 2020Q3.
We conclude that the long-term bond yield only has the anticipated effect on velocity in the pre-GFC sample, but the model did not perform well out-of-sample in that case. In the other three samples, the coefficient on the long-term bond yield is small and often has a counterintuitive sign. Thus, the results do not favor including the long-term Treasury yield as a long-run determinant of consumption velocity.
3.2.2. Estimation results using the dual user cost measure
As Barnett et al. (Reference Barnett, Ghosh and Adil2022) explain, the Divisia dual user cost index is the opportunity cost measure that is most internally consistent with a Divisia monetary aggregate, since it is constructed from the same underlying data on user costs and asset stocks and is derived within the same aggregation-theoretic framework. Moreover, if a long-term rate was the appropriate alternative rate of return, then the user costs underlying Divisia and its dual should employ such a rate as the benchmark.Footnote 16 While the dual user cost is very attractive on theoretical grounds, we were unable to obtain reasonable results using that variable. We now summarize our findings in this regard.
Our lag selection procedure did not produce clean residuals for any lag length between 2 and 8 quarters for the 1985Q3 to 2012Q4 sample (Model 2) when the user cost is used as the opportunity cost variable. Consequently, we estimated the model using several different lags and compared results. At 4 lags, we found that
$r=1$
and the coefficient on LSLoad is −0.271 and the coefficient on CFMA is −0.16, which differ somewhat from the results using the bond rate, but seem reasonable. The coefficient on LUsercost, however, was −0.118 with a t-stat of 4.25. According to the theory, we would expect a higher user cost to negatively affect money demand and, consequently, to positively affect velocity. At 6 lags, there was some ambiguity about the number of cointegrating vectors (
$r=1$
would be rejected at the 90% level). We found negative coefficients for both LSLoad (−0.244) and CFMA (−0.221) when setting
$r=1$
but again found a counterintuitive negative coefficient for LUsercost (−0.222) with a t-stat of 5.65. For the pre-pandemic sample, 1985Q3-2019Q4 (Model 3), we estimated the model using the same lag lengths for comparison. At 4 lags,
$r=1$
is rejected at the 90% level, but if we nevertheless impose
$r=1$
, then the estimated cointegration vector is very similar to the estimates for the shorter sample period. At 6 lags,
$r=1$
was rejected at the 95% level, but setting
$r=1$
yielded coefficients of −0.252 for LSLoad, −0.192 for CFMA, and −0.169 for LUsercost, which had a t-stat of 5.79.Footnote 17
Thus, while the dual user cost is theoretically more attractive, we find that it has a counter-intuitive negative effect on velocity in both sample periods for a range of lag lengths. The exact reason why this happens is beyond the scope of the current paper, but it potentially reflects the impact of the zero-lower bound on the behavior of the underlying user costs following the GFC and continuing for a number of years and also following the onset of the pandemic as discussed earlier. Another factor, which applies to the long-term yield as well, is the potential stationarity of the dual user cost variable. In any case, given these findings, we do not pursue models including the dual user cost further in our analysis.
3.2.3. Estimation results excluding the opportunity cost variables
Building on our previous results, we now estimate more parsimonious models that include just the stock load variable as an explanatory factor while accounting for CFMA as in Bordo and Duca (Reference Bordo and Duca2025), although we estimate our models differently as described above. In these models,
$p=2$
. As reported in Table 3, for Models 1 and 2, which use the pre-GFC (1985Q3 to 2005Q4) and the 1985Q3 to 2012Q4 subsamples respectively, we find a significant cointegrating vector (
$r=0$
is rejected at the 95% significance level or higher and
$r=1$
cannot be rejected at the 90% significance level). For Model 3, which uses the pre-pandemic subsample (1985Q3 to 2019Q4),
$r=0$
is rejected at the 99% significance level, but
$r=1$
is rejected at the 90% significance level, so there is some ambiguity about whether the system is cointegrated or just stationary for that sample. Setting
$r=1$
and normalizing on velocity yields similar coefficient estimates across all three sub-samples in Models 1–3, with coefficients on the load variable near −0.3 and those on CFMA near −0.1. The estimates for these sample periods are all based on VARs with 6 lags. We also estimated VARs with 4 and 8 lags and found similar coefficient estimates.Footnote 18
Table 3. Quarterly models of U.S. consumption velocity excluding long treasury yields.

Notes: Significance is determined using critical values for the trace test from Giles and Godwin (Reference Giles and Godwin2012) for various values of
$v_{1}$
as described in footnote 13. +, *, and ** denote 90%, 95%, and 99% significance respectively. Long-term coefficient estimates are for
$r=1$
. Absolute t-statistics are in parentheses. For the short-run model, all coefficients are for the velocity equation.
${\Gamma} _{1}(\mathrm{i},\mathrm{j})$
is the
$i,j$
element of
${\Gamma} _{1}$
in eq. (5). Short run coefficients corresponding to higher order lags (
${\Gamma} _{2},\, \ldots {\Gamma} _{k-1}$
) are omitted to save space.
The estimated impact coefficient on the error-correction term for log velocity is −0.071 with a t-stat of 2.56 for the pre-GFC subsample indicating that velocity significantly error-corrects. The corresponding coefficients are around −0.12 with t-stats above 4 for the other two subsamples indicating faster adjustment of velocity. The model also performs well out-of-sample insofar as the error-correction terms based on the subsample ending in 2012Q4 are very similar to the ones estimated over the pre-pandemic sample period from 2013 on. This is an improvement vis-à-vis the models that included the long-term rate. When we included the long-term bond rate, we found that the error-correction terms based on the estimates from the subsample ending in 2012Q4 were higher than the ones estimated over the pre-pandemic sample period from 2013 on. Turning to the full sample (Model 4), we identify a significant cointegrating vector (i.e.,
$r=1)$
with sensible coefficients on LSload (−0.320) and CFMA (−0.091) and the impact coefficient on the error-correction term for log velocity (−0.09) is reasonable as well.
To provide more perspective on our results, Figure 5 plots the estimated long-run equilibrium consumption velocities from samples ending in 2012Q4 (Model 2) and 2022Q4 (Model 4). These line up well with each other and with long-run movements in log velocity. The slight lead of the long-run equilibrium relative to actual log velocity reflects the importance of accounting for partial adjustment toward long-run equilibrium for tracking velocity in the short-run and buttresses the credibility of the error-correction models. As explained above, velocity fell initially during the onset of the pandemic when the money supply surged.

Figure 5. Estimated equilibrium consumption velocity trends with long-run velocity.
Sources: CFS, BEA, and author’s calculations.
Overall, including stock loads without either the bond yield or the dual user cost measure works well for modeling consumption velocity. Our model also performs well if we estimate our specification allowing for an unrestricted broken constant (corresponding to
$H_{\ell c}(1)\,$
rather than
$H_{c}(1)$
). Furthermore, we found that estimating real Divisia M3 money demand versions of our model yielded similar results when imposing a single cointegrating vector. These additional results are presented in Appendix 1, parts A and B respectively. Accordingly, we adopt our parsimonious model of long-run equilibrium velocity for our P-Star models of inflation.Footnote 19
3.2.4. Broader levels of aggregation
Finally, we briefly consider the simple velocity model using the broader Divisia M4- and M4 aggregates, both of which include commercial paper. We estimated Models 1–3 excluding opportunity cost variables using both of these aggregates for lag lengths of 4, 6, and 8 similar to Divisia M3 as discussed above. For M4, we found generally similar coefficient estimates for the pre-GFC sample period (Model 1) and the sample period ending in 2012Q4 (Model 2) across the various lag lengths. For these two sample periods, the estimated coefficients on the load variable ranged from −0.25 to −0.21 and the estimated coefficients on CFMA ranged from −0.09 to −0.075 (with t-stats above 6.0 in all cases) consistent with our theoretical priors. The estimated coefficients differed for the pre-pandemic sample period (Model 3). At 4 lags, we found that the coefficient on the load variable was −0.27 (t-stat. of 5.93) and the coefficient on CFMA was −0.054 (with a t-stat of 2.52). At 6 and 8 lags, the coefficients on the load variable were around −0.3 with t-stats above 5.75 and the coefficients on CFMA were −0.038 and −0.021 (with t-stats of 1.49 and 0.97) respectively.Footnote 20 Thus, there is evidence of parameter instability especially regarding the impact of CFMA between the sample period ending in 2012Q4 and the pre-pandemic sample period.
For M4-, there is additional evidence of parameter instability. For the pre-GFC sample period (Model 1), the coefficient estimates at 6 and 8 lags are around −0.28 for the load variable and around −0.08 for CFMA. At 4 lags, the corresponding estimates are −0.153 for the load variable and −0.103 for CFMA. For Model 2, however, the coefficient estimates ranged from −0.322 to −0.275 for the load variable and from −0.056 to −0.03 for CFMA. At 6 lags, the coefficient estimate on CFMA (−0.03) had a t-stat of just 0.92. For the pre-pandemic sample period (Model 3), the estimated coefficients on the load variable were around −0.37, but the coefficients on CFMA were wrongly signed (estimates ranged from 0.002 to 0.024), although the t-stats were below 0.7 for all three lag lengths.Footnote 21
Thus, for Divisia M4, and even more so for Divisia M4-, there is some evidence of parameter instability particularly with respect to the impact of CFMA on long-run velocity vis-à-vis Models 2 and 3. Throughout the remainder of the paper, our focus will be on prediction via the P-Star model and the relative stability of the parameter estimates for Divisia M3 is an attractive feature in that respect. Consequently, we will focus on Divisia M3 for subsequent analysis.
A plausible explanation for the difficulty in modeling the demand for Divisia M4- and Divisia M4 is that they include the monetary services from assets that have not been transformed by intermediaries into broadly accepted means of payment. As described earlier, Divisia M4- augments Divisia M3 by including commercial paper. When scaled by trend potential nominal GDP, commercial paper holdings shifted up with the rise of asset-backed commercial paper from the mid-1990s to the mid-2000s (Figure 6).Footnote 22 This occurred when shadow banks made greater use of leverage in funding their assets by issuing commercial paper much, but not all, of which was held by money funds. This rise unwound sharply starting with the onset of the Global Financial Crisis and amplified after Lehman Brothers failed. Even though the opportunity cost of commercial paper is small, the large swings in asset holdings could have loosened the overall link between Divisia M4- and nominal spending. In addition to including commercial paper, M4 also includes Treasury bills held outside of banks and money funds, which may further affect the ability to model the velocity of M4.

Figure 6. Commercial paper not held by money funds scaled by potential nominal GDP.
Sources: Financial Accounts of the U.S., CBO, and authors’ calculations.
4. Using Divisia M3 in a P-Star framework for predicting inflation
4.1. A P-star model for consumer inflation
As discussed previously, P-Star models are derived from the equation of exchange:

where
$P_{t}$
is the price level,
$Y_{t}$
is real income,
$M_{t}$
is nominal money, and
$V_{t}$
is velocity. As in Belongia and Ireland (Reference Belongia and Ireland2015, Reference Belongia and Ireland2017, Reference Belongia and Ireland2021) and Ireland (Reference Ireland2024), the long-run price-level target is:

In natural logs (denoted by lower case letters), eq. (14) takes the following form:

and, similarly for eq. (15). If the log of the price level has a unit root (is integrated of order 1), then an error-correction model of inflation (
$\pi ={\Delta} p$
) could be specified as:

where γ is expected to be positive,
$q$
is the lag length, and
$\varepsilon$
is an i.i.d. residual. Equation (17) is the form of P-star estimated by, for example, Kamal (Reference Kamal2014). However, for the sample period used by Hallman et al. (Reference Hallman, Porter and Small1991),
$\pi$
was not stationary and had a unit root. Appealing to the inflation-augmented Phillips Curve, Hallman et al. (Reference Hallman, Porter and Small1991) cast their P-Star model in terms of the change in inflation (
${\Delta} ^{2}p={\Delta} \pi$
), which was stationary over their sample periods resulting in:

Their model is equivalent to regressing the change in inflation on its lags and the price gap:

One can decompose the “price gap” in logs (
$p^{*}-p$
) as in Belongia and Ireland (Reference Belongia and Ireland2017) as:

which implies that eq. (19) can be transformed into:

Ireland (Reference Ireland2024) estimates versions of (21) that allow the coefficients on the two gap terms to differ.Footnote 23
Following Ireland (Reference Ireland2024), we treat
$y$
as real PCE and use the PCE price deflator to measure inflation, so that
$y^{*}$
is trend or equilibrium real consumption; see also Belongia and Ireland (Reference Belongia and Ireland2017). The change in the log of the PCE deflator (ΔLPPCE) is stationary over 1985 to 2024Q3; see Table 1. Consequently, we estimate a specification corresponding to eq. (17) while incorporating the decomposition of the gap term in eq. (20), which results in our general estimating equation:

which allows for the coefficients on the two gap terms to differ as in Ireland (Reference Ireland2024). We consider specifications where
$\gamma ^{v}=\gamma ^{y}$
is imposed and ones where it is not. In estimating the latter specifications, we report the actual regression coefficient on the consumption gap, which we expect should be negative.
We use our velocity model developed in Section 3.2.3 to estimate long-term equilibrium log velocity,
$v^{*}$
. To implement the P-star model, we treat the 1985Q3 to 2012Q4 period as our “in sample” and then calculate the “velocity gap,”
$v^{*}-v$
, out of sample from 2013Q1 to 2024Q3. Here, we set the stock load variable at its 2022Q4 value in 2023-24 in order to cover more of the period encompassing the partial winding down of inflation. As Figure 5 shows, velocity tends to be above its long-run equilibrium following the GFC until just before the pandemic. Consequently, the velocity gap is generally negative throughout the 2010s implying downward pressure on inflation due to monetary factors given eq. (22). Although not conclusive, this is certainly suggestive given that core inflation was nearly always below the Fed’s 2% target throughout this period. We select the lag length by the SIC criteria following Ireland (Reference Ireland2024).
Belongia and Ireland (Reference Belongia and Ireland2015) use the one-sided HP filter (Stock and Watson, Reference Stock and Watson.1999) to detrend real income and velocity.Footnote 24 We too use a one-sided HP filter to calculate equilibrium log consumption in order to compute
$y^{*}-y$
, which we will refer to as cgap in the tables. As explained above, however, we modify the Belongia-Ireland (B-I) framework by using our parsimonious model of equilibrium velocity to calculate the velocity gap,
$v^{*}-v$
, which we will refer to as vgap. Later, we compare results to using the one-sided HP filter to also detrend log velocity. Finally, we use pgap, which equals vgap minus cgap, to refer to the price gap. A potential advantage of our model-based approach for velocity is that some detrending filters may not perform well over the pandemic or in the lead up to it. For example, Kamber et al. (Reference Kamber, Morley and Wong2025) found that for the output gap, “the real-time HP filter estimates in 2018–2019 suggested that the economy was very close to trend, while the estimates for the same time period have been subsequently revised upwards...” In contrast they found that their refined Beveridge-Nelson filter “would have provided a better prediction of the final-vintage HP filter estimates than a one-sided HP filter” and also had “good revision properties during the pandemic.”Footnote 25 Our model of equilibrium velocity is estimated over an in-sample period that excludes COVID-19 and its out-of-sample performance is impressive as seen in Figure 5. Nevertheless, we are still subject to this criticism as it relates to detrended consumption.
4.2. Modeling overall and core PCE inflation
We use the framework in eq. (22) to model overall and core PCE inflation: ΔLPPCE and ΔLPcorePCE respectively. We use the estimated coefficients from Model 2 in Table 3 corresponding to the 1985Q3-2012Q4 sample period, to calculate
$v^{*}$
. Using these coefficients, we calculate a static forecast of
$v^{*}$
out-of-sample from 2013 until the end of our sample period and the corresponding velocity gap term,
$v^{*}-v$
(vgap). As previously noted, we extend the out-of-sample period to 2024Q3. The in-sample period ends in 2012Q4, which reflects two factors. First, as Figure 5 shows, this provides ample time for both equilibrium and actual velocity to recover from the large negative shock associated with the subprime and global financial crisis (GFC) of 2007–11. Moreover, the regulatory response to the GFC took the form of the Dodd-Frank Act, which was passed in mid-2010, and which came out in phases and affected money creation. We then estimate the P-Star model eq. (22) from 2013Q1 to 2024Q3. All regressions include separate dummies for 2020Q2, 2020Q3, and 2020Q4 to control for the initial phase of the pandemic.
To eq. (22), we add two variables to control for other unusual shocks. All of our estimated models include DPEnergy, which is a dummy equal to 1 in any quarter in which the PCE price index for energy goods and services falls by more than 10 percent relative to a PCE price index which excludes energy goods and services. Including this dummy addresses large outliers that are reflective of global shocks to energy prices while avoiding introducing multicollinearity with smoother energy price variables, such as the continuous percent change in relative energy prices that may partly reflect general trends in overall inflation. The dummy picks up 2020Q2, but we set it to zero in that period to avoid conflating the coefficient on it with the worst quarter of the pandemic.Footnote 26 Consequently, the dummy is de facto a dummy for 2015Q1, when energy prices sagged during a major downshift in the long-term growth rate of the Chinese economy. Owing to its construction, DPEnergy is expected to have a negative coefficient, which is expected to be larger in magnitude for overall than for core PCE inflation. The coefficient in core PCE inflation models may be significant and negative owing to the pass-through of energy price drops into the prices of other consumer items.
To shed light on how disruptions to supply chains affected inflation during the recovery from the COVID Recession, we test versions of eq. (22) that also include the t-1 level of the quarterly average of the FRBNY’s Index of Global Supply Chain Pressures (SupPress). This index draws information from several indicators of supply pressures and effort is made to remove the influence of demand side factors from the index. During the pandemic, spikes in SupPress reflect breakdowns in market functioning associated with supply chain disruptions that are not easily captured by our measure of the consumption gap and are plausibly inflationary in nature. Over a sample from when SupPress begins in 1998Q1 to 2024Q3, the unit root tests (Table 1) indicate that it is I(0), implying that the level of SupPress could be tested in a regression with other variables that are stationary. We separately add the t-1 change and the t-1 level of the index to different models because it is unclear a priori which should enter the P-Star model. For overall PCE inflation, we include both the t-1 and t-2 lags of ΔSupPress when testing for the effects of changes.
Table 4. Quarterly P-star models of U.S. PCE (consumer) inflation, 2013Q1-2024Q3.

Notes: +,*,** denote 90%, 95% & 99% significance. Absolute t-statistics are in parentheses. vgap is formed using estimates of v * from Model 2, Table 3, estimated over 1985Q3-2012Q4. cgap is formed from real PCE and an estimate of trend real PCE from a one-sided HP filter.
Table 5. Quarterly P-Star models of U.S. core PCE (consumer) inflation, 2013Q1-2024Q3.

Notes: +,*,** denote 90%, 95% & 99% significance. Absolute t-statistics are in parentheses. vgap is formed using estimates of v * from Model 2, Table 3, estimated over 1985Q3-2012Q4. cgap is formed from real PCE and an estimate of trend real PCE from a one-sided HP filter.
We estimate six P-Star models for quarterly overall PCE inflation and core PCE inflation (annualized) over a 2013Q1-2024Q3 sample, which are reported in Tables 4 and 5 respectively. By extending the sample through 2024Q3, we are able to bolster the limited number of observations and to pull in more data from the recent period when SupPress moves a lot. In each table, Model 1 is a P-Star model that follows eq. (22) allowing for differences in the coefficients on vgap and cgap as in Ireland (Reference Ireland2024). Model 2 combines the velocity (vgap) and consumption (cgap) gaps into the price gap (pgap = vgap − cgap) thus imposing equality of the coefficients (in absolute value) as in the standard formulation of the model (e.g., Belongia and Ireland, Reference Belongia and Ireland2017). Given the potential errors in measuring consumption gaps in real time during the pandemic, it may be more accurate to focus on the overall price gap rather than its subcomponents. Models 1 and 2 omit SupPress. Models 3 and 4 repeat Models 1 and 2, respectively, except that they add the lagged level of SupPress, while Models 5 and 6 add the lagged change in SupPress to the models of core inflation and both the first and second lags of the change in SupPress to the models of overall PCE inflation.
4.3. Estimation results for overall and core PCE inflation
Several notable patterns emerge in Tables 4 and 5. In the even-numbered models, the combined price gap term is statistically significant. In the odd-numbered models that decompose the price gap, the velocity gap terms are significant with the expected positive sign.Footnote 27 The real consumption gaps have insignificant coefficients in all cases with unexpected positive coefficients in two cases. Moreover, the coefficient on the consumption gap is lower in absolute magnitude than the coefficient on the velocity gap across all specifications. The lagged level of the supply chain index has a positive and significant coefficient for all models in Tables 4 and 5, while the coefficient on the lagged change in the index is significant and positive as expected for models of core PCE inflation in Table 5. For overall PCE inflation, the first lag of the change is insignificant, but positive, while the second lag is significant and positive. The evidence that the velocity and price gaps have statistically significant and positive effects, while the coefficients on the level and first difference of SupPress are positive and significant, together imply roles for both aggregate demand and supply pressures in driving the rise and partial ebbing of inflation during the COVID recovery. The COVID impact dummies for 2020Q2 and 2020Q4 are highly significant in all specifications.
Figure 7 illustrates the estimated role of these factors on core PCE inflation using Model 4 from Table 5. Actual core inflation is plotted along with an adjustment for the COVID dummy effects. The figure plots the estimated effect of global supply pressures and the sum of the estimated effects of the lag of pgap and of the level of supply pressures, where the latter is the product of the lag of SupPress and its estimated coefficient. The gap between the latter two lines reflects how much the lagged deviation between equilibrium prices and actual prices (pgap) affects inflation beyond that captured by including five quarterly lags of inflation (and the energy dummy). It is unclear how much of the lagged changes in inflation reflect past aggregate demand versus supply shocks. The chart illustrates how the standard P-Star price gap and changes in supply conditions are directly linked to recent movements in core inflation rates. While the estimated contribution of supply pressures to the temporary run-up of core inflation is notable, more of it is attributable to aggregate demand pressures reflected in the gap between the two lower plotted lines.
Figure 8 repeats Figure 7 except it uses estimated coefficients from Model 6 in Table 5 and also gauges supply chain pressure effects using the change instead of the level of SupPress. The only notable difference in results is that the estimated effect of global supply chain pressures is much smaller, implying that there was an even larger relative role for aggregate demand pressures in explaining the 2021−22 run-up in core PCE inflation and its retreat in 2023 to mid-2024.

Figure 7. Effects of the price gap and supply chain (level) pressures on U.S. core inflation.
Sources: BEA, FRBNY, and authors’ calculations.

Figure 8. Effects of the price gap and supply chain (changes) pressures on U.S. core inflation.
Sources: BEA, FRBNY, and authors’ calculations.
4.4. Estimation results using the HP filter to model long-run velocity
Tables 6 and 7 report results from overall and core PCE inflation models that correspond to the models in Tables 4 and 5, respectively, except that they use a one-sided HP filter to calculate the velocity gap, as in Belongia and Ireland (Reference Belongia and Ireland2015, Reference Belongia and Ireland2017, Reference Belongia and Ireland2021) and Ireland (Reference Ireland2024), and consistent with how we calculated cgap. We use vgapHP and pgapHP to distinguish these from the earlier estimates in Tables 4 and 5. We refer to models in Tables 4 and 5 as our “modified approach.”
Table 6. Quarterly P-star models of U.S. PCE inflation (HP filter), 2013Q1-2024Q3.

Notes: +,*,** denote 90%, 95% & 99% significance. Absolute t-statistics are in parentheses. vgapHP is formed from velocity and an estimate of trend velocity from a one-sided HP filter. cgap is formed from real PCE and an estimate of trend real PCE from a one-sided HP filter.
Table 7. Quarterly P-star models of U.S. core PCE inflation (HP filter), 2013Q1-2024Q3.

Notes: +,*,** denote 90%, 95% & 99% significance. Absolute t-statistics are in parentheses. vgapHP is formed from velocity and an estimate of trend velocity from a one-sided HP filter. cgap is formed from real PCE and an estimate of trend real PCE from a one-sided HP filter.
There are several patterns across the overall PCE inflation results in Tables 4 and 6. First, in the odd numbered models, the velocity gap is either less significant or not at all for the HP filtered velocity gaps (Table 6), whereas vgap is always significant at the 99% level in our modified models (Table 4). Second, and similarly, the price gap terms in the even-numbered models are more significant in the modified models in Table 4 than in the corresponding models in Table 6 except for Model 6. Third, the fit of the modified models in Table 4 is uniformly better (including for Model 6) as evidenced by the higher adjusted R2’s across the board that range from 0.092 higher for Model 4 (0.751 vs. 0.843) to 0.188 higher for Model 5 (0.683 vs. 0.871).Footnote 28
The differences in significance of the price and velocity gaps and overall fit of the model arise from the differences in how the velocity gap is calculated. These are compared in Figures 9 and 10. During much of the period from 2010 to 2019, the one-sided HP filter trend is above velocity leading to small, but often positive, velocity gaps, indicating upward pressure on inflation. In contrast, the velocity gap from our model is generally slightly negative over 2010-19 (since velocity is usually above its long-run equilibrium over this period), implying that velocity was putting downward pressure on inflation in a period when inflation was generally slightly below the Fed’s 2 percent target.

Figure 9. HP trend velocity lags actual U.S. velocity in the early-2020s.
Sources: CFS, Federal Reserve, BEA, and author’s calculations.

Figure 10. Estimated velocity gap implies more plausible swings in U.S. inflationary pressures than an HP filter-based measure.
Sources: CFS, Federal Reserve, BEA, and author’s calculations.
From 2020 through 2024, the one-sided HP trend lags behind the actual u-shaped velocity trajectory as shown in Figure 9. This produces a short-lived upward spike in the velocity gap as shown in Figure 10 after which the velocity gap turns negative in mid-2021 and bottoms out in early 2023—suggesting downward pressure on inflation precisely when inflation was surging. In contrast, our velocity gap also spikes and then declines, but much more gradually, and remains positive through early 2023. Then, small negative velocity gaps and negative money growth accompany the sharp deceleration of inflation since early 2023. Thus, our velocity gap measure appears more consistent with actual inflation.
Similar qualitative differences arise when modeling core PCE inflation, Tables 5 and 7, except that they are smaller in magnitude in several ways. While the modified models in Table 5 fit better across the board, the improvement in fit relative to the corresponding models in Table 7 is not as large for core inflation. This partly reflects the fact that vgapHP and pgapHP are both statistically significant at the 99 percent level in two of core PCE inflation models in Table 7 and have notably higher t-statistics than in the corresponding models of overall inflation in Table 6. Despite some of these nuanced differences, however, the model fits are better, the velocity and price gap terms are more statistically significant, and the magnitude of the effects of supply pressures is smaller for core inflation when the velocity gap is calculated out-of-sample from our error-correction model versus when it is calculated via a one-sided HP filter.
Taken together, these results suggest that our modified model enhances the ability of the P-star model to gauge inflationary pressures over this period and that by doing so, it helps us to avoid overstating the role of supply pressures in contributing to the rise and the partial ebbing of inflation over 2021-2024.
5. Conclusion
Broad Divisia measures of money services were ignored by many macroeconomists during the COVID Recession and the recovery from it because of past instability of the demand for simple-sum monetary aggregates. We show that the velocity of U.S. broad Divisia M3 is well modeled by, and is a stable function of, stock mutual fund loads and the shift in derivatives legislation since 1985. We also demonstrate that deviations of actual from equilibrium velocity help explain recent inflation in the U.S. We estimate a parsimonious error-correction model for the consumption velocity of Divisia M3 and use it to calculate the deviation of velocity from its long-run equilibrium out of sample. We then incorporate the corresponding velocity gap into the P-star model advocated by Belongia and Ireland (Reference Belongia and Ireland2015, Reference Belongia and Ireland2017) and Ireland (Reference Ireland2024) among others allowing us to incorporate structural factors altering the consumption velocity of Divisia M3. We find that our approach enhances the ability of the P-star model to gauge inflation pressures resulting from the velocity gap. In our modified framework, we find that only some of the 2021−22 surge in U.S. inflation is attributable to global supply pressures, with a larger contribution coming from aggregate demand. From a broader perspective, our findings are consistent with recent studies, which imply a role for both strong aggregate demand growth and negative supply factors in driving up U.S. core inflation during 2021−22 (e.g., Bernanke and Blanchard, Reference Bernanke and Blanchard2025; Di Giovanni et al. Reference Di Giovanni, Kalemli-Ozcan, Silva and Yildirim2022).
Movements in Divisia M3 and its short-run determinants plausibly reflect the confluence of macroeconomic factors. These can include some fiscal policy actions that were indirectly supported by Fed balance sheet actions, as well as shifts in risk aversion associated with the unfolding of the pandemic. Indeed, there are strong parallels during and after the pandemic between the build-up and unwinding of excess saving in response to tax cuts and changes in risk aversion from a Keynesian perspective, and a rapid rise and then fall of real monetary services from a more monetary view. This argues against a simple and overly reductive monetarist interpretation of our findings. Instead, as Nelson (Reference Nelson2003) argued, “a spectrum of yields matters for the determination of aggregate demand and money demand” so that “money conveys information about monetary conditions not summarized by the short-term interest rate.” Nelson’s (Reference Nelson2003) point is strengthened by the subsequent use of unconventional monetary policy tools in the Great and COVID Recessions that affected long-term Treasury yields and of private bond backstops that capped private credit risk premia in the COVID Recession (see Bordo and Duca, Reference Bordo and Duca2021, Reference Bordo and Duca2022). Further, Divisia money measures are better suited for an indicator role than simple sum monetary aggregates that assume perfect substitutability and which make no use of monetary asset user costs in their construction. We argue that broad Divisia measures are more reflective of variation in overall aggregate demand pressures and their lagged effects on the macroeconomy. From this perspective, the usefulness of broad Divisia money as an important U.S. macroeconomic indicator is consistent with the central themes and the title of Don Patinkin’s (Reference Patinkin1956) classic book, Money, Interest, and Prices, in which prices, quantities, and lagged adjustment play key roles in the macroeconomy.Footnote 29
Acknowledgements
The views expressed are those of the authors and are not necessarily those of the Federal Reserve Bank of Dallas or the Federal Reserve System. We thank two anonymous referees for helpful comments and suggestions. We thank participants from sessions at the Eastern Economic Association’s 2025 Annual Conference and at the 29th (2025) International Conference on Macroeconomic Analysis and International Finance for helpful comments. We also acknowledge our intellectual debt to Richard Anderson, Milton Friedman, Stephen Goldfeld, Richard Porter, Anna Schwartz, David Small, and Jim Tobin. Any errors are our own.
Appendix 1: Robustness checks on the preferred model of consumption velocity
A. Unrestricted constant version of the consumption velocity model
As a robustness check, we estimate a model corresponding to eq. (5), with
$\Pi$
being of reduced rank, but leaving
$\mu$
unrestricted as in
$H_{\ell c}(r)$
. This results in a model of the form:

We estimate eq. (5’) with
$r=1$
for the samples ending in 2012Q4 and 2019Q4. For this specification, the constant in the cointegration relations equals the estimated constants in eq. (5’) multiplied by
$\alpha (\alpha \alpha ')^{-1}$
following Johansen (Reference Johansen1991, p. 1553). For both samples, the coefficient on LSload is about −0.29, the impact coefficient for velocity is around −0.15, and the coefficients on CFMA are −0.107 and −0.101 for the samples ending in 2012Q4 and 2019Q4, respectively. The error-correction terms are also very similar to those estimated from the less general model.
B. Estimating an equivalent money demand model of real Divisia M3
The consumption velocity models effectively impose a unitary income elasticity of money demand. As a robustness check, one can instead estimate real money demand models by separating the consumption velocity into real Divisia M3 (LDM) and real PCE (LPCE), where real Divisia M3 equals nominal Divisia M3 divided by the implicit PCE price deflator. In this case, we would be interested in the long-run money demand function corresponding to eq. (1’) in the main text, which we repeat here for the reader’s convenience:

We estimate this specification over the 1985Q3-2012Q4 and 1985Q3-2019Q4 sample periods using the same number of lags as for the corresponding velocity models for consistency. For the pre-pandemic sample period,
$r=0$
is rejected at the 99% significance level,
$r=1$
is rejected at the 95% significance level, and
$r=2$
cannot be rejected at the 90% significance level. Despite these results, we are primarily interested in the case where there is a single cointegrating vector. Setting
$r=1$
, we normalized the estimated cointegration vector on LDM and represent the other coefficients in terms of a long-run real money demand function as in eq. (1’). If the estimated consumption elasticity is close to one, then we would expect the estimated coefficients on LSLoad and CFMA to be similar to their estimated values in the velocity models, but with opposite signs, since
$\textit{LV}=\textit{LPCE}-\textit{LDM}$
as explained in the main text. We found that the estimated consumption elasticity is 1.004, the estimated coefficient on the stock load is 0.285, and the estimated coefficient on CFMA is 0.124. We also estimated the specification corresponding to eq. (5’), as described in part A of this Appendix. Setting
$r=1$
, we found nearly identical estimated coefficients for LPCE and LSLoad and an estimated coefficient on CFMA of 0.113. Thus, despite the results regarding the rank of the system, if we impose a single cointegrating vector, we find that the model is very consistent with our velocity models presented in the main text.
For the subsample ending in 2012Q4,
$r=0$
is rejected at the 99% significance level,
$r=1$
is rejected at the 90% significance level, but not at the 95% level, and
$r=2$
cannot be rejected at the 90% significance level. Again setting
$r=1$
, we found that the estimated consumption elasticity is 1.16, the estimated coefficient on the stock load is 0.364, and the estimated coefficient on CFMA is 0.097. For the specification corresponding to eq. (5’) over this sample period with
$r=1$
, we found estimated coefficients of 1.13 on LPCE, 0.351 on LSLoad and 0.093 on CFMA. Once again, the estimated coefficients are generally in line with those of the velocity model when a single cointegrating vector is imposed, although we find a consumption elasticity of money demand that is close to but slightly higher than unity making the results somewhat less compelling.