Abstract This article shows that accounting for variation in mistakes can be crucial for welfare analysis. Focusing on consumer under-reaction to not-fully-salient sales taxes, we show theoretically that the efficiency costs of taxation are amplified by differences in under-reaction across individuals and across tax rates. To empirically assess the importance of these issues, we implement an online shopping experiment in which 2,998 consumers purchase common household products, facing tax rates that vary in size and salience. We replicate prior findings that, on average, consumers under-react to non-salient sales taxes—consumers in our study react to existing sales taxes as if they were only 25% of their size. However, we find significant individual differences in this under-reaction, and accounting for this heterogeneity increases the efficiency cost of taxation estimates by at least 200%. Tripling existing sales tax rates nearly doubles consumers’ attention to taxes, and accounting for this endogeneity increases efficiency cost estimates by 336%. Our results provide new insights into the mechanisms and determinants of boundedly rational processing of not-fully-salient incentives, and our general approach provides a framework for robust behavioural welfare analysis. 1. Introduction When incentive schemes are complex, or when decision-relevant attributes are not fully salient, consumers may make mistakes. A growing body of work documents inattention to, or incorrect beliefs about, financial incentives such as sales taxes (Chetty et al., 2009), shipping and handling charges (Hossain and Morgan, 2006), energy prices (Allcott, 2015), and out-of-pocket insurance costs (Abaluck and Gruber, 2011). Such studies typically estimate the “average mistake”, usually because inferring mistakes at the individual level is difficult or impossible with available data. Correspondingly, policy analysis building on these results often studies a representative agent committing the “average mistake”, and thus assumes that mistakes are homogeneous. In this article, we demonstrate that accounting for variation in mistakes can substantially impact policy analysis. We highlight two crucial ways in which variation in mistakes matters. First, the variation in mistakes across consumers matters: the greater the individual differences, the lower the allocational efficiency of the market, because these differences drive a wedge between who buys the product and who benefits from it the most. Second, the variation in mistakes across different incentives matters: this variation creates a debiasing channel that can accentuate the demand response to policy changes. In the theoretical component of this article, we formalize the role of these two channels in shaping the efficiency cost of taxation when consumers misreact to sales taxes. In the empirical component of this article, we directly examine these two dimensions of variation in a large-scale online shopping experiment and demonstrate that their quantitative impact on welfare analysis is substantial. To formalize these arguments, we begin with a model—building on and generalizing Chetty (2009), Chetty et al. (2009, henceforth CLK), and Finkelstein (2009)—of consumers who choose whether or not to purchase a good in the presence of a sales tax. The sales tax is potentially non-salient, and consumers may not correctly account for its presence in their purchasing decisions. Breaking from earlier theoretical treatments of tax salience, we allow for arbitrary heterogeneity in both consumers’ valuations for the products and consumers’ misreaction to the tax. We present a series of results that generalize the canonical Harberger (1964) formula for the efficiency costs of taxation. We find that the efficiency cost of imposing a small tax in a previously untaxed market is increasing in the mean of the weights that the marginal consumers place on the tax when making purchasing decisions—thus, as in CLK, homogeneous under-reaction reduces efficiency costs.1 However, we additionally show that inefficiency is increasing in the variance of misreactions to a degree of equal quantitative importance. The result arises because variation in mistakes across consumers generates misallocation of products. When under-reaction to the tax is homogeneous, the product is always purchased by those consumers who value it the most, and thus the market preserves the efficient sorting that is obtained with fully optimizing consumers. However, when consumers vary in their misreaction, purchasing decisions depend on both their valuation of the good and on their propensity to ignore the tax, thus breaking the efficient sorting property. The consequences of misallocation are particularly stark when supply is inelastic relative to demand and thus the equilibrium quantity purchased is relatively unaffected by taxation—a situation in which efficiency costs are low when consumers optimize perfectly but can be substantial in the presence of varying mistakes. When evaluating “small” taxes, the mean and variance of marginal consumers’ misreaction—together with the price elasticity of demand—are sufficient statistics for computing efficiency costs. When considering increasing pre-existing taxes, however, accounting for how misreaction changes with the tax rate is crucial. If increases in the tax rate increase attention, and thus “debias” consumers, the distortionary effects of tax increases can be substantially higher than would otherwise be expected under the hypothesis that attention is exogenous. Intuitively, this is because consumers act as if prices have increased not only by the salient portion of the new tax, but also by a portion of the existing tax that they had previously ignored, but now do not.2 Taken together, these theoretical results show that empirical estimates of the variation in mistakes are crucial for welfare analysis. However, measurement of variation in mistakes requires data sets containing richer information than simple aggregate demand responses. This motivates our experimental design. Our experiment studies the behaviour of 2,998 consumers—approximately matching the U.S. adult population on household income, gender, and age—drawn from the forty-five U.S. states with positive sales taxes. The experiment utilizes an online pricing task with twenty different non-tax-exempt household products (such as cleaning supplies), and with between- and within-subject variation of three different decision environments. The decision environments induce exogenous variation in the tax applied to purchases, featuring either (1) no sales taxes, (2) standard sales taxes identical to those in the consumer’s city of residence, or (3) high sales taxes that are triple those in the consumers’ city of residence. Decisions in the experiment are incentive compatible: study participants use a $20 budget to potentially buy one of the randomly chosen products, and purchased products are shipped to their homes. We begin our empirical analysis by estimating the average amount by which study participants under-react to taxes. Following CLK, we measure under-reaction by estimating the implicit weight placed on taxes, denoted by $$\theta$$. This measure constitutes a sufficient statistic for welfare analysis when mistakes are homogeneous. In the standard-tax condition, we estimate an average $$\theta$$ of 0.25: study participants react to the taxes as if they are only 25% of their size. This result is quantitatively similar to that of CLK, who find an average $$\theta$$ of 0.35 in an analysis of grocery store purchases and an average $$\theta$$ of $$0.06$$ in an analysis of demand for alcoholic beverages. Our estimates fall within the confidence intervals of this previous work, and our design affords significantly greater statistical power. In the triple-tax condition, in contrast, study participants react to the taxes as if they are just under 50% of their actual size. Across specifications, this increase in weight placed on the tax is significant at least at the 5% level, and provides initial evidence that consumers are more attentive to higher taxes. Complementing this evidence, we also show that consumers are on average more likely to under-react to taxes on particularly cheap products (priced below $5), than they are to taxes on more expensive products (priced above $5). Having established variation of misreaction across tax rates, in the second part of our empirical analysis we focus on variation of misreaction across consumers. This analysis is directly motivated by the efficiency cost formulas that we derive, which show that the efficiency cost of a small tax $$t$$ on a product sold at price $$p$$ depends on the variance of under-reaction by consumers who are on the margin at $$p$$ and $$t$$. The corresponding statistic of interest is thus the average—computed with respect to the distribution of $$p$$ and $$t$$ in the experiment—of $$Var[\theta|p,t]$$. We bound this statistic through a novel combination of a “self-classifying” survey question and experimental behaviour, in a way that requires no assumptions about truth-telling or metacognition. Our estimates of the bound imply that for taxes that are the size of those observed in the U.S., the variance of consumer mistakes increases the efficiency cost estimate by over 200% relative to what would be inferred under the assumption that consumers are homogeneous in their mistakes. This article relates to three distinct literatures. First, beyond extending and generalizing the existing work on tax salience ($$e.g.$$ CLK, Finkelstein, 2009; Feldman and Ruffle, 2015; Feldman et al., 2015), the article broadly contributes to a growing theoretical and empirical literature in “behavioral public economics” (see Chetty (2015) for a review, and Mullainathan et al. (2012) and Farhi and Gabaix (2015) for general theoretical frameworks). Some of our own previous work on corrective taxation in energy markets has emphasized the importance of welfare estimates that are robust to heterogeneous bias (Allcott and Taubinsky, 2015; Allcott et al., 2014).3 This article focuses on an importantly different domain and is the first, to our knowledge, to explicitly formalize the welfare-relevant statistics of mistake variation and to empirically measure those statistics. These results have immediate applications to the literature on tax misunderstanding;4 however, our framework for analysing variation in mistakes is broadly portable, and can serve as a template for empirical analysis of other psychological biases, and in other domains of behaviour. Second, our experimental findings are also relevant to the growing literature on firm and consumer interactions in markets with shrouded attributes (Gabaix and Laibson, 2006; Veiga and Weyl, 2016; Heidhues et al., 2017). The predictions of these models rely on particular assumptions about the heterogeneity of attention to the shrouded attributes, as well as how the inattention depends on the size of the shrouded attribute. Our estimates can thus help guide the quantitative predictions of these models.5 Third, our work contributes to the literature on boundedly rational value computation (see $$e.g.$$Gabaix, 2014; Woodford, 2012; Caplin and Dean, 2015a; Chetty, 2012). To the best of our knowledge, our result that consumers under-react less to higher tax rates provides one of the first experimental demonstrations in a naturalistic setting of imperfect processing of a financial attribute responding to economic incentives.6 The article proceeds as follows. Section 2 presents our theoretical framework. Section 3 presents our experimental design. Section 4 quantifies average under-reaction across different taxes, while Section 5 quantifies the variance of under-reaction across consumers. Section 6 utilizes our theoretical framework to discuss the welfare implications of our empirical estimates. Section 7 concludes. 2. Theory This section analyses the tax policy implications of variation in consumers’ inattention to or misunderstanding of tax instruments. Specifically, we generalize Harberger’s (1964) canonical formulas for the efficiency costs of taxation, as well as CLK’s formulas for the case of homogeneous consumers. The formulas we develop transparently highlight the importance of accounting for the variation of mistakes across both consumers and tax sizes. The results can be immediately applied to questions about optimal Ramsey or Pigouvian taxes—which we summarize in Section 2.5 and elaborate on in Online Appendix B—and also apply more broadly to consideration of any kind of imperfectly understood policy instrument. All proofs are contained in Online Appendix C. 2.1. Set-up 2.1.1. Consumer and producer behaviour Consumers: There is a unit mass of consumers who have unit demand for a good $$x$$ and spend their remaining income on an untaxed composite good $$y$$ (the numeraire). A person’s utility is given by $$u(y)+vx$$, where $$x\in\{0,1\}$$ denotes whether or not the good is purchased, and $$v$$ is the person’s utility from $$x$$. Let $$Z$$ denote the budget (assumed identical across consumers), $$p$$ the posted price of the product, and $$t$$ the tax set by the policymaker.7 We assume throughout that $$Z>>p+t$$. A fully optimizing consumer chooses $$x=1$$ if and only if $$u(Z-p-t)+v\geq u(Z)$$. However, we allow consumers to not process the tax fully. Instead, a consumer chooses $$x=1$$ when $$u(Z-p-\theta t)+v\geq u(Z)$$, where $$\theta$$—which may covary with $$v$$ or be endogenous to $$t$$—denotes how much the consumer under- (or over-) reacts to the tax.8 Because we make minimal assumptions about the distribution of $$\theta$$, this modelling approach encompasses a number of psychological biases that may lead consumers to make mistakes in incorporating the sales tax into their decisions. These include: Exogenous inattention to the tax, so that consumers always react to the tax as if it’s a constant fraction $$\theta$$ of its size (Gabaix and Laibson, 2006; DellaVigna, 2009). Endogenous inattention to the tax, or boundedly rational processing more broadly, so that consumers pay more attention to higher taxes (Chetty et al., 2007; Gabaix, 2014). Incorrect beliefs, where a person perceives a tax $$t$$ as $$\hat{t}$$. In this case, $$\theta=\hat{t}/t$$. Rounding heuristics. Forgetting about the tax. Any combination of the above biases. In practice, multiple mechanisms are likely to be in play. Existing data provides little guidance on which mechanisms are the most important (CLK) or on the shape of the distribution of $$\theta$$. Gabaix’s (2014) anchoring and adjustment model of attention, for example, predicts that each consumer will have a $$\theta\in[0,1)$$, with that value depending on the size of the tax. Other theories of inattention may predict binary attention $$\theta\in\{0,1\}$$. Incorrect beliefs and rounding heuristics can generate a variety of different values of $$\theta$$, with instances in which $$\theta>1$$. We develop our theoretical and empirical framework to be robust to all of these possible mechanisms. Instead of defining $$\theta$$ in relation to a specific mechanism, we define it by the behaviour that these mechanisms generate: a difference in willingness to pay depending on the presence of a tax. For a given consumer, define $$p_{max}(t)$$ to be the highest posted price at which the consumer would purchase $$x$$ at a tax $$t$$. Then $$\theta:=\frac{p_{max}(0)-p_{max}(t)}{t}$$. We make no assumptions about the relation between $$\theta$$ and $$v$$ other than that their joint distribution $$F_{t}(v,\theta)$$ generates smooth, downward-sloping aggregate demand curves,9 that $$\theta\geq0$$ and is bounded, and that the marginal distribution of $$v$$ does not depend on $$t$$. By allowing the distribution of $$\theta$$ to depend on $$t$$ we capture the possibility that attention to taxes may depend on the tax rate. With minor abuse of notation, we define $$E[\theta|p,t]$$ and $$Var[\theta|p,t]$$ to be the mean and variance of $$\theta$$ of consumers who are indifferent about purchasing the product at $$(p,t)$$. We let $$D(p,t)$$ denote aggregate demand for $$x$$ as a function of posted price $$p$$ and sales tax $$t$$. We let $$D_{p}$$ and $$D_{t}$$ denote partial derivatives with respect to the $$p$$ and $$t$$, and we let $$\varepsilon_{D,p}(p,t)=-D_{p}(p,t)\frac{p+t}{D(p,t)}$$ and $$\varepsilon_{D,t}(p,t)=-D_{t}(p,t)\frac{p+t}{D(p,t)}$$ denote the elasticities with respect to $$p$$ and $$t$$. We often suppress the arguments $$p,t$$ in the elasticity to economize on notation. To focus our analysis on mistakes arising solely from incorrect reactions to the sales tax, we assume that (1) in the absence of taxes, consumers optimize perfectly and (2) consumers’ utility depends only on the final consumption bundle $$(x,y)$$.10 Welfare analysis under these two assumptions and our choice-based definition of $$\theta$$ is an application of Bernheim and Rangel’s (2009) approach to welfare analysis: we view choice in the presence of taxes as provisionally suspect, and we use consumer choice in the absence of taxes as the welfare-relevant frame. We relax the first assumption in Online Appendix B, following models such as those in Lockwood and Taubinsky (2017) and Farhi and Gabaix (2015). Producers: We define production identically to CLK: price-taking firms use $$c(S)$$ units of the numeraire $$y$$ to produce $$S$$ units of $$x$$. The marginal cost of production is weakly increasing: $$c'(S)>0$$ and $$c''(S)\geq0$$. The representative firm’s profit at pretax price $$p$$ and level of supply $$S$$ is $$pS-c(S)$$. Producers optimize perfectly so that the supply function for good $$x$$ is implicitly defined by the marginal condition $$p=c'(S(p))$$. Let $$\varepsilon_{S,p}=-\frac{\partial S}{\partial p}\frac{p}{S(p)}$$ denote the price elasticity of supply. We define $$\varepsilon_{D,t}^{TOT}=-\frac{d}{dt}D(p,t)\cdot\frac{p+t}{D}$$ to be the total percentage change in equilibrium demand (taking into account changes in producer prices) caused by a 1% change in the tax.11 2.1.2. Efficiency cost of taxation We follow Auerbach (1985) in defining the excess burden of a tax for a market with heterogeneous consumers. We let $$x_{i}^{*}(p,t,Z)$$ denote consumer $$i$$’s choice of $$x\in\{0,1\}$$ and we let $$V_{i}(p,t,Z)=u(y-px_{i}^{*}(p,t,Z)-tx_{i}^{*}(p,t,Z))+v_{i}x_{i}^{*}(p,t,Z)$$ denote the consumer’s indirect utility function. We denote the consumer’s expenditure function by $$e_{i}(p,t,V)$$, which is the minimum wealth necessary to attain utility $$V$$ under a price $$p$$ and tax $$t$$. Let $$R_{i}(t,Z)=tx_{i}^{*}$$ denote the revenue collected from this consumer. Excess burden is given by: \[ EB=\int_{i}\left[Z-e(p_{0},0,V_{i}(p(t),t,Z))-R_{i}(t,Z)\right]+\pi_{0}-\pi_{1} \] where $$\pi_{0}-\pi_{1}$$ is the change in producer profits, $$p_{0}$$ is the equilibrium market price in the absence of taxes, and $$p(t)$$ is the equilibrium price at tax $$t$$. That is, excess burden is the sum of the change in consumer surplus and producer surplus minus government revenue. With quasilinear utility and fixed producer prices ($$i.e.$$ perfectly elastic supply), this is simply $$\int_{i}(v_{i}-p_{0})(x_{i}^{*}(p_{0},t)-x_{i}^{*}(p_{0},0)$$: the loss in surplus that accrues from discouraging transactions in which the value of the product $$v$$ exceeds its marginal cost of production. To clarify the key determinants of total excess burden, we write it as a function of two arguments, $$t$$ and $$F_{t}$$, to clarify its dependence on both the tax and the distribution of $$\theta$$. The efficiency costs of increasing a tax from $$t_{1}$$ to $$t_{2}$$ can be decomposed into two effects: \begin{equation} EB(t_{2},F_{t_{2}})-EB(t_{1},F_{t_{1}})=\underbrace{\left[EB(t_{2},F_{t_{2}})-EB(t_{1},F_{t_{2}})\right]}_{\text{Direct distortion effects}}+\underbrace{\left[EB(t_{1},F_{t_{2}})-EB(t_{1},F_{t_{1}})\right]}_{\text{"Nudge channel" distortion effects}} \end{equation} (1) The first effect corresponds to the direct distortionary effect of the tax, holding the distribution of bias constant. The second effect is the indirect effect that a tax has on excess burden by altering the distribution of consumer bias. The second effect can be understood more broadly as the efficiency costs of a nudge that changes the distribution of consumer bias. To provide a clear exposition of the economics of each of these two effects, we study the two effects in isolation before combining them into one formula. 2.2. Direct efficiency costs For the results presented in the body of the article, we assume that $$u$$ is linear ($$i.e.$$ no income effects are present), but we discuss the implications of income effects at the end of the section, and in more detail in Online Appendix A.2. Proposition 1. Suppose that $$F_{t}$$ does not depend on $$t$$. Let $$p(t)$$ denote the equilibrium price as a function of $$t$$. Then \begin{eqnarray} \frac{d}{dt}EB(t,F_{t}) & = & -E[\theta|p,t]t\frac{d}{dt}D(p(t),t)-Var[\theta|p,t]tD_{p}(p(t),t)\nonumber \\ & = & E[\theta|p,t]tD(p(t),t)\frac{\varepsilon_{D,t}^{TOT}}{p(t)+t}+\frac{Var[\theta|p,t]}{E[\theta|p,t]}tD(p(t),t)\frac{\varepsilon_{D,t}}{p(t)+t} \end{eqnarray} (2) Proposition 1 provides a general formula for the (direct) excess burden of a small tax $$t$$ when consumers are arbitrarily heterogeneous. When $$Var[\theta|p,t]=0$$, the formula reduces to the formula provided in CLK, which shows that the excess burden of the tax is proportional to $$E[\theta|p,t]$$. In the simple framework without income effects, the more the consumers ignore the tax, the less the consumers are discouraged from purchasing the product because of the tax, and thus the smaller the excess burden. The formula, as written, does not feature the covariance between $$\theta$$ and $$v$$ or between $$\theta$$ and elasticities. However, we note that those covariances determine which consumers are on the margin, and are thus incorporated into our $$E[\theta|p,t]$$ and $$Var[\theta|p,t]$$ terms. The general formula illustrates that it is not just how much people under-react to the tax on average that matters, but also the variance of marginal consumers’ under-reactions. To take a stark example, suppose that $$E[\theta]=0.25$$ for consumers on the margin. When all consumers are homogeneous with $$\theta=0.25$$, equation (2) shows that the excess burden from a marginal increase in the tax is $$(0.25)tD(p,t)\frac{\varepsilon_{D,t}^{TOT}}{p+t}$$; that is, the true excess burden is one-quarter of what the neoclassical analyst would compute using the tax elasticity of demand. Now, suppose that 25% of the marginal consumers have $$\theta=1$$ while 75% have $$\theta=0$$, so that $$E[\theta]=0.25$$ and $$Var[\theta]=(0.75)(0.25)$$. In this case, we still have $$E[\theta]=0.25$$, but equation (2) implies that the excess burden is now at least$$tD(p,t)\frac{\varepsilon_{D,t}^{TOT}}{p+t}$$, since $$\varepsilon_{D,t}\geq\varepsilon_{D,t}^{TOT}$$. Interestingly, this is greater than or equal to the inference that would be made by an analyst who assumes that consumers optimize perfectly and thus uses the tax elasticity of demand as a sufficient statistic for calculating excess burden. The intuition for this result is that heterogeneity in consumers’ mistakes creates a market failure that is conceptually distinct from the effect of a homogeneous mistake. If consumers are homogeneous in their under-reaction to the tax, then for any quantity of products purchased, the allocation of products to consumers is efficient: the product is still purchased by consumers who derive the most value from it. When consumers are heterogeneous in their under-reaction, however, there is misallocation: the consumers purchasing the product are now not just the consumers who derive the most value from it, but also consumers who under-react to taxes the most. There is thus an additional efficiency cost from an inefficient match between consumers and products.12 Another important insight from Proposition 1 is that the efficiency costs arising from misallocation depend on the elasticity of the demand curve, rather than on the elasticity of the equilibrium quantity of $$x$$ in the market. Thus, measurement of (changes of) the equilibrium quantity is not sufficient to calculate efficiency costs, even when combined with estimates of average under-reaction—this is in stark contrast to standard efficiency cost of taxation results, as well as Chetty (2009)’s results that allow endogenous producer prices but assume homogeneous under-reaction. This is most clear in the case of inelastic supply: Corollary 1. Suppose that supply is inelastic $$(\varepsilon_{S,p}=0)$$ and that $$F_{t}$$ does not depend on $$t$$. Then \[ \frac{d}{dt}EB=\frac{Var(\theta|p,t)}{E(\theta|p,t)}tD(p(t),t)\frac{\varepsilon_{D,t}}{p(t)+t} \] Corollary 1 shows that when supply is inelastic—and thus the equilibrium quantity produced by the market does not change—the excess burden of a small tax $$t$$ depends only on the variance of bias and the price elasticity of demand. Intuitively, this is because all of the efficiency cost is generated by misallocation, the extent of which is proportional to the variance of $$\theta$$—which quantifies the extent of individual differences—and the price elasticity of demand—which determines how much the individual differences translate to different purchase decisions. That efficiency costs can be significant even when supply is inelastic is in sharp contrast to standard results in public finance that efficiency costs should be zero if taxes do not distort the equilibrium quantity. More generally, the results imply that when consumers are heterogeneous in their under-reaction, efficiency costs will be significantly higher than in the standard model when supply is relatively inelastic compared to demand.13 The formula in Proposition 1 can also be used to extend the classic Harberger (1964) second-order approximations of the efficiency costs of taxation. We begin by quantifying the efficiency costs of introducing a small tax $$t$$ into a previously untaxed market. Although Proposition 1 characterizes only direct efficiency costs, it can be used to provide a complete characterization of the excess burden of introducing a small tax $$t$$ in a previously untaxed market. Because the nudge channel distortion effect is irrelevant when there are no pre-existing taxes (as per equation 1, $$EB(0,F_{t})-EB(0,F_{0})=0$$), in this case the only relevant efficiency costs are the direct efficiency costs. We thus have: Proposition 2. The excess burden of imposing a small tax (so terms of order $$t^{3}$$ or higher are negligible) in a previously untaxed market is \[ EB(t,F_{t})\approx\frac{1}{2}t^{2}D\left[E[\theta|p,t]\frac{\varepsilon_{D,t}^{TOT}}{p(t)+t}+Var[\theta|p,t]\frac{\varepsilon_{D,p}}{p(t)+t}\right] \] The nudge distortion channel is not irrelevant when there are pre-existing taxes, but we now use Proposition 1 to characterize the direct efficiency costs of increasing pre-existing taxes. We maintain the standard assumptions of the “Harberger Trapezoid” formula (Harberger, 1964) that for all $$k\geq2$$, the terms $$t(\Delta t)^{k}D_{pp}$$, $$t(\Delta t)^{k}S_{pp}$$, $$(\Delta t)^{k+1}$$ are negligible. This assumption corresponds to cases in which the demand and supply curves are approximately linear, to cases in which both the pre-existing tax $$t$$ and the change $$\Delta t$$ are sufficiently small, or a suitable combination of the two. We also introduce one more technical assumption about smoothness in the family of conditional distributions $$F(v|\theta)$$: Assumption A. For each $$\theta$$ in the support of the distribution $$F$$, the conditional distribution $$F(v|\theta)$$ has a differentiable density function. Proposition 3. Suppose that $$F_{t_{1}}=F_{t_{2}}\equiv F$$ for $$t_{2}=t_{1}+\Delta t$$. Then, if for all $$k\geq2$$ the terms $$t(\Delta t)^{k}D_{pp}$$, $$t(\Delta t)^{k}S_{pp}$$, $$(\Delta t)^{k+1}$$ are negligible, and if assumption A holds, the excess burden of increasing the tax from $$t_{1}$$ to $$t_{2}$$ is \begin{eqnarray*} EB(t_{2},F)-EB(t_{1},F) & \approx &- \left(t_{1}\Delta t+\frac{(\Delta t)^{2}}{2}\right)\left(E[\theta|p(t_{1}),t_{1}]\frac{d}{dt}D(p(t),t)|_{t=t_{1}}\right.\nonumber\\ &&\left.\quad +Var[\theta|p(t_{1}),t_{1}]D_{p}(p(t_{1}),t_{1})\vphantom{\frac{d}{dt}}\right)\\ & = & \left(t_{1}\Delta t+\frac{(\Delta t)^{2}}{2}\right)\frac{D(p(t_{1}),t_{1})}{p(t_{1})+t_{1}}\left(E[\theta|p(t_{1}),t_{1}]\varepsilon_{D,t}^{TOT}+Var[\theta|p(t_{1}),t_{1}]\varepsilon_{D,p}\right) \end{eqnarray*} Like Proposition 1, Proposition 3 shows that the standard formula is modified in two ways. First, the change in the equilibrium quantity, $$\frac{d}{dt}D(p(t),t)|_{t=t_{1}}$$, is now multiplied by the average $$\theta$$ of marginal consumers. Second, increasing taxes increases misallocation of products to consumers, which leads to a new term given by the product of the variance of $$\theta$$ and the price elasticity of demand. 2.3. Indirect efficiency costs: the consequences of debiasing In this section, we provide “Harberger-type” formulas for the efficiency costs (or benefits) of changing the distribution of $$\theta$$. We keep the tax fixed, and we consider a family of distributions $$F_{n}(\theta,v)$$ that are smooth functions of $$n$$ for all $$\theta,v$$. We think of $$n$$ as the “nudge parameter”, and we ask how the excess burden of a tax changes as we shift this parameter by a small amount from $$n$$ to $$n+\Delta n$$. The formulas here serve as an intermediate step to the final formulas that we derive in Section 2.4, but we also view them to be of independent interest as a novel extension of the standard public finance toolbox. We provide results under two additional assumptions: Assumption B. $$F_{n}(h(\theta,n),v)=F_{0}(\theta,v)$$, where $$h$$ is differentiable in $$\theta$$ and $$n$$, and $$\frac{\partial}{\partial n}h$$ is bounded. Assumption C. The terms $$t^{k+1}\frac{\partial^{k}}{\partial p^{k}}D$$ are negligible for all $$k\geq2$$. Assumption B requires that the nudge smoothly changes the distribution of $$\theta$$. Assumption C is a variation of the standard Harberger formula assumption that the term $$t(\Delta t)^{k}D_{pp}$$ is negligible, but is a slightly stronger requirement on how small $$t$$ or $$D_{pp}$$ needs to be. To appreciate the need for placing additional structure on the distributions, consider the difficulty of generally estimating efficiency costs in the seemingly simple case in which $$\theta$$ takes on just two possible values, $$\theta_{1}$$ and $$\theta_{2}$$, and is distributed independently of $$v$$. Let $$EB_{i}(t)$$ denote the excess burden arising from the type $$\theta_{i}$$ consumers. The efficiency cost of increasing the measure of type $$\theta_{2}$$ consumers by some small amount $$dn$$ is then $$(EB_{2}(t)-EB_{1}(t))dn$$. But if $$t$$ is not small and the demand curve of each $$\theta$$ is highly nonlinear so that each $$\theta$$ type’s price elasticity is different, we have no way of quantifying $$EB_{2}(t)-EB_{1}(t)$$ in terms of observables. Further structure is needed to relate the demand curves of the different $$\theta$$ types in terms of observables. The additional structure provided by Assumptions B and C essentially ensures a good fit from a quadratic approximation for the efficiency costs corresponding to each $$\theta$$ type, and that the price elasticities of demand are not too different across the $$\theta$$ types. For the results in this section, we let $$D^{F_{n}}$$ denote the demand curve under $$F_{n}$$ and let $$E_{F_{n}}$$ denote the expectation operator with respect to $$F_{n}$$. To simplify exposition, we will also assume that producer prices are fixed. Proposition 4. Suppose that producer prices are fixed $$(\varepsilon_{S,p}=\infty)$$, and that Assumptions A–C are satisfied. Then $$\frac{d}{dn}EB(t,F_{n})\approx-\frac{d}{dn}\left(E_{F_{n}}[\theta^{2}|p,t]\right)\frac{t^{2}}{2}D_{p}^{F_{n}}$$. If for all $$k\geq 3$$ the terms $$(\Delta n)^k$$ are negligible then \[ EB(t,F_{n+\Delta n})-EB(t,F_{n})\approx-\frac{1}{2}t^{2}\left(E_{F_{n+\Delta n}}[\theta^{2}|p,t]-E_{F_{n}}[\theta^{2}|p,t]\right)D_{p}^{F_{n}} \] The intuition behind Proposition 4 is straightforward. As we have already established, efficiency costs depend on both the mean and the variance of $$\theta$$. Consequently, the welfare impacts of a nudge should correspond to how the nudge impacts the mean and variance of $$\theta$$. This is exactly the result of Proposition 4, as $$E[\theta^{2}|p,t]=E[\theta|p,t]^{2}+Var[\theta|p,t]$$. 2.4. Total efficiency costs We now combine our results from Sections 2.2 and 2.3 to quantify the total efficiency costs of taxation. As in Section 2.3, we assume fixed producer prices to simplify exposition. Proposition 5. Consider two taxes $$t_{1}$$ and $$t_{2}=t_{1}+\Delta t$$. Suppose that producer prices are fixed $$(\varepsilon_{S,p}=\infty)$$ and that Assumptions A–C are satisfied for the family of distributions $$F_{t}$$ indexed by the tax $$t$$. Suppose also that for $$k\geq2$$, the terms $$t(\Delta t)^{k}D_{pp}$$ and $$(\Delta t)^{k}$$ are negligible. Then \begin{eqnarray} EB(t_2, F_{t_2})-EB(t_1, F_{t_1}) & \approx & -\left(t_{1}(\Delta t)+\frac{(\Delta t)^{2}}{2}\right)\left(E[\theta|p,t_{2}]^{2}+Var[\theta|p,t_{2}]\right)D_{p}\\ \end{eqnarray} (3) \begin{eqnarray} & - & \left(\frac{t_{1}^{2}}{2}\right)\left(E[\theta^{2}|p,t_{2}]-E[\theta^{2}|p,t_{1}]\right)D_{p} \end{eqnarray} (4) Proposition 5 is essentially a combination of our earlier results about the direct efficiency costs of a tax and our results about the efficiency costs of a nudge. Equation (3) corresponds to the direct efficiency costs (as in Proposition 3), while (4) corresponds to the nudge channel efficiency costs (as in Proposition 4). The formula in Proposition 5 is written in its most compact form using the price elasticity of demand. One might be tempted to think that using tax elasticities could eliminate additional terms corresponding to costs of debiasing, since the tax elasticity captures both the direct and indirect effects that increasing a tax has on demand. However, simply using the tax-elasticity version of the direct efficiency costs formula in Proposition 3 will still not account for all of the efficiency costs, because it is not just the change in demand that matters, but also how the valuations $$v$$ of the marginal types change. We clarify in the corollary below. Corollary 2. Under the assumptions of Proposition 5, and the assumption that the approximations $$E[\theta|p,t_{2}]-E[\theta|p,t_{1}]\approx\Delta t\frac{d}{dt}E[\theta|p,t]|_{t=t_{1}}$$ and $$Var[\theta|p,t_{2}]-Var[\theta|p,t_{1}]\approx\Delta t\frac{d}{dt}Var[\theta|p,t]|_{t=t_{1}}$$ are valid, efficiency costs can also be expressed as \begin{eqnarray*} && EB(t_2, F_{t_2})-EB(t_1, F_{t_1}) \approx \nonumber\\ &&\quad \frac{\left(t_{1}(\Delta t)+\frac{(\Delta t)^{2}}{2}\right)D}{p+t_{1}}\left(\frac{E[\theta|p,t_{1}]+E[\theta|p,t_{2}]}{2}\varepsilon_{D,t}+\frac{Var[\theta|p,t_{1}]+Var[\theta|p,t_{2}]}{2}\varepsilon_{D,p}\right)\\ &&\quad + \frac{1}{2}t_1 (\Delta t+t_1) \frac{D}{p+t_{1}}\left(Var[\theta|p,t_{2}]-Var[\theta|p,t_{1}]\right)\varepsilon_{D,p}\\ &&\quad + \frac{t_1(\Delta t)}{4}\frac{D}{p+t_{1}}\left(E[\theta|p, t_2]^2-E[\theta|p, t_1]^2 \right)\varepsilon_{D,p} \end{eqnarray*} To illustrate the formula in the corollary, suppose that $$\theta$$ is homogeneous, so that $$Var[\theta|p,t]=0$$. In this case, efficiency costs are not simply given by $$\left(t_{1}(\Delta t)+(\Delta t)^{2}/2\right)\frac{D}{p+t_{1}}E[\theta|p,t_{1}]\varepsilon_{D,t}$$, as would be prescribed by Proposition 3. There are additional efficiency costs, arising from the nudge effect, given by $$\frac{t_1(\Delta t)}{4}\frac{D}{p+t_{1}}\left(E[\theta|p,t_{2}]^{2}-E[\theta|p,t_{1}]^{2}\right)\varepsilon_{D,p}$$. In the simple case of $$Var[\theta|p,t]=0$$, these additional efficiency costs correspond to the fact that the value of the product to the marginal consumer under $$t_{2}$$ is not simply $$p+E[\theta|p,t_{1}](t_{1}+\Delta t)$$, as it would be if taxes did not change under-reaction, but is instead $$p+E[\theta|p,t_{2}](t_{1}+\Delta t)$$. That is, in contrast to the standard model, the value of the product to the marginal consumer is a convex, rather than a linear function of the tax when $$E[\theta|p,t]$$ is increasing in $$t$$. 2.5. Extensions and optimal tax implications Optimal ramsey and pigouvian taxes: The formulas we present for quantifying how changes in the tax affect welfare or excess burden have direct implications for optimal taxes. In Online Appendix B, we derive optimal tax formulas in a Ramsey framework, using a more general model that allows for other market frictions arising from either externalities or other imperfections in consumer choice ($$i.e.$$ the possibility that consumers misoptimize even in the absence of taxes or that they spend their remaining income suboptimally on the composite untaxed good). In formalizing the implications of our excess burden calculations for optimal taxes, the results in the Appendix generate several new insights. First, when there are no other market frictions and taxes are used only to meet a fixed revenue requirement, the optimal tax system may deviate from the canonical Ramsey inverse elasticity rule in several ways. If people under-react less to taxes on more expensive products, that implies that other things equal, the tax rates on bigger ticket items should be smaller. Holding product prices constant, the inverse elasticity rule is also dampened if $$\theta$$ is on average increasing in the tax. This is because increasing taxes increases deadweight loss through the additional debiasing channel.14 Second, we characterize how taxes depend on other market imperfections, and consider whether a less salient tax is optimal for the policymaker, building on the analysis in Farhi and Gabaix (2015). When there is no variation in $$\theta$$, under-reaction to the tax is always beneficial, even in the presence of externalities (or internalities). Because the consumers who buy the product are still those who value it the most, any not-fully-salient tax can still be set high enough to achieve the socially optimal consumption of $$x$$. With variation in $$\theta$$, however, the more salient tax is better if the externality is sufficiently large relative to the value of public funds. This is because introducing a not-fully-salient tax causes misallocation and therefore cannot achieve the socially optimal consumption of $$x$$. Our general message about the importance of taking into account the misallocation arising from heterogeneity in $$\theta$$ is thus particularly relevant in the presence of other market frictions. Income effects: We have thus far assumed that $$u(y)$$ is linear, imposing an absence of income effects. This is a reasonable assumption for small-ticket items for which $$p$$ and $$t$$ are small relative to income. Relaxing this assumption complicates our analyses, but follows the same principles as the baseline excess burden formula without income effects. As we show in Online Appendix A.2, the formulas we derive in the body of the article still hold in the presence of income effects when either (1) the taxed product is a small share of consumers’ expenditures or (2) the taxed product is purchased on a reasonably frequent basis, and the consumer can observe his budget in between the purchases. Thus, for common household commodities, we believe that our results hold robustly in the presence of income effects. However, for infrequent, large-ticket purchases there can still be efficiency costs when consumers ignore the tax fully. This can occur when a consumer spends more money than he realizes on the product in question, and then consumes inefficiently too little $$y$$ in the future after he is surprised by a smaller budget. For large-ticket purchases, this process of budget adjustment can become quantitatively important, and we note that this process is not incorporated into the analyses presented here. For related discussion, see Reck (2014). Distributional concerns: In Online Appendix A.3 we also extend our framework to incorporate distributional concerns. We show that with redistributive concerns, the relative regressivity costs of not-fully-salient sales taxes, as compared to fully salient sales taxes, are determined by how the mistakes—given by $$(\theta_{i}-1)^{2}$$ and reflecting either under- or over-reaction to the tax—covary across the income distribution. 2.6. Identification from aggregate demand data What kinds of data sets identify the statistics necessary for welfare analysis? CLK and Chetty (2009) show that for a representative consumer, the generalized demand curve $$D(p,t)$$ identifies excess burden when pre-existing taxes are small. Under these assumptions, $$\theta$$ is identified by the average degree of under-reaction to taxes relative to prices, $$D_{t}(p,t)/D_{p}(p,t)$$. In Online Appendix A.1 we prove two main results about identification of efficiency costs under more general assumptions. First, we focus on the case in which $$F(\theta|p,t)$$ is degenerate for all $$p,t$$, and show that when $$\theta$$ is endogenous to the tax rate, locally estimated elasticities no longer identify $$\theta$$ or excess burden, although full knowledge of $$D(p,t)$$ does. Intuitively, this is because the ratio of demand responses $$D_{t}/D_{p}$$ is roughly equal to $$E[\theta|p,t]+\frac{d}{dt}E[\theta|p,t]t$$, and thus identifies $$E[\theta|p,t]$$ only when the distribution of $$\theta$$ does not depend on $$t$$. Thus data sets containing only local variation in $$t$$ are not sufficient for questions about the efficiency costs of non-negligible increases in sales taxes. Second, we show that if $$\theta$$ can be heterogeneous, conditional on $$p$$ and $$t$$, then $$D(p,t)$$ can never identify the dispersion, and thus welfare. While the average $$\theta$$ is identified by $$D_{t}/D_{p}$$ for small taxes, the variance of $$\theta$$ is left completely unidentified. These results show that key questions about the variation of under-reaction to taxes cannot be identified from existing data sources. This motivates our experimental design. 3. Experimental Design Platform: The experiment was implemented through ClearVoice Research, a market research firm that maintains a large and demographically diverse panel of participants over the age of 18. This platform is frequently used by firms that ship products to consumers to elicit product ratings, but is additionally available to researchers for academic use (for other examples of research using this platform, see, $$e.g.$$Benjamin et al., 2014; Rees-Jones and Taubinsky, 2016). Two key features of this platform make it appropriate for our experimental design. First, ClearVoice provides samples that match the U.S. population on basic demographic characteristics. Second, ClearVoice maintains an infrastructure for easily shipping products to consumers, which facilitates an incentive-compatible online-shopping experiment. Overview:Figure 1 provides a synopsis of the experimental design. The design had four parts: (1) elicitation of residential information, (2) module 1 shopping decisions, (3) module 2 shopping decisions, and (4) end-of-study survey questions. The design is both within-subject—we vary tax rates for a given consumer between modules 1 and 2—and between-subject—consumers face different tax rates in module 1. Decisions are incentivized: study participants have a chance to receive a $20 shopping budget to actually enact their purchasing decisions, and ClearVoice ships any products purchased. Subjects retain any unspent portion of the budget. The within-subject aspect of the design increases statistical power and provides identification that is not possible from between-subject aggregate data. Figure 1 View largeDownload slide Experimental design Notes: This figure summarizes our experimental design. For full details, see the accompanying discussion in Section 3. Figure 1 View largeDownload slide Experimental design Notes: This figure summarizes our experimental design. For full details, see the accompanying discussion in Section 3. Each consumer was randomly assigned to one of three arms: (1) the “no-tax arm”, (2) the “standard-tax arm”, and (3) the “triple-tax arm”. The standard- and triple-tax arms were implemented to provide within-subject comparisons of purchasing decisions with and without taxes. The no-tax arm was implemented to identify any order effects on valuations over the course of the experiment and to help test for demand or anchoring effects.15 Each module consisted of a series of shopping decisions involving twenty common household products. In module 1, consumers made shopping decisions with either a zero tax rate (no-tax arm), a standard tax rate corresponding to their city of residence (standard-tax arm), or a tax rate equal to triple their standard tax rate (triple-tax arm). In module 2, consumers in all three arms made decisions in the absence of any sales taxes. The same twenty products were used in each module and in each arm of the experiment. The order in which the twenty products were presented was randomly determined, and independent between the two modules. Our experimental design involves language about the sales tax rate that study participants pay in their city of residence. To avoid confusion, we asked ClearVoice to only recruit panel members from states that have a positive sales tax. This excluded panel members from Alaska, Montana, Delaware, New Hampshire, and Oregon. The remaining forty-five states are all represented in our final sample. Prior to learning the details of the experiment, consumers were asked to report their state, county, and city of residence. To correctly determine the money spent in the experiment, this information was matched to a data set of tax rates in all cities in the U.S.16 This design is closely related to several recent experimental studies of tax salience ($$e.g.$$Feldman and Ruffle, 2015; Feldman et al., 2015), but differs in important ways. Our design combines within-subject manipulation of tax rates with a pricing mechanism that elicits full and precise demand curves. This design, combined with our unusually large sample size, allows us to infer the sufficient statistics of our general welfare formulas—an exercise not possible with previous experimental designs. Purchasing decisions: Each product appeared on a separate screen. For each product, consumers saw a picture and a product description drawn from Amazon.com. Consumers then used a slider to select the highest tag price at which they would be willing to purchase the product. It was explained that “The tag price is the price that you would find posted on an item as you walk down the aisle of the store; this is different from the final amount that you would pay when you check out at the register, which would be the tag price, plus any relevant sales taxes.” Figure 2 shows examples of the decision screen. Figure 2 View largeDownload slide Decision format Notes: Panel (a) shows an example of a pricing decision from modules where taxes apply. Consumers indicate the highest tag price at which they would buy the product. As in typical shopping environments—and as was explained in the experimental instructions—the final price that applies at “check out” is the tag price plus sales taxes. Panel (b) shows an example of a pricing decision from modules where taxes do not apply. As can be seen in the prompt, respondents are instructed to consider the case where no sales tax is added at the register. Figure 2 View largeDownload slide Decision format Notes: Panel (a) shows an example of a pricing decision from modules where taxes apply. Consumers indicate the highest tag price at which they would buy the product. As in typical shopping environments—and as was explained in the experimental instructions—the final price that applies at “check out” is the tag price plus sales taxes. Panel (b) shows an example of a pricing decision from modules where taxes do not apply. As can be seen in the prompt, respondents are instructed to consider the case where no sales tax is added at the register. If a study participant selected the highest price on the slider, $15, he was directed to an additional screen where he was asked a hypothetical free-response question about the highest tag price at which they would be willing to buy the product. The three different decision environments were described to consumers as follows: No-tax decision environment: In the no-tax decisions, consumers were told that “In contrast to what shopping is like at your local store, no sales tax will be added to the tag price at which you purchase a product.” It was explained that “You can imagine this to be like the case if there were no sales tax, or if sales tax were already included in the prices posted at a store.” As depicted in Figure 1, the no-tax decisions constituted the second module that consumers encountered in each experimental arm, and also the first module that consumers encountered in the no-tax arm. Standard-tax decision environment: For the standard-tax decisions, the instructions prior to decisions were that “The sales tax in this section of the study is the same as the standard sales tax that you pay (for standard nonexempt items) in your city of residence, [city], [state].” The standard-tax decisions constituted the first module of the standard-tax arm. Triple-tax decision environment: For the triple-tax decisions, the instructions prior to decisions were that “The sales tax in this section of the study is equal to triple the standard sales tax that you pay (for standard nonexempt items) in your city of residence, [city], [state].” The triple-tax decisions constituted the first module that consumers encountered in the triple-tax arm. To make this experimental shopping experience as close as possible to the normal shopping experience and to enable tests for incorrect beliefs, consumers were not told what tax rate applies in their city of residence. Once consumers read the instructions (and answered the comprehension questions), they were never reminded of the taxes again in the tax modules. In contrast, the no-tax modules emphasized the absence of taxes to ensure that choices in those models reflect consumers’ true willingness to pay for the products. Product selection: To arrive at the final list of twenty household products, we began with a list of seventy-five potential items in the $0–$15 price range compiled by a research assistant. From this list, we eliminated items that were tax exempt in at least one state. We then ran a pre-test with ClearVoice to elicit (hypothetical) willingness to pay for the items. We selected twenty items that had unimodal distributions of valuations and had the least censoring at $0 and $15. Online Appendix F lists the products, prices, and Amazon.com product descriptions that were displayed to study participants. Incentive compatibility: Decisions in the experiment were incentive compatible. All study participants who passed the necessary comprehension questions (described below) had a 1/3 chance of being selected to receive a $20 budget; accounting for the probability of failing the comprehension check, this chance was approximately 1/4. Participants were informed of this incentive structure prior to making any decisions, but they did not know if they received the budget until they completed the experiment. If they did not receive the budget, they simply received a compensation of $1.50 and no products from the study. Consumers who were selected to receive the $20 budget had one out of the forty decisions (from modules 1 and 2 combined) selected to be played out. Outcomes were determined using the Becker–DeGroot–Marshak (BDM) mechanism. A random tag price, between $0 and $15, was drawn. If the randomly generated price was below the maximum tag price the consumer was willing to pay, then the product was sold to the consumer at that tag price $$p$$, and a final amount of $$p(1+\tau)$$ (where $$\tau$$ is the experimentally induced tax rate) was subtracted from this consumer’s budget. The product was shipped to the consumer by ClearVoice, and the remainder of the budget was included in experimental compensation. Participants received a full explanation of the BDM mechanism, and were also told that it was in their best interest to always be honest about the highest tag price at which they would want to buy the product. Comprehension questions: It is important to ensure that study participants understand the experimental tax rate that applies to their decisions, so that the appearance of under-reaction is not generated by a simple failure to read experimental instructions. In both module 1 and module 2, we thus gave study participants a multiple-choice comprehension question designed to confirm their understanding of the applicable experimental tax rate. This question presents an item being purchased for a $5 tag price, and asks the respondent to choose the amount of money that would be deducted from their budget from several tag-price/tax combinations. In both modules, the quiz question appeared on the same screen as the instructions for that module. Subjects who fail these questions are generally excluded from our analyses; however, we demonstrate that our main analyses are robust to alternative treatments of these subjects in Section 4.7. Survey questions: After completing the main part of the experiment, study participants received a short set of questions eliciting household income, marital status, financial literacy, ability to compute taxes, and health habits. We discuss these questions in further detail in the analysis. ClearVoice also collects and shares various demographic information on its panel members, including educational attainment, occupation, age, sex, and ethnicity. We report these basic demographics in Section 4.1. 4. Quantifying under-reaction Across Different Tax Sizes 4.1. Sample selection, demographics, and balance In this section, we discuss the creation of our final sample for analysis. We then analyse the demographic properties and balance of that sample. A total of 4,328 consumers completed the experiment. For our primary analyses, we restrict our sample to the 3,066 respondents who correctly answered the instruction-comprehension questions regarding the tax rate that applied in both module 1 and module 2. Unsurprisingly, the 29% of consumers who failed these comprehension questions do not react to the differences in taxes across conditions. Thus, while these respondents would contribute to evidence of under-reaction to sales taxes, we believe the misoptimization these consumers exhibit is likely due to misunderstanding of our experimental manipulation. This type of misunderstanding is conceptually distinct from misunderstanding a given tax rate and is not the object of interest in our theoretical analysis.17 Out of the remaining 3,066 consumers, thirty consumers were not willing to buy at any positive price in at least one of their decisions. Because our primary estimates are formed using the logarithm of the ratio of module 1 and module 2 prices, we cannot use at least one observation for each of these thirty consumers. We thus exclude them from analysis as well. We additionally exclude ten consumers who reported living in a state with no sales tax.18 In part due to our pre-test for product selection, only 0.9% of all responses were censored at $15. For responses that were censored, we use consumers’ uncensored responses to the hypothetical question about the maximum tag price. However, this question did not force a response, and twenty-eight consumers did not provide an answer to this question upon encountering it. We exclude these consumers as well, leaving us with a final sample size of 2,998. Table 1 presents a summary of the demographics of our final sample. All participants in the final sample are over the age of 18, and all but thirty-one participants are over the age of 21. Experimental recruitment was targeted to generate a final sample approximating the gender, income, and age distribution of the U.S. As a result, our sample—which is 48% male, has a median income of $50,000, and average age of 50—is similar to the U.S. population on these basic demographics. Despite this favourable comparison, we caution the reader that the nature of recruitment into the ClearVoice panel likely induces selection on unobservable characteristics. Table 1 Demographics by experimental arm All No tax Std. tax Triple tax $$F$$-test $$p$$-value Age 50.49 50.80 50.43 50.20 0.66 (14.63) (14.28) (14.84) (14.82) Household income ($1,000s) 63.04 61.86 63.67 63.78 0.68 (56.29) (55.00) (58.21) (55.77) Household size 2.40 2.40 2.38 2.42 0.86 (1.52) (1.60) (1.50) (1.46) Married 0.34 0.33 0.36 0.34 0.35 Male 0.48 0.47 0.51 0.47 0.15 Education Highschool degree or higher 0.95 0.94 0.95 0.95 0.74 College degree or higher 0.41 0.41 0.40 0.40 0.84 Post-graduate education 0.17 0.18 0.16 0.16 0.49 Ethnicity Asian 0.03 0.03 0.04 0.03 0.88 Caucasian 0.77 0.76 0.77 0.78 0.57 Hispanic 0.03 0.04 0.03 0.03 0.47 African American 0.07 0.08 0.07 0.08 0.83 Other 0.02 0.03 0.02 0.02 0.39 Tax rate in city of residence 7.32 7.36 7.31 7.30 0.36 (1.15) (1.16) (1.13) (1.15) $$N$$ (Final sample) 2,998 1,102 982 914 Comprehension test pass rate (%) 71 78 70 65 <$$0.01$$ All No tax Std. tax Triple tax $$F$$-test $$p$$-value Age 50.49 50.80 50.43 50.20 0.66 (14.63) (14.28) (14.84) (14.82) Household income ($1,000s) 63.04 61.86 63.67 63.78 0.68 (56.29) (55.00) (58.21) (55.77) Household size 2.40 2.40 2.38 2.42 0.86 (1.52) (1.60) (1.50) (1.46) Married 0.34 0.33 0.36 0.34 0.35 Male 0.48 0.47 0.51 0.47 0.15 Education Highschool degree or higher 0.95 0.94 0.95 0.95 0.74 College degree or higher 0.41 0.41 0.40 0.40 0.84 Post-graduate education 0.17 0.18 0.16 0.16 0.49 Ethnicity Asian 0.03 0.03 0.04 0.03 0.88 Caucasian 0.77 0.76 0.77 0.78 0.57 Hispanic 0.03 0.04 0.03 0.03 0.47 African American 0.07 0.08 0.07 0.08 0.83 Other 0.02 0.03 0.02 0.02 0.39 Tax rate in city of residence 7.32 7.36 7.31 7.30 0.36 (1.15) (1.16) (1.13) (1.15) $$N$$ (Final sample) 2,998 1,102 982 914 Comprehension test pass rate (%) 71 78 70 65 <$$0.01$$ Notes: This table presents the means and standard deviations of demographic variables in each of the three arms in our final sample. To test whether each characteristic is equally distributed across arms, we regress that characteristic on dummies for arms of the study, using OLS with robust standard errors, and report the $$F$$-test $$p$$-value for equality across arms. Omnibus tests also show that there are no significant differences in demographics between Arm 1 versus Arm 2 ($$F$$-test, $$p=0.49$$), Arm 2 versus Arm 3 ($$F$$-test, $$p=0.94$$), or Arm 1 versus Arm 3 ($$F$$-test, $$p=0.36$$). Table 1 Demographics by experimental arm All No tax Std. tax Triple tax $$F$$-test $$p$$-value Age 50.49 50.80 50.43 50.20 0.66 (14.63) (14.28) (14.84) (14.82) Household income ($1,000s) 63.04 61.86 63.67 63.78 0.68 (56.29) (55.00) (58.21) (55.77) Household size 2.40 2.40 2.38 2.42 0.86 (1.52) (1.60) (1.50) (1.46) Married 0.34 0.33 0.36 0.34 0.35 Male 0.48 0.47 0.51 0.47 0.15 Education Highschool degree or higher 0.95 0.94 0.95 0.95 0.74 College degree or higher 0.41 0.41 0.40 0.40 0.84 Post-graduate education 0.17 0.18 0.16 0.16 0.49 Ethnicity Asian 0.03 0.03 0.04 0.03 0.88 Caucasian 0.77 0.76 0.77 0.78 0.57 Hispanic 0.03 0.04 0.03 0.03 0.47 African American 0.07 0.08 0.07 0.08 0.83 Other 0.02 0.03 0.02 0.02 0.39 Tax rate in city of residence 7.32 7.36 7.31 7.30 0.36 (1.15) (1.16) (1.13) (1.15) $$N$$ (Final sample) 2,998 1,102 982 914 Comprehension test pass rate (%) 71 78 70 65 <$$0.01$$ All No tax Std. tax Triple tax $$F$$-test $$p$$-value Age 50.49 50.80 50.43 50.20 0.66 (14.63) (14.28) (14.84) (14.82) Household income ($1,000s) 63.04 61.86 63.67 63.78 0.68 (56.29) (55.00) (58.21) (55.77) Household size 2.40 2.40 2.38 2.42 0.86 (1.52) (1.60) (1.50) (1.46) Married 0.34 0.33 0.36 0.34 0.35 Male 0.48 0.47 0.51 0.47 0.15 Education Highschool degree or higher 0.95 0.94 0.95 0.95 0.74 College degree or higher 0.41 0.41 0.40 0.40 0.84 Post-graduate education 0.17 0.18 0.16 0.16 0.49 Ethnicity Asian 0.03 0.03 0.04 0.03 0.88 Caucasian 0.77 0.76 0.77 0.78 0.57 Hispanic 0.03 0.04 0.03 0.03 0.47 African American 0.07 0.08 0.07 0.08 0.83 Other 0.02 0.03 0.02 0.02 0.39 Tax rate in city of residence 7.32 7.36 7.31 7.30 0.36 (1.15) (1.16) (1.13) (1.15) $$N$$ (Final sample) 2,998 1,102 982 914 Comprehension test pass rate (%) 71 78 70 65 <$$0.01$$ Notes: This table presents the means and standard deviations of demographic variables in each of the three arms in our final sample. To test whether each characteristic is equally distributed across arms, we regress that characteristic on dummies for arms of the study, using OLS with robust standard errors, and report the $$F$$-test $$p$$-value for equality across arms. Omnibus tests also show that there are no significant differences in demographics between Arm 1 versus Arm 2 ($$F$$-test, $$p=0.49$$), Arm 2 versus Arm 3 ($$F$$-test, $$p=0.94$$), or Arm 1 versus Arm 3 ($$F$$-test, $$p=0.36$$). We find no evidence of selection on demographic covariates across experimental arms. We fail to reject the null hypotheses of equality of the demographics in Table 1 when comparing Arm 1 versus Arm 2 ($$F$$-test, $$p=0.49$$), Arm 2 versus Arm 3 ($$F$$-test, $$p=0.94$$), or Arm 1 versus Arm 3 ($$F$$-test, $$p=0.36$$). In contrast to the demographic results, there are statistically significant cross-arm differences in the likelihood that consumers pass the comprehension questions regarding the tax rate that apply in the experiment. The likelihoods of correctly answering both comprehension questions are 78%, 70%, and 65% in the no-tax, standard-tax, and triple-tax arms, respectively.19 The null hypothesis of equal pass rates is rejected for any pair of arms at the 5% significance level. The differential selection introduced by these differing pass rates introduces a potentially important confound to cross-arm inference. However, we will show in Section 4.7 that our primary results are robust to both worst-case assumptions about differential selection and to the reinclusion of those failing the test. 4.2. Summary of behaviour We begin with a graphical summary of the data. Figure 3 provides a summary of the demand curves as functions of before- or after-tax prices. To construct the figure, we start with demand curves $$D_{k}^{{\rm C},m}(p)$$ for each product $$k$$, where $$\text{C}\,\,{\in}\{{0x, 1x, 3x}\}$$ denotes the experimental arm, $$m$$ denotes the module, and $$p$$ the before-tax price. Because there are twenty products, we summarize the data by plotting the average demand curves$$D_{avg}^{\text{C},m}(p):=\frac{1}{20}\sum_{k}D_{k}^{\text{C},m}(p)$$ for each arm $$C$$ and module $$m$$. Figure 3 View largeDownload slide Average demand curves in the first and second stages of the experiment Notes: This figure plots demand curves from the first and second modules of the experiment, averaging across all twenty products. In the first stage, consumers face either no taxes, standard taxes, or triple their standard taxes. In the second stage, consumers in all three arms face no additional taxes. To construct the figures, we start with the demand curves, denoted $$D_{k}^{\text{C,}m}(p)$$, for each product $$k$$. $$\text{C}\,\,{\in}\{{0x, 1x, 3x}\}$$ denotes the no-tax, standard-tax, or triple-tax experimental arm, $$m$$ denotes the module (stage), and $$p$$ the relevant price. The average demand curves are calculated as $$D_{avg}^{\text{C},m}(p):=\frac{1}{20}\sum_{k}D_{k}^{\text{C},m}(p)$$. Panel (a) plots average demand as a function of the after-tax prices in module 1. For comparison, panel (b) plots the counterfactual average demand in module 1 that would be expected if consumers react to taxes fully. We reconstruct the demand curves by assuming that if a fraction $$D(p)$$ of consumers are willing to buy at price $$p$$ in the no-tax arm, then a fraction $$D\left(\frac{p}{1+\tau}\right)$$ of consumers are willing to buy at a (before-tax) price $$p$$ when facing tax rate $$\tau$$. Panel (c) plots demand as a function of the after-tax prices in module 1. Panel (d) plots demand as a function of the tax-free prices in module 2. Figure 3 View largeDownload slide Average demand curves in the first and second stages of the experiment Notes: This figure plots demand curves from the first and second modules of the experiment, averaging across all twenty products. In the first stage, consumers face either no taxes, standard taxes, or triple their standard taxes. In the second stage, consumers in all three arms face no additional taxes. To construct the figures, we start with the demand curves, denoted $$D_{k}^{\text{C,}m}(p)$$, for each product $$k$$. $$\text{C}\,\,{\in}\{{0x, 1x, 3x}\}$$ denotes the no-tax, standard-tax, or triple-tax experimental arm, $$m$$ denotes the module (stage), and $$p$$ the relevant price. The average demand curves are calculated as $$D_{avg}^{\text{C},m}(p):=\frac{1}{20}\sum_{k}D_{k}^{\text{C},m}(p)$$. Panel (a) plots average demand as a function of the after-tax prices in module 1. For comparison, panel (b) plots the counterfactual average demand in module 1 that would be expected if consumers react to taxes fully. We reconstruct the demand curves by assuming that if a fraction $$D(p)$$ of consumers are willing to buy at price $$p$$ in the no-tax arm, then a fraction $$D\left(\frac{p}{1+\tau}\right)$$ of consumers are willing to buy at a (before-tax) price $$p$$ when facing tax rate $$\tau$$. Panel (c) plots demand as a function of the after-tax prices in module 1. Panel (d) plots demand as a function of the tax-free prices in module 2. Panel (a) of Figure 3 shows that consumers do react to sales taxes in module 1, as their willingness to buy at a given before-tax price is decreasing in the size of the sales tax. However, as shown in panel (b), consumers do not react to taxes as much as perfect optimization would require. In this panel, we construct the demand curves that would be expected if consumers reacted to the taxes fully, and find substantially larger differences than those observed in panel (a). As demonstrated in panel (c), this discrepancy results in differences in demand curves across treatment arms when they are plotted as a function of after-tax price: consumers are willing to buy at higher final prices in the presence of larger taxes. While consumers in the different treatment arms behave differently in module 1, panel (d) shows that all treatment arms exhibit similar demand patterns in module 2. This pattern is confirmed by several statistical tests. For our first test, we compute an average pre-tax price $$\bar{p}^{i}=\frac{1}{20}\sum_{k}p^{ik}$$ for each consumer $$i$$, and then compare the distributions of $$\bar{p}^{i}$$. Kolmogorov–Smirnov tests find no differences in the $$\bar{p}_{i}$$ between the no-tax and standard-tax arms ($$p=0.73$$), between the no-tax and triple-tax arms ($$p=0.29$$), and between the standard-tax and triple-tax arms ($$p=0.50$$).20 OLS and quantile regressions comparing the average willingness to pay in module 2 across experimental arms similarly detect no differences (see Online Appendix E.1). Since all three treatment arms face the same no-tax environment in module 2, this similarity of demand behaviour is reassuring: it suggests that the willingness to pay elicited in module 2 is not contaminated by earlier cross-arm differences, as could arise in the presence of anchoring or demand effects.21 4.3. Econometric framework We now present our baseline econometric framework for studying how under-reaction to taxes varies by experimental condition and by observable characteristics. Let $$p_{1}^{ik}$$ be the highest tag price a subject $$i$$ is willing to pay in module 1 for product $$k$$, and define $$p_{2}^{ik}$$ analogously for module 2. Note that in the absence of noise or order effects, $$p_{2}^{ik}/p_{1}^{ik}=1+\theta_{ik}\tau_{i}$$, where $$1-\theta_{ik}$$ is the degree of under-reaction to the tax on product $$k$$ by consumer $$i$$. Thus for a consumer $$i$$ in either the standard- or triple-tax arms, $$\frac{y_{ik}}{\tau_{i}}\approx\theta_{ik}$$, where $$\tau_{i}$$ is the tax rate faced by the consumer in module 1 and $$y_{ik}=\log(p_{2}^{ik})-\log(p_{1}^{ik})$$. Of course, $$\frac{y_{ik}}{\tau_{i}}$$ provides a noisy estimate of $$\theta_{ik}$$ because study participants’ reported values for the product fluctuate. Furthermore, this measure may be biased if average perceived values of the products vary between module 1 and module 2 even in the absence of tax changes. This phenomenon—which we refer to as order effects—is commonly found in pricing experiments (see, $$e.g.$$Andersen et al., 2006; Clark and Friesen, 2008), and the no-tax arm of the experiment was designed to allow us to identify and econometrically accommodate these effects. In this arm, we find that participants’ valuations declined by an average of 42 cents from module 1 to module 2 ($$p<0.001$$). Our econometric approach incorporates these order effects and allows them to depend on any estimated covariates, but we assume that order effects do not vary by experimental arm. This assumption, labelled A1 below for reference, allows us to extrapolate the estimated order effects from the no-tax arm to the other tax arms, in which the identification of order effects would otherwise be confounded with the variation in tax rates between module 1 and 2. A1 For any vector of covariates $$X_{ik}$$, $$E[y_{ik}-\log(1+\theta_{ik}\tau_{i})|X_{ik}]$$ does not depend on $$\tau_{i}$$. For a vector of covariates $$X_{ik}$$ we will estimate the following model: \begin{eqnarray*} E[y_{ik}|X] & = & \log(1+\theta_{ik}\tau_{i})+\beta X_{ik}\\ E[\theta_{ik}|X_{ik}]\approx E\left[\frac{\log(1+\theta_{ik}\tau_{i})}{\tau_{i}}|X_{ik}\right] & = & \alpha X_{ik} \end{eqnarray*} The model above implies the following moment conditions: \begin{eqnarray} E[X_{ik}'y_{ik}] & = & X_{ik}'\beta X_{ik}\,\,\,\,\text{ for no-tax arm}\\ \end{eqnarray} (5) \begin{eqnarray} E\left[X_{ik}'\left(\frac{y_{ik}-\beta X_{ik}}{\tau_{i}}\right)\right] & = & X_{ik}'\alpha X_{ik}\,\,\,\,\text{for std./triple-tax arms} \end{eqnarray} (6) Equation (5) identifies any order effects in the data using the no-tax arm. These order effects are partialled out from $$y_{ik}$$ in the standard and triple-tax arms in Equation (6), which allows us to estimate $$E[\theta_{ik}]$$ as a linear function of covariates $$X_{ik}$$. When estimating Equations (5) and (6) for either the standard or triple-tax arm separately, the system of equations is exactly identified. When pooling data from multiple treatment arms, we will assume that Equation (6) holds independently for each arm, but with a common $$\alpha$$. The system is thus over-identified, and we use the two-step generalized method of moments (GMM) estimator to obtain an approximation to the efficient weighting matrix. We will often condition on $$p_{2}^{ik}\geq\underline{p}$$ (typically $$p_{2}^{ik}\geq1$$)—$$i.e.$$ focusing analysis on those with non-negligible willingness to pay—as a means of increasing precision. Because most of our analysis takes $$p_{2}/p_{1}$$ as an object of interest, noisiness in responses can generate dramatic variation in this quantity when valuations approach zero. All of our point estimates are robust to the inclusion of all data. Although our approach may seem somewhat complicated, we show in Online Appendix E.6 that all of our main results are robust to a simpler approach, using OLS to regress $$y_{ik}$$ on the tax rate. As we elaborate in that Appendix, however, we prefer our GMM approach as it avoids the need to assume that mean under-reaction is constant across tax sizes within an experimental arm—an assumption that our results refute.22 4.4. Average under-reaction to taxes by experimental arm Table 2 presents our estimates of average $$\theta$$ in each arm using the econometric framework presented in Section 4.3. We provide estimates using all data, as well as conditioning on $$p_{2}^{ik}\geq1$$ and $$p_{2}^{ik}\geq5$$. Across all specifications, we estimate an average $$\theta$$ of approximately $$0.25$$ in the standard-tax arm and an average $$\theta$$ of approximately $$0.5$$ in the triple-tax arm.23 Due to advantages of our design, these estimates are notably more precise than those of prior work and strongly reject the null hypotheses that consumers completely neglect or completely attend to taxes.24 All the estimates are more precise in the second and third columns than in the first column, as the ratio $$p_{2}^{ik}/p_{1}^{ik}$$ is naturally most noisy when a consumer attaches low value to the product. We will thus continue conditioning on $$p_{2}^{ik}\geq1$$ throughout the rest of our analysis. Table 2 Average $$\theta$$ (weight placed on tax) by experimental arm 1 2 3 All $$p_2\geq 1$$ $$p_2\geq5$$ Std. tax avg. $$\theta$$ 0.261** 0.250*** 0.226** (0.111) (0.095) (0.094) Triple tax avg. $$\theta$$ 0.481*** 0.475*** 0.535*** (0.045) (0.039) (0.041) Observations 59,960 58,478 32,810 Difference $$p$$-value 0.03 0.01 <0.001 1 2 3 All $$p_2\geq 1$$ $$p_2\geq5$$ Std. tax avg. $$\theta$$ 0.261** 0.250*** 0.226** (0.111) (0.095) (0.094) Triple tax avg. $$\theta$$ 0.481*** 0.475*** 0.535*** (0.045) (0.039) (0.041) Observations 59,960 58,478 32,810 Difference $$p$$-value 0.03 0.01 <0.001 Notes: This table displays GMM estimates of average $$\theta$$ by experimental arm, applying the methodology discussed in Section 4.3. $$\theta$$ is defined as the “weight” that consumers place on the sales tax, with $$\theta=0$$ corresponding to complete neglect of the tax and $$\theta=1$$ corresponding to full optimization. Column (1) uses all data, column (2) conditions on module 2 price ($$p_{2}$$) being greater than 1, column (3) conditions on module 2 price ($$p_{2}$$) being greater than 5. Standard errors, clustered at the subject level, are reported in parentheses. * $$p<0.1$$, ** $$p<0.0$$5, *** $$p<0.01$$. Table 2 Average $$\theta$$ (weight placed on tax) by experimental arm 1 2 3 All $$p_2\geq 1$$ $$p_2\geq5$$ Std. tax avg. $$\theta$$ 0.261** 0.250*** 0.226** (0.111) (0.095) (0.094) Triple tax avg. $$\theta$$ 0.481*** 0.475*** 0.535*** (0.045) (0.039) (0.041) Observations 59,960 58,478 32,810 Difference $$p$$-value 0.03 0.01 <0.001 1 2 3 All $$p_2\geq 1$$ $$p_2\geq5$$ Std. tax avg. $$\theta$$ 0.261** 0.250*** 0.226** (0.111) (0.095) (0.094) Triple tax avg. $$\theta$$ 0.481*** 0.475*** 0.535*** (0.045) (0.039) (0.041) Observations 59,960 58,478 32,810 Difference $$p$$-value 0.03 0.01 <0.001 Notes: This table displays GMM estimates of average $$\theta$$ by experimental arm, applying the methodology discussed in Section 4.3. $$\theta$$ is defined as the “weight” that consumers place on the sales tax, with $$\theta=0$$ corresponding to complete neglect of the tax and $$\theta=1$$ corresponding to full optimization. Column (1) uses all data, column (2) conditions on module 2 price ($$p_{2}$$) being greater than 1, column (3) conditions on module 2 price ($$p_{2}$$) being greater than 5. Standard errors, clustered at the subject level, are reported in parentheses. * $$p<0.1$$, ** $$p<0.0$$5, *** $$p<0.01$$. The difference in average $$\theta$$ between the arms is significant at the 5% level when using all data or when conditioning on $$p_{2}^{ik}\geq1$$, and it is significant at the 0.1% level when conditioning on $$p_{2}^{ik}\geq5$$.25 4.5. Further tests of endogenous attention Our baseline results suggest that consumers attend more to higher taxes. However, several important caveats apply. Consumers might overreact to the triple tax if they are surprised by the unusual scenario (Bordalo et al., 2017). This suggests that a measurement of average $$\theta$$ shortly after a real or experimentally induced tax change might overestimate the degree of attention that would be realized after the resolution of surpise. On the other hand, our estimates of average $$\theta$$ in the triple-tax arm may underestimate long-run attention because it may take time for people to update their heuristics in a modified decision environment. A complementary analysis that could test for long-run response would be to estimate whether consumers are more attentive in states with larger sales taxes. However, since the variation in tax rates across states is substantially lower than the tripling of taxes considered in our experiment, such an analysis would require a sample size that is approximately 45 times larger than ours to be well-powered. Unfortunately, such a test cannot currently be feasibly implemented with a lab-in-field approach like our own.26 An alternative and better-powered approach to testing endogenous response to stake size makes use of variation in willingness to pay rather than tax rates. Since the total tax is determined by $$t=\tau p$$, variation in either tax rates or maximal acceptable purchase prices may be used to generate variation in stakes. We operationalize this test by dividing all consumers (from all three arms) into three bins corresponding to their module 2 valuation ($$p_{2}^{ik}<5$$, $$p_{2}^{ik}\in[5,10)$$, and $$p_{2}^{ik}\geq10$$), and then estimating an average $$\theta$$ for each bin. Note that we partition consumers using module 2 prices to avoid endogeneity issues arising from the fact that the module 1 prices will depend on a person’s attention to the tax.27 Columns (1)–(3) of Table 3 report the results of this estimation. Column (1) presents estimates for the standard-tax arm, column (2) presents estimates for the triple-tax arm, and column (3) presents estimates for the pooled data. When pooling data, we allow for different baselines of average $$\theta$$ for the different arms but we assume that the impact of moving to a higher bin is the same across the arms. Although we are underpowered for this analysis in the standard-tax arm, the table shows that when pooling the data, or when restricting to the triple-tax arm, consumers in the second and third bin have a higher average $$\theta$$ than consumers in the first bin. The differences in average $$\theta$$ are approximately $$0.1$$2 for second versus first bin and 0.15 for third versus first bin, in both the triple-tax arm or pooled analysis. We do not detect a difference for average $$\theta$$ between the second and third bin, although we also cannot reject a moderate one. This suggests that attention may not increase linearly with price and that consumers employ different attention strategies for very low-price products below $5 versus moderate-price products above $5. Table 3 Average $$\theta$$ (weight placed on tax) for different product valuations 1 2 3 4 5 6 Standard Triple Pooled Standard Triple Pooled Middle $$p_2$$ bin –0.097 0.117** 0.123** –0.077 0.097** 0.104*** (0.147) (0.054) (0.054) (0.101) (0.038) (0.038) High $$p_2$$ bin 0.115 0.147** 0.154** 0.168 0.069 0.072 (0.185) (0.074) (0.074) (0.152) (0.053) (0.053) Std. tax cons. 0.266* 0.156 (0.140) (0.098) Triple tax cons. 0.402*** 0.395*** (0.054) (0.054) Individual fixed effects No No No Yes Yes Yes Observations 40,651 39,378 58,478 40,651 39,378 58,478 1 2 3 4 5 6 Standard Triple Pooled Standard Triple Pooled Middle $$p_2$$ bin –0.097 0.117** 0.123** –0.077 0.097** 0.104*** (0.147) (0.054) (0.054) (0.101) (0.038) (0.038) High $$p_2$$ bin 0.115 0.147** 0.154** 0.168 0.069 0.072 (0.185) (0.074) (0.074) (0.152) (0.053) (0.053) Std. tax cons. 0.266* 0.156 (0.140) (0.098) Triple tax cons. 0.402*** 0.395*** (0.054) (0.054) Individual fixed effects No No No Yes Yes Yes Observations 40,651 39,378 58,478 40,651 39,378 58,478 Notes: This table displays GMM estimates of the relationship between average $$\theta$$ and the valuation of the good considered, applying the methodology discussed in Section 4.3. $$\theta$$ is defined as the “weight” that consumers place on the sales tax, with $$\theta=0$$ corresponding to complete neglect of the tax and $$\theta=1$$ corresponding to full optimization. Columns (1)–(3) estimate the model $$\bar{\theta}_{ik}=\alpha_{0}^{\text{1x}}\mathbf{1}_{\text{1x}}+\alpha_{0}^{\text{3x}}\mathbf{1}_{\text{3x }}+\alpha_{p_{2}\in[5,10)}\mathbf{1}_{p_{2}\in[5,10)}+\alpha_{p_{2}\geq10}\mathbf{1}_{p_{2}\geq10}$$. We assume that $$\alpha_{p_{2}\in[5,10)}$$ and $$\alpha_{p_{2}\geq10}$$ do not change across the standard- and triple-tax arms, but we allow for different baseline values $$\alpha_{0}^{\text{1x }}$$ and $$\alpha_{0}^{\text{3x }}$$. Columns (4)–(6) control for individual fixed effects, estimating the model $$\bar{\theta}_{ik}=\theta_{i}+\alpha_{p_{2}\in[5,10)}\mathbf{1}_{p_{2}\in[5,10)}+\alpha_{p_{2}\geq10}\mathbf{1}_{p_{2}\geq10}$$. We model the two moment conditions for each arm separately, and we use the two-step GMM estimator to approximate the efficient weighting matrix for the over-identified model. All specifications condition on module 2 price ($$p_{2}$$) being greater than 1. Standard errors, clustered at the subject level, are reported in parentheses. * $$p<0.1$$, ** $$p<0.0$$5, *** $$p<0.01$$. Table 3 Average $$\theta$$ (weight placed on tax) for different product valuations 1 2 3 4 5 6 Standard Triple Pooled Standard Triple Pooled Middle $$p_2$$ bin –0.097 0.117** 0.123** –0.077 0.097** 0.104*** (0.147) (0.054) (0.054) (0.101) (0.038) (0.038) High $$p_2$$ bin 0.115 0.147** 0.154** 0.168 0.069 0.072 (0.185) (0.074) (0.074) (0.152) (0.053) (0.053) Std. tax cons. 0.266* 0.156 (0.140) (0.098) Triple tax cons. 0.402*** 0.395*** (0.054) (0.054) Individual fixed effects No No No Yes Yes Yes Observations 40,651 39,378 58,478 40,651 39,378 58,478 1 2 3 4 5 6 Standard Triple Pooled Standard Triple Pooled Middle $$p_2$$ bin –0.097 0.117** 0.123** –0.077 0.097** 0.104*** (0.147) (0.054) (0.054) (0.101) (0.038) (0.038) High $$p_2$$ bin 0.115 0.147** 0.154** 0.168 0.069 0.072 (0.185) (0.074) (0.074) (0.152) (0.053) (0.053) Std. tax cons. 0.266* 0.156 (0.140) (0.098) Triple tax cons. 0.402*** 0.395*** (0.054) (0.054) Individual fixed effects No No No Yes Yes Yes Observations 40,651 39,378 58,478 40,651 39,378 58,478 Notes: This table displays GMM estimates of the relationship between average $$\theta$$ and the valuation of the good considered, applying the methodology discussed in Section 4.3. $$\theta$$ is defined as the “weight” that consumers place on the sales tax, with $$\theta=0$$ corresponding to complete neglect of the tax and $$\theta=1$$ corresponding to full optimization. Columns (1)–(3) estimate the model $$\bar{\theta}_{ik}=\alpha_{0}^{\text{1x}}\mathbf{1}_{\text{1x}}+\alpha_{0}^{\text{3x}}\mathbf{1}_{\text{3x }}+\alpha_{p_{2}\in[5,10)}\mathbf{1}_{p_{2}\in[5,10)}+\alpha_{p_{2}\geq10}\mathbf{1}_{p_{2}\geq10}$$. We assume that $$\alpha_{p_{2}\in[5,10)}$$ and $$\alpha_{p_{2}\geq10}$$ do not change across the standard- and triple-tax arms, but we allow for different baseline values $$\alpha_{0}^{\text{1x }}$$ and $$\alpha_{0}^{\text{3x }}$$. Columns (4)–(6) control for individual fixed effects, estimating the model $$\bar{\theta}_{ik}=\theta_{i}+\alpha_{p_{2}\in[5,10)}\mathbf{1}_{p_{2}\in[5,10)}+\alpha_{p_{2}\geq10}\mathbf{1}_{p_{2}\geq10}$$. We model the two moment conditions for each arm separately, and we use the two-step GMM estimator to approximate the efficient weighting matrix for the over-identified model. All specifications condition on module 2 price ($$p_{2}$$) being greater than 1. Standard errors, clustered at the subject level, are reported in parentheses. * $$p<0.1$$, ** $$p<0.0$$5, *** $$p<0.01$$. This analysis is consistent with attention increasing in the absolute tax $$p\tau$$. However, this result could also be obtained if consumers willing to pay the most for the products are also the most attentive. Columns (4)–(6) report an analogous test, ruling out this possibility through the inclusion of individual fixed effects (Online Appendix D.2 formally documents how we modify our GMM strategy). While estimates of attention are somewhat lower than in the first three columns, we again find greater inattention when $$p_{2}^{ik}<5$$ than when $$p_{2}^{ik}\geq5$$. In summary, our findings are consistent with attention allocation that is endogenous to stake size, whether variation in stake size is generated through experimentally manipulated tax rates or through naturally occurring variation in the prices at which consumers are marginal. This finding becomes important when comparing average attention found in our experiment to the attention predicted to occur at existing market prices. Subjects in our experiment valued the considered products somewhat lower, on average, than the prices posted on Amazon.com (average Amazon.com price: $10.15; average module 2 willingness to pay: $6.09). As we document in Online Appendix E.3, consumers who are marginal at market prices have average values of $$\theta$$ approximately 0.1 higher than other consumers. When extrapolating the quantitative estimates of this article into new settings, differences in marginal valuations between our experiment and the setting of interest must be similarly accommodated. 4.6. Sources and correlates of consumer mistakes 4.6.1 Do consumers know the tax rates? To assess consumers’ knowledge of the sales tax rates, and whether underestimation of the tax rates generates some of the under-reaction, we included the followin