demand systems with unobserved heterogeneity and …pendakur/hixtrix13.pdf · 2005. 6. 9. · are...

Report

Post on 24-Feb-2021

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Demand Systems With Unobserved Heterogeneity andUnrestricted Engel Curves

Arthur Lewbel and Krishna Pendakur*

Abstract

We propose a class of �exible utility derived consumer demand systems, called "Pseudo-

Marshallian" demands, that can have the following properties: 1. Budget shares are

nearly linear in parameters. 2. Engel curve shapes for each good are almost com-

pletely unrestricted and can have any rank. 3. Like the Almost Ideal Demand System

(AID), the model has budget shares that are nearly linear in log prices, but, unlike

the AID system, our model is not restricted to linearity in log de�ated total expendi-

tures, e.g., it can be a polynomial or spline of any order in de�ated expenditures and

in other observed heterogeneity characteristics. 4. Error terms correspond to unob-

served preference heterogeneity. 5. Demand functions are estimated using ordinary

GMM, and, like the AID system, an approximate model can be estimated by linear

regression methods.

1 Introduction

Recent empirical nonparametric and semiparametric work with large consumer ex-

penditure data sets reveals income expansion paths (Engel curves) with signi�cant

curvature and variation across goods. For example, some goods have Engel curves

that are close to linear or quadratic, while others are more S shaped (see, e.g. Banks,

*Department of Economics, Boston College, USA and Department of Economics, Simon FraserUniversity, CANADA. This version is June, 2005. We would like to thank David Donaldson forsuggesting Theorem 2, and Stefan Sperlich.

Blundell and Lewbel 1997 and Blundell, Chen and Kristensen 2003). Typical para-

metric demand models cannot encompass this variety of shapes. We propose a class

of �exible cost (expenditure) functions characterizing utillity, and associated budget

share demand functions, that have the following properties.

1. Budget share demand functions are nearly linear in parameters.

2. Budget share Engel curves for each good are almost completely unrestricted,

that is, for one price regime, Engel curves can have almost any functional form, and

the resulting demand system can have any rank up to J � 1, where J is the numberof goods.

3. Like Deaton and Muellbauer�s (1980) Almost Ideal Demand (AID) System ,

our models have budget shares that, up to an income de�ator, are linear or close to

linear in log prices. However, unlike the AID system, our models are not restricted to

linearity in log de�ated total expenditures y, e.g., they can be polynomials or splines

of any order in y.

4. Error terms in our model can interpreted as unobserved preference hetero-

geneity parameters. This unobserved preference heterogeneity is locally coherent and

invertible.

5. Observable parameters that a¤ect tastes, such as demographic variables, can

a¤ect demands in the same way as the unobserved heterogeneity, or more generally

can also appear in the model interacting in general ways with y, e.g., the demand

functions can be polynomials or splines of any order in both y and in demographic

variables.

6. The demand functions can be estimated using ordinary nonlinear instrumental

variables via GMM. Like the AID, approximate versions of our models can be esti-

mated by linear regression methods. We �nd almost no empirical di¤erence between

the exact GMM and approximate linear regression estimates.

The methodology begins by specifying a cost (expenditure) function and uses

Shephard�s lemma to obtain Hicks demands. Then, instead of obtaining Marshallian

demands from Hicks demands by solving for utility in terms of total expenditure, we

solve for demands as functions of log prices and log de�ated (or real) total expendi-

tures y, where the de�ator is similar to a Stone index. We call the results Pseudo-

Marshallian demand functions. These Pseudo-Marshallian demands circumvent the

di¢ culty of �nding simple analytic expressions for indirect utility or Marshallian de-

mands. A similar idea is Browning�s (2001) �M-demands,�which expresses demand

functions in terms of prices and the quantity of one good, instead of in terms of

prices and total expenditures, though in our case the goal is obtaining simple �exible

functional forms rather than coping with missing or mismeasured total expenditures.

De�ne a J�vector of logged prices p, a J�vector of budget shares w, and loggedtotal expenditure x. Assume an individual chooses w by maximizing a regular utility

function subject to a linear budget constraint. We wish to explicitly include both

observable and unobservable sources of preference heterogeneity in the model, so let

z be an L� vector of observable demographic (or other) characteristics that a¤ect

preferences, and let " be a J�vector of unobserved preference characteristics (tasteparameters). De�ne the log cost or expenditure function x = C(p; u; z; ") which

equals the minimum log-expenditure required for an individual with characteristics

z; " to attain utility level u when facing log prices p.

Two models serve to illustrate this paper�s methodology. The �rst is

C(p; u; z; ") = u+ p0a+ p0Bs(u; z) + (p0Ap)1=2 + p0"

where a is a J�vector of constants with 10ja = 1, A is a J � J symmetric matrixof constants with 10jA = 00J , B is a J � L matrix of constants with 10jB = 00L, and

s(u; z) is an L�vector of arbitrary functions such as polynomials, splines, logistictransforms of polynomials (to remain bounded), or almost any other functional form

we choose. In our empirical application, we let s(u; z) = (u�1, u, u2,..., u5, z, uz)0.

By Shephard�s lemma, this cost function has Hicksian (compensated) budget shares

w = a + Bs(u; z) + Ap(p0Ap)�1=2 + " and it can be readily checked from these

formulas that C(p; u; z; ") = u+ p0w, which means that

w = a+Bs(y; z) +Ap(p0Ap)�1=2 + "

where

y = x� p0w.

Also, in the base time period where p is the zero vector, we get Engel curves w =

a+Bs(x; z) + ", so Engel curves in this price regime may have almost any shape.

This model is an example of what we call Pseudo-Marshallian demand functions,

meaning that budget shares are expressed in terms of log de�ated expenditures y

instead of log nominal expenditures x. Unlike Deaton and Muellbauer�s (1980) AID

system, here the logged Stone index p0w is not an approximation, but is instead the

exact, correct logged de�ator for x. This model does not have a closed form expression

for indirect utility or ordinary Marshallian demand function except for very special

choices of s(x; z), but it can still be readily estimated because it expresses budget

shares as functions of observables p, z, and y.

The key is de�ning a cost function that makes y ordinally equivalent to u, thereby

allowing us to replace u with y in the Hicksian demand functions. The resulting model

has budget shares that are �exible in prices, additive in random utility parameters

(the errors terms "), and has Engel curves that are almost unrestricted functions,

such as polynomials of arbitrary degree, in total expenditures and in other observable

characteristics.

A second example is the closely related speci�cation

C(p; u; z; ") = u+ p0a+ p0Bs(u; z) + p0Ap=2 + p0"

This cost function has the simpler Hicksian budget shares w = a+Bs(u; z)+Ap+"

and for this model we obtain C(p; u; z; ") = u+p0w+p0Ap=2, which means that we

have Pseudo-Marshallian budget share demand functions

w = a+Bs(y; z) +Ap+ "

where now

y = x� p0(w �Ap=2).

Log de�ated expenditures y are more complicated in this model, depending on A

instead of only being a function of observed variables as in the previous model. How-

ever, in this model the Pseudo-Marshallian demand functions are completely linear

in parameters given y, while still maintaining �exibility in prices, additive random

utility parameter errors, and arbitrary Engel curve functional forms.

We provide general theorems showing the restrictions on cost functions required

to construct pseudo-Marshallian demands. We then show that, with approximations

similar to those used for AID system estimation, approximate versions of these mod-

els are fully linear, and can be estimated by least squares regression. We show that

instrumental variable and GMM estimators can be used to deal with the endogeneity

that results from having a regressor y that depends on w, though we also �nd that

accounting for this endogeneity has very little e¤ect empirically. We provide exten-

sions and additional mathematical properties of pseudo Marshallian demands, which

are relevant for evaluating these models and for other applications of our general

methodology.

2 Tricks With Hicks

Ignore preference heterogeneity for now, that is, consider z and " as �xed. We will

reintroduce them again later. De�ne preferences to be regular if they can be repre-

sented by a log cost function C(p; u) that satis�es: exp[C(p; u)] is concave, increas-

ing, di¤erentiable and homogeneous of degree one in (unlogged) prices exp(p) and

is monotonically increasing and di¤erentiable in u. In most cases, explicit functions

we provide for C(p; u) will not be globally regular, that is, they may only be regular

over some limited range of values of p; u, in which case the reported results will only

be valid over these limited ranges. Shephard�s lemma relates Hicksian (compensated)

budget shares to regular cost functions by

w = !(p; u) = rpC(p; u):

The Stone de�ator is de�ned as the price index equal to the share-weighted geo-

metric mean of prices, that is, as exp(p0w). De�ne log stone-de�ated expenditure, y,

as y = x � p0w. If y = u (or more generally if y is ordinally equivalent to u) then

real expenditures, which hold utility constant when prices change, are equal to Stone

de�ated expenditures. De�ne the Stone de�ator to be exact if u = y = x � p0w, orequivalently if

u = x� p0w = C(p; u)� p0[rpC(p; u)]: (1)

Exactness of the Stone de�ator implies w = !(p; x � p0w), which transforms theHicksian demands into a Pseudo-Marshallian demand system, which is based solely

on observables. Let 1J denote the J�vector of ones.

Theorem 1: Assume C(p; u) is regular. De�ne d(p; u) � C(p; u)�u. The Stonede�ator is exact for C(p; u) if and only if d(�p+1J�; u) = �d(p; u)+� for any scalars

� and �.

Proofs are in the Appendix. Theorem 1 can be equivalently stated as the require-

ment that ed be linearly homogeneous in ep, which follows from ordinary cost function

homogeneity, and also that d be linearly homogeneous p, which makes u = y. Both

homogeneity conditions must hold for the Stone index to be the exact de�ator of a

regular cost function.

Proofs are in the Appendix. By ordinality, instead of de�ning exactness by y = u,

we could have de�ned it by y = h(u) for any strictly monotonically increasing function

h, and Theorem 1 would then hold with d(p; u) de�ned by C(p; u) � h(u) There isno gain in generality from doing so, because by ordinality, C(p; u) � h(u) has thesame indi¤erence curves, and hence the same Marshallian and Pseudo-Marshallian,

demand functions, as C(p; h�1(u)) � u = eC(p; u) � u, and in this representationof the cost function (i.e., in this cardinalization of utility) we would have y = u as

before. We will for convenience write cost functions in the cardinalization that makes

u = x� p0w instead of h(u) = x� p0w.The following Corollary and theorem illustrates the restrictiveness of the con-

straint that d(�p+ 1J�; u) = �d(p+ 1J�; u), as required for the Stone de�ator.

Corollary 1: Assume the Stone de�ator is exact for C(p; u). Then Hicksiandemands !(p; u) = rpC(p; u) are homogeneous of degree zero in p and in exp(p).

Cost function regularity implies that Hicksian demands do not change when all

prices are scaled by a constant factor. Corollary 1 shows that Stone index exactness

requires that Hicksian demands also not change when all prices undergo the same

power transformation, for example, when all prices are squared. This also rules out

Hicksian demands that are linear in p.

Theorem 2: Consider the case where J = 2. Then d(�p1 + �; �p2 + �; u) =

�d(p1; p2; u)+� for any scalars � and � if and only if d(p1; p2; u) = m(u)p1+[1�m(u)]p2for some function m(u).

Theorem 2 shows that, with J = 2 goods, the only cost functions C(p; u) that have

an exact Stone de�ator are linear in p, which we may write as C(p; u) = u+ p0m(u)

for any J�vector of functionsm(u) that satisfym(u)01J = 1. This has an exact Stonede�ator for any J , but also has Hicksian budget shares m(u) that are independent of

Systems having more than J = 2 goods can be nonlinear in p. One example is

C(p; u) = u+ p0m(u) + [p0M(u)p]1=2

where m(u) is as above andM(u) is a J by J symmetric matrix-valued function of u

withM(u)1J = 0J . The �rst model described in the introduction is a special case of

this example. Note that, as the above example shows, we could add a term of the formhp0 eApi1=2 u to the log cost function of the �rst model in the introduction, which wouldthen add the term eApy(p0 eAp)�1=2 to the Pseudo-Marshallian demands, allowing for�exible interactions between prices and real Stone de�ated total expenditures in the

demand functions.

Another example is

C(p; u) = u+ p0m(u) + [p0M(u)p] =p0n(u)

where m(u) and M(u) are as above and n(u) is a J�vector function of u withn(u)01J = 0. The functions m(u), M(u), and n(u) can depend in arbitrary ways

on u, with cost function regularity only imposing inequality constraints on these

functions.

3 Obtaining Flexible Linearity by Generalizing the De�ator

In the models of the previous section, Pseudo-Marshallian shares require the unattrac-

tive feature of not changing when prices are squared, and require nonlinear terms like

(p0Ap)1=2 in the cost function to have any relative price e¤ects. We can avoid these

drawbacks by generalizing the form of the permitted de�ator, thereby relaxing the ho-

mogeneity restriction required by Stone de�ator exactness. Speci�cally, now consider

log real total expenditures de�ned by

y = x� p0w � t(p) (2)

for some function t(p). Exactness for the more general de�ator p0w + t(p) requires

y = u with y given by equation (2), which implies

u = x� p0w � t(p) = C(p; u)� p0[rpC(p; u)]� t(p). (3)

Theorem 3: Assume C(p; u) is regular. Again let d(p; u) � C(p; u)� u. Thereexists a function t(p) that makes the de�ator p0w+t(p) exact for C(p; u) if and only if

d(p+1J�; u) = d(p; u)+� andrud(�p; u) = �rud(p; u) for any scalars � and �. Also,

if the de�ator p0w+ t(p) is exact for some cost function, then t(p+ 1J�) = t(p) + �.

Theorem 3 describes the general conditions required for exactness of de�ators

of the form p0w + t(p). Comparing Theorems 3 and 1 shows that the di¤erence

between this exactness and Stone de�ator exactness is that the latter requires linear

homogeneity of d(p; u), while the former only needs linear homogeneity of rud(p; u).

The following theorem provides a convenient way to construct such cost functions.

Theorem 4: Assume the function C(p; u) satis�es the homogeneity conditionsrequired to be a log cost function with an exact Stone de�ator, as given by Theorem

1. Assume C(p; u) = C(p; u)+T (p) is a regular log cost function. Then the de�ator

p0w + t(p) is exact for C(p; u), where t(p) = T (p)� p0 [rpT (p; u)].

It follows from Theorem 4 that we may take any log cost function described in the

previous section, and add to it a function T (p) that satis�es T (p+ 1J�) = T (p) + �,

to obtain a log C(p; u) that satis�es the necessary price homogeneities for exactness

of p0w + t(p). In particular, this means that C(p; u) = u + p0m(u) + T (p) will

satisfy the necessary price homogeneities for exactness of p0w + t(p) where t(p) =

T (p) � p0 [rpT (p; u)], given any function T (p) having the property that exp [T (p)]

is homogeneous of degree zero in exp(p). An example of particular interest is

C(p; u) = u+ p0m(u) + p0Ap=2

where m(u)01J = 1, A is symmetric, and A1J = 0J . This has Hicks demands

!(p; u) =m(u)+Ap and, by Theorem 3, has Pseudo-Marshallian demandsw =m(y)+

Ap where y = x�p0 (w �Ap=2). This is the model of Pendakur and Sperlich (2005),although they estimate its ordinary Marshallian demands, leaving m nonparametric

and numerically solving for u. The second model described in the introduction has

this form. By allowing the de�ator to depend on Ap=2, we can have prices appear

linearly and �exibly in the Pseudo-Marshallian demand functions.

4 Preference Heterogeneity

We wish to explicitly include preference heterogeneity in the model, both from ob-

servable sources z and unobservable sources ". A simple way to include these in the

models without interfering with required price homogeneities is to include them in

the functions m(u). Example models provided in the previous sections then have the

general form

C(p; u; z; ") = u+ p0m(u; z; ") + (p0Ap)1=2 (4)

C(p; u; z; ") = u+ p0m(u; z; ") + p0Ap=2

where m(u; z; ")01J = 1. Equation (4) has Stone index exactness, while the other

equation has exactness of the de�ator p0 (w �Ap=2). Note that u may also be im-plicitly a function of z; ", since by ordinality, replacing u with h(u; z; ") for any func-

tion h that is strictly monotonically increasing in its �rst element leaves Marshallian

demands unchanged.

A convenient general form for m(u; z; ") is

m(u; z; ") = a+Bs(u; z) + "

where s(u; z) are polynomials, splines, or other known functions of u and z. This then

yields the models described in the introduction. These models are

C(p; u; z; ") = u+ p0a+ p0Bs(u; z) + (p0Ap)1=2 + p0" (5)

which has a Stone index de�ator and Pseudo-Marshallian demands

w = a+Bs(y; z) +Ap(p0Ap)�1=2 + ", y = x� p0w (6)

and

C(p; u; z; ") = u+ p0a+ p0Bs(u; z) + p0Ap+ p0" (7)

which has a more general de�ator, but linear in parameters Pseudo-Marshallian de-

mands

w = a+Bs(y; z) +Ap+ ", y = x� p0w � p0Ap=2. (8)

The complete set of properties required for de�ator exactness and cost func-

tion regularity of these models are a01J = 1, A1J = 0J, 10JB = 0K, "01J = 0,

p0Brus(u; z) > �1, and concavity of exp[C(p; u)] (a su¢ cient condition for concav-ity is that A be negative semide�nite). These constraints are assumed to hold for

every value that x;p; z; " can take on, and hence every value that u;p; z; " can take

on.

5 Models For Estimation

The model we estimate has cost function

C(p; u; z; ") = u+p0�a+Ap=2 + b�1u

�1 + b1u+ b2u2 + :::+ b5u

5 +Cz+Dzu+ "�

(9)

where each bk is a J�vector of constants with b0k1J = 0, and C and D are J by L

matrices of parameters with 10JC = 0L and 10JD = 0L. This model has log real total

expenditures y = x� p0w � p0Ap=2 and pseudo-Marshallian budget shares

w = a+Ap+ b�1u�1 + b1y + b2y

2 + :::+ b5y5 +Cz+Dzy + ". (10)

This model is the special case of equations (7) and (8) that has s(y; z) = (y�1; y; y2; :::; yK ; z0; z0y)0

and B = (b�1 : b1 : ::: : b5 : C : D). Typical elements of z would include house-

hold size, age, and composition. This model has Engel curves that are high order

polynomials. Unlike Marshallian demands, our pseudo-Marshallian demands can be

polynomials of any degree K without being bound by Gorman (1981) and Lewbel

(1991) type rank restrictions. Note, however, that monotonicity of the cost function

does place inequality constraints on s(y; z), which restrict the range of possible pa-

rameter values and the range of values of p; y; z; " for which these demand functions

satisfy regularity. Instead of polynomials, we could alternatively have taken s(y; z) to

be splines, or bounded functions such as trigonometric functions or logistic transfor-

mations of polynomials, which would automatically bound estimated budget shares.

In our model the D parameters allow for interactions between y and z. Demand

functions having interactions of z and y with p are also possible, e.g., by adding the

termhp0 eApi1=2 u to the log cost function. Other possible extensions are discussed in

the Appendix.

6 Approximate Fully Linear Model

The demand functions (8) are linear in parameters except for the term p0Ap in the

de�ator. A similar nonlinearity appears in Deaton and Muellbauer�s (1980) AID

system and Banks, Blundell, and Lewbel�s (1997) QUAID system, and can be dealt

with in an analogous way by replacing it with an observable approximation.

In the models given by equations (7) and (8), consider approximating log real

expenditure y with nominal total expenditures de�ated by an average Stone price

index, that is, replace y = x� p0w � p0Ap=2 with ey de�ned byey = x� p0w (11)

for some set of budget shares w. Then equation (8) becomes

w � a+Ap+Bs(ey; z) + ".and, in particular, our budget share demands (10) become

w � a+Ap+ b�1ey�1 + b1ey + :::+ bKeyK +Cz+Dzey + ". (12)

For K = 1 and D zero, this is identical to the popular approximate Almost Ideal

Demand System (AID). Note, however, that our exact model with K = 1 di¤ers from

the exact AID, in that the exact AID has a de�ator for x is that is quadratic in p,

while our model�s exact de�ator includes both a quadratic in p and the Stone index.

Unlike the AID system, our model permits s(ey; z) to have any functional form, suchas a polynomial of any order. A special case is the model w � a+Ap+ b1ey+b2ey2+Cz+ ", which is the model estimated by Blundell, Pashardes, and Weber (1993).

Their motivation for this model was by analogy with the Almost Ideal, but if this

model was really to be Marshallian with a, b1, and b2 constant then, as they show,

utility maximization would require linear rank restrictions on these coe¢ cients, which

they did not impose. The above approximate model therefore provides a rationale for

the model they actually estimated.

The approximate model (12) can be estimated by linear regression methods, with

linear cross-equation symmetry restrictions on theA coe¢ cients. A natural choice for

w is the sample average of budget shares across consumers. A better approximation to

y would be to let w be each consumer�s own w, so each consumer has their own Stone

index de�ator based on their own budget shares. As discussed later, this introduces

endogeneity, but dealing with the endogeneity makes the estimator be just linear three

stage least squares with linear cross-equation constraints on the A coe¢ cients.

In our later empirical application, we estimate the approximate model withw = w

using seemingly unrelated regressions ignoring endogeneity, and we estimate the exact

model accounding for endogeneity. We �nd empirically that these approximate model

estimates did not di¤er much from the exact model estimates.

7 Applying the Model

Consider evaluating the cost for each individual of a price change from p0 to p1. A

consumer surplus measure for this price change is the log cost of living index, which

for our general model (4) is given by

C(p1; u; z; ")� C(p0; u; z; ") = (p1 � p0)0 [a+m(u; z; ")] + p01Ap1=2� p00Ap0=2.

If C(p0; u; z; ") is the cost function of a household that has budget shares w, then

this expression simpli�es to

C(p1; u; z; ")� C(p0; u; z; ") = (p1 � p0)0 (w �Ap0) + p01Ap1=2� p00Ap0=2.= (p1 � p0)0w + (p1 � p0)0A(p1 � p0)=2:

The �rst term in this cost of living index is (p1�p0)0w the share-weighted geomet-ric mean of the price changes. Such indices are commonly used on the grounds that

they are appropriate for small price changes and they allow for arbitrary unobserved

preference heterogeneity across households. In our model, the presence of the second

term, which depends upon A, allows us to explicitly model substitution e¤ects, and

so consider large price changes, while also accounting for the behavioural importance

of both observed and unobserved heterogeneity.

De�ne semielasticities to be derivatives of budget shares with respect to log prices

p, log real total expenditures y, log nominal total expenditures x, and demographic

characteristics (or other observed taste shifters) z. The semielasticity of a budget

share can be converted into an ordinary elasticitiy by dividing by that budget share.

We present semielasticities because they are easier to present algebraically. Our gen-

eral model (4), and the speci�cations (7) and (9) have Hicksian demands

!(p; u; z; ") = a+Ap+m(u; z; ")

= a+Ap+Bs(u; z) + "

= a+Ap+ b1u+ b2u2 + :::+ bKu

K +Cz+Dzy + "

so the Hicksian price semielasticities are

rp0!(p; u; z; ") = A.

These are equivalently the price semielasticities holding real expenditures y �xed, and

the real total expenditures semielasticities are

ry!(p; y; z; ") = rym(y; z; ") = Brys(y; z)

= b1 + 2b2y + :::+KbKyK�1 +Dz.

Marshallian elasticities can be calculated by writing the general form of the de-

mand functions as

w = a+Ap+m(x� p0w � p0Ap=2; z; ") (13)

and taking the total derivative of w with respect to p and x. Details regarding these

and other elasticity calculations are given in the Appendix.

One could also numerically solve x = C(p; u; z; ") for u, with C given by equations

(4), and substitute the result into the Hicksian demand functions, to obtain Marshal-

lian demands. The appendix also provides partial characterizations and closed form

approximations of these Marshallian demands.

8 Estimation

We are estimating demand systems with J goods, so as usual we can drop the last

equation (the J�th good) from the system and just do estimation on the remaining

system of J � 1 equations. The parameters of the the J�th good are then recoverablefrom the adding up constraint that budget shares sum to one. Assume this is done

in all of the following discussion.

The approximately linear model of equations (12) and (11) is the simplest one

to estimate. Assuming " is homoskedastic and uncorrelated with p and s(ey; z), thismodel �ts the form of ordinary SUR (Seemingly Unrelated Regressions). Without im-

posing symmetry of the A matrix, estimating each equation separately by ordinary

least squares is consistent and equivalent to SUR. Symmetry of the A matrix imposes

linear cross equation equality constraints on the coe¢ cients. The SUR estimator is

asymptotically equivalent to maximum likelihood assuming " is normal. This estima-

tor remains consistent if " is heteroskedastic, but in that case it will be more e¢ cient

to use the Generalized Method of Moments (GMM), based on the moments de�ned

by p and s(ey; z) uncorrelated " and by " having mean zero. SUR estimates from the

linear approximate model can be used to obtain good parameter starting values for

exact model estimation. If we let ey = x� p0w, then endogeneity of w means that "

will not be uncorrelated with s(ey; z). However, in our empirical application we �ndthat the bias from ignoring this endogeneity is small, probably because variation in

the endogeneous portion of y; that is, variation in the p0" component of p0w, is small

relative to variation in x and the other components of y.

The exact model of equation (8) is

w = a+Ap+Bs [x� p0w � p0Ap=2; z] + " (14)

and the version we estimate has matrix B and functions s described by equation

(10). A complication for estimation of these models is that w, which is endogenous,

appears on the right side of this expression. One possible estimator is to ignore

this endogeneity and simply estimate the system of equations (14) by nonlinear least

squares.

To formally account for endogeneity, nonlinearity, and heteroskedasticity, we use

an instrumental variables estimator. Let r be an M�vector of observable variablesthat are uncorrelated with ", which will be used as instruments for estimation. If

E(" j x;p; z) = 0J then r can include any bounded functions of p, z, and x. However,if unobserved heterogeneity is correlated with some observed characteristics such as x

or elements of z, then those elements should be excluded from r. Let r = (r1,...,rM)0.

It then follows from E(" j r) = 0J that

E [(w � a�Ap�Bs (x� p0w � p0Ap=2; z)) r`] = 0J , ` = 1,...,M (15)

If " is homoskedastic, then one possible estimator based on these moment conditions is

nonlinear three stage least squares. Otherwise, allowing for generally heteroskedastic

", parameters may be estimated by applying GMM to this set of moment conditions.

By adding up, the moments associated with the J�th good are super�uous, so with

M instruments we will have (J � 1)M moments, which is a rather large number of

moments. The matrix A is constrained to be symmetric.

Writing this system linearly asw = a+Ap+Bs(y; z)+" suggests that good instru-

ments r should be highly correlated with s(y; z) and p. We assume E(" j p; x; z) = 0Jand take r to be s(y; z) and p, using y = x� p0

�w �Ap=2

�where w is the average

budget shares across consumers in our sample, and A is the estimated value of A

based on estimation of the linear approximate model. Speci�cally, the instruments

we use for the model (10) are therefore 1, y�1, y, y2,...,y5, z, yz, and p. Note that

use of w and possible inconsistency of A in the construction of y only a¤ects the

quality of the instruments r and hence e¢ ciency of the GMM estimation, but does

cause inconsistency of the GMM, because r remains uncorrelated with " given any

choice of these parameters w and A. Cross terms such as xp could also be included

as additional instruments. Reasonable starting values for the GMM estimator may

be obtained from the linear approximate model.

Since we have assumed E(" j p; x; z) = 0J , we could interpret Bs(y; z) as a sieveapproximation to a general unknown vector of smooth functionsm(y; z), where s(y; z)

are elements of a spanning basis. Ai and Chen (2003) provide associated rates, optimal

instrument construction, and limiting distribution theory for this GMM estimator,

which can attain the semiparametric e¢ ciency bound for parameters such as A.

The parametric GMM estimator can be readily modi�ed to deal with possible

measurement error or endogeneity in some regressors, by suitably modifying r. For

example, if log total-expenditure x su¤ers from measurement error then one might

use functions of income (eg, log income) instead of functions of x in r to instrument

the problem. Similarly, if unobserved preference heterogeneity is correlated with

some observed taste shifters (i.e., elements of z), then those may be excluded from

the instrument list and replaced by, e.g, nonlinear functions x and of the remaining

elements of z. However, in these cases one would need to take care in interpreting the

resulting model residuals b", because they will then contain both unobserved preferenceheterogeneity and measurement error. With panel data one might separate these two

e¤ects by modeling the unobserved preference heterogeneity using standard random

or �xed e¤ects methods.

These estimators do not impose the inequality constraints implied by regularity

of the cost function, or equivalently by utility maximization. If the model is correctly

speci�ed then imposing these constraints would not be binding asymptotically. These

inequalities can be checked using the elasticity calculations, e.g., the estimated Slutsky

matrix can be checked for negative semide�niteness.

9 Empirical Work

The data used in this paper come from the following public use sources: (1) the Family

Expenditure Surveys 1969, 1974, 1978, 1982, 1984, 1986, 1990, 1992 and 1996; (2)

the Surveys of Household Spending 1997, 1998 and 1999; and (3) Browning and

Thomas (1999), with updates and extensions to rental prices from Pendakur (2002).

Price and expenditure data are available for 12 years in 5 regions (Atlantic, Quebec,

Ontario, Prairies and British Columbia) yielding 60 distinct price vectors. Prices are

normalised so that the price vector facing residents of Ontario in 1986 is (1; :::; 1),

that is, these observations de�ne the base price vector.

Table 1 gives summary statistics for 19276 observations of rental-tenure house-

holds located in cities with at least 30,000 residents. The empirical analysis uses

annual expenditure in nine expenditure categories: food-in, food-out, rent, cloth-

ing, household operation, household furnishing & equipment, private transportation

operation, public transportation and personal care. Personal care is the left-out

equation, yielding eight expenditure share equations which depend on 9 log-prices

and log-expenditure. These expenditure categories account for about three quarters

of the current consumption of the households in the sample.

We estimate expenditure-share equations of the form (10) corresponding to the

cost function (9), where z is a the vector of 7 demographic characteristics reported

in Table 1. For our speci�cation of m, we use powers of y in [�1; :::; 5] and allow forinteractions between z and y, so that

w = a+Ap+ b�1y�1 + b1y + b2y

2 + :::+ b5y5 +Cz+Dzy + ":

To estimate, we need instruments orthogonal the behavioural heterogeneity parame-

ters " to deal with endogeneity of log real-expenditure y. As discussed in the previous

section, we use (y)k as an instrument for yk, where y = x� p0�w �Ap=2

�, w is the

average budget shares across consumers in our sample, and A is a pre-estimate of

A. This instrument y ranges from 7.68 to 11.57, and has somewhat lower vari-

ance than log expenditure. To mitigate numerical di¢ culties associated with our

high-order polynomial in y, we subtract 7.5 from log-expenditure (and thus from log

real-expenditure and its instrument as well).

When symmetry is not imposed on A, the model is exactly identi�ed, and when

symmetry is imposed on A, then the model has (J � 1) (J � 2) =2 = 28 overidentify-ing moments. With symmetry constraints, the model has 188 parameters. The model

is estimated by the GMM, with starting values obtained by seemingly unrelated re-

gressions estimation of the linear approximate model. We obtained convergence in

�ve iterations, due to the near linearity of the model. We obtain a test statistic for

overidentifying restrictions of 1836, distributed under the null as a �228. Since the

overidentifying restrictions are symmetry restrictions, this is a joint test of instru-

ment validity and Slutsky symmetry. It is likely that this rejection of symmetry

is due to not in part to constraining A to be constant across all consumers. This

constraint could be relaxed by letting A depend upon z.

Tables 2-4 in the appendix show estimated coe¢ cients for our model. The left

side of each table gives GMM estimates for the exact model, and the right side of

the table gives estimates for the linear approximate model (estimated using SUR)

which ignores both the endogeneity of y and the appearance of p0Ap=2 in y. Table

2 gives parameters describing the shape of the Engel curve, a and B. Table 3 gives

parameters describing demographic e¤ects, C and D. Table 4 gives parameters

describing price e¤ects, A. A summary of own price e¤ects is:

j wj Ajj

food-in 0.22 -0.47

food-out 0.06 -0.45

rent 0.33 0.097

clothing 0.10 0.029

operation 0.08 -0.033

furnish/equip 0.05 0.046

private trans 0.09 0.023

public trans 0.03 -0.002

Note that the Hicks elastiticity of the quantity of good j with respect to its own price

is (Ajj=wj)� 1.Turning to income e¤ects, Figures 1 and 2 show estimated budget share equations

from GMM estimates for 8 of the 9 budget shares for a household where z = 0, a

single childless adult aged 40. Each budget share function is evaluated at the base

price vector p = 0J (that faced by residents of Ontario in 1986) at each percentile of

the distribution of y for single childless adults. In each curve, there are 99 dots rep-

resenting 99 percentiles. At di¤erent price vectors, the expenditure-share equations

over y are shifted vertically by Ap, and so have the same shape over log real expendi-

ture y. For households with di¤erent demographic characteristics, expenditure-share

equations are shifted vertically by Cz and have a linear e¤ect Dzy added to them.

The estimated expenditure-share equation for Food at Home is almost linear,

which is consistent with many other parametric and nonparametric studies that �ne

nearly linear food shares. The Rent expenditure-share equation exhibits substantial

nonlinearity beyond what a quadratic model could capture. The private transporta-

tion operation equation is approximately S-shaped, and the clothing equation appears

more like two linear pieces joined with a curve. Figure 2 suggests two in�ection points

for the public transportation equation and one in�ection point at very low expenditure

for the food out equation with near linearity above the median..

Figures 3 and 4 overlay a quadratic version of our model for selected expenditure

shares. This quadratic version is given by

w = a+Ap+ b1y1 + b2y

2 +Cz+Dzy + ";

and, in comparison with our baseline model, lacks both the term in the reciprocal

of log real-expenditure, and higher-order terms in log real-expenditure. This model

di¤ers from the Quadratic Almost Ideal (QAI) model of Banks, Blundell, and Lewbel

(1997) in that it has a di¤erent de�ator for total expenditures, and does not require

the coe¢ cient of squared log real-expenditures to depend on prices. Our speci�cation

also allows for more general demographic e¤ects than is typical in applied QAI models,

which usually only have terms similar to C, but not D.

Here, we see that non-quadratic features may be important in a few expenditure-

share equations. At the lower end of the expenditure distribution, the rent share

is mapped poorly by the quadratic model, overstating rent shares by as much as

5 percentage points for the poorest households. Similarly, the expenditure share of

poorer households on private transportation operation is understated by the quadratic

model, and indeed is predicted to be negative for the very poorest households. The

private transportation operation equation is also over-estimated for the second quintile

of the expenditure distribution by as much as 3 percentage points.

Turning to Figure 4, we see some small e¤ects at the bottom of the expendi-

ture distribution: public transportation shares are overstated and food out shares

overstated by the quadratic model.

Figures 5 and 6 overlay SUR estimates of the approximate model

w � a+Ap+ b�1ey�1 + b1ey + b2ey2 + :::+ b5ey5 +Cz+Dzey + "where approximate log real-expenditure ey = x � p0w is treated as though it were

exogenous and exact. The message of these �gures is that the SUR does a remarkably

good job of getting the shape of expenditure-share equations right. The bias due

to the endogeneity of approximate log real expenditure is small and to measurement

error resulting from the exclusion of p0Ap=2 in that approximation is pretty small.

10 Conclusions

We have provided a utility derived consumer demand system that is as �exible in

price responses and as close to linear in parameters as the Almost Ideal Demand

System, but also allows for virtually unrestricted �exibility in Engel curve shapes for

each good. Error terms in the model correspond to coherent and invertible unobserved

preference heterogeneity parameters. Demand functions are estimated by GMM, and,

like the AID system, an approximate model can be estimated by linear regression. We

empirically estimate the model allowing Engel curves to be high order polynomials,

and �nd some signi�cant departures from linear or quadratic demands.

The model is an application of a new general methodology we propose for con-

structing demand systems, which we call pseudo-Marshallian demands. These are

essentially the Hicks demands associated with cost functions having the property that

utility can be represented by total expenditures de�ated by an easily constructed price

de�ator, such as a variant of a Stone index.

References

Ai, Chunrong and Xiaohong Chen, 2003, E¢ cient Estimation of Models with

Conditional Moment Restrictions Containing Unknown Functions, Economet-

rica, 71(6), pp 1795-1944.

Banks, Blundell, R. and A. Lewbel. 1997. Quadratic Engel curves and consumer

demand, Review of Economic studies 79, 527�539.

Beckert, W. and R. Blundell, 2004, Invertibility of Nonparametric Stochastic

Demand Functions, Birkbeck Working Papers in Economics and Finance No.

406.

Blackorby, C., and D. Donaldson. 1993, "Adult-Equivalence Scales and the

Economic Implementation of Interpersonal Comparisons of Well-Being." Social

Choice and Welfare 10, 335-61.

Blundell, Richard, Xiaohong Chen and Dennis Kristensen, 2003, Semi-

Nonparametric IV Estimation of Shape-Invariant Engel Curves, Centre for Mi-

crodata Methods and Practice, working paper CWP15/03.

Blundell, R., A. Duncan and K. Pendakur. 1998. "Semiparametric Estimation

of Consumer Demand", Journal of Applied Econometrics 13, pp 435-461.

Blundell, R. , and A. Lewbel. "The Information Content of Equivalence Scales."

Journal of Econometrics 50 (1991): 49-68.

Blundell, R. W., P. Pashardes, and G. Weber (1995), �What do we learn about

Consumer Demand Patterns from Micro Data?�, American Economic Review,

vol. 83, 570-597.

Brown, D. and R. Matzkin, 1995, "Estimation of a Random Utility Model From

Data on Consumer Demand,"

Browning, Martin and Irena Thomas, 1999, Prices for the FAMEX, unpublished

working paper, Department of Economics, McMaster University.

Browning, Martin, 2001, Modelling Commodity Demands and Labour Supply

with M-Demands, unpublished working paper, Institute of Economics, Univer-

sity of Copenhagen, 21 pages.

Deaton, A. S., and J. Muellbauer (1980), �An Almost Ideal Demand System,�

American Economic Review, vol. 70, 312-326.

Gorman, W. M. (1981), "Some Engel Curves," in Essays in the Theory and

Measurement of Consumer Behaviour in Honor of Sir Richard Stone, ed. by

Angus Deaton, Cambridge: Cambridge University Press.

Lewbel, A. "Household Equivalence Scales and Welfare Comparisons." Journal

of Public Economics 39 (1989): 377-91.

Lewbel, A. (1991), �The Rank of Demand Systems: Theory and Nonparametric

Estimation,�Econometrica, 59, 711-730.

Pendakur, K. 1999. "Estimates and Tests of Base-Independent Equivalence

Scales", Journal of Econometrics 88(1), pp 1-40.

Pendakur, Krishna. 2002. "Taking Prices Seriously in the Measurement of In-

equality", Journal of Public Economics, October 2002, 86(1), pp 47-69.

Pendakur, Krishna and Stefan Sperlich, 2005, Semiparametric Estimation of

Consumer Demand Systems with Almost Partially Linear Price E¤ects, unpub-

lished working paper, Department of Economics, Simon Fraser University.

11 Appendix

This appendix provides proofs, extensions, and additional mathematical properties of

our models, which are relevant for evaluating these models and for other applications

of our general methodology.

11.1 Proofs

Proof of Theorem 1: The condition that d(�p+1J�; u) = �d(p; u)+� is equivalentto d(p; u) being linearly homogeneous in p and exp[d(p; u)] linearly homogeneous

in exp(p). The latter condition is required by cost function regularity. For the

former, given equation (1) we have C(p; u) � u = p0 [rpC(p; u)]. This is equivalent

to d(p; u) = p0 [rpd(p; u)], which by the Euler Theorem shows that d(p; u) � u islinearly homogeneous in p of degree one.

Proof of Corollary 1: Homogeneity of degree zero in exp(p) follows from or-

dinary cost function regularity, and homogeneity of degree zero in p follows from

equalling the derivative of a linearly homogeneous function.

Proof of Theorem 2: Since d(�p1 + �; �p2 + �; u) = �d(p1; p2; u) + � holds

for any � and �, let � = 1=(p2 � p1) and � = �p1=(p2 � p1) to obtain d(0; 1; u) =[d(p1; p2; u)= (p2 � p1)]� [p1= (p2 � p1)]. The theorem holds withm(u) = 1�d(0; 1; u).

Proof of Theorem 3: The condition that d(p+1J�; u) = d(p; u)+� is equivalentto exp[d(p; u)] linearly homogeneous in exp(p), which is required by cost function

regularity. Given equation (3) we have C(p; u) � u � t(p) = p0 [rpC(p; u)]. This is

equivalent to d(p; u)� t(p) = p0 [rpd(p; u)]. Taking the derivative of this expression

with respect to u gives

rud(p; u) = p0 [rprud(p; u)] (16)

which by the Euler Theorem shows that rud(p; u) is linearly homogeneous in p,

that is, rud(�p; u) = �rud(p; u). In the other direction, if rud(p; u) is linearly

homogeneous then equation (16) holds. Integrate this equation with respect to u

(de�ning t(p) be the constant of integration) and continue following the above steps

in reverse to obtain equation (3). Given exactness, Equation (3) implies t(p) =

u�C(p; u)+p0!(p; u). Ordinary homogeneity of cost functions and Marshallian de-mands requires that exp [u� C(p; u) + p0!(p; u)] be linearly homogeneous in exp(p),so t(p+ 1J�) = t(p) + �.

Proof of Theorem 4: C(p; u) = C(p; u) + T (p), so

C(p; u)� p0[rpC(p; u)] = C(p; u)� p0[rpC(p; u)] + T (p)� p0 [rpT (p; u)]

= u+ T (p)� p0 [rpT (p; u)]

where the second equality follows from Stone de�ator exactness for C(p; u) from

Theorem 1, and this expression by (3) implies exactness of the de�ator p0w+T (p)�p0 [rpT (p; u)].

Proof of Theorem 5: Let R be a scalar valued function of rpC(p; u) and p

that is strictly monotonic in u. Since R(p;!(p; u)) is strictly monotonic in u, it can

be inverted to obtain u = S(p; R), and we may de�ne a function g by g(w;p) =

C(p; S(p; R(w;p)))� S(p; R(w;p)).

11.2 Extensions

Instead of Stone index or related de�ators, we could more generally de�ne de�ated

log total expenditures by

y = x� g(w;p) (17)

for any de�ator function g. Exactness of the de�ator g(w;p) would then require that

y = u, and therefore that

C(p; u) = u+ g(rpC(p; u);p) (18)

The resulting Pseudo-Marshallian demand functions would then be

w = ! [p; x� g(w;p)] . (19)

where !(p; u) = rpC(p; u). At this level of generality, there exists a de�ator g(w;p)

that is exact for almost any log cost function C(p; u), as the following Theorem shows.

Theorem 5: Assume C(p; u) is regular. A su¢ cient condition for existence of afunction g(w;p) that is an exact de�ator for C(p; u) is the existence of some scalar

valued function of rpC(p; u) and p that is strictly monotonic in u.

Theorem 5 provides an extremely weak su¢ cient condition for existence of g,

and hence existence of pseudo-Marshallian demands. For example, this condition

is satis�ed if any good or combination of goods has a budget share that is strictly

increasing or strictly decreasing in total expenditures (and hence in utility). Moreover,

this condition is su¢ cient but not necessary, e.g., if preferences are homothetic, then

C(p; u) = u+t(p) for some function t and we may let g(w;p) = �t(p). The only kindsof situations where g might fail to exist is when all budget shares are independent of

u for some (but not all) ranges of values of u.

To give an example of Theorem 5, suppose that the Hicksian budget share function

for good 1, w1 = !1(p; u), is invertible in u, so u = !�11 (p; w1). Then a de�ator g

that makes u = y is g(w;p) = C(p; !�11 (p; w1))� !�11 (p; w1).The function g satisfying equation (18) is not unique, e.g., it can be freely trans-

lated by an ordinal transformation of u, and in the previous example, di¤erent in-

vertible budget shares could be used to obtain di¤erent expressions for g. For appli-

cations, the main point of Pseudo-Marshallian demands is not existence, but rather

convenience for demand estimation, which is why this paper focused on examples

where g has simple closed forms, like the log Stone de�ator.

One potentially interesting generalization of the cost function (4), is the general

class of models

x = C(p; u; z; ") = u+ c(p; z; ")0 em(u; z; ") + d(p; z; ") (20)

where a is a J�vector of constants with a01J = 1, em(u; z; ") and c(p; z; ") areJ�vector valued functions, d(p; z; ") is a scalar valued function, 1+ru em(u)0c(p; z; ") >0, em(u)01J = 1, exp[c(p; z; ")] is di¤erentiable and homogeneous of degree one in

prices, exp [d(p; z; ")] is homogeneous of degree zero in prices, rpc(p; z; ") is nonsin-

gular, and exp[C(p; u; z; ")] is concave. By Shephard�s lemma the Hicksian budget

shares for this class are

w = !(p; u; z; ") = rpc(p; z; ")0 em(u; z; ") +rpd(p; z; ") (21)

Solving this equation for em(u; z; ") and substituting the result into equation (20)gives the Pseudo-Marshallian demand functions

w = rpc(p; z; ")0 em(y; z; ") +rpd(p; z; ")

where real log total expenditures y are given by

y = x� d(p; z; ") + c(p; z; ")0 [rpc(p; z; ")0]�1[w � d(p; z; ")]

Assuming d(p; z; ") and c(p; z; ") are independent of " and em(y; z; ") is additive in aconstant vector a and in " gives the model

x = C(p; u; z; ") = u+ c(p; z)0 [a+m(u; z) + "] + d(p; z) (22)

which has real log expenditures and pseudo Marshallian demand functions

y = x� d(p; z) + c(p; z)0 [rpc(p; z)0]�1[w � d(p; z)] (23)

w = rpc(p; z)0 [a+m(y; z)] +rpd(p; z) + e". (24)

This makes y depend only on observables, and has Pseudo-Marshallian demand func-

tions that are additive in errors e" = rpc(p; z)0". These errors are conditionally

mean zero if " is conditionally mean zero, so this general model can still be esti-

mated by GMM given parameterizations of the functions c(p; z), d(p; z), andm(y; z).

Our estimated models are the special case of equation (22) where c(p; z) = p,

d(p; z) = p0Ap=2, and m(y; z) = Bs(y; z). The generalizations above show how

further interactions between p, z, and y can be introduced in our model, if empiri-

cally required.

11.3 Coherency and Invertibility

Coherency of a structural model is de�ned as the property that, for each value of

the exogenous variables and errors, there exists a unique corresponding value of the

endogenous variables, and invertibility is the property that a unique value of errors is

associated with each value of the endogenous and exogenous variables. In the demand

system context (see, e.g., Brown and Matzkin (1995) or Beckert and Blundell (2004)),

coherency requires that a unique value of budget shares w be associated with each

possible value of p; x; z; ", and invertibility requires that a unique value of errors "

be associated with each possible value of p; x; z;w.

Consider the class of cost functions (22). Invertibility of this class (and hence of

our estimated models) is established by observing that, by equation (24),

" = [rpc(p; z)0]�1[w �rpd(p; z)]� a�m(y; z)

where y is given by equation (23). Coherency follows from C(p; u; z; ") having all

of the properties of a regular cost function, which ensures existence of Marshallian

demands (treating " as preference parameters), even though we do not have a closed

form analytic expression for them. This coherency is local in the sense that it requires

joint restrictions on variables, errors and parameters to attain the assumed regularity

of C.

11.4 Closure Under Unit Scaling

A desirable feature of demand models is that they be closed under unit scaling, that

is that a change in the units that goods are measured in only changes the values of

the parameters or functions that de�ne the model. The general model (22) is closed

under unit scaling. To see this, observe that changing the units of measurement is

equivalent to replacing p with p+k for some J�vector of constants k. This changes

the cost function to

C(p; u; z; ") = u+ c(p+ k; z)0 [a+m(u; z) + "] + d(p+ k; z)

= u+ c�(p; z)0 [a+m(u; z) + "] + d�(p; z)

so changing the units just changes the functions c(p; z) and d(p; z) but leaves the

general form of the model unchanged.

The general form of the model we estimated, equation (4) is also closed under unit

scaling, since

C(p; u; z; ") = u+ (p+ k)0 [a+A(p+ k)=2 +m(u; z; ")]

= u+ k0m(u; z; ") + k0(a+Ak=2) + p0 [(a+Ak) +Ap=2 +m(u; z; ")]

= u� + p0 [a� +Ap=2 +m�(u�; z; ")]

where a� = a+Ak, u� = h(u; z; ") = u+k0m(u; z; ")+k0(a+Ak=2), andm�(u�; z; ") =

m(h�1(u�; z; "); z; "), where h�1 denotes the inverse of h(u; z; ") with respect to its

�rst element, and invertibility of h(u; z; ") follows from strict monotonicity of the

cost function with respect to u. This reinforces the point made earlier that u may

also be implicitly a function of z; ", since by ordinality, replacing u with h(u; z; ")

for any function h that is strictly monotonically increasing in its �rst element leaves

Marshallian demands unchanged.

Although the general form of our model is closed under unit scaling, speci�c

parametric functional forms for m might not be, since the functional form for m�

need not be the same as the functional form for m (though they are the same when

m is linear in its arguments). When m� does not have the same functional form

as m, closure under unit scaling can be attained by introducing k as an additional

parameter vector to be estimated. The presence of a implies that, without loss of

generality, we may take k01J = 0. In our empirical application, attempts to estimate

a parameter k were always either completely insigni�cant or failed to converge.

11.5 Additional Elasticity Calculations

This section provides elasticity calculations in addition to those provided in the section

on applying the model. Marshallian semielasticities are obtained by taking total

derivatives and solving. With

w = a+Ap+m [x� p0(w �Ap=2); z; "] ,

Marshallian total expenditure semielasticities are

rxw(p; x; z; ") = ry!(p; y; z; ") [rx(x� p0(w(p; x; z; ")�Ap=2))]= rym(y; z; ") [1� p0rxw(p; x; z; ")]

= rym(y; z; ")� [rym(y; z; ")]p0rxw(p; x; z; ")

= [IJ +rym(y; z; ")p0]�1rym(y; z; ")

where IJ is the J by J identity matrix. Marshallian price e¤ects are most easily

obtained from the above Hicksian price e¤ects and Marshallian total expenditure

e¤ects by the Slutsky matrix

rp0w(p; x; z; ") = rp0!(p; u; z; ")� [rxw(p; x; z; ")]!(p; y; z; ")0.

From the general model, Hicksian demographic semielasticities are

rz0!(p; u; z; ") = rz0m(u; z; ") = Brz0s(u; z).

These are equivalently the demographic semielasticities holding real expenditures y

�xed. Marshallian demographic semielasticities are obtained by

rz0w(p; x; z; ") = rz0m(y; z; ")�rym(y; z; ")p0rz0w(p; x; z; ")

= [IJ +rym(y; z; ")p0]�1rz0m(y; z; ").

The derivative of log nominal expenditures x with respect to utility u is

ruC(p; u; z; ") = 1 + p0 [rum(u; z; ")] = 1 + p

0 [rym(y; z; ")]

and the derivative of the indirect utility function with respect to x equals 1=ruC(p; u; z; ").

Some of the above elasticity expressions depend on " either directly or through w.

Given estimated model parameters, " for each consumer can be estimated as the model

residuals (the di¤erence between �tted and observed w). If each consumer is only

observed onece, the empirical distribution of the estimated residuals " can be used

as an estimate of the population distribution of unobserved preference heterogeneity.

Empirical moments or quantiles of " are generally consistent estimates of population

moments or quantiles. If the same consumers are observed more than once (panel

data), then each observation of the same consumer will likely yield a di¤erent ", which

can either be interpreted as time varying preferences, or the mean of the estimated

" for each individual could be modeled as a �xed preference e¤ect, with remaining

variation attributed to measurement error.

Similarly, in any price regime, features of the distribution of y = x�p0 (w �Ap=2)may be estimated by calculating the corresponding features of the distribution esti-

mated y across consumers observed in that price regime. For example, to estimate the

mean and variance of the average elasticities of budget shares with respect to x, based

on the general model (4) and the linear in coe¢ cients speci�cation (??), we may takethe sample mean and variance of each element of [IJ + [Brys(yi; zi)]p

0]�1 [Brys(yi; zi)]

divided by the corresponding element of wi across all consumers i that faced prices

p, where yi = xi�p0 (wi �Ap=2), and parameters are replaced with their estimatedvalues. Moreover, since yi is ordinally equivalent to ui, social welfare calculations can

be based directly estimates of each consumer�s yi.

If Marshallian demand functions are of direct interest, evaluated at points other

than those observed in the sample using the above formulas, they can be obtained

numerically by, e.g., numerically solving x = C(p; u; z; ") for u, with C given by equa-

tion (4), and substituting the result into the Hicksian demand functions. However, as

the above equations show, this will often not be necessary for evaluating the e¤ects

on demand or welfare of price, expenditure, or demographic changes.

11.6 Shape Invariance

A property of some demand functions that is relevant for the construction of equiv-

alence scales, and has been found to at least approximately hold empirically in some

consumer demand applications, is shape invariance. See, e.g., Blundell and Lewbel

(1991), and Blundell, Duncan and Pendakur (1998), Pendakur (1999), and Blundell,

Chen, and Kristensen, (2003). In our notation, shape invariance can be de�ned as

Marshallian Engel curves of the form

w = h [x� t0(z; ")] + t1(z; ")

for some function t0 and some J�vector valued functions h and t. In a graph ofengel curves t1(z; ") is a vector of vertical shifts of each budget share and t0(z; ") is

a horizontal shift of x. Utility derived demand functions possess shape invariance if

and only if

C(p; u; z; ") = h0[p; h(u; z; ")] + h1(p; z; ") (25)

for some functions h0, h1, and h2. Shape invariance is the only observable implica-

tion of equivalence scale exactness or independence of base of equivalence scales (see

Lewbel 1989, Blackorby and Donaldson 1993, and Pendakur 1999), which is the same

as equation (25) with the restriction that h(u; z; ") = h(u) for some function h.

A special case of the class of cost functions (22) is

x = C(p; u; z; ") = u+ c(p)0 [a+m0(u) +m1(z) + "] + d(p; z) (26)

which satis�es the above de�nition of shape invariance. The cost function (9) is

a special case of equation (26) when D is a matrix of zeroes. In that case, the

Marshallian Engel curves of this model simplify to w = h [x� �0(Cz+ ")] +Cz+ "for some J�vector valued function h. This shows that our model with zeroD providesa functional form for vertical shifts t1(z; "), and restricts the horizontal shift t0(z; ")

to equal a weighted sum of the vertical shifts. Nonzero D in our model violates shape

invariance by providing more �exibility with respect to z than simple horizontal and

vertical shifts.

An alternative model speci�cation that also provides interactions between y and

z while satisfying shape invariance is the cost function

C(p; u; z; ") = u+ p0�a+Ap=2 + b1(u� dz) + :::+ bK(u� dz)K +Cz+ "

�(27)

where d is a J�vector of constants with d01J = 0. This again has y = x � p0w �p0Ap=2 and has pseudo-Marshallian budget shares

w = a+Ap+ b1(y � dz) + :::+ bK(y � dz)K +Cz+ ". (28)

The cost function (27) is a special case of equation (26), and so is shape invariant.

This model has Marshallian Engel curves of the form w = h(x� dz��0") +Cz+ "for some J�vector valued function h, and so has vertical shifts t1(z; ") = Cz+ ",

and horizontal shifts dz � �0". We empirically estimated this model, but do notreport results because this model yielded very similar Engel curves and elasticities,

but had multiple local minima and other numerical problems that were likely due to

the nonlinearity dz: This alternative model might be prefered in applications involving

equivalence scale calculations, where independence of base is desired.

11.7 Indirect Utility and Marshallian Demands

Our model does not have an explicit analytic expression for indirect utility or for

Marshallian demands, except under special cases, but this section provides some par-

tial characterizations of these functions. Implicitly solving the cost function equation

(9) for u gives indirect utility

u = V [x� p0(Cz+ ");p0Dz;p]

where the function V is a root of a K 0th order polynomial which depends upon the

parameters a, A, b1; :::bK . This can be substituted into the associated Hicksian

demand functions w = a+Ap+b1u+ :::+bKuK +Cz+Dzu+" to get Marshallian

demands. Similarly, solving the cost function (27) for u gives indirect utility

u� = v[x� dz� p0(Cz+ ");p] (29)

where v is also a root of a K 0th order polynomial, and u� = u�dz is an unobservablemonotonic transformation of utility. Applying Roy�s identity to equation (29) shows

that Marshallian demands for this model have the form

w = h[x� dz� p0(Cz+ ");p] +Cz+ " (30)

where the J�vector valued function h is the negative of the ratio of the derivativesof v with respect to it�s second and �rst arguments.

The following approximation may provide additional insight into the structure

of Marshallian demands. Substituting y = x � p0w � p0Ap=2 into (8) and Taylorexpanding the result around the point p = 0J gives, with the general cost function

(7), w � a+Ap+Bs(x; z) + ", and substituting this expression into (??) yields

y � x� p0 (a+Ap=2 +Bs(x; z) + ") : (31)

Finally, substituting this approximate expression for y into the Pseudo-Marshallian

demand function (8) gives approximate Marshallian demands

w � a+Ap+Bs [x� p0 (a+Ap=2 +Bs(x; z) + ") ; z] + "

These expressions show that while our prefered model speci�cation has no interaction

terms between p and z,y, there do exist interactions between p and z,x.

11.8 Engel Curves

Evaluating equation (8) at some arbitrary price vector p = p gives Pseudo-Marshallian

Engel curves

w = �+Bs(y; z) + " (32)

with

y = x� � �0w (33)

where � = a+Ap, � = p, and = ��0A�=2. The Pseudo-Marshallian Engel curvesfor models (9) and (27) are therefore w = �+ b1y+ :::+ bKyK +Cz+Dzy+ " and

w = �+ b1(y � dz) + :::+ bK(y � dz)K +Cz+ " with y = x� � �0w. Based on

equation (31), approximate Marshallian Engel curves are given by equation (32) with

y � x� � �0 [�+Bs(x; z) + "] (34)

By equation (30), the exact Marshallian Engel curves for the (27) model have the

shape invariant form

w = h(x� dz� �0") +Cz+ "

where d = d� �0C, for some J�vector valued function h.

11.9 Data and Parameter Estimates

Table 1: The Data Min Max Mean Std Dev

expenditure shares food-in 0.00 0.81 0.22 0.09

food-out 0.00 0.64 0.06 0.10

rent 0.01 0.95 0.33 0.06

clothing 0.00 0.59 0.10 0.06

operation 0.00 0.64 0.08 0.05

furnish/equip 0.00 0.65 0.05 0.06

private trans 0.00 0.59 0.09 0.08

public trans 0.00 0.45 0.03 0.04

log-expenditure 6.58 11.61 9.38 0.64

childless single adult dummy 0 1 0.36 0.48

childless adult couple dummy 0 1 0.22 0.41

log household size no children 0 1.95 0.21 0.37

w/ children 0 2.40 0.46 0.64

single parent dummy 0 1 0.06 0.23

age of head (less 40) no children -15 24 1.11 9.73

w/ children -15 24 -1.42 5.05

Table 2: Engel Curve Parameters, GMM and

SUR

GMM-exact SUR-approximate

Commodity Parameter Estimate Std Err Estimate Std Err

Food at Home a 0.344 0.751 0.303 0.177

Food at Home y1 0.052 1.340 0.135 0.346

Food at Home y2 -0.243 1.141 -0.319 0.321

Food at Home y3 0.137 0.499 0.174 0.153

Food at Home y4 -0.033 0.108 -0.041 0.036

Food at Home y5 0.003 0.009 0.004 0.003

Food at Home yinv -0.016 0.151 -0.015 0.033

Food Out a 0.251 0.135 0.210 0.135

Food Out y1 -0.416 0.270 -0.323 0.265

Food Out y2 0.385 0.253 0.299 0.246

Food Out y3 -0.159 0.121 -0.122 0.117

Food Out y4 0.033 0.028 0.025 0.027

Food Out y5 -0.003 0.003 -0.002 0.003

Food Out yinv -0.037 0.024 -0.029 0.025

Rent a -0.441 0.786 -0.372 0.231

Rent y1 1.679 1.411 1.540 0.453

Rent y2 -1.215 1.208 -1.077 0.421

Rent y3 0.383 0.532 0.312 0.200

Rent y4 -0.057 0.116 -0.039 0.047

Rent y5 0.003 0.010 0.001 0.004

Rent yinv 0.124 0.158 0.109 0.043

Clothing a 0.168 0.290 0.169 0.107

Clothing y1 -0.095 0.498 -0.120 0.209

Clothing y2 -0.002 0.416 0.041 0.195

Clothing y3 0.029 0.181 0.001 0.093

Clothing y4 -0.010 0.039 -0.002 0.022

Clothing y5 0.001 0.003 0.000 0.002

Clothing yinv -0.007 0.063 -0.008 0.020

HH Operation a -0.044 0.114 -0.076 0.127

HH Operation y1 0.124 0.236 0.214 0.248

HH Operation y2 -0.154 0.233 -0.253 0.230

HH Operation y3 0.095 0.117 0.147 0.110

HH Operation y4 -0.024 0.029 -0.038 0.026

HH Operation y5 0.002 0.003 0.004 0.002

HH Operation yinv 0.007 0.020 0.010 0.023

HH Furn/Eq a -0.129 0.131 -0.058 0.129

HH Furn/Eq y1 0.320 0.263 0.123 0.253

HH Furn/Eq y2 -0.298 0.246 -0.098 0.235

HH Furn/Eq y3 0.147 0.117 0.054 0.112

HH Furn/Eq y4 -0.031 0.027 -0.011 0.026

HH Furn/Eq y5 0.002 0.002 0.001 0.002

HH Furn/Eq yinv 0.034 0.023 0.038 0.024

Priv Trans a 0.747 0.226 0.617 0.180

Priv Trans y1 -1.560 0.401 -1.255 0.353

Priv Trans y2 1.411 0.346 1.092 0.328

Priv Trans y3 -0.567 0.155 -0.407 0.156

Priv Trans y4 0.104 0.034 0.067 0.037

Priv Trans y5 -0.007 0.003 -0.004 0.003

Priv Trans yinv -0.110 0.047 -0.093 0.033

Pub Trans a -0.018 0.084 0.072 0.094

Pub Trans y1 0.088 0.171 -0.092 0.185

Pub Trans y2 -0.045 0.160 0.124 0.171

Pub Trans y3 0.000 0.076 -0.081 0.082

Pub Trans y4 0.005 0.018 0.024 0.019

Pub Trans y5 -0.001 0.002 -0.003 0.002

Pub Trans yinv 0.013 0.014 -0.001 0.017

Table 3: Demographic E¤ects, GMM and SUR

GMM-exact SUR-

approximate

Commodity Parameter Estimate Std Err Estimate Std Err

Food at Home childless couple 0.009 0.014 0.013 0.011

Food at Home head age (kids) 0.002 0.001 0.001 0.001

Food at Home head age (no kids) 0.003 0.000 0.003 0.000

Food at Home log size (kids) 0.220 0.010 0.214 0.007

Food at Home log size (no kids) 0.200 0.018 0.199 0.014

Food at Home single parent -0.015 0.013 -0.003 0.011

Food at Home y1*childless couple -0.009 0.006 -0.010 0.005

Food at Home y1*head age (kids) 0.000 0.000 0.000 0.000

Food at Home y1*head age (no kids) -0.001 0.000 -0.001 0.000

Food at Home y1*log size (kids) -0.047 0.004 -0.046 0.003

Food at Home y1*log size (no kids) -0.040 0.007 -0.041 0.006

Food at Home y1*single parent 0.000 0.006 -0.005 0.005

Food Out childless couple -0.016 0.008 -0.017 0.009

Food Out head age (kids) 0.000 0.000 0.000 0.001

Food Out head age (no kids) -0.001 0.000 -0.001 0.000

Food Out log size (kids) -0.018 0.005 -0.023 0.006

Food Out log size (no kids) -0.035 0.010 -0.038 0.011

Food Out single parent 0.006 0.006 0.001 0.008

Food Out y1*childless couple 0.005 0.004 0.005 0.004

Food Out y1*head age (kids) 0.000 0.000 0.000 0.000

Food Out y1*head age (no kids) 0.000 0.000 0.000 0.000

Food Out y1*log size (kids) -0.011 0.003 -0.008 0.003

Food Out y1*log size (no kids) 0.000 0.005 0.003 0.005

Food Out y1*single parent -0.009 0.003 -0.007 0.004

Rent childless couple 0.017 0.017 0.019 0.015

Rent head age (kids) -0.001 0.001 -0.001 0.001

Rent head age (no kids) -0.001 0.000 -0.001 0.000

Rent log size (kids) -0.196 0.011 -0.174 0.010

Rent log size (no kids) -0.188 0.020 -0.183 0.019

Rent single parent 0.016 0.018 0.000 0.014

Rent y1*childless couple 0.000 0.007 -0.001 0.006

Rent y1*head age (kids) 0.000 0.000 0.000 0.000

Rent y1*head age (no kids) 0.001 0.000 0.001 0.000

Rent y1*log size (kids) 0.073 0.005 0.063 0.004

Rent y1*log size (no kids) 0.063 0.008 0.059 0.008

Rent y1*single parent 0.008 0.008 0.015 0.007

Clothing childless couple 0.005 0.006 0.004 0.007

Clothing head age (kids) 0.003 0.000 0.003 0.000

Clothing head age (no kids) 0.000 0.000 0.000 0.000

Clothing log size (kids) 0.002 0.005 0.000 0.005

Clothing log size (no kids) 0.061 0.007 0.061 0.009

Clothing single parent -0.018 0.008 -0.014 0.007

Clothing y1*childless couple -0.003 0.002 -0.003 0.003

Clothing y1*head age (kids) -0.002 0.000 -0.002 0.000

Clothing y1*head age (no kids) 0.000 0.000 0.000 0.000

Clothing y1*log size (kids) 0.001 0.002 0.001 0.002

Clothing y1*log size (no kids) -0.029 0.003 -0.029 0.004

Clothing y1*single parent 0.025 0.004 0.023 0.003

HH Operation childless couple -0.008 0.009 -0.013 0.008

HH Operation head age (kids) 0.000 0.000 0.000 0.000

HH Operation head age (no kids) 0.001 0.000 0.001 0.000

HH Operation log size (kids) 0.043 0.006 0.034 0.005

HH Operation log size (no kids) 0.038 0.011 0.034 0.010

HH Operation single parent -0.008 0.008 -0.008 0.008

HH Operation y1*childless couple 0.005 0.004 0.007 0.004

HH Operation y1*head age (kids) 0.000 0.000 0.000 0.000

HH Operation y1*head age (no kids) 0.000 0.000 0.000 0.000

HH Operation y1*log size (kids) -0.026 0.003 -0.021 0.002

HH Operation y1*log size (no kids) -0.026 0.005 -0.023 0.004

HH Operation y1*single parent 0.002 0.004 0.003 0.004

HH Furn/Eq childless couple -0.024 0.009 -0.030 0.008

HH Furn/Eq head age (kids) -0.001 0.001 -0.001 0.000

HH Furn/Eq head age (no kids) 0.000 0.000 0.000 0.000

HH Furn/Eq log size (kids) 0.018 0.006 0.015 0.005

HH Furn/Eq log size (no kids) 0.020 0.011 0.023 0.010

HH Furn/Eq single parent 0.032 0.008 0.028 0.008

HH Furn/Eq y1*childless couple 0.007 0.004 0.011 0.004

HH Furn/Eq y1*head age (kids) 0.001 0.000 0.000 0.000

HH Furn/Eq y1*head age (no kids) 0.000 0.000 0.000 0.000

HH Furn/Eq y1*log size (kids) -0.013 0.003 -0.011 0.002

HH Furn/Eq y1*log size (no kids) -0.011 0.005 -0.011 0.004

HH Furn/Eq y1*single parent -0.015 0.004 -0.013 0.004

Priv Trans childless couple 0.024 0.012 0.035 0.012

Priv Trans head age (kids) -0.001 0.001 -0.001 0.001

Priv Trans head age (no kids) -0.001 0.000 0.000 0.000

Priv Trans log size (kids) -0.074 0.007 -0.069 0.008

Priv Trans log size (no kids) -0.115 0.016 -0.118 0.014

Priv Trans single parent 0.001 0.009 0.008 0.011

Priv Trans y1*childless couple -0.008 0.005 -0.012 0.005

Priv Trans y1*head age (kids) 0.000 0.000 0.000 0.000

Priv Trans y1*head age (no kids) 0.000 0.000 0.000 0.000

Priv Trans y1*log size (kids) 0.029 0.003 0.026 0.003

Priv Trans y1*log size (no kids) 0.053 0.007 0.052 0.006

Priv Trans y1*single parent -0.016 0.005 -0.020 0.005

Pub Trans childless couple -0.006 0.006 -0.010 0.006

Pub Trans head age (kids) 0.000 0.000 0.000 0.000

Pub Trans head age (no kids) 0.000 0.000 -0.001 0.000

Pub Trans log size (kids) -0.002 0.004 -0.003 0.004

Pub Trans log size (no kids) 0.012 0.007 0.016 0.008

Pub Trans single parent -0.016 0.005 -0.017 0.006

Pub Trans y1*childless couple 0.001 0.003 0.003 0.003

Pub Trans y1*head age (kids) 0.000 0.000 0.000 0.000

Pub Trans y1*head age (no kids) 0.000 0.000 0.000 0.000

Pub Trans y1*log size (kids) -0.004 0.002 -0.003 0.002

Pub Trans y1*log size (no kids) -0.007 0.003 -0.008 0.003

Pub Trans y1*single parent 0.007 0.002 0.007 0.003

Table 3: Price E¤ects, GMM and SUR

GMM-exact SUR-

approximate

Commodity Price Estimate Std Err Estimate Std Err

Food at Home Food at Home -0.047 0.010 0.027 0.002

Food Out Food at Home 0.037 0.007 0.008 0.007

Food Out Food Out -0.045 0.006 -0.036 0.010

Rent Food at Home -0.072 0.004 -0.017 0.003

Rent Food Out 0.042 0.003 0.045 0.002

Rent Rent 0.097 0.005 0.109 0.005

Clothing Clothing 0.029 0.007 0.034 0.008

Clothing Food at Home 0.012 0.007 0.002 0.004

Clothing Food Out 0.009 0.005 -0.033 0.006

Clothing Rent 0.003 0.002 -0.029 0.003

HH Operation Clothing -0.045 0.006 0.014 0.005

HH Operation Food at Home 0.058 0.006 -0.042 0.006

HH Operation Food Out 0.005 0.005 -0.049 0.004

HH Operation HH Operation -0.033 0.007 0.007 0.005

HH Operation Rent -0.046 0.003 0.007 0.003

HH Furn/Eq Clothing 0.004 0.004 0.027 0.007

HH Furn/Eq Food at Home 0.022 0.005 0.015 0.005

HH Furn/Eq Food Out -0.049 0.004 0.052 0.006

HH Furn/Eq HH Furn/Eq 0.046 0.004 0.027 0.004

HH Furn/Eq HH Operation 0.042 0.005 -0.052 0.003

HH Furn/Eq Rent -0.033 0.002 -0.075 0.004

Priv Trans Clothing -0.027 0.002 -0.008 0.002

Priv Trans Food at Home 0.026 0.004 0.042 0.003

Priv Trans Food Out 0.004 0.003 0.052 0.004

Priv Trans HH Furn/Eq -0.004 0.002 -0.002 0.002

Priv Trans HH Operation 0.000 0.003 0.001 0.002

Priv Trans Priv Trans 0.023 0.004 0.042 0.005

Priv Trans Rent -0.024 0.003 -0.035 0.008

Pub Trans Clothing 0.025 0.002 -0.006 0.003

Pub Trans Food at Home -0.053 0.003 -0.022 0.002

Pub Trans Food Out -0.007 0.002 0.020 0.004

Pub Trans HH Furn/Eq 0.002 0.002 0.007 0.002

Pub Trans HH Operation -0.020 0.002 -0.005 0.002

Pub Trans Priv Trans 0.007 0.002 0.000 0.002

Pub Trans Pub Trans -0.002 0.002 -0.037 0.002

Pub Trans Rent 0.046 0.002 -0.045 0.003

top related

demand systems with unobserved heterogeneity and …pendakur/hixtrix13.pdf · 2005. 6. 9. · are...

Documents

we examine whether economic and military … examine whether...

(rÉhabilitation rapide aprÈs chirurgie) ·...

methodology for determining geothermal potential

evaluating the roadmap out of lockdown for england

casing design methodology for casing while drilling

knowledge integration to overcome ontological heterogeneity:...

evaluating leniency with missing information on undetected...

teil b: serviceteil978-3-658-22426-4/1.pdf · 634...

cost estimation methodology for netl assessments of...

methodology for extracting dynamic standard load...

papr reduction analysis using combined methodology of

evaluating suitability of glutaraldehyde tanning in

evaluation of fossil power plants with ccs: methodology

evaluating an open source ea management tool · fakultät...

an insight into the hadith methodology of jamal al-din ahmad...

playful mathematics methodology book innovative math...

pilotprojekt mobilitätsfördernde pflegeintervention...

finite element based acoustic analysis of dissipative...

characterization of -cell heterogeneity in the islets of

methodology and representation in the study of lexical...