Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel...

34

Transcript of Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel...

Page 1: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Fertility and female employment reconsidered:

A macro-level time series analysis�

Henriette Engelhardt, Tomas K�ogel and Alexia Prskawetz

Max Planck Institute for Demographic Research,

Doberaner Str. 114, D-18057 Rostock, Germany

May 13, 2001

Abstract

Our paper examines causality and parameter instability in the long-

run relation between fertility and female employment. This is done

by cross-national comparison of macro-level time series data. By ap-

plying error correction models { a combination of Granger-causality

tests with recent econometric time series techniques { we �nd causal-

ity in both directions. This �nding is consistent with simultaneous

movements of both variables brought about by common exogenous

factors such as social norms, social institutions and �nancial incen-

tives. We �nd a signi�cant negative correlation until the 1970s, re-

spectively 1980s (depending on the country under investigation) and

an insigni�cant or weaker correlation afterwards. This result is consis-

tent with a recent hypothesis in the demographic literature according

to which changes in the institutional context like childcare availabil-

ity and attitudes towards working mothers might have reduced the

incompatibility between childrearing and female employment.

�For helpful comments and suggestions we would like to thank Juha Alho, Horst Entorf,

Ulf Christian Ewert, Tomas Frejka, Jan Hoem, Johannes Huinink, Stefan Klasen, Michaela

Kreyenfeld, Thomas Lindh, Warren Sanderson and participants at workshops in Heidelberg

and Rostock. We are especially grateful to Tom DiPrete who essentially contributed to

improve the exposition of the paper. The views expressed in this paper are the authors' own

views and do not necessarily represent those of the Max Planck Institute for Demographic

Research.

1

Page 2: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

1 Introduction

The relationship between fertility and female labor force participation is a

long-standing question in demography. A negative association between these

two variables is mainly argued for as evidence for the incompatibility of

raising children and staying in the workforce in today's society where the

place of work and home are mostly spatially separated. Therefore, decreasing

fertility is associated with increasing female employment and rising female

employment is associated with falling fertility. Up to now, it is unclear

whether these mutual relations are causal in one or the other direction.

The question \What causes what?" received renewed attention in the

demographic literature during the last years. This renewed interest resulted

from recent studies which have shown that a simple cross-country correlation

coe�cient between the total fertility rate and the female labor force partici-

pation rate turned from a negative value before the 1980s to a positive value

thereafter. The question then arises whether there is a causal relationship at

all.

Several studies go beyond measures of correlation and explicitly attempt

to test for the existence and direction of causality between fertility and female

employment. Due to substantive and methodological shortcomings these

studies have found con icting results. Our paper aims to clarify the liter-

ature in three speci�c ways. First, we apply methods that are designed to

avoid the problem (referred to as \spurious regression" in the time series

literature) that frequently plagues the analysis of variables with trends and

that arguably a�icts existing e�orts to estimate the causal relation between

female labor force participation and fertility. Second, we estimate what are

called \error correction models" which are the appropriate models to test

for causality between stochastic trending time series. Third, we explicitly

test for \parameter instability," i.e., the possibility that the causal relation

between total fertility rate and female labor force participation has changed

over time.

Our results are twofold: (i) We �nd that there was a long-run causal re-

lation linking the total fertility rate and female employment until the end of

the 1970s or the begin of the 1980s (depending on the country of investiga-

tion). The results suggest that the causation was in both directions, but in

any case, we statistically rule out the possibility that the well-known corre-

lation between these variables was spurious. We argue that the bidirectional

causality is consistent with the view that comovements of fertility and fe-

2

Page 3: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

male employment are caused by common exogenous factors. These factors

can be social norms and social institutions, but also opportunity costs of

childrearing and family income. (ii) Our empirical results show a signi�cant

negative correlation between fertility and female employment from the 1960s

to about the beginning of the 1970s for all countries of our study. How-

ever, for the non-Mediterranean countries in our sample the causal relation

becomes weaker (though still signi�cant negative) at about the end of the

1970s. Finally, we �nd considerable country-level heterogeneity in the rela-

tionship between fertility and female employment in the 1980s and 1990s. In

some countries, the relation becomes weaker to the point of insigni�cance,

while in others the relation remains signi�cantly negative.

Our results are supportive for a recent hypothesis in the demographic

literature according to which societal level responses have eased the incom-

patibility between childrearing and female employment in most developed

countries (Brewster and Rindfuss 2000; Rindfuss, Benjamin and Morgan

2000; Rindfuss and Brewster 1996). For the Mediterranean country in our

sample { Italy { we �nd the opposite result, that is, the negative correla-

tion between fertility and female employment becomes even stronger across

time. This �nding coincides with the argument in Rindfuss et al. (2000)

and Brewster and Rindfuss (2000) that we are only likely to see increasing

female employment no longer to depress fertility in those countries that suc-

ceeded in minimizing the incompatibility between childrearing and female

work. In Mediterranean countries this incompatibility between female em-

ployment and childrearing still persists.

The set up of our paper is as follows. In section 2 we �rstly discuss

the possible relationships between fertility and female employment from a

micro theoretical point of view. Secondly, we discuss the gap in the existing

macro-level time series literature that we aim to close with our paper. In

section 3 we describe the data and in section 4 we explain the econometric

methodology that is applied in the paper. We present the results in section

5 and conclude in section 6.

3

Page 4: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

2 Theoretical and methodological considera-

tions

2.1 Micro explanations

At the individual level, numerous studies have shown a negative associa-

tion between fertility and female labor force participation (e.g., Lehrer and

Nerlove 1986; Brewster and Rindfuss 2000). On average, women in gainful

employment tend to have fewer children, and women with children spend

less time in the labor market. Weller (1977: 43) lists four possible expla-

nations for this negative association between fertility and female labor force

participation:

1. women's fertility a�ect their labor force participation;

2. women's labor force participation a�ect their fertility;

3. both women's fertility and their labor force participation a�ect each

other; and

4. the observed negative relationship is spurious and is caused by common

antecendents of both variables.

According to the mentioned role incompatibility hypothesis both women's

fertility and their labor force participation a�ect each other reciprocally be-

cause of the strain between mother and worker roles. Nothing in this hy-

pothesis suggests causality in one direction rather than the other (Lehrer

and Nerlove 1986).

From the point of view of economic theory fertility and female employ-

ment are simultaneously determined by the same basic economic variables.

More speci�cally, female labor market participation and fertility are both

choice variables which households choose simultaneously given their exoge-

nous constraints. If both variables uctuate to some extent synchronously,

then { according to the logic of economic theory { this must be caused entirely

by external variables which determine both variables exogenously. Examples

of such external variables are the real female wage, the unemployment rate

and { recently accepted in some work by economists { social norms.

Many researchers would not go as far as economic theory and would argue

that at least part of the correlation between fertility and female employment

4

Page 5: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

is not determined by external variables. Some of these researchers view

fertility and female employment more as the result of a sequential decision

process rather than the result of a simultaneous decision problem. If these

variables are indeed the result of a sequential decision process, then it is quite

possible that one variable exogenously causes the other variable.

2.2 Macro studies

Given the discussed explanations of the fertility/employment nexus on the

micro level, it is no wonder that previous empirical research has mainly con-

centrated on micro-level data (for an extensive review of the micro literature

see Cramer 1980; Lehrer and Nerlove 1986; Spitze 1988). Nevertheless, with

existing cross-sectional as well as longitudinal survey data it is very di�-

cult to resolve the fertility/employment question since the empirical �ndings

depend on the underlying decision model (sequential or simultaneous), and

therefore on the applied statistical model. Furthermore, work intentions may

cause actual fertility behavior and fertility intentions may cause actual work

behavior, that is, future events may cause present behavior (Bernhardt 1993;

N�i Bhrolch�ain 1993). Macro-level studies { and especially cross-national com-

parisons { are an alternative way to resolve the fertility/employment puzzle

because they do not require the detailed individual-level data (see Rindfuss

and Brewster 1996: 262, who also stress the value of a cross-national assess-

ment). However, relationships at the individual and the aggregate level may

be di�erent (e.g., N�i Bhrolch�ain 1993).

Existing macro studies can be divided into studies which analyze macro-

level data on a cross-country basis and studies which apply methods of time-

series analysis. Various authors (Ahn and Mira 1999; Brewster and Rindfuss

2000; Esping-Andersen 1999; Rindfuss et al. 2000) �nd that in OECD coun-

tries the cross-country correlation between the total fertility rate (TFR) and

the female labor market participation rate (FLP ) turned from a negative

value before the 1980s to a positive value thereafter. The countries that now

have the lowest levels of fertility are those with relatively low levels of female

labor force participation and the countries with higher fertility levels tend

to have relatively high female labor force participation rates. Following the

graphical presentation in the literature (e.g., Rindfuss et al. 2000), Figure 1

illustrates this change for 12 selected OECD countries.1

1Countries included are Belgium, Canada, Finland, France, Italy, Japan, Norway, Swe-

5

Page 6: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Figure 1: Correlation between the total fertility rate and female labor force

participation rate, 1960-1994

1960 1965 1970 1975 1980 1985 1990-1

-0.5

0

0.5

1

year

corr

elat

ion

coef

fici

ent

Several recent papers have suggested a weakening link between fertility

and female employment due to greater availability of market child care, family

policies (such as state mandated maternity leave) and changing attitudes

towards working mothers (Brewster and Rindfuss 2000; Rindfuss et al. 2000;

Rindfuss and Brewster 1996). For that reason, they argue that changes in the

institutional context at the macro-level must have enabled women in some

countries to better combine work and childrearing.

The cross-section studies, however, do not explicitly address the causality

question. This is done in studies which apply formal Granger causality tests

to aggregate time-series in di�erent countries (Cheng 1996; Klijzing et al.

1988; Michael 1985; Zimmermann 1985). The basic logic underlying these

tests is quite simple: \We say, that [some time series] Yt is causing Xt if

we are better able to predict Xt using all available information than if the

information apart from Yt had been used" (Granger 1969: 428). Causality

between two time-series can be either unidirectional (X causes Y or the

reverse), bidirectional (X causes Y and the reverse { this is termed feedback)

den, Switzerland, United Kingdom, United States and West-Germany. The data sources

can be found in Appendix A.

6

Page 7: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

or non existent in which case the two time series move independently or

contemporaneously.

Table 1: Summary of macro-level time-series studies

author method data results

Zimmermann (1985) modi�ed Granger / German time series TFR ! FLP

�rst di�erences 1960-79

Michael (1985) standard Granger / US time series TFR FLP

levels 1948-80 AFR! FLP

Klijzing et al. (1988) indirect Granger / Dutch survey data FER! LFP

�rst di�erences 1977-84 (FER LFP )

Cheng (1996) modi�ed Granger / US time series TFR! FLP

�rst di�erences 1948-93

Notes: TFR { total fertility rate; FLP { female labor force participation rate;

AFR { age-speci�c fertility rate; FER { lagged (by 10 months) number of

children born (per month); LFP { percentage of women participating in the labor

market (per month).

Table 1 provides a summary of empirical results of macro studies with

time series data. Analyzing German time-series from 1960-79 Zimmermann

(1985) concludes from a modi�ed Granger-causality test applied to �rst dif-

ferences of all variables that increasing female employment does not cause

decreasing fertility; rather the reduction in birth rate causes the increase in

female labor force participation. Applying standard Granger-causality tests

to the levels of US time-series data from 1948-80 Michael (1985) �nds that

female labor force participation positively causes fertility, but not the other

way around. However, this result seems to be sensitive to the de�nition of

fertility. With age-speci�c fertility rates Michael �nds that fertility nega-

tively a�ects female labor force participation, but not the other way around.

Klijzing et al. (1988) use monthly data from a Dutch survey for a seven-year

period (1977-84). They calculate the average value for each month in the

data of this survey and apply Sims' indirect Granger-causality test to the

�rst di�erences of these averages. They �nd that labor force participation

has no in uence on subsequent fertility decision-making and that fertility de-

cisions do have an impact on female labor force participation. However, with

standard Granger tests they �nd causality in both directions. Cheng (1996)

applies a modi�ed version of the Granger-causality method to �rst di�erences

of aggregate US data for 1948-93. He �nds unidirectional negative causality

7

Page 8: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

running from fertility to female employment.

Obviously, the time series literature has not yet found an agreement nei-

ther on the presence nor on the direction of causality between fertility and

female employment. In our view, this might be due to two issues that are

not yet addressed in the literature. First, the literature has not yet taken

into consideration several important recent developments in the econometric

time series literature (see section 4). In particular, Michael (1985) does not

consider non-stationarity of the time series, that is whether the mean and/or

the variance of the time series are changing over time. This is problematic

because, if the time series are non-stationary, then there is the possibility

of \spurious" causality results. Cheng (1996), Klijzing et al. (1988) and

Zimmermann (1985) account for non-stationarity. However, by only apply-

ing Granger-causality tests to �rst di�erences, they refrain from the use of

valuable information about a possible long-run relationship between the vari-

ables.2 As a consequence, the results in these studies might be wrong.

Our paper contains two advances over these earlier papers that attempted

to determine causality. First, we use more recent data, which is important

because the relationship between TFR and FLP may have shifted in re-

cent years. Second, we employ more sophisticated econometric methods to

overcome de�ciencies in earlier e�orts. Besides the methodological issues

as related to the stationarity assumption of the time series we allow for a

further methodological correction. We consider the possibility of parameter

instability in the long-run relation between fertility and female employment

(as suggested by inspection of Figure 1) and structural breaks in the trend

of the variables. Clearly, it would be desirable to include in the regressions

socio-demographic variables that caused this change in behavior. However,

in our view this would be too complex { if not impossible { and, for that

reason, we approximate this change in behavior with the inclusion of dummy

variables. The use of such a \minimal" approach is a standard procedure in

applied econometrics.

2Though Cheng (1996) �nds absence of cointegration, that is, absence of a long-run

relation between fertility and female employment, we have shown that this results is not

robust. While he uses the correct Granger-causality method given this result, he uses a

unit root test on the OLS residuals of the cointegrating relation. We applied an error

correction model to the same data and found cointegration. We believe this to be due to

the well-known fact that error correction models are considered superior compared to the

residual based approach which tends to reject cointegration too often (see Kremers et al.

1992).

8

Page 9: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

3 Data

We assembled annual time series of the total fertility rate (TFR) and female

labor force participation rate (FLP ).3 We assembled annual data from 1960-

94 for �ve European countries (France (FRA), West-Germany (FRG), Italy

(ITA), Sweden (SWE), and the United Kingdom (UK)). Moreover, we were

able to assemble annual data from 1948-1995 for the USA. Longer time series

are preferable because they increase the power of cointegration tests (Otero

and Smith 2000). For the purpose of comparison of the results of the USA

with the results of the European countries we also show the results for a

truncated time series for the USA that extends over 1960-94. For FLP we

choose the female labor force divided by the female population both of age

15 to 64. Hence, women of age 65 and older are not included in our measure

of FLP .

There are two possible measurement problems with our data. First, to

avoid problems arising from age structure changes we used as a measure of

fertility the total fertility rate (TFR) which constitutes an age standard-

ized measure of fertility (that is, the TFR is una�ected from age structure

changes). However, the FLP is not age standardized. Second, the FLP for

women 15 to 65 contains women of age 50 to 65 which cannot give births.

To check whether these two possible problems a�ected our empirical results,

we applied the same regressions also to the fertility rate of women of age

20-24 (respectively 25-34) and the female labor force participation rate of

women of age 20-24 (respectively 25-34) with annual US time series from

1948-1995.4 With age speci�c data one can avoid the two possible problems.

As a by-product it is possible to check whether age structure changes a�ect

the empirical results.

In Figure 2 and Figure 3 we plot the time series of TFR and FLP for

each country for 1960-94. As is well-known, the time series of TFR show

for most developed countries a kink in the 1960s. Some researchers argue

3See Appendix A for de�nition and source of data. In a few time series the value for a

single year was missing. In those cases we calculated the missing value as the average of

the year that preceded the missing year and the year that followed the missing year.4We neglected fertility of women of age below 20 and above 34 since the magnitude of

their fertility is relatively small. Unfortunately, data on fertility of women of age 25-29

and 29-34 were only available since 1976. Therefore, we restricted attention to the sum of

these two age groups. In addition, this data limitation made it impossible to construct an

age standardized measure of the female labor force participation rate (which would be an

alternative to avoid the �rst mentioned possible problem).

9

Page 10: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Figure 2: Time-series of the total fertility rate for six countries, 1960-94

1960 1965 1970 1975 1980 1985 19900

0.5

1

1.5

2

2.5

3

3.5

4

year

FRAFRGITASWEUKUSA

Figure 3: Time-series of female labor force participation rate for six countries,

1960-94

1960 1965 1970 1975 1980 1985 19900

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

year

FRAFRGITASWEUKUSA

10

Page 11: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

that this kink represents the di�usion of the use of the contraceptive pill

(e.g. Goldin and Katz 2000). Other researchers argue the kink is the result

of changing social norms. One can see that Italy is an exceptional case

since its fertility decline was very slow in comparison to most countries in

the developed world. The Swedish TFR shows a small hill around 1990.

The demographic literature o�ers some explanations for this hill which are,

however, outside of the scope of our paper (see, e.g., Andersson 1999, and

Hoem 1990, who explain the increase of the Swedish total fertility rate at the

end of the 1980s with newly enacted leave and wage compensation policies).

As is also well-known, the time series of FLP show a clear upwards trend

in most developed countries. Again Italy is an exception. There the rise

of female employment is rather modest. However, high education levels of

younger Italian women (not shown) seem to indicate a future change in the

FLP even in Italy. The Swedish FLP is distinguished by its relatively higher

level over the entire time period from 1960 to 1994.

We have opted to include Italy and Sweden in our set of countries since

each of them represents an extreme position in the spectrum of family poli-

cies and norms that may in uence the fertility/employment relation. The

exceptional behavior of the Italian time series is often explained with tradi-

tional norms and the view of the family as a private domain in which the

government does not intervene with many state services. On the other ex-

treme of a spectrum of family policy and norms is clearly Sweden. The policy

in the `nation of individuals' (Chesnais 1996; Hantrais 1997) tend to be both

supportive of women's desires and concerned with children's care. France

and the United Kingdom provide a somewhat weaker illustration of `nations

of individuals'. West-Germany as a `nation of families' shares a strong com-

mitment toward families, backed by monetary allowances for housing, child

bene�t packages, and well-paid maternal leave. To summarise: The time

series in Figure 2 and 3 show that our set of six countries is representative

for most developed countries.

4 Econometric methodology

Before introducing the speci�c method we apply, we shortly review some

of the recent econometric time series techniques that need to be taken into

account when testing for causality.

11

Page 12: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Cointegration

When estimating with time series data the relation between two trending

variables one often gets spurious regression results, that is, a seemingly sig-

ni�cant e�ect even though the variables are actually unrelated. An illus-

trative example is the fertility rate and the number of storks in a region.

Both variables have mostly a downward trend. Due to their trends, when re-

gressing the number of storks on the fertility rate with OLS one might �nd a

signi�cant positive e�ect even though it would be daring to argue that such a

causal relation exists in reality. Often detrending (that is, including a smooth

function of calendar time as a further element on the right hand side of the

regression equation) helps to eliminate spurious regression results. But as a

recent econometric literature (started by Granger and Newbold 1974) shows,

detrending does not help in case the variables are di�erence-stationary, also

labeled I(1). A series is di�erence-stationary if its mean and its variance

are constant over time after �rst di�erencing, but not in levels. We found

that FLP and TFR series appear to be di�erence stationary. Consequently,

analyses that attempt to estimate causation and that control for trends by

including a function of calendar time on the right side of the equation will not

generally give unbiased standard errors for inference of the causal relation.

If two time series are trending, the important question to determine is

whether the relation between these trends is \tight" enough so as to warrant

the conclusion that either there is a direct causal relationship between them,

or that there is some external force that is jointly determining both variables.

Cointegration tests can be applied to detect whether a relation between two

I(1) variables is `true' or spurious. These tests aim to detect synchronous

movements in deviations from the trend of both variables. More technically,

cointegration tests aim to detect whether there exists a linear combination

of two I(1) variables which is stationary (possibly after detrending). In case

such a stationary linear combination exists, the variables are said to be coin-

tegrated and the possibility of a spurious correlation due to trends can be

excluded.

Granger-causality

Granger (1969) introduced tests to detect causality between time series (hence-

forth labeled as Granger-causality test). The Granger causality test is typ-

ically based on the estimation of a vector autoregressive (VAR) model with

variables in levels or in �rst di�erences:

12

Page 13: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

�Yt = �0 +

mYXi=1

�Y;i�Yt�i +

nYXj=1

�Y;j�Xt�j + uX;t; (1)

�Xt = �0 +

mXXk=1

�X;k�Yt�k +

nXXl=1

�X;l�Xt�l + uY;t; (2)

where �Zt = Zt � Zt�1; 8Z = Y;X; with all variables being in logarithms.

The �'s and �'s are constants, the m's and n's are the optimal numbers of

lags of the series Y and X and the ut's are serially uncorrelated random

disturbances with zero mean.5 For given values of the lag lengths (i.e, the

m's and n's) it can, for example, be tested, whether X Granger-causes Y

by testing the hypothesis H0: �Y;1 = �Y;2 = ::: = �Y;nY = 0 against the

alternative Ha: not H0. The joint signi�cance of the coe�cients can be

tested by a Wald test with the F-statistic.

Engle and Granger (1987) have shown that the Granger-causality test

can be seriously wrong if the time series are I(1) and cointegrated. For

that reason, the literare suggests to test �rst (with so-called unit root tests)

whether the variables are I(1). If the variables turn out to be I(1), the recent

literature suggest then to use a single error correction model (ECM) to test

for cointegration and causality upon application of , e.g., a method of Boswijk

(1994).

Testing for cointegration in a single error correction model

When trends are cointegrated, it follows that there is a long-run relation

between them. This long-run relation can be expressed in terms of what is

called an error correction model. An error correcting model explains changes

in one series in terms of three components: (1) random shocks, (2) recent

changes in the two series, and (3) an \error-correction" term that pushes the

series back into its long-run relation with the other series. The signi�cance

of the parameter for the error-correction term amounts to a statistical test

for the existence of a long-run causal relation between the two series. We

implement this error correction model below and we further allow for this

long-run relation to change historically, in order to test the hypothesis that

the employment-fertility con ict has changed in recent years at least for some

5Henceforth the ut's denote any disturbance term with zero mean.

13

Page 14: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

countries in our sample. An ECM can be tested with OLS and is written in

the following form:6

�Yt = �0 + �Y Yt�1 + YXt�1 + 'Y t+

mYXi=1

�Y;i�Yt�i +

nYXj=1

�Y;j�Xt�j + uY;t;

(3)

where the �Y 's, �Y 's, �0, �Y ; Y and 'Y are constants, t is a trend term and

mY and nY are again the optimal number of lags of the �rst di�erences (we

choose mY and nY so that the Schwarz criterion was minimized, but limited

mY and nY to a maximum of 2). The purpose of the lags is to correct for

autocorrelation in the random disturbance term. Henceforth we include the

trend term in the ECM only if its coe�cient is signi�cant according to the

t-statistic (the t-statistic is reliable if Y and X are cointegrated).

Boswijk shows that one can test in (3) for cointegration between Y and

X with a joint Wald test of the hypothesis H0: �Y = Y = 0: In this

Wald test one has to compare the �2-statistic that results from a test of H0

with the critical values tabulated in Boswijk in Table B.1-B.5 (the tables are

for various cases, that is, with and without trend term in (3) and so forth).

Cointegration is accepted if the �2-statistic is larger than the relevant critical

value.

If there is absence of cointegration, then it is appropriate to test for

causality upon application of the standard Granger-causality test in �rst

di�erences, that is, to test for joint signi�cance of the lags in (1)-(2).

Testing for causality in a single error correction model

In case of cointegration, absence of weak exogeneity of X for Y implies long-

run causality of X for Y (Hall and Milne 1994). A test for weak exogeneity

in an ECM can be applied in two stages (Boswijk 1994). In the �rst stage

one may estimate the following equation with OLS:

Yt = Y + �YXt + �Y t + uY;t; (4)

Xt = X + �XYt + �Xt+ uX;t; (5)

6An analogous equation to (3) exist for �Xt as the dependent variable.

14

Page 15: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

where the 's, �'s and �'s are constants. The relation in (4) and (5) is

labeled as long-run relation between Y and X. Note that this relationship

involves three elements: a slope coe�cient, a time trend, and a random error.

This relation is \long-term" in the sense that Yt � �YXt is stationary once

a smooth function of time (in our case, a linear time trend) is controlled

for. The crucial point here is that a speci�c linear combination of these

variables obeys an equilibrium relation in the long run. The residuals of

the estimates of (4) (that is, �̂Y;t = Yt � ̂Y � �̂YXt � �̂Y t) and (5) (that

is, �̂X;t = Xt � ̂X � �̂XYt � �̂Xt), where a hat on top of a variable denotes

the estimate of the corresponding parameter) are saved and labeled as error

correction terms. In the second stage the following equations are tested with

OLS:

�Yt = �0 + �Y �̂Y;t�1 +

mYXi=1

�Y;i�Yt�i +

nYXj=1

�Y;j�Xt�j + uY;t; (6)

�Xt = �0 + �X �̂X;t�1 +

mXXk=1

�X;k�Yt�k +

nXXl=1

�X;l�Xt�l + uX;t; (7)

where the �'s and �'s are constants, the �̂t�1's denote the lagged error cor-

rection term, the m's and n's are again the optimal numbers of lags of the

�rst di�erences chosen according to the Schwarz criterion. The variable X

(resp. Y ) is weakly exogenous for Y (resp. X), if �Y (resp. �X) is insignif-

icant according to the t-statistic. Otherwise long-run causality of X for Y

cannot be rejected. Intuitively, long-run causality implies that a deviation

in the long-run relation between X and Y , that is, �̂Y;t 6= 0 for some t, will

impact on the value of �Yt (and analogously for �̂X;t 6= 0).

If �Y as well as �X are signi�cant { which means that X long-run causes

Y as well as Y long-run causes X {, then one can interpret this as evidence for

that there exists a Z vector containing exogenous variables which is common

to Y and X and which \long-run causes" the cointegration relation of Y and

X.

Contrary to long-run causality, short-run causality ofX for Y prevails if in

(6) (resp. in (7)) the lags of �X (resp. of �Y ) are jointly signi�cant.7 How-

7X is termed strongly exogenous for Y , if there is absence of short-run causality and

long-run causality of X for Y and, in addition, uY;t in (6) and uX;t in (7) are uncorrelated

with each other (and analogously for Y in relation to X).

15

Page 16: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

ever, in the subsequent analysis we only test for long-run causality (hence-

forth just labeled \causality"). This also agrees with Granger (1988) who

questions the usefulness of the concept of short-run causality for di�erence

stationary time series.8

In addition to merely testing for causality between two time series one

may also test for parameter instability in the long run relation between TFR

and FLP .

Testing for parameter instability in the long-run relation between

two variables within a single error correction model

Parameter instability occurs when the long-run relation between X and Y

changes. We allow two forms of parameter instability in our model. First, the

parametric relationship between X and Y (as represented by the coe�cient

for X) can change. Second, the linear time trend can change. To test for

parameter instability one may estimate the following equation with non-

linear least squares (NLS) (the estimation procedure { but not the test for

parameter instability { follows Boswijk 1994):

�Yt = �(Yt�1 � Y � �Y;1Xt�1 � �Y;2 DUMtB Xt�1 � �Y t) (8)

+

mYXi=1

�Y;i�Yt�i +

nYXj=1

�Y;j�Xt�j + uY;t;

with DUMtB =

�1 if t > tB0 otherwise

;

where �; Y ; �Y;1; �Y;2; �Y are constants. tB denotes the possible date of a

structural break in the slope parameter of X. The hypothesis of a break in

this slope parameter is accepted if �Y is signi�cant according to the t-statistic.

Inspired by a literature that suggests methods to test whether a time series is

di�erence-stationary (see Maddala and Kim 1998), we suggest to choose the

date of a possible break in the slope endogenously. More speci�c, we suggest

to estimate (8) for various values of tB and compare the corresponding t-

statistics for testing �Y = 0. The value of tB for which the absolute value of

the t-statistic is maximized may be chosen as the date of a possible break in

the slope. In case a break in the slope is accepted, we suggest to test the sign

8In case of stationary time series one can apply system (1)-(2) to test for causality

without the need to distinguish between short-run and long-run causality.

16

Page 17: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

and the signi�cance of the correlation between X and Y before the break

and after it by testing the following equation with NLS:9

�Yt = �[Yt�1 � Y � �Y;1(DUM0 �DUMtB )Xt�1 (9)

��Y;2(DUMtB �DUMT )Xt�1 � �Y t]

+

mYXi=1

�Y;i�Yt�i +

nYXj=1

�Y;j�Xt�j + uY;t;

where@Yt�1

@Xt�1

j long�run =

��Y;1 for t < tB�Y;2 otherwise

;

and DUM0 = 1; 8t�[0; T ] and DUMT = 0; 8t�[0; T ], with the subscript 0

and respectively T as the dates at which the sample starts and respectively

ends. The (long-run) correlation between X and Y is signi�cant before the

break (resp. after the break) if �Y;1 = 0 (resp. �Y;2 = 0) is rejected according

to the t-statistic.

We have, for didactical reasons, deferred the exposition on how to test

for parameter instability to the end of this section. However, in practice

one needs to test for parameter instability in the �rst place, in order not to

distort the results of the cointegration and causality tests.

5 Empirical application

Prior to any test of causality we applied the Augmented Dickey-Fuller unit

root test to the time series of TFR and FLP . These tests showed that each

time series is di�erence-stationary. Therefore, we rule out the use of time

series models that attempt to estimate causal relation by simply regressing

one series on the other series, controlling for a time trend. Instead, an error-

correction model is called for. Therefore, we estimated equation (8) for (i)

�TFRt and alternatively (ii) �FLPt as the dependent variable. We found

a signi�cant trend in the long-run relation between TFR and FLP for each

country when we allow for breaks in the trend. Therefore, in the �rst step we

9Equation (9) is nothing else than equation (8) with the dummies rearranged. This

rearrangement does not change the content of the estimation results, but { in our view {

simpli�es the interpretation of the results. To give an example, in (9) the coe�cient in

front of the term (DUM0�DUMtB)Xt�1 represents the coe�cient of the slope before the

break (that is, from the date of the start of the sample to the date of the break).

17

Page 18: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

determined the dates and the number of breaks in the trend for each equation.

To determine the exact date of a break in the trend, we followed the procedure

for determining endogenously the break in the slope as was explained in the

last section. That is, we compared the t-statistics of dummies for various

possible dates of the break and chose the date with the highest absolute value

of the t-statistic. Following Kim (1997), we chose the number of breaks in

the trend that minimized the Schwarz criterion. According to our estimates,

there were breaks in the trends in the long-run relation, with the timing of

the breaks varying somewhat by country. In the next step we tested for

instability in the long-run relation between TFR and FLP and tested for

an endogenous break in the slope parameter as explained in the previous

section.10

Our estimates of the stability of the slope coe�cient that links TFR and

FLP into a cointegrated trend has changed once for the countries under

investigation. The statistically optimal estimate for this break varies some-

what depending upon whether we model the long-run relation as an e�ect of

TFR on FLP or as an e�ect of FLP on TFR. We report the estimates of

these break points in Table 2. In our view, the important issue here is not the

exact timing of the break (we do not actually believe that the break occurred

at a single distinct point in history). Rather, the important point is that, at

least for some of the countries in our analysis, the long-run relation between

TFR and FLP has changed in recent history. Table 2 summarizes the dates

of breaks in the slope and suggests that for each country the correlation

between TFR and FLP has changed across time.

Next we tested for cointegration between TFR and FLP by estimating

equation (3) for (i) �TFRt and alternatively (ii) �FLPt as the dependent

variable (we include the breaks as determined in the step before). Table 3

summarizes the cointegration test results.11

10Once we allowed for a break in the correlation between TFR and FLP , then in a few

cases some breaks in the trend became insigni�cant according to the t-statistic and the

Schwarz criterion suggested too many breaks in the trend. In these few cases we abolished

the insigni�cant breaks in the trend.11As suggested by the results in Table 2 the coe�cient Y in equation (3) had in fact

to be split up into two coe�cients Y;1 and Y;2 with the former (latter) one referring to

periods before (after) the structural break. Therefore, we tested in (3) for cointegration

upon application of a Wald test of the joint hypothesis: H0: �Y = Y;1 = YY 2 = 0.

Alternatively, in case of insigni�cance of the slope after the break, one could estimate H 0

0:

�Y = Y;1 = 0: However, the results of testing H 0

0are the same as the results of testing

H0 and are therefore not shown in the paper.

18

Page 19: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Table 2: Endogenous dates of break in slope in time series regressions for six

countries, 1960-94, and additionally for the US, 1948-95 and US age speci�c

time series.

Dependent Variable

Country �TFRt �FLPtFRA 1976 1982

FRG 1985 1991

ITA 1981 1980

SWE 1970 1973

UK 1977 1986

USA 1970 1984

USA (1948-95) 1970 1977

USA20-24 (1948-95) 1971 1981

USA25-34 (1948-95) 1970 1967

Note: USA20-24 (respectively USA25-34) indicates that we apply age speci�c time series

of fertility and female labor force participation for women of age 20-24 (respectively 25-34).

Table 3 shows cointegration between TFR and FLP (mostly at the 1%

signi�cance level) for all countries with the only exception being the case of

Sweden when TFR is the dependent variable. Hence, there is strong evidence

for absence of spurious regression results for the association between TFR

and FLP in aggregate time series data.

In order to test for the direction of causality between TFR and FLP

we used the two stage procedure as explained in the previous section. For

illustration, assume there is one break in the slope and in the trend of each

time series. In this case we estimated in a �rst stage the following equations

with OLS:12

TFRt = TFR+�TFR;1(DUM 0�DUM tB;TFR)FLP t +

�TFR;2(DUM tB;TFR�DUMT )FLP t + (10)

�TFR;1(DUM 0�DUM t ~B;TFR)(t=100)+

�TFR;2(DUM t ~B;TFR�DUMT )(t=100);

12In these equations the dummies for the trend are rearranged in the same way as

are the dummies of the slope in (9). E.g. in (10) the coe�cient in front of the term

(DUM0�DUMt~B;TFR

)(t=100) represents the coe�cient of the trend between the start of

the sample and the date of the break in the trend t ~B;TFR.

19

Page 20: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Table 3: Test for cointegration between TFR and FLP in time series regres-

sions for six countries, 1960-94, and additionally for the US, 1948-95 and US

age speci�c time series.

Dependent Variable

Country �TFRt �FLPtFRA yes*** yes***

FRG yes** yes***

ITA yes*** yes***

SWE no yes***

UK yes*** yes***

USA yes** yes***

USA (1948-95) yes*** yes***

USA20-24 (1948-95) yes*** yes***

USA25-34(1948-95) yes*** yes***

Notes: *, **, *** 15%, 5%, 1% signi�cance level. \yes" means rejection of H0: no cointe-

gration.

FLP t = FLP+�FLP;1(DUM 0�DUM tB;FLP )TFRt +

�FLP;2(DUM tB;FLP�DUMT )TFRt + (11)

�FLP;1(DUM 0�DUM t ~B;FLP)(t=100)+

�FLP;2(DUM t ~B;FLP�DUMT )(t=100) ;

where the 's, �'s and �'s are constants, DUMtB;i ; 8i = TFR; FLP denotes

the break in the slope, DUMt ~B;i; 8i = TFR; FLP denotes the break in the

trend with the subscript tB;i and t ~B;i indicating the date of the break in

the slope and the trend respectively. As was the case in the cointegration

tests, we included in the estimations of equations (10) and (11) the breaks

which were signi�cant according to the estimations of (8). The residuals

that resulted from estimation of (10) (resp. (11)) were saved and used in the

estimation of (6) (resp. (7)) with �TFRt (resp. �FLPt) as the dependent

variable. Table 4 summarizes the results of the tests for causality.

The table shows clear evidence of causality in both directions for almost

all cases. The only country were we found no causality at all is Sweden. Note

that in case of Sweden when �TFR is the dependent variable we applied

the standard Granger-causality test in �rst di�erences because in this case

cointegration was rejected.

In our view Table 4 is, by and large, consistent with simultaneous move-

20

Page 21: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Table 4: Testing for causality between FLP and TFR for six countries, 1960-

94, and additionally for the US, 1948-95 and US age speci�c time series.

Country FLP ! TFR TFR! FLP

FRA yes*** yes**

FRG yes** no

ITA yes* yes**

SWE no no

UK yes** yes***

USA no yes**

USA (1948-95) yes*** yes**

USA20-24 (1948-95) yes*** yes***

USA25-34 (1948-95) yes*** yes*

Notes: *, **, *** 15%, 5%, 1% signi�cance level. \yes" means rejection of H0: weak

exogeneity.

ments of both variables brought about by common exogenous third vari-

ables. These third variables are presumably �nancial incentives, social norms

and/or social institutions which help to combine work and family.

Table 5 summarizes the �ndings from the estimates of the long-run rela-

tion in (9), that is, { for our illustrative example with one break in the trend

and for the case with TFR as the dependent variable:

TFRt�1 = ̂TFR + �̂TFR;1(DUM0 �DUMtB;TFR)FLPt�1 (12)

+�̂TFR;2(DUMtB;TFR �DUMT )FLPt�1

+�̂TFR;1 (DUM0 �DUMt ~B;TFR)(t=100)

+�̂TFR;2(DUMt ~B;TFR�DUMT )(t=100)

(and similar for the case with FLP as the dependent variable). The quanti-

tative estimation results are shown in Appendix B.

Our estimation results (Appendix B) show a signi�cant negative correla-

tion between TFR and FLP before the break in the slope for all countries in

all equations with the exception of Sweden in case with TFR as the depen-

dent variable. Furthermore, for all countries the magnitude of this relation

in the equation with TFR as the dependent variable exceeds the one in the

equation with FLP as the dependent variable. Table 5 shows that in almost

all cases (that is, for all countries almost always no matter whether TFR

or FLP is the dependent variable) the correlation between TFR and FLP

21

Page 22: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Table 5: Relation between TFR and FLP after the break in the slope for six

countries, 1960-94, and additionally for the US, 1948-95 and US age speci�c

time series.

�TFRt �FLPtCountry { 0 + { 0 +

FRA x x

FRG x x

ITA x x

SWE x

UK x x

USA x x

USA (1948-95) x x

USA20-24 (1948-95) x x

USA25-34 (1948-95) x x

Notes: { indicates that the relation between TFR and FLP after the break is signi�cant

negative and its magnitude exceeds the one before the break; 0 indicates that the relation

between TFR and FLP is either signi�cant negative and weaker as compared to the period

before the break, or insigni�cant; + indicates a signi�cant and positive relation between

TFR and FLP after the break. The signi�cance level is in any case the 15% level. For

each country we indicate with a cross the valid case.

becomes smaller in magnitude or even insigni�cant after the break. However,

the opposite holds for Italy where the negative relation between TFR and

FLP becomes even stronger after the break. Note that in case of Sweden

when TFR is the dependent variable we found no cointegration and accord-

ing to econometric theory there is no long-run relation in case of absence of

cointegration.

To conclude, we �nd for all non-Mediterranean countries evidence for

a signi�cant negative relation between TFR and FLP before the break in

the slope and an insigni�cant or signi�cant negative, but weaker relation

after the break. This �nding is consistent with the view in Rindfuss and

Brewster (1996), Rindfuss et al. (2000) and Brewster and Rindfuss (2000)

that changes in childcare availability and attitudes towards working moth-

ers might have reduced the incompatibility between childrearing and female

employment. Moreover, our empirical results show that for Italy the relation

between TFR and FLP did neither become insigni�cant nor weaker { ac-

tually it became even bigger in magnitude as compared to the period before

22

Page 23: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

the break. This latter result is consistent with the view in Brewster and

Rindfuss and Rindfuss et al. that in the Mediterranean countries there were

not such changes of family policies and/or social norms which reduced the

incompatibility between childrearing and employment for women.13

A glance at the time plot of the Italian FLP might even give an expla-

nation for our �ndings. The time series clearly reveals that in Italy the FLP

for women fell from 1960 to 1965. It might be that in the year 1960 (and

possibly earlier) in Italy men's wage income was not enough to support the

family and so often women had also to participate in the labor market, but

could not restrict their fertility (for example, because at that time the con-

traceptive pill was not introduced). As in the early 1960s the wage income of

the men grew, often women had not anymore to work to support the family

and, as a consequence, reduced their employment. Clearly, such a mechanism

would reduce the correlation between TFR and FLP in the initial periods

of our sample.

6 Conclusion

In this study we applied recent econometric time series techniques to test

for causality between fertility and female employment with macro-level time

series data of six developed countries. Compared to previous research we

introduced two new methodological elements: (i) we allowed for parameter

instability and (ii) we used an error correction model.

The existing literature mostly found unidirectional causation and con-

icting results on the direction between fertility and female employment. In

the light of our empirical results { which show causality in both directions {

we suggest that previous research tended to reject causality too often. The

failure to account for parameter instability (either in the long-run relation be-

tween fertility and female employment and/or the trend of each time series)

may partly explain why previous research rejected causality too often (see

Table 1). Moreover most of the previous research either ignored di�erence-

stationarity or applied Granger-causality tests to the �rst di�erences of the

time series without testing for cointegration.

Our result of causation in both directions indicates that ignoring di�erence-

stationarity did probably not lead to wrong causality results since this failure

13We would expect for other Mediterranean countries (such as Greece, Spain and Por-

tugal) the empirical results to be similar to the result for Italy.

23

Page 24: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

tends to �nd \spurious" causality rather than to reject causality too often

(indeed Michael 1985, who applies Granger-causality tests to the levels and

ignores di�erence-stationarity as well as parameter instability, �nds some ev-

idence for causality in both directions). Nevertheless, in case of di�erence

stationary time series an error correction model is by far the most appropriate

model and testing for cointegration is imperative in this case.

The application of a Granger-causality test to the �rst di�erences of the

time series is a special case of the application of an error correction model.

More speci�cally, it is a Granger-causality test without the inclusion of the

error correction term. However, not including the error correction term might

lead to serious problems because the error correction term contains the in-

formation about a possible long-run relation between fertility and female

employment. For that reason, Granger-causality tests applied to �rst dif-

ferences can only test for \instantaneous causation" and the information on

long-run causation is lost in this framework. From a statistical point of view,

this loss of information might be an additional explanation why the existing

literature tended to reject causality too often. From a substantive point of

view, if the correlation between fertility and female employment is the re-

sult of intentions (as, for example, economists would argue), then testing for

instantaneous causation is not useful. Clearly, if individuals \decide" about

their number of children, their timing of childbearing and their labor market

participation, then a respond to shocks takes time and, therefore, a test of

long-run causality seems much more useful.

Allowing for parameter instability not only prevented us from rejecting

causality too often. Omission of a break in the trend can also lead to biased

estimates of the coe�cient of the relationship between fertility and female

employment. Obviously, the same is true for the failure to account for insta-

bility in the correlation between fertility and female employment. Michael

(1985), for example, �nds a positive e�ect from female employment on fer-

tility with macro-level data. We also found in our empirical applications a

positive coe�cient for some countries when we did not account for structural

breaks (the results are not shown). However, when we accounted for param-

eter instability, we found for all countries a negative coe�cient before the

break. Empirical results that show a positive correlation between fertility

and female employment already in the 1960s and the 1970s might { in our

view { indicate biased estimates due to the failure to account for parameter

instability (it is possible that for Nordic countries the situation is di�erent).

Besides the improvements of the estimation results, accounting for pa-

24

Page 25: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

rameter instability is also an interesting �nding in its own right. Most im-

portantly, allowing for instability of the correlation between fertility and

female employment revealed that for all non-Mediterranean countries the

correlation became weaker over time. This result with time series data com-

plements and supports recent studies which found for OECD countries that

the cross-country correlation between fertility and female employment turned

from a negative value before the 1980s to a positive value thereafter. Our

empirical �nding supports therefore the view of a recent demographic liter-

ature which argues that societal level responses reduced the incompatibility

between fertility and female employment.

While those latter studies as well as our study do not distinguish between

full and part-time employment, the availability of part-time employment is

clearly a further element of societal level responses which also might have

reduced the incompatibility between childrearing and female labor market

participation. As more data about availability of part-time employment will

become accessible in the future, we suggest for future work to estimate the

contribution of availability of part-time employment to the weakening asso-

ciation between fertility and female employment.

To illustrate the importance to apply more sophisticated econometric

methods, as suggested in our paper, we conclude by showing the results of a

simple OLS regression of TFR on FLP with a trend and a constant included

on the right hand side.14 We applied this regression to all six countries of

our sample and for the time interval 1960-94.

Table 6: OLS regressions of TFR on FLP

Country Coe�cient of FLP P-value of coe�cient of FLP

FRA {2.47 0.00

FRG 2.38 0.00

ITA {0.85 0.00

SWE {1.73 0.00

UK {0.40 0.73

USA {4.11 0.00

Note: All regressions include a constant and a trend.

Though for �ve out of our six countries the OLS coe�cients on FLP are

14Results from simple OLS regressions of FLP on TFR with a trend and a constant

included on the right hand side are very similar.

25

Page 26: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

negative (cf. Table 6), a closer look at the level and the signi�cance of the

estimates reveals several counterintuitive results. The positive coe�cient on

FLP for West-Germany seems clearly against the general predictions of de-

mographic theories. Though recent demographic �ndings suggest a reduction

in the incompatibility between childrearing and female employment, none of

these theories actually predicts a positive causal e�ect from FLP on TFR.

Comparing the estimates for Italy, Sweden and UK implies the lowest incom-

patibility between childrearing and female employment for the UK (where

the coe�cient on FLP is insigni�cant), followed by Italy and Sweden ranking

last. Again, these results are against the general �ndings in the demographic

literature so far where Sweden is always regarded as the country with the

highest compatibility between childrearing and female employment. Finally,

according to the OLS estimates the USA turns out to be the country with

the by far strongest incompatibility between childrearing and female employ-

ment in our sample. Again, one would rather expect Italy to be a country

that ranks lowest on the incompatibility between childrearing and female

employment. To conclude: The empirical results in our paper are much

more intuitive and are supported by the view of a recent demographic liter-

ature which argues that societal level responses reduced the incompatibility

between fertility and female employment. The importance to apply more

sophisticated econometric methods that go beyond simple OLS regressions

is therefore evident.

26

Page 27: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

References

Andersson, G. 1999. \Trends in childbearing and nuptiality in Sweden, 1961-1997."

Paper presented at the Annual Meeting of the Population association of America,

New York.

Ahn N. and P. Mira. 1999. \A note on the changing relationship between fertility and

female employment rates in developed countries." Working paper,

http://www.feda.es/hojas/publicado.html

Bernhardt, E.M. 1993. \Fertility and employment." European Sociological Review 9:

25-42.

Brewster, K.L. and R.R. Rindfuss. 2000. \Fertility and women's employment in indus-

trialized nations." Annual Review of Sociology 26: 271-296.

Boswijk, H.P. 1994. \Testing for an unstable root in conditional and structural error

correction models." Journal of Econometrics 63: 37-60.

Cheng, B.S. 1996. \An investigation of cointegration and causality between fertility and

female labour force participation." Applied Economics Letters 3: 29-32.

Chesnais, J.C. 1996. \Fertility, family, and social policy in contemporary western Eu-

rope." Population Development Review 22: 729-739.

Cramer, J.C. 1980. \Fertility and female employment: Problems of causal direction."

American Sociological Review 45: 167-190.

Engle, R.F. and C.W.J. Granger. 1987. \Co-integration and error-correction: represen-

tation, estimation and testing." Econometrica 55: 251-276.

Esping-Andersen, G. 1999. Social foundations of postindustrial economies. Oxford: Ox-

ford University Press.

Goldin C. and L.F. Katz. 2000. \The power of the pill: Oral contraceptives and women's

career and marriage decisions." National Bureau of Economic Research working

paper no. 7527, Cambridge MA.

Granger, C.W.J. 1969. \Investigating causal relations by econometric models and cross-

spectral methods." Econometrica 37: 424-438.

27

Page 28: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Granger, C.W.J. 1988. \Some recent developments in a concept of causality." Journal

of Econometrics 39: 199-211.

Granger, C.W.J. and P. Newbold. 1974. \Spurious regression in econometrics", Journal

of Econometrics 26: 1045-1066.

Hall, S.G. and A. Milne. 1994. \The relevance of p-star analysis to UK monetary policy."

Economic Journal 104: 597-604.

Hantrais, L. 1997. \Exploring relationships between social policy and changing family

forms within the European Union." European Journal of Population 13: 339-379.

Hoem, J. 1990. \Social policy and recent fertility change in Sweden." Population and

Development Review 16: 735-748.

Kim, I.-M. 1997. \Detecting the number of structural breaks", Economics Letters 57:

145-148.

Klijzing, E., J. Sieger, N. Keilman and L. Groot. 1988. \Static versus dynamic analysis

of the interaction between female labour force participation and fertility." European

Journal of Population 4: 97-116.

Kremers, J., N.R. Ericsson and J.D. Juan. 1992. \The power of cointegration tests."

Oxford Bulletin of Economics & Statistics 54: 325-48.

Lehrer, E. and M. Nerlove. 1986. \Female labor force behavior and fertility in the United

States." Annual Review of Sociology 12: 181-204.

Maddala, G.S. and I.-M. Kim. 1998. Unit roots, cointegration, and structural change.

Cambridge, UK: Cambridge University Press.

Michael, R.T. 1985. \Consequences of the rise in female labor force participation rates:

questions and probes." Journal of Labor Economics 3: 117-146.

N�i Bhrolch�ain, M. 1993. \How to anser the fertility/employment question (and how

not to)." In: M�aire N�i Bhrolch�ain (ed.), New perspectives on fertility in Britain.

Studies on Medical and Population Subjects 55: 151-157.

Rindfuss, R.R., K. Benjamin and S.P. Morgan. 2000. \The changing institutional context

of low fertility." Working paper, University of North Carolina.

28

Page 29: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Rindfuss, R.R. and K.L. Brewster. 1996. \Childrearing and fertility." Population and

Development Review 22 (Supplement): 258-289.

Spitze, G. 1988. \Women's employment and family relations: A review." Journal of

Marriage and the Family 50: 595-618.

Otero, J. and J. Smith. 2000. \Testing for cointegration: Power versus frequency of

observation-further Monte Carlo results." Economics Letters 67: 5-9.

Weller, R.H. 1977. \Wife's employment and cumulative family size in the United States,

1970 and 1960." Demography 14: 43-65.

Zimmermann, K.F. 1985. Familien�okonomie. Berlin: Springer.

29

Page 30: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

Appendix A: The data

1. TFR: total fertility rate

De�nition: Sum of age speci�c fertility rates

Data sources: German Federal Statistical O�ce, section population

and labor force (in case of West Germany), New Cronos CD-Rom (from

Eurostat) (1998)(in case of Western European countries other than

West-Germany) and the Demographic Yearbook of The United Nations

CD-Rom (1997)(for all other countries).

2. FLP : female labor force participation in female population of age 15

to 64

De�nition: Logarithm of female labor force including female unem-

ployed divided by logarithm of female population of age 15 to 64

Data source: Comparative welfare states data set http://www.lis.ceps.lu/

(except West-Germany after 1989 and US where the sources are: Ger-

man Federal Statistical O�ce, microcensus and US Bureau of Labor

statistics http://stats.bls.gov/).

30

Page 31: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

AppendixB:Estimationresultsoflong-runrelation

15

FRA

TFRt�

1=�

0:19�

1:66�

(DUM59�

DUM76)FLPt�

1�

1:32�

(DUM76�

DUM94)FLPt�

1�

0:55�

(DUM59�

DUM73)(t=100)

(�

8:42)

(�

5:41)

(�

3:30)

1:01�

(DUM73�

DUM76)(t=100);

(�

4:67)

FLPt�

1=�

0:68�

0:17�

(DUM59�

DUM82)TFRt�

1�

0:05(DUM82�

DUM94)TFRt�

1+0:59�

(DUM59�

DUM82)(t=100)

(�

8:34)

(�

1:15)

(9:64)

+0:32�

(DUM82�

DUM89)(t=100)+0:43�

(DUM82�

DUM94)(t=100):

(4:97)

(7:94)

FRG

TFRt�

1=�

0:05�

1:30�

(DUM59�

DUM85)FLPt�

1�

0:69(DUM85�

DUM94)FLPt�

1�

1:67�

(DUM68�

DUM85)(t=100),

(�

2:26)

(�

0:96)

(�

8:69)

FLPt�

1=�

0:68�

0:03�

(DUM59�

DUM91)TFRt�

1+0:11�

(DUM91�

DUM94)TFRt�

1�

0:15�

(DUM65�

DUM67)(t=100)

(�

2:49)

(1:64)

(�

2:41)

+0:14�

(DUM79�

DUM85)(t=100)+0:25�

(DUM85�

DUM89)(t=100)+0:55�

(DUM89�

DUM90)(t=100)+0:32�

(DUM90�

DUM94)(t=100):

(5:64)

(8:63)

(8:25)

(6:40)

15Asmentionedbefore,inourapplicationwe�ndforsomecountriesmorethanonebreakinthetrend.Thenotationfor

thecasewithmultiplebreaksinthetrendisastraightforwardextensionofthenotationusedinthetext(forexample,the

coe�cientinfrontoftheterm(DUM

64�

DUM

78)(t=100)representsthecoe�cientofthetrendfrom1965to1978).Moreover,

weexcludedintheestimationequationthetrendforaparticulartimeintervalifinpriorestimationsthetrendwasinsigni�cant

forthattimeinterval(accordingtoatleastthe15%signi�cancelevel).

31

Page 32: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

ITA

TFRt�

1=0:30�

0:85�

(DUM59�

DUM81)FLPt�

1�

1:05�

(DUM81�

DUM94)FLPt�

1�

1:78�

(DUM64�

DUM74)(t=100)

(�

2:26)

(�

2:72)

(�

3:66)

2:69�

(DUM74�

DUM87)(t=100)�

2:19�

(DUM87�

DUM94)(t=100);

(�

3:52)

(�

3:57)

FLPt�

1=�

0:67�

0:49�

(DUM59�

DUM80)TFRt�

1�

0:70�

(DUM80�

DUM94)TFRt�

1+0:30�

(DUM74�

DUM92)(t=100)

(�

8:48)

(�

6:40)

(�

3:62)

0:28�

(DUM92�

DUM93)(t=100):

(�

2:13)

SW

E

FLPt�

1=�

0:26�

0:35�

(DUM59�

DUM73)TFRt�

1�

0:17�

(DUM73�

DUM94)TFRt�

1+0:38�

(DUM67�

DUM91)(t=100):

(�

4:10)

(�

1:58)

(5:13)

UK

TFRt�

1=0:13�

1:25�

(DUM59�

DUM77)FLPt�

1�

0:97�

(DUM77�

DUM94)FLPt�

1�

0:46�

(DUM64�

DUM71)(t=100)

(�

7:15)

(�

3:34)

(�

3:27)

1:35�

(DUM71�

DUM77)(t=100)�

0:26�

(DUM77�

DUM86)(t=100);

(�

8:97)

(�

2:59)

FLPt�

1=�

0:75�

0:10�

(DUM59�

DUM86)TFRt�

1�

0:04(DUM86�

DUM94)TFRt�

1+0:80�

(DUM59�

DUM94)(t=100):

(�

2:95)

(�

1:40)

(8:18)

32

Page 33: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

USA(1960-94)

TFRt�

1=0:21�

1:10�

(DUM59�

DUM70)FLPt�

1�

0:80�

(DUM70�

DUM94)FLPt�

1�

0:77�

(DUM64�

DUM76)(t=100)

(�

5:25)

(�

2:86)

(�

3:42)

0:45�

(DUM86�

DUM94)(t=100);

(�

3:42)

FLPt�

1=�

0:43�

0:30�

(DUM59�

DUM84)TFRt�

1�

0:21�

(DUM84�

DUM94)TFRt�

1+0:20�

(DUM64�

DUM76)(t=100)

(�

6:69)

(�

5:52)

(2:09)

+0:51�

(DUM76�

DUM94)(t=100):

(5:34)

USA(1948-95)

TFRt�

1=0:39�

0:79�

(DUM47�

DUM70)FLPt�

1�

0:35�

(DUM70�

DUM95)FLPt�

1+1:28�

(DUM52�

DUM61)(t=100)

(�

7:46)

(2:34)

(�

3:72)

0:53�

(DUM64�

DUM73)(t=100)+0:32�

(DUM86�

DUM95)(t=100);

(�

3:24)

(3:79)

FLPt�

1=�

0:91�

0:09�

(DUM47�

DUM77)TFRt�

1�

0:02(DUM77�

DUM95)TFRt�

1+1:30�

(DUM47�

DUM89)(t=100)

(�

4:09)

(�

0:68)

(19:91)

+1:23�

(DUM89�

DUM95)(t=100):

(20:72).

33

Page 34: Henriette Engelhardt, T omas K ogel and Alexia Prsk a w etz abs+pap/Paper-Engel… · omas K ogel and Alexia Prsk a w etz Max Planc k Institute for Demographic Researc h, Dob eraner

USA20-24(1948-95)

FERt�

1=�

2:21�

0:91�

(DUM47�

DUM71)FLPt�

1�

0:03(DUM71�

DUM95)FLPt�

1+1:09�

(DUM53�

DUM61)(t=100)

(�

9:64)

(�

0:16)

(4:58)

0:99�

(DUM64�

DUM71)(t=100);

(�

5:22)

FLPt�

1=�

1:27�

0:31�

(DUM47�

DUM81)FERt�

1�

0:30�

(DUM81�

DUM95)FERt�

1�

0:94�

(DUM51�

DUM53)(t=100)

(�

14:65)

(�

13:87)

(�

1:91)

+0:73�

(DUM53�

DUM89)(t=100)+0:60�

(DUM89�

DUM95)(t=100):

(10:55)

(9:41).

USA25-34(1948-95)

FERt�

1=�

1:88�

0:56�

(DUM47�

DUM70)FLPt�

1�

0:32�

(DUM70�

DUM95)FLPt�

1+1:28�

(DUM50�

DUM61)(t=100)

(�

8:30)

(2:89)

(4:23)

0:64�

(DUM64�

DUM76)(t=100)+0:29�

(DUM84�

DUM95)(t=100);

(�

4:80)

(3:73)

FLPt�

1=�

2:40�

1:06�

(DUM47�

DUM67)FERt�

1�

0:96�

(DUM67�

DUM95)FERt�

1+1:24�

(DUM47�

DUM90)(t=100):

(�

7:31)

(�

7:58)

(7:79)

Notes:*,**,***15%,5%,1%signi�cancelevel,t-statisticsinparenthesisand

DUM

i=

�1

if

t>

i

0

otherwise

;8

i�[60;94];FER

representstheagespeci�cfertilityrate.

34