
The next three chapters are about SIPP data collection procedures. This chapter describes the current data collection procedures, highlighting those features most likely to influence the quality of the data, e.g., interview mode, respondent rules, the structure of the interview and the questionnaire, interviewer characteristics and training, and quality assurance procedures. Chapters 5 and 6 present available information on how these features of the data collection process affect quality. The results of experiments in which alternative procedures were tested are also covered in Chapters 5 and 6. Chapter 5 covers nonresponse error and Chapter 6 covers measurement or response error.
References that provide useful information about current data collection strategies and procedures, as well as changes since the start of interviewing in 1983, include Kalton et al. (1986) and SIPP Working Papers 8401 (Nelson et al., 1985) and 8830 (Kasprzyk, 1988). Persons interested in more detail may also wish to consult the SIPP-5010 Interviewers Manual (U. S. Bureau of Census, 1988b), which is periodically updated, and may be obtained by writing or calling Customer Services, Data User Services Division, U. S. Bureau of the Census, Washington, DC 20233, (301) 457-4100.
Interviews of sample persons and households take place at 4-month intervals. Information on core items, such as labor force participation and income recipiency and amounts, is collected for each of the 4 months included in the reference period. Interviews are conducted in the month immediately following the 4-month reference period and the majority of them are completed in the first 2 weeks of that month.
For SIPP panels between 1984 and 1991, the preferred mode of interview and the one used in the great majority of interviews was face-to-face. Telephone interviewing was permitted to follow-up for information not obtained at the initial face-to-face interview, to interview persons who will not or cannot participate otherwise, and to interview sample persons who have moved to locations more than 100 miles from a SIPP sample PSU. For the 1984 panel, about 5.3 percent of the interviews were conducted by telephone because the households were inaccessible for face-to-face interviews. The corresponding rate for the 1985 panel was 6.3 percent. For the most part, the proportion of interviews conducted by telephone increased from the second through the final wave of each panel.
The mode of data collection was switched to maximum telephone interviewing in 1992 commencing with wave 7 of the 1990 panel and wave 4 of the 1991 panel. For the 1992 and 1993 panels, waves 1, 2, and 6 were conducted primarily using face-to-face interviewing as before. The remaining waves were conducted through telephone interviewing, to the extent feasible. Section 4.5 discusses the studies that monitored effects of maximum telephone interviewing on data quality.
The length of a household interview depends in part on the number of adults (persons 15 and over) in the household. Table 4.1 shows the median length of interview (in minutes) by wave and number of household members age 15 and over for the 1985 and 1993 panels. The median for all households ranged from a low of 27 minutes to a high of 40 minutes for the eight waves. For the 1985 panel, the high occurred in the first wave and the low in the second wave, which was the only wave after the first which had no topical modules. For the 1993 panel, the high again occurred in the first wave and there is no significant evidence that telephone interviewing shortens interview times. The interview time for the subsequent waves vary, depending on the topical modules included in each wave.
Table 4.1 Median household interview times (in minutes)
by wave and number of persons aged 15 and over in household: 1985 and 1993 panels
| Number of persons aged 15 and over |
1985 panel wave: | |||||||
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
|
One Two Three Four Five Six Seven or more |
24 41 52 64 71 92 116 |
18 27 39 51 56 67 90 |
21 38 51 61 71 87 99 |
21 38 51 62 71 76 98 |
23 41 54 68 73 83 96 |
20 34 45 57 66 78 101 |
21 39 50 59 67 88 98 |
22 40 52 63 71 81 99 |
All households |
40 |
27 |
37 |
37 |
39 |
33 |
37 |
38 |
Source: Table prepared by A. Feldman-Harkins, Division of Housing and Household Economic Statistics, U.S. Bureau of Census.
| Number of persons 15 and over |
1993 panel wave: |
||||||||
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
|
One Two Three Four Five Six Seven or more |
27 45 59 73 90 94 125 |
20 35 49 60 77 96 120 |
20 35 50 65 77 103 109 |
19 32 45 55 71 82 85 |
17 30 42 53 68 78 90 |
22 40 55 68 89 104 113 |
19 33 45 53 73 78 93 |
17 30 42 50 68 71 84 |
20 37 50 61 77 92 103 |
All households |
40 |
32 |
32 |
27 |
21 |
35 |
30 |
27 |
32 |
Mode of interview1 |
Per |
Per |
Tele |
Tele |
Tele |
Per |
Tele |
Tele |
Tele |
1Per = face-to-face interview, Tele = telephone interviews.
Response to the survey is voluntary. The respondent rule for SIPP is that all persons aged 15 and over who are present at the time of the interview should report for themselves unless not physically or mentally able to do so. Proxy informants are accepted for persons absent or incapable of responding. Exhibit 4.1 shows the hierarchy of preferred proxy respondents established for SIPP. The procedure for identifying a proxy with telephone interviews is the same as personal visits.
Exhibit 4.1 SIPP Respondent Rules

During the first wave of the 1985 panel, 67 percent of the sample persons for whom data were obtained reported for themselves. In subsequent waves this figure was quite stable at the level of 63-64 percent (Kasprzyk and McMillen, 1987). Similar results have been observed for other panels. However, for persons completing all interviews, Table 4.2 shows that the percentage reporting for themselves in every wave was about 40 percent for the 1984 panel and about 36 percent for the 1992 panel. The distributions of sample persons by patterns of self and proxy response for the first three waves of the 984 and 1992 SIPP panels are shown in Table 4.3.
Table 4.2 Persons aged 15 and over1 by number of proxy interviews: 1984 and 1992 panels
Number of proxy interviews |
1984 Panel | 1992 Panel | ||
Number of persons |
|
Number of persons |
|
|
0 1 2 3 4 5 6 7 8 |
9,822 2,764 1,887 1,492 1,465 1,378 1,468 1,722 2,570 |
40.0 11.2 7.7 6.1 6.0 5.6 6.0 7.0 10.5 |
11,223 3,724 2,922 2,131 2,487 2,291 2,804 3,684 -- |
35.9 11.9 9.3 6.8 8.0 7.3 9.0 11.8 -- |
Total |
24,568 |
100.0 |
31,266 |
100.0 |
1Age determined as of first interview.
Souce: Data for the 1984 panel was from Kasprzyk and McMillen (1987).
Table 4.3 Self (S) and proxy (P) response patterns for waves 1 to 3: 1992 panels
| Wave | Percent 1984 |
Percent 1992 |
||
1 |
2 |
3 |
||
S S S S P P P P |
S S P P S S P P |
S P S P S P S P |
49.4 5.9 6.0 6.2 5.8 3.5 5.1 18.1 |
50.5 7.7 4.6 7.2 4.5 3.6 3.1 18.8 |
All patterns |
100.0 |
100.0 |
||
Source: Data for the 1984 panel was from Kalton et al. (1986).
The SIPP "following rules," which determine which persons should be interviewed after the initial wave, are considered to be a part of the sample selection procedure and were described in Section 3.4. The preferred procedure is for the same interviewer to interview sample persons and households at each wave, but because of interviewer turnover and moves of sample persons to other locations, this is not always possible.
This section describes the SIPP data collection instruments and how they are used in the interviews for successive waves of a panel. Copies of the data collection instruments for each completed panel are included in a SIPP Informational Booklet, which may be obtained by writing or calling Customer Services, Data User Services Division, Bureau of the Census, Washington, DC 20233, (301) 457-4100. The primary data collection instruments for SIPP are the control card and the questionnaire. The Control Card is the basic field record for the sample unit and remains in the Regional Office during the life of the panel. Questionnaires for each wave are sent to a central point for processing.
For each sample address, a control card is completed at the first (wave 1) interview and updated at each subsequent wave. At the start of the wave 1 interview, the interviewer uses the control card to record basic demographic characteristics for each person residing at the sample address, plus a few characteristics of the household and the housing unit. The telephone number of the household is recorded both for use in telephone interviewing in subsequent waves (introduced in February 1992) and for callbacks needed to obtain information not available at the time of the interview. To assist in following movers, each household is asked for the name and telephone number of a nonhousehold member who would be likely to know the new location of households or persons who have moved.
At the close of the first-wave interview, the interviewer transcribes a few key data items for each adult to the control card, such as names of employers, sources of income received, and assets, to aid in collecting data for these items in subsequent interviews. Social Security numbers are recorded for household members who have them, for possible use in connection with estimation and research projects requiring linkage of data from SIPP interviews with data from administrative records. A separate section of the control card is used to record information about household noninterviews when they occur.
In interviews for the second and later waves, the interviewer begins by updating the control card to identify changes in the household roster and in the demographic characteristics of persons, e.g., a change in marital status. Selected data items from the control card are transcribed to the current-wave questionnaire to help the interviewer determine which specific questions need to be asked for each person. At the close of each interview, except in the final wave, changes in the data items used for tracking employment, income, and assets are recorded on the control card.
A different version of the questionnaire is used for each wave and at each wave a separate questionnaire is completed for each adult. Core items, mainly about labor force participation and sources and amounts of income, are included on the questionnaire for each wave. Except in wave 1 for all panels and wave 2 for the 1984 and 1985 panels, each questionnaire has included one or more fixed and/or variable topical modules.
The interview proceeds on a person-by-person basis until a questionnaire has been completed for each adult in the household. The core items for a person are divided into four sections:
Section 4 is followed by the topical modules. After they are completed, the interview proceeds to the next adult in the household. Exhibit 4.2 shows the main steps in completing or updating the control card and completing the questionnaires for a single interview at a sample address.
Other instruments used in the data collection process are advance letters and respondent aids. Before each wave of interviewing, an advance letter signed by the Director of the Census Bureau is sent to each sample address. The letter informs the residents that an interviewer will be calling in a few days, explains the purpose of the survey, gives the authority for conducting the survey, and strongly encourages the recipient(s) to participate. Exhibit 4.3 shows the version of the letter used for the 1993 panel.
Respondent aids, sometimes called flash cards, play an important role in the interview. A key one is the Calendar of Reference Months, which is frequently kept in sight during the entire interview to make clear the specific months and weeks for which respondents are being asked to provide information. Other aids are lists of the categories used to respond to questions on topics such as race, origin or descent, types of income, and types of assets. These cards are shown to respondents at the appropriate stages in the interview. There are also aids for use primarily by interviewers, such as an Age Verification Chart, which the interviewer can use to determine whether reported ages and dates of birth are consistent. Checklists for interviewers remind them what steps are needed in order to prepare for an interview and what needs to be done after the interview has been completed. Respondent aids are left with the respondents after the initial face-to-face interviews. If the respondents cannot find them at the time of telephone interviews, the interviewers would read the categories to the respondents.
Exhibit 4.3 Advance Letter to Respondents: 1993 Panel

Exhibit 4.3 (continued) Reverse Side of Advance Letter

The quality of SIPP data is affected by certain broad strategies that were adopted in designing the data collection instruments and the interview process. These strategies are identified in this section. Some of them could be applied in any survey; others are applicable only in longitudinal surveys. Available information on how these strategies affect quality and some results of experiments to test alternative strategies are presented in Chapters 5 and 6.
One strategy is redundancy, i.e., collecting more than the minimum information needed to meet the surveys data requirements and using the redundant information either to check for internal consistency or to impute missing information. The most obvious example of this in the SIPP is the Annual Roundup, a topical module used in waves 5 and 8 (waves 6 and 9 in the 1984 panel) to collect calendar year information on certain kinds of income that have already been covered on a month-by-month basis in prior interviews. The timing of the Annual Roundup is such that most respondents will have already completed their tax returns for the prior year. For some types of income, the availability of such records might lead to more accurate reporting.
Another aspect of strategy is sequencing of the items. As described in the preceding section, all types of income received are identified for each person before the person is asked for amounts and other details about each type. An alternative would be to collect detailed information on each type of income recipiency identified before proceeding to other types. Likewise, the decision to collect all data for each person before proceeding to the next person is an aspect of sequencing. An alternative would be to cover each topic for the entire household before proceeding to the next topic.
Use of records, such as pay vouchers, tax returns, and bank statements, during the interview may help respondents to provide more accurate information. One would expect the use of copies of tax returns and wage and tax statements (Form W-2) to be especially helpful in completing the Annual Roundup modules. Interviewers are instructed to encourage respondents to use records. The introduction to Section 5, Amounts Section, in the SIPP Interviewers Manual states: "Wherever possible the use of available records showing amounts received should be encouraged in order to ensure the best possible responses" (U.S. Bureau of the Census, 1988b).
An important strategy question that applies only to longitudinal surveys is whether each interview should be independent or dependent with respect to preceding interviews. A mixed strategy is followed in SIPP. Questions on sources of income are asked in a dependent mode. Respondents are asked about sources of income reported in the previous interview, and then are asked about new sources of income during the 4 months covered by the current interview. Starting with the 1986 panel, questions on occupation and industry of employed persons have also been asked in a dependent mode. Respondents are asked whether their activities or duties have changed during the past 8 months. A negative response would eliminate the detailed occupation and industry questions. On the other hand, questions on assets in wave 7 have been asked without reference to information on assets reported a year earlier, in wave 4. An exception was made for half of the 1984 panel. This Asset Feedback Experiment is discussed in Section 6.3.1.
The formal titles of Census Bureau interviewers and supervisors are field representatives (FRs) and supervisory field representatives (SFRs), respectively. The FRs locate sample addresses and collect most of the data. The SFRs do follow-up work to convert potential refusals, handle assignments that are sometimes partially completed by FRs, or act as team leaders for a group of FRs.
As of September 1995, there were 335 FRs and about 50 SFRs working on the 1992 and 1993 panels. The turnover rate for SIPP FRs in Fiscal Year 1995 was 19 percent. This rate was calculated by dividing the number of FRs trained in Fiscal Year 1995 (64) by the total number of FRs (335). The average monthly assignment size in Fiscal Year 1995 was about 14 households. FRs have the entire month to complete the work, although they are encouraged to finish about 90 percent during the first 2 weeks; the last 2 weeks are spent following up movers and potential refusal cases. Both FRs and SFRs are paid on an hourly basis. As of September 1995, the starting salary ranged from $7.55 to $8.48, excluding any increases reflecting high cost-of-living geographic areas.
Training for SIPP FRs includes home study, classroom training, on-the-job training, and refresher training. Training is continuous and includes:
The work of FRs is monitored and feedback provided to them in several ways.
FR Response Rates: |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
FRs receive annual ratings based on these standards. SFR may take account of extenuating circumstances when they assign adjective ratings.
There is a concern that the current method for calculating response rates is not sensitive to increases in the number of noninterviews for unlocated movers (Type D noninterview). A Type D noninterview reduces the eligible workload by one and the response rate is computed based on the reduced workload (in the denominator). With this method, increases in the number of Type D noninterviews have little impact on the response rate, and therefore may offer little incentive for FRs to find unlocated movers. The Census Bureau is reviewing alternative methods to calculate response rates and to provide incentives for locating movers (see Section 4.5)
This section is about response errors that are associated with the data collection phase to SIPP. The level of measurement error is affected by virtually all design features including: length of reference periods, interview mode, respondent rules, errors associated with interviewers, questionnaire length and structure, and use of records by respondents. This section discusses evaluation studies in each of theses areas and improvements to the data collection procedures to enhance data quality.
SIPP interviews are conducted at four monthly intervals. In the core portion of the SIPP interview, respondents are asked to report on labor force activity, income recipiency, and amounts of income for the 4 months immediately preceding the month of the survey interview. For most items, they are asked to report separate data for each of the months included in the 4-month reference period. Although respondents sometimes refer to records when answering these questions, they often rely only on memory.
Memory errors in surveys have been the subject of many evaluation studies and experiments (e.g., Neter and Waksberg, 1965). Experience suggests that memory errors can be reduced by the use of relatively short reference periods and, in longitudinal surveys, by using some form of dependent interviewing such as bounded recall, where interviewers at each round of a survey have access to information reported by respondents on previous rounds.
The frequency of interviews and length of reference periods was a key issue in the design of SIPP and was addressed in two studies conducted as part of the ISDP. In the first of these studies (Olson, 1980; Kasprzyk, 1988), a single interview using a 6-month recall period was compared with two consecutive interviews, both using 3-month reference periods. The same 6 calendar months were covered in both sets of interviews. Not surprisingly, the proportion of respondents who reported some income from sources such as wages, Aid to Families with Dependent Children (AFDC), and unemployment compensation for the first 3 months of this period was greater for the interviews that used a 3-month reference period.
Hill (1986) used an additive error model to analyze the joint effects of length of recall and time in sample (number of waves) on earnings data for the first 9 months of the 1984 panel. The analysis was restricted to men aged 25 to 55 who reported for themselves in all three interviews, had at least 2 months with some employment, and had no self-employment and no imputed data - a total of 1,378 sample persons. He found evidence of significant differential reporting bias and response variance associated with both length of recall and time in sample.
The choice between 4-month and 6-month reference periods was considered for the 1996 SIPP redesign. Since a 6-month recall period involves only two interviews per year, SIPP could afford to increase sample size and reduce sampling error. However, a longer recall may reduce reports for small sources of income and incomes received for short durations, and exacerbate the seam effect observed at the transition between waves of data collection (see Section 6.1 for a discussion of the seam problem). Huang (1993) presented the results of a Recall Length Study to examine this choice. This report presents a literature review and the results of simulation studies on the effects of length of recall period on sampling error and estimates of poverty and program participation. The conclusion is that the increase in the precision of estimates cannot compensate for the increase in biases. The recommendation is therefore to maintain the 4-month reference period. More recent findings on recall bias from a record-check study are discussed in Chapter 6.
Until February 1992, face-to-face interview was the preferred mode of interviewing for SIPP. Telephone interviews were allowed when a sample person had moved more than 100 miles from a SIPP primary sampling unit or when the interview cannot otherwise be completed. Under this maximum personal visit interviewing mode, about 95 percent of all interviews were face-to-face.
In a continuing effort to improve the efficiency of the SIPP design, tests of a maximum telephone mode, i.e., conducting interviews by telephone except when circumstances make it impossible, have been built into the ongoing survey operations, in order to determine how the quality of results and the cost of data collection might be affected by shifting to greater use of telephone interviews.
An off-line feasibility test of maximum telephone interviewing in a single wave was conducted in June 1985 with 279 households, in two regional offices, that had been dropped from the 1984 panel because of budget cuts. Subsequently, an on-line experiment was carried out in two phases. From August to November 1986, half of the national sample for the 1986 panel was designated for maximum telephone interviewing in either the second or third wave of the panel. The allocation to the two modes was by segment and most of the telephone interviews were done by regular SIPP field representatives, calling from their homes. In the second phase, from February to April 1987, some of the households assigned to the telephone mode in the first phase were assigned to that mode again for the wave 4 interviews. As a result, there were three experimental groups and a control group, as follows:
Group |
Proportion of |
Maximum |
| Experimental A | One-fourth | Waves 3 and 4 |
| Experimental B | One-eighth | Waves 2 and 4 |
| Experimental C | One-eighth | Wave 2 only |
| Control | One-half | Not used |
The experimental procedures and some results have been discussed by Durant and Gbur (1988); Carmody et al. (1988); Kasprzyk (1988); Petroni et al. (1989); Gbur and Petroni (1989); and Waite (1990). Household and person noninterview rates did not differ significantly for the experimental and control groups; item nonresponse rates were slightly but not significantly higher in the telephone interviews. The proportion of persons whose data were supplied by proxy respondents was higher for the maximum telephone group, 37.8 percent versus 35.3 percent. In interviewer debriefings, about three-fourths of the interviewers felt that telephone interviewing could be used successfully in SIPP, but others believed it would not work because of the sensitive nature of the questions and because it would be more difficult to establish and maintain respondent rapport and trust.
The encouraging results from this experiment, and the cost saving benefits, led to a decision to make greater use of telephone interviewing in SIPP. Maximum telephone interviewing began in February 1992 with wave 7 of panel 1990 and wave 4 of panel 1991. For the 1992 and 1993 panels, wave 1, 2, and 6 were conducted primarily using face-to-face interview as before, but the remaining waves were conducted using telephone interviewing to the extent feasible. For waves in which telephone interviewing is the preferred mode, FRs were instructed to record the reason for obtaining a personal versus telephone interview. The reasons are mostly the following: respondent requests, no telephone or unlisted number, reluctant households (as determined by the FR), elderly persons, people with hearing problems, and language barriers.
Since implementation, the Census Bureau has monitored the effects of telephone interviewing on cost savings and on data quality. Analyses of cost showed savings in terms of cost per household unit, interviewing hours per unit, and miles per unit (Allen, 1993). The effects of telephone interviewing on data quality showed no adverse effects on response rate (King, 1995) and on record usage by respondents during interviews (Huggins, 1994).
King (1995) found that although both Type A (refusal, no one home, etc.) and Type D (unlocated movers) nonresponse rates have risen during the 1990s, there is no consistent evidence that telephone interviewing is the cause of the increases. She compared the nonresponse rates by wave for the 1986-1993 panels and concluded that the increase in Type A rate cannot be attributed to telephone interviewing because most of the Type A nonresponse occurred in the first two waves which continued to rely on face-to-face interviews. The Type D nonresponse rate showed an increase of approximately 12 percent in the unlocated mover rate per wave in panels where telephone interviewing is used heavily, but there is no evidence of the telephone interviewing as the cause of the increase.
James (1994) reviewed the studies by Ikeda (1993) and Sutterlin (1993a) which evaluated SIPP estimates from the Education Financing module. Estimates of educational cost are derived based on data from the "telephone interview" wave (wave 5 of Panel 1991 and wave 8 of panel 1990) and from a "personal visit" wave (wave 5 of panel 1990). These estimates were compared with estimates from administrative data from the College Board and Department of Education. The results showed no evidence of an effect from telephone interviewing.
As explained in Section 4.1, all sample persons aged 15 and over who are present at the time of an interview are asked to respond for themselves unless not physically or mentally competent to do so. Proxy responses are accepted for those who are absent or incapable of responding (see Exhibit 4.1 above). Under this rule, about 65 percent of sample persons report for themselves at each interview wave. It has been generally believed that the level of accuracy for self-response is higher than for proxy response (but see Moore, 1988, for a contrary view); however, achieving a higher proportion of self-response would increase data collection costs and might lead to some increase in person nonresponse rates.
In the ISDP, the current SIPP respondent rule was experimentally compared with a maximum self-response rule, which required callbacks to interview sample persons not present at the first interview for a household (Coder, 1980; Kasprzyk, 1988). Household and person noninterview rates were slightly higher for the maximum self-response group and the estimated costs of data collection were about 4 to 6 percent higher. Under the self-response rule, records were consulted more often by persons answering wage and salary questions and response rates for hourly wage rates were higher. There was not conclusive evidence favoring either set of respondent rules, so the less stringent requirement was adopted, with the addition of a callback procedure to obtain certain critical data items if they were not obtained at the initial interview of a household.
Under the current respondent rules, most unmarried college students living away from home while attending college are considered to be members of their parents households and information for them is usually obtained from proxy respondents. Roman and OBrien (1984) describe the results of an ISDP study in which data for students obtained by proxy from members of their parents households were compared with data collected from them by self-response at their school residences. The analysis was limited by flaws in the implementation of the test; however, it suggested that proxy response led to greater underreporting of recipiency and more nonresponse on amounts of income and expenses, especially for smaller amounts.
Hill (1987) used data from a single month of the 1984 panel in a model-based analysis of the effects of proxy versus self-response on reported earnings for a sample of men aged 25 to 55. He estimated that self-reports of earnings were about 20 percent below proxy reports. The validity of Hills results depends on the correctness of his assumptions concerning a common behavioral structure for proxy and self-respondents. Furthermore, without an external measure of validity, it cannot be determined which type of report is of higher quality.
The above discussion has been in terms of the effects of self and proxy responses on SIPP cross-sectional estimates. There is an additional concern about the effects of changing the informant between waves (e.g., from self respondent to proxy respondent, or from one proxy respondent to another) or longitudinal estimates, particularly on changes between waves (Kalton et al., 1989). No research has been conducted on this issue.
The evidence currently available on the effects of accepting proxy reporting does not clearly indicate the advisability of changing SIPP respondent rules regarding proxy or selfrespondents. There are, therefore, no plans at present to change these rules.
Formal interviewer variance studies carried out in connection with U.S. population censuses have shown that when the data are collected by interviewers (as opposed to the use of self-administered questionnaires), interviewer variance can contribute a substantial proportion of the total mean square error for small area counts and sample estimates (see, for example, U.S. Bureau of the Census, 1968). There have been fewer studies of the contribution of interviewer variance to error in household surveys, but a study carried out in connection with the National Crime Survey in eight cities demonstrated that it can be an important source of error for some variables (Bailey et al., 1978). An analysis by Tepping and Boland (1972) of interviewer variance in the Current Population Survey gave estimates of 0.50 or greater for the ratio of interviewer (correlated response) variance to sampling variance for several items included in the survey.
There have been no formal interviewer variance studies in connection with SIPP; however, the findings cited from other surveys and from censuses suggest that interviewer variance could be a significant source of error for some items. The contribution of interviewer variance to total error depends on the size of interviewer assignments. The larger the assignments, the greater the effect on total error. In SIPP the average interviewer assignment is about 14 per month, or 56 households per wave (each interviewed three times during a 12-month period), as compared to approximately 21 households per month or 125 households per 6-month interviewing cycle in the Central Cities Sample for the National Crime and Victimization Survey. Monthly interviewer assignments in the Current Population Survey presently average between 25 and 30 households.
The continuing SIPP reinterview program (see Section 4.4) provides some information on measurement errors attributable to interviewers. Selected questions from five content groups: job, income, benefits, health insurance, and assets are asked again in the reinterviews. The original and reinterview responses are compared and differences reconciled. When differences are confirmed, the reinterviewer judges whether or not the original interviewer was responsible for the difference in responses. Pennie (1988), summarized results of this process for 1984 through 1987. The rates presented are based on the number of content groups in each reinterview with one or more confirmed differences. Overall difference rates for the five content groups ranged from a high of 3.1 percent in 1984 to 2.4 percent in 1987. The proportion of confirmed differences attributed to interviewers stayed within a fairly narrow range and averaged 43.3 percent for the 4-year period. Although the manner in which the rates were calculated and presented does not permit precise comparisons between content groups, it appears that the health insurance and asset groups have relatively high difference rates as compared to the other three groups. The health insurance questions were modified after the 1984 panel, but the difference rates for that content group remained relatively high.
Reinterviewers also check the original interviewers control card listing of persons present in the household and usual residents of the household who were temporarily absent at the time of the interview. The gross errors detected in this household composition check have been consistently small, averaging 0.23 percent of persons listed in the initial interview over the 4-year period (Pennie, 1988). The average net error rate was -0.05 percent, representing a very small undercount in the initial interviews. However, results of demographic analyses and other types of evidence suggest that the proportion of persons missed in interviewed households may be considerably larger than is indicated by the reinterview results (see, for example, Hainer et al., 1988; Shapiro et al., 1993). The low figure just cited may simply mean that reinterviewers are not much more successful than initial interviewers in obtaining complete listings of household members. The Census Bureau is examining methods to modify the roster questions to improve coverage (see Section 3.5).
Standards have been established for each section of the interview that is checked in the reinterviews. Failure rates have been low; generally, less than 2 percent of the interviewers for each section checked. The usual response to failures is to discuss the errors found with the interviewers. Occasionally interviewers are reassigned or are dismissed or suspended.
Hill (1989) undertook a detailed analysis of reinterview data for waves 2 and 3 of the 1984 panel (1,559 cases), using a data file containing reinterview reports merged with data for the same respondents from the public use file. He found the highest overall discrepancy rates, 5 to 7 percent, for items relating to Medicare and health insurance coverage. For many items included in the reinterview, he found that a high proportion of the discrepancies were procedural, i.e., the question was asked by the interviewer and not by the reinterviewer, or vice versa. He attributed this finding to unnecessarily complex skip patterns in the relevant sections of the questionnaire.
Hill also examined the relationship between response inconsistencies and respondent characteristics, using a Poisson regression procedure. He concluded that "Data quality appears to be significantly lower for low income, black, and either very young or very old respondents." His general conclusion was that the SIPP reinterview results can be valuable in understanding nonsampling error issues, and he recommended some changes to make the reinterview program even more useful.
The ISDP included two methodological experiments which had a direct influence on the length and structure of the SIPP core modules (Olson, 1980; Kasprzyk, 1988). The first of these experiments compared the use of short and long forms to collect income data for each household member. With the short form, respondents were asked a series of direct questions about receipt of selected types of income. For all "yes" answers, questions on amounts were asked before proceeding to the next income type. The long form made extensive use of cues and probes in an attempt to help respondents recall types of income received by putting them in a broader context. Questions on amounts were not asked until all sources of income had been determined, and a longer set of questions was used to determine the amounts.
Several kinds of data were used in the analysis. Except for Social Security income, which was reported at a higher level on the long forms, the comparison of income reporting levels was inconclusive (Olson, 1980, pp. 6-46). In debriefing sessions, the consensus of interviewers was that the short form was easier for respondents, but that respondents thought more carefully about their answers when the long form was used (Olson, 1980, pp. 2-22). Not surprisingly, the long forms took longer to complete. The form ultimately adopted for SIPP was a variation of the long form used in the ISDP experiment.
The second experiment compared a household screening format with a person-by-person format for obtaining information on income recipiency and amounts. The household format was intended to reduce burden by asking a single household respondent, for each income type, whether any household members had received that type of income. For affirmative responses, the household members who had received that type of income were identified. Data on amounts were obtained only after completion of screening for all household members. The person-by-person format used a separate form for each adult household member. No significant differences in income recipiency rates and item nonresponse were observed for the two formats. Use of the person-by-person format added about 5 minutes per household to the length of the interview. The latter format, with moderate changes, was adopted for SIPP (Kasprzyk, 1988).
One of the SIPP data collection strategies is to encourage respondents to use financial records to aid response. In a debriefing of a sample of 1985 panel respondents in connection with reinterviews following wave 8 interviews, 56 percent of the respondents said they had referred to their W-2 forms and completed tax forms in the final (wave 8) interview. In response to a question about the use of records such as pay stubs and bank statements in all interviews, 61 percent said they routinely referred to these kinds of records (Meier, 1988). Anecdotal information from interviewer debriefings suggests that the frequency of record use is significantly affected by the extent to which individual interviewers encourage it (Chapman, 1988).
The usage of records, as reported by interviewers, is somewhat lower. For persons responding to questions in the tax and annual roundup module, interviewers indicate in check boxes on the questionnaire which ones refer to W-2 forms and tax forms during the interview. Table 4.4 shows the results for the 1984 and 1985 panels (Altman, 1989) which indicate that records were used by around 35-45 percent of the respondents.
Table 4.4 Rate of record usage reported by interviewers
| Wave and panel | Percent referring to: |
|
| W-2 Form | Tax form worksheet | |
Wave 6, 1984 Panel Wave 9, 1984 Panel Wave 5, 1985 Panel Wave 8, 1985 Panel |
34.9 43.2 35.6 36.1 |
34.0 41.6 35.5 36.3 |
Source: Altman (1989).
Huggins (1994) summarized the studies which compared the rate of record usage between personal visit waves and telephone visit waves, and concluded that there is no evidence to suggest that the rate of record usage is affected by telephone interviewing. Jones (1993) compared the percentage of records used for all income sources reported on the Income Source Summary sheet for personal visit waves (wave 2 of 1991 panel and wave 5 of 1990 panel) and for telephone interview waves (wave 7 of 1990 panel and wave 3 of 1992 panel). She found that the average record usage rate is about 20 percent and there is no significant difference by interviewing mode. The percentages reported for the 1990-1993 panels, however, are lower, at about 20 percent. James (1992) found similar results comparing the rates for wave 4 of 1990 (personal visit wave) and wave 4 of the 1991 panel (telephone interview wave).
Guarino (1996) studied the rate of record usage by respondents in the 1990-1993 panel. She found that the rate of respondents using at least one type of record is about 20 percent for each panel. Within each panel, she observed that record usage appears to increase as the waves progress. She attributed this to a possible correlation between record usage and attrition. Those respondents who refuse to use records in the initial waves are more likely to drop out of the survey.
The Census Bureau has examined different methods to improve the data collection procedure and data quality. This section summarizes some of the efforts in this area.
SIPP Quality Profile (NEW !) Page
SIPP Main Page
Census Home Page