Skip to main content Accessibility help
×
  • Cited by 90

Book description

Secondary data play an increasingly important role in epidemiology and public health research and practice; examples of secondary data sources include national surveys such as the BRFSS and NHIS, claims data for the Medicare and Medicaid systems, and public vital statistics records. Although a wealth of secondary data is available, it is not always easy to locate and access appropriate data to address a research or policy question. This practical guide circumvents these difficulties by providing an introduction to secondary data and issues specific to its management and analysis, followed by an enumeration of major sources of secondary data in the United States. Entries for each data source include the principal focus of the data, years for which it is available, history and methodology of the data collection process, and information about how to access the data and supporting materials, including relevant details about file structure and format.

Refine List

Actions for selected content:

Select all | Deselect all
  • View selected items
  • Export citations
  • Download PDF (zip)
  • Save to Kindle
  • Save to Dropbox
  • Save to Google Drive

Save Search

You can save your searches here and later view and run them again in "My saved searches".

Please provide a title, maximum of 40 characters.
×

Contents

Bibliography
Chapter 1. An Introduction to Secondary Data Analysis
Black, N. 2003. Secondary use of personal data for health and health services research: Why identifiable data are essential. Journal of Health Services Research & Policy 8 CrossRef | Google Scholar | PubMed(Suppl 1):36–40.
Bulmer, M I A, Sturgis, P J, Allum, N. 2006. The secondary analysis of survey data. Thousand Oaks, CA Google Scholar: Sage.
Ciol, M A, Hoffman, J M, Dudgeon, B J, Shumway-Cook, A, Yorkston, K M, Chan, L. 2006. Understanding the use of weights in the analysis of data from multistage surveys. Archives of Physical Medicine and Rehabilitation 87 CrossRef | Google Scholar | PubMed(2):299–303.
Vaus, D. 2002. Social surveys. Thousand Oaks, CA Google Scholar: Sage.
Elder, G H, Pavalko, E K, Clipp, E C. 1993. Working with archival data: Studying lives. Thousand Oaks, CA CrossRef | Google Scholar: Sage.
Heaton, J. 2004. Reworking qualitative data. Thousand Oaks, CA CrossRef | Google Scholar: Sage.
Hinds, P, Vogel, R, Clarke-Steffen, L. 1997. The possibilities and pitfalls of doing a secondary analysis of a qualitative data set. Qualitative Health Research 7 CrossRef | Google Scholar(3):408–424.
Huston, P, Naylor, C D. 1996. Health services research: Reporting on studies using secondary data. Canadian Medical Association Journal 155 Google Scholar | PubMed(12):1697–1702.
James, J, Sorensen, A. 2000. Archiving longitudinal data for future research: Why qualitative data add to a study's usefulness. Forum: Qualitative Social Research 1 Google Scholar(3). Available online at: http://qualitative-research.net/fqs/fqs-eng.htm. Accessed July 6, 2006.
Jiang, Y, Alastair, J S, Wild, C J. 2006. Secondary analysis of case-control data. Statistics in Medicine 25 CrossRef | Google Scholar | PubMed:1323–1339.
Lee, A J, McMurchy, L, Scott, A J. 1997. Re-using data from case-control studies. Statistics in Medicine 16 CrossRef | Google Scholar | PubMed:1377–1389.
Lowrance, W W. 2003. Learning from experience: Privacy and the secondary use of data in health research. Journal of Biolaw and Business 6 Google Scholar | PubMed(4):30–60.
Kiekolt, K J, Nathan, L E. 1985. Secondary analysis of survey data. Thousand Oaks, CA CrossRef | Google Scholar: Sage.
Pearl, J. 2000. Causality. New York Google Scholar: Cambridge University Press.
Phillips, C V, Goodman, K J. 2006. Causal criteria and counterfactuals: Nothing more (or less) than scientific common sense. Emerging Themes in Epidemiology 3 CrossRef | Google Scholar | PubMed:5. Available online at: http://www.ete-online.com/content/3/1/5. Accessed September 20, 2006.
Rothman, K J, Greenland, S. 1998. Modern epidemiology. 2nd ed. Philadelphia Google Scholar: Lippincott.
Sieber, J E, ed. 1991. Sharing social science data: Advantages and challenges. Thousand Oaks, CA CrossRef | Google Scholar: Sage.
Stewart, E, Kamins, M A. 1992. Secondary research: Information sources and methods. 2nd ed. Thousand Oaks, CA Google Scholar: Sage.
Tomlinson-Keasey C A. 1993. Opportunities and challenges posed by archival data sets. In: Funder, D C, Parke, R D, Tomlinson-Keasey, C A, Widaman, L, eds. Studying lives through time: Personality and development. Washington, DC CrossRef | Google Scholar: American Psychological Association, pp. 65–92.
Yeager K R, Roberts A R. 2006. Mental illness, substance dependence, and suicidality: Secondary data analysis. In: Roberts, A R, Yeager, K R, eds. Foundations of evidence-based social work practice. New York Google Scholar: Oxford, pp. 314–322.
Chapter 2. Health Services Utilization Data
Bartlett, D L, Ezzati-Rice, T M, Stokley, S, Zhao, Z. 2001. Comparison of NIS and NHIS/NIPRCS vaccination coverage estimates. American Journal of Preventive Medicine 20 CrossRef | Google Scholar | PubMed(4 Suppl):25–27.
Bradley, C J, Neumark, D, Oberst, K, Luo, Z, Brennan, S, Schenk, M. 2005. Combining registry, primary, and secondary data sources to identify the impact of cancer on labor market outcomes. Medical Decision Making 25 CrossRef | Google Scholar | PubMed(5):534–547.
Gilchrist, V J, Stange, K C, Flocke, S A, McCord, G, Bourguet, C C. 2004. A comparison of the National Ambulatory Medical Care Survey (NAMCS) measurement approach with direct observation of outpatient visits. Medical Care 42 CrossRef | Google Scholar | PubMed(3):276–280.
Hing, E, Gousen, S, Shimizu, I, Burt, C. 2003. Guide to using masked design variables to estimate standard errors in public use files of the National Ambulatory Medical Care Survey and the National Hospital Ambulatory Medical Care Survey. Inquiry 40 CrossRef | Google Scholar | PubMed(4):401–415.
Luo, Y H, Zack, M. 2005. Computations of confidence intervals for estimates in the United States National Hospital Discharge Survey, 1979–2000. Preventing Chronic Disease 2 Google Scholar | PubMed(3):A06. Epub 2005 Jun 15.
Merrill, R M, Dearden, K A. 2004. How representative are the surveillance, epidemiology, and end results (SEER) program cancer data of the United States? Cancer Causes & Control 15 CrossRef | Google Scholar | PubMed(10):1027–1034.
Smith, P J, Battaglia, M P, Huggins, V J, Hoaglin, D C, Roden, A, Khare, M, Ezzati-Rice, T A, Wright, R A. 2001. Overview of the sampling design and statistical methods used in the National Immunization Survey. American Journal of Preventive Medicine 20 CrossRef | Google Scholar | PubMed(4 Suppl):17–24.
Steiner, C, Elixhauser, A, Schnaier, J. 2002. The Healthcare Cost and Utilization Project: An overview. Effective Clinical Practice 5 Google Scholar | PubMed(3):143–151.
Zell, E R, Ezzati-Rice, T M, Battaglia, M P, Wright, R A. 2000. National Immunization Survey: The methodology of a vaccination surveillance system. Public Health Reports 115 CrossRef | Google Scholar | PubMed(1):65–77.
Chapter 3. Health Behaviors and Risk Factors Data
Andresen, E M, Catlin, T K, Wyrwich, K W, Jackson-Thompson, J. 2003. Retest reliability of surveillance questions on health related quality of life. Journal of Epidemiology and Community Health 57 CrossRef | Google Scholar | PubMed(5):339–343.
Bowlin, S J, Morril, B D, Nafziger, A N, Jenkins, P L, Lewis, C, Pearson, T A. 1993. Validity of cardiovascular disease risk factors assessed by telephone survey: The Behavioral Risk Factor Survey. Journal of Clinical Epidemiology 46 CrossRef | Google Scholar | PubMed(6):561–571.
Brener, N D, Kann, L, Kinchen, S A, Grunbaum, J A, Whalen, L, Eaton, D, Hawkins, J, Ross, J G. 2004. Methodology of the Youth Risk Behavior Surveillance System. MMWR Recommendations and Reports 53 Google Scholar | PubMed(RR-12):1–13.
Gentry, E M, Kalsbeek, W D, Hogelin, G C, Jones, J T, Gaines, K L, Forman, M R, Marks, J S, Trowbridge, F L. 1985. The Behavioral Risk Factor Surveys: II. Design, methods, and estimates from combined state data. American Journal of Preventive Medicine 1 CrossRef | Google Scholar | PubMed(6):9–14.
Johnston, L D, O'Malley, P M. 1997. The recanting of earlier reported drug use by young adults. NIDA Research Monograph 167 Google Scholar | PubMed:59–80.
O'Malley, P M, Bachman, J G, Johnston, L D. 1983. Reliability and consistency in self-reports of drug use. International Journal of the Addictions 18 CrossRef | Google Scholar | PubMed(6):805–824.
Nelson, D E, Holtzman, D, Bolen, J, Stanwyck, C A, Mack, K A. 2001. Reliability and validity of measures from the Behavioral Risk Factor Surveillance System (BRFSS). Social and Preventive Medicine 46 Google Scholar(Suppl 1):S3–S42.
Remington, P L, Smith, M Y, Williamson, D F, Anda, R F, Gentry, E M, Hogelin, G C. 1988. Design, characteristics, and usefulness of state-based behavioral risk factor surveillance 1981–1987. Public Health Reports 103 Google Scholar(4):366–375.
Chapter 4. Data on Multiple Health Topics
Fronstin, P. 2000. Counting the uninsured: A comparison of national surveys. Employee Benefit Research Institute Issue Brief 225 Google Scholar:1–19.
Himes, J H, Faricy, A. 2001. Validity and reliability of self-reported stature and weight of US adolescents. American Journal of Human Biology 13 CrossRef | Google Scholar | PubMed(2):255–260.
James, M K, Miller, M E, Anderson, R T, Worley, A S, Longino, C F Jr. 1997. Benefits of linkage to the National Death Index in the Longitudinal Study of Aging. Journal of Aging and Health 9 CrossRef | Google Scholar(3):298–315.
Korenman, S, Goldman, N, Fu, H. 1997. Misclassification bias in estimates of bereavement effects. American Journal of Epidemiology 145 CrossRef | Google Scholar | PubMed(11):995–1002.
Kuczmarski, M F, Kuczmarski, R J, Najjar, M. 2001. Effects of age on validity of self-reported height, weight, and body mass index: Findings from the Third National Health and Nutrition Examination Survey, 1988–1994. Journal of the American Dietetic Association 101 CrossRef | Google Scholar | PubMed(1):28–34.
Vargas, C M, Burt, V L, Gillum, R F, Pamuk, E R. 1997. Validity of self-reported hypertension in the National Health and Nutrition Examination Survey III, 1988–1991. Preventive Medicine 26 CrossRef | Google Scholar | PubMed(5 Pt 1):678–685.
Yabroff, K R, Gordis, L. 2003. Assessment of a national health interview survey-based method of measuring community socioeconomic status. Annals of Epidemiology 13 CrossRef | Google Scholar | PubMed(10):721–726.
Chapter 5. Fertility and Mortality Data
Adams, M. 2001. Validity of birth certificate data for the outcome of the previous pregnancy, Georgia, 1980–1995. American Journal of Epidemiology 154 CrossRef | Google Scholar | PubMed(10):883–888.
Baumeister, L, Marchi, K, Pearl, M, Williams, R, Braveman, P. 2000. The validity of information on “race” and “Hispanic ethnicity” in California birth certificate data. Health Services Research 35 Google Scholar(4):869–883.
Campbell, A A, Mosher, W D. 2000. A history of the measurement of unintended pregnancies and births. Maternal and Child Health Journal 4 CrossRef | Google Scholar | PubMed(3):163–169.
Cowper, D C, Kubal, J D, Maynard, C, Hynes, D M. 2002. A primer and comparative review of major US mortality databases. Annals of Epidemiology 12 CrossRef | Google Scholar | PubMed(7):462–468.
DiGiuseppe, D L, Aron, D C, Ranbom, L, Harper, D L, Rosenthal, G E. 2002. Reliability of birth certificate data: A multi-hospital comparison to medical records information. Maternal and Child Health Journal 6 CrossRef | Google Scholar | PubMed(3):169–179.
Johansson, L A, Westerling, R, Rosenberg, H M. 2006. Methodology of studies evaluating death certificate accuracy were flawed. Journal of Clinical Epidemiology 59 CrossRef | Google Scholar | PubMed(2):125–131.
Kaufmann, R B, Morris, L, Spitz, A M. 1997. Comparison of two question sequences for assessing pregnancy intentions. American Journal of Epidemiology 145 CrossRef | Google Scholar | PubMed(9):810–816.
Klerman, L V. 2000. The intendedness of pregnancy: A concept in transition. Maternal and Child Health Journal 4 CrossRef | Google Scholar | PubMed(3):155–162.
Lydon-Rochelle, M T, Cardenas, V, Nelson, J L, Tomashek, K M, Mueller, B A, Easterling, T R. 2005. Validity of maternal and perinatal risk factors reported on fetal death certificates. American Journal of Public Health 95 CrossRef | Google Scholar | PubMed(11):1948–1951.
Lydon-Rochelle, M T, Holt, V L, Nelson, J C, Cardenas, V, Gardella, C, Easterling, T R, Callaghan, W M. 2005. Accuracy of reporting maternal in-hospital diagnoses and intrapartum procedures in Washington State linked birth records. Paediatric and Perinatal Epidemiology 19 CrossRef | Google Scholar | PubMed(6):460–471.
Martin, J A, Hoyert, D L. 2002. The national fetal death file. Seminars in Perinatology 26 CrossRef | Google Scholar | PubMed(1):3–11.
Mosher, W D. 1998. Design and operation of the 1995 National Survey of Family Growth. Family Planning Perspectives 30 CrossRef | Google Scholar | PubMed(1):43–46.
Northam, S, Knapp, T R. 2006. The reliability and validity of birth certificates. Journal of Obstetric, Gynecologic and Neonatal Nursing 35 CrossRef | Google Scholar | PubMed(1):3–12.
Rothwell, C J. 2004. Reengineering vital registration and statistics systems for the United States. Preventing Chronic Disease 1 Google Scholar | PubMed(4):A03. Epub 2004 Sep 15.
Shulman, H B, Gilbert, B C, Msphbrenda, C G, Lansky, A. 2006. The Pregnancy Risk Assessment Monitoring System (PRAMS): Current methods and evaluation of 2001 response rates. Public Health Reports 121 CrossRef | Google Scholar | PubMed(1):74–83.
Chapter 6. Medicare and Medicaid Data
Cooper, G S, Yuan, Z, Stange, K C, Amini, S B, Dennis, L K, Rimm, A A. 1999. The utility of Medicare claims data for measuring cancer stage. Medical Care 37 CrossRef | Google Scholar | PubMed(7):706–711.
Chattopadhyay, A, Bindman, A B. 2005. Accuracy of Medicaid payer coding in hospital patient discharge data: Implications for Medicaid policy evaluation. Medical Care 43 CrossRef | Google Scholar | PubMed(6):586–591.
Doctor, J N, Chan, L, MacLehose, R F, Patrick, D L. 2000. Weighted health status in the Medicare population: Development of the Weighted Health Index for the Medicare Current Beneficiary Survey (WHIMCBS). Journal of Outcome Measurement 4 Google Scholar(4):721–739.
Fisher, E S, Baron, J A, Malenka, D J, Barrett, J, Bubolz, T A. 1990. Overcoming potential pitfalls in the use of Medicare data for epidemiologic research. American Journal of Public Health 80 CrossRef | Google Scholar | PubMed(12):1487–1490.
Gyllstrom, M E, Jensen, J L, Vaughan, J N, Castellano, S E, Oswald, J W. 2002. Linking birth certificates with Medicaid data to enhance population health assessment: Methodological issues addressed. Journal of Public Health Management and Practice 8 CrossRef | Google Scholar | PubMed(4):38–44.
Hoffman, E D, Klees, B S, Curtis, C A. 2002. Overview of the Medicare and Medicaid programs. Health Care Financing Review Statistical Supplement Google Scholar | PubMed1–348.
Jones, N III, Jones, S L, Miller, N A. 2004. The Medicare Health Outcomes Survey program: Overview, context, and near-term prospects. Health and Quality of Life Outcomes 2 CrossRef | Google Scholar | PubMed:33. Available online at: http://www.hqlo.com/content/2/1/33. Accessed July 6, 2006.
Koroukian, S M, Cooper, G S, Alfred, Rimm A A. 2002. Ability of Medicaid claims data to identify incident cases of breast cancer in the Ohio Medicaid population. Health Services Research 38 CrossRef | Google Scholar(3):947–960.
Morgan, R O, Wei, I I, Virnig, B A. 2004. Improving identification of Hispanic males in Medicare: Use of surname matching. Medical Care 42 CrossRef | Google Scholar | PubMed(8):810–816.
Saydah, S H, Geiss, L S, Tierney, E, Benjamin, S M, Engelgau, M, Brancati, F. 2004. Review of the performance of methods to identify diabetes cases among vital statistics, administrative, and survey data. Annals of Epidemiology 14 CrossRef | Google Scholar | PubMed(7):507–516.
Wang, P S, Walker, A, Tsuang, M, Orav, E J, Levin, R, Avorn, J. 2000. Strategies for improving comorbidity measures based on Medicare and Medicaid claims data. Journal of Clinical Epidemiology 53 CrossRef | Google Scholar | PubMed(6):571–578.
Weiner, M, Stump, T E, Callahan, C M, Lewis, J N, McDonald, C J. 2003. A practical method of linking data from Medicare claims and a comprehensive electronic medical records system. International Journal of Medical Informatics 71 CrossRef | Google Scholar(1):57–69.
Wunsch, H, Harrison, D A, Rowan, K. 2005. Health services research in critical care using administrative data. Journal of Critical Care 20 CrossRef | Google Scholar | PubMed(3):264–269.
Zhan, C, Miller, M R. 2003. Administrative data based patient safety research: A critical review. Quality and Safety in Health Care 12 CrossRef | Google Scholar | PubMed(Suppl 2):58–63.
Chapter 7. Other Sources of Data
Barrett, R E. 1994. Using the 1990 U.S. Census for Research. Thousand Oaks, CA Google Scholar: Sage.
Best, A E. 1999. Secondary data bases and their use in outcomes research: A review of the Area Resource File and the Healthcare Cost and Utilization Project. Journal of Medical Systems 23 CrossRef | Google Scholar | PubMed(3):175–181.
Davis, J. 1991. The NORC General Social Survey: A user's guide. Thousand Oaks, CA Google Scholar: Sage.
A Guide to the Data Resources of the Henry A. Murray Research Center of Radcliffe College, A Center for the Study of Lives. 1996. Cambridge, MA Google Scholar: Radcliffe Institutes for Advanced Study.
Mays, V M, Ponce, N A, Washington, D L, Cochran, S D. 2003. Classification of race and ethnicity: Implications for public health. Annual Review of Public Health 24 CrossRef | Google Scholar | PubMed:83–110.
Heijden, P J M, Puijenbroek, E P, Buuren, S, Hofstede, J W. 2002. On the assessment of adverse drug reactions from spontaneous reporting systems: The influence of under-reporting on odds ratios. Statistics in Medicine 21 CrossRef | Google Scholar | PubMed:2027–2044.
Young, C H, Savola, K L, Phelps, E. 1991. Inventory of longitudinal studies in the social sciences. Thousand Oaks, CA Google Scholar: Sage.

Metrics

Usage data cannot currently be displayed.