Skip to content

Scripts to download and build panel data files for IPEDS.

License

Notifications You must be signed in to change notification settings

adamrossnelson/StataIPEDSAll

Repository files navigation

1. Project StataIPEDSAll Introduction

These files will download, build, and save to the local computer data from the US DOE's Integrated Postsecondary Education Data System. This project aims to be reverse compatible to Version 13. Use of version control. Also uses saveold. Intended as reverse compatible, but not tested. Notes about assumptions or limitations maintained in the do files.

For a related/similar repo built for R see btskinner/downloadipeds. If you might be interested in an old fashioned spreadsheet consider this very user friendly option from Jon Boeckenstedt.

Brief Demonstration Video Here.

I provide a YouTube overview of IPEDS:
Image of IPEDS Video

1.1. Table of Contents

1.2. IPEDS Update Cycles (Revised releases)

Occasionally IPEDS provides preliminary release data, and then later provides revised data. They publish information about their schedule. IPEDS conventions place "rv" in csv filenames to denote revised data. As I am able, I update these do files to reference revised data.

If I'm behind on an update send me a note. Or branch the project, revise on your own, and submit a pull request. For beginners, contact me and I'll help you learn how. Updates logged in do file headers. An aggregated Testing & Development Log is available below.

More from IPEDS Twitter about the survey cycle.

More On Why It Makes Sense To Use Code When Collecting These Data
Image of Data Automation Video

1.3. Additional Background

IPEDS provides an origins report which documents the history of survey survey items.

A recent article (Miller & Shedd, 2019) provides a helpful overview of the history of these data. Miller, E. S., & Shedd, J. M. (2019). The history and evolution of IPEDS. New Directions for Institutional Research, 2019(181), 47–58. Retrieved from: https://doi.org/10.1002/ir.20297.

Additional reading about the landscape of higher education data from IHEP.

1.4. Supplemental Items

Located at adamrossnelson/StataIPEDSAll/tree/master/supp are useful supplemental routines that will enable quicker analysis of IPEDS data.

  • abbrev.ado Quickly abbreviates college & university names. For example, "echnology" -> "Tech." or "Academy" -> "Acad."
  • stdstring.ado Quickly standardizes a string variable to be the same across all units through a panel.
  • standardgroups.do Quickly writes group indicators. (e.g. inB10, etc.)

Formerly in Supplemental Items was drill_down_picker_builder.do which prepares a spreadsheet that can populate a drill-down-pick list from IPEDS institutions. This item has migrated to: adamrossnelson/nces_inst_pick_list_builder.

2. Usage

Do File Name & Description Suggested Naming Convention Where YY is the last year of the survey file
IPEDSDirInfo02to18.do
Produces a panel data set that includes IPEDS directory information survey files.
IPEDSDirInfo02to18.log
IPEDSDirInfo02to18.dta
IPEDSInstChar02to17.do
Produces a panel data set that includes IPEDS institutional characteristics survey files. This series contains test score, admission, and other similar data points previously found in the institutional characteristics survey files. IPEDS introduced separate survey file for "Admissions and Test Scores" in 2014.
IPEDSInstChar02to17.log
IPEDSInstChar02to17.dta
IPEDS12MoEnrl02to18.do
Produces a panel data set that includes IPEDS 12 month enrollment survey files. This series contains 12-month unduplicated headcount. IPEDS frequently modified variable naming conventions in this series. Details related to variable naming conventions identified in do file.
IPEDS12MoEnrl02to18.log
IPEDS12MoEnrl02to18.dta
IPEDS12MoInstrAct02to16.do
Produces a panel data set that includes IPEDS 12 month instructional activity survey files. This series contains data on instructional activity measured in total credit and/or contact hours delivered by institutions during a 12-month period. The credit hour and contact hour activity data are used to derive 12-month full-time equivalent enrollments for both undergraduate and graduate levels.
IPEDS12MoInstrAct02to16.log
IPEDS12MoInstrAct02to16.dta
IPEDSGradRt02to16.do
Produces a panel data set that includes IPEDS graduation rates. This series contains the graduation rate status for the cohort of full-time, first-time degree/certificate-seeking undergraduates in both four year and two year institutions. This routine builds data for four-year institutions only, however.
IPEDSGrad02to16.log
IPEDSGrad02to16.dta
IPEDSFallEnrl02to16SrsA.do
Produces a panel data set that includes IPEDS fall enrollment rates. This series contains the fall enrollment rates status including race/ethnicity, gender, attendance status, and level of student.
IPEDSFallEnrl02to16SrsA.log
IPEDSFallEnrl02to16SrsA.dta
IPEDSFallEnrl02to16SrsB.do
Produces a panel data set that includes IPEDS fall enrollment rates. This series contains the fall enrollment rates status including age, gender, attendance status, and level of student. This routine builds data undergraduate level of study only, however.
IPEDSFallEnrl02to16SrsB.log
IPEDSFallEnrl02to16SrsB.dta

3. Run From Online

No installation required. Run these do files from the Stata’s command line. Code provided below.

Recommend using the naming conventions from the above table when prompted for a log file name and location. Note that close of this issue - removal of vestigial suffix will elimiate the use of the YYtoYY suffix on these do files.

do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDSDirInfo02to18.do
do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDSInstChar02to18.do
do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDS12MoEnrl02to18.do
do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDS12MoInstrAct02to17.do
do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDSGrad02to16.do
do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDSFallEnrl02to16SrsA.do
do https://raw.githubusercontent.com/adamrossnelson/StataIPEDSAll/master/IPEDSFallEnrl02to16SrsB.do

4. Contextual Note

Contextual note: The universe of higher education data is vast and complicated. This illustration from http://www.studentclearninghouse.org is a helpful illustration of the range and complexity.

Higher Education Data Illustration

Other similar data sources:

College-Insight from Institute for College Access & Success www.college-insigt.org

College Scorecard data - My Github Repo

College Scorecard data - College Scorecard Website

Integrated Postsecondary Education Data System - My Github Repo

Integrated Postsecondary Education Data System - IPEDS Website

Common Data Set - Wesbite

Common Data Set - Discussion List Archives

5. Testing And Development Log

Date Developer Description
01Apr2017 Adam Ross Nelson Initial build.
01Sep2017 Adam Ross Nelson GitHub rebuild.
03Jan2018 Adam Ross Nelson Added test & dev log.
11Jan2018 Naiya Patel Tested "Run from Online" command lines - Success.
09Feb2018 Adam Ross Nelson Added IPEDSGradRt02to16.do.
25Feb2018 Adam Ross Nelson Closed issue.
20Mar2018 Adam Ross Nelson Splified ReadMe.md usage section.
20Mar2018 Niya Patel Added FallEnrl02to16SrsA.do & FallEnrl02to16SrsA.do.
02Apr2018 Adam Ross Nelson Tested "Run from Online" command lines - Success.
03May2018 Adam Ross Nelson Test on Linux Ubuntu - Success.
03May2018 Adam Ross Nelson Closed issue.
17Jul2018 Adam Ross Nelson Closed issue.
15Jan2019 Adam Ross Nelson Updated to include 2017 data files.
26Jan2019 Adam Ross Nelson Added to Supp sub dir and related doc.
17Aug2019 Adam Ross Nelson Closed issue
15Oct2019 Adam Ross Nelson Began updates to include 2018 data.
26May2022 Adam Ross Nelson Added reference to Miller and Shedd, 2019.

About

Scripts to download and build panel data files for IPEDS.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published