In Illinois, at the at the end of every semester, each school is required to submit a set of data to the Office of I&I for transmission to the Illinois State Board of Education (ISBE). The state then uses this data to produce the annual State School Report Cards. The purpose of this project is to collect student and teacher data from KIPP Chicago’s different data systems and put them in a format that conforms to ISBE’s reporting requirements.
This document provides a detailed description of each data file and how to find it. The document also points out possible shortcomings and issues with 1) the code, and 2) the report submission process. There are also tips on how and when to begin preparing for the process.
CPS will send detailed documentation on what is required for the report to whoever is designated as the “data liason”. You may need to collect information from the COO, Chief of Staff, Talent Department and HR, so you need to put in your requests early. For more information on required data read section 3, “Data Sources and how to access”. Note: Some data requirements may change from year to year.
Store all data collected from outside KIPP Chicago’s database within a Google cloud storage bucket. For example, all collected data for the 19-20 midyear report was saved in the raw_data_storage
bucket under ISBE_Student_Courses/19-20_files
.
You can find previous instructions on the Midyear ISBE Reporting Process from CPS in Google cloud storage within the raw storage bucket
under ISBE_Student_Courses/19-20_files/cps_report_instructions
. These files will be downloaded to the documentation folder after running the 02-gcs_files.R
script in the data
folder.
Some files that are needed for this report can be accessed directly from our database, but other files need to be collected from different departments within KIPP Chicago. Below is a list of the files that were required for the 2020 Midyear report.
All data sources listed in the 01-bq_files.R
script are stored in KIPP Chicago’s Big Query Database. In order to access these data sets you will need the credentials to gain access to KIPP Chicago’s Google Cloud Platform. For more information please checkout the 01-bq_files.R
script.
All data sources listed in the 02-gcs_files.R
script were collected from different departments within KIPP Chicago as well as other outside sources. Below is a list of all files and where to find them. The specific files for 2020 listed in this section can be found in KIPP Chicago’s Google Cloud Storage in the raw_data_storage
bucket under ISBE_Student_Courses/19-20_files
.
File Description
File Name: course_local_number_state_ids.csv
This csv matches ISBE State Course Codes with local course IDs and local course titles for all KIPP Chicago classes. For more information, see the “ISBE State Course Codes” tab in the “Instructions - 2020 Course Assignment” file located in the documentation folder.
File Location
This file was created manually and should be downloaded from the Google cloud storage raw_data_storage
bucket under ISBE_Student_Courses/19-20_files
. As schools add or drop classes (or change course names), this file will need to be manually updated.
File Description
File Name: zenefits_teacher_data_isbe_midyear_reporting.csv
This csv contains a subset of HR information for everyone working at KIPP Chicago.
File Location
This is a custom report created from Zenefits (KIPP Chicago’s HR SIS). This information can be pulled by either someone in HR or the Chief of staff.
File Description
File Name: 19_20_IEIN_numbers.csv
This csv contains identifying information for teachers. Information includes name, date of birth, IEIN number and serving school.
File Location
This file can be collected from the head of talent, COO or HR. Note, first ask both HR and the COO if they have an existing report with this information before putting in a request to talent. The teacherid column is required to match the information in the cc table and will need to be input by hand.
Note: The users table will need to be connected to the schoolstaff table in order to find the correct teacher id for each teacher. The teacher id column is the id column in the schoolstaff table (users and schoolstaff tables are located on big query).
users.user_dcid = schoolstaff.user_dcid THEN schoolstaff.id = cc.teacherid
File Description
File Name: kipp_staff_member_start_after_20190819.csv
Some teachers start after the first day of school. In order to provide the correct start date for those teachers we need a file that lists the start dates for every teacher who started after the first day of school.
File Location
This file can be collected from HR.
File Description
File Name: Ex. 400044_ascend_current_students_aspen.csv
File Name: Ex. 400044_ascend_former_students_aspen.csv
These files contain the legal name, date of birth, State and CPS ID and School for all KIPP Chicago Students. This includes both current students and transferred students (Note: our report needs to include students who have transferred within the year).
File Location
These files (8 in total) are downloaded from ASPEN. In order to log into ASPEN you will need your cps username and password (the same credentials that you use to log into your CPS email address). Below is a step-by-step guide for finding this information in ASPEN.
File Description
File Name: cps_id_corrections.csv
List of students with differing IDs in ASPEN and Power school. For reporting purposes we need to use the CPS ID listed in power school for each student, but in order to join data accurately we need to use the IDs listed in Power schools.
File Location
This file was created manually. first, I ran an anti_join in R between the student information files and students table (join on CPS Student ID). Output shows students in ASPEN with no match in Power school. Next step was to take this list of students and search for each student in Power School and then note the differing CPS Student ID. This process will need to be repeated for any new report.
File Description
File Name: Ex. enrollment_academy_aspen_400146.csv
This file lists enrollment information for KIPP Chicago Students. If a student starts school after the first day of class then KIPP Chicago’s enrollment record (the records kept in powerschool) will likely be different from the official ASPEN records. The report needs to list the enrollment date from ASPEN in order to be accepted.
File Location
The file is located in ASPEN. ASPEN. In order to log into ASPEN you will need your cps username and password (the same credentials that you use to log into your CPS email address). Below is a step-by-step guide for finding this information in ASPEN.
File Description
File Name: cps_name_replacement_aspen.csv
List of names that were still incorrect the day before the submission was due. Because we are using names from ASPEN we should not need this file in future submissions.
ProjectTemplate
package installed in R.config/global.dcf
file for all required libraries).raw_data_storage
bucket under ISBE_Student_Courses/19-20_files
if you have any doubts.)src
folder and run 01-A_write_submission_files.R
file. This will produce the report files in the required format for ISBE and write them to the output\final_reports
folder. Note: if you’d like to see the final Files in R, then open the isbe_midyear_report_400146
, isbe_midyear_report_400044
, isbe_midyear_report_400163
, and isbe_midyear_report_400180
data frames.src
folder and run the 02-A_evaluate_cps_validation_period_errors.Rmd
file. This file will produce data frames that show all unique errors by school. This file will also produce data frames that list all unique name errors and date of birth errors.03-A_produce_write_submission_files_with_error_fixes.R
file to fix problems with the final reports that cannot be corrected in the original code (Note: use this file cautiously or not at all).Below are links that lay out the exact validation information set by ISBE.