# Data Sources

This page explains where the major fields on California School Trends come from.

The short version:

- Most academic performance fields come from [**CAASPP** research files](https://caaspp-elpac.ets.org/caaspp/ResearchFileListSB?ps=true&lstTestYear=2025&lstTestType=B&lstCounty=00&lstDistrict=00000&lstFocus=a).
- Most school and district identity fields come from the [**California Department of Education (CDE) School Directory**](https://www.cde.ca.gov/schooldirectory).
- Whole-population low-income and English learner counts come from [**CALPADS student-group data**](https://www.cde.ca.gov/ds/sd/sd/filescupc.asp).
- Board and election fields come from [**CEDA**](https://scholars.csus.edu/esploro/outputs/dataset/California-Elections-Data-Archive-CEDA/99257830890201671), county-certified election sources, and current district board pages.
- District boundary maps come from the [California school district areas GeoJSON published through California's GIS data portal](https://gis.data.ca.gov/datasets/CDEGIS::california-school-district-areas-2023-24.geojson?outSR=%7B%22latestWkid%22%3A4326%2C%22wkid%22%3A4326%7D).

## Core rule

We try to keep a clear separation between:

- **direct source fields** from public state or district datasets
- **derived fields** we calculate from those source rows

Examples of derived fields:

- 2-year trend deltas
- combined reading + math scores
- percentile and rank displays
- page-level summaries

## School page fields

### Identity and location

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>School name, CDS code, district, city, grade span, school level, charter status</td>
      <td><strong>CDE School Directory</strong> public school files</td>
    </tr>
    <tr>
      <td>Street address, city, ZIP, latitude, longitude</td>
      <td><strong>CDE School Directory</strong> school files</td>
    </tr>
    <tr>
      <td>Map point shown on site</td>
      <td>CDE directory coordinates by default, with reviewed overrides from the site's map-location override layer when available</td>
    </tr>
  </tbody>
</table>

Notes:

- Some school map points are refined with reviewed overrides so the point better matches the campus or school POI.
- When schools share the same or nearly the same coordinates, the site may still apply a small display-only separation on maps.

### Academic performance

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td><code>readingScore</code>, <code>mathScore</code></td>
      <td><strong>CAASPP</strong> latest proficiency rates</td>
    </tr>
    <tr>
      <td><code>readingHistory</code>, <code>mathHistory</code></td>
      <td><strong>CAASPP</strong> historical proficiency rates</td>
    </tr>
    <tr>
      <td>Race and ethnicity performance breakdowns</td>
      <td><strong>CAASPP</strong> student-group rows</td>
    </tr>
    <tr>
      <td>Low-income subgroup proficiency charts</td>
      <td><strong>CAASPP</strong> economic-status subgroup rows</td>
    </tr>
    <tr>
      <td>English learner subgroup proficiency charts</td>
      <td><strong>CAASPP</strong> English-language-fluency subgroup rows</td>
    </tr>
    <tr>
      <td>Grade distribution charts</td>
      <td><strong>CAASPP</strong> grade-level score distributions</td>
    </tr>
  </tbody>
</table>

Derived from CAASPP:

- 2-year reading and math trend deltas
- statewide and district rank fields
- performance summaries shown near the top of school pages

### School context

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Student enrollment</td>
      <td><strong>CDE enrollment</strong> files</td>
    </tr>
    <tr>
      <td>Student / teacher ratio, teacher FTE</td>
      <td><strong>CDE staffing / student-teacher ratio</strong> files</td>
    </tr>
    <tr>
      <td>Graduation rate</td>
      <td><strong>CDE ACGR</strong> files</td>
    </tr>
    <tr>
      <td>FRPM percentage</td>
      <td><strong>CDE FRPM</strong> files</td>
    </tr>
    <tr>
      <td>Per-student spending</td>
      <td><strong>California education finance files</strong>, summarized into per-student spending metrics for the site</td>
    </tr>
  </tbody>
</table>

### Student-group counts shown in “Other Stats”

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Low-income student count and percentage</td>
      <td><strong>CALPADS UPC</strong></td>
    </tr>
    <tr>
      <td>English learner count and percentage</td>
      <td><strong>CALPADS UPC</strong></td>
    </tr>
  </tbody>
</table>

Important:

- These are **whole-population demographic counts**, not CAASPP-tested subgroup counts.
- On the site, “Low-Income Students” and “English Learners” in the `Other Stats` table are based on **California’s CALPADS student-group data**.

## District page fields

### Identity and coverage

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>District name, CDS code, district type, county</td>
      <td><strong>CDE School Directory</strong> district files</td>
    </tr>
    <tr>
      <td>District center point</td>
      <td>Public district boundary and map geometry data, reduced to a center point for display</td>
    </tr>
    <tr>
      <td>District school list</td>
      <td><strong>CDE School Directory</strong> school files, grouped under each district</td>
    </tr>
  </tbody>
</table>

### Academic performance

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>District reading and math scores</td>
      <td><strong>CAASPP</strong> district-level proficiency rows when available; otherwise weighted school fallback</td>
    </tr>
    <tr>
      <td>District reading and math history</td>
      <td><strong>CAASPP</strong> district-level history or weighted school fallback</td>
    </tr>
    <tr>
      <td>District subgroup charts</td>
      <td><strong>CAASPP</strong> district subgroup rows</td>
    </tr>
    <tr>
      <td>District proficiency tracker</td>
      <td><strong>CAASPP</strong> grade 3 reading rows</td>
    </tr>
  </tbody>
</table>

Derived fields:

- district 2-year trend deltas
- district statewide rank / percentile
- district reading rank
- district math rank

### District context

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Total students</td>
      <td>Built from current district metric rows / school totals</td>
    </tr>
    <tr>
      <td>Student / teacher ratio</td>
      <td>Derived from total students and teacher FTE</td>
    </tr>
    <tr>
      <td>Spend per student</td>
      <td><strong>California education finance files</strong>, summarized into per-student spending metrics for the site</td>
    </tr>
    <tr>
      <td>Low-income student count and percentage</td>
      <td><strong>CALPADS UPC</strong></td>
    </tr>
    <tr>
      <td>English learner count and percentage</td>
      <td><strong>CALPADS UPC</strong></td>
    </tr>
  </tbody>
</table>

## Board and election fields

Board data comes from more than one source family because no single statewide file cleanly covers every district’s current seated board.

### Historical election context

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Historical board election candidates and results</td>
      <td><strong>California Elections Data Archive (CEDA)</strong></td>
    </tr>
    <tr>
      <td>County-certified board result enrichment for supported counties</td>
      <td>County election sources, including Contra Costa and San Francisco</td>
    </tr>
    <tr>
      <td>Election-year metadata such as elected year, term context, margin, appointed/incumbent hints</td>
      <td>Derived from normalized election tables built from those public result sources</td>
    </tr>
  </tbody>
</table>

### Current board roster fields

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Current member names, roles, emails, board page URLs</td>
      <td>Public district board pages and district trustee profile pages</td>
    </tr>
    <tr>
      <td>District board website</td>
      <td>District web profile / board page discovery pipeline</td>
    </tr>
  </tbody>
</table>

Notes:

- Board rosters are not inferred from election data alone.
- When the site has a current board roster with emails, that usually comes from the district’s published board pages or trustee profile pages.
- Current board pages can override or refine what historical election data implies.

## Search, rankings, and research pages

These pages mostly reuse the same underlying fields described above.

### Search

<table>
  <thead>
    <tr>
      <th>Page</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>School search</td>
      <td>Site school index built from normalized school payloads</td>
    </tr>
    <tr>
      <td>District search</td>
      <td>Site district index built from normalized district payloads</td>
    </tr>
  </tbody>
</table>

### Research pages

<table>
  <thead>
    <tr>
      <th>Research page family</th>
      <th>Primary source fields</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Spending pages</td>
      <td>District finance fields + district reading/math fields</td>
    </tr>
    <tr>
      <td>Student-teacher ratio pages</td>
      <td>Staffing ratio fields + district reading/math fields</td>
    </tr>
    <tr>
      <td>Performance distribution pages</td>
      <td>School and district reading/math proficiency fields</td>
    </tr>
    <tr>
      <td>Low-income leader pages</td>
      <td>CAASPP low-income subgroup performance fields</td>
    </tr>
    <tr>
      <td>English learner leader pages</td>
      <td>CAASPP English learner subgroup performance fields</td>
    </tr>
    <tr>
      <td>High low-income-share school pages</td>
      <td>CALPADS UPC low-income share + overall school reading/math proficiency</td>
    </tr>
  </tbody>
</table>

## Map and boundary data

<table>
  <thead>
    <tr>
      <th>Field family</th>
      <th>Source</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>District polygons / boundaries</td>
      <td><a href="https://gis.data.ca.gov/datasets/CDEGIS::california-school-district-areas-2023-24.geojson?outSR=%7B%22latestWkid%22%3A4326%2C%22wkid%22%3A4326%7D">California school district areas GeoJSON published through California's GIS data portal</a></td>
    </tr>
    <tr>
      <td>School point markers</td>
      <td><strong>CDE School Directory</strong> coordinates by default, with reviewed map-location overrides when needed</td>
    </tr>
  </tbody>
</table>

## Source systems and files

### Academic performance

- [**CAASPP / ELPAC research files**](https://caaspp-elpac.ets.org/caaspp/ResearchFileListSB?ps=true&lstTestYear=2025&lstTestType=B&lstCounty=00&lstDistrict=00000&lstFocus=a)
- [**CAASPP Student Groups codebook**](https://caaspp-elpac.ets.org/caaspp/researchfiles/StudentGroups.zip)

### School and district directory

- [**CDE School Directory** school files](https://www.cde.ca.gov/schooldirectory/report?rid=dl1&tp=txt)
- [**CDE School Directory** district files](https://www.cde.ca.gov/schooldirectory/report?rid=dl2&tp=txt)

### Enrollment

- [**CDE enrollment files**](https://www3.cde.ca.gov/demo-downloads/census/cdenroll2425.txt)

### Staffing and student-teacher ratio

- [**CDE staffing files**](https://www3.cde.ca.gov/demo-downloads/staff/strat2425.txt)

### Graduation

- [**CDE ACGR files**](https://www3.cde.ca.gov/demo-downloads/acgr/acgr25.txt)

### FRPM

- [**CDE FRPM files**](https://www.cde.ca.gov/ds/ad/documents/frpm2425.xlsx)

### CALPADS student-group counts

- [**CDE CALPADS UPC source files**](https://www.cde.ca.gov/ds/sd/sd/filescupc.asp)
- [**2024–25 CALPADS UPC workbook**](https://www.cde.ca.gov/ds/ad/documents/cupc2425-k12.xlsx)

### Election and board sources

- [**California Elections Data Archive (CEDA)**](https://scholars.csus.edu/esploro/outputs/dataset/California-Elections-Data-Archive-CEDA/99257830890201671)
- **District board pages**
  - current district trustee and board profile pages published by districts themselves

## Important caveats

### Demographic percentages may not use the same denominator as the page’s enrollment field

Some pages show:

- enrollment from one official state file
- low-income or English learner percentages from **CALPADS UPC**

Those official denominators can differ slightly even within the same school year. The counts and percentages shown for student groups come from the **same CALPADS row**.

### Some district scores use a fallback

When direct districtwide CAASPP rows are missing or suppressed, the site may use a weighted school-based fallback for district reading and math summaries.

That fallback is built year by year from the district's school-level CAASPP proficiency rows:

- each school's proficiency rate is weighted by that school's tested student count for the same subject and year
- those weighted values are summed across the district's schools
- the total is divided by the combined tested student count to produce a district-level proficiency estimate

The site then uses that same fallback history to derive the district's current reading and math scores, so the top-line score and the trend chart stay on the same basis.

This keeps the fallback tied to public CAASPP data, but it is still an estimate derived from school rows rather than a direct published districtwide CAASPP row.
