Schedule of Changes
Overview
In order to create PLDF3, a series of programs was run for each state's data and in them, the newkey was initally created from the last year each library reported, then sorted by newkey and year. The data were subsequently printed to a file and this file was examined for anomalies. In effect, a library's FSCSKEY in the last year of the library's reporting determines that library's newkey—with resolution of collisions discussed by methods described below. Next these programs were edited to make changes in the newkey as a result of that examination. The programs were run recusively and the files examined until all anomalies appeared to be resolve. In some case this process was repeated a number of times. When the printout looked correct, it was initally thought to send it to the State Data Coordinators for examination. This idea was not particularly effective because the lists were often long and daunting, and these days SDCs have more than enough to do. There were no replies to these emails although subsequently I found questions about the odd case here or there generally got a reply. As a general principle, after many year-to-year changes in the FSCSKEY, the lack of systematic assignment of keys has been addressed and anomalies each year are increasingly rare. And, as an additional general principle, the newkey assigned to each library is the FSCSKEY of the last year of the data for that library. Given that few changes in FSCSKEYs assigned to libraries have been changed, there have been few changes in newkeys recently. But, there are a few each year and they cause small adjustments here and there in the number of libraries reporting by state in earlier years. If you had a copy of the table below, you would find evidence of this effect. Such changes for FY 2005 are noted below.
In the case of the states listed below with asterisks, there are no changes in the FSCSKEY from year to year so the newkey is the same as the FSCSKEY. So far, in these states writing the program to create the state's PLDF3 file has generally been straightforward because there have only been a few cases of ambiguity and one case where untangling of the early years of the FSCSKEY was quite difficult. However, the very complexity of the task with nearly 165,000 cases has meant that each year—even this year—changes in the FSCSKEY for individual libraries missed in earlier years have been found and adjustments made in the newkey. The code that is copied into each of the various state "schedules" copied below has been changed in realively few cases for the FY 2005 data.
The most important variable is newkey
As we saw on the index page, the FSCSKEY is usually stable after about 1996 or 1997 so in those cases, the stable FSCSKEY is the newkey for that library. If there is ambiguity in the FSCSKEYs, the FSCSKEY from last year of the reported data—usually 2005— is that library's newkey and it is included in the List of Libraries. Otherwise, by definition, the FSCSKEY and newkey should be the same for the year listed here. In the List of Libraries the last year also supplies the ZIP, state, CITY, and LIBNAME. Hence, of the many forms of LIBNAME, CITY, and so forth, one sees, this latest one will be what appears in the List. If a LIBNAME changes when the 2006 data come out, then it will be changed in that updated List. One characteristic of the CITY and LIBNAME now is the use of upper case. I have been told it is a postal regulation that makes it easier for machines to read. It sure makes it hard for people to read them which might explain why so much of what shows up in my mailbox isn't mine.
The purpose of newkey is to be a unique number for any one library through all years of the data. The newkeys for libraries that die are not reused. When two libraries merge, they become a new library and get a different newkey. If they divorce, they get their old newkeys back but there will be years missing for that library during the years it was merged. In some cases, libraries died and the FSCSKEY was reassigned by NCES and using the last year of reporting resulted in two different libraries having the same newkey. These kinds of collisions have been resolved by creating a separate newkey for the library that closed or where collusions occur for any other reason starting with XX9975 and assigned sequentially with 'XX' standing for the two character state code.
span is a variable that indicates a characteristic of the data for each library. It has three possible values: A indicates the library reports data for all years of the data. Given that states started reporting in different years, A refers to reporting all possible years for that state so the years covered is a function of the first year reporting to FSCS. S libraries report some years. E is a special case of S where the library reports at least the end years of the span of years but are missing some years in the middle. A libraries would be useful in measuring behavior of all libraries through the years while A and E libraries would be useful in measuring changes from the first to last years of a span of years. There are 7,765 A libraries, 1,774 S libraries and 354 E libraries. Minnesota did not report data in 2001, as a result, all Minnesota libraries are coded with a span of S.
first year is the first year this library reports data to FSCS.
last year is the last year this library reports data. Last year is most often 2004 for the current dataset.
The changes documented in the Schedules are primarily to newkey.
It is a temptation to correct obvious errors that are noted in the printouts and not just the FSCSKEYs. Inspection of the files find clear typographic errors in the demographic variables used in the construction of PLDF3 and, too, there are not-so-obvious errors that the SDCs will be aware of. In addition, there are changes in formats used for the names of systems and libraries. Why not “correct” them? The temptation to correct all these errors must be resisted. The Hippocratic Oath of data is: First Do No Harm. There is no telling what mischief might result from an attempt to correct what is a digital text in an archival sense—these datasets constitute a text worthy of preservation. Creating a superset with new variables as with newkey allows the old variables to be preserved and the new “corrected” (we hope) variables to exist at the same time. Anyone using this edition may then make such changes as he or she fancies.
Those looking at the Schedules will note that there are cases where changes for CITY for 2002 were commented out. There are two examples where the SDCs agreed that the CITY was wrong in 2002 and it was obvious by inspection. Iowa had three examples, one: LAPORTE CITY was corrected to LA PORTE CITY. The changes were made so that the List of Libraries, which is drawn from the last years of the data, will be as accurate as possible to anyone using it and not ambiguous when checking the data file itself. As I was going over the data in 2003, I decided changing the CITY was not a good idea so I didn't do subsequently but left the code in as a reminder of the folly of such changes. So, these changes have been reversed so that the errors in those files are, again, as they were in the orginal NCES data.
The schedules are pasted from the SAS code for each state and are in ASCII for manipulation. The programs themselves are available to anyone who wants to see them.
There are a total number of 164,872 observations in PLDF3 through the FY 2005 data. More information on them can be found in the revision history of PLDF3.
There are currently no changes for California, the District of Columbia, Georgia, Guam, Hawaii, Louisiana, the Marianas, Maryland, Mississippi, Nevada, Ohio, or the Virgin Islands.
The changes in each file are in order by the FSCSKEY except for Colorado, Oregon, and Washington which are in order by the CITY. The FSCSKEY is often in order by CITY at least in the early years. In retrospect, I should not have rearranged the data by CITY but left them in the order they were in the data so that my coding could be more easily followed for anyone checking for errors.
How to Read the Schedules of Changes
The programs creating newkey from FSCSKEY first create a dummy newkey from the FSCSKEY for 2003 and then changes are made in the newkey in the program. The programs are written in SAS and the code changes are what appear in the Schedule of Changes. They are pasted from the programs without editorial changes. They should be readable by people who do not know SAS. Here is an example:
/* OTIS */
if year = 1987 and FSCSKEY = 'CO0071' then newkey = 'CO0091';
if year = 1988 and FSCSKEY = 'CO0102' then newkey = 'CO0091';
These are the changes for Otis which were used as an example from the List of Libraries. The “/*” and “*/” indicate comments and are used here as an easy means of finding lines of code related to different libraries. These comments, typically, have the CITY. The first line indicates that SAS is to look for any case where the year is 1987 and the FSCSKEY is CO0071. In those cases (and this is only one where those two conditions hold), change the newkey to CO0091 from the dummy CO0071 that was used to seed this variable. The last year of Otis was 1989, you will recall. And there was no other library that was using that FSCSKEY in a later year so it is used as the newkey. In 1988, Otis had the FSCSKEY of CO0102 and this, too, was changed to CO0091. The semicolon is a SAS end of command.
The Schedules of Changes are available as text files by state. Most of the state Schedules are in order by FSCSKEYs and years changed in the code—often in pairs, that is, in order by the sort of FSCSKEY, then year and often there were pairs of changes in different FSCSKEYs. This method gives the changes no apparent order but they are in order by the data. In a handful of cases, I sorted the libraries by CITY, thinking this would make things easier to find. I regret having done that because rather than making changes easier to find, it has made them harder to find when you do have the data. The most likely use of these files is for people to check my work and sorting by CITY cannot make this process easier.
There is a spreadsheet (OpenOffice, xls) giving a fuller list of the number of reporting libraries by state and year than what is given below.
FY 2005 Changes
There are only a few cases of changes in the FSCSKEYs in the FY 2005 data that affect the newkey. Most were where I had missed changes in annual FSCSKEYs in individual libraries. That is, one library was in more than one FSCSKEY over the years and I had not corrected the newkeys. There was one case of an apparent anomaly in the way one library was assigned an FSCSKEY. Given that the software that assigns FSCSKEYs has improved so much over the years, this case is profoundly odd. It is impossible but, nonetheless, it occurred. That case occurred with data from Oregon. See Oregon for the table that lists data for the Wagner Community Library. Otherwise the few changes are noted in the code included in the various state schedules with a comment indicating the changes were made for the FY 2005 data. See the page below for Illinois, Oregon, and Pennsylvania for cases where anomalies in assignment of FSCSKEYs required adjustments in the newkey and the underlying code required to produce PLDF3. Alabama had a minor spelling error in a library name corrected in FY 2005.
Links
Schedules of Changes
| state |
Number of Observations in PLDF3 |
Number of Libraries Ever Reporting to FSCS |
Number of Libraries in 2001 |
Number of Libraries in 2002 |
Number of Libraries in 2003 |
Number of Libraries in 2004 |
Number of Libraries in 2005 |
| Alabama |
3,509 |
224 |
207 |
207 |
208 |
208 |
207 |
| Alaska |
1,533 |
118 |
86 |
85 |
85 |
88 |
89 |
| Arizona |
1,036 |
107 |
35 |
35 |
90 |
91 |
86 |
| Arkansas |
720 |
54 |
43 |
47 |
47 |
48 |
48 |
| California * |
3,123 |
181 |
179 |
179 |
179 |
179 |
179 |
| Colorado |
2,243 |
144 |
116 |
115 |
115 |
115 |
115 |
| Connecticut |
3,498 |
196 |
194 |
194 |
194 |
194 |
194 |
| Delaware |
512 |
37 |
37 |
21 |
21 |
21 |
21 |
| District of Columbia * |
18 |
1 |
1 |
1 |
1 |
1 |
1 |
| Florida |
1,724 |
158 |
72 |
72 |
69 |
70 |
78 |
| Georgia * |
946 |
58 |
57 |
58 |
58 |
58 |
58 |
| Guam * |
5 |
1 |
1 |
0 |
1 |
0 |
0 |
| Hawaii * |
18 |
1 |
1 |
1 |
1 |
1 |
1 |
| Idaho |
2,021 |
116 |
106 |
106 |
104 |
104 |
104 |
| Illinois |
11,090 |
637 |
629 |
626 |
629 |
626 |
623 |
| Indiana |
4,528 |
242 |
239 |
239 |
238 |
239 |
239 |
| Iowa |
9,954 |
545 |
537 |
538 |
539 |
540 |
540 |
| Kansas |
5,464 |
333 |
321 |
323 |
325 |
325 |
325 |
| Kentucky |
2,084 |
116 |
116 |
116 |
116 |
116 |
116 |
| Louisiana * |
1,168 |
67 |
65 |
65 |
65 |
66 |
67 |
| Maine |
4,356 |
281 |
273 |
274 |
273 |
269 |
272 |
| Marianas * |
5 |
1 |
0 |
0 |
1 |
0 |
0 |
| Maryland * |
432 |
24 |
24 |
24 |
24 |
24 |
24 |
| Massachusetts |
6,641 |
378 |
371 |
370 |
370 |
370 |
370 |
| Michigan |
6,872 |
414 |
381 |
383 |
384 |
384 |
383 |
| Minnesota |
2,419 |
152 |
0** |
142 |
140 |
140 |
140 |
| Mississippi * |
861 |
52 |
49 |
49 |
49 |
49 |
50 |
| Missouri |
2,685 |
182 |
150 |
148 |
168 |
151 |
149 |
| Montana |
1,527 |
85 |
79 |
79 |
79 |
79 |
79 |
| Nebraska |
4,666 |
286 |
272 |
275 |
275 |
276 |
270 |
| Nevada * |
402 |
28 |
23 |
22 |
22 |
22 |
22 |
| New Hampshire |
4,349 |
240 |
229 |
230 |
230 |
231 |
230 |
| New Jersey |
5,551 |
321 |
309 |
309 |
308 |
306 |
306 |
| New Mexico |
1,365 |
105 |
80 |
89 |
90 |
92 |
87 |
| New York |
14,290 |
788 |
750 |
751 |
752 |
753 |
754 |
| North Carolina |
1,499 |
106 |
76 |
76 |
75 |
75 |
75 |
| North Dakota |
1,510 |
98 |
82 |
82 |
83 |
83 |
83 |
| Ohio * |
4,751 |
251 |
250 |
250 |
250 |
250 |
251 |
| Oklahoma |
2,109 |
121 |
115 |
110 |
112 |
112 |
113 |
| Oregon |
2,327 |
138 |
125 |
124 |
125 |
125 |
125 |
| Palau |
1 |
1 |
0 |
0 |
0 |
0 |
0 |
| Pennsylvania |
8,227 |
514 |
459 |
451 |
453 |
455 |
458 |
| Rhode Island |
892 |
51 |
48 |
48 |
48 |
48 |
49 |
| South Carolina |
770 |
44 |
41 |
41 |
42 |
42 |
42 |
| South Dakota |
2,118 |
141 |
126 |
125 |
125 |
125 |
124 |
| Tennessee |
2,602 |
200 |
184 |
184 |
184 |
184 |
186 |
| Texas |
9,669 |
585 |
540 |
557 |
552 |
555 |
553 |
| Utah |
1,328 |
72 |
70 |
72 |
72 |
72 |
71 |
| Vermont |
3,516 |
213 |
188 |
189 |
188 |
189 |
184 |
| Virgin Islands * |
4 |
1 |
1 |
1 |
1 |
0 |
0 |
| Virginia |
1,617 |
93 |
90 |
90 |
90 |
90 |
91 |
| Washington |
1,286 |
74 |
65 |
64 |
64 |
66 |
65 |
| West Virginia * |
1,751 |
100 |
97 |
97 |
97 |
97 |
97 |
| Wisconsin |
6,809 |
394 |
379 |
380 |
380 |
380 |
381 |
| Wyoming |
437 |
23 |
23 |
23 |
23 |
23 |
23 |
| TOTALS (ALL) |
164,874 |
9,893 |
8,991** |
9,137 |
9,214 |
9,207 |
9,198 |
* indicates states with no changes.
** Minnesota did not report in 2001.

December 27, 2007
Back to the Schedule of Changes
Back to PLDF3 main page
Back to NCES index
Back to the state spreadsheets
Return to NCLIS Homepage