Quick Links
Public Libraries in the US
Analysis of 2001 data
Analyzing Trends
Trends Results
Tables

Characteristics of the Distribution of Public Libraries in the US, 2001


For this analysis, the same five variables listed on the Public Libraries in the US main page: BKVOL (book and serial volumes held), POPU_LSA (population of the library's legal service area), TOTINCM (total library income), TOTSTAFF (total library staff), and TOTOPEXP (total library expenditures) have been analyzed using all 9,133 PLDF3 libraries reporting in 2001 as well as the "1990" dataset for those cases where span is 'A'. To make the presentation of results more complex, the results in the following tables for PLDF3 and the subset of "1990" data present measures based first on the raw values, then present these same data for the log transforms of those five variables of these two datasets. Transforming values by taking their logarithms is a respected technique used both make the data more symmetrical and to provide more tools for analysis. Distributions with logarithmic transforms are commonly examined with skewed distributions.

PLDF3 Libraries in 2001 (raw values)

Basic Descriptive Statistics, PLDF3 Dataset,
2001 Data
Variable N Skewness Kurtosis Mean 3rd Quartile Median
BKVOL 8,772 25.86 1,078 85,402 65,090 26,909
POPU_LSA 8,991 15.44 347 30,401 20,535 6,799
TOTINCM 8,759 23.02 901 $919,035 $559,129 $142,410
TOTOPEXP 8,764 23.79 976 $845,660 $507,679 $129,707
TOTSTAFF 8,791 20.10 722 14.80 10.55 3.38

The skewness and kurtosis of a Gaussian ("normal") distribution are 0. Skewness is a measure of the shape of the distribution and this distribution is positively skewed meaning there are many small values and fewer very large values. Kurtosis is a measure of how sharp the peak is in a distribution. With the high peak in the small values the distributions for these variables have large values for kurtosis. These statistics, among others, for these values indicate that the distributions are not "normal." In fact, I have never seen a Gaussian, "normal," distribution in any library data. The arithmetic mean and 3rd Quartile figures indicate another important characteristic of these libraries: the large libraries are so large that they pull the arithmetic mean above the 75th percentile figure. See the discussion of "heavy tails" on the main page for another way of looking at this characteristic.

"1990" Libraries for 2001, Span = 'A' (raw values)

These libraries have a similar structure as the PDLF3 group.

Basic Descriptive Statistics, "1990" Dataset
2001 data, Span = 'A'
Variable N Skewness Kurtosis Mean 3rd Quartile Median
BKVOL 8,143 25.19 1,018 89,742 68,500 28,776
POPU_LSA 8,282 15.22 335 31,751 21,675 7,343
TOTINCM 8,129 22.50 855 $965,580 $601,095 $159,317
TOTOPEXP 8,140 23.27 927 $888,962 $540,745 $144,098
TOTSTAFF 8,158 19.76 693 15.5 11.15 3.75

What are the characteristics of logarithmic tranformations of these data?

Basic Descriptive Statistics of Logarithmic Tranformations,
PLDF3 Dataset, 2001 Data
Variable N Skewness Kurtosis Mean 3rd Quartile Median
Log of BKVOL 8,772 0.62 1.05 10.35 11.08 10.20
Log of POPU_LSA 8,991 0.21 0.00 8.84 9.93 8.82
Log of TOTINCM 8,759 0.15 -0.01 11.96 13.23 11.87
Log of TOTOPEXP 8,762 0.16 0.05 11.88 13.14 11.77
Log of TOTSTAFF 8,685 0.31 0.04 1.33 2.37 1.25

Basic Descriptive Statistics of Logarithmic Tranformations,
"1990" Dataset 2001 data, Span = 'A'
Variable N Skewness Kurtosis Mean 3rd Quartile Median
Log of BKVOL 8,143 0.66 1.06 10.42 11.13 10.27
Log of POPU_LSA 8,282 0.24 0.01 8.92 9.98 8.90
Log of TOTINCM 8,129 0.16 0.01 12.06 13.30 11.98
Log of TOTOPEXP 8,139 0.19 0.03 11.97 13.20 11.88
Log of TOTSTAFF 8,094 0.29 0.05 1.39 2.42 1.34

In two of the variables the N changes between the raw and log values. The differences are libraries with 0 (zero) reported for those variables. The logarithm of 0 has no meaning.

The logarithm of a book or of dollar figures is not a notion one encounters every day but we are concerned not with the books or dollars in this kind of analysis but the underlying characteristics of the libraries--of all the libraries whose data we have--and what these data tell us about these libraries. What we see here is that the skewness and kurtosis have been reduced so a logarithmic transform makes the data more symmetrical--they are still not "normal," however. This result is about what is expected and with information from the correlations on the page with summary 2001 data, there are hints about underlying structures that will be dealt with in time.

Valid XHTML 1.0!


May 20, 2004
Analysis of 2001 data
Analyzing Trends
Trends Results
Tables
Public Libraries in the United States
Back to NCES index
NCLIS 30th Anniversary logo Return to NCLIS Homepage