11.520: A Workshop on Geographic Information Systems |
11.188: Urban Planning and Social Science Laboratory |
In this exercise, you will explore the spatial patterns of the housing and
socio-economic characteristics of communities in and around
Before starting the hands-on work, read through the entire assignment to get a sense of the datasets, analytic approach, and processing steps. Then, make sure you can access the datasets in the 11.520 Locker. As explained in the lab exercises, you can find the locker by navigating through the sub-directories on network drive Z:, or you can attach the class locker to a new drive letter - we suggest Drive M: (for maps). In the Room 37-312 lab, you can attach the locker by typing the command "attach -Dm 11.520" into a DOS command window. This command will mount the Andrew File System locker //afs/athena.mit.edu/course/11/11.520/ on network drive M: On the CRON machines in Buildings 9 and 10, you can mount the class locker as drive M by running the batch file 'attach_m.bat' in the class locker via: Z:\athena.mit.edu\course\11\11.520\attach_m.bat
The census tract boundaries are saved in a 'shapefile' called msa5_tr90.shp and may be found in the M:\data directory. This shapefile contains only the bouandary geometry and it must be paired with a dbf-formatted file (called msa5_tr90_data.dbf) in order to relate the census data to specific census tracts. The attribute table for the census tract shapefile only contains a few geographic identifiers (like county, track number, etc.). Some additional socioeconomic data for these tracts have been pulled from the 1990 Census SF3A datasets and are stored in the same M:\data directory in a dbf-formatted table called msa5_tr90_data.dbf. This file must be 'linked' or 'joined' to the attrbiute table for the tract boundaries by a common field called "STCNTYTR" before you will be able to generate thematic maps using the census data. (STCNTYTR is the abbreviation for STate-CouNTY-TRact.) Use the ArcMap help files to see how to 'join' the data table to the attribute table if you want to get started with the homework before we show you how to do this in class.
The msa5_tr90_data.dbf table includes 60+ variables from the much longer list of all variables in the decennial Census. Take a look at the dictionary for the specific census data fields in msa5_tr90_data.dbf. (Note: this list is a subset of the full Census Bureau's listing and technical documentation for the hundreds of population and housing variables from the 1990 census. This technical document is archived in the class locker as M:\data\census90\census90stf3td.pdf. More details about the 1990 US Census are available in Tom Grayson's and Annie Thompson's notes on "Making Sense of the Census". Be aware, however, that some of the online references in these notes are no longer available and you will need ONLY the shorter list of 60+ variables mentioned above in order to do the homework. During the next few weeks we will have additional exercises using US census data, but we will use the 2000 US census for those exercises and will provide the relevant technical documentation. Also, note that small-area comparison of census data across decades can be tricky due to changes in census tract and block group boundaries. MIT's Rotch Library has CDs from a third-party firm, GeoLytics, that has reconciled past census data to year 2000 boundaries. Should you wish to analyze trends in census data for your individual project later in the semester, you will likely want to use the GeoLytics CD.)
Besides the census data, which will be used primarily in Problem #1, you
will need a map of major roads and shopping centers for Problem #2. The
shopping center coverage (for the
A map should always have a purpose. A good map should deliver the information that you want readers to understand. Therefore the map should be very intuitive without requiring reading the discussion of the map in your paper or report. Try to give the map to your friends who have no training in GIS to see if they can recognize the message you were trying to deliver and find the evidence to be compelling.
1. [20 points] Create a thematic (or chloropleth) map showing the population density of the MSA.
You should calculate density as persons (P0010001)
per acre (landacre).
Normalizing by the 'landacre' variable in the census
data is more reliable than using the 'area' variable since the tracts extend
into
Classify the data into approximately five categories. The classification method you use is up to you, and can include customized category breaks. Experiment with the available methods and pick the one you feel best clarifies your exploratory interest (for example, do you want to differentiate within high density areas, or characterize the full range of densities?) In a sentence or two, explain your focus, your choice of classification, and why your cartographic technique makes sense.
2. [20 points] Map the homeownership ratio--the ratio of owner occupied housing units to the total occupied housing units. Remember that the 'tenure' variables count the number of owner-occupied and renter-occupied housing units. Be careful about what is the numerator and what is the denominator. Just as for the previous map, you will also need to exclude tracts that lack adequate data. Again, include a brief few sentences explaining your choice of classification scheme.
3. [20 points] Map another Census attribute of your choosing with interesting spatial patterns using the same process as described in number two.
For this problem, you are asked to investigate the relationships among the locations of shopping centers, major roads, and residential clusters. After doing some exploratory mapping as you did in the first problem, you are asked to dig a bit further into the data, develop a few specific measures that carefully exclude incomplete or inapplicable data, and then develop maps that successfully visualize the results and reasoning behind your analysis.
The shopping center data are stored in a shapefile at M:\data\shopcntrs.shp. (These data are proprietary and not to be used or redistributed for non-MIT purposes.) Included in these data are characteristics such as square footage of retail space (totalsf as a text field and squarefeet as a numeric integer) and type of center (propertysu). Explore these two variables to try to determine if a relationship exists between them. To do this you may want to calculate the average size of each type of shopping center. Note that not all observations include a value in the totalsf or squarefeet fields. Note that these shopping 'centers' do not include places like Central Square (Cambridge) where commercial/retail activity is present among individually owned parcels and buildings along a city street. This dataset focuses on shopping center developments where a large tract of land or strip mall under common ownership is divided up into clusters of businesses.
1. [20 Points] Create one map showing the relationship(s) between shopping center location and the location of major roads and population centers. Be sure to use different symbols and/or sizes on your map for the different types of shopping centers. Likewise differentiate major and not-so-major roads -- use the class field in the majmhda1 attribute table. Use high population density as an indicator of where population centers are located and shade the tracts based on population density (as you computed it in problem 1. Be sure to turn the tract outlines off so they don't clutter the map. In fact, it will take some effort to develop maps with good symbolization and cartographic choices so that they are both readable and informative.
2. [20 Points] Buffer the major roads and create a second map that examines whether certain types of shopping centers tend to be inside the buffer. Select the Interstate Highways and Routes 2 and 3 (not 3A) from the major roads layer. Use the buffer tools to create a one kilometer buffer around these selected roads and then calculate the share of shopping centers by type (among those within the 5-county region) that lie within the buffer. Map your results and include in your "layout" a map of the entire area, as well as a more detailed map zoomed into an area of the region with interesting spatial patterns. In addition to your two map views, create and display a table showing, for each type of shopping center (propertysu), the number of such shopping centers within the 5-county msa5_tr90 region, and the number and percentage of each type of shopping center that fall within the buffer areas.
Explain in a couple of paragraphs, separate from the maps, (a) the steps you took to select those roads and shopping centers that you included when computing your statistics, and (b) your interpretation of any general pattern that you observe regarding the location of shopping centers, major roads, and population centers. In particular, be sure that your discussion covers:
Howework Requirement
Don't just turn in the maps! Write up a short report that integrates the maps, tables and the explicit answers to both questions. Use the maps and tables in the paper to illustrate and amplify your verbal reasoning rather than simply to produce maps without a stated context and purpose.
Please submit your homework (in WORD, RTF, TEXT and/or PDF format) using the Stellar homework turnin capability at http://stellar.mit.edu/S/course/11/fa09/11.520 or http://stellar.mit.edu/S/course/11/fa09/11.188
by 2 PM Wednesday, October 7, 2009.
Created and Modified:1993-2008 by Raj Singh, Thomas H. Grayson, Annie Kinsella Thompson, Joseph Ferreira, Myounggu Kang, Jschung Chung
, Jinhua Zhao, Mike Flaxman, and Yi Zhu
Last modified: 22 September, 2009, [jf]
Back to the CRON Home Page.
Back to the 11.520/11.188 Home Page.