11.521 | Spatial
Database Management and Advanced Geographic Information Systems |
11.523 | Fundamentals
of Spatial Database Management |
Tips and Tricks working with East Boston parcel data
create view v_ebosvalue as select pid, 1.0*to_number(lotsize) as lotsize, 1.0*to_number(living_area) as living_area, 1.0*to_number(fy2005_land) as fy2005_land from bos05eb where lotsize > 0 and living_area > 0 and fy2005_land > 0;
Part I: Review of 'Zoning Variance' data base
In next week's Lab #4, we will map the zoning variances. Today, we will focus on understanding and running basic queries of the zoning databases plus ways of using the ideas in the book chapter that we read to accumulate and utilize knowledge about how to interpret ownership types among the owners of property seeking a zoning variance.
Often, the data that we wish to study cannot be linked to mappable objects in a straightforward (one-to-one) manner. For example, we may wish to study those zoning variances involving vacant land under city control that are proposed for use as housing. Figuring out which variances are controlled by the city is a database manipulation task in and of itself and understanding the neighborhoods proximate to the proposed site might require a side effort - using GIS - to tag each zoning variance with, say, the census tract that sourrounds its location. We want to help you understand - and gain some hands-on experience - with relational and spatial database management techniques that can facilitate such multi-stage analyses. Moreover, we want to help you understand how the architecture of the city's information infrastructure can affect the types of analyses that are possible and the extent to which decentralized access to city data can be an effective empowerment tool.
Schema of ZONING table (and related lookup tables with codes for Decision, Use, NSA, and Neighbrhd) |
|
Schema of 1980 Boston Census data (grouped by neighborhood statistical area, NSA) |
|
Annotated SQL queries of ZONING table |
|
Annotated SQL queries illustrating use of lookup tables to categorize ownership of properties seeking zoning variances |
|
Stages of evolution of the ZONING variance database (for next week's discussion) |
The ZONING table and associated lookup tables are in our Oracle database and accessible to everyone in the class. In addition, the same tables are available as an MS-Access database in the class data locker: http://mit.edu/11.521/data/11.521_lab3.mdb.
Part II: Distributed Access to Parcel-Level City Databases (Using lookup tables to merge local data with large, 'read-only' official databases).
Most of today's lecture focuses on the Book Chapter reading for today - http://mit.edu/11.521/papers/techcity_7ferreira.pdf which is Chapter 7 of "High Technology and Low-Income Communities: Prospects for the Positive Use of Advanced Information Technology," edited by Donald A. Schön, Bish Sanyal, and William J. Mitchell, MIT Press, 1997
This chapter explains some of the complexities and technical strategies involved in trying to empower communities by providing local access to parcel-level city databases. We would like to facilitate distributed access to centrally maintained databases - while allowing some capacity on the part of end-users to make changes in the 'official' data. Our example uses parcel-level databases of land use and ownership - a very detailed data layer that has recently become much more standardized and accessible. To emphasize how often it is necessary to do extensive data processing before such data can be utilized effectively, we consider the problem of addressing the 'spelling errors' in owner names that complicate efforts to determine ovnership patterns from typical parcel databases. For example, there are at least 17 different spellings of the Boston Redevelopment Authority in the 'owner' field of the official parcel database for Boston.
Much of the lecture explains how one can use local 'lookup' tables and SQL 'update' queries to correct the spelling errors (and construct other ownership groupings). After illustrating the techniques, we discus how these methods can provide a useful alternative to 'top-down' and 'bottom up' strategies for fixing the spelling errors. In particular, they provide a 'middle-out' alternative that allows decentralized accumulation of useful knowledge that can be readily linked to official datasets outside the user's control. Here are some of the queries that we will run in class and the powerpoint slides that outline the methods and ideas in the book chapter.
In parts of Lab #3 and Problem Set A, you will use similar methods to categorize the owners of the properties in the zoning variance database. A set of notes about these queries is available in the SQL Notes section of the class web page as this link: Grouping zoning applicants via 'lookup' tables.
Home | Syllabus
| Lectures | Labs
| CRON | 37-312
lab | MIT
Last Modified 18 Feburary 2010 (jf)