MIT ID Data Problems

UPDATED: 4/9/1998
MIT ID Home Page


This document defines some of the data problems that have been identified with the deployment of MIT ID and proposes solutions for dealing with them going forward.

Duplicate ID Problem

Description: Two or more people with the same MIT ID.
Cause: This usually occurs when a search (manual or automated) incorrectly identifies a person and an existing MIT ID number is reused.
Resolution: All records associated with this MIT ID must be manually reviewed and the MIT ID modified where appropriate.
Proposed Solution: The MIT ID Database does not allow duplicate MIT ID's. If duplicate IDs exist across departmental systems only one of the records will appear in the MIT ID Database as determined by the feed from the Warehouse. The best time to identify Duplicate IDs is during the integration of multiple data sets in the Warehouse. Each case will need to be analyzed and cleaned up separately.

Multiple ID Problem

Description: An individual is assigned more than one MIT ID.
Cause: Usually assigned by different departments when there is inadequate searching of existing records prior to MIT ID assignment.
Resolution: Manually determine which MIT ID is "correct" and then change the ID for all records containing the "incorrect" ID(s).
Proposed Solution: It is proposed that a process be put into place where "incorrect" MIT IDs may be marked as obsolete and that they may be "linked" to the "correct" ID.  Different algorithms will need to be employed to determine which ID should be the correct one. To prevent further multiple ID problems, an ID lookup operation must be performed before each individual is added to the MIT ID Database.