Estimated passenger itinerary data for calendar year 2007 can be downloaded via http from:

Please note that the file is approximately 2.5G, so the download could be quite slow. In the uncompressed file, there will be a CSV for each calendar month, with the filenames:

In total, the uncompressed files take up approximately 27G of disk space. Each monthly data file has a header row describing the data columns. Each data row provides identifying information for the itinerary, including the first flight, and in the case of one-stop itineraries, the second flight, along with the estimated seating capacities for each flight. The last piece of information is the number of passengers estimated to have traveled on the itinerary. Note that the estimated number of passengers may be non-integral, because we allow non-integral estimates of the seating capacities. In order to consider integral seating capacities and passenger counts, these values should be rounded appropriately. For reference, we have listed and briefly described each of the data columns below. Using these fields, the itineraries can be easily joined with the ASQP flight data provided by BTS.

If there are any problems with the data, please contact Douglas Fearing (dfearing "at" mit.edu) or Vikrant Vaze (vikrantv "at" mit.edu). The First_Flight_ID and Second_Flight_ID columns identify the flights within the NSFNATS internal database, so please include these values when describing the problem.