MIT: Independent Activities Period: IAP

IAP 2014



Real Time Big Data Analytics @ Twitter

Karthik Ramasamy, Sanjeev Kulkarni

Jan/13 Mon 07:00PM-09:00PM 4-237

Enrollment: Unlimited: Advance sign-up required

Tech Talk: Real Time Big Data Analytics @ Twitter: 7pm - 8pm

Demo: Storm @ Twitter: 8pm - 9pm

Location: 4-237

Food will be served!

RSVP here.

Twitter is all about real time - real time conversations, real time trends, real time search and real time content dissemination. Twitter has invested in a massive data pipeline that collects, aggregates, processes large volumes of data in real time. At the heart of the pipeline are several components that power the real-time processing.  In this talk, we will give an overview of real time analytics, discuss the twitter real time data pipeline and how various components are assembled together for extracting analytics. We will also discuss the challenges we faced and lessons we have learned while building this infrastructure at Twitter.

In our second hour we'll discuss Storm, a real time fault tolerant and distributed stream data processing system. Storm is currently used to run various critical computations in Twitter at scale and in real-time, and is at the heart of nearly every user interaction and revenue decision that is made at Twitter. We'll give an overview of Storm concepts, architecture and present use cases from actual deployments at Twitter.

Sponsor(s): Student Information Processing Board, Electrical Engineering and Computer Science
Contact: Karthik Ramasamy, sipb-iap-twitter@mit.edu