From health care to social media the world generates a tremendous amount of data every day, often too much to be processed on a single computer or even some-times a single data centre. In this graduate seminar we will learn about technologies and systems behind Big Data. In particular, we will discuss what challenges exist in processing and storing massive amounts of data. We will explore how these challenges are being solved in real-world systems as well as the limitations inherent in these designs. The evolution of these technologies will be explored by reading both current and historically significant research papers.
This is a graduate seminar where we will learn about technologies and systems behind Big Data.
- What are the challenges in processing and storing massive amounts of data?
- How are these challenges being solved in existing systems?
- What are the limitations of existing systems?
- We will learn by reading latest research papers, by watching technical talks and by discussing the material.
- Operating System Fundamentals
- Big Data File Systems
- Big Data Bases (NoSQL)
- Cloud Computing
- GPGPU (CUDA and OpenCL)
- Big Data Processing Applications (Hadoop and Spark)