COLUMBIA UNIVERSITY COMS 6998.005

Information

Prereqs

Assignments

Grading

Confirmed Speakers

The course will involve invited speakers from industry and academia to compare and contrast research and practice, as well as transitioning between the two disciplines.

Overview

Data is eating the world and developing next generation data-driven applications and systems for working with data is more important than ever before. In addition, the lines between research, applications, and industry are blurring more than ever.

This course will survey modern research in data management – from large scale data processing, modern database engines, to data cleaning and visualization, to secure data management. To ground the discussion, we will host invited speakers that have (or are) transitioned their research work from academia to industry. Depending on timing and interest, select students may be invited to join the speakers for more in-depth discussions over dinner after class.

Students are expected to actively participate in discussions.
Course capped at 25. If waitlist is huge, a small assignment will be used to choose participants.

Schedule

1/17: Introduction

1/24: Crowdsourcing

1/31: Secure Databases

2/07:

2/21: TileDB

2/14:

2/28: Instabase

3/07:

3/14: Spring Recess

3/21:

3/28:

4/04: 3 Timeless ideas in SQLite

4/11:

4/18:

4/25: Project presentations