Percentages are of your total class grade.
The major portion of your grade is based on the research project. Students will organize into teams of 1-3 students and work on a semester long project. Some possible ideas are described below.
Good class projects can vary dramatically in complexity, scope, and topic. The only requirement is that they be related to something we have studied in this class and that they contain some element of research – e.g., that you do more than simply engineer a piece of software that someone else has described or architected. To help you determine if your idea is of reasonable scope, we will arrange to meet with each group several times throughout the semester.
At the beginning of the 2nd to 4th lectures, each group will give a 5 minute presentation about their proposed project to the class. The presentation should contain:
Teams can meet with the instructor after their presentations for further discussion and feedback.
Click here to sign up. Click “next” until you get to the appropriate week.
Your reserach prospectus will contain an overview of the research problem, your hypothesis, first pass at related work, a description of how you plan to complete the project, and metrics to decide if it worked.
Your prospectus should follow the example:
Submission:
prospectus_<lastname1>_.._<lastnameN>.pdf
Your team will prepare and present a project poster at the end-of-course poster session. This gives you an opportunity to present a short demo of your work and show what you have accomplished in the class!
Submission
You will prepare a conference-style report on your project with maximum length of 15 pages (10 pt font or larger, one or two columns, 1 inch margins, single or double spaced – more is not better.) Your report should expand upon your prospectus and introduce and motivate the problem your project addresses, describe related work in the area, discuss the elements of your solution, and present results that measure the behavior, performance, or functionality of your system (with comparisons to other related systems as appropriate.)
Because this report is the primary deliverable upon which you will be graded, do not treat it as an afterthought. Plan to leave at least a week to do the writing, and make sure your proofread and edit carefully!
Submission
report_<lastname1>_<lastname2>.._<lastnameN>.pdf
The following are examples of possible projects – they are by no means a complete list and you are free to select your own projects. In general, projects can be of three varieties:
Understand how scientific articles use and talk about data. Two possible directions:
Arachnid is a new explanation engine that automatically generates cleaning programs based on user specifications of data quality. It is an extension to ideas from Scorpion. Contact Eugene for a copy of Arachnid. Some possible projects:
Precision interfaces automatically generates interaction interfaces from program logs. It supports any parsable language that can be represented as an abstract syntax tree. Extend the system in interesting ways
Smoke is the fastest lineage-enabled database engine. It captures the relationships between output and input records as efficient lineage indexes. It turns out, this can be used to express and speed up interactive applications such as visualizations. Extend or use it in interesting ways