Instructor:
Min Song
Office:
Room 5604 – GITC Building – 5th Floor
Office
Hours: Check my Web site for hours, other
times by appointment
Web
Site: http://web.njit.edu/~song/
Telephone:
973-596-5291 (email is much better if
have to leave a message)
E-mail:
min.song@njit.edu
Course:
CIS 679
I
am on campus extensively, but send me an e-mail to make sure that I am
available in my office.
OVERVIEW
This course is designed to introduce graduate students to
the foundations of database systems, focusing on basics such as data modeling,
query optimization, query processing, and transactions. This is not a course on
database design or SQL programming (though we will discuss these issues
briefly). It is designed for students who have taken 6.033 (or equivalent); no
prior database experience is assumed though students who have taken an
undergraduate course in databases are encouraged to attend.
Classes will consist of lectures and discussions based on
readings from the database literature. There will be a semester long project,
as well as two exams and a few (2 or 3) problem sets.
OBJECTIVES
This course seeks to provide an understanding of the issues
in managing database systems as an essential organizational resource. Students
will learn the components of enterprise data architecture, data storage configurations,
and information retrieval methods. The course proceeds from the relational
model to the multidimensional model, object-relational techniques, and web
accessed data.
TOPICS
• The variety and
complexity of current data management systems and evolving data management
technology
• The entity
relationship model
• Normalization
• Relational
integrity and concurrency control
• Comparison of
normalized and denormalized models
• Limitations inherent
in the relational model and possible solutions including object-oriented
databases, object-relational databases, and multi-dimensional databases
• Large text
files, multi-media and embedded information needed for a complete information
set
•
• Techniques for
managing the design, development, and maintenance of large database systems and
data warehouses; methods for handling terabyte data sets and integrating the
data with internal and external data sources, including data cleansing
• Role and
responsibilities of the database administrator; maintaining the database,
privacy and security, recovery, and tuning
• Retrieving
information using SQL and other methods
• Data mining
• Machine learning
DISCUSSION:
The objectives of the course will be achieved by discussion
and activities including the following;
• Design, build
and implement a database
• Exercise the
database built under various conditions
• Query the
database using SQL
• Use SQL to
demonstrate implementation problems
• Evaluate file
storage and transfer methods
• Sort and merge
files
• Interview real
or mock users
• Case discussions
to demonstrate management issues
• Lectures
• Team projects
• In-class student
presentations
Lecture and readings from original research papers.
presented in the text listed below and other possible sources
Semester-long project and paper.
PREREQUISITES
Prior database experience is not required.
GRADING
Grades are assigned based on 3 problem sets, midterm, a
final project, and class participation. The grading breakdown is as follows:
* Problem Sets:
20% total
* Midterm:
20%
* Final Project:
45%
* Class Participation:
15%
Late assignments will not be accepted.
COLLABORATION POLICY
For problem sets, you are allowed and expected to discuss
your answers with other students, but please write up your own answers and list
your collaborators. Copying solutions from other students is never allowed. For
the group project, you will work in teams and hand in only one written report.
TEXT
The course readings will primarily be drawn from the 4th
Edition of ``Readings in Database Systems'', edited by Stonebraker and
Hellerstein.
The text will be available from the Bookstore.
NOTE: The Third Edition of "Readings In Database
Systems" is a substantially different text (it does not include the same
readings.) There may be several other readings that will be posted on the
course web site.
SUPPLEMENTAL
For an excellent introduction to the basics of database
systems, including extensive coverage of, the textbook "Database
Management Systems" from Ramakrishnan and Gehrke is excellent. It is
available from Amazon.
Class
|
Topic
|
|
Assignment Due/ Project Progress
|
|
Data Modeling,
DBMS Architecture and Database Evolution |
Chapter 1 |
|
|
|
#1 – |
|
|
|
|
Continued |
Chapter 1 |
Project team |
|
|
#2 – |
|
|
|
|
Query Processing |
Chapter 2 |
||
|
#3 – |
|
|
|
|
Query Processing cont. |
Chapter 2 |
Problem set1 |
|
|
#4 – |
|
|
|
|
Data storage and
Access Methods |
Chapter 3 |
Project proposal |
|
|
#5 – |
|
|
|
|
Data storage and
Access Methods
cont. |
Chapter 3 |
Problem set2 |
|
|
#6 – |
|
|
|
|
Transaction
Processing |
Chapter 4 |
||
|
#7 – |
|
|
|
|
|
Transaction
Processing cont.
|
Chapter 4
|
Problem set3 |
|
#9 – Mar 8 - Mar 11 |
Midterm – online
(not in class exam) -- Miterm Review |
|
|
|
Spring Break |
March 12-18 |
|
|
|
Data Warehousing |
Chapter 7 |
Project progress report |
|
|
#10 – |
|
|
|
|
Data Warehousing
cont. |
Chapter 7 |
|
|
|
#11 – |
|
|
|
|
Data Mining |
Chapter 8 |
||
|
#12 – |
|
|
|
|
Data Mining cont. |
Chapter 8 |
||
|
#13 – |
|
|
|
|
Web Services,
Extensible Systems, Stream-based Data Management |
Chapter 9, Chapter 10 |
|
|
|
#14 – |
|
|
|
|
Web Services,
Extensible Systems, Stream-based Data Management cont. |
Chapter 9, Chapter 10 |
|
|
|
#15 – |
|
|
|
|
May 10 |
Project
Presentation Due |
In Class |
NO Extensions |