Syllabus

Note:

This is tentative and we are refining during the first week.

Lecture No. Date Topics Exams and Homework
1 09-SEP

Introduction, Course Overview, Foundational Concepts

  • Course overview, assignments, exams, grading, logistics, ...
  • Topics
    • Data, database systems, data models, programming languages, application architectures, user roles.
    • Relational model, keys, schema, schema diagrams.
    • Introduction to SQL, schema, DDL, basic queries.
    • Overview of the Entity-Relationship Model.
  • Reference material: Database Systems Concepts (Silberschatz)
    • Chapter 1.
    • Sections 2.1 through 2.4
    • Sections 3.1, 3.2, 3.3
    • Chapter 6: Sections 6.1, 6.2

2 16-SEP

Relational Model and Algebra
Intro to SQL, Data Modeling,and Applications

  • Topics
    • Relational Model, Relational Algebra.
    • SQL: Data Definition Language
    • SQL: Advanced SELECT clause, Set operations, aggregate functions, updates.
    • Data modeling: ER-Model, entities, relationships, keys, cardinality.
    • Introduction to Pandas. Pandas and SQL.
  • Reference material: Database Systems Concepts (Silberschatz)
    • Sections 3.4 - 3.10
    • Sections 2.4 - 2.6 (continued)
    • Sections 6.3 - 6.8

3 23-SEP

Intermediate SQL,Data Modeling, Applications

  • Topics:
    • SQL: Join, Views, Integrity Constraints, Indexes
    • Data Modeling: Design issues, modeling to relational, advanced topics.
    • Patterns: Inheritance, Aggregation, ...
  • Reference material: Database Systems Concepts (Silberschatz)

    4 30-SEP

    Advanced SQL, Data Modeling Patterns, Applications

    • Topics
      • Functions, Procedures, Triggers.
      • Recursive queries, Window Functions.
      • Cursors, Connections, Transactions, Isolation Levels.
    • Reference material: Database Systems Concepts (Silberschatz)
      • Chapter 5

    5 7-OCT

    Example Use Cases and Scenario Implementation

    • Modeling and implementing Columbia University course, sections, ... ...
    • Improving and optimizing IMDB data, ... ...
    • Other models TBD

    6 14-OCT

    Normalization, De-normalization, Design Patterns

    • Topics
      • Relational design objectives and issues.
      • Functional dependencies.
      • Normal forms.
      • Normalization tradeoffs and examples.
    • Reference material: Database Systems Concepts (Silberschatz)
      • Chapter 7.

    7 21-OCT

    Data Storage, Indexes, Query Processing, Transactions

    • Topics
      • Data storage media, formats and data structures.
      • Indexes.
      • Query processing.
      • Transactions, isolation/currency control, recover.
    • Reference material: Subset of material in chapters 12-19.

    8 28-OCT

    No SQL, Cloud Databases (1)

    • Topics
      • No SQL concepts and motivation.
      • CAP "Theorem". Consistency, eventual consistency.
      • Scale-up versus Scale-out.
      • Graph databases, Neo4j.
    • Reference material:

    9 04-NOV

    NoSQL, Cloud Databases (II)

    • Topics
      • Topics
        • Document DBs: MongoDB, DynamoDB
        • Key Value: Redis
        • Time Series DB: TBD
      • Reference material:

    10 11-NOV

    Big Data, Data Insight (I)

    • Topics
      • Data warehose, data lake
      • De-normalization
      • OLAP
    • Reference material:

    .

    11 18-NOV

    Big Data, Insights (II)

    • Topics
      • MapReduce, Hadoop, Sparq, AWS Glue
      • Dashboard, analysis techniques
    • Reference material

    12 25-NOV

    University Holiday

    • Topics
      • TBD
    • Reference material
      • TBD

    13 2-DEC

    Python Ecosystem, examples

    • Topics
      • Jupyter
      • Pandas
      • Plotly, Map Plot Lib
      • ScitKit
      • ... ...
    • Reference material
      • TBD

    14 9-DEC

    Overflow, Summary

    • Topics
    • Reference material
      • TBD