Image: ESA - C.Carreau (SEMPDN9OY2F)
Overview | Assignments | Lab sessions | Hall of Fame

NETS 212: Assignments

Homework assignments and links to relevant materials will be published here.

Homework 0: Using the Virtual Machine Image

This very simple assignment will show you how to use the virtual machine image we have prepared for you. You will need the following:

Please keep in mind that any code you save in the virtual machine is lost if you delete the VM image itself, unless you check your code into your git repository!

Homework 1: TED Talk Search

In this assignment, you will index a large data set of TED talks, upload it to DynamoDB, and then implement a little web frontend that can be used to search for talks. You will need the following:
  • The handout for this assignment
  • The TED talk data set (will be downloaded automatically by Maven)
  • Amazon's DynamoDB Local (already installed in your VM)
You should continue to use the VM image you downloaded for Homework 0.

Homework 2: Linked TED Talk Portal

In this assignment, you will learn basic and intermediate Apache Spark data manipulation, and extend your search service from Homework 1 into something more interesting for looking at TED talks. You will need the following:
  • The handout for this assignment
  • The RDDs in Java handout
  • The TED talk data set (will be downloaded automatically by Maven)
You should continue to use the VM image you downloaded for Homework 0.

Homework 3: SocialRank

In this assignment, you will enhance your Apache Spark RDD skills to do social network analysis. You will also gain some experience with Amazon's Elastic MapReduce. You will need the following:
  • The handout for this assignment
  • The Twitter and LiveJournal data sets (in a public S3 bucket)
You should continue to use the VM image you downloaded for Homework 0.