Computer Science 501
Data Structures and Algorithm Analysis
Fall 2015, The College of Saint Rose
You have two main tasks: to complete an empirical study of sorting algorithms, and to learn more about Java iterators by completing a program that iterates over subsets to solve an interesting problem. The empirical study is due in two weeks, while the iterator lab is due in one week.
You may work alone or in a group of up to four on this lab. There is a lot of work here, so everyone is encouraged to find some classmates to work together. Of course, collaboration with your partner is unrestricted. You may discuss the lab with your classmates and give and receive some help, but your submission must be your own work (or that of you and your teammates, if you choose to form a group).
Getting Set Up
To get your BlueJ environment set up for this week's lab assignment, start BlueJ and choose "New Project" from the "Project" menu. Navigate to your folder for this course and choose the name "Lab7" (no spaces) for the project.
Create a document where you will record your answers to the lecture assignment and lab questions. If you use plain text, call it "lab7.txt". If it's a Word document, you can call it whatever you'd like, but when you submit, be sure you convert it to a PDF document "lab7.pdf" before you submit it.
Lecture Assignment Questions
We will usually discuss these questions at the start of class on the lab due date, so no credit can be earned for late submissions of lecture assignment questions.
Sorting Algorithm Empirical Study
Your two week task is to complete an empirical study of sorting algorithms. Your analysis should include all combinations of these sorting algorithms:
with these input data distributions:
Please note that this means you will be performing 24 case studies. As with the previous timing lab, automation of the gathering of timing statistics will be essential. Controlling your experiments with command-line parameters and managing multiple runs with scripts will make your task much more manageable.
Timings and Analysis
Use your program(s) to generate basic operation counts and timing data for each sorting algorithm on an appropriate range of data sizes and distributions. Compare your results with your expectations based on our efficiency analysis of these algorithms. If you find discrepencies, how might they be explained? Please include as many of the details of your test environment as you can: the type of processor or processors in the computer including clock speed, cache sizes, memory sizes, the operating system and version running on the computer, and the Java version you are using.
Tips, Tricks, Precautions, and Suggestions
10000 selection random .034693
might indicate for an input size of 10,000, using a selection sort on random input took .034693 seconds.
Include the source code for all programs and scripts you used to generate your timings.
Each case study in your writeup is worth 2 points, which will include your raw timing and operation count data, the graphs of your timing results and operation counts, and your analysis of the results, including a comparison with your theoretical expectations. When different operations have different complexities, please be sure to address that.
The Two Towers Problem
The one-week part of your assignment is the laboratory at the end of Chapter 8 in Bailey. You will gain experience writing your own generally-useful Iterator class and use it to solve an interesting problem.
Notes and Hints
When you implement your SubsetIterator that extends AbstractIterator, be sure to import structure5.* and java.util.Iterator at the top of your file. Also remember that your SubsetIterator should use generic types. It is far from obvious how to accomplish this, so here is what your class header should look like for the SubsetIterator:
public class SubsetIterator<E,T extends Vector<E>> extends AbstractIterator<T>
During the implementation of your SubsetIterator's get and next methods, you will need to create a Vector<E> and return it as a T to satisfy the required return types specified by AbstractIterator<T> for those methods. Java's syntax for this is again not entirely obvious. This should work:
T subset = (T)new Vector<E>();
For testing, write a main method of your SubsetIterator that creates a Vector<Integer> with the Integers from 0 through 7, creates a SubsetIterator<Integer,Vector<Integer>> with this Vector<Integer>, and then prints out all subsets returned and a count of the subsets returned. Make sure you end up with 256 subsets printed. Please leave this main method in your program when you submit.
Write the main method to solve the two towers problem in a separate class called TwoTowers, proceeding as specified in the lab description in the text.
Submitting
Before 6:00 PM, Wednesday, October 28, 2015, and 6:00 PM, Wednesday, November 4, 2015 (empirical study only), submit your lab for grading. There are two things you need to do to complete the submission: (i) Copy your file with the answers to the lecture assignment and lab questions into your project directory. Be sure to use the correct file name. If you prepared your answers in Word, export to a PDF file and submit that. (ii) Email a copy of your lab (a .7z or .zip file containing your project directory) to terescoj AT strose.edu. Please use a meaningful subject line such as "Joe Student Lab7 Submission".
Grading
This assignment is worth 100 points, which are distributed as follows:
> Feature | Value | Score |
LA Question 1 (9.10) | 3 | |
LA Question 2 (9.12) | 4 | |
LA Question 3 (n-story building) | 5 | |
General sorting algorithm test framework | 5 | |
Java code for specific sorting algorithms | 5 | |
Presentation and analysis of timing results | 48 | |
"Two Towers" program design and style | 2 | |
"Two Towers" program documentation | 4 | |
"Two Towers" correctness: SubsetIterator | 8 | |
"Two Towers" correctness: TwoTowers | 6 | |
Question 1 (Thought Question 1) | 1 | |
Question 2 (Thought Question 2) | 2 | |
Question 3 (complexity) | 1 | |
Question 4 (20-block time) | 1 | |
Question 5 (21-block expectation and time) | 1 | |
Question 6 (22- through 25-block times) | 1 | |
Question 7 (complexity match?) | 1 | |
Question 8 (40- and 60-block estimates) | 2 | |
Total | 100 | |