CS 345 section 001 (assignments, notes, exams, paper links): Fall 2015

The copyrighted material downloadable from this page is to be used only by the students enrolled in CS 345 under Prof. Gerbessiotis. Distribution of this material outside this group is NOT allowed for any reason.


C1. Midterm Performance


C2. Solutions to assignments and exams

Homeworks are posted in the handouts section that is section B. Only sample solutions might be posted in this area.


C3. Statistics.

Statistics for the class exam will be become available through CS345 exam statistic link. Edited on 10/30 . √


C4. Exams, Statistics, and solutions to selected exams


C5. Previous Exams

The information available here is as is. There is no guarantee that any future exam will look like previous exams (nor will it be so). Sample solutions are provided for some but not all exams. Ocasionally an exam problem might become another's exam problem or even a homework problem and you might find a solution elsewhere.


C6. Lecture Summaries aka Supplementary Notes and Paper links

DISCLAIMER: The included material DOES NOT substitute the textbook for this class. It should be used in conjunction with the textbook and the material presented in class. If a statement in these "notes" seems to be incorrect, report it to the instructor so that it be fixed immediately. These "notes" are distributed to the students of CS485 offered by Dr. Gerbessiotis at the New Jersey Institute of Technology; distribution outside this group of students is prohibited. The material below will be uploaded in due time; an upload message will appear as soon as the corresponding document is uploaded.

SUBJECTS

Paper and Code Links

L0
Paper Presentation Schedule (Fall 2015) The link on the left includes schedule (text) of the presentations and to the right, the Fall-2015-Presentation-Schedule (Word). The following links will be populated with the presentations and their one page summaries. Last modified on 12/07 at 09:30. √
L1
Google paper (Brin and Page) in http://www-db.stanford.edu/~backrub/google.html The pdf version of this paper that might included additional material is mirrored locally in Google (Brin and Page).
L2
Google Cluster Architecture (Barroso et al) in http://research.google.com/archive/googlecluster-ieee.pdf Also mirrored locally in pdf in Google Cluster Architecture (Barroso et al).
L3
Who Links to Whom (Bharat et al) in http://theory.lcs.mit.edu/~ruhl/papers/2001-icdm.pdf Also mirrored locally in pdf in Who link to Whom (Bharat et al).
L4
WebCrawlers
  1. WebSphinx (CMU,USA) CMU's WebSPHINX. A local copy is available at this link.
  2. UbiCrawler (Italy) http://ubi.imc.pi.cnr.it/projects/ubicrawler/.
  3. WIRE (Chile) http://www.cwr.cl/projects/WIRE/.
  4. Paper: Parallel Web Crawlers Cho and Garcia-Molina (Stanford).
L5
Code Links
  1. Recursive directory descent recdir.c.
L6
Google Architecture Review (2009) in http://research.google.com/people/jeff/WSDM09-keynote.pdf Also mirrored locally in pdf in Google Architecture Review(Jeff Dean)).
L7
HW 8 papers (selection)
  1. Google Crawler for Deep Web PDF.
  2. PageRank related paper PDF.
  3. Aardvark Social Network search PDF.
  4. L3 paper PDF.
  5. L6 paper PDF.
  6. Google File System PDF.
  7. BigTable PDF.
  8. Kleinberg's HITS algorithm PDF.
  9. Google's MAPREDUCE PDF.
  10. Web as a Graph PDF.
  11. Twitter PDF.
  12. Semantic Web MS Word. PDF.
  13. The Datacenter as a computer by Barroso and Holzle PDF.
  14. PEW Center Survey http://www.pewinternet.org/pdfs/PIP_Internet_and_Daily_Life.pdf (this link might not be active any more).
  15. Host Survey http://news.netcraft.com/archives/web_server_survey.html.
  16. Web size http://www.vldb.org/conf/2001/P069.pdf.
  17. Search Engines (query statistics) http://www.webology.org/2004/v1n2/a4.html.
  18. Zmap Zmap PDF.
  19. Internet Census 2012 Internet Census 2012.
  20. Designs, Lessons and Advice from Building Large Distributed Systems Jeff Dean 2009 LADIS (Large Scale Distributed Systems and Middleware) Keynote talk .
  21. Case Study Google's Green Data Centers.
  22. Case Study Dell: Power Efficiency Comparison.
  23. Case Study High Performance Datacenter Networks.
  24. Collection of Papers on AWS (Data Center centric) Web-page of James Hamilton (AWS Team), papers from 2011-2014.
L8
Paper presentation Schedule available through link L0 above.
L9
NSA Data Center in Wired article on NSA's Utah Data Center (by James Bamford, 2012), another one Wired article on NSA's Utah Data Center Electricity issues (by Klint Finley, Oct 2013), another one Wired article on NSA's Utah Data Center and Taxes (by Klint Finley, May 2013), another one Wired article on NSA's Utah Data Center Water Usage (by Robert McMillan, Mar 2014) Fox13 Salt Lake City's article on NSA's Utah Data Center Water Bill (by Ben Winslow, Apr 2014) Forbes Article (by Kashmir Hill, Jul 2013)
L10
Some notes of the instructors in Notes on parallel and multithreaded computing(dec 2009 print) A more recent compilation (using a more recent version of LaTeX and pdfLaTeX) is this one: Notes on parallel and multithreaded computing (nov 4, 2014 print) There should be no or minimal differences between the two versions; two different versions of LaTeX and PdfLaTeX were used; the more recent version available for the more recent print of the notes. (Note the chapters on web-searching there are from 2008/2009 and obsolete; they have been superseded by newer material including the notes of this CS345 course.)

Last modified Dec 7, 2015 09:31