CS 345 section 001 (assignments, notes, exams, paper links): Fall 2015
The copyrighted material downloadable from this page is to be
used only by the students enrolled in CS 345 under Prof. Gerbessiotis.
Distribution of this material outside this group is
NOT allowed for any reason.
C1. Midterm Performance
C2. Solutions to assignments and exams
Homeworks are posted in the handouts section that is section B. Only sample solutions might be
posted in this area.
-
HW1 solutions.
Uploaded on 10/05. √
-
HW2 solutions.
Uploaded on 10/16 . √
-
HW3 solutions.
Uploaded on 10/22 . √
-
HW4 solutions.
Uploaded on 10/30 . √
-
HW5 solutions: No solutions.
HW6 is a programming mini-project. √
-
HW6 solutions: No solutions.
HW7 is a programming mini-project. √
-
HW7 solutions: No solutions.
HW7 is a paper presentation. Check Section C6.L0 for the presentation schedule
and other information √
C3. Statistics.
Statistics for the class exam will be become available through
CS345 exam statistic link.
Edited on 10/30 . √
C4. Exams, Statistics, and solutions to selected exams
C5. Previous Exams
The information available here is as is. There is no guarantee that any future exam will look like
previous exams (nor will it be so). Sample solutions are provided for some but not all exams.
Ocasionally an exam problem might become another's exam problem or
even a homework problem and you might find a solution elsewhere.
C6. Lecture Summaries aka Supplementary Notes and Paper links
DISCLAIMER:
The included material DOES NOT substitute the textbook
for this class. It should be used in conjunction with the textbook and the
material presented in class. If a statement in these "notes" seems to be
incorrect, report it to the instructor so that it be fixed immediately.
These "notes" are distributed to the students of CS485
offered by Dr. Gerbessiotis at the New Jersey Institute of Technology;
distribution outside this group of students is prohibited.
The material below will be uploaded in due time; an upload message will
appear as soon as the corresponding document is uploaded.
SUBJECTS
Paper and Code Links
- L0
-
Paper Presentation Schedule (Fall 2015)
The link on the left includes schedule (text) of the presentations
and to the right, the
Fall-2015-Presentation-Schedule (Word).
The following links will be populated with the presentations and their one page summaries.
Last modified on 12/07 at 09:30. √
- L1
- Google paper (Brin and Page) in
http://www-db.stanford.edu/~backrub/google.html
The pdf version of this paper that might included additional material is mirrored locally
in Google (Brin and Page).
- L2
- Google Cluster Architecture (Barroso et al) in
http://research.google.com/archive/googlecluster-ieee.pdf
Also mirrored locally in pdf in
Google Cluster Architecture (Barroso et al).
- L3
- Who Links to Whom (Bharat et al) in
http://theory.lcs.mit.edu/~ruhl/papers/2001-icdm.pdf
Also mirrored locally in pdf in
Who link to Whom (Bharat et al).
- L4
- WebCrawlers
- WebSphinx (CMU,USA)
CMU's WebSPHINX. A local copy is available at
this link.
- UbiCrawler (Italy)
http://ubi.imc.pi.cnr.it/projects/ubicrawler/.
- WIRE (Chile)
http://www.cwr.cl/projects/WIRE/.
- Paper: Parallel Web Crawlers
Cho and Garcia-Molina (Stanford).
- L5
- Code Links
- Recursive directory descent
recdir.c.
- L6
- Google Architecture Review (2009) in
http://research.google.com/people/jeff/WSDM09-keynote.pdf
Also mirrored locally in pdf in Google Architecture Review(Jeff Dean)).
- L7
- HW 8 papers (selection)
- Google Crawler for Deep Web
PDF.
- PageRank related paper
PDF.
- Aardvark Social Network search
PDF.
- L3 paper
PDF.
- L6 paper
PDF.
- Google File System
PDF.
- BigTable
PDF.
- Kleinberg's HITS algorithm
PDF.
- Google's MAPREDUCE
PDF.
- Web as a Graph
PDF.
- Twitter
PDF.
- Semantic Web
MS Word.
PDF.
- The Datacenter as a computer by Barroso and Holzle
PDF.
- PEW Center Survey
http://www.pewinternet.org/pdfs/PIP_Internet_and_Daily_Life.pdf (this link might not be active any more).
- Host Survey
http://news.netcraft.com/archives/web_server_survey.html.
- Web size
http://www.vldb.org/conf/2001/P069.pdf.
- Search Engines (query statistics)
http://www.webology.org/2004/v1n2/a4.html.
- Zmap
Zmap PDF.
- Internet Census 2012
Internet Census 2012.
- Designs, Lessons and Advice from Building Large
Distributed Systems
Jeff Dean 2009 LADIS (Large Scale Distributed Systems and Middleware) Keynote talk .
- Case Study
Google's Green Data Centers.
- Case Study
Dell: Power Efficiency Comparison.
- Case Study
High Performance Datacenter Networks.
- Collection of Papers on AWS (Data Center centric)
Web-page of James Hamilton (AWS Team), papers from 2011-2014.
- L8
- Paper presentation Schedule available through link L0 above.
- L9
- NSA Data Center in
Wired article on NSA's Utah Data Center (by James Bamford, 2012), another one
Wired article on NSA's Utah Data Center Electricity issues (by Klint Finley, Oct 2013), another one
Wired article on NSA's Utah Data Center and Taxes (by Klint Finley, May 2013), another one
Wired article on NSA's Utah Data Center Water Usage (by Robert McMillan, Mar 2014)
Fox13 Salt Lake City's article on NSA's Utah Data Center Water Bill (by Ben Winslow, Apr 2014)
Forbes Article (by Kashmir Hill, Jul 2013)
- L10
- Some notes of the instructors in
Notes on parallel and multithreaded computing(dec 2009 print)
A more recent compilation (using a more recent version of LaTeX and pdfLaTeX) is this
one:
Notes on parallel and multithreaded computing (nov 4, 2014 print)
There should be no or minimal differences between the two versions; two different versions
of LaTeX and PdfLaTeX were used; the more recent version available for the more recent print of
the notes.
(Note the chapters on web-searching there are from 2008/2009 and obsolete; they have
been superseded by newer material including the notes of this CS345 course.)
Last modified Dec 7, 2015 09:31