Teaching

Organization

Exam:	Oral exams will take place on July 31 and September 18. Registration: Contact our secretary Mrs. Gerber (This email address is being protected from spambots. You need JavaScript enabled to view it., room 36/430).
Lecture:	Tuesdays, 11:45-13:15 in 36-438.
Exercise:	The exercise will be held either on Mondays, 10:00-11:30, or on Wednesdays, 15:30-17:00. Room: 36-438.
Contact:	Prof. Dr.-Ing. Jens Schmitt M.Sc. Paul Nikolaus or follow us on Twitter

Lecture Overview

The objective of this lecture is to introduce the basics of performance management in communication networks on multiple time-scales. The contents are as follows:

Introduction & Motivation
Long-Term Performance Management
- Network Design
- Traffic Modelling
Medium-Term Performance Management
- Traffic Engineering / Routing
- Content Distribution / Caching
Short-Term Performance Management
- Packet-Level Dynamics
- Packet Scheduling
Conclusion and Outlook

Material & Slides

The slides are only accessible from within the university network (131.246.*). You can use SSH or VPN for remote access.

Chapter	Title	Last Update	Slides
0	Organization	April 25, 2017	PDF
1	Introduction	April 28, 2017	PDF
2	Long-Term Performance Management	May 19, 2017	PDF
3	Medium-Term Performance Management	May 30, 2017	PDF
4	Short-Term Performance Management	July 11, 2017	PDF uncorr-vs-indep

Exercises

The exercise sheets are only accessible from within the university network (131.246.*). You can use SSH or VPN for remote access.

Exercise	Time	Download
1	May 15 at 10:00	PDF ex_1_data_1.dat ex_1_data_2.dat Positions Solution(1-2)
2	May 24 at 15:30	PDF Solution(1-3)
3	June 26 at 10:00	PDF Solution(1)
4	July 10 at 10:00	PDF

Organisatorisches

News:	Die Klausurergebnisse vom 25.9.2017 sind online!
Nachklausur:	Am 25.09.2017, um 09:00 Uhr in Gebäude 1, Raum 106
Vorlesung:	Montags, 15:30 in Raum 46-210
Kontakt:	Prof. Dr.-Ing. Jens Schmitt M.Sc. Carolina Nogueira M.Sc. Matthias Schäfer oder folgt uns auf Twitter

Vorlesungsübersicht

Diese Vorlesung soll die Aufgaben, der Aufbau und die Arbeitsweise moderner Kommunikationssysteme näher bringen. Insbesondere liegt dabei der Fokus auf der Architektur und den Protokollen, welche heute vor allem im Internet im Einsatz sind. Anhand eines Schichtenmodells, werden von Anwendungsprotokollen (z.B. HTTP, FTP, SMTP, ...) bis zu den Grundlagen der Datenübertragung (z.B. Signalverarbeitung, Kodierungstheorie) viele Aspekte vernetzter Systeme behandelt.

Der Vorlesungsaufbau ist dabei wie folgt:

Anwendungsschicht: Softwarearchitektur, Web, FTP, E-Mail, DNS, Peer-to-peer Anwendungen
Transportschicht: Multiplexing, UDP, TCP
Vermittlungsschicht: IP, Routing, Forwarding
Sicherungsschicht: Fehlererkennung & -korrektur, Mehrfachzugriff, Ethernet
Physikalische Schicht: Theoretische Grundlagen der Datenübertragung, Übertragungsmedien, Kodierungstechniken

Vorlesungsunterlagen

Downloads zu dieser Vorlesung sind nur aus dem Universitätsnetz verfügbar (131.246.*). Um außerhalb des Universitätsnetzes auf das Material zugreifen zu können, nutzt bitte Dienste wie VPN oder SSH.

Titel	Letztes Update	Folien
0. Organisation	26. April 2017	PDF
1. Introduction	26. April 2017	PDF
2. Physical Layer	05. Mai 2017	PDF
3. Link Layer	29. Mai 2017	PDF
4. Network Layer	11. Juni 2017	PDF
5. Transport Layer	02. Juli 2017	PDF
6. Application Layer	14. Juli 2017	PDF

Klausur

Die Nachklausur findet am Montag, dem 25.09.2017 um 9 Uhr in Gebäude 1, Raum 106 statt.

Um an der Klausur teilnehmen zu dürfen benötigt ihr die Zulassung. Voraussetzung um die Zulassung dieses Semester zu erhalten waren:

Teilnahme an mindestens 9 Übungsterminen
Bestehen der Zwischenklausur (25%)
Korrigieren einer anderen Zwischenklausur

Die Liste der Neuzulassungen dieses Semester findet ihr hier: PDF. Wer seine Zulassung schon in einem vergangenen Semester erhalten hat, behält diese natürlich. Achtung: Wer seine Zulassung nicht hat darf nicht an der Klausur teilnehmen und wird davon zwangsabgemeldet. Bitte überprüft daher die Liste auf Korrektheit und klärt Unstimmigkeiten bitte mit eurem Übungsleiter ab.

Klausurergebnisse

Die Klausurergebnisse vom 11.8.2017 könnt ihr hier einsehen: PDF.

Die Klausurergebnisse vom 25.9.2017 könnt ihr hier einsehen: PDF.

Übungen

Übungsanmeldung:	bis 1. Mai 2017, 23:59
Übungsbeginn:	2. Mai Woche (KW 19)

#	Termine	Letztes Update	PDF
1	KW 19+20	8. Mai	PDF
2	KW 21+22	22. Mai	PDF
3	KW 23+24+25	3. Juni	PDF
4	KW 29	11. Juli	PDF

Zwischenklausur

Die Zwischenklausur findet dieses Jahr in einem Peer-Review-Modus statt, d.h. ihr korrigiert euch gegenseitig unter Anleitung. Dies hat den großen Vorteil, dass die ihr vor der richtigen Klausur schon Einblick erhaltet, wie Klausuren aus der Sicht des Korrigierenden aussehen und was von euch erwartet wird. Ablaufen wird das Ganze wie folgt:

Die Zwischenklausur wird in den Übungen vom 27. bis 29. Juni geschrieben. In der Woche danach, d.h. in den Übungen vom 4. bis 6. Juli werden die Klausuren dann von den Studenten (unter Anleitung) korrigiert. In den Übungen vom 11. bis 13. Juli könnt ihr dann eine halbe Übung lang Einsicht in eure korrigierte Klausur nehmen und die Korrektur überprüfen. Falls ihr in einer der Wochen nicht an eurem Termin teilnehmen könnt, müsst ihr in der gleichen Woche in eine der anderen Gruppen gehen.
Jeder Student bekommt eine eindeutige Teilnehmernummer (KoSy ID) zugewiesen, die er dann auf seiner Klausur einträgt. Auch beim Korrigieren schreibt ihr eure Nummer in das entsprechende Feld vorne auf der Klausur. Damit ist zum einen gewährleistet, dass wir überprüfen können, wer geschrieben und korrigiert hat und zum anderen wird so die Anonymität zwischen Korrektor/in und Scheiber/in gewahrt.
Um die Zulassung zu erhalten muss jeder Student a) mitschreiben b) mitkorrigieren und c) 25% der Punkte bekommen. Das Ganze wird danach anhand der Teilnehmer/Korrektornummern überprüft.
Sollte in der Einsicht jemand mit der Korrektur berechtigt nicht einverstanden sein bzw. herausfinden, dass der Korrektor unfair/böswillig korrigiert hat, wird das Problem an Prof. Schmitt persönlich übergeben und es kann zum Verlust der Zulassung des Verursachers führen.
Um die Distanz zwischen Korrektor und Schreiber zu maximieren, vermischen wir die Klausuren über die verschiedenen Übungsgruppen zur Korrektur. Wir werden aber danach dennoch Stichprobenartig überprüfen, ob korrekt korrigiert wurde. Sollten wir Klausuren finden, bei denen offensichtlich absichtlich zu viele Punkte vergeben wurden, wird der Fall an Prof. Schmitt übergeben und der Korrektor verliert die Zulassung.

Bringt für die Klausur bitte einen nicht-programmierbaren Taschenrechner mit (normaler Schulrechner). Außerdem gelten während dem Schreiben, Korrigieren und Einsehen Klausurbedingungen. Wer beim Schummeln, Fotografieren, etc erwischt wird verliert sofort die Zulassung.

Course Overview

Organization

News:	Website online
Exam:	To be announced.
Lecture:	On Tuesdays 11:45h - 13:15h in room 48-453
Exercises:	On Thursdays 10:00h - 11:30h in room 48-453
Contact:	Derek Sedlack

Course Material

Exercise Material

Organization

News:	Next deadline: July 12
Level:	Bachelor (89-4111) / Master (89-4271)
Contact:	M.Sc. Matthias Schäfer and follow us on Twitter

Important Dates

Kick-off Meeting:	April 25, 5:15pm
Registration (FCFS):	April 26, 4am - 11:59pm
Visualization Mock-up:	May 19, 11:59pm
Visualization Draft:	June 9, 11:59pm
Report & Presentation Draft:	June 30, 11:59pm
Final Visualization, Report, Presentation:	July 7, 11:59pm
Presentation Day:	July 12

Presentation Day

Each seminar participant will give a talk which should not exceed 30 minutes. In these 30 minutes, the students should a) present their topic and b) give a short demonstration of their visualization.

The presentations will be held in room 36-232 (AG Hagen) starting from 9am. The tentative schedule for the presentation day is as follows:

ID	Student	Title
09:00-10:30: Performance
P1	Sriharsha	EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding
P3	Johannes	Update or Wait: How to Keep Your Data Fresh
P5	Markus	Silo: Predictable Message Latency in the Cloud
10:45-12:15: Security
S2	Marco	Fingerprinting Wi-Fi Devices Using Software Defined Radios
S3	Philipp	ARMageddon: Cache Attacks on Mobile Devices
S5	David	Leave Your Phone at the Door: Side Channels that Reveal Factory Floor Secrets

Registration

In order to participate in this year's seminar, you have to register. The registration will open on April 26, at 4am and will close on the same day at 11:59pm. No registrations will be accepted after the registration deadline has passed. There are 10 topics available and the registration is on a first-come first-served basis. As soon as the 10 topics are gone, the registration will be closed. However, if there are no topics available anymore, you can send an email to This email address is being protected from spambots. You need JavaScript enabled to view it. to get on the waiting list. Please include your three favorite topics and your study program in your email. Please also go through the introduction slides before registering. You can register for the seminar here:

Seminar Registration (only from within the Uni network/VPN)

Content

The DISCO group offers a seminar with focus on performance and security aspects of communication networks this summer. You will be able to practice working on current research literature, presenting a scientific topic and your programming ability. The latter is crucial for your grade as all presentations have to be supported by an interactive visualization of an important aspect of your paper.

Here is the list of papers for this year's seminar. Note that this list is tentative and may be subject to changes:

ID	Author(s) and Title	Student	Supervisor	PDF
Performance
P1	Rashmi et al.: "EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding"	Sriharsha	Daniel	PDF
P2	Ghaderi: "Randomized Algorithms for Scheduling VMs in the Cloud"	Manish	Paul	PDF
P3	Sun et al.: "Update or Wait: How to Keep Your Data Fresh"	Johannes	Paul	PDF
P4	Zhu et al.: "PriorityMeister: Tail Latency QoS for Shared Networked Storage"		Steffen	PDF
P5	Jang et al.: "Silo: Predictable Message Latency in the Cloud"	Markus	Steffen	PDF
Security
S1	Bultel et al.: "A Prover-Anonymous and Terrorist-Fraud Resistant Distance-Bounding Protocol"	Sachinkumar	Carolina	PDF
S2	Vo-Huu et al.: "Fingerprinting Wi-Fi Devices Using Software Defined Radios"	Marco	Carolina	PDF
S3	Lipp et al.: "ARMageddon: Cache Attacks on Mobile Devices"	Philipp	Daniel	PDF
S4	Karapanos et al.: "Sound-Proof: Usable Two-Factor Authentication Based on Ambient Sound"	Simon	Matthias	PDF
S5	Hojjati et al.: "Leave Your Phone at the Door: Side Channels that Reveal Factory Floor Secrets"	David	Matthias	PDF

Topic

This year's project will focus on web caching. A web cache is what makes your browser fast (or not, if it fails) by holding copies of popular websites in your browser cache, on a computer close to you (aka. a content delivery network), and in large server farms (aka. web accelerators).

At the heart of browser performance lies the problem of selecting which web objects you'd like to have copies of, which is known as a caching policy. This project will center on the performance evaluation of various caching policies. We will experiment with various caching policies and evaluate their performance on request traces of one of the most popular websites in the world.

We will learn about basic concepts of performance evaluation, implementation, and surveying the academic literature.

Instructors

M.Sc. Daniel Berger
M.Sc. Matthias Schäfer

Follow #PEDSproject and @DISCO_Teaching.

Schedule and Slides

Here is a Google calendar with the tentative schedule.

Class time	Title	Link
28.11.2016 5pm	Introduction and overview (case study: delivery Wikipedia's content)	PDF
07.12.2016 1.30pm	I. Basic analysis (popularity distributions, CDFs, size distributions, etc)	PDF
14.12.2016 5.15pm	II. Statistics and simulation (hit ratios)	PDF
11.01.2017 5.30pm	III. More statistics and advanced simulation (classical caching policies)	PDF
18.01.2017 5.15pm	IV: Better than classical caching policies (the Filter policy)	PDF
25.01.2017 5.15pm	Q&A and general discussion	-
01.02.2017 5.15pm	V: Theory meets practice (bloom filters)	upcoming

Make your own slides by using the latex/ppt templates or by cloning (File->Make a copy) of this Google slide set.

Student Groups

Group A:

Ali Taha

Anusha Halsnad

Manish Kumar

Group B:

Oskar Frohn

Karla Schäfer

Ewa Krajnik

Selin Dursun

Group C:

Bindu Nagendra

Sriharsha Navile Basavaraju

Anonymized Real-World Trace Data

We will use traces of requests to a Wikipedia cache server. The trace format is:

seq# hashid size

where seq# is a consecutive request counter (per day), hashid is a long integer hash of the accessed page, and the size measures in bytes the size of the requested object.

[Download traces here.]

(Download available from inside the university network. These traces fit the request fornat of the webcachesim open cache simulator. )

Example C++ code to parse such a trace file. This does very little, only outputs the contents of each line of the trace.

#include <fstream>
#include <iostream>

using namespace std;

int main (int argc, char* argv[])
{
  const string traceFileName = argv[1];

  long time, hashid, size;
  ifstream infile;
  infile.open(traceFileName);
  
  while (infile >> time >> hashid >> size)
    cout << "time: " << time << " hashid: " << hashid << " size " << size << endl;
  infile.close();

  return 0;
}

Task I (due Dec 7)

based on one day ("Monday") of wikipedia requests from here (only accessible from within the university network)
your results:
- Most Popular object : hashed ID 330950210 with 2400065 requests in a single day
- Total number of requests: 198075867
- Total number of objects: 10549934
- Mean object size : 7010 Byes
- Minimum object size: 10 Bytes
- Maximum object size: 707143844 Bytes
- Object Hit ratio of caching 10GB most frequently ued : 0.745589
Group a: slides and their github repository.
Group b: n/a
Group c: slides and their github repository.

Task II (due Dec 14)

based on seven days of wikipedia requests from here (only accessible from within the university network)
A) data analysis tasks:
- Difference between daily 100 most-popular objects: the idea is to see how many of the very popular objects of day 1 are still among the very object of day 2. Formally speaking: let X denote the set of the 100 most popular objects of day 1; and let Y denote the set of the 100 most popular objects of day 2. Calculate the number of objects in the intersection of X with Y, and repeat this for day 2 and 3, 3 and 4, etc.
- Solution: 90, 93, 94, 98, 94, 91
- Popularity Plot: request count of 1st-most-popular, 10th-most-popular, 100th-most-popular, 1000th-most-popular object etc. (consider putting both x-axis and y-axis on a log scale). Create a plot which has this data (as a line) for each day.
- Object size histogram. Object size CDF. Create the CDF as a single plot with separate data (lines) for each day.
- a good example of the popularity plot was given by group c

B) simulation task: simulate LRU on both a 1-day-long trace and the 7-consecutive-days trace with 10 GB capacity.

This should be easy, if you reuse the webcachesim open cache simulator

Downloading, compiling, and running the simulator should be as easy as:

git clone This email address is being protected from spambots. You need JavaScript enabled to view it.:dasebe/webcachesim.git
cd webcachesim
cmake CMakeLists.txt
make
./webcachesim cache.07.01.tr 0 LRU 33.3219

As an alternative you can directly import from github into CLion IDE (first fork webcachesim into your git account).
Solution: object hit ratio for all days around 70% and byte hit ratio around 30%

Task III (due Jan 11)

A) Work with all 7 days worth of traces. Simulator should run through them consecutively.
- Simulate the Cache Policies: LRU, FIFO, GDSF (CS-students: LFU-DA, GDS, S2LRU)
- with Cache capacities: 100GB (CS-students: 1-1000GB)
- Measure both Object and Byte Hit ratio
- (CS-students: Measure throughput: how many requests per second is each policy able to process? )

B) Literature research

Find 10 classic/modern caching policies (besides FIFO and LRU) in the literature.

Add these policies to the following table:

Policy

Name

# Operations per Hit

# Operations per Miss

Memory

Overhead (besides the cached object)

LRU

O(1)

(List entry + unordered hash map entry) for each cached object

FIFO

none

O(1)

(List entry + unordered hash map entry) for each cached object

Task IV (due Jan 18)

Literature on the "one-hit-wonder" problem in Internet content delivery.
Evaluate a simple implementation of the FILTER cache policy (based on a hash map, e.g., the std::unordered_map)
- OHR/BHR
- throughput
- memory overhead

Task V (due Feb 1)

Read the literature on "bloom filters". Note that the Wikipedia entry talks mostly about checking whether something has been requested once or never, whereas it is necessary to count for our purposes (i.e., need an integer array instead of a bool array). This paper details the idea of building many hash functions from just two hash functions.
Familiarize yourself with 128bit-Murmurhash3. Split the 128bit hash into two 64 bit hashes (e.g., access the 128bit hash as an array of 64bit values) and use them to derive them any number of hash functions.
Implement a simple bloom filter (about 50LOC). Note that realistic values for the Filter policies' N parameter only need 8-10 bits, so an array of uint16 leaves plenty room. Be wary of overruns, however, some objects will get many requests (why not stop incrementing your bloom filter, once we cross the N threshold of the filter policy?)
Change Filter-LRU and/or Filter-FIFO to use your bloom filter instead of the std map.
Evaluate the hit ratio, throughput, memory overhead for different variants of the new and old Filter policies
- start with an array size of l=1MB and a number of k=8 hash functions (the slides denote the number of hash functions n, but that collides with the Filter's N, so let's call this k).
- also experiment with l=512KB, 2MB, 4MB, 8MB and k=4,16,32

Final Report Guidelines

4 pages of text per person, not counting space taken up by figures (i.e., 2 pages each half text/ half plot count as one page of text).
Figures and plots need to be legible, clear, with proper axis labels and legends.
Each figure should have a descriptive caption, and should be referenced and discussed in the text.
Correctly reference literature and sources, section at end of paper which lists all references.
We suggest you use Latex as it simplifies referencing and figure placement (use this template (DiscoReport.zip) ).
Content (brackets indicate percentage of overall text length that we suggest for this section):

Intro and Problem Statement (10%)
1. How does the Wikipedia use caching and why?
2. What is the optimal caching policy for the Wikipedia?
3. Description of traces, research questions, reference metrics in Section 4
Literature Research (10%)
1. Table of Classical Caching Policies
2. References, short discussion of policies
Statistics For A Week of Wikipedia Requests (10%)
1. Popularity distribution line-graph (Zipf's law)
2. Popularity over time line-graph (differences between days)
3. CDF plot of object sizes, min, mean, max object size
4. Cumulative count of one-hit wonders line-graph
Experimental Setup (10%)
1. Metrics to Evaluate CDN Performance
  1. Object hit ratio (OHR), Byte hit ratio (BHR), throughput, memory overhead
  2. Define each. Why is each relevant?
2. Simulator (how does it work, source)
3. Experiment hardware, OS, etc.
Performance Evaluation of Classical Caching Policies (20%)
1. For 10GB cache, BHR and OHR bar-plots for
  1. LFU (caching the most frequently used objects)
  2. LRU, FIFO, GDSF, LFU-DA, GDS, S2LRU
2. For 10GB cache, throughput box-plot, memory overhead bar-plot
  1. LRU, FIFO, GDSF, LFU-DA, GDS, S2LRU
3. Discussion
4. Trade-offs when choosing a classical caching policy
5. Plots for 1-1000GB
  1. Which policy would you choose if you had 1-50GB vs 50-500GB vs 500-1000GB?
Maximizing the BHR with Cache Filters (30%)
1. "Simple" Filter 10GB: BHR/OHR bar-plots, throughput box-plot, memory-overhead
  1. Discussion of "Simple" Filter:
  2. Parameter N, picking N*
  3. Comparison to classical policies, +/-
2. How to build a low-overhead high-throughput Filter: Bloom-Filter idea.
  1. Parameters of Bloom-Filter: which ones would you chose?
  2. Comparison to classical policies and "Simple" Filter
Conclusions and Future Work (10%)
1. What is missing to use your filter in practice? How can you still improve it?
2. Alternative ideas to boost the OHR or BHR that you can think of
References

General Reading References

Subcategories

Past Teaching

Teaching Overview

Page 7 of 10

Start
Prev

Teaching

University of Kaiserslautern

we are part of the Department of Computer Science at the TU Kaiserslautern

Write your thesis with a disco advisor

We offer a variety of bachelor and master theses at any point in the academic year. Also check out some of our completed theses. Read more...

Teaching

Quantitative Aspects of Distributed Systems Summer 2017

Organization

Lecture Overview

Material & Slides

Exercises

Kommunikationssysteme Sommer 2017

Organisatorisches

Vorlesungsübersicht

Vorlesungsunterlagen

Klausur

Klausurergebnisse

Übungen

Zwischenklausur

A Survey of Information Security WS16

Course Overview

Organization

Course Material

Exercise Material

Seminar: Distributed Computer Systems

Organization

Important Dates

Presentation Day

Registration

Content

Project: Performance Evaluation of Distributed Systems WS16

Topic

Instructors

Schedule and Slides

Student Groups

Anonymized Real-World Trace Data

Task I (due Dec 7)

Task II (due Dec 14)

Task III (due Jan 11)

Task IV (due Jan 18)

Task V (due Feb 1)

Final Report Guidelines

General Reading References

Subcategories

Past Teaching

Teaching Overview

Teaching

University of Kaiserslautern

Write your thesis with a disco advisor