The home page for the 2006/07 course
is here!
Security and Fault-tolerance in Distributed Systems
According to Lamport, a distributed system is one where the crash of a
computer that you've never heard of stops you from getting any work
done. This course presents methods for building dependable and secure
distributed systems. The emphasis is on fault-tolerant and
distributed cryptographic protocols. Topics include group
communication, failure detectors, reliable broadcast protocols,
distributed cryptography, threshold cryptosystems, Byzantine
agreement, quorums, replication and secure networked storage systems.
Applications to cluster computing, Internet services, and storage-area
networks will be presented.
The course presents principles and fundamental methods, and shows how
they are applied to real-world systems.
Lecturer. Dr.
Christian Cachin,
IBM Zurich Research Lab.
Dates.
| Lecture: |
Thursday, 14:15-16:00, IFW C42, starting 31.3.2005. |
| Exercise: |
Thursday, 16:15-17:00, IFW A36 (to be confirmed in first lecture). |
Web page. http://www.zurich.ibm.com/~cca/sft05/,
The course is part of the
Master
in Computer Science, in the
Major Program Information Security.
Prerequisites. Knowledge in information
security and/or network security, distributed systems, and cryptography.
- Introduction
- Dependability Concepts
- Quorums
- Consensus and Broadcast
- View-synchronous Group Communication
- Distributed Cryptography
- Byzantine Agreement
- Service Replication
- Data Storage
Books
- Hagit Attiya and
Jennifer Welch.
Distributed Computing: Fundamentals, Simulations and Advanced
Topics.
Wiley, 2nd edition, 2004.
- George Coulouris,
Jean Dollimore, and Tim Kindberg.
Distributed Systems: Concepts and Design.
Addison-Wesley, 3rd edition, 2001.
- Sape J. Mullender, editor.
Distributed Systems.
ACM Press & Addison-Wesley, 2nd edition, 1993.
Recommended articles (in order of topics)
- [opgapa03] David Oppenheimer,
Archana Ganapathi, and David A. Patterson.
Why do Internet
services fail, and what can be done about it?.
In Proc. 4th USENIX Symposium on Internet Technologies and Systems
(USITS '03), 2003.
- [patter02] David A. Patterson.
An introduction to dependability.
;login:, 27(4):61-65, August 2002.
- [chatou96] Tushar Deepak Chandra and Sam Toueg.
Unreliable failure detectors for reliable distributed systems.
Journal of the ACM, 43(2):225-267, 1996.
- [raynal05] Michel Raynal.
A short
introduction to failure detectors for asynchronous distributed systems
(Distributed Computing Column).
SIGACT News, 36(1):53-70, 2005.
- [hadtou93] Vassos Hadzilacos and Sam Toueg.
Fault-tolerant broadcasts and related problems.
In Sape J. Mullender, editor, Distributed Systems.
ACM Press & Addison-Wesley, New York, 1993.
Expanded version appears as Technical Report TR94-1425,
Department of Computer Science, Cornell University, Ithaca NY, 1994.
- [schnei90] Fred B. Schneider.
Implementing fault-tolerant services using the state machine approach: A
tutorial.
ACM Computing Surveys, 22(4):299-319, December 1990.
- [reibir94] Michael K. Reiter and
Kenneth P. Birman.
How to securely replicate services.
ACM Transactions on Programming Languages and Systems,
16(3):986-1009, May 1994.
- [cacpor02] Christian Cachin and
Jonathan A. Poritz.
Secure intrusion-tolerant replication on the Internet.
In Proc. International Conference on Dependable Systems and Networks
(DSN-2002), pages 167-176, June 2002.
- [cacsam04] Christian Cachin and
Asad Samar.
Secure distributed DNS.
In Proc. International Conference on Dependable Systems and Networks
(DSN-2004), pages 423-432, June 2004.
- [caslis02] Miguel Castro and
Barbara Liskov.
Practical Byzantine fault tolerance and proactive recovery.
ACM Transactions on Computer Systems, 20(4):398-461, November
2002.
- [schhas02] Frank Schmuck and
Roger Haskin.
GPFS: A shared-disk file system for large computing clusters.
In Proc. USENIX Conference on File and Storage Technologies (FAST
2002), 2002.
- [rikasw02] Erik Riedel, Mahesh
Kallahalla, and Ram Swaminathan.
A framework for evaluating storage system security.
In Proc. USENIX Conference on File and Storage Technologies (FAST
2002), 2002.
- [cachin04] Christian Cachin.
Security in storage networks: A
current perspective.
Talk at ZISC Information Security Colloquium, ETH, Zürich, November 2004.
Last updated Monday, 24-Aug-2009 11:34:21 CEST,
by Christian Cachin.